PydanticAI is a Python agent framework designed to make it less painful to build production-grade applications with Generative AI. It brings the same ergonomic design and developer experience to GenAI that FastAPI brought to web development.
Portkey enhances PydanticAI with production-readiness features, turning your experimental agents into robust systems by providing:
Complete observability of every agent step, tool use, and interaction
Built-in reliability with fallbacks, retries, and load balancing
Cost tracking and optimization to manage your AI spend
Access to 200+ LLMs through a single integration
Guardrails to keep agent behavior safe and compliant
Version-controlled prompts for consistent agent performance
Create a Portkey API key with optional budget/rate limits from the Portkey dashboard. You can attach configurations for reliability, caching, and more to this key.
3
Configure Portkey Client
For a simple setup, first configure the Portkey client that will be used with PydanticAI:
from portkey_ai import AsyncPortkey# Set up Portkey client with appropriate metadata for trackingportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_PROVIDER_VIRTUAL_KEY", # Optional, if using virtual keys trace_id="unique-trace-id", # Optional, for request tracing metadata={ # Optional, for request segmentation "app_env": "production", "_user": "user_123" # Special _user field for user analytics })
What are Virtual Keys? Virtual keys in Portkey securely store your LLM provider API keys (OpenAI, Anthropic, etc.) in an encrypted vault. They allow for easier key rotation and budget management. Learn more about virtual keys here.
4
Connect to PydanticAI
After setting up your Portkey client, you can integrate it with PydanticAI by connecting it to a model provider:
from pydantic_ai import Agentfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvider# Connect Portkey client to a PydanticAI model via provideragent = Agent( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), system_prompt="You are a helpful assistant.")
Let’s create a simple structured output agent with PydanticAI and Portkey. This agent will respond to a query about Formula 1 and return structured data:
from portkey_ai import AsyncPortkeyfrom pydantic import BaseModel, Fieldfrom pydantic_ai import Agentfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvider# Set up Portkey client with tracing and metadataportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_OPENAI_VIRTUAL_KEY", trace_id="f1-data-request", metadata={"app_env": "production", "_user": "user_123"})# Define structured output using Pydanticclass F1GrandPrix(BaseModel): gp_name: str = Field(description="Grand Prix name, e.g. `Emilia Romagna Grand Prix`") year: int = Field(description="The year of the Grand Prix") constructor_winner: str = Field(description="The winning constructor of the Grand Prix") podium: list[str] = Field(description="Names of the podium drivers (1st, 2nd, 3rd)")# Create the agent with structured output typef1_gp_agent = Agent[None, F1GrandPrix]( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), output_type=F1GrandPrix, system_prompt="Assist the user by providing data about the specified Formula 1 Grand Prix")# Run the agentasync def main(): result = await f1_gp_agent.run("Las Vegas 2023") print(result.output)if __name__ == "__main__": import asyncio asyncio.run(main())
The output will be a structured F1GrandPrix object with all fields properly typed and validated:
PydanticAI supports multimodal inputs including images. Here’s how to use Portkey with a vision model:
from portkey_ai import AsyncPortkeyfrom pydantic_ai import Agent, ImageUrlfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvider# Set up Portkey clientportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_OPENAI_VIRTUAL_KEY", trace_id="vision-request", metadata={"request_type": "image_analysis"})# Create a vision-capable agentvision_agent = Agent( model=OpenAIModel( model_name="gpt-4o", # Vision-capable model provider=OpenAIProvider(openai_client=portkey_client), ), system_prompt="Analyze images and provide detailed descriptions.")# Process an imageresult = vision_agent.run_sync([ 'What company is this logo from?', ImageUrl(url='https://iili.io/3Hs4FMg.png'),])print(result.output)
Visit your Portkey dashboard to see detailed logs of this image analysis request, including token usage and costs.
PydanticAI provides a powerful tools system that integrates seamlessly with Portkey. Here’s how to create an agent with tools:
import randomfrom portkey_ai import AsyncPortkeyfrom pydantic_ai import Agent, RunContextfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvider# Set up Portkey clientportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_OPENAI_VIRTUAL_KEY", trace_id="dice-game-session", metadata={"game_type": "dice"})# Create an agent with dependency injection (player name)dice_agent = Agent( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), deps_type=str, # Dependency type (player name as string) system_prompt=( "You're a dice game host. Roll the die and see if it matches " "the user's guess. If so, tell them they're a winner. " "Use the player's name in your response." ),)# Define a plain tool (no context needed)@dice_agent.tool_plaindef roll_die() -> str: """Roll a six-sided die and return the result.""" return str(random.randint(1, 6))# Define a tool that uses the dependency@dice_agent.tooldef get_player_name(ctx: RunContext[str]) -> str: """Get the player's name.""" return ctx.deps# Run the agentdice_result = dice_agent.run_sync('My guess is 4', deps='Anne')print(dice_result.output)
Portkey logs each tool call separately, allowing you to analyze the full execution path of your agent, including both LLM calls and tool invocations.
PydanticAI excels at creating multi-agent systems where agents can call each other. Here’s how to integrate Portkey with a multi-agent setup:
This multi-agent system uses three specialized agents:
search_agent - Orchestrates the flow and validates flight selections
extraction_agent - Extracts structured flight data from raw text
seat_preference_agent - Interprets user’s seat preferences
With Portkey integration, you get:
Unified tracing across all three agents
Token and cost tracking for the entire workflow
Ability to set usage limits across the entire system
Observability of both AI and human interaction points
Here’s a diagram of how these agents interact:
import datetimefrom dataclasses import dataclassfrom typing import Literalfrom pydantic import BaseModel, Fieldfrom rich.prompt import Promptfrom pydantic_ai import Agent, ModelRetry, RunContextfrom pydantic_ai.messages import ModelMessagefrom pydantic_ai.usage import Usage, UsageLimitsfrom portkey_ai import AsyncPortkey# Set up Portkey clients with shared trace ID for connected tracingportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_OPENAI_VIRTUAL_KEY", trace_id="flight-booking-session", metadata={"app_type": "flight_booking"})# Define structured output typesclass FlightDetails(BaseModel): """Details of the most suitable flight.""" flight_number: str price: int origin: str = Field(description='Three-letter airport code') destination: str = Field(description='Three-letter airport code') date: datetime.dateclass NoFlightFound(BaseModel): """When no valid flight is found."""class SeatPreference(BaseModel): row: int = Field(ge=1, le=30) seat: Literal['A', 'B', 'C', 'D', 'E', 'F']class Failed(BaseModel): """Unable to extract a seat selection."""# Dependencies for flight search@dataclassclass Deps: web_page_text: str req_origin: str req_destination: str req_date: datetime.date# This agent is responsible for controlling the flow of the conversationfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvidersearch_agent = Agent[Deps, FlightDetails | NoFlightFound]( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), output_type=FlightDetails | NoFlightFound, # type: ignore retries=4, system_prompt=( 'Your job is to find the cheapest flight for the user on the given date. ' ), instrument=True, # Enable instrumentation for better tracing)# This agent is responsible for extracting flight details from web page textextraction_agent = Agent( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), output_type=list[FlightDetails], system_prompt='Extract all the flight details from the given text.',)# This agent is responsible for extracting the user's seat selectionseat_preference_agent = Agent[None, SeatPreference | Failed]( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), output_type=SeatPreference | Failed, # type: ignore system_prompt=( "Extract the user's seat preference. " 'Seats A and F are window seats. ' 'Row 1 is the front row and has extra leg room. ' 'Rows 14, and 20 also have extra leg room. ' ),)@search_agent.toolasync def extract_flights(ctx: RunContext[Deps]) -> list[FlightDetails]: """Get details of all flights.""" # Pass the usage to track nested agent calls result = await extraction_agent.run(ctx.deps.web_page_text, usage=ctx.usage) return result.output@search_agent.output_validatorasync def validate_output( ctx: RunContext[Deps], output: FlightDetails | NoFlightFound) -> FlightDetails | NoFlightFound: """Procedural validation that the flight meets the constraints.""" if isinstance(output, NoFlightFound): return output errors: list[str] = [] if output.origin != ctx.deps.req_origin: errors.append( f'Flight should have origin {ctx.deps.req_origin}, not {output.origin}' ) if output.destination != ctx.deps.req_destination: errors.append( f'Flight should have destination {ctx.deps.req_destination}, not {output.destination}' ) if output.date != ctx.deps.req_date: errors.append(f'Flight should be on {ctx.deps.req_date}, not {output.date}') if errors: raise ModelRetry('\n'.join(errors)) else: return output# Sample flight data (in a real application, this would be from a web scraper)flights_web_page = """1. Flight SFO-AK123- Price: $350- Origin: San Francisco International Airport (SFO)- Destination: Ted Stevens Anchorage International Airport (ANC)- Date: January 10, 20252. Flight SFO-AK456- Price: $370- Origin: San Francisco International Airport (SFO)- Destination: Fairbanks International Airport (FAI)- Date: January 10, 2025... more flights ..."""# Main application flowasync def main(): # Restrict how many requests this app can make to the LLM usage_limits = UsageLimits(request_limit=15) deps = Deps( web_page_text=flights_web_page, req_origin='SFO', req_destination='ANC', req_date=datetime.date(2025, 1, 10), ) message_history: list[ModelMessage] | None = None usage: Usage = Usage() # Run the agent until a satisfactory flight is found while True: result = await search_agent.run( f'Find me a flight from {deps.req_origin} to {deps.req_destination} on {deps.req_date}', deps=deps, usage=usage, message_history=message_history, usage_limits=usage_limits, ) if isinstance(result.output, NoFlightFound): print('No flight found') break else: flight = result.output print(f'Flight found: {flight}') answer = Prompt.ask( 'Do you want to buy this flight, or keep searching? (buy/*search)', choices=['buy', 'search', ''], show_choices=False, ) if answer == 'buy': seat = await find_seat(usage, usage_limits) await buy_tickets(flight, seat) break else: message_history = result.all_messages( output_tool_return_content='Please suggest another flight' )async def find_seat(usage: Usage, usage_limits: UsageLimits) -> SeatPreference: message_history: list[ModelMessage] | None = None while True: answer = Prompt.ask('What seat would you like?') result = await seat_preference_agent.run( answer, message_history=message_history, usage=usage, usage_limits=usage_limits, ) if isinstance(result.output, SeatPreference): return result.output else: print('Could not understand seat preference. Please try again.') message_history = result.all_messages()async def buy_tickets(flight_details: FlightDetails, seat: SeatPreference): print(f'Purchasing flight {flight_details=!r} {seat=!r}...')
Portkey preserves all the type safety of PydanticAI while adding production monitoring and reliability.
Portkey provides comprehensive observability for your PydanticAI agents, helping you understand exactly what’s happening during each execution.
Traces provide a hierarchical view of your agent’s execution, showing the sequence of LLM calls, tool invocations, and state transitions.
# Add trace_id to enable hierarchical tracing in Portkeyportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_LLM_PROVIDER_VIRTUAL_KEY", trace_id="unique-session-id", # Add unique trace ID metadata={"request_type": "user_query"})
Traces provide a hierarchical view of your agent’s execution, showing the sequence of LLM calls, tool invocations, and state transitions.
# Add trace_id to enable hierarchical tracing in Portkeyportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_LLM_PROVIDER_VIRTUAL_KEY", trace_id="unique-session-id", # Add unique trace ID metadata={"request_type": "user_query"})
Portkey logs every interaction with LLMs, including:
Complete request and response payloads
Latency and token usage metrics
Cost calculations
Tool calls and function executions
All logs can be filtered by metadata, trace IDs, models, and more, making it easy to debug specific agent runs.
Portkey provides built-in dashboards that help you:
Track cost and token usage across all agent runs
Analyze performance metrics like latency and success rates
Identify bottlenecks in your agent workflows
Compare different agent configurations and LLMs
You can filter and segment all metrics by custom metadata to analyze specific agent types, user groups, or use cases.
Add custom metadata to your PydanticAI agent calls to enable powerful filtering and segmentation:
portkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_LLM_PROVIDER_VIRTUAL_KEY", metadata={ "agent_type": "weather_agent", "environment": "production", "_user": "user_123", # Special _user field for user analytics "request_source": "mobile_app" })
This metadata can be used to filter logs, traces, and metrics on the Portkey dashboard, allowing you to analyze specific agent runs, users, or environments.
2. Reliability - Keep Your Agents Running Smoothly
When running agents in production, things can go wrong - API rate limits, network issues, or provider outages. Portkey’s reliability features ensure your agents keep running smoothly even when problems occur.
It’s simple to enable fallback in your PydanticAI agents by using a Portkey Config:
Portkey’s Prompt Engineering Studio helps you create, manage, and optimize the prompts used in your PydanticAI agents. Instead of hardcoding prompts or instructions, use Portkey’s prompt rendering API to dynamically fetch and apply your versioned prompts.
Manage prompts in Portkey's Prompt Library
Prompt Playground is a place to compare, test and deploy perfect prompts for your AI application. It’s where you experiment with different models, test variables, compare outputs, and refine your prompt engineering strategy before deploying to production. It allows you to:
Iteratively develop prompts before using them in your agents
Test prompts with different variables and models
Compare outputs between different prompt versions
Collaborate with team members on prompt development
This visual environment makes it easier to craft effective prompts for each step in your PydanticAI agent’s workflow.
Prompt Playground is a place to compare, test and deploy perfect prompts for your AI application. It’s where you experiment with different models, test variables, compare outputs, and refine your prompt engineering strategy before deploying to production. It allows you to:
Iteratively develop prompts before using them in your agents
Test prompts with different variables and models
Compare outputs between different prompt versions
Collaborate with team members on prompt development
This visual environment makes it easier to craft effective prompts for each step in your PydanticAI agent’s workflow.
The Prompt Render API retrieves your prompt templates with all parameters configured:
from portkey_ai import Portkey, AsyncPortkeyfrom pydantic_ai import Agentfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvider# Initialize Portkey clientsportkey_admin = Portkey(api_key="YOUR_PORTKEY_API_KEY")portkey_client = AsyncPortkey(api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_OPENAI_VIRTUAL_KEY")# Retrieve prompt using the render APIprompt_data = portkey_admin.prompts.render( prompt_id="YOUR_PROMPT_ID", variables={ "user_input": "Tell me about artificial intelligence" })# Use the rendered prompt in your PydanticAI agentagent = Agent( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), system_prompt=prompt_data.data.messages[0]["content"] # Use the rendered prompt)result = agent.run_sync("Tell me about artificial intelligence")print(result.output)
You can:
Create multiple versions of the same prompt
Compare performance between versions
Roll back to previous versions if needed
Specify which version to use in your code:
# Use a specific prompt versionprompt_data = portkey_admin.prompts.render( prompt_id="YOUR_PROMPT_ID@version_number", variables={ "user_input": "Tell me about quantum computing" })
Portkey prompts use Mustache-style templating for easy variable substitution:
You are an AI assistant helping with {{task_type}}.User question: {{user_input}}Please respond in a {{tone}} tone and include {{required_elements}}.
When rendering, simply pass the variables:
prompt_data = portkey_admin.prompts.render( prompt_id="YOUR_PROMPT_ID", variables={ "task_type": "research", "user_input": "Tell me about quantum computing", "tone": "professional", "required_elements": "recent academic references" })
Track individual users through your PydanticAI agents using Portkey’s metadata system.
What is Metadata in Portkey?
Metadata allows you to associate custom data with each request, enabling filtering, segmentation, and analytics. The special _user field is specifically designed for user tracking.
from portkey_ai import AsyncPortkeyfrom pydantic_ai import Agentfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvider# Configure client with user trackingportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_OPENAI_VIRTUAL_KEY", metadata={ "_user": "user_123", # Special _user field for user analytics "user_tier": "premium", "user_company": "Acme Corp", "session_id": "abc-123" })# Create agent with Portkey clientagent = Agent( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), system_prompt="You are a helpful assistant.")
Filter Analytics by User
With metadata in place, you can filter analytics by user and analyze performance metrics on a per-user basis:
Filter analytics by user
This enables:
Per-user cost tracking and budgeting
Personalized user analytics
Team or organization-level metrics
Environment-specific monitoring (staging vs. production)
PydanticAI supports multiple LLM providers, and Portkey extends this capability by providing access to over 200 LLMs through a unified interface. You can easily switch between different models without changing your core agent logic:
from portkey_ai import AsyncPortkeyfrom pydantic_ai import Agentfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvider# OpenAI with Portkeyportkey_openai = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_OPENAI_VIRTUAL_KEY")# Anthropic with Portkeyportkey_anthropic = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY", virtual_key="YOUR_ANTHROPIC_VIRTUAL_KEY")# Create agents with different modelsopenai_agent = Agent( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_openai), ), system_prompt="You are a helpful assistant.")anthropic_agent = Agent( model=OpenAIModel( model_name="claude-3-opus-20240229", provider=OpenAIProvider(anthropic_client=portkey_anthropic), ), system_prompt="You are a helpful assistant.")# Choose which agent to use based on your needsactive_agent = openai_agent # or anthropic_agentresult = active_agent.run_sync("Tell me about quantum computing.")print(result.output)
Portkey provides access to LLMs from providers including:
OpenAI (GPT-4o, GPT-4 Turbo, etc.)
Anthropic (Claude 3.5 Sonnet, Claude 3 Opus, etc.)
After setting up your Portkey API key with the attached config, connect it to your PydanticAI agents:
from portkey_ai import AsyncPortkeyfrom pydantic_ai import Agentfrom pydantic_ai.models.openai import OpenAIModelfrom pydantic_ai.providers.openai import OpenAIProvider# Configure Portkey client with your API keyportkey_client = AsyncPortkey( api_key="YOUR_PORTKEY_API_KEY" # The API key with attached config from step 3)# Create agent with Portkey clientagent = Agent( model=OpenAIModel( model_name="gpt-4o", provider=OpenAIProvider(openai_client=portkey_client), ), system_prompt="You are a helpful assistant.")
As your AI usage scales, controlling which teams can access specific models becomes crucial. Portkey Configs provide this control layer with features like:
After distributing API keys to your team members, your enterprise-ready PydanticAI setup is ready to go. Each team member can now use their designated API keys with appropriate access levels and budget controls.
Portkey adds production-readiness to PydanticAI through comprehensive observability (traces, logs, metrics), reliability features (fallbacks, retries, caching), and access to 200+ LLMs through a unified interface. This makes it easier to debug, optimize, and scale your agent applications, all while preserving PydanticAI’s strong type safety.
Yes! Portkey integrates seamlessly with existing PydanticAI applications. You just need to replace your client initialization code with the Portkey-enabled version. The rest of your agent code remains unchanged and continues to benefit from PydanticAI’s strong typing.
Portkey supports all PydanticAI features, including structured outputs, tool use, multi-agent systems, and more. It adds observability and reliability without limiting any of the framework’s functionality.
Yes, Portkey allows you to use a consistent trace_id across multiple agents and requests to track the entire workflow. This is especially useful for multi-agent systems where you want to understand the full execution path.
Portkey allows you to add custom metadata to your agent runs, which you can then use for filtering. Add fields like agent_name, agent_type, or session_id to easily find and analyze specific agent executions.
Yes! Portkey uses your own API keys for the various LLM providers. It securely stores them as virtual keys, allowing you to easily manage and rotate keys without changing your code.