May

May-king it production ready✨

In May, we shipped the kind of upgrades that help you move your AI Agents fast into productiion and stay in control — whether you’re scaling, securing AI behavior, or bringing new models to your apps.

We launched deep integrations with agent frameworks like PydanticAI and OpenAI Agents SDK, added enterprise-grade controls to Claude Code, made it simpler to call a remote MCP server simpler and much more!

Here’s everything new this month:

Summary

Area	Key Highlights
Platform	• Full HTTP method support (`GET`, `PUT`, `DELETE`) • OTel analytics export to your stack • OpenAI Computer Use Tool support • Multimodal embedding support (Vertex AI)
Enterprise	• Deep Azure AI ecosystem integration (Foundry, APIM, Marketplace) • Claude Code with enterprise controls (rate limits, observability) • Model whitelist guardrail for org/env control
Integrations	• Expanded AI Agent Frameworks (PydanticAI, OpenAI SDK, Strands) • Support for latest models (Claude 4, Grok 3, Gemini 2.5) & new providers • AI Coding Assistant integrations (Cline, Roo Code) • Remote MCP server & Arize Phoenix tracing support
Security	• New Prompt Security guardrails (injection, data protection) • JWT validator input guardrail • PANW Prisma AIRS plugin for real-time risk blocking
Resources	• New Solution Pages (AWS Bedrock, GovCloud) • New Cookbooks (OpenAI Computer Use, Llama Prompt Ops)

AI Agent Infrastructure

AI agent frameworks are helping teams prototype faster, but taking agents to production requires real infrastructure. Portkey integrates with leading frameworks to bring interoperability, observability, reliability, and cost management to your agent workflows.

PydanticAI

PydanticAI is a Python framework that brings FastAPI-like ergonomics to building AI agents.

OpenAI Agents SDK

OpenAI Agents SDK helps teams ship production-grade agents with built-in planning, memory, and tool use.

Strands Agents

Strands Agents is a lightweight agent framework built by AWS to simplify agent development.

Tracing Integrations: Arize AI

For teams consolidating observability into Arize, you can now view Portkey’s logs directly into Arize Phoenix to get unified trace views across your LLM workflows.

Remote MCP servers

Portkey now supports calling a remote MCP server that is maintained by developers and organizations across the internet that expose these tools to MCP clients via the Responses API Read more about the integration here.

Azure AI ecosystem

More than half of Fortune 500 companies use Azure OpenAI. But building GenAI apps in the enterprise is still messy, cost attribution, routing logic, usage tracking, model evaluation… all scattered.

With Portkey’s deep integration into the Azure AI ecosystem (OpenAI, Foundry, APIM, Marketplace), teams can now build, scale, and govern GenAI apps without leaving their existing cloud setup.

Our customers are vouching for it!

Portkey for AI Tools

Claude Code

Bring enterprise-grade visibility, governance, and access control to Claude Code.

Cline

Supercharge your AI-powered terminal with cost tracking, access controls, and observability.

Roo Code

Add security, compliance, and real-time analytics to your code assistant workflows.

Goose

Add essential enterprise controls to Goose’s powerful autonomous coding capabilities

Multilmodal embeddings

Portkey now supports embedding APIs from Vertex AI for text, image, and video—across multiple languages. This unlocks the ability to:

Build multimodal search and retrieval
Power multimodal RAG pipelines
Track, route, and optimize embedding usage at scale

Platform

Multi-label support for prompts

You can now assign multiple labels to a single prompt version, making it easy to promote a version across environments like staging and production.

Gateway to any API

Portkey now supports GET, PUT, and DELETE HTTP methods in addition to POST, allowing you to route requests to any external or self-hosted provider endpoint. This means you can connect to custom APIs directly through Portkey with full observability for every call.

OTel Integration (Analytics Data)

You can now export Portkey analytics to any OpenTelemetry (OTel)-compatible collector, integrating easily into your existing observability stack.

Improvements

Token cost tracking is now available for gpt-image-1.
Ping messages are removed from streamed responses.
Resizing metadata columns in logs

This is what keeps us going!

New Models & Providers

New additions

Claude 4 is now live for advanced reasoning and coding.
Grok 3 & Grok 3 Mini are available on Azure
Lepton AI is now live
Nscale Models can now be accessed through Portkey.

Updates

PDF Support for Claude via Anthropic and Bedrock.
Gemini 2.5 Thinking Mode is now supported in Prompt Playground.
Extended Thinking is available for Claude 3.7 and Claude 4.
Image generation now supported on WorkersAI
Tool Calling and Function Calling for Mistral is now live.
MIME Type is now supported for Vertex AI

Guardrails

Prompt Security guardrails: Integrate with Prompt Security to detect prompt injection and prevent sensitive data exposure in both prompts and responses.
JWT validator guardrail: Added as an input guardrail to validate incoming JWT tokens before requests are sent to the LLM.
PANW Prisma AIRS Plugin: Portkey now integrates with Palo Alto Networks’ AIRS (AI Runtime Security) to enforce guardrails that block risky prompts or model responses based on real-time security analysis.
Model whitelist guardrail: Restrict or deny specific models at the org, environment, or request level using a flexible whitelist/blacklist guardrail.

No frills. No hype. Just serious safety

Resources

Cookbook: Optimizing Prompts with LLama Prompt Ops
Cookbook: OpenAI Computer Use Tool
Guardrail documentation is now located under “Integrations”.
Expanded guides for agent frameworks, including CrewAI and LangGraph.

Community Contributors

A special thanks to our community contributors this month:

Coming this month!

Provision and manage LLM access across your entire org from a single admin panel. Centralized controls. Granular permissions. Stay tuned.

Monthly Summary

Enterprise Releases

Product Releases

SDK Releases

Summary

AI Agent Infrastructure

PydanticAI

OpenAI Agents SDK

Strands Agents

Remote MCP servers

Azure AI ecosystem

Portkey for AI Tools

Claude Code

Cline

Roo Code

Goose

Multilmodal embeddings

Platform

New Models & Providers

Guardrails

Resources

Community Contributors

Coming this month!

Support

Need Help?

Join Us

Monthly Summary

Enterprise Releases

Product Releases

SDK Releases

​Summary

​AI Agent Infrastructure

PydanticAI

OpenAI Agents SDK

Strands Agents

​Remote MCP servers

​Azure AI ecosystem

​Portkey for AI Tools

Claude Code

Cline

Roo Code

Goose

​Multilmodal embeddings

​Platform

​New Models & Providers

​Guardrails

​Resources

​Community Contributors

​Coming this month!

​Support

Need Help?

Join Us

Summary

AI Agent Infrastructure

Remote MCP servers

Azure AI ecosystem

Portkey for AI Tools

Multilmodal embeddings

Platform

New Models & Providers

Guardrails

Resources

Community Contributors

Coming this month!

Support