Count Tokens

Using Count Tokens with Portkey
Use Cases
Related Documentation

Portkey supports the Google Vertex AI CountTokens API, which returns the token count for a given input. This helps you estimate token usage before sending requests to models, allowing you to estimate costs, optimize prompts to fit within token limits, and plan for token usage in your applications. The token count returned by this operation matches the token count that would be charged if the same input were sent to the model in a generation request.

Using Count Tokens with Portkey

Portkey supports Vertex AI’s CountTokens endpoint through the Anthropic-compatible format, allowing you to use the same API signature across Vertex AI, Bedrock, and Anthropic providers.

import anthropic

client = anthropic.Anthropic(
    api_key="dummy", # we will use portkey's provider slug
    default_headers={"x-portkey-api-key": "YOUR_PORTKEY_API_KEY"},
    base_url="https://api.portkey.ai/v1"
)

response = client.messages.count_tokens(
    model="@your-vertex-provider-slug/your-model-name",
    system="You are a scientist",
    messages=[{
        "role": "user",
        "content": "Hello, Claude"
    }],
)

print(response.json())

Use Cases

Estimate costs before sending inference requests
Optimize prompts to fit within token limits
Plan for token usage in your applications

⌘I

Ecosystem

LLM Integrations

Cloud Platforms

Guardrails

Plugins

Vector Databases

Agents

AI Apps

Libraries

Tracing Providers

MCP Clients

MCP Servers

Using Count Tokens with Portkey

Use Cases

Ecosystem

LLM Integrations

Cloud Platforms

Guardrails

Plugins

Vector Databases

Agents

AI Apps

Libraries

Tracing Providers

MCP Clients

MCP Servers

​Using Count Tokens with Portkey

​Use Cases

​Related Documentation

Using Count Tokens with Portkey

Use Cases

Related Documentation