OpenAI Integration (Python)

The mutagent-openai package provides a drop-in wrapper around the official OpenAI Python client that automatically traces all chat completion calls.

Installation

This package is coming soon to PyPI. The install command below will work once published.

pip install mutagent-openai

This installs mutagent-openai along with its dependencies. Tracing transport is provided by mutagent-sdk via the mutagent.tracing module. The openai SDK (>= 1.0.0) is also required and installed automatically.

Quick Start

Initialize tracing

from mutagent.tracing import init_tracing

init_tracing(api_key="mt_xxxxxxxxxxxx")

Use MutagentOpenAI instead of OpenAI

from mutagent_openai import MutagentOpenAI

client = MutagentOpenAI(api_key="your-openai-key")

response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.choices[0].message.content)

Every call to client.chat.completions.create() is automatically traced and sent to MutagenT. No additional code changes required.

Full Example

import os
import asyncio
from mutagent.tracing import init_tracing, shutdown_tracing
from mutagent_openai import MutagentOpenAI

# Initialize MutagenT tracing
init_tracing(
    api_key=os.environ["MUTAGENT_API_KEY"],
    environment="production",
)

# Create the traced OpenAI client
client = MutagentOpenAI(api_key=os.environ["OPENAI_API_KEY"])

# Use exactly like the standard OpenAI client
response = client.chat.completions.create(
    model="gpt-4",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "What is the capital of France?"},
    ],
    temperature=0.7,
)

print(response.choices[0].message.content)

# Flush remaining spans on exit (shutdown_tracing is async)
asyncio.run(shutdown_tracing())

Streaming

Streaming responses are fully supported. The wrapper accumulates streamed content and records the complete response in the span when the stream finishes.

stream = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Write a haiku about Python."}],
    stream=True,
)

for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="", flush=True)

Streaming spans capture the accumulated output text after the stream completes. Token usage metrics are recorded when available from the model response.

What Gets Traced

Each call to chat.completions.create() generates a span with:

Input Messages

All messages sent to the model (system, user, assistant)

Output Messages

The model’s response content

Token Usage

Input tokens, output tokens, and total tokens

Model Info

Model name and provider (openai)

Latency

Request duration in milliseconds

Errors

Error messages with stack traces on failure

Span Details

Field	Description	Example
`kind`	Span type	`llm.chat`
`name`	Model name	`gpt-4`
`input.messages`	Input chat messages	System + user messages
`output.messages`	Response messages	Assistant response
`metrics.model`	Model identifier	`gpt-4-0613`
`metrics.provider`	Provider name	`openai`
`metrics.input_tokens`	Prompt tokens	`150`
`metrics.output_tokens`	Completion tokens	`50`
`metrics.total_tokens`	Total tokens	`200`

Constructor Options

MutagentOpenAI accepts the same parameters as the official openai.OpenAI client:

Parameter	Type	Description
`api_key`	`str \| None`	OpenAI API key (falls back to `OPENAI_API_KEY` env var)
`organization`	`str \| None`	OpenAI organization ID
`base_url`	`str \| None`	Custom API base URL
`timeout`	`float \| None`	Request timeout in seconds
`max_retries`	`int \| None`	Maximum retry count for failed requests

Error Handling

When an API call fails, the span is automatically recorded with an ERROR status and the error message is captured:

try:
    response = client.chat.completions.create(
        model="gpt-4",
        messages=[{"role": "user", "content": "Hello!"}],
    )
except Exception as e:
    # Span is already recorded with status=ERROR
    print(f"Request failed: {e}")

Nested Spans

MutagentOpenAI works with the core tracing SDK’s context propagation. If you create a parent span, OpenAI calls will automatically nest under it:

from mutagent.tracing import start_span, end_span, SpanOptions, SpanEndOptions

# Create a parent span
parent = start_span(SpanOptions(kind="chain", name="my-pipeline"))

# This call is automatically nested under the parent span
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}],
)

# End the parent span
if parent:
    end_span(parent, SpanEndOptions(status="ok"))

TypeScript Equivalent

For the TypeScript/Node.js OpenAI integration, see the TypeScript Integrations.

​OpenAI Integration (Python)

​Installation

​Quick Start

​Full Example

​Streaming

​What Gets Traced

Input Messages

Output Messages

Token Usage

Model Info

Latency

Errors

​Span Details

​Constructor Options

​Error Handling

​Nested Spans

​TypeScript Equivalent

OpenAI Integration (Python)

Installation

Quick Start

Full Example

Streaming

What Gets Traced

Span Details

Constructor Options

Error Handling

Nested Spans

TypeScript Equivalent