Tracing Standards

This page defines the tracing standards required to unlock important capabilities in LangSmith. Structuring your traces correctly is required to make the most of LangSmith: 1. Rich debugging experience: LangSmith provides structured rendering of message lists, making it easier to visualize and understand the interaction history. 2. LangSmith features: Features like Polly, LangSmith Fetch and multi-turn evals require properly structured traces to work correctly.

Understanding Threads, Traces and Runs

LangSmith has three levels that work together to capture your agent’s behavior:

Threads

A Thread groups multiple interactions together so you can see the history over time. In the LangSmith UI, threads can be viewed in the “Threads” tab within a Tracing Project.

Traces

A Trace represents a single request/response cycle, also known as a “turn”. It contains everything that happened during one execution of an agent. Multiple traces, grouped together, form a thread. In the LangSmith UI, a trace is the “parent” node in the execution tree.

Runs

A Run is an individual operation within a trace: an LLM call, a tool execution, middleware, or any other step in your agent’s process. One or more runs make up a trace. In the LangSmith UI, runs are “children” and “grandchildren” (and onward as needed) nodes in the tree. If a trace fails, you would examine individual runs to identify which step had an error.

Trace Structure

Structure your traces so each one is self-contained and replayable:

Input: The new message or instruction. For example: What’s the status of the Tokyo shipment?
Output: Output is the state of the interaction after the turn. Represents the current memory (interaction history + new messages) of the agent after the turn is complete. For example: What’s the status of the Tokyo shipment? -> Checked the shipping logs -> Found the Tokyo tracking number -> Confirmed it arrived at port -> Told the user.

The benefit of this approach is any trace can be understood independently without needing to search for prior context. Both inputs and outputs of a trace must have messages as a top-level key that represents the interaction. Input:

{
  "messages": [
    // User's current request
  ],
  "additional_fields":{
    // Optional additional fields
  }
}

Output:

{
  "messages": [
    // History from prior turns + user request + LLM response
  ],
  "additional_fields":{
    // Optional additional fields
  }
}

You can think of the output as the intraction state after the turn, which is why outputs should include the full message history—not just the latest response. The output is a complete “receipt” of what happened in that turn. View Threads for more information on how to configure threads, and the expected trace structure within a thread.

Message Format Standards

In the above section, we covered the expected structure of traces. This section covers the expected structure of messages within a trace.

If you don’t log your LLM traces in the suggested formats, you will still be able to log the data to LangSmith, but it may not be processed or rendered in expected ways, and you will not be able to use some features, like Polly, LangSmith Fetch and multi-turn evals.

Required Structure

The messages key should be follow LangChain, OpenAI Chat Completions or Anthropic messages formats. If you’re using other models, or tracing a custom model, you’ll need to modify the structure of the messages array to follow one of the supported schemas for best results.

If you’re using LangChain OSS to call language models or LangSmith wrappers (OpenAI, Anthropic), this formatting is handled automatically.

Schema

Show LangChain Messages Format (Click to expand)

messages

array

required

A list of messages containing the content of the interaction.

role

string

required

Identifies the message type. One of: system | reasoning | user | assistant | tool

content

array

required

Content of the message. List of typed dictionaries.

Show Content type options

type

string

required

Show text

type

literal('text')

required

text

string

required

Text content.

annotations

object[]

List of annotations for the text.

extras

object

Additional provider-specific data.

Show reasoning

type

literal('reasoning')

required

text

string

required

Reasoning content (e.g., extended thinking tokens).

extras

object

Additional provider-specific data.

Show image

type

literal('image')

required

url

string

URL pointing to the image location.

base64

string

Base64-encoded image data.

string

Reference ID to an externally stored image (e.g., in a provider’s file system).

mime_type

string

Image MIME type (e.g., image/jpeg, image/png).

Show file (e.g., PDFs)

type

literal('file')

required

url

string

URL pointing to the file.

base64

string

Base64-encoded file data.

string

Reference ID to an externally stored file.

mime_type

string

File MIME type (e.g., application/pdf).

Show audio

type

literal('audio')

required

url

string

URL pointing to the audio file.

base64

string

Base64-encoded audio data.

string

Reference ID to an externally stored audio file.

mime_type

string

Audio MIME type (e.g., audio/mpeg, audio/wav).

Show video

type

literal('video')

required

url

string

URL pointing to the video file.

base64

string

Base64-encoded video data.

string

Reference ID to an externally stored video file.

mime_type

string

Video MIME type (e.g., video/mp4, video/webm).

Show tool_call

type

literal('tool_call')

required

name

string

Name of the tool being called.

args

object

required

Arguments to pass to the tool.

string

Unique identifier for this tool call.

Show server_tool_call

type

literal('server_tool_call')

required

string

required

Unique identifier for this tool call.

name

string

required

The name of the tool to be called.

args

object

required

Arguments to pass to the tool.

Show server_tool_result

type

literal('server_tool_result')

required

tool_call_id

string

required

Identifier of the corresponding server tool call.

string

Unique identifier for this tool result.

status

string

required

Execution status of the server-side tool. One of: success | error.

output

Output of the executed tool.

tool_call_id

string

Must match the id of a prior assistant message’s tool call entry. Only valid when role is tool.

usage_metadata

object

Use this field to send token counts and/or costs with your model’s output. See this guide for more details.

Examples

inputs = {
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What's the capital of France?"
        }
      ]
    }
  ]
}

outputs = {
  "messages": [
    {
      "role": "assistant",
      "content": [
        {
          "type": "text",
          "text": "The capital of France is Paris."
        },
        {
          "type": "reasoning",
          "text": "This is a straightforward geography question..."
        }
      ]
    }
  ]
}

FAQ

What if my model doesn't return the exact format?

If you’re using a custom input or output format, you can convert it to a LangSmith compatible format using process_inputs and process_outputs with the @traceable decorator to transform your custom format into LangSmith-compatible structure:For details, refer to the Log LLM calls page.

How do I handle streaming responses?

Log the final accumulated result after streaming completes:

@traceable(run_type="llm")
def call_llm_streaming(messages):
    chunks = []

    # Stream response
    for chunk in client.chat.completions.create(
        model="gpt-4",
        messages=messages,
        stream=True
    ):
        chunks.append(chunk.choices[0].delta.content or "")

    # Log complete response
    full_text = "".join(chunks)
    return {
        "role": "assistant",
        "content": [{"type": "text", "text": full_text}]
    }

Don’t log individual chunks—log the complete message once streaming finishes.

How do I trace parallel tool calls?

Include all tool calls and their results in the message sequence:

outputs = {
  "messages": [
    {
      "role": "assistant",
      "content": [
        {"type": "tool_call", "name": "search", "args": {...}, "id": "call_1"},
        {"type": "tool_call", "name": "calculate", "args": {...}, "id": "call_2"}
      ]
    },
    {
      "role": "tool",
      "tool_call_id": "call_1",
      "content": [{"type": "text", "text": "Search results..."}]
    },
    {
      "role": "tool",
      "tool_call_id": "call_2",
      "content": [{"type": "text", "text": "Calculation result..."}]
    },
    {
      "role": "assistant",
      "content": [{"type": "text", "text": "Based on the search and calculation..."}]
    }
  ]
}

The tool_call_id links each result to its corresponding call.

What about local/private data in traces?

Traces are stored according to retention policies, which can only be modified if you are on self-hosted LangSmith.

Use references instead of raw data: Store files externally and use the id field
Filter before logging: LangSmith provides multiple approaches to protect your data before it’s sent to the backend here.
Configure Trace Deletion: Set up trace deletion rules.

Edit this page on GitHub or file an issue.

Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

Tracing setup

Configuration & troubleshooting

Viewing & managing traces

Automations

Feedback & evaluation

Monitoring & alerting

Data type reference

Understanding Threads, Traces and Runs

Threads

Traces

Runs

Trace Structure

Message Format Standards

Required Structure

Schema

Examples

FAQ

Tracing setup

Configuration & troubleshooting

Viewing & managing traces

Automations

Feedback & evaluation

Monitoring & alerting

Data type reference

​Understanding Threads, Traces and Runs

​Threads

​Traces

​Runs

​Trace Structure

​Message Format Standards

​Required Structure

​Schema

​Examples

​FAQ

Understanding Threads, Traces and Runs

Threads

Traces

Runs

Trace Structure

Message Format Standards

Required Structure

Schema

Examples

FAQ