fix: account for cached prompt tokens in OTEL spans by simonvdk-mistral · Pull Request #452 · mistralai/client-python

simonvdk-mistral · 2026-03-27T15:02:38Z

Summary

emit gen_ai.usage.cache_read.input_tokens when cached prompt token counts are present in usage payloads
support the current payload shapes exposed by the SDK and API (prompt_tokens_details.cached_tokens, prompt_token_details.cached_tokens, and num_cached_tokens)
add non-streaming and streaming OTEL regression tests for cached token usage

Testing

uv run pytest src/mistralai/extra/tests/test_otel_tracing.py -k 'cached_prompt_tokens or num_cached_tokens or streaming_chat_completion_enriches_span or test_chat_completion_basic'
uv run ruff check src/mistralai/extra/observability/otel.py src/mistralai/extra/tests/test_otel_tracing.py

Linear: OBS-1423

Co-authored-by: mistral-hydra[bot] <mistral-hydra[bot]@users.noreply.github.com>

src/mistralai/extra/observability/otel.py

simonvdk-mistral · 2026-03-27T16:40:15Z

src/mistralai/extra/observability/otel.py

+    if isinstance(prompt_token_details, dict):
+        cached_tokens = prompt_token_details.get("cached_tokens")
+        if isinstance(cached_tokens, int):
+            return cached_tokens
+
+    num_cached_tokens = usage.get("num_cached_tokens")
+    if isinstance(num_cached_tokens, int):
+        return num_cached_tokens


How did you arbitrate the priority between the two (prompt token details and number of cached tokens) ?

I made the priority explicit in code in b6cce3d: prefer prompt_tokens_details.cached_tokens when present, and only fall back to top-level num_cached_tokens for payloads that expose the legacy field instead.

Ok maybe the spec is not dry yet. The UsageInfo model I linked in the linear ticket is specific to a voice endpoint, and all other endpoints (chat completion, conversation, etc) do not have the cache tokens attributes defined yet in the models.

Let's wait a bit for this PR, will come back later

Understood. I am not making a follow-up code change from this comment. The current branch only records gen_ai.usage.cache_read.input_tokens when the raw usage payload actually contains one of the cache-token fields, so endpoints whose generated models do not expose those fields today remain unaffected. I will leave the PR here and wait for your follow-up on whether you want to keep or revert this behavior once the models/spec settle.

src/mistralai/extra/tests/test_otel_tracing.py

Co-authored-by: mistral-hydra[bot] <mistral-hydra[bot]@users.noreply.github.com>

fix(otel): record cached prompt tokens in spans

7e585f6

Co-authored-by: mistral-hydra[bot] <mistral-hydra[bot]@users.noreply.github.com>

simonvdk-mistral commented Mar 27, 2026

View reviewed changes

mistral-hydra bot added 4 commits March 27, 2026 16:50

fix(otel): address cached token review feedback

b6cce3d

Co-authored-by: mistral-hydra[bot] <mistral-hydra[bot]@users.noreply.github.com>

fix(otel): satisfy pyright for cached token usage

8dd088e

Co-authored-by: mistral-hydra[bot] <mistral-hydra[bot]@users.noreply.github.com>

fix(otel): relax semconv dependency range

06f680e

Co-authored-by: mistral-hydra[bot] <mistral-hydra[bot]@users.noreply.github.com>

fix(otel): require semconv 0.61b0

c3bf896

Co-authored-by: mistral-hydra[bot] <mistral-hydra[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: account for cached prompt tokens in OTEL spans#452

fix: account for cached prompt tokens in OTEL spans#452
simonvdk-mistral wants to merge 5 commits intomainfrom
hydra/OBS-1423/session-a63fceb33e32

simonvdk-mistral commented Mar 27, 2026

Uh oh!

Uh oh!

Uh oh!

simonvdk-mistral Mar 27, 2026

Uh oh!

simonvdk-mistral Mar 27, 2026

Uh oh!

simonvdk-mistral Mar 30, 2026

Uh oh!

simonvdk-mistral Mar 30, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

simonvdk-mistral commented Mar 27, 2026

Summary

Testing

Uh oh!

Uh oh!

Uh oh!

simonvdk-mistral Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

simonvdk-mistral Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

simonvdk-mistral Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

simonvdk-mistral Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant