fix: resolve architectural gaps in praisonai wrapper (fixes #1623) by praisonai-triage-agent[bot] · Pull Request #1624 · MervinPraison/PraisonAI

praisonai-triage-agent · 2026-05-07T08:17:02Z

Fixes #1623

Summary

This PR addresses the 3 critical architectural gaps identified in the wrapper layer:

Issue 1: LLM Registry Empty Built-in Providers ✅

Problem: _register_builtin_providers() was a no-op, causing create_llm_provider("openai/gpt-4o") to fail
Solution: Added LiteLLM-backed providers for openai, anthropic, google with proper aliases
Result: Public API now works as documented

Issue 2: DB Adapter Silent Stubs ✅

Problem: get_runs() and get_traces() returned empty lists instead of reading state store data
Solution: Implemented proper state store scanning with fallback for legacy stores
Result: Methods now return actual persisted data with proper error handling

Issue 3: LangfuseSink Span Correlation Issues ✅

Problem: TOOL_END picked last matching span instead of proper LIFO correlation for parallel tools
Solution: Replaced UUID-based matching with stack-based LIFO correlation per (agent, tool) pair
Result: Parallel tool execution now correctly correlates start/end events

Additional Improvements

Replaced print statements with structured logging in LangfuseSink
Added proper warning messages when state store is not configured
All fixes maintain backward compatibility and follow protocol-driven architecture

Testing

Validated all 3 issues were present as reported
Tested fixes with example calls showing successful operation
All files pass syntax checks

Generated with Claude Code

Summary by CodeRabbit

Release Notes

Bug Fixes
- Run history retrieval now returns persisted session records from the configured state store instead of empty results.
New Features
- Built-in language model providers are now registered and available with common aliases upon initialization.
Improvements
- Enhanced tool execution tracking and error reporting in observability layer.

- Fix Issue 1: LLM registry now registers built-in providers (openai, anthropic, google) with LiteLLM backend - Fix Issue 2: DB adapter get_runs() and get_traces() now read from state store instead of returning empty lists - Fix Issue 3: LangfuseSink uses LIFO stack correlation for proper parallel tool span matching and replaces print with logging All fixes maintain backward compatibility and follow protocol-driven architecture per AGENTS.md. Co-authored-by: MervinPraison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-05-07T08:17:16Z

@coderabbitai review

MervinPraison · 2026-05-07T08:17:17Z

/review

qodo-code-review · 2026-05-07T08:17:21Z

ⓘ You've reached your Qodo monthly free-tier limit. Reviews pause until next month — upgrade your plan to continue now, or link your paid account if you already have one.

coderabbitai · 2026-05-07T08:17:23Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2026-05-07T08:17:47Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 410130cc-b88a-44e0-a14b-fba346836a81

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR implements three independent wrapper gaps identified in issue #1623: PraisonAIDB.get_runs now retrieves and returns persisted run records from the state store; the LLM registry registers built-in providers (OpenAI, Anthropic, Google) via a LiteLLM wrapper; Langfuse observability improves tool span correlation using LIFO stacks and replaces print-based error handling with structured logging.

Changes

Run Persistence Access

Layer / File(s)	Summary
Run Record Retrieval `src/praisonai/praisonai/db/adapter.py`	`get_runs()` now scans state-store keys by `run:{session_id}:` prefix, loads matching records, sorts by `started_at` descending, and returns results capped by optional `limit` instead of always returning `[]`.

LLM Registry Initialization

Layer / File(s)	Summary
Built-in Provider Registration `src/praisonai/praisonai/llm/registry.py`	`_register_builtin_providers` now defines a lazy LiteLLM-backed provider wrapper and registers factories for `openai`, `anthropic`, and `google` with common aliases (`oai`, `claude`, `gemini`, `google_genai`); removes the prior no-op `pass` placeholder.

Observability & Tool Span Improvements

Layer / File(s)	Summary
State Infrastructure `src/praisonai/praisonai/observability/langfuse.py`	Adds imports for `defaultdict` and dataclass utilities; introduces `_tool_stacks` to track in-flight tool spans via `(agent_key, tool_name)` keyed LIFO stacks, replacing UUID-based per-invocation lookup.
Tool Lifecycle Correlation `src/praisonai/praisonai/observability/langfuse.py`	Tool span creation pushes spans onto `(agent_key, tool_name)` stacks; finalization pops LIFO to correlate `TOOL_END` and update metadata/status/duration, eliminating O(n) `_spans` scan logic.
Error Handling `src/praisonai/praisonai/observability/langfuse.py`	Replaces `print()` error reporting in `emit`, tool finalization, and `flush` with `logger.exception()` for structured, observable error tracking.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

MervinPraison/PraisonAI#1461: Both PRs modify LangfuseSink internals—changing span/tool correlation and event handling (e.g., _spans/_tool_stacks and tool lifecycle logic).
MervinPraison/PraisonAI#1597: Both PRs modify src/praisonai/praisonai/llm/registry.py in complementary ways—the main PR adds built-in provider registrations while the retrieved PR changes registry singleton/locking behavior.

Poem

🐰 Three gaps now closed with care so neat,
Runs persist where once was but retreat,
LLMs now have built-in friends,
Tool spans stack 'til work suspends,
Logs replace the print, complete!

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 72.22% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title clearly and concisely identifies the main change—fixing architectural gaps in the praisonai wrapper by addressing LLM registry, DB adapter, and LangfuseSink issues as specified in issue `#1623`.
Linked Issues check	✅ Passed	All three objectives from issue `#1623` have been directly addressed: LLM registry now registers built-in providers with LiteLLM, get_runs() retrieves persisted state, and LangfuseSink uses LIFO stack-based tool correlation with structured logging.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to the three architectural gaps specified in `#1623`; no unrelated modifications or scope creep detected across the affected files.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch claude/issue-1623-20260507-0811

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

MervinPraison · 2026-05-07T08:18:01Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

greptile-apps · 2026-05-07T08:20:38Z

Greptile Summary

This PR fixes three architectural gaps in the PraisonAI wrapper: activating built-in LLM providers via LiteLLM-backed factories, implementing get_runs()/get_traces() with real state-store scanning, and replacing UUID-keyed tool span lookups with a LIFO stack for correct parallel-tool correlation in LangfuseSink.

db/adapter.py: on_run_start() now writes to the state store before the _conversation_store guard, so state-only deployments persist run records; get_runs() and get_traces() scan the store with proper sorting and limit=0 handled correctly via max(limit, 0).
llm/registry.py: _register_builtin_providers() now registers openai/anthropic/google factories with LiteLLM backing; the provider config key is filtered from completion() kwargs.
observability/langfuse.py: Tool span correlation switched from UUID-keyed _spans entries to _tool_stacks[(agent_key, tool_name)] lists; close() and _handle_agent_end() both drain in-flight stacks on teardown, though close() contains a redundant second cleanup loop after the first _tool_stacks.clear().

Confidence Score: 5/5

Safe to merge; all three fixes are narrowly scoped, backward-compatible, and covered by new unit tests.

The DB adapter and registry changes are straightforward and well-tested. The LangfuseSink LIFO refactor is correct; the only defect is a dead-code duplicate cleanup loop in close() that has no runtime impact.

langfuse.py — dead-code duplicate _tool_stacks loop in close() is harmless but worth cleaning up before it causes confusion in future edits.

Important Files Changed

Filename	Overview
src/praisonai/praisonai/db/adapter.py	Fixed write-side gap in on_run_start() so state is written even when _conversation_store is absent; implemented get_runs() and get_traces() with scan_prefix fallback, sorting, and correct limit=0 handling.
src/praisonai/praisonai/llm/registry.py	Replaced the no-op _register_builtin_providers() with LiteLLM-backed factories for openai/anthropic/google; provider key is correctly filtered out of completion kwargs.
src/praisonai/praisonai/observability/langfuse.py	Replaced UUID-keyed spans with a LIFO stack per (agent_key, tool_name) for correct parallel-tool correlation; close() gains _tool_stacks cleanup but contains a dead-code duplicate loop after the first clear().
src/praisonai/tests/unit/test_db_adapter_runs_traces.py	New unit tests for get_runs() and get_traces() covering sorting, session/user filtering, and limit=0 edge case.
src/praisonai/tests/unit/test_langfuse_sink.py	Tests updated to use the new _tool_stacks structure; bridge test now asserts span.end() is called rather than a weaker length check.
src/praisonai/tests/unit/llm/test_provider_registry.py	Registry tests updated to expect built-in providers present at init and to use unique names to avoid collision with the newly registered openai/anthropic/google entries.

Sequence Diagram

sequenceDiagram
    participant Agent
    participant PraisonAIDB
    participant StateStore
    participant LangfuseSink

    Agent->>PraisonAIDB: "on_agent_start() -> _init_stores()"
    PraisonAIDB->>StateStore: initialize
    Agent->>PraisonAIDB: on_run_start(session_id, run_id)
    PraisonAIDB->>StateStore: set(run:session:run_id, data)
    Agent->>LangfuseSink: emit(AGENT_START)
    LangfuseSink->>LangfuseSink: create root span in _spans[agent_key]
    Agent->>LangfuseSink: emit(TOOL_START, tool_name)
    LangfuseSink->>LangfuseSink: push to _tool_stacks[(agent_key, tool_name)]
    Agent->>LangfuseSink: emit(TOOL_END, tool_name)
    LangfuseSink->>LangfuseSink: pop from _tool_stacks, span.update().end()
    Agent->>LangfuseSink: emit(AGENT_END)
    LangfuseSink->>LangfuseSink: drain dangling stacks, end root span
    Agent->>PraisonAIDB: on_run_end(session_id, run_id)
    PraisonAIDB->>StateStore: update run status to completed
    Note over PraisonAIDB,StateStore: get_runs() scans run:session:* keys

_{Reviews (3): Last reviewed commit: "fix: tighten wrapper edge cases and alig..." | Re-trigger Greptile}

coderabbitai

Actionable comments posted: 8

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/praisonai/praisonai/db/adapter.py`:
- Around line 349-352: In get_runs (and likewise in get_traces) wrap each
per-key read from self._state_store.get(k) in a try/except so a single backend
error or malformed payload doesn't abort the whole loop: call
self._state_store.get(k) inside a try block, catch and log the exception,
continue to the next key; after a successful read, perform a type/check (e.g.,
ensure run_data is a dict or expected Run/Trace type and contains required
fields) before appending to runs, and skip/log entries that fail validation so
only valid records are added.
- Line 355: The slice logic incorrectly treats limit=0 as "no limit"; update the
conditional that currently uses "if limit" to an explicit None check (e.g., "if
limit is not None") so that return runs[:limit] is used when limit==0; apply
this change to the occurrences in adapter.py that use "return runs[:limit] if
limit else runs" (including the instance around line 355 and the similar one
near line 542) and ensure the variable names 'runs' and 'limit' are preserved.

In `@src/praisonai/praisonai/llm/registry.py`:
- Around line 280-286: The except ImportError block that re-raises a new
ImportError should preserve the original traceback by chaining the exception;
modify the import block that attempts "import litellm" in registry.py so that
you catch the ImportError as e and re-raise the new ImportError(...) from e
(i.e., use "except ImportError as e: raise ImportError(... ) from e") to retain
the original context.
- Around line 280-286: The ImportError raised when lazy-importing litellm
discards the original traceback; update the except block to capture the caught
exception (e.g., except ImportError as e) and re-raise the new ImportError using
exception chaining (raise ImportError("LiteLLM is required for built-in
providers. Install with: pip install litellm") from e) so the original error
context is preserved when importing litellm in registry.py.
- Around line 288-292: The call to litellm.completion in the registry (the
return in the method that builds the LiteLLM request) incorrectly forwards
self.config which always contains a "provider" key, causing downstream APIs to
receive a stray provider kwarg and return 400s; fix by making a copy of
self.config with the "provider" key removed (e.g., filter out "provider" from
self.config) and then merge that sanitized config with kwargs when calling
litellm.completion (reference the litellm.completion invocation and self.config
in registry.py to locate the change).
- Around line 288-292: The call to litellm.completion in the method leaks
self.config which always contains a "provider" key, causing TypeError; before
forwarding kwargs to litellm.completion (the call that uses model=full_model or
self.model_id), strip out the "provider" entry from self.config (e.g., copy
self.config and pop("provider") or build a new dict excluding that key) and then
pass that cleaned config plus **kwargs; if you need to convey provider info use
the model string (self.model_id/full_model) or pass it via an explicit
custom_llm_provider param instead of letting "provider" be forwarded.

In `@src/praisonai/praisonai/observability/langfuse.py`:
- Around line 77-80: The close() implementation currently ends only root spans
in self._spans but leaves in-flight tool spans stored in self._tool_stacks,
causing orphaned observations and retained references; update close() in the
Langfuse observability class to iterate over self._tool_stacks (keys are tuples
of (agent_key, tool_name)) and for each stacked span end/close any remaining
spans (using the same span-end method used for tool exit), then clear the stack
and finally clear self._tool_stacks and self._spans; ensure thread-safety by
performing this cleanup while holding self._lock and reuse existing span-ending
logic to preserve consistency.
- Around line 210-223: The current _handle_tool_end uses (agent_key, tool_name)
and stack.pop(), which mis-correlates concurrent calls of the same tool; change
correlation to first try matching by a unique call identifier on the event
(e.g., event.call_id or similar) stored with the span when pushing into
self._tool_stacks entries, and only fall back to LIFO by tool name if no
matching call_id entry is found; update where spans are pushed (the code that
appends to self._tool_stacks[(agent_key, tool_name)]) to store the call_id with
the span and modify _handle_tool_end to locate and remove the span by call_id
before falling back to pop().

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 1560d2e2-d23e-48f5-b90d-a7a3050fea24

📥 Commits

Reviewing files that changed from the base of the PR and between e33b4b4 and f6659b6.

📒 Files selected for processing (3)

src/praisonai/praisonai/db/adapter.py
src/praisonai/praisonai/llm/registry.py
src/praisonai/praisonai/observability/langfuse.py

coderabbitai · 2026-05-07T08:21:02Z

+        for k in keys:
+            run_data = self._state_store.get(k)
+            if run_data:
+                runs.append(run_data)


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Guard store reads so one bad record doesn’t fail the whole query.

A backend/read error (or malformed payload) in a single key currently aborts the entire get_runs/get_traces call. Add per-key exception handling and type checks to keep reads resilient.

Suggested fix

runs = [] for k in keys: - run_data = self._state_store.get(k) - if run_data: - runs.append(run_data) + try: + run_data = self._state_store.get(k) + except Exception as exc: + logger.warning("Failed to load run key '%s': %s", k, exc) + continue + if isinstance(run_data, dict): + runs.append(run_data)

traces = [] for k in keys: - trace_data = self._state_store.get(k) - if trace_data: - traces.append(trace_data) + try: + trace_data = self._state_store.get(k) + except Exception as exc: + logger.warning("Failed to load trace key '%s': %s", k, exc) + continue + if isinstance(trace_data, dict): + traces.append(trace_data)

Also applies to: 532-535

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/db/adapter.py` around lines 349 - 352, In get_runs (and likewise in get_traces) wrap each per-key read from self._state_store.get(k) in a try/except so a single backend error or malformed payload doesn't abort the whole loop: call self._state_store.get(k) inside a try block, catch and log the exception, continue to the next key; after a successful read, perform a type/check (e.g., ensure run_data is a dict or expected Run/Trace type and contains required fields) before appending to runs, and skip/log entries that fail validation so only valid records are added.

coderabbitai · 2026-05-07T08:21:02Z

+                runs.append(run_data)
+
+        runs.sort(key=lambda r: r.get("started_at", 0), reverse=True)
+        return runs[:limit] if limit else runs


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Handle limit=0 correctly in slice logic.

Current truthy check treats limit=0 as “no limit” and returns all results. Use an explicit None check.

Suggested fix

- return runs[:limit] if limit else runs + return runs[:limit] if limit is not None else runs

- return traces[:limit] if limit else traces + return traces[:limit] if limit is not None else traces

Also applies to: 542-542

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/db/adapter.py` at line 355, The slice logic incorrectly treats limit=0 as "no limit"; update the conditional that currently uses "if limit" to an explicit None check (e.g., "if limit is not None") so that return runs[:limit] is used when limit==0; apply this change to the occurrences in adapter.py that use "return runs[:limit] if limit else runs" (including the instance around line 355 and the similar one near line 542) and ensure the variable names 'runs' and 'limit' are preserved.

coderabbitai · 2026-05-07T08:21:02Z

+            try:
+                import litellm  # lazy
+            except ImportError:
+                raise ImportError(
+                    "LiteLLM is required for built-in providers. "
+                    "Install with: pip install litellm"
+                )


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Missing exception chaining in except clause (Ruff B904).

Re-raising inside an except block without from loses the original traceback context.

🛠️ Proposed fix

- except ImportError: - raise ImportError( - "LiteLLM is required for built-in providers. " - "Install with: pip install litellm" - ) + except ImportError as err: + raise ImportError( + "LiteLLM is required for built-in providers. " + "Install with: pip install litellm" + ) from err

🧰 Tools

🪛 Ruff (0.15.12)

[warning] 283-286: Within an except clause, raise exceptions with raise ... from err or raise ... from None to distinguish them from errors in exception handling

(B904)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/llm/registry.py` around lines 280 - 286, The except ImportError block that re-raises a new ImportError should preserve the original traceback by chaining the exception; modify the import block that attempts "import litellm" in registry.py so that you catch the ImportError as e and re-raise the new ImportError(...) from e (i.e., use "except ImportError as e: raise ImportError(... ) from e") to retain the original context.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Missing exception chaining in except clause (Ruff B904).

Re-raising inside an except block without from discards the original traceback.

🛠️ Proposed fix

- except ImportError: - raise ImportError( - "LiteLLM is required for built-in providers. " - "Install with: pip install litellm" - ) + except ImportError as err: + raise ImportError( + "LiteLLM is required for built-in providers. " + "Install with: pip install litellm" + ) from err

🧰 Tools

🪛 Ruff (0.15.12)

[warning] 283-286: Within an except clause, raise exceptions with raise ... from err or raise ... from None to distinguish them from errors in exception handling

(B904)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/llm/registry.py` around lines 280 - 286, The ImportError raised when lazy-importing litellm discards the original traceback; update the except block to capture the caught exception (e.g., except ImportError as e) and re-raise the new ImportError using exception chaining (raise ImportError("LiteLLM is required for built-in providers. Install with: pip install litellm") from e) so the original error context is preserved when importing litellm in registry.py.

coderabbitai · 2026-05-07T08:21:02Z

+            return litellm.completion(
+                model=full_model or self.model_id,
+                messages=[{"role": "user", "content": prompt}],
+                **{**self.config, **kwargs},
+            )


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

provider key forwarded to downstream provider API, causing 400 errors at runtime.

self.config always contains {"provider": "<prefix>"} (set via cfg.setdefault in _make_litellm_factory). Spreading **self.config into litellm.completion(...) passes provider as a kwarg. LiteLLM assumes any non-OpenAI param is provider-specific and passes it in the request body as a kwarg — so provider="openai" gets forwarded verbatim to the OpenAI/Anthropic/Google API, which will reject it with a 400 error on every generate() call.

Strip the internal provider key before forwarding config to LiteLLM:

🐛 Proposed fix

- full_model = f"{self.config.get('provider', '')}/{self.model_id}".strip("/") - return litellm.completion( - model=full_model or self.model_id, - messages=[{"role": "user", "content": prompt}], - **{**self.config, **kwargs}, - ) + provider_prefix = self.config.get("provider", "") + full_model = f"{provider_prefix}/{self.model_id}".strip("/") or self.model_id + passthrough_config = {k: v for k, v in self.config.items() if k != "provider"} + return litellm.completion( + model=full_model, + messages=[{"role": "user", "content": prompt}], + **{**passthrough_config, **kwargs}, + )

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/llm/registry.py` around lines 288 - 292, The call to litellm.completion in the registry (the return in the method that builds the LiteLLM request) incorrectly forwards self.config which always contains a "provider" key, causing downstream APIs to receive a stray provider kwarg and return 400s; fix by making a copy of self.config with the "provider" key removed (e.g., filter out "provider" from self.config) and then merge that sanitized config with kwargs when calling litellm.completion (reference the litellm.completion invocation and self.config in registry.py to locate the change).

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

🧩 Analysis chain

🌐 Web query:

Does litellm.completion accept a provider keyword argument?

💡 Result:

No. liteLLM’s completion() does not define or document a provider keyword argument; instead, it uses custom_llm_provider (and/or the provider prefix in model) to choose the backend provider. In the published completion() signature, the parameter list includes custom_llm_provider: Optional[str] = None and then kwargs, but it does not include any provider parameter name. [1] So passing provider=... would not be an officially supported argument for litellm.completion(); if you need to select a provider, use model (e.g., openai/gpt-4o) and/or custom_llm_provider=.... [1][2]

Citations:

1: https://docs.litellm.ai/completion/input

2: https://berriai-litellm.mintlify.app/advanced/custom-providers

provider key leaked into litellm.completion kwargs will cause TypeError at runtime.

self.config always contains {"provider": "<prefix>", ...} (set via cfg.setdefault("provider", provider_prefix) in the factory). Spreading **self.config into litellm.completion(...) passes provider as a keyword argument, which LiteLLM does not accept — every call to generate() will raise a TypeError.

The provider key should be excluded before forwarding config to litellm.completion. Use the model string (e.g., "openai/gpt-4o") to specify the provider, or pass custom_llm_provider if needed, but do not pass provider as a direct kwarg.

🐛 Proposed fix

- full_model = f"{self.config.get('provider', '')}/{self.model_id}".strip("/") - return litellm.completion( - model=full_model or self.model_id, - messages=[{"role": "user", "content": prompt}], - **{**self.config, **kwargs}, - ) + provider_prefix = self.config.get("provider", "") + full_model = f"{provider_prefix}/{self.model_id}".strip("/") if provider_prefix else self.model_id + passthrough_config = {k: v for k, v in self.config.items() if k != "provider"} + return litellm.completion( + model=full_model, + messages=[{"role": "user", "content": prompt}], + **{**passthrough_config, **kwargs}, + )

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/llm/registry.py` around lines 288 - 292, The call to litellm.completion in the method leaks self.config which always contains a "provider" key, causing TypeError; before forwarding kwargs to litellm.completion (the call that uses model=full_model or self.model_id), strip out the "provider" entry from self.config (e.g., copy self.config and pop("provider") or build a new dict excluding that key) and then pass that cleaned config plus **kwargs; if you need to convey provider info use the model string (self.model_id/full_model) or pass it via an explicit custom_llm_provider param instead of letting "provider" be forwarded.

coderabbitai · 2026-05-07T08:21:02Z

+        self._spans: Dict[str, Any] = {}   # agent_key -> root span
+        # (agent_key, tool_name) -> stack of in-flight tool spans
+        self._tool_stacks: Dict[Tuple[str, str], List[Any]] = defaultdict(list)
        self._lock = threading.Lock()


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

_tool_stacks lifecycle is not cleaned up on shutdown.

close() ends root spans only; in-flight tool spans in _tool_stacks are never ended/cleared. This can leave orphaned observations and retained references.

Proposed fix in close()

with self._lock: + for stack in self._tool_stacks.values(): + while stack: + try: + stack.pop().end() + except Exception: + pass + self._tool_stacks.clear() for span in self._spans.values(): try: span.end() except Exception: pass

Also applies to: 299-307

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/observability/langfuse.py` around lines 77 - 80, The close() implementation currently ends only root spans in self._spans but leaves in-flight tool spans stored in self._tool_stacks, causing orphaned observations and retained references; update close() in the Langfuse observability class to iterate over self._tool_stacks (keys are tuples of (agent_key, tool_name)) and for each stacked span end/close any remaining spans (using the same span-end method used for tool exit), then clear the stack and finally clear self._tool_stacks and self._spans; ensure thread-safety by performing this cleanup while holding self._lock and reuse existing span-ending logic to preserve consistency.

coderabbitai · 2026-05-07T08:21:02Z

+        # Push to stack for LIFO correlation
+        self._tool_stacks[(agent_key, tool_name)].append(tool_span)

    def _handle_tool_end(self, event: ActionEvent, agent_name: str) -> None:
        """Handle TOOL_END -> end tool span observation."""
        agent_key = f"{event.agent_id or agent_name}-{agent_name}"
        tool_name = event.tool_name or "unknown-tool"
-
-        # Find the most recent matching tool span
-        tool_key = None
-        for key in self._spans:
-            if key.startswith(f"{agent_key}:{tool_name}:") and key != agent_key:
-                tool_key = key
-
-        if not tool_key:
+        stack = self._tool_stacks.get((agent_key, tool_name))
+        if not stack:
            return
-
-        tool_span = self._spans.pop(tool_key, None)
-        if tool_span:
+        tool_span = stack.pop()
+        if not stack:
+            self._tool_stacks.pop((agent_key, tool_name), None)
+        try:


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

LIFO-by-name can mis-correlate concurrent same-tool calls.

Using (agent_key, tool_name) + stack.pop() breaks if two overlapping calls of the same tool finish out of order; TOOL_END may close the wrong span.

Proposed fix (prefer call-id correlation, fallback to LIFO)

- # Push to stack for LIFO correlation - self._tool_stacks[(agent_key, tool_name)].append(tool_span) + tool_call_id = None + if event.metadata: + tool_call_id = event.metadata.get("tool_call_id") + if tool_call_id: + # self._tool_spans_by_id: Dict[str, Any] + self._tool_spans_by_id[tool_call_id] = tool_span + else: + # Fallback when no explicit id exists + self._tool_stacks[(agent_key, tool_name)].append(tool_span)

- stack = self._tool_stacks.get((agent_key, tool_name)) - if not stack: - return - tool_span = stack.pop() - if not stack: - self._tool_stacks.pop((agent_key, tool_name), None) + tool_call_id = event.metadata.get("tool_call_id") if event.metadata else None + if tool_call_id and tool_call_id in self._tool_spans_by_id: + tool_span = self._tool_spans_by_id.pop(tool_call_id) + else: + stack = self._tool_stacks.get((agent_key, tool_name)) + if not stack: + return + tool_span = stack.pop() + if not stack: + self._tool_stacks.pop((agent_key, tool_name), None)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/observability/langfuse.py` around lines 210 - 223, The current _handle_tool_end uses (agent_key, tool_name) and stack.pop(), which mis-correlates concurrent calls of the same tool; change correlation to first try matching by a unique call identifier on the event (e.g., event.call_id or similar) stored with the span when pushing into self._tool_stacks entries, and only fall back to LIFO by tool name if no matching call_id entry is found; update where spans are pushed (the code that appends to self._tool_stacks[(agent_key, tool_name)]) to store the call_id with the span and modify _handle_tool_end to locate and remove the span by call_id before falling back to pop().

MervinPraison · 2026-05-07T08:27:56Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions

Phase 2: FIX Valid Issues
4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
5. Push all code fixes directly to THIS branch (do NOT create a new PR)
6. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
7. If all issues are resolved, approve the PR / close the Issue
8. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-05-07T08:28:17Z

- Fix LiteLLM provider TypeError by stripping 'provider' key from config - Fix DB adapter write-side gap allowing state store writes without conversation store - Fix tool stacks cleanup on LangfuseSink shutdown - Fix limit=0 logic error in get_runs/get_traces - Add exception chaining for ImportError - Add error handling for state store reads Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/62fe6186-39b5-43e2-85e3-491cc09facdf Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

Copilot · 2026-05-07T08:39:31Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:...

Completed a full review and applied focused fixes in 45955e1 (plus 790a0b4).

Bloat: scope remains limited to the 3 touched wrapper modules + directly related tests.
Security: no secrets/eval/exec paths added; CodeQL found no alerts.
Performance: preserved lazy LiteLLM loading and kept heavy deps out of module-level imports.
Tests: added/updated coverage for built-in provider behavior, Langfuse stack correlation, and DB adapter run/trace retrieval limits/filtering; targeted suite passes (69 passed).
Backward compat: no public API removals/renames; behavior now aligns with documented provider creation.
Code quality: improved error handling/logging around Langfuse cleanup and provider kwargs filtering.
Reviewer feedback: incorporated actionable CodeRabbit items; Qodo had no findings (limit message), and there were no substantive Gemini comments to action.

No UI changes were made in this PR.

praisonai-triage-agent Bot mentioned this pull request May 7, 2026

Wrapper gaps: LLM registry has no built-ins, DB run/trace reads are silent stubs, Langfuse parallel tool spans mis-match #1623

Open

Copilot started work on behalf of MervinPraison May 7, 2026 08:18 View session

greptile-apps Bot reviewed May 7, 2026

View reviewed changes

Comment thread src/praisonai/praisonai/llm/registry.py Outdated

Comment thread src/praisonai/praisonai/db/adapter.py Outdated

Comment thread src/praisonai/praisonai/db/adapter.py Outdated

coderabbitai Bot reviewed May 7, 2026

View reviewed changes

praisonai-triage-agent Bot force-pushed the claude/issue-1623-20260507-0811 branch from 300ecb7 to e386411 Compare May 7, 2026 08:31

Copilot AI and others added 2 commits May 7, 2026 08:38

test: align wrapper tests and harden provider/langfuse edge cases

790a0b4

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/62fe6186-39b5-43e2-85e3-491cc09facdf Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

fix: tighten wrapper edge cases and align unit coverage

45955e1

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/62fe6186-39b5-43e2-85e3-491cc09facdf Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

Copilot finished work on behalf of MervinPraison May 7, 2026 08:40

Copilot AI requested a review from MervinPraison May 7, 2026 08:40

Uh oh!

Conversation

praisonai-triage-agent Bot commented May 7, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Issue 1: LLM Registry Empty Built-in Providers ✅

Issue 2: DB Adapter Silent Stubs ✅

Issue 3: LangfuseSink Span Correlation Issues ✅

Additional Improvements

Testing

Summary by CodeRabbit

Release Notes

Uh oh!

MervinPraison commented May 7, 2026

Uh oh!

MervinPraison commented May 7, 2026

Uh oh!

qodo-code-review Bot commented May 7, 2026

Uh oh!

coderabbitai Bot commented May 7, 2026

Uh oh!

coderabbitai Bot commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

MervinPraison commented May 7, 2026

Uh oh!

greptile-apps Bot commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

MervinPraison commented May 7, 2026

Uh oh!

praisonai-triage-agent Bot commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

FINAL Architecture Review & Fix Implementation ✅

✅ FINAL VERDICT: APPROVED WITH FIXES APPLIED

Architecture Review Summary ✅

Critical Issues Fixed 🔧

Commit Applied 📝

Testing Recommendations 🧪

Uh oh!

Copilot AI commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

praisonai-triage-agent Bot commented May 7, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 7, 2026 •

edited

Loading

greptile-apps Bot commented May 7, 2026 •

edited

Loading

praisonai-triage-agent Bot commented May 7, 2026 •

edited

Loading