Add Langfuse AutoGen Tracing to example by SajidAlamQB · Pull Request #104 · kedro-org/kedro-academy

SajidAlamQB · 2026-01-28T16:17:25Z

Related PR: kedro-org/kedro-plugins#1288

This PR adds use LangfuseTraceDataset with mode: autogen to trace AutoGen agent conversations.

For the credentials:

openai:
  base_url: "https://api.openai.com/v1"
  api_key: "sk-..."

langfuse_credentials:
  public_key: "pk-..."
  secret_key: "sk-..."
  endpoint: "https://cloud.langfuse.com/api/public/otel/v1/traces"

Depedencies:
Install the following manually as the change on dataset is not out yet.
uv pip install opentelemetry-sdk opentelemetry-exporter-otlp-proto-http

Run example:
kedro run --pipeline autogen --params user_id=3

Copilot

Pull request overview

This PR adds Langfuse tracing support for AutoGen agent conversations using OpenTelemetry. It enables tracing of the AutoGen-based response generation pipeline by introducing a setup node that configures OpenTelemetry with a Langfuse span processor, and wrapping agent execution in custom spans to capture input/output attributes.

Changes:

Added setup_autogen_tracing function to configure OpenTelemetry tracing with Langfuse
Modified generate_response to create tracing spans around agent execution with relevant attributes
Updated requirements.txt to include the langfuse-langfusetracedataset-autogen extra for kedro-datasets
Added autogen_tracer_langfuse dataset configuration in genai-config.yml

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

File	Description
kedro-agentic-workflows/src/kedro_agentic_workflows/pipelines/response_generation_autogen/pipeline.py	Added setup_tracing_node to pipeline and tracer_provider input to generate_response_node
kedro-agentic-workflows/src/kedro_agentic_workflows/pipelines/response_generation_autogen/nodes.py	Implemented setup_autogen_tracing function and added OpenTelemetry span instrumentation to generate_response
kedro-agentic-workflows/requirements.txt	Added langfuse-langfusetracedataset-autogen extra to kedro-datasets dependency
kedro-agentic-workflows/conf/base/genai-config.yml	Added autogen_tracer_langfuse dataset configuration and updated comments to include autogen mode

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

ElenaKhaustova

It doesn't seem to be working for me 😕

I've installed pip install ".[langfuse-langfusetracedataset-autogen]" from feat/add-autogen-support-to-langfusetrace branch
Installed project requirements
Double-checked that I have everything in place

langfuse==3.12.1
opentelemetry-api==1.39.1
opentelemetry-exporter-otlp-proto-common==1.39.1
opentelemetry-exporter-otlp-proto-http==1.39.1
opentelemetry-proto==1.39.1
opentelemetry-sdk==1.39.1
opentelemetry-semantic-conventions==0.60b1

And I still get an error

DatasetError: AutoGen mode requires langfuse with OpenTelemetry support. Install with: pip install 'langfuse[opentelemetry]'

@SajidAlamQB, can you please provide step-by-step installation instructions so I can try once again?

SajidAlamQB · 2026-01-30T11:56:32Z

@SajidAlamQB, can you please provide step-by-step installation instructions so I can try once again?

Hi @ElenaKhaustova thanks for the review I've fixed the installation issue.

Installation instructions:

In kedro-academy:

pip install -r requirements.txt 
# Install kedro from main branch instead
pip install git+https://github.com/kedro-org/kedro.git

In kedro-datasets:

cd kedro-datasets
pip install -e ".[langfuse-langfusetracedataset-autogen]"

Then in kedro-academy run:

kedro run --pipeline autogen --params user_id=3

I've verified traces appear correctly in Langfuse dashboard:

ElenaKhaustova

Thank you, @SajidAlamQB, this works for me now.

When testing, I've noticed the following warning. Do we understand why it is raised?

ravi-kumar-pilla · 2026-02-03T06:29:14Z

Hi @SajidAlamQB , I tried running the example on python 3.12 env and there seems to be some issues with pydantic. Did you try this example on python 3.12 ? Is there a specific pydantic version we need to install ?

DatasetError: An exception occurred when parsing config for dataset 'llm':
The `__modify_schema__` method is not supported in Pydantic v2. Use `__get_pydantic_json_schema__` instead 
in class `SecretStr`.

For further information visit https://errors.pydantic.dev/2.12/u/custom-json-schema

Thank you

SajidAlamQB · 2026-02-03T13:13:09Z

Thank you, @SajidAlamQB, this works for me now.

When testing, I've noticed the following warning. Do we understand why it is raised?

From opentelemtry because we always created a new provider we should check if there is an existing provider is already set then create a new one if needed.

SajidAlamQB · 2026-02-03T13:17:06Z

Hi @SajidAlamQB , I tried running the example on python 3.12 env and there seems to be some issues with pydantic. Did you try this example on python 3.12 ? Is there a specific pydantic version we need to install ?

Thanks @ravi-kumar-pilla, I have only tried with 3.11, this doesn't really sound specific to our dataset I think there are some known Pydantic v2 compatibility issue in the LangChain/AutoGen, maybe we pin pin pydantic below 2.10 but let me have play around with it.

SajidAlamQB · 2026-02-04T13:26:13Z

Hi @SajidAlamQB , I tried running the example on python 3.12 env and there seems to be some issues with pydantic. Did you try this example on python 3.12 ? Is there a specific pydantic version we need to install ?

Hey @ravi-kumar-pilla I tested on python 3.12 I wasn't able to recreate your issue:

pip show pydantic langchain-core:

Name: pydantic
Version: 2.12.5
---
Name: langchain-core
Version: 0.3.83

What version of langchain-core were you using?

ravi-kumar-pilla · 2026-02-04T16:08:38Z

What version of langchain-core were you using?

Name: langchain-core
Version: 0.3.80

Name: pydantic
Version: 2.12.5

ravi-kumar-pilla · 2026-02-04T16:18:00Z

Hi Sajid, I tried with langchain-core 0.3.83 as well and still have issues. I will try to test with a new environment and let you know how it goes. Thank you

ravi-kumar-pilla · 2026-02-04T16:56:04Z

Hi @SajidAlamQB ,

The pydantic issue is resolved. I am getting a new error -

DatasetError: Failed while loading data from dataset kedro_agentic_workflows.datasets.autogen_model_client.OpenAIChatCompletionClientDataset(openai_api_base='***', openai_api_key='***', model='gpt-4o', temperature=0.0).
The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

We usually set the credentials like below I believe. Is there a change in how this is configured ? Should I pass api_key ?

openai:
  openai_api_base: ""
  openai_api_key: ""
  
langfuse_credentials:
  public_key: ""
  secret_key: ""
  host: ""
  endpoint: ""
  openai:
    openai_api_base: ""
    openai_api_key: ""

ElenaKhaustova · 2026-02-04T17:51:22Z

@SajidAlamQB

It runs for me without issues on both Python 3.11 and Python 3.12.

However, I've noticed the difference in how the execution graph looks right now

And this is how it looked before, when I tested it a few days ago:

Any idea why this is the case now? The example remains the same, meaning that changes on the dataset side are reflected this way.

SajidAlamQB · 2026-02-04T18:31:09Z

DatasetError: Failed while loading data from dataset kedro_agentic_workflows.datasets.autogen_model_client.OpenAIChatCompletionClientDataset(openai_api_base='***', openai_api_key='***', model='gpt-4o', temperature=0.0).
The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

We usually set the credentials like below I believe. Is there a change in how this is configured ? Should I pass api_key ?

This is resolved now I've added credentials mapping.

SajidAlamQB · 2026-02-04T19:05:13Z

Any idea why this is the case now? The example remains the same, meaning that changes on the dataset side are reflected this way.

My runs have always had the same visualisation as the image on the top of your post. The trace data is identical with same spans and hierarchy etc but the graph layout difference might just be Langfuse rendering I don't think this is related to any dataset changes.

ElenaKhaustova · 2026-02-04T22:48:47Z

@ravi-kumar-pilla, @SajidAlamQB

After we moved langchain.ChatOpenAIDataset from experimental to kedro-datasets, we follow the format that OpenAI supports by default, which is

openai:
    base_url: <openai-api-base>  # Optional, defaults to OpenAI default
    api_key: <openai-api-key>   # Optional if OPENAI_API_KEY is set

https://docs.kedro.org/projects/kedro-datasets/en/kedro-datasets-9.1.1/api/kedro_datasets/langchain.ChatOpenAIDataset/#kedro_datasets.langchain.ChatOpenAIDataset--example-usage-for-the-yaml-api

I would suggest keeping it, since it's a demo project and we don't want to overload the logic with key swapping.

ElenaKhaustova · 2026-02-04T23:20:59Z

@SajidAlamQB

My runs have always had the same visualisation

Did they look the same with the openlit approach?

https://langfuse.com/integrations/frameworks/autogen

ravi-kumar-pilla · 2026-02-05T02:19:44Z

https://docs.kedro.org/projects/kedro-datasets/en/kedro-datasets-9.1.1/api/kedro_datasets/langchain.ChatOpenAIDataset/#kedro_datasets.langchain.ChatOpenAIDataset--example-usage-for-the-yaml-api

Hi @ElenaKhaustova , Thanks for letting me know. I looked at our LangfuseTraceDataset implementation for other modes like openai and we have this. So I asked @SajidAlamQB if we can stick to the same pattern (not in the example starter here but in kedro-datasets)

Since we now follow how openai recommends, we need to update the usage across LangfuseTraceDataset for other modes as well. May be in this PR or a separate PR @SajidAlamQB

Thank you

ravi-kumar-pilla · 2026-02-05T02:31:36Z

Hi @SajidAlamQB ,

It works well. Thank you

I also see the graph as Elena mentioned but the tracing is working fine.

ElenaKhaustova · 2026-02-05T09:23:30Z

@ravi-kumar-pilla

we need to update the usage across LangfuseTraceDataset for other modes as well

Yes, it's a good point! I'll do it in a separate PR.

This reverts commit 79d1449.

SajidAlamQB · 2026-02-05T10:03:34Z

@SajidAlamQB

My runs have always had the same visualisation

Did they look the same with the openlit approach?

https://langfuse.com/integrations/frameworks/autogen

@ElenaKhaustova Just tried with openlit it didn't have that render issue:

Should we switch Langfuse autogen back to OpenLit for the better trace visualisation?
It would simplify code but be different from Opik implementation (Opik has no OpenLit equivalent)

ElenaKhaustova · 2026-02-05T10:09:51Z

Should we switch Langfuse autogen back to OpenLit for the better trace visualisation?
It would simplify code but be different from Opik implementation (Opik has no OpenLit equivalent)

Yes, I'm definitely for it, so we sacrifice alignment, not user experience

SajidAlamQB · 2026-02-05T17:50:19Z

Switched from raw OTLP tracing to OpenLit for AutoGen after discussing with @ElenaKhaustova.

Also split configs and requirements so Langfuse and Opik tutorials can be installed independently (they have conflicting openai version pins — OpenLit requires <2.0, Opik requires >=2.x).

Installation:

Install from Academy:
pip install -r requirements-langfuse.txt

If testing against local kedro-datasets changes:
cd /path/to/kedro-plugins/kedro-datasets
pip install -e ".[langfuse-langfusetracedataset-autogen]"

Run:
(put credentials into base)
kedro run --pipeline autogen --params user_id=3 --env langfuse

Tested on both Python 3.11 and 3.12.

@ElenaKhaustova @ravi-kumar-pilla ready for re-review when you get a chance.

ravi-kumar-pilla · 2026-02-06T01:34:24Z

Hi @SajidAlamQB , It works fine but the execution graph still looks like below and I followed the steps you mentioned here

SajidAlamQB · 2026-02-06T09:17:57Z

Hi @SajidAlamQB , It works fine but the execution graph still looks like below and I followed the steps you mentioned here

Very strange the rendering just seems intermittent.

ElenaKhaustova · 2026-02-06T11:09:03Z

@SajidAlamQB

I followed your steps from the above and running this

kedro run --pipeline autogen --params user_id=3 --env langfuse

gives me the following error now:

It looks like it's coming from this line (https://github.com/kedro-org/kedro-plugins/pull/1288/changes)

openlit.init(tracer=langfuse._otel_tracer, disable_batch=True, disable_metrics=True, disabled_instrumentors=["httpx"])

I tested with python3.11 and had the following libs versions:

autogen-agentchat==0.7.5
autogen-core==0.7.5
autogen-ext==0.7.5
langfuse==3.12.1
openai==1.109.1
openlit==1.36.8

Based on where we are now, I see the following next steps to proceed:

If the above error is fixed (perhaps it's the openlit version?), then I can test once again if rendering looks fine
We need to look through the issues in the langfuse's repo/ google if someone else is experiencing the same issue with the rendering, and if there's a workaround
Perhaps we can open an issue in their repo to get some response
Based on 1, 2 and 3, we'll decide how to proceed and with what solution.

SajidAlamQB · 2026-02-06T11:55:16Z

If the above error is fixed (perhaps it's the openlit version?), then I can test once again if rendering looks fine

Yes its the new openlit version 1.36.8 released 5 hours ago and broke this, 1.36.7 works I can pin it for now.

ElenaKhaustova · 2026-02-06T12:11:57Z

1.36.7

Yeah, pinning version works, but rendering looks the same as @ravi-kumar-pilla posted above 😕

ElenaKhaustova · 2026-02-06T12:16:12Z

In their tutorial (https://langfuse.com/integrations/frameworks/autogen), they say

Now, we initialize the OpenLit instrumentation. OpenLit automatically captures AutoGen operations and exports OpenTelemetry (OTel) spans to Langfuse.

an they don't wrap the runs with as we do

 with tracer.start_as_current_span("response_generation") as span:

Does it look the same if following their example?

Can we please look through the issues in Langfuse's repo to see if someone else is experiencing the same rendering issue and whether there's a workaround?

SajidAlamQB · 2026-02-06T15:28:44Z

Can we please look through the issues in Langfuse's repo to see if someone else is experiencing the same rendering issue and whether there's a workaround?

I've looked through Langfuse issues and found several related ones:

#9427: "The langfuse graph is not plotting correctly" exact same issue their response was: "This is a known limitation with Langfuse's agent graph visualization... The graph view is still in beta and doesn't always reflect the true execution flow for complex or nested agent setups"

#10721 : "LangGraph CallbackHandler Observations Do Not Nest Under Active Parent Span" similar too its related nesting issue.

There were few others as well that was related.

Our trace structure is correct the JSON has the correct structure. The issue is Langfuse graph rendering it's beta and has known bugs. No workaround exists on our side unfortunately.

SajidAlamQB · 2026-02-06T15:33:23Z

 with tracer.start_as_current_span("response_generation") as span:
Does it look the same if following their example?

Can we please look through the issues in Langfuse's repo to see if someone else is experiencing the same rendering issue and whether there's a workaround?

I tested this already. Without the manual start_as_current_span wrapper, the result is worse each AutoGen operation becomes a separate trace at depth 0.

ElenaKhaustova · 2026-02-06T16:34:28Z

@SajidAlamQB, thanks for looking into this!

Let's then return to the original OTLP solution and add a note that there may be rendering issues.

add autogen langfuse trace

a7e13b2

SajidAlamQB requested review from ElenaKhaustova, Copilot and ravi-kumar-pilla January 28, 2026 16:17

Copilot started reviewing on behalf of SajidAlamQB January 28, 2026 16:17 View session

SajidAlamQB mentioned this pull request Jan 28, 2026

feat(datasets): Extend LangfuseTraceDataset to support AutoGen tracing kedro-org/kedro-plugins#1288

Merged

6 tasks

Copilot AI reviewed Jan 28, 2026

View reviewed changes

Comment thread ...agentic-workflows/src/kedro_agentic_workflows/pipelines/response_generation_autogen/nodes.py Outdated

Comment thread ...agentic-workflows/src/kedro_agentic_workflows/pipelines/response_generation_autogen/nodes.py Outdated

Copilot AI reviewed Jan 28, 2026

View reviewed changes

ElenaKhaustova reviewed Jan 29, 2026

View reviewed changes

Comment thread ...agentic-workflows/src/kedro_agentic_workflows/pipelines/response_generation_autogen/nodes.py Outdated

Comment thread ...agentic-workflows/src/kedro_agentic_workflows/pipelines/response_generation_autogen/nodes.py Outdated

SajidAlamQB added 2 commits February 2, 2026 14:51

simplify autogen example

3f26c1b

lint

afb847a

ElenaKhaustova reviewed Feb 2, 2026

View reviewed changes

Merge branch 'main' into feat/add-autogen-support-to-langfusetrace

5795d97

credentials mapping

79d1449

ElenaKhaustova reviewed Feb 4, 2026

View reviewed changes

Comment thread kedro-agentic-workflows/src/kedro_agentic_workflows/datasets/autogen_model_client.py Outdated

ElenaKhaustova reviewed Feb 4, 2026

View reviewed changes

Comment thread kedro-agentic-workflows/src/kedro_agentic_workflows/datasets/openai_model_dataset.py Outdated

ElenaKhaustova mentioned this pull request Feb 4, 2026

Add Opik AutoGen Tracing to example #105

Open

Revert "credentials mapping"

9122274

This reverts commit 79d1449.

ElenaKhaustova mentioned this pull request Feb 5, 2026

kedro-datasets: Update openai credentials usage for Opik and Langfuse datasets kedro-org/kedro-plugins#1299

Closed

3 tasks

SajidAlamQB added 2 commits February 5, 2026 15:59

Update requirements.txt

56a78f9

split config and reqs for opik and langfuse

a870773

put into dirs

285ee73

Merge branch 'main' into feat/add-autogen-support-to-langfusetrace

6cde4fe

SajidAlamQB mentioned this pull request Mar 23, 2026

Clean up Langfuse AutoGen Tracing example for merge #113

Open

6 tasks

Conversation

SajidAlamQB commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

ElenaKhaustova left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SajidAlamQB commented Jan 30, 2026

Uh oh!

ElenaKhaustova left a comment

Choose a reason for hiding this comment

Uh oh!

ravi-kumar-pilla commented Feb 3, 2026

Uh oh!

SajidAlamQB commented Feb 3, 2026

Uh oh!

SajidAlamQB commented Feb 3, 2026

Uh oh!

SajidAlamQB commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ravi-kumar-pilla commented Feb 4, 2026

Uh oh!

ravi-kumar-pilla commented Feb 4, 2026

Uh oh!

ravi-kumar-pilla commented Feb 4, 2026

Uh oh!

ElenaKhaustova commented Feb 4, 2026

Uh oh!

SajidAlamQB commented Feb 4, 2026

Uh oh!

SajidAlamQB commented Feb 4, 2026

Uh oh!

ElenaKhaustova commented Feb 4, 2026

Uh oh!

ElenaKhaustova commented Feb 4, 2026

Uh oh!

Uh oh!

Uh oh!

ravi-kumar-pilla commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ravi-kumar-pilla commented Feb 5, 2026

Uh oh!

ElenaKhaustova commented Feb 5, 2026

Uh oh!

SajidAlamQB commented Feb 5, 2026

Uh oh!

ElenaKhaustova commented Feb 5, 2026

Uh oh!

SajidAlamQB commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ravi-kumar-pilla commented Feb 6, 2026

Uh oh!

SajidAlamQB commented Feb 6, 2026

Uh oh!

ElenaKhaustova commented Feb 6, 2026

Uh oh!

SajidAlamQB commented Feb 6, 2026

Uh oh!

ElenaKhaustova commented Feb 6, 2026

Uh oh!

ElenaKhaustova commented Feb 6, 2026

Uh oh!

SajidAlamQB commented Feb 6, 2026

Uh oh!

SajidAlamQB commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

SajidAlamQB commented Jan 28, 2026 •

edited

Loading

SajidAlamQB commented Feb 4, 2026 •

edited

Loading

ravi-kumar-pilla commented Feb 5, 2026 •

edited

Loading

SajidAlamQB commented Feb 5, 2026 •

edited

Loading

SajidAlamQB commented Feb 6, 2026 •

edited

Loading