feat(datasets): Extend `LangfuseTraceDataset` to support AutoGen tracing by SajidAlamQB · Pull Request #1288 · kedro-org/kedro-plugins

SajidAlamQB · 2026-01-20T14:52:39Z

Description

Related to: #1276

To test for QA use the kedro-academy example: kedro-org/kedro-academy#104

Adds autogen mode to LangfuseTraceDataset, enabling OpenTelemetry based tracing for AutoGen agent pipelines via Langfuse's OTLP endpoint.

Development notes

Added autogen mode to LangfuseTraceDataset that returns a configured OpenTelemetry Tracer
_build_autogen_tracer() sets up an OTLP exporter

Checklist

Opened this PR as a 'Draft Pull Request' if it is work-in-progress
Updated the documentation to reflect the code changes
Updated jsonschema/kedro-catalog-X.XX.json if necessary
Added a description of this change in the relevant RELEASE.md file
Added tests to cover my changes
Received approvals from at least half of the TSC (required for adding a new, non-experimental dataset)

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ElenaKhaustova

Left a small comment, other than that he implementation looks good 👍

Could you please also open a PR in the academy project applying autogent mode to this pipeline https://github.com/kedro-org/kedro-academy/tree/main/kedro-agentic-workflows/src/kedro_agentic_workflows/pipelines/response_generation_autogen so it it easy to test for reviewers?

Also don't forget to update RELEASE.md

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ravi-kumar-pilla · 2026-01-26T17:43:57Z

Hi @SajidAlamQB , The implementation looks good. It would be nice to have some QA steps or as Elena mentioned some way to test this out, would be cool. Thank you

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ElenaKhaustova

Need some help to clarify how to install: kedro-org/kedro-academy#104 (review)

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ElenaKhaustova

Thank you, @SajidAlamQB, changes made make sense to me! I left a suggestion regarding the implementation.

I tested it with the academy project, and it works now. I left a question regarding the warning produced kedro-org/kedro-academy#104 (review)

And another general question is regarding the OTLP approach we chose. Is it because we try to align with the autogen mode implementation for OpikTraceDataset? Otherwise, this approach (https://langfuse.com/integrations/frameworks/autogen) looks much easier and requires only configuration through Langfuse, as we already do for other modes.

I also wonder what the difference is between those two approaches in terms of the end result, and if you had a chance to explore it?

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

SajidAlamQB · 2026-02-03T15:49:40Z

And another general question is regarding the OTLP approach we chose. Is it because we try to align with the autogen mode implementation for OpikTraceDataset? Otherwise, this approach (https://langfuse.com/integrations/frameworks/autogen) looks much easier and requires only configuration through Langfuse, as we already do for other modes.

I also wonder what the difference is between those two approaches in terms of the end result, and if you had a chance to explore it?

Yes the main reason for OTLP approach was to keep consistent with Opik which didn't have an equivalent, so its autogen mode uses OTLP directly.

I think for initial implementation it makes sense to keep OTLP for consistency, but we could explore adding an openlit mode or enhancing the autogen mode for Langfuse specifically in a follow up if those other features are needed.

For the endpoint that makes sense I'll make it configurable.

ElenaKhaustova · 2026-02-03T15:57:49Z

@SajidAlamQB

what the difference is between those two approaches in terms of the end result?

I mean, is there any notable difference at all, aside from the configuration?

SajidAlamQB · 2026-02-03T16:01:53Z

@SajidAlamQB

what the difference is between those two approaches in terms of the end result?

I mean, is there any notable difference at all, aside from the configuration?

The openLit approach just gives more detailed traces out of the box but otherwise not much difference tbh.

Signed-off-by: Sajid Alam <90610031+SajidAlamQB@users.noreply.github.com>

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ElenaKhaustova

Thank you, @SajidAlamQB!

I've unresolved the comment about the endpoint as it does not seem to be solved. Also added a few suggestions on how it can be done.

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ravi-kumar-pilla · 2026-02-05T02:36:40Z

Hi @SajidAlamQB ,

The code looks good and it works well with the test project in kedro-academy. We need to change how credentials are handled in other modes (either in this PR or a separate one to be consistent)

Thank you

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ankatiyar

Code looks good overall, I'll let Elena and Ravi do the final approvals :)

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

SajidAlamQB · 2026-02-09T14:56:43Z

Hey team so this PR went through a few different iteration so just to make it clear:

We explored two approaches for AutoGen tracing with Langfuse:

Approach 1: OpenLit (attempted, reverted)
Tried using OpenLit for as shown in Langfuse's AutoGen tutorial

Trace hierarchy was breaking without manual spans and without wrapping agent calls in tracer.start_as_current_span(), each AutoGen operation became a separate trace at depth 0 instead of nested under a parent.

Graph visualisation issues: Even with correct trace hierarchy Langfuse's graph view renders multi-agent workflows incorrectly. This is a known Langfuse limitation (see issues below).

Approach 2: OTLP (current implementation)
Reverted to direct OpenTelemetry OTLP export which sas no additional dependencies beyond opentelemetry-sdk and opentelemetry-exporter-otlp-proto-http

Provides stable API and aligns with opik setup and produces correct trace structure

I've added a note in the docstring that Langfuse's graph visualisation is in beta and may not render complex multi-agent workflows correctly. Also opened an issue on their side:

langfuse/langfuse#11941

Other Related issues:

langfuse/langfuse#9427
langfuse/langfuse#10721
langfuse/langfuse#9648

ElenaKhaustova

Thank you, @SajidAlamQB, implementation looks good to me!

One minor thing that I've noticed is that docs are not rendered properly:

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

…b.com/kedro-org/kedro-plugins into feat/add-auto-gen-support-to-langfuse Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

add autogen support for langfuse trace

769a5bc

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

SajidAlamQB changed the title ~~Extend LangfuseTraceDataset to support AutoGen tracing~~ feat(datasets): Extend LangfuseTraceDataset to support AutoGen tracing Jan 20, 2026

SajidAlamQB marked this pull request as ready for review January 21, 2026 15:51

ElenaKhaustova reviewed Jan 22, 2026

View reviewed changes

Comment thread kedro-datasets/kedro_datasets_experimental/langfuse/langfuse_trace_dataset.py

ElenaKhaustova requested review from ankatiyar and ravi-kumar-pilla January 22, 2026 10:21

SajidAlamQB and others added 2 commits January 23, 2026 12:50

Merge branch 'main' into feat/add-auto-gen-support-to-langfuse

64346cf

changes based on review

90d0e4d

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

SajidAlamQB requested a review from ElenaKhaustova January 28, 2026 15:02

Update pyproject.toml

9dcde34

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

SajidAlamQB mentioned this pull request Jan 28, 2026

Add Langfuse AutoGen Tracing to example kedro-org/kedro-academy#104

Open

ElenaKhaustova reviewed Jan 29, 2026

View reviewed changes

Comment thread kedro-datasets/pyproject.toml

ElenaKhaustova reviewed Jan 29, 2026

View reviewed changes

Comment thread kedro-datasets/kedro_datasets_experimental/langfuse/langfuse_trace_dataset.py Outdated

ElenaKhaustova reviewed Jan 29, 2026

View reviewed changes

Comment thread kedro-datasets/kedro_datasets_experimental/langfuse/langfuse_trace_dataset.py Outdated

ElenaKhaustova reviewed Jan 29, 2026

View reviewed changes

SajidAlamQB added 8 commits January 30, 2026 09:31

Update pyproject.toml

980b170

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

Update pyproject.toml

31bfa3a

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

Update langfuse_trace_dataset.py

96d8062

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

add to langfuse group

e472ca1

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

update docstring

14269c1

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

Update test_langfuse_trace_dataset.py

3ded6e4

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

Update langfuse_trace_dataset.py

b574b1f

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

Update test_langfuse_trace_dataset.py

7d08069

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ElenaKhaustova reviewed Feb 2, 2026

View reviewed changes

Comment thread kedro-datasets/kedro_datasets_experimental/langfuse/langfuse_trace_dataset.py Outdated

fix opentelemetry warning

688f0a0

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ElenaKhaustova mentioned this pull request Feb 3, 2026

feat(datasets): Add autogen support for OpiktraceDataset #1295

Merged

6 tasks

Merge branch 'main' into feat/add-auto-gen-support-to-langfuse

35ea57b

Signed-off-by: Sajid Alam <90610031+SajidAlamQB@users.noreply.github.com>

ElenaKhaustova reviewed Feb 3, 2026

View reviewed changes

Comment thread kedro-datasets/kedro_datasets_experimental/langfuse/langfuse_trace_dataset.py Outdated

SajidAlamQB added 2 commits February 4, 2026 12:39

Update langfuse_trace_dataset.py

669ea55

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

lint and fix tests

fcecb67

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

SajidAlamQB requested a review from ElenaKhaustova February 4, 2026 13:07

ElenaKhaustova reviewed Feb 4, 2026

View reviewed changes

SajidAlamQB added 2 commits February 4, 2026 14:25

endpoint from user

58aa9eb

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

update docstring for self-hosted

56dc163

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

SajidAlamQB requested a review from ElenaKhaustova February 4, 2026 19:05

ravi-kumar-pilla approved these changes Feb 5, 2026

View reviewed changes

Update langfuse_trace_dataset.py

49b2d07

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

ankatiyar reviewed Feb 5, 2026

View reviewed changes

Comment thread kedro-datasets/kedro_datasets_experimental/langfuse/langfuse_trace_dataset.py

ankatiyar reviewed Feb 5, 2026

View reviewed changes

SajidAlamQB and others added 7 commits February 5, 2026 15:38

replace with openlit

aac306b

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

lint

94ffb36

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

fix tests

d2b61a7

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

fix ci

b40dfd5

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

pin openlit <1.36.8

da65e0b

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

revert back to otlp

7507dc4

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

Merge branch 'main' into feat/add-auto-gen-support-to-langfuse

8611432

ElenaKhaustova approved these changes Feb 9, 2026

View reviewed changes

SajidAlamQB and others added 3 commits February 10, 2026 12:48

Merge branch 'main' into feat/add-auto-gen-support-to-langfuse

9127a97

docstring fix

511d1b4

Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

Merge branch 'feat/add-auto-gen-support-to-langfuse' of https://githu…

8e71a2d

…b.com/kedro-org/kedro-plugins into feat/add-auto-gen-support-to-langfuse Signed-off-by: Sajid Alam <sajid_alam@mckinsey.com>

SajidAlamQB merged commit 33364c9 into main Feb 10, 2026
28 checks passed

SajidAlamQB deleted the feat/add-auto-gen-support-to-langfuse branch February 10, 2026 14:05

SajidAlamQB mentioned this pull request Feb 10, 2026

kedro-datasets: Extend LangfuseTraceDataset to support AutoGen tracing #1276

Closed

SajidAlamQB mentioned this pull request Mar 23, 2026

Clean up Langfuse AutoGen Tracing example for merge kedro-org/kedro-academy#113

Open

6 tasks

Conversation

SajidAlamQB commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Development notes

Checklist

Uh oh!

ElenaKhaustova left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ravi-kumar-pilla commented Jan 26, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ElenaKhaustova left a comment

Choose a reason for hiding this comment

Uh oh!

ElenaKhaustova left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SajidAlamQB commented Feb 3, 2026

Uh oh!

ElenaKhaustova commented Feb 3, 2026

Uh oh!

SajidAlamQB commented Feb 3, 2026

Uh oh!

Uh oh!

ElenaKhaustova left a comment

Choose a reason for hiding this comment

Uh oh!

ravi-kumar-pilla commented Feb 5, 2026

Uh oh!

Uh oh!

ankatiyar left a comment

Choose a reason for hiding this comment

Uh oh!

SajidAlamQB commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ElenaKhaustova left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

SajidAlamQB commented Jan 20, 2026 •

edited

Loading

SajidAlamQB commented Feb 9, 2026 •

edited

Loading