Skip to content

OpikEvaluationDataset test branch#128

Open
lrcouto wants to merge 5 commits into
mainfrom
opik-evaluation
Open

OpikEvaluationDataset test branch#128
lrcouto wants to merge 5 commits into
mainfrom
opik-evaluation

Conversation

@lrcouto
Copy link
Copy Markdown
Contributor

@lrcouto lrcouto commented Apr 17, 2026

Reopened from #125


This branch is to test OpikEvaluationDataset (kedro-org/kedro-plugins#1364). PR is to make it easier to see differences compared to main.

How to test

opik_credentials:
  api_key: "opik-api-key"
  workspace: "workspace-name"

openai:
  openai_api_key: "openai-api-key"

Then we can run the pipeline and check the result on the opik dashboard:

kedro run --pipeline intent_detection_evaluation_opik --params model_name=gpt-4o

Expected in the Opik dashboard:

  • A new dataset named evaluations/intent_agent_evaluation appears under Datasets
  • Items are visible with auto-generated UUIDs (human-readable IDs are stripped, this is expected)
  • An experiment run appears under Experiments with scoring results per item
image

Run the pipeline a second time without changing the local file. Content hash deduplication means no new items are created in Opik, the item count stays the same.

Expected: dataset item count unchanged; a new experiment run is recorded.

image

Then, add a new item to data/intent_detection/evaluation/intent_evaluation.json and rerun. The new item should appear in the remote dataset after load.

image

lrcouto and others added 4 commits April 17, 2026 13:37
@lrcouto lrcouto changed the title Opik evaluation OpikEvaluationDataset test branch Apr 17, 2026
@lrcouto lrcouto marked this pull request as ready for review April 17, 2026 16:50
@lrcouto lrcouto moved this to To Do in Kedro 🔶 May 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: To Do

Development

Successfully merging this pull request may close these issues.

3 participants