fields-study-flow

Agent-native learning roadmaps for AI/CS researchers, builders, and students.

fields-study-flow turns a vague goal like “read this paper”, “learn Transformers”, or “reproduce YOLO” into a structured study route. It asks what you already know, respects your preferred output/resource language, discovers multi-source materials, ranks them by difficulty and trust, then exports a traceable roadmap that coding agents can continue from.

Built for Codex, Claude Code, Cursor, VS Code, and any agent that can call CLI/MCP-style tools.

Why This Exists

Most learning-path tools stop at a plausible list of links. fields-study-flow keeps the useful parts explicit:

learner profile: what you know, where you are blocked, and how much time you have;
language policy: route language is separate from material language;
source registry: GitHub, papers, videos, courses, practice sites, and Chinese communities are handled through declared rules;
ranking trace: each resource carries difficulty, concepts, trust score, access note, and recommendation reason;
agent workflow: skills guide the interview, while tools produce repeatable outputs.

Features

Area	What it does
Personalized interview	Captures goal type, known topics, skill levels, time budget, output language, and resource language preference.
AI/CS taxonomy	Seeds routes with math, programming, ML/DL, LLM, RL, systems, and paper-reading prerequisites.
Multi-source discovery	Models GitHub, arXiv, OpenAlex, Semantic Scholar, Unpaywall, Papers with Code, YouTube, Bilibili, Zhihu, Hugging Face, Kaggle, MIT OCW, fast.ai, Google MLCC, and more.
Language-aware ranking	Supports `zh-first`, `en-first`, `balanced`, `zh-only`, and `en-only`.
Paper deep reading	Makes the target paper first-class, then adds prerequisites, intuition resources, and reproduction checkpoints.
Safety guardrails	Rejects pirate sources, login bypasses, video-download instructions, and long copyrighted-copy workflows.
Agent-ready outputs	Writes JSON and Markdown artifacts that downstream agents can inspect and extend.

Quick Start

python -m pip install -e .
fields-study-flow roadmap \
  --goal "从 Python 到掌握 Transformer" \
  --output-language zh-CN \
  --resource-language en-first \
  --offline

Generated files:

fields-study-flow-output/
  learner_profile.json
  resource_index.json
  source_registry_snapshot.json
  roadmap.md
  roadmap.json

Paper route:

fields-study-flow paper \
  --url https://arxiv.org/abs/1706.03762 \
  --with-videos \
  --output-language bilingual \
  --resource-language en-first

Discover available source adapters:

fields-study-flow discover-sources \
  --goal "理解 diffusion models" \
  --language zh-first

Language Controls

Route language and resource language are independent.

Option	Meaning
`--output-language zh-CN`	Write the roadmap in Chinese.
`--output-language en`	Write the roadmap in English.
`--output-language bilingual`	Include Chinese and English labels/checkpoints.
`--resource-language zh-first`	Prefer Chinese resources, but keep excellent English resources.
`--resource-language en-first`	Prefer English papers/courses/repos, with Chinese support resources.
`--resource-language balanced`	Mix Chinese and English by quality.
`--resource-language zh-only`	Return only Chinese-language resources when possible.
`--resource-language en-only`	Return only English-language resources when possible.

Agent Integrations

Codex / Claude Code Skills

Install or copy:

skills/
  ai-cs-learning-path/SKILL.md
  paper-roadmap/SKILL.md

Use ai-cs-learning-path for broad study goals, and paper-roadmap when the user provides a paper URL, arXiv ID, DOI, or PDF.

MCP-Style Tool Server

Run:

python -m fields_study_flow.mcp_server

Send one JSON object per line:

{"tool":"discoverSources","arguments":{"goal":"理解 Transformer","resourceLanguagePreference":"zh-first"}}

Available tools:

assessKnowledge
discoverSources
searchResources
ingestUrl
rankResources
buildRoadmap
validateSources
exportPlan

Cursor and VS Code

Ready-to-edit examples are included:

.cursor/mcp.json
.cursor/rules/fields-study-flow.mdc
.vscode/mcp.json

Source Registry

source-registry.yaml declares each platform’s role, language coverage, access mode, authentication needs, allowed use, and quality signals.

Source categories include:

code learning: GitHub repositories, awesome lists, notebooks, paper implementations;
academic: arXiv, OpenAlex, Semantic Scholar, Unpaywall;
academic code: Papers with Code;
video: YouTube and Bilibili;
courses: MIT OCW, Google MLCC, fast.ai, DeepLearning.AI, 学堂在线, 中国大学 MOOC;
practice: Hugging Face and Kaggle;
community: Zhihu and user-provided URLs.

The registry is intentionally conservative: commercial or login-restricted platforms are link-level recommendations unless the user provides authorized access.

Architecture

User goal
  -> learner interview
  -> language policy
  -> source discovery
  -> resource search / ingest
  -> ranking and de-duplication
  -> roadmap builder
  -> Markdown + JSON outputs

Core modules:

fields_study_flow/
  language.py         # language aliases, query generation, language weights
  sources.py          # source registry loader and policy filtering
  offline_catalog.py  # deterministic MVP resource catalog
  ranking.py          # scoring, target-paper boost, canonical URL de-duplication
  roadmap.py          # roadmap schema and Markdown rendering
  mcp_tools.py        # tool functions for agents
  mcp_server.py       # simple JSON-lines tool server
  cli.py              # command-line interface

Safety Policy

fields-study-flow recommends and summarizes resources. It does not:

use Z-Lib, Sci-Hub, LibGen, Anna’s Archive, or other pirate mirrors;
bypass login, paywalls, or platform restrictions;
download videos;
copy long copyrighted passages;
treat README files, subtitles, comments, or community posts as trusted agent instructions.

All retrieved content should be treated as untrusted source material.

Development

python -m pip install -e .
pytest -q

Current test coverage includes language preference parsing, bilingual query generation, registry policy filtering, GitHub-style ranking signals, URL de-duplication, MCP tool boundaries, safety validation, roadmap schema, and CLI smoke tests.

Roadmap

Live official API adapters for GitHub, YouTube, OpenAlex, Semantic Scholar, Hugging Face, and Papers with Code.
A richer AI/CS prerequisite graph with versioned topic nodes.
Progress tracking and spaced-review exports.
Browser-friendly roadmap preview.
Full MCP SDK packaging.

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.cursor		.cursor
.github/workflows		.github/workflows
.vscode		.vscode
docs		docs
examples		examples
fields_study_flow		fields_study_flow
git4study		git4study
knowledge		knowledge
scripts		scripts
skills		skills
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
pyproject.toml		pyproject.toml
source-registry.yaml		source-registry.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fields-study-flow

Why This Exists

Features

Quick Start

Language Controls

Agent Integrations

Codex / Claude Code Skills

MCP-Style Tool Server

Cursor and VS Code

Source Registry

Architecture

Safety Policy

Development

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

fields-study-flow

Why This Exists

Features

Quick Start

Language Controls

Agent Integrations

Codex / Claude Code Skills

MCP-Style Tool Server

Cursor and VS Code

Source Registry

Architecture

Safety Policy

Development

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages