Skip to content
Merged
Show file tree
Hide file tree
Changes from 107 commits
Commits
Show all changes
115 commits
Select commit Hold shift + click to select a range
cd3c368
Update PDF blueprint architecture diagram
kheiss-uwzoo Feb 19, 2026
70b5a80
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Feb 24, 2026
7f0248c
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Feb 25, 2026
0dd5f1b
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Feb 26, 2026
dea2770
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Feb 27, 2026
3ff2f1f
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Feb 27, 2026
a886244
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 2, 2026
b44f7ad
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 2, 2026
addf637
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 2, 2026
5900322
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 4, 2026
d12df70
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 4, 2026
67e674b
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 10, 2026
83c3c42
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 10, 2026
371d883
Introduce release branch 26.03 with version 26.3.0-RC1
jdye64 Mar 11, 2026
4af706f
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 11, 2026
a5812fa
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 11, 2026
6ecb070
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 11, 2026
72173fc
Release prep: Update version to 26.03.0-RC1 (#1574)
jdye64 Mar 11, 2026
852910c
(retriever) Add .split() for text chunking by token count (#1547) (#1…
edknv Mar 11, 2026
64c694b
(retriever) add documentation for image file support (#1571) (#1577)
edknv Mar 11, 2026
d38abb2
[26.03] Refactor get_*_model_name to avoid caching fallback model nam…
charlesbluca Mar 11, 2026
fbd2e28
[26.03] (helm) More nemotron rebranding (#1581)
charlesbluca Mar 11, 2026
ba92f69
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 11, 2026
1835ba7
Add source_id column back to lancedb
jdye64 Mar 12, 2026
db03ed7
upmerge
jperez999 Mar 11, 2026
5cbf38e
fix reranker in inproc (#1588)
jperez999 Mar 12, 2026
6459e60
Add source_id to output columns
jdye64 Mar 12, 2026
ed95c44
fix in process extract to handle txt (#1589)
jperez999 Mar 12, 2026
9568b50
Release prep: 26.03.0-RC2 (#1591)
jdye64 Mar 12, 2026
4a8301e
Increase default Redis TTL from 1-2h to 48h to prevent job expiry dur…
jioffe502 Mar 11, 2026
4f4e512
Add Helm RTX PRO 4500 override, extend obj-det warmup batch size over…
charlesbluca Mar 12, 2026
41d2b07
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 12, 2026
be53306
(retriever) update nemotron_parse extraction method (#1599) (#1604)
edknv Mar 12, 2026
491aed0
(retriever) auto-route image files in .extract() for both inprocess a…
edknv Mar 12, 2026
82088d7
Dump libfreetype source in release container (#1600) (#1606)
charlesbluca Mar 12, 2026
10c7435
Unit test failure fixes (#1607)
jdye64 Mar 12, 2026
11662db
Fix markdown outputs for batch and inprocess. (#1601)
jioffe502 Mar 12, 2026
02c2dcd
(retriever) update pre/post-processing for improved recall (#1596) (#…
edknv Mar 12, 2026
f55a733
Remove get_hf_revision logic from code not inside the nemo_retriever …
jdye64 Mar 13, 2026
c00b6bf
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Mar 13, 2026
83a936c
Added air gap instructions to helm file (#1616)
kheiss-uwzoo Mar 13, 2026
4d9ce5f
fix for network call reranking (#1619)
jperez999 Mar 13, 2026
0a60c1a
Release prep: Update versions to 26.3.0-RC4 (#1620)
jdye64 Mar 13, 2026
86cda76
Updated RNs to show forthcoming changes (#1623)
kheiss-uwzoo Mar 13, 2026
e5e3b36
update rns (#1624)
kheiss-uwzoo Mar 13, 2026
ce8133d
Fix score (#1627)
jperez999 Mar 14, 2026
8908e21
rm assert on rerank and readme (#1628)
jperez999 Mar 14, 2026
7d112c3
cherry-pick 15b2bc05681599329276e46e83edfa0f15bb4318 from main
randerzander Mar 16, 2026
823775d
Release prep: update version references to 26.3.0 (#1638)
jdye64 Mar 16, 2026
7b54385
26.03 RNs (#1641)
kheiss-uwzoo Mar 17, 2026
b7be9ba
update quickstart library mode (#1642)
kheiss-uwzoo Mar 18, 2026
1c6ec79
update release version from 26.1.3 to 26.3.0 on Release Notes (#1643)
kheiss-uwzoo Mar 18, 2026
cfd0b72
Kheiss/bullets (#1644)
kheiss-uwzoo Mar 18, 2026
818de0a
Update README.md
kheiss-uwzoo Mar 18, 2026
671d78a
Updating & simplifying main README (#1647) (#1650)
jperez999 Mar 18, 2026
85168e2
updates to release notes to fix bullets and doc link (#1651)
kheiss-uwzoo Mar 18, 2026
4075ae9
Kheiss/5970976 (#1652)
kheiss-uwzoo Mar 18, 2026
ebb1253
Kheiss/5966534 (#1653)
kheiss-uwzoo Mar 18, 2026
924a18e
Kheiss/5970976 - change location of air gap documentation (#1656)
kheiss-uwzoo Mar 18, 2026
4129d5b
Revert doc naming changes
jdye64 Mar 19, 2026
22d58bf
Confirmed product naming of NeMo Retriever Library in files and code …
kheiss-uwzoo Mar 19, 2026
17e0148
update helm file (#1679)
kheiss-uwzoo Mar 20, 2026
3d4fdae
updated quickstart to current version following reversion (#1683)
kheiss-uwzoo Mar 23, 2026
b1f56bb
Kheiss/quickstart lib mode update (#1682)
kheiss-uwzoo Mar 23, 2026
19e77e1
Update RNs to current version (#1687)
kheiss-uwzoo Mar 23, 2026
0e0bebc
Kheiss/update quickstart (#1688)
kheiss-uwzoo Mar 23, 2026
77cb39a
update reference diagram for overview (#1689)
kheiss-uwzoo Mar 23, 2026
56c2c51
fixed reference information about name change from nv-ingest to NeMo …
kheiss-uwzoo Mar 23, 2026
6758c17
changed opening note to NVIDIA Ingest (nv-ingest) has been renamed N…
kheiss-uwzoo Mar 23, 2026
3db9a49
remove duplicate caption() section with wrong parameters (NVBug 60006…
kheiss-uwzoo Mar 23, 2026
f0f9e97
Kheiss/6000618 (#1694)
kheiss-uwzoo Mar 23, 2026
cf22e8c
fix syntax (#1696)
kheiss-uwzoo Mar 23, 2026
cc33bea
Kheiss/6000353 - update links to Helm chart (#1697)
kheiss-uwzoo Mar 23, 2026
fa30ff8
Document RTX PRO 4500 Blackwell (GB203) in hardware support matrix 59…
kheiss-uwzoo Mar 23, 2026
726340c
fixed the contributing.md (#1706)
sosahi Mar 24, 2026
ad96fc9
add contributing.md back to repository (#1709)
kheiss-uwzoo Mar 24, 2026
bcaf8f3
Kheiss/6000353 - update links to older RNs (#1712)
kheiss-uwzoo Mar 24, 2026
486a0de
Kheiss/5966538 - document Python 3.12+ as a prerequisite for NeMo Ret…
kheiss-uwzoo Mar 24, 2026
f07e881
Aligns NeMo Retriever Library extraction docs with the current defaul…
kheiss-uwzoo Mar 25, 2026
f6e5869
Align nemotron-parse overview with three methods (NVBug 5965574); (#1…
kheiss-uwzoo Mar 25, 2026
998f26b
Kheiss/updates0325 (#1734)
kheiss-uwzoo Mar 25, 2026
a6ef79a
removed duplication of the word NVIDIA (#1736)
kheiss-uwzoo Mar 26, 2026
a07ac1d
removed reference to zipking (#1737)
kheiss-uwzoo Mar 26, 2026
fd1353a
Fixed bug 5966370 (#1744)
kheiss-uwzoo Mar 27, 2026
c63daab
Align production GPU examples with support matrix (NVBug 5965601) (#…
kheiss-uwzoo Mar 30, 2026
9dc88b5
Kheiss/5966722 (#1743)
kheiss-uwzoo Mar 30, 2026
6c3c2a6
Updated files per bugs 5970369, 5966307, and 5966925 (#1740)
kheiss-uwzoo Mar 30, 2026
53262b4
Align VLM caption model and MinIO defaults with runtime (#1739)
kheiss-uwzoo Mar 30, 2026
1a91164
added licensing info to documentation (#1750)
kheiss-uwzoo Mar 30, 2026
b5d7b96
updated quickstart guide file per 5966239 (#1751)
kheiss-uwzoo Mar 30, 2026
4744677
update support matrix to add footnotes
kheiss-uwzoo Mar 30, 2026
e8759e2
update support matrix to add footnotes (#1752)
kheiss-uwzoo Mar 30, 2026
f39912f
Merge remote-tracking branch 'upstream/26.03' into 26.03
kheiss-uwzoo Mar 30, 2026
29f787b
Kheiss/5966297update (#1758)
kheiss-uwzoo Mar 31, 2026
c5e1c22
Align VLM caption model, fix V2 ingest() example, document run_pipel…
kheiss-uwzoo Mar 31, 2026
7461ce4
Merge remote-tracking branch 'upstream/26.03' into 26.03
kheiss-uwzoo Mar 31, 2026
d56a8cb
Merge branch '26.03' into main
kheiss-uwzoo Apr 2, 2026
3e80634
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Apr 2, 2026
7f73df3
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Apr 14, 2026
4ce21b5
Merge remote-tracking branch 'upstream/main'
kheiss-uwzoo Apr 15, 2026
21a756b
Creating NRL only posting for GitHub
kheiss-uwzoo Apr 15, 2026
e7cb523
Merge branch 'main' into kheiss/NRLonly
kheiss-uwzoo Apr 16, 2026
31117de
NRL centric GitHub pages
kheiss-uwzoo Apr 16, 2026
ebd9fd5
ci(docs): add NRL GitHub Pages workflow, mkdocs config, and helper sc…
kheiss-uwzoo Apr 16, 2026
17f2504
docs: add NVIDIA logo icon to NRL staging overrides for MkDocs build
kheiss-uwzoo Apr 16, 2026
648b597
docs(nrl): emit site root index.html via redirect to Library overview
kheiss-uwzoo Apr 16, 2026
3eac3ca
Apply suggestion from @greptile-apps[bot]
kheiss-uwzoo Apr 17, 2026
1482710
docs: NRL workflows, navigation, and rename note across extraction pages
kheiss-uwzoo Apr 17, 2026
286921b
docs: fix broken internal links and anchors for MkDocs NRL build
kheiss-uwzoo Apr 17, 2026
97f4e49
docs: update internal links for clarity in reranking documentation
kheiss-uwzoo Apr 17, 2026
5718b99
docs: replace instructional 'see' with 'refer to' in extraction topics
kheiss-uwzoo Apr 17, 2026
a43263c
NRL only doc updates
kheiss-uwzoo Apr 20, 2026
8a3b8be
Merge branch 'main' into kheiss/NRLonly
kheiss-uwzoo Apr 20, 2026
22476ee
Update docs/mkdocs.nrl-github-pages.yml
kheiss-uwzoo Apr 20, 2026
4e3001c
Update docs/docs/extraction/overview.md
kheiss-uwzoo Apr 20, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
102 changes: 102 additions & 0 deletions .github/workflows/nrl-docs-github-pages.yml
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the point of this new workflow? There is already a workflow that does the same thing

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Existing Pages workflows run the full Docker/Sphinx docs build; this one is a lightweight NRL-only MkDocs path for staging/nightly without that pipeline.

Original file line number Diff line number Diff line change
@@ -0,0 +1,102 @@
# NeMo Retriever Library (NRL) documentation only — GitHub Pages staging / nightly.
# Does not run the full Docker + Sphinx pipeline (no nv-ingest / nv-ingest-api HTML API dump).
name: NRL documentation — GitHub Pages (staging)

on:
push:
branches:
- main
paths:
- "docs/**"
- "nemo_retriever/**"
- ".github/workflows/nrl-docs-github-pages.yml"
schedule:
# Nightly (UTC): pick up doc changes even if no pushes
- cron: "0 7 * * *"
workflow_dispatch:

permissions:
contents: read
pages: write
id-token: write
Comment thread
kheiss-uwzoo marked this conversation as resolved.

concurrency:
group: pages-nrl-staging
cancel-in-progress: false

jobs:
build:
name: Build NRL docs (staging)
runs-on: ubuntu-latest
uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
- name: Checkout
uses: actions/checkout@v4
Comment thread
kheiss-uwzoo marked this conversation as resolved.

- name: Configure Pages
id: pages
uses: actions/configure-pages@v5

- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: "3.12"
cache: pip
cache-dependency-path: docs/requirements.txt

- name: Install Python dependencies
run: |
python -m pip install --upgrade pip
pip install -r docs/requirements.txt
pip install -e ./nemo_retriever

- name: Print NRL site navigation (pre-deploy)
run: python docs/scripts/print_nrl_mkdocs_nav.py

- name: Write nav + scan summary for the workflow run
run: |
{
echo "### NRL GitHub Pages — site navigation"
echo
echo '```'
python docs/scripts/print_nrl_mkdocs_nav.py
echo '```'
echo
echo "### Non-NRL / legacy reference scan (excerpt)"
echo "Full report is attached as an artifact."
echo
echo '```'
python docs/scripts/scan_non_nrl_doc_references.py | head -n 120
echo '```'
} >> "$GITHUB_STEP_SUMMARY"

- name: Scan for non-NRL references (full report)
run: python docs/scripts/scan_non_nrl_doc_references.py | tee non-nrl-review.txt

- name: Upload non-NRL scan artifact
uses: actions/upload-artifact@v4
with:
name: non-nrl-content-review
path: non-nrl-review.txt

- name: Build MkDocs (NRL only)
working-directory: docs
env:
SITE_URL: ${{ steps.pages.outputs.base_url }}
run: mkdocs build -f mkdocs.nrl-github-pages.yml --strict

- name: Upload Pages artifact
uses: actions/upload-pages-artifact@v3
with:
path: docs/site

deploy:
name: Deploy to GitHub Pages
needs: build
runs-on: ubuntu-latest
environment:
name: github-pages
url: ${{ steps.deployment.outputs.page_url }}
steps:
- name: Deploy
id: deployment
uses: actions/deploy-pages@v4
11 changes: 11 additions & 0 deletions docs/docs/extraction/agentic-retrieval-concept.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
# Agentic retrieval (concept)

Agentic retrieval means **iterative, tool-driven** retrieval: an agent plans steps, issues searches, may refine filters, and optionally reranks until it has enough context to answer.

NeMo Retriever Library focuses on document ingestion, embeddings, vector stores, hybrid search, and reranking. Orchestration frameworks call these building blocks from your application.

**Related**

- [Workflow: Agentic retrieval](workflow-agentic-retrieval.md)
- [Semantic and hybrid retrieval](semantic-hybrid-retrieval.md)
- Framework examples: [LangChain, LlamaIndex, Haystack](integrations-langchain-llamaindex-haystack.md)
4 changes: 2 additions & 2 deletions docs/docs/extraction/audio.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ Currently, you can extract speech from the following file types:

[NeMo Retriever Library](overview.md) supports extracting speech from audio files for Retrieval Augmented Generation (RAG) applications.
Similar to how the multimodal document extraction pipeline leverages object detection and image OCR microservices,
NeMo Retriever leverages the [parakeet-1-1b-ctc-en-us ASR NIM microservice](https://docs.nvidia.com/nim/speech/latest/asr/deploy-asr-models/parakeet-ctc-en-us.html)
NeMo Retriever Library uses the [parakeet-1-1b-ctc-en-us ASR NIM microservice](https://docs.nvidia.com/nim/speech/latest/asr/deploy-asr-models/parakeet-ctc-en-us.html)
to transcribe speech to text, which is then embedded by using the NeMo Retriever embedding NIM.

!!! important
Expand Down Expand Up @@ -136,4 +136,4 @@ Instead of running the pipeline locally, you can use NVCF to perform inference b

- [Support Matrix](support-matrix.md)
- [Troubleshoot Nemo Retriever Extraction](troubleshoot.md)
- [Use the Python API](nv-ingest-python-api.md)
- [Use the Python API](python-api-reference.md)
32 changes: 32 additions & 0 deletions docs/docs/extraction/choose-your-path.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
# Choose your path

Use this page to pick documentation and deployment options that match your goal.

## I want to run locally or embed the library

1. [Prerequisites](prerequisites.md) and [Support matrix](support-matrix.md)
2. [Deploy (Library mode)](quickstart-library-mode.md)
3. [Use the Python API](python-api-reference.md) or [Use the CLI](cli-reference.md)

## I want a Kubernetes / Helm deployment

1. [Prerequisites](prerequisites.md)
2. [Deploy (Helm Chart)](helm.md)
3. [Environment variables](environment-config.md) and [Troubleshoot](troubleshoot.md) as needed

## I want examples and notebooks

1. [Jupyter Notebooks](notebooks.md)
2. [Integrate with LangChain, LlamaIndex, Haystack](integrations-langchain-llamaindex-haystack.md)

## I need API details and keys

1. [Get your API key](ngc-api-key.md)
2. [API reference](nemo-retriever-api-reference.md) and [V2 API guide](v2-api-guide.md) if applicable

## I am tuning performance or cost

1. [Benchmarking and performance](benchmarking.md)
2. [Telemetry](telemetry.md)
3. [Throughput is dataset-dependent](throughput-is-dataset-dependent.md)
4. [Evaluate on your data](evaluate-on-your-data.md)
2 changes: 1 addition & 1 deletion docs/docs/extraction/chunking.md
Original file line number Diff line number Diff line change
Expand Up @@ -106,6 +106,6 @@ If you are building the container yourself and want to pre-download this model,

## Related Topics

- [Use the Python API](nv-ingest-python-api.md)
- [Use the Python API](python-api-reference.md)
- [NeMo Retriever Library V2 API Guide](v2-api-guide.md)
- [Environment Variables](environment-variables.md)
27 changes: 27 additions & 0 deletions docs/docs/extraction/concepts.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
# Concepts

These terms appear throughout NeMo Retriever Library documentation.

## Job

A **job** is a unit of work you submit with a JSON description: a document payload (or reference) and a list of **ingestion tasks** to run on that payload. Results are retrieved as structured metadata and annotations.

## Pipeline and tasks

NeMo Retriever Library does **not** run one static pipeline on every document. You configure **tasks** such as parsing, chunking, embedding, storage, and filtering per job. Related topics: [Customize your pipeline](user-defined-functions.md), [user-defined stages](user-defined-stages.md).

## Extraction metadata

Output is typically a **JSON dictionary** listing extracted objects (text regions, tables, images, and so on), processing notes, and timing or trace data. Field-level detail is in the [metadata reference](content-metadata.md).

## Embeddings and retrieval

Optionally, the library can compute **embeddings** for extracted content and store vectors in a database such as [LanceDB](https://lancedb.com/) or [Milvus](https://milvus.io/) for downstream **semantic or hybrid search** in your application.

## Deployment modes

- **Library mode** — Run without the full container stack where appropriate ([quickstart](quickstart-library-mode.md)).
- **Helm / Kubernetes** — [Helm-based deployment](helm.md) for cluster operations.
- **Notebooks** — [Jupyter examples](notebooks.md) for experimentation and RAG demos.

For a concise comparison, see [Choose your path](choose-your-path.md).
2 changes: 1 addition & 1 deletion docs/docs/extraction/custom-metadata.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ meta_df.to_csv(file_path)
### Example: Add Custom Metadata During Ingestion

The following example adds custom metadata during ingestion.
For more information about the `Ingestor` class, see [Use the Python API](nv-ingest-python-api.md).
For more information about the `Ingestor` class, see [Use the Python API](python-api-reference.md).
For more information about the `vdb_upload` method, see [Upload Data](data-store.md).

```python
Expand Down
12 changes: 6 additions & 6 deletions docs/docs/extraction/data-store.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,10 +20,10 @@ It does not store the embeddings for images.

!!! tip "Storing Extracted Images"

To persist extracted images, tables, and chart renderings to disk or object storage, use the `store` task in addition to `vdb_upload`. The `store` task supports any fsspec-compatible backend (local filesystem, S3, GCS, etc.). For details, refer to [Store Extracted Images](nv-ingest-python-api.md#store-extracted-images).
To persist extracted images, tables, and chart renderings to disk or object storage, use the `store` task in addition to `vdb_upload`. The `store` task supports any fsspec-compatible backend (local filesystem, S3, GCS, etc.). For details, refer to [Store Extracted Images](python-api-reference.md#store-extracted-images).

NeMo Retriever Library supports uploading data by using the [Ingestor.vdb_upload API](nv-ingest-python-api.md).
Currently, data upload is not supported through the [CLI](nv-ingest_cli.md).
NeMo Retriever Library supports uploading data by using the [Ingestor.vdb_upload API](python-api-reference.md).
Currently, data upload is not supported through the [CLI](cli-reference.md).



Expand Down Expand Up @@ -140,7 +140,7 @@ You can delete all collections by deleting that volume, and then restarting the

!!! tip

When you use the `vdb_upload` method, the behavior of the upload depends on the `return_failures` parameter of the `ingest` method. For details, refer to [Capture Job Failures](nv-ingest-python-api.md#capture-job-failures).
When you use the `vdb_upload` method, the behavior of the upload depends on the `return_failures` parameter of the `ingest` method. For details, refer to [Capture Job Failures](python-api-reference.md#capture-job-failures).

To upload to Milvus, use code similar to the following to define your `Ingestor`.

Expand Down Expand Up @@ -179,7 +179,7 @@ For more information, refer to [Build a Custom Vector Database Operator](https:/

## Related Topics

- [Use the NeMo Retriever Library Python API](nv-ingest-python-api.md)
- [Store Extracted Images](nv-ingest-python-api.md#store-extracted-images)
- [Use the NeMo Retriever Library Python API](python-api-reference.md)
- [Store Extracted Images](python-api-reference.md#store-extracted-images)
- [Environment Variables](environment-config.md)
- [Troubleshoot Nemo Retriever Extraction](troubleshoot.md)
7 changes: 7 additions & 0 deletions docs/docs/extraction/embedding-nims-models.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
# Embedding NIMs and models

Embeddings turn extracted text and multimodal content into vectors for semantic search. NeMo Retriever Library integrates with NVIDIA NIM microservices for embedding. Model names and compatibility vary by release; see the [Support matrix](support-matrix.md) and the [NVIDIA NIM catalog](https://build.nvidia.com/).

For multimodal or VLM embeddings, see [Multimodal embeddings (VLM)](vlm-embed.md).

After embedding, content is stored in a vector database; see [Vector databases](data-store.md). RAG-style collections are created and populated through your pipeline configuration and harness runs. For details, see [Benchmarking](benchmarking.md) and the [data store](data-store.md) documentation for your backend.
2 changes: 1 addition & 1 deletion docs/docs/extraction/environment-config.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ You can specify these in your .env file or directly in your environment.
| `OTEL_EXPORTER_OTLP_ENDPOINT` | `http://otel-collector:4317` <br/> | The endpoint for the OpenTelemetry exporter, used for sending telemetry data. |
| `REDIS_INGEST_TASK_QUEUE` | `ingest_task_queue` <br/> | The name of the task queue in Redis where tasks are stored and processed. |
| `REDIS_POOL_SIZE` | - `50` (default) <br/> - `100` <br/> - `200` <br/> | Maximum Redis connection pool size. Increase for high-concurrency workloads processing many documents in parallel. Default of 50 works well for most deployments. |
| `IMAGE_STORAGE_URI` | `s3://nv-ingest/artifacts/store/images` <br/> | Default fsspec-compatible URI for the `store` task. Supports `s3://`, `file://`, `gs://`, etc. See [Store Extracted Images](nv-ingest-python-api.md#store-extracted-images). |
| `IMAGE_STORAGE_URI` | `s3://nv-ingest/artifacts/store/images` <br/> | Default fsspec-compatible URI for the `store` task. Supports `s3://`, `file://`, `gs://`, etc. See [Store Extracted Images](python-api-reference.md#store-extracted-images). |
| `IMAGE_STORAGE_PUBLIC_BASE_URL` | `https://assets.example.com/images` <br/> | Optional HTTP(S) base URL for serving stored images. |


Expand Down
17 changes: 17 additions & 0 deletions docs/docs/extraction/evaluate-on-your-data.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
# Evaluate on your data

Retrieval and ingestion performance **depend on your documents**, hardware, and pipeline settings. Use the following when measuring quality and throughput on **your** datasets.

## Benchmarking and baselines

Start with [Benchmarking](benchmarking.md) for methodology and baseline expectations. Combine with [Telemetry](telemetry.md) to observe production-like runs.

## Throughput and dataset effects

Read [Throughput is dataset-dependent](throughput-is-dataset-dependent.md) for why raw numbers from generic benchmarks may not match your corpus (layout complexity, file types, image density, and so on).

## Operational tuning

- [Resource scaling modes](scaling-modes.md)
- [Support matrix](support-matrix.md) for supported configurations
- [Troubleshoot](troubleshoot.md) when results or performance diverge from expectations
9 changes: 9 additions & 0 deletions docs/docs/extraction/extraction-charts-infographics.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Charts and infographics

Charts and infographic regions are classified as graphic elements and processed with the corresponding NVIDIA NIM workflows (for example, **yolox-graphic-elements** in current releases). Outputs use the same metadata schema as other extracted objects.

**Related**

- [What is NeMo Retriever Library?](overview.md)
- [Support matrix](support-matrix.md)
- [Multimodal embeddings (VLM)](vlm-embed.md) when you treat graphics as images for embedding
9 changes: 9 additions & 0 deletions docs/docs/extraction/extraction-ocr-scanned.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# OCR and scanned documents

Scanned PDFs and image-only pages rely on OCR and hybrid paths that combine native text extraction with OCR when needed. For extract methods such as `ocr` and `pdfium_hybrid`, see the [Python API reference](python-api-reference.md).

**Related**

- [Text and layout extraction](text-layout-extraction.md)
- [Nemotron Parse](nemoretriever-parse.md)
- [Throughput is dataset-dependent](throughput-is-dataset-dependent.md)
9 changes: 9 additions & 0 deletions docs/docs/extraction/extraction-tables.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Tables

NeMo Retriever Library detects tables as structured page elements, processes them through the appropriate NIMs, and exports formats suitable for downstream RAG (including Markdown-oriented representations where configured). Availability depends on pipeline and model configuration; see the [Support matrix](support-matrix.md).

**Related**

- [What is NeMo Retriever Library?](overview.md) for artifact classification
- [Nemotron Parse](nemoretriever-parse.md) for advanced visual parsing
- [Metadata reference](content-metadata.md)
8 changes: 4 additions & 4 deletions docs/docs/extraction/faq.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@ For more information, refer to [Data Upload](data-store.md).
For images that `nemoretriever-page-elements-v3` does not classify as tables, charts, or infographics,
you can use our VLM caption task to create a dense caption of the detected image.
That caption is then be embedded along with the rest of your content.
For more information, refer to [Extract Captions from Images](nv-ingest-python-api.md#extract-captions-from-images).
For more information, refer to [Extract Captions from Images](python-api-reference.md#extract-captions-from-images).



Expand Down Expand Up @@ -74,10 +74,10 @@ For examples of `*_ENDPOINT` variables, refer to [nv-ingest/docker-compose.yaml]
See the [Profile Information](quickstart-guide.md#profile-information) section
for information about the optional NIM components of the pipeline.

You can configure the `extract`, `caption`, and other tasks by using the [Ingestor API](nv-ingest-python-api.md).
You can configure the `extract`, `caption`, and other tasks by using the [Ingestor API](python-api-reference.md).

To choose what types of content to extract, use code similar to the following.
For more information, refer to [Extract Specific Elements from PDFs](nv-ingest-python-api.md#extract-specific-elements-from-pdfs).
For more information, refer to [Extract Specific Elements from PDFs](python-api-reference.md#extract-specific-elements-from-pdfs).

```python
Ingestor(client=client)
Expand All @@ -93,7 +93,7 @@ Ingestor(client=client)
```

To generate captions for images, use code similar to the following.
For more information, refer to [Extract Captions from Images](nv-ingest-python-api.md#extract-captions-from-images).
For more information, refer to [Extract Captions from Images](python-api-reference.md#extract-captions-from-images).

```python
Ingestor(client=client)
Expand Down
14 changes: 14 additions & 0 deletions docs/docs/extraction/getting-started-about.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
# About getting started

This section walks you from **access and prerequisites** through **first deployment** and **hands-on notebooks**.

Typical order:

1. [Get your API key](ngc-api-key.md) (NGC / API access as required by your workflow).
2. Confirm [Prerequisites](prerequisites.md) and the [Support matrix](support-matrix.md) for your OS, GPU, and software stack.
3. Deploy using one of:
- [Library mode](quickstart-library-mode.md) (without full stack containers where appropriate)
- [Helm Chart](helm.md) for Kubernetes environments
4. Explore [Jupyter Notebooks](notebooks.md) for end-to-end examples.

If you are new to the product, read [What is NeMo Retriever Library?](overview.md), [Key features](key-features.md), and [Concepts](concepts.md) under **Introduction** first.
14 changes: 14 additions & 0 deletions docs/docs/extraction/hosted-nims-when-to-use.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
# When to use NVIDIA-hosted NIMs

[NVIDIA-hosted NIMs](https://build.nvidia.com/) run inference on NVIDIA-managed infrastructure. You call models with API keys (see [Get your API key](ngc-api-key.md)) without operating GPU nodes yourself.

Consider hosted NIMs when:

- You want the fastest path to try models and iterate without installing drivers, containers, or the [NIM Operator](https://docs.nvidia.com/nim-operator/latest/index.html) on your own clusters.
- Latency to NVIDIA endpoints works for your region and use case.
- Your compliance and data policies allow document or query content in the hosted service (confirm with your security review).

For more information, see the following pages:

- [NVIDIA NIM catalog](https://build.nvidia.com/)
- [Compare deployment options](choose-your-path.md)
22 changes: 22 additions & 0 deletions docs/docs/extraction/how-to-use-this-documentation.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
# How to use this documentation

Use the sections below as a reading order that matches how you run NeMo Retriever Library.

## NeMo Retriever Library (local or embedded)

Start with the [Introduction](overview.md), [Concepts](concepts.md), and [Get started](getting-started-about.md) pages. Then follow [Prerequisites](prerequisites.md), [Quickstart: Library mode](quickstart-library-mode.md), and either the [Python API](python-api-reference.md) or [CLI](cli-reference.md). For deeper topics, see [Core workflows](v2-api-guide.md) and [Multimodal extraction](supported-file-types.md).

## Microservices, Helm, and production clusters

Follow [Choose your deployment](choose-your-path.md), [Deploy (Helm Chart)](helm.md), [Environment variables](environment-config.md), and the [V2 API guide](v2-api-guide.md). For operations topics, see [Scaling modes](scaling-modes.md), [Ray logging](ray-logging.md), [Telemetry](telemetry.md), and [Benchmarking](benchmarking.md).

## NVIDIA Blueprints and end-to-end RAG

For solution-level patterns, read [End-to-end RAG with NVIDIA Blueprints](resources-links.md), including links to [NVIDIA AI Blueprints](resources-links.md). These docs cover ingestion, embedding, and retrieval primitives that Blueprints combine into full applications.

## Related

The following pages supplement this overview:

- [About getting started](getting-started-about.md), for a step-by-step first deployment
- [Release notes](releasenotes-nv-ingest.md)
Loading
Loading