Document VLM Captioning for Infographics by kheiss-uwzoo · Pull Request #1369 · NVIDIA/nv-ingest

kheiss-uwzoo · 2026-02-04T16:57:05Z

Summary

Added comprehensive documentation explaining how to use VLMs to caption infographics across multiple documentation files.

The changes provide:

Clear explanations of VLM captioning for infographics
Code examples showing practical implementation
Proper VLM acronym definitions
Cross-references between related documentation sections

Changes Made

Added new section: "VLM Captioning for Infographics Example"

Explains why infographics benefit from VLM captioning
Provides three approaches:
Approach 1: Extract and caption infographics (generates text descriptions)
Approach 2: Embed infographics as images (preserves visual characteristics)
Combining Both: Use both captioning and embedding together
Includes complete code examples for each approach
Added guidance on when to use each method

nv-ingest-python-api.md (Enhanced existing section)

Enhanced "Extract Captions from Images" section to explicitly mention infographics
Added new subsection: "Captioning Infographics"
Includes code example showing extract_infographics=True with .caption()
Added note about requiring the vlm profile
Cross-referenced to vlm-embed.md

quickstart-guide.md (Enhanced profile documentation + example)

Updated VLM profile description to mention infographics
Added explanation that profile enables .caption() method
Added new subsection: "Example: Using the VLM Profile for Infographic Captioning"
Includes docker compose command with both retrieval and vlm profiles
Provides complete end-to-end Python example
Added tip explaining benefits of VLM captioning for complex visuals

support-matrix.md (Clarified VLM feature)

Defined VLM acronym on first use (line 27)
Updated VLM feature description to include infographics
Ensures consistency across documentation

Documentation Standards

✅ Cross-references added between related sections
✅ Code examples are complete and runnable
✅ Progressive detail: quick reference → detailed examples → comprehensive guide
✅ Multiple entry points for users to discover this information

##Files Modified

docs/docs/extraction/vlm-embed.md (+98 lines)
docs/docs/extraction/nv-ingest-python-api.md (~30 lines modified/added)
docs/docs/extraction/quickstart-guide.md (~50 lines modified/added)
docs/docs/extraction/support-matrix.md (~1 line modified)

docs/docs/extraction/nv-ingest-python-api.md

docs/docs/extraction/quickstart-guide.md

docs/docs/extraction/vlm-embed.md

nkmcalli

A few suggestions for you

Co-authored-by: nkmcalli <[email protected]>

added VLM information

15be91b

kheiss-uwzoo requested a review from a team as a code owner February 4, 2026 16:57

kheiss-uwzoo requested review from jdye64, jperez999, nkmcalli and sosahi February 4, 2026 16:57

kheiss-uwzoo added the doc Improvements or additions to documentation label Feb 4, 2026

nkmcalli reviewed Feb 4, 2026

View reviewed changes

nkmcalli requested changes Feb 4, 2026

View reviewed changes

nkmcalli assigned kheiss-uwzoo Feb 4, 2026

kheiss-uwzoo and others added 11 commits February 4, 2026 11:10

Update docs/docs/extraction/nv-ingest-python-api.md

2320ada

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/nv-ingest-python-api.md

a6184e5

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/nv-ingest-python-api.md

1a8e16e

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/quickstart-guide.md

fb35504

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/quickstart-guide.md

d79f824

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/quickstart-guide.md

8d2ccdd

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/vlm-embed.md

09db2a5

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/vlm-embed.md

e2c5568

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/vlm-embed.md

f1b613a

Co-authored-by: nkmcalli <[email protected]>

Update docs/docs/extraction/vlm-embed.md

3aa09a3

Co-authored-by: nkmcalli <[email protected]>

Merge branch 'main' into kheiss/vlm-info-capt

702ebdf

kheiss-uwzoo requested a review from nkmcalli February 4, 2026 19:13

nkmcalli approved these changes Feb 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document VLM Captioning for Infographics#1369

Document VLM Captioning for Infographics#1369
kheiss-uwzoo wants to merge 12 commits intoNVIDIA:mainfrom
kheiss-uwzoo:kheiss/vlm-info-capt

kheiss-uwzoo commented Feb 4, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nkmcalli left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kheiss-uwzoo commented Feb 4, 2026

Summary

Changes Made

Documentation Standards

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nkmcalli left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants