Paper Visualizer (Anthropic Skill)

Turn ArXiv Papers into High-Fidelity Technical Schematics. A specialized Anthropic Skill that architects professional diagrams for research papers, optimized for Nano Banana Pro.

🌐 Official Website: https://wilsonwukz.github.io/paper-visualizer-skill/

Transformer Architecture - Anthropic Console

Figure 1: The "Golden Schema" generated via Anthropic Console (Claude 3.5 Sonnet). Note the precise recursive structure of the Encoder/Decoder stacks and the detailed "Multi-Head Attention" insets.

Introduction

"Why can't AI draw my architecture correctly?"

Researchers and Engineers often struggle to visualize complex systems. While standard generative AI excels at art, it fundamentally fails at Scientific Logic and Topological Consistency, often producing "hallucinated" connections or gibberish text.

Paper Visualizer bridges this gap. It acts as a Structural Architect middleware that:

Decodes the PDF: Reads the raw academic text to extract the logical topology (e.g., Is it a cyclic loop? A parallel stream? A hierarchical tree?).
Visual Tokenization: Translates abstract concepts (e.g., "Residual Connection") into concrete visual tokens (e.g., "Curved bypass arrow with (+) symbol").
Strict Layout Enforcement: Outputs a structured, coordinate-based prompt that forces Nano Banana Pro to obey physical laws.

Key Features

6 Cognitive Layout Engines: Automatically selects the best visual topology for your paper:
- Linear Pipeline (for CNNs/Preprocessing)
- Parallel Dual-Stream (for Transformers/Siamese Networks)
- Central Hub (for Agents/RL)
- Cyclic Loop (for Optimization/GANs)
- Hierarchical Stack (for FPNs/UNets)
- Matrix Grid (for Ablation Studies)
Typography Guardrails: Enforces sans-serif hierarchy rules to minimize text artifacts, ensuring that main labels (e.g., "ENCODER") remain legible.
Nano Banana Pro Optimized: Specifically tuned to leverage Nano Banana Pro's strengths in text rendering and structural adherence.

Gallery: Style Variants

This skill supports different aesthetic outputs based on the configuration passed to Nano Banana Pro.

Variant A: "The Textbook Standard" (Precision Focus)

(See Figure 1 above)

Pipeline: Claude 3.5 Sonnet → Nano Banana Pro
Style: Clean, Academic, White Background. Perfect for Paper Submissions (LaTeX).

Variant B: "The Tech Presentation" (Impact Focus)

Figure 2: The same Transformer architecture rendered with a "Sci-Fi/High-Tech" aesthetic via GPT-4o logic. Ideal for Conference Slides, Posters, and Pitch Decks.

Benchmark & Validation

We strictly evaluate this skill across different environments to ensure robustness.

Environment	Logic Model	Logic Adherence	Detail Insets	Log Output
Anthropic Console	Claude 3.5 Sonnet	Excellent	Perfect	View Log
ChatGPT Web	GPT-4o	Very Good	Good	View Log

Observation: Claude 3.5 Sonnet tends to follow the "Detail Inset" (Zone 7 & 8) instructions more strictly, making it the recommended engine for complex architectures.

Installation & Usage

How to Use

Download the core skill file: skills/visual-architect/SKILL.md.
Add it to your Project Knowledge (Claude Desktop / Cursor) or System Instructions.
Trigger: "Generate a visual schema for this paper's methodology."

How It Works (The Prompt Engineering)

This skill forces the LLM to output a structured JSON-like Markdown block, bypassing its usual "chatty" nature:

[LAYOUT CONFIGURATION]
* Selected Layout: Parallel Dual-Stream
* Composition Logic: Left column = Encoder... Right column = Decoder...

[ZONE 1: INPUT]
* Visual Structure: A stack of 3 realistic paper icons...
...

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
skills/visual-architect		skills/visual-architect
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
marketplace.json		marketplace.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper Visualizer (Anthropic Skill)

Introduction

Key Features

Gallery: Style Variants

Variant A: "The Textbook Standard" (Precision Focus)

Variant B: "The Tech Presentation" (Impact Focus)

Benchmark & Validation

Installation & Usage

How to Use

How It Works (The Prompt Engineering)

About

Uh oh!

Releases

Packages

Languages

License

WilsonWukz/paper-visualizer-skill

Folders and files

Latest commit

History

Repository files navigation

Paper Visualizer (Anthropic Skill)

Introduction

Key Features

Gallery: Style Variants

Variant A: "The Textbook Standard" (Precision Focus)

Variant B: "The Tech Presentation" (Impact Focus)

Benchmark & Validation

Installation & Usage

How to Use

How It Works (The Prompt Engineering)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages