Skip to content

VIBE (Visual Instruction Based Editor) implementation for ComfyUI. Edit images using natural language instructions.

License

Notifications You must be signed in to change notification settings

ato-zen/ComfyUI-VIBE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ComfyUI-VIBE 🎨


▶️ Play Video Demo
Implementation of VIBE (Visual Instruction Based Editor) as a custom node for ComfyUI. Edit images using natural language instructions (e.g., "make it winter", "change the dog to a cat").

VIBE leverages the efficient Sana1.5-1.6B diffusion model and Qwen3-VL-2B-Instruct for fast, high-quality image manipulation.

✨ Features

  • Instruction-based Editing: No complex prompting required.
  • Latent Support: Connect an Empty Latent Image to define output resolution.
  • Fast Inference: Powered by Sana1.5's linear attention.
  • Local Model Support: Runs entirely on your hardware.

⚡ Easy Installation (via ComfyUI Manager)

  1. Load this workflow into ComfyUI.
  2. Open ComfyUI Manager and click "Install Missing Custom Nodes".
  3. Restart ComfyUI after the installation finishes.
  4. Once reloaded, locate the VIBE Image Editor node (in Step 2) and click the "Check / Download Model" button. This will automatically download the necessary weights.

🖼️ Example Workflow

Drag and drop this image into ComfyUI to load the workflow:

ComfyUI VIBE Workflow


🚀 Installation

  1. Clone the repository: Navigate to your ComfyUI/custom_nodes folder and run:

    git clone https://github.com/ato-zen/ComfyUI-VIBE
  2. Install dependencies: Open terminal in the ComfyUI-VIBE folder and run:

    pip install -r requirements.txt

📂 Model Setup

This node looks for models in: ComfyUI/models/vibe/

  1. Create the target folder:

    mkdir -p models/vibe
    cd models/vibe
  2. Download the weights: Clone from Hugging Face (requires git-lfs):

    git clone https://huggingface.co/iitolstykh/VIBE-Image-Edit

Structure should look like:

📂 ComfyUI/
└── 📂 models/
    └── 📂 vibe/
        └── 📂 VIBE-Image-Edit/
            ├── model_index.json    
            ├── 📂 scheduler/
            ├── 📂 text_encoder/
            ├── 📂 tokenizer/
            ├── 📂 transformer/
            └── 📂 vae/

📜 Credits & Acknowledgements

ComfyUI Node implementation by ato-zen.

About

VIBE (Visual Instruction Based Editor) implementation for ComfyUI. Edit images using natural language instructions.

Resources

License

Stars

Watchers

Forks