Skip to content

LYL1015/JarvisEvo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

21 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

JarvisArt Icon

JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Paper Project Page Model Weights ArtEdit-Bench ๆœบๅ™จไน‹ๅฟƒ ้‡ๅญไฝ

Tencent Hunyuan, Xiamen University

*Equal Contributions โ€ Project Leader โ™ฃCorresponding Author
๐Ÿ’ก We also have other image editing agents that may interest you โœจ.

[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
Yunlong Lin, Zixu Lin and Kunjie Lin, etc.
github github arXiv Project Page Hugging Face Space

๐Ÿ“ฎ News

  • [2025.12.29] We are grateful for the coverage by ๆœบๅ™จไน‹ๅฟƒ (link) and ้‡ๅญไฝ (link). Thank you for the support!
  • [2025.12.16] ๐ŸŽ‰ JarvisEvo's project page, paper are now available!

๐ŸŽช Open-source Plan

  • Create repo and project page
  • Release Inference code and checkpoints
  • Release Agent-to-Lightroom Protocol (server-client communication protocol for multi-machine, multi-GPU training with distributed Lightroom instances)
  • Release ArtEdit-Bench
  • Release SFT training code
  • Release SEPO, RFT training code

๐Ÿงญ Table of Contents

๐Ÿงญ Overview

JarvisArt Teaser
JarvisEvo performs interleaved multimodal Chain-of-Thought (iMCoT) reasoning for image editing, which marries multi-step planning, dynamic tool orchestration, and iterative visual feedback. This closed-loop workflow incorporates self-evaluation and refinement to ensure the final output is both visually compelling and faithful to the creative vision. By seamlessly integrating professional tools like Adobe Lightroom for precision adjustments and Qwen-Image-Edit for generative tasks, the system achieves a unique synergy of expert- level refinement and creative synthesis.

๐Ÿ“ Key Features

JarvisArt Teaser

๐Ÿง  Interleaved Multimodal Chain-of-Thought (iMCoT)

Closed-Loop Reasoning: "Thinks" with both text and images, validating steps against visual feedback to minimize hallucinations and error propagation.

๐Ÿ”„ Synergistic Editor-Evaluator Optimization (SEPO)

Self-Evolving Framework: A dual-loop reinforcement learning system where the model acts as both editor and evaluator, refining strategies via intrinsic rewards without relying on static external models.

๐ŸŽจ Unified Preservative & Generative Editing

Comprehensive Toolset: Seamlessly integrates Adobe Lightroom (200+ tools) for precise adjustments and Qwen-Image-Edit for creative synthesis (object removal, style transfer), handling the full spectrum of editing tasks.

๐Ÿชž Self-Reflective Learning Mechanism

Autonomous Improvement: Automatically generates reflection trajectories upon suboptimal results, enabling the model to learn from mistakes and continuously optimize its tool selection logic.

๐Ÿ“Š Visual Comparison

JarvisEvo
Comparison with ChatGPT x Adobe Photoshop
JarvisEvo
Comparison with Leading Image Editing Models

๐Ÿ’ป Getting Started

For batch inference, please follow:

For training, please follow:

For evaluation, please follow:

For Agent-to-Lightroom Protocol Detail, please follow:

๐Ÿ™ Acknowledgements

We would like to express our gratitude to LLaMA-Factory for their valuable open-source contributions which have provided important technical references for our work.

๐ŸŒค๏ธ Discussion Group

If you have any questions during the trial, running or deployment, feel free to join our WeChat group discussion! If you have any ideas or suggestions for the project, you are also welcome to join our WeChat group discussion!

WeChat Group

Scan QR code to join WeChat group discussion

๐Ÿ“ง Contact

For any questions or inquiries, please reach out to us:


๐Ÿ“š Citation

If you find JarvisEvo useful in your research, please consider citing:

@article{lin2025jarvisevo,
  title={JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization},
  author={Lin, Yunlong and Wang, Linqing and Lin, Kunjie and Lin, Zixu and Gong, Kaixiong and Li, Wenbo and Lin, Bin and Li, Zhenxi and Zhang, Shiyi and Peng, Yuyang and others},
  journal={arXiv preprint arXiv:2511.23002},
  year={2025}
}

๐Ÿ“œ License

JarvisEvo is released under the Apache License 2.0.

About

๐Ÿ”ฅ JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages