#

audiovisual-uderstanding

Here is 1 public repository matching this topic...

JavisVerse / JavisGPT

[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"

audio-video multimodal mllm multimodal-large-language-models sounding-video-generation joint-audio-video-generation audiovisual-uderstanding unified-mllm audiovisual-synchronization

Updated Jan 10, 2026
Python

Improve this page

Add a description, image, and links to the audiovisual-uderstanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audiovisual-uderstanding topic, visit your repo's landing page and select "manage topics."