[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
-
Updated
Mar 19, 2025 - Python
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
[ICCV 2025] Explore the Limits of Omni-modal Pretraining at Scale
[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
[ICML 2026] AVTrack: Audio-Visual Tracking in Human-centric Complex Scenes
[ICML'26] Toward Human-like Audio-Visual Intelligence of Omni-MLLMs
📰 Must-read papers on Unified Multimodal Understanding and Generation (constantly updating 🤗).
Omni-Modality Processing, Understanding, and Generation
Track human speakers in complex scenes using this audio-visual instance segmentation dataset.
Omnimodal RWKV-v7 browser runtime — vision, audio, SheafMemory, AutopoieticOptimizer. Succeeded by EvaROSA.
Add a description, image, and links to the omnimodal topic page so that developers can more easily learn about it.
To associate your repository with the omnimodal topic, visit your repo's landing page and select "manage topics."