omnimodal

Star

Here are 10 public repositories matching this topic...

EvolvingLMMs-Lab / EgoLife

Star

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

rag egocentric-vision omnimodal

Updated Mar 19, 2025
Python

invictus717 / MiCo

Star

[ICCV 2025] Explore the Limits of Omni-modal Pretraining at Scale

deep-learning scale-up multimodal pretraining multimodal-large-language-models omnimodal

Updated Sep 2, 2024
Python

aim-uofa / Omni-R1

Star

[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

rl omnimodal mllms grpo neurips-2025

Updated Dec 3, 2025
Python

JaaackHongggg / WorldSense

Star

WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs

omnimodal mllms

Updated May 7, 2026
JavaScript

FudanCVL / AVTrack

Star

[ICML 2026] AVTrack: Audio-Visual Tracking in Human-centric Complex Scenes

tracking computer-vision segmentation audiovisual icml omnimodal multomodal

Updated May 14, 2026
Python

FudanCVL / AVI-Bench

Star

[ICML'26] Toward Human-like Audio-Visual Intelligence of Omni-MLLMs

agent benchmark avi reasoning icml multimodal audio-visual llm mllm omnimodal audio-visual-intelligence

Updated May 28, 2026
Python

RainBowLuoCS / Awesome-Unified-Multimodal-Understanding-and-Generation

Star

📰 Must-read papers on Unified Multimodal Understanding and Generation (constantly updating 🤗).

awesome-list papers any-to-any multimodal-large-language-models omnimodal

Updated Jun 13, 2025

kyegomez / CELESTIAL-1

Sponsor

Star

Omni-Modality Processing, Understanding, and Generation

openai attention multi-modal multimodality attention-is-all-you-need attention-mechanisms multimodal multimodal-deep-learning gpt-4 gpt4 omnimodal

Updated May 3, 2024
Python

marie-jeannesotho844 / AVTrack

Star

Track human speakers in complex scenes using this audio-visual instance segmentation dataset.

linux crawler database spider computer-vision hacking loading magnet-link segmentation magnet hacking-tool qtav icml javlibrary osint-python omnimodal multomodal

Updated Jun 16, 2026

ConsciousNode / HTMLNLM-Evangelion

Star

Omnimodal RWKV-v7 browser runtime — vision, audio, SheafMemory, AutopoieticOptimizer. Succeeded by EvaROSA.

javascript machine-learning single-file browser-native rwkv omnimodal

Updated May 26, 2026
HTML

Improve this page

Add a description, image, and links to the omnimodal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the omnimodal topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

omnimodal

Here are 10 public repositories matching this topic...

EvolvingLMMs-Lab / EgoLife

invictus717 / MiCo

aim-uofa / Omni-R1

JaaackHongggg / WorldSense

FudanCVL / AVTrack

FudanCVL / AVI-Bench

RainBowLuoCS / Awesome-Unified-Multimodal-Understanding-and-Generation

kyegomez / CELESTIAL-1

marie-jeannesotho844 / AVTrack

ConsciousNode / HTMLNLM-Evangelion

Improve this page

Add this topic to your repo