Official PyTorch implementation of "Video Summarization with Large Language Models" (CVPR 2025).
-
Updated
Oct 7, 2025 - Python
Official PyTorch implementation of "Video Summarization with Large Language Models" (CVPR 2025).
Unofficial mirror and notes for FastVLM (CVPR 2025) efficient vision encoder.
Unofficial PyTorch reproduction for OSDFace: One-Step Diffusion Model for Face Restoration.
Unofficial PyTorch reproduction for ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning.
Unofficial PyTorch reproduction for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation.
Unofficial PyTorch reproduction for DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness.
Unofficial PyTorch reproduction for 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion.
Unofficial PyTorch reproduction for URWKV: Unified RWKV Model with Multi-state Perspective for Low-light Image Restoration.
Unofficial PyTorch reproduction for UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior.
Unofficial PyTorch reproduction for Dual Prompting Image Restoration with Diffusion Transformers.
Unofficial PyTorch reproduction for Exploring Simple Open-Vocabulary Semantic Segmentation.
Unofficial PyTorch reproduction for Reversing Flow for Image Restoration.
Unofficial PyTorch reproduction for VideoDirector: Precise Video Editing via Text-to-Video Models.
Unofficial PyTorch reproduction for Feat2GS: Probing Visual Foundation Models with Gaussian Splatting.
Unofficial PyTorch reproduction for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation.
Unofficial PyTorch reproduction for Tartan IMU: A Light Foundation Model for Inertial Positioning in Robotics.
Unofficial PyTorch reproduction for FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering.
Unofficial PyTorch reproduction for DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes.
Unofficial PyTorch reproduction for Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models.
Unofficial PyTorch reproduction for SketchVideo: Sketch-based Video Generation and Editing.
Add a description, image, and links to the cvpr-2025 topic page so that developers can more easily learn about it.
To associate your repository with the cvpr-2025 topic, visit your repo's landing page and select "manage topics."