MCG-NJU/LongVPO-Stage2-InternVL3-8B
Video-Text-to-Text
•
8B
•
Updated
Computer Vision; Video Understanding; Action Recognition
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
SAM 2++: Tracking Anything at Any Granularity