Towards the Aha Moment of Vision-Language Models
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
9
MMInstruction/Qwen2-VL-72B-Video-T3
73B
•
Updated
•
13
MMInstruction/Giraffe
8B
•
Updated
•
8
•
2
MMInstruction/LongVA-7B-Video-T3
8B
•
Updated
•
7
MMInstruction/Qwen-VL-ArXivCap
Text Generation
•
Updated
•
34
•
4
MMInstruction/Qwen-VL-ArXivQA
Text Generation
•
Updated
•
24
•
4
MMInstruction/Silkie
Text Generation
•
Updated
•
29
•
12
MMInstruction/YingVLM
Updated
•
8
•
1
MMInstruction/YingVLM-zh
Updated
•
7
MMInstruction/YingVLM-Video
Updated
•
5
datasets
17
MMInstruction/stock_factors
Viewer
•
Updated
•
48.2M
•
816
MMInstruction/OSWorld-G
Viewer
•
Updated
•
510
•
155
•
6
MMInstruction/VL-RewardBench
Viewer
•
Updated
•
1.25k
•
616
•
14
MMInstruction/Video-T3-QA
Viewer
•
Updated
•
162k
•
164
•
2
MMInstruction/SuperClevr_Val
Viewer
•
Updated
•
5k
•
148
•
1
MMInstruction/Clevr_CoGenT_TrainA_R1
Viewer
•
Updated
•
37.8k
•
212
•
48
MMInstruction/Clevr_CoGenT_TrainA_70K_Complex
Viewer
•
Updated
•
70k
•
554
•
8
MMInstruction/Clevr_CoGenT_ValB
Viewer
•
Updated
•
5k
•
20
•
2
MMInstruction/Clevr_CoGenT_ValA
Viewer
•
Updated
•
5k
•
286
•
1
MMInstruction/Clevr_CoAgent_TrainA_R1
Viewer
•
Updated
•
2.5k
•
21