vincent's picture

vincent

Realcat

·

AI & ML interests

Focusing on SLAM, SfM, and Visual Localization.

Recent Activity

liked a Space 2 days ago

lch01/StreamVGGT

liked a model 2 days ago

lch01/StreamVGGT

upvoted a paper 2 days ago

InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams

View all activity

Organizations

None yet

upvoted a paper 2 days ago

InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams

Paper • 2601.02281 • Published 19 days ago • 33

upvoted a paper 2 months ago

RoMa v2: Harder Better Faster Denser Feature Matching

Paper • 2511.15706 • Published Nov 19, 2025 • 8

upvoted a collection 3 months ago

Emu3.5

Native Multimodal Models are World Learners 🌍 • 4 items • Updated about 1 month ago • 73

upvoted a paper 6 months ago

EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Paper • 2507.16535 • Published Jul 22, 2025 • 21

upvoted a paper 8 months ago

Probing the 3D Awareness of Visual Foundation Models

Paper • 2404.08636 • Published Apr 12, 2024 • 14

upvoted an article 9 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

511

upvoted 2 collections 10 months ago

VideoChat-R1

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning • 4 items • Updated Sep 28, 2025 • 8

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 686

upvoted an article about 1 year ago

Article

Use Models from the Hugging Face Hub in LM Studio

Nov 28, 2024

•

141

upvoted a collection about 1 year ago

Cosmos-Tokenizer

A suite of image and video tokenizers • 13 items • Updated 4 days ago • 43

upvoted a paper over 1 year ago

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Paper • 2407.17952 • Published Jul 25, 2024 • 32