huggingface-projects (Huggingface Projects)

pcuenq

updated a dataset about 4 hours ago

huggingface-projects/drlc-leaderboard-data

Viewer • Updated about 4 hours ago • 48.2k • 8.93k • 2

AdinaY

posted an update about 13 hours ago

Post

136

Wechat AI is shipping!

WeDLM 🔥 A new language model that generates tokens in parallel, making it faster than standard LLMs , with the same Transformer setup!
https://huggingface.co/collections/tencent/wedlm

✨ 7B/8B - Base & Instruct
✨ Apache 2.0

AdinaY

posted an update about 15 hours ago

Post

627

Qwen just released two new model series: Qwen3-VL-Embedding & Qwen3-VL-Reranker 🚀

✨ 2B / 8B - Apache2.0
✨ 30+ languages
✨ Supported text, images, screenshots, videos, and arbitrary multimodal combinations

Qwen3-VL-Embedding: Flexible vector sizes (64–2048)
https://huggingface.co/collections/Qwen/qwen3-vl-embedding
Qwen3-VL-Reranker: Built for recall>rerank pipelines
https://huggingface.co/collections/Qwen/qwen3-vl-reranker

sergiopaniego

posted an update about 16 hours ago

Post

608

New GRPO + TRL free Colab notebook out! 🔥

Fine-tune 7B+ models on T4 GPUs thanks to a ton of memory optimizations for GRPO

7B model uses only 9.2 GB VRAM (~7× reduction) 🤯

Try the notebook here 👉 https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_trl_lora_qlora.ipynb

akhaliq

submitted a paper to Daily Papers about 17 hours ago

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Paper • 2601.03955 • Published 1 day ago • 1

sergiopaniego

updated a dataset about 20 hours ago

huggingface-projects/Deep-RL-Course-Certification

Viewer • Updated about 20 hours ago • 1.65k • 1.29k • 16

AdinaY

posted an update 1 day ago

Post

249

MOSS Transcribe Diarize 🔊 A multimodal model for Speaker-Attributed, Time-Stamped Transcription from OpenMOSS.

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization (2601.01554)
OpenMOSS-Team/MOSS-transcribe-diarize

✨ Single-pass end-to-end SATS
✨ 128k context, ~90 min audio
✨ Robust to overlap & noise

1 reply

·

AdinaY

posted an update 2 days ago

Post

189

Daily Papers just got an AI reading assistant 🔥

You can ask any question you want: clarify a paragraph, get a short summary...all without leaving the page!

✨ Powered by HuggingChat + Hugging Face MCP server

victor

updated a Space 3 days ago

AI Video Composer - Natural Language FFMPEG

🏞

633

Describe what you want, AI writes the FFMPEG command

AdinaY

posted an update 4 days ago

Post

1746

Chinese open source AI in December 2025 was about the stack coming together: open, end to end, and ready to ship 🔥

https://huggingface.co/collections/zh-ai-community/december-2025-china-open-source-highlights

✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size
- DeepSeek-V3.2
- Z.ai GLM-4.7
- MiniMax-M2.1
- Xiaomi: MiMo-V2-Flash

✨ Multimodal reasoning is now default
- Z.ai GLM-4.6V
- Z.ai AutoGLM-Phone 9B
- Bytedance: Dolphin-v2

✨ Image & video: editable assets and real workflows
- Qwen-Image-Layered / Image-2512
- Meituan: LongCat-Image & Image Edit
- AIDC: Ovis-Image-7B
- Live-Avatar / LongCat-Video-Avatar
- HY-WorldPlay / RealVideo

✨ Audio goes edge ready
- GLM-ASR-Nano / Fun-ASR-Nano
- GLM-TTS / VoxCPM1.5
- CosyVoice 0.5B

✨ The quiet backbone: data & infrastructure
- Finch (FinWorkBench)
- Tencent ARC: TimeLens-100K
- BIGAI: TongSIM-Asset
- MiniMax: VTP-Base

✨ Also congrats on Minimax and Z.ai announced their IPOs and Moonshot announced a new $500M funding round 🔥

Like everyone else, I was OOO at the end of December, so feel free to share (in comments or PR) any I missed in this list!

pcuenq

posted an update 4 days ago

Post

2424

👉 What happened in AI in 2025? 👈

We prepared the 2025 version of the HF AI Timeline Grid, highlighting open vs API-based model releases, and allowing you to browse and filter by access, modality, and release type!

Play with it here:
2025-ai-timeline/2025-ai-timeline

Here's my personal quarterly TL;DR:

1️⃣ Q1 — Learning to Reason
Deepseek not only releases a top-notch reasoning model, but shows how to train them and compete with closed frontier models. OpenAI debuts Deep Research.

Significant milestones: DeepSeek R1 & R1-Zero, Qwen 2.5 VL, OpenAI Deep Research, Gemini 2.5 Pro (experimental)

2️⃣ Q2 — Multimodality and Coding
More LLMs embrace multimodality by default, and there's a surge in coding agents. Strong vision, audio, and generative models emerge.

Significant milestones: Llama 4, Qwen 3, Imagen 4, OpenAI Codex, Google Jules, Claude 4

3️⃣ Q3 — "Gold" rush, OpenAI opens up, the community goes bananas
Flagship models get gold in Math olympiads and hard benchmarks. OpenAI releases strong open source models and Google releases the much anticipated nano-banana for image generation and editing. Agentic workflows become commonplace.

Significant milestones: Gemini and OpenAI IMO Gold, gpt-oss, Gemini 2.5 Flash Image, Grok 4, Claude Sonnet 4.5

4️⃣ Q4 — Mistral returns, leaderboard hill-climbing
Mistral is back with updated model families. All labs release impressive models to wrap up the year!

Significant milestones: Claude Opus 4.5, DeepSeek Math V2, FLUX 2, GPT 5.1, Kimi K2 Thinking, Nano Banana Pro, GLM 4.7, Gemini 3, Mistral 3, MiniMax M2.1 🤯

Credits
🙏 NHLOCAL for the source data https://github.com/NHLOCAL/AiTimeline

🫡 @reach-vb for the original idea, design and recipe

🙌 @ariG23498 and yours truly for compiling and verifying the 2025 edition

🥳 Here's to 2026, wishing it becomes the best year ever for open releases and on-device-first use-cases! 🥂

1 reply

·

AdinaY

posted an update 4 days ago

Post

1864

MiniMax M2.1 blog is out🔥
https://huggingface.co/blog/MiniMaxAI/multilingual-and-multi-task-coding-with-strong-gen

Only a year into open source, MiniMax is already making a great impact. Not only through solid models/products, but also by how well the team uses community platforms like Hugging Face.

HF Teams, blogs, Daily Papers, Spaces as project pages, and always experimenting with new ways to engage. Super impressive!

AdinaY

posted an update 5 days ago

Post

3570

2025.1 - DeepSeek entered the scene, backed by High Flyer Quant
2026.1 - IQuest enters the game, backed by Uniquant Quant 📈 and launching IQuest-Coder on huggingface
https://huggingface.co/collections/IQuestLab/iquest-coder

✨ 40B models: Instruct / Thinking / Loop
✨ Loop = MoE-level performance with only ~5% extra training cost
✨ Native 128K context

1 reply

·

sergiopaniego

posted an update 7 days ago

Post

2456

The list of hands-on notebooks (some beginner-friendly!) to get started with fine-tuning using TRL keeps growing!!

• SFT
• GRPO
• Tool calling & agents
• RL environments with OpenEnv
• LLMs and VLMs
✨ Many run on FREE Colab, making it super easy to get started fast!

https://github.com/huggingface/trl/tree/main/examples/notebooks

akhaliq

submitted 2 papers to Daily Papers 7 days ago

FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation

Paper • 2512.24724 • Published 9 days ago • 6

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

Paper • 2512.24766 • Published 9 days ago • 7

AdinaY

submitted a paper to Daily Papers 8 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 9 days ago • 230

sergiopaniego

posted an update 10 days ago

Post

367

As the year comes to an end, it’s a good moment to catch up on some of the best long-form pieces published by the @huggingface team.

I’ve gathered them all here if you want to read or save them for later:
https://huggingface.co/collections/sergiopaniego/research-and-long-form-blog-posts

sergiopaniego

posted an update 11 days ago

Post

2210

This super detailed tutorial by @Paulescu is pure gold 🪙 "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv"

LFM2-350M ( @LiquidAI ) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control 🤝

https://paulabartabajo.substack.com/p/fine-tuning-lfm2-350m-for-browser

sergiopaniego

posted an update 17 days ago

Post

427

if you’re on holidays 🎄 and want some reading, here are blogs I contributed to this year:

🦄 VLMs in TRL: https://huggingface.co/blog/trl-vlm-alignment
🦖 VLMs in 2025: https://huggingface.co/blog/vlms-2025
👾 tokenization in transformers v5: https://huggingface.co/blog/tokenizers
🛸 faster transformers: https://huggingface.co/blog/faster-transformers

Huggingface Projects

AI & ML interests

Recent Activity

huggingface-projects/drlc-leaderboard-data

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

huggingface-projects/Deep-RL-Course-Certification

AI Video Composer - Natural Language FFMPEG

FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

mHC: Manifold-Constrained Hyper-Connections

AI & ML interests

Recent Activity

Team members 20

huggingface-projects's activity

AI Video Composer - Natural Language FFMPEG