do

cocodd

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

liked a Space 2 months ago

HuggingFaceH4/on-policy-distillation

liked a dataset 4 months ago

withmartian/routerbench

View all activity

Organizations

None yet

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.73k

The secrets to building world-class LLMs

liked a Space 2 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

Apply on-policy distillation to any model family

liked a dataset 4 months ago

withmartian/routerbench

Updated Mar 27, 2024 • 269 • 21

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 316

liked a model 5 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18 • 247k • • 2.31k

liked a Space 5 months ago

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

liked 4 datasets 6 months ago

upvoted a paper 7 months ago

BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

Paper • 2505.19457 • Published May 26 • 64

liked a Space 10 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.24k

Generate high-quality text data for LLMs using FineWeb

do