-
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper • 2402.01739 • Published • 28 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 140 -
Rethinking Interpretability in the Era of Large Language Models
Paper • 2402.01761 • Published • 23 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 159
Michel Chaduteau
michadu
·
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 17 hours ago
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies
updated
a collection
4 months ago
LLM_papers
updated
a collection
about 2 years ago
LLM_papers
Organizations
None yet