π§ LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. β’ 19 items β’ Updated about 23 hours ago β’ 43
nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3 Text Generation β’ 8B β’ Updated Oct 28, 2025 β’ 599 β’ 8
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper β’ 2512.15176 β’ Published 21 days ago β’ 42
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper β’ 2512.07525 β’ Published 30 days ago β’ 57
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper β’ 2512.23447 β’ Published 9 days ago β’ 93
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows Paper β’ 2512.16969 β’ Published 20 days ago β’ 111
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper β’ 2512.16676 β’ Published 19 days ago β’ 202
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper β’ 2511.22699 β’ Published Nov 27, 2025 β’ 224
Running 96 The Eiffel Tower Llama π 96 Explore the Eiffel Tower Llama experiment with open-source models