view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 8 days ago • 469
view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism 16 days ago • 16
view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 about 1 month ago • 103
view article Article Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models Jan 6 • 25
view article Article The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU Jan 2 • 15
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 120
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 302
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers Paper • 2511.09554 • Published Nov 12, 2025 • 9
view article Article Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness Nov 5, 2025 • 12