QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 17 days ago • 103
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30, 2025 • 119
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30, 2025 • 119
FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion Paper • 2504.06562 • Published Apr 9, 2025
ThinkSwitcher: When to Think Hard, When to Think Fast Paper • 2505.14183 • Published May 20, 2025 • 1