KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction Paper • 2505.23416 • Published May 29, 2025 • 12
AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference Paper • 2502.04077 • Published Feb 6, 2025 • 1
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published Dec 2, 2025 • 67
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning Paper • 2512.02551 • Published Dec 2, 2025 • 12
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization Paper • 2512.00956 • Published Nov 30, 2025 • 19
SpeContext: Enabling Efficient Long-context Reasoning with Speculative Context Sparsity in LLMs Paper • 2512.00722 • Published Nov 30, 2025 • 15
Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation Paper • 2512.03534 • Published Dec 3, 2025 • 20
UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers Paper • 2512.04504 • Published about 1 month ago • 17
SCALE: Selective Resource Allocation for Overcoming Performance Bottlenecks in Mathematical Test-time Scaling Paper • 2512.00466 • Published Nov 29, 2025 • 9
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment Paper • 2512.02807 • Published Dec 2, 2025 • 8
Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach Paper • 2512.02834 • Published Dec 2, 2025 • 40
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Paper • 2512.05111 • Published about 1 month ago • 47
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 95
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning Paper • 2512.02425 • Published Dec 2, 2025 • 24
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper • 2512.05103 • Published about 1 month ago • 18