Zheng Zhang's picture

Zheng Zhang

qpz

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

upvoted a paper 7 months ago

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

authored a paper 7 months ago

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

View all activity

Organizations

upvoted a paper 2 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published 5 days ago • 70

upvoted a paper 7 months ago

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Paper • 2508.08401 • Published Aug 11, 2025 • 42

authored a paper 7 months ago

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Paper • 2504.13914 • Published Apr 10, 2025 • 5

commented a paper 10 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15, 2025 • 20 •

upvoted a paper 12 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B about 1 year ago

Generate crashed by repeatedly generating <think>

#35 opened about 1 year ago by

liked a model over 2 years ago

CofeAI/FLM-101B

Text Generation • Updated Jan 10 • 17 • 92

updated 4 models about 3 years ago

ConvLab/gpt2-medium-nlg-tm1_tm2_tm3

Updated Dec 26, 2022

ConvLab/gpt2-medium-nlg-multiwoz21

Updated Dec 26, 2022

ConvLab/gpt2-medium-nlg-multiwoz21_sgd_tm1_tm2_tm3

Updated Dec 26, 2022

ConvLab/gpt2-medium-nlg-sgd

Updated Dec 26, 2022