Haoze Wu's picture

3 20 3

Haoze Wu

WaitHZ

·

https://waithz.github.io/

AI & ML interests

Modular DL, Complex Reasoning

Recent Activity

upvoted a paper 29 days ago

InnoGym: Benchmarking the Innovation Potential of AI Agents

upvoted a paper 29 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper about 1 month ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

View all activity

Organizations

upvoted 2 papers 29 days ago

InnoGym: Benchmarking the Innovation Potential of AI Agents

Paper • 2512.01822 • Published Dec 1, 2025 • 35

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published about 1 month ago • 242

upvoted a paper about 1 month ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 84

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 115k • • 1.05k

upvoted a paper about 2 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29, 2025 • 20

authored a paper 2 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

upvoted a paper 2 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

liked a dataset 2 months ago

hkust-nlp/Toolathlon-Trajectories

Preview • Updated 27 days ago • 3.79k • 18

upvoted a paper 2 months ago

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published Oct 21, 2025 • 110

upvoted a collection 3 months ago

DeepSeek-V3.2

4 items • Updated Dec 1, 2025 • 511

updated 2 collections 4 months ago

MATH-Benchmark

5 items • Updated Sep 15, 2025

MATH-Training

2 items • Updated Sep 15, 2025

upvoted a paper 4 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 79

authored 2 papers 4 months ago

ReCode: Updating Code API Knowledge with Reinforcement Learning

Paper • 2506.20495 • Published Jun 25, 2025 • 9

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published Aug 28, 2025 • 8

upvoted a paper 4 months ago

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published Aug 28, 2025 • 8

updated a collection 6 months ago

ReCode

2 items • Updated Jul 21, 2025