Jiafei Lyu's picture

4 3

Jiafei Lyu

dmux

·

https://dmksjfl.github.io/

dmksjfl

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 1 day ago

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

liked a model 22 days ago

biang889/ProAct

upvoted a paper 22 days ago

ProAct: Agentic Lookahead in Interactive Environments

View all activity

Organizations

upvoted a paper 1 day ago

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published 2 days ago • 27

upvoted a paper 22 days ago

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published 23 days ago • 25

upvoted a paper about 2 months ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 151

upvoted a paper 3 months ago

EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

Paper • 2511.15248 • Published Nov 19, 2025 • 7