arxiv:2504.00891
Jiafei Lyu
dmux
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted a paper about 8 hours ago
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization liked
a model 21 days ago
biang889/ProAct upvoted a paper 21 days ago
ProAct: Agentic Lookahead in Interactive Environments