Ivy
FURUF
AI & ML interests
NLP RL
Recent Activity
upvoted
a
paper
1 day ago
Shaping capabilities with token-level data filtering
upvoted
a
paper
3 days ago
Reinforcement Learning via Self-Distillation
upvoted
a
paper
23 days ago
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs
Organizations
None yet