Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kun LI's picture
2 6 14

Kun LI

inNexus

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
TreePS-RAG: Tree-based Process Supervision for Reinforcement Learning in Agentic RAG
upvoted a paper 4 months ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
upvoted a paper 4 months ago
Reinforcement Learning on Pre-Training Data
View all activity

Organizations

None yet

Collections 1

NLP
  • Self-Evaluation Improves Selective Generation in Large Language Models

    Paper • 2312.09300 • Published Dec 14, 2023 • 16
  • Prefix Grouper: Efficient GRPO Training through Shared-Prefix Forward

    Paper • 2506.05433 • Published Jun 5, 2025 • 4
NLP
  • Self-Evaluation Improves Selective Generation in Large Language Models

    Paper • 2312.09300 • Published Dec 14, 2023 • 16
  • Prefix Grouper: Efficient GRPO Training through Shared-Prefix Forward

    Paper • 2506.05433 • Published Jun 5, 2025 • 4

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs