Aayush

Aayushfaced

AI & ML interests

None yet

Recent Activity

liked a model 21 days ago

openai-community/gpt2

upvoted an article about 2 months ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted an article about 2 months ago

New in llama.cpp: Model Management

View all activity

Organizations

None yet

liked a model 21 days ago

openai-community/gpt2

Text Generation • 0.1B • Updated Feb 19, 2024 • 7.01M • 3.11k

upvoted 2 articles about 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

587

Article

New in llama.cpp: Model Management

Dec 11, 2025

•

116

upvoted 2 collections 2 months ago

NVIDIA Nemotron V2

Collection

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 5 days ago • 102

Inference Optimized Checkpoints (with Model Optimizer)

Collection

A collection of generative models quantized and optimized for inference with Model Optimizer. • 51 items • Updated 1 day ago • 82

upvoted an article 2 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21, 2025

•

liked a Space 3 months ago

The Smol Training Playbook

📚

2.95k

The secrets to building world-class LLMs

liked 3 Spaces 4 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.28k

Generate high-quality text data for LLMs using FineWeb

Robot Learning: A Tutorial

📝

322

Learn about modern robot learning techniques and applications

The Ultra-Scale Playbook

🌌

3.67k

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 papers 4 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 49

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

liked a dataset 5 months ago

InternRobotics/OmniWorld

Viewer • Updated 27 days ago • 6.35B • 25.7k • 80

upvoted 7 papers 5 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 57

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 177

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10, 2025 • 98

Aayush

AI & ML interests

Recent Activity

Organizations

Aayushfaced's activity

We Got Claude to Fine-Tune an Open Source LLM

New in llama.cpp: Model Management

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

The Smol Training Playbook

FineWeb: decanting the web for the finest text data at scale

Robot Learning: A Tutorial

The Ultra-Scale Playbook