Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
8
20
Xu Zhihao
naiweizi
Follow
Jhonny999's profile picture
didiforhugface's profile picture
mamasihan's profile picture
3 followers
·
0 following
AI & ML interests
Trustworthy AI
Recent Activity
authored
a paper
about 15 hours ago
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration
upvoted
a
paper
3 days ago
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration
upvoted
a
paper
4 days ago
ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas
View all activity
Organizations
None yet
naiweizi
's models
12
Sort: Recently updated
naiweizi/r1-qwen-7b-sft_meta
8B
•
Updated
Nov 21, 2025
naiweizi/R1-Qwen-7B-SFT-Meta
Updated
Nov 21, 2025
naiweizi/R1-Qwen-1_5B-Cold_Start-OpenR1_Math-priority
2B
•
Updated
Jul 18, 2025
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
naiweizi/mistral-dpo-helpful-vanilla-1e-4
Updated
May 6, 2025
naiweizi/mistral-dpo-harmless-vanilla-2e-4
Updated
May 6, 2025
naiweizi/test
Text Generation
•
8B
•
Updated
Apr 21, 2025
•
1
naiweizi/dpo-harmless_helpful-vanilla
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-rc_armo
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-mixed
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-rc_armo_mistral
Updated
Apr 14, 2025
naiweizi/qwen2.5-instruct-sft_helpsteer2
8B
•
Updated
Mar 14, 2025
•
1