Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
15
23
19
Wei Xiong
weqweasdas
Follow
dangkai-nk's profile picture
xinyut's profile picture
Laihaoran's profile picture
20 followers
·
21 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
12 months ago
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Text Generation
•
8B
•
Updated
May 10, 2025
•
3.9k
•
•
37
liked
a dataset
about 1 year ago
RLHFlow/RLHFlow-SFT-Dataset-ver2
Viewer
•
Updated
Nov 2, 2024
•
2.32M
•
62
•
5
liked
a model
about 1 year ago
RLHFlow/Llama3.1-8B-PRM-Mistral-Data
Text Generation
•
8B
•
Updated
Nov 9, 2024
•
247
•
•
10
liked
10 models
over 1 year ago
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
•
8B
•
Updated
Sep 6, 2024
•
29
•
23
RLHFlow/LLaMA3-SFT
Text Generation
•
8B
•
Updated
Nov 3, 2024
•
70
•
•
10
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
8B
•
Updated
Oct 14, 2024
•
39
•
•
41
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
8B
•
Updated
Sep 23, 2024
•
12.2k
•
184
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
8B
•
Updated
Oct 14, 2024
•
59
•
•
38
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
8B
•
Updated
Jan 21, 2025
•
23
•
11
Salesforce/LLaMA-3-8B-SFR-SFT-R
Text Generation
•
8B
•
Updated
Jan 21, 2025
•
11
•
8
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
8B
•
Updated
Jan 21, 2025
•
49
•
78
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
8B
•
Updated
Oct 14, 2024
•
775
•
60
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
7B
•
Updated
Apr 24, 2024
•
15
•
8
liked
a model
almost 2 years ago
weqweasdas/RM-Mistral-7B
Text Classification
•
7B
•
Updated
Mar 31, 2024
•
942
•
24
liked
a Space
almost 2 years ago
Running
417
Reward Bench Leaderboard
📐
417
Display and analyze reward model evaluation results
liked
2 models
almost 2 years ago
weqweasdas/RM-Gemma-7B
Text Classification
•
9B
•
Updated
Mar 22, 2024
•
38
•
8
weqweasdas/RM-Gemma-2B
Text Classification
•
3B
•
Updated
Mar 22, 2024
•
469
•
25
liked
a model
over 2 years ago
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
Feb 25, 2024
•
100
•
17
liked
a Space
over 2 years ago
Runtime error
Featured
66
Robin 7b
🔥
66