Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Low Horng Jiun
NickolasLow1
Follow
0 followers
ยท
10 following
AI & ML interests
None yet
Recent Activity
reacted
to
sergiopaniego
's
post
with ๐ฅ
11 days ago
New REPL environment in OpenEnv available! โจ Used in the Recursive Language Models (RLM) paper by Alex Zhang. Ready for inference & post-training using trajectories. Handles long contexts: > Run Python code in a sandbox > Make recursive calls to LMs > Explore data programmatically > Return final result Docs: https://meta-pytorch.org/OpenEnv/environments/repl/ Inference script: https://github.com/meta-pytorch/OpenEnv/blob/main/examples/repl_oolong_simple.py
reacted
to
sergiopaniego
's
post
with ๐
2 months ago
Interested in RL training environments? We just released a beginner-friendly walkthrough notebook! Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM. happy learning! ๐ฑ Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv
updated
a model
2 months ago
NickolasLow1/Qwen2.5-7B-Instruct
View all activity
Organizations
None yet
NickolasLow1
's Spaces
1
Sort:ย Recently updated
Sleeping
Qwen2.5 7B Instruct
๐
Visualize tracking metrics and media from experiments