Low Horng Jiun's picture

1

Low Horng Jiun

NickolasLow1

·

AI & ML interests

None yet

Recent Activity

reacted to sergiopaniego's post with 🔥 11 days ago

New REPL environment in OpenEnv available! ✨ Used in the Recursive Language Models (RLM) paper by Alex Zhang. Ready for inference & post-training using trajectories. Handles long contexts: > Run Python code in a sandbox > Make recursive calls to LMs > Explore data programmatically > Return final result Docs: https://meta-pytorch.org/OpenEnv/environments/repl/ Inference script: https://github.com/meta-pytorch/OpenEnv/blob/main/examples/repl_oolong_simple.py

reacted to sergiopaniego's post with 👍 2 months ago

Interested in RL training environments? We just released a beginner-friendly walkthrough notebook! Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM. happy learning! 🌱 Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv

updated a model 2 months ago

NickolasLow1/Qwen2.5-7B-Instruct

View all activity

Organizations

None yet

NickolasLow1 's Spaces 1

Qwen2.5 7B Instruct

Visualize tracking metrics and media from experiments