hanspeterlyngsoeraaschoujensen/DeepScaleR-1.5B-lora-256-scaling_factor_5.0-mask_cosine_0.0 Updated Sep 19, 2025
hanspeterlyngsoeraaschoujensen/DeepScaleR-1.5B-lora-256-scaling_factor_5.0-mask_cosine_0.00_0.90 Updated Aug 29, 2025
hanspeterlyngsoeraaschoujensen/llm-finetune-DeepScaleR-1.5B-Preview-128-new-tokens-scaling-factor-5.0-mask-cosi Updated Aug 27, 2025
hanspeterlyngsoeraaschoujensen/Qwen3_1.7B-fineweb_edu-train-ctx2048_layer_4 Updated Sep 24, 2025 • 54
hanspeterlyngsoeraaschoujensen/Qwen3_1.7B-fineweb_edu-train-ctx2048_layer_2 Updated Sep 24, 2025 • 61
hanspeterlyngsoeraaschoujensen/Qwen3_1.7B-fineweb_edu-train-ctx2048_layer_0 Updated Sep 24, 2025 • 40
hanspeterlyngsoeraaschoujensen/Qwen3_0.6B-fineweb_edu-train-ctx2048_layer_4 Updated Sep 24, 2025 • 66
hanspeterlyngsoeraaschoujensen/Qwen3_0.6B-fineweb_edu-train-ctx2048_layer_2 Updated Sep 24, 2025 • 63
hanspeterlyngsoeraaschoujensen/Qwen3_0.6B-fineweb_edu-train-ctx2048_layer_0 Updated Sep 24, 2025 • 58
hanspeterlyngsoeraaschoujensen/OpenR1-Math-every_n_tokens250-spacy_segmenter-basic_strategy Viewer • Updated Aug 26, 2025 • 93.5k • 32