ThinkPRM Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 launch/ThinkPRM-1.5B Text Generation β’ 2B β’ Updated Jun 25, 2025 β’ 75 β’ 3 launch/ThinkPRM-7B Text Generation β’ 8B β’ Updated May 17, 2025 β’ 9 β’ 1 launch/ThinkPRM-14B Text Generation β’ 15B β’ Updated Jul 1, 2025 β’ 205 β’ 5 mradermacher/ThinkPRM-7B-i1-GGUF 8B β’ Updated Jul 11, 2025 β’ 2.21k
ThinkPRM Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 launch/ThinkPRM-1.5B Text Generation β’ 2B β’ Updated Jun 25, 2025 β’ 75 β’ 3 launch/ThinkPRM-7B Text Generation β’ 8B β’ Updated May 17, 2025 β’ 9 β’ 1 launch/ThinkPRM-14B Text Generation β’ 15B β’ Updated Jul 1, 2025 β’ 205 β’ 5 mradermacher/ThinkPRM-7B-i1-GGUF 8B β’ Updated Jul 11, 2025 β’ 2.21k