RedHatAI/starcoder2-15b-quantized.w8a16
Text Generation
• 4B • Updated
• 6
RedHatAI/Qwen3-Next-80B-A3B-Thinking-FP8-block
Text Generation
• 80B • Updated
• 13
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a8
Text Generation
• 4B • Updated
• 23
RedHatAI/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
• Updated
• 230
RedHatAI/Qwen3-8B-FP8-block
Text Generation
• 8B • Updated
• 182
Text Generation
• 358B • Updated
• 3
RedHatAI/Kimi-K2-Thinking
Text Generation
• Updated
• 5
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
• 1.0B • Updated
• 11.7k
• 1
RedHatAI/Qwen3-30B-A3B-Instruct-2507.w4a16
Text Generation
• 5B • Updated
• 851
• 1
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
• 0.5B • Updated
• 127k
• 1
RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic
Text Generation
• 71B • Updated
• 55.5k
• 14
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic
Text Generation
• 8B • Updated
• 19k
• 9
RedHatAI/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
• Updated
• 388
• 5
RedHatAI/Qwen3-VL-32B-Instruct-NVFP4
Text Generation
• 20B • Updated
• 15.5k
• 5
RedHatAI/Qwen3-VL-32B-Instruct-FP8-dynamic
Text Generation
• 33B • Updated
• 361
• 1
RedHatAI/Qwen3-VL-32B-Instruct-FP8-block
Text Generation
• 33B • Updated
• 23
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-NVFP4
Text Generation
• 229B • Updated
• 619
• 2
RedHatAI/Qwen3-30B-A3B-NVFP4
Text Generation
• 17B • Updated
• 11.2k
• 2
RedHatAI/Qwen3-235B-A22B-NVFP4
Text Generation
• 136B • Updated
• 18
• 1
RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
• 136B • Updated
• 1.12k
• 4
RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4
Text Generation
• 14B • Updated
• 14k
• 6
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
• 1B • Updated
• 1.03k
RedHatAI/gpt-oss-20b-speculator.eagle3
Text Generation
• 0.9B • Updated
• 14.5k
• 7
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-speculator.eagle3
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
• 2B • Updated
• 6.26k
• 5
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
• 1B • Updated
• 295
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
• 1B • Updated
• 73.6k
• 2
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
• 2B • Updated
• 179
• 1
RedHatAI/Llama-3.3-70B-Instruct-NVFP4
Text Generation
• 41B • Updated
• 592
• 1
RedHatAI/Llama-3.1-70B-Instruct-NVFP4
Text Generation
• 41B • Updated
• 94