Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 92
inference-optimization/test_qwen3_next_mtp
Updated
• 6
inference-optimization/test_tencentbac_fastmtp
Updated
• 5
inference-optimization/Qwen3-30B-A3B-Instruct-2507-NVFP4
17B • Updated
• 14
inference-optimization/Qwen3-30B-A3B-Instruct-2507-FP8-Dynamic
31B • Updated
• 14
inference-optimization/Qwen3-30B-A3B-Instruct-2507-FP8-Block
31B • Updated
• 11
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.5bits
25B • Updated
• 12
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5bits
20B • Updated
• 13
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.5bits
22B • Updated
• 11
inference-optimization/Qwen3-30B-A3B-Instruct-2507-7bits
27B • Updated
• 15
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.75bits
26B • Updated
• 15
datasets 0
None public yet