microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
6.21k
•
1.28k
Generate high-quality text data for LLMs using FineWeb
The ultimate guide to training LLM on large GPU Clusters
Calculate memory usage for model configurations