NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models Paper โข 2602.06694 โข Published 14 days ago โข 15 โข 5
TheBloke/WizardLM-Uncensored-Falcon-40B-GPTQ Text Generation โข 42B โข Updated Aug 21, 2023 โข 11 โข 60