nvidia
/

Llama-3_1-Nemotron-51B-Instruct

Text Generation

Model card Files Files and versions

Resources

View closed (14)

Can Llama-3.1- Nemotron-40B-Instruct be released as well?

#24 opened about 1 year ago by

What is the context size this model was trained on?

#23 opened about 1 year ago by

Modified llama.cpp to generate GGUFs for Llama-3_1-Nemotron-51

#22 opened about 1 year ago by

Documentation about the linear attention used in some layers of this model?

#21 opened about 1 year ago by

Comparison to the 70B model?

#20 opened about 1 year ago by

Update README.md

#11 opened over 1 year ago by

vLLM compatible?

#10 opened over 1 year ago by

AttributeError: 'DeciLMConfig'

#9 opened over 1 year ago by

fp8 / int8 inference - use bitsandbytes or awq

#8 opened over 1 year ago by

GGUF possible ?

#5 opened over 1 year ago by

fine-tuning

#1 opened over 1 year ago by