Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
28
Follow
AWS Inferentia and Trainium
154
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
634
main
optimum-neuron-cache
4 contributors
History:
13581 commits
dacorvo
HF Staff
Synchronizing local compiler cache.
27f802f
verified
about 12 hours ago
inference-cache-config
Add llama3 configurations with longer sequences
about 22 hours ago
neuronxcc-2.19.8089.0+8ab9f450
Synchronizing local compiler cache.
about 1 month ago
neuronxcc-2.20.9961.0+0acef03a
Synchronizing local compiler cache.
about 1 month ago
neuronxcc-2.21.18209.0+043b1bf7
Synchronizing local compiler cache.
9 days ago
neuronxcc-2.21.33363.0+82129205
Synchronizing local compiler cache.
about 12 hours ago
neuronxcc-2.22.12471.0+b4a00d10
Synchronizing local compiler cache.
18 days ago
.gitattributes
1.95 MB
Synchronizing local compiler cache.
about 12 hours ago
README.md
Safe
1.27 kB
Add SageMaker deployment instructions
almost 2 years ago