Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16
like
5
Follow
Red Hat AI
1.88k
Text Generation
Transformers
Safetensors
PyTorch
6 languages
nemotron_h
nvidia
int4
quantized
llm-compressor
compressed-tensors
red hat
conversational
custom_code
arxiv:
2210.17323
License:
nvidia-open-model-license
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16
6.47 GB
2 contributors
History:
9 commits
robgreenberg3
Update README.md
f61c1e1
verified
15 days ago
.gitattributes
1.57 kB
Upload folder using huggingface_hub
4 months ago
README.md
11.3 kB
Update README.md
15 days ago
config.json
2.44 kB
Update config.json
23 days ago
configuration_nemotron_h.py
12.2 kB
Upload folder using huggingface_hub
4 months ago
generation_config.json
158 Bytes
Upload folder using huggingface_hub
4 months ago
gsm8k_5shot.txt
50.2 kB
Upload folder using huggingface_hub
4 months ago
model-00001-of-00002.safetensors
4.97 GB
xet
Upload folder using huggingface_hub
4 months ago
model-00002-of-00002.safetensors
1.48 GB
xet
Upload folder using huggingface_hub
4 months ago
model.safetensors.index.json
49.2 kB
Upload folder using huggingface_hub
4 months ago
modeling_nemotron_h.py
78.8 kB
Upload folder using huggingface_hub
4 months ago
nemotron_toolcall_parser_no_streaming.py
3.72 kB
Upload folder using huggingface_hub
4 months ago
recipe.yaml
680 Bytes
Upload folder using huggingface_hub
4 months ago
special_tokens_map.json
422 Bytes
Upload folder using huggingface_hub
4 months ago
tokenizer.json
17.1 MB
xet
Upload folder using huggingface_hub
4 months ago
tokenizer_config.json
181 kB
Upload folder using huggingface_hub
4 months ago