seonglae
/

llama-2-7b-chat-hf-gptq

Text Generation

Model card Files Files and versions

llama-2-7b-chat-hf-gptq

7.8 GB

1 contributor

History: 9 commits

seonglae's picture

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False

bcece48 over 2 years ago

.gitattributes
1.52 kB

initial commit over 2 years ago
README.md
993 Bytes

Update README.md over 2 years ago
added_tokens.json
21 Bytes

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
config.json
624 Bytes

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
generation_config.json
192 Bytes

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
gptq_model-4bit-128g.bin
3.9 GB
xet

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
gptq_model-4bit-128g.safetensors
3.9 GB
xet

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
quantize_config.json
224 Bytes

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
special_tokens_map.json
414 Bytes

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
tokenizer.json
1.84 MB

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
tokenizer.model
500 kB
xet

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago
tokenizer_config.json
719 Bytes

build: AutoGPTQ for meta-llama/Llama-2-7b-chat-hf: 4bits, gr128, desc_act=False over 2 years ago