Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
philschmid
/
falcon-40b-instruct-GPTQ-inference-endpoints
like
2
Text Generation
Transformers
tiiuae/falcon-refinedweb
English
RefinedWeb
custom_code
arxiv:
4 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
falcon-40b-instruct-GPTQ-inference-endpoints
2 contributors
History:
6 commits
philschmid
Update handler.py
abdc7a2
almost 2 years ago
.gitattributes
Safe
1.48 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
README.md
Safe
14.2 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
config.json
Safe
721 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
configuration_RW.py
Safe
2.51 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
generation_config.json
Safe
111 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
gptq_model-4bit--1g.safetensors
Safe
22.5 GB
LFS
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
handler.py
Safe
1.5 kB
Update handler.py
almost 2 years ago
modelling_RW.py
Safe
47.1 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
quantize_config.json
Safe
183 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
requirements.txt
Safe
92 Bytes
Update requirements.txt
almost 2 years ago
special_tokens_map.json
Safe
281 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
tokenizer.json
Safe
2.73 MB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago
tokenizer_config.json
Safe
220 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
almost 2 years ago