Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
philschmid
/
falcon-40b-instruct-GPTQ-inference-endpoints
like
2
Text Generation
Transformers
tiiuae/falcon-refinedweb
English
RefinedWeb
custom_code
Inference Endpoints
arxiv:
4 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
falcon-40b-instruct-GPTQ-inference-endpoints
2 contributors
History:
6 commits
philschmid
HF staff
Update handler.py
abdc7a2
over 1 year ago
.gitattributes
Safe
1.48 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
README.md
Safe
14.2 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
config.json
Safe
721 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
configuration_RW.py
Safe
2.51 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
generation_config.json
Safe
111 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
gptq_model-4bit--1g.safetensors
Safe
22.5 GB
LFS
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
handler.py
Safe
1.5 kB
Update handler.py
over 1 year ago
modelling_RW.py
Safe
47.1 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
quantize_config.json
Safe
183 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
requirements.txt
Safe
92 Bytes
Update requirements.txt
over 1 year ago
special_tokens_map.json
Safe
281 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
tokenizer.json
Safe
2.73 MB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
tokenizer_config.json
Safe
220 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago