Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
philschmid
/
falcon-40b-instruct-GPTQ-inference-endpoints
like
2
Text Generation
Transformers
tiiuae/falcon-refinedweb
English
RefinedWeb
custom_code
text-generation-inference
Inference Endpoints
arxiv:
4 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
falcon-40b-instruct-GPTQ-inference-endpoints
2 contributors
History:
6 commits
philschmid
HF staff
Update handler.py
abdc7a2
over 1 year ago
.gitattributes
1.48 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
README.md
14.2 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
config.json
721 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
configuration_RW.py
2.51 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
generation_config.json
111 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
gptq_model-4bit--1g.safetensors
22.5 GB
LFS
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
handler.py
1.5 kB
Update handler.py
over 1 year ago
modelling_RW.py
47.1 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
quantize_config.json
183 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
requirements.txt
92 Bytes
Update requirements.txt
over 1 year ago
special_tokens_map.json
281 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
tokenizer.json
2.73 MB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago
tokenizer_config.json
220 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
over 1 year ago