Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ethzanalytics
/
mpt-7b-storywriter-sharded
like
18
Follow
Analytics Club at ETH Zürich
31
Text Generation
Transformers
PyTorch
the_pile_books3
English
mpt
mosaicML
sharded
story
custom_code
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
26347e8
mpt-7b-storywriter-sharded
/
modeling_mpt.py
Commit History
✨ gradient checkpointing
ae54cae
pszemraj
commited on
May 9, 2023
add MPTBlock to _no_split_modules
0688e28
pszemraj
commited on
May 8, 2023
format
76b1322
pszemraj
commited on
May 8, 2023
initial support for device_map=auto
304970e
pszemraj
commited on
May 8, 2023
add sharded checkpoint
7ab236e
peter szemraj
commited on
May 8, 2023