Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-3-small-128k-instruct
like
171
Follow
Microsoft
5,476
Text Generation
Transformers
Safetensors
multilingual
phi3small
nlp
code
conversational
custom_code
License:
mit
Model card
Files
Files and versions
Community
34
Train
Use this model
refs/pr/33
Phi-3-small-128k-instruct
9 contributors
History:
11 commits
moidhassan
Fix for RuntimeError: FlashAttention only support fp16 and bf16 data type during fine tuning.
5b7216f
verified
2 months ago
.gitattributes
Safe
1.52 kB
chore(root): Initial files upload.
7 months ago
CODE_OF_CONDUCT.md
Safe
444 Bytes
Initial commit
7 months ago
LICENSE
Safe
1.08 kB
chore(root): Initial files upload.
7 months ago
NOTICE.md
Safe
1.77 kB
chore(root): Initial files upload.
7 months ago
README.md
Safe
19.4 kB
Update README.md
3 months ago
SECURITY.md
Safe
2.66 kB
Initial commit
7 months ago
cl100k_base.tiktoken
Safe
1.68 MB
chore(root): Initial files upload.
7 months ago
config.json
Safe
4.02 kB
Add attention_bias to make TGI work (#4)
6 months ago
configuration_phi3_small.py
Safe
12.3 kB
chore(root): Initial files upload.
7 months ago
generation_config.json
Safe
142 Bytes
chore(root): Initial files upload.
7 months ago
model-00001-of-00004.safetensors
Safe
4.83 GB
LFS
chore(root): Initial files upload.
7 months ago
model-00002-of-00004.safetensors
Safe
4.8 GB
LFS
chore(root): Initial files upload.
7 months ago
model-00003-of-00004.safetensors
Safe
4.8 GB
LFS
chore(root): Initial files upload.
7 months ago
model-00004-of-00004.safetensors
Safe
352 MB
LFS
chore(root): Initial files upload.
7 months ago
model.safetensors.index.json
Safe
32.1 kB
chore(root): Initial files upload.
7 months ago
modeling_phi3_small.py
Safe
48 kB
Move flash_attn assert from __init__ into calling func (#32)
3 months ago
positional_embedding.py
Safe
11.7 kB
Fix for RuntimeError: FlashAttention only support fp16 and bf16 data type during fine tuning.
2 months ago
special_tokens_map.json
Safe
99 Bytes
chore(root): Initial files upload.
7 months ago
tokenization_phi3_small.py
Safe
11.5 kB
Update tokenization_phi3_small.py (#14)
6 months ago
tokenizer_config.json
Safe
638 Bytes
chore(root): Initial files upload.
7 months ago
triton_blocksparse_attention_layer.py
Safe
7.2 kB
chore(root): Initial files upload.
7 months ago
triton_flash_blocksparse_attn.py
Safe
82.5 kB
Resolve - 196 [rank0]: triton.runtime.autotuner.OutOfResources: out of resource: shared memory, Required: 180224, Hardware limit: 101376. Reducing block sizes or `num_stages` may help.
2 months ago