Using the fine-tuned Mistral-7B-v0.1 for inference, when encountering the backslash escape character '\', the inference stalls, but after a few minutes, it continues generating."
#173 opened 24 days ago
by
xp123
Create mistral
#172 opened 29 days ago
by
Marselgarifullin888
Mistral AI Research
4
#171 opened 29 days ago
by
Siva82
Please grant me access
#170 opened about 1 month ago
by
BugFixer
MISTRAL7B-MODEL TYPE
#169 opened about 1 month ago
by
adnanbaig
Update README.md
#168 opened about 2 months ago
by
allakri
please grant me access
1
#167 opened about 2 months ago
by
Alex341x1
Help in downloading Mistral 7B
1
#166 opened 2 months ago
by
AmitKumarMallick1980
unexpected response even it is Fine tuned with custom dataset
4
#165 opened 3 months ago
by
coder05
[AUTOMATED] Model Memory Requirements
#164 opened 3 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#163 opened 3 months ago
by
model-sizer-bot
When testing model API using curl, It crashes.
#162 opened 4 months ago
by
yashhirulkar
Exception: data did not match any variant of untagged enum PyPreTokenizerTypeWrapper at line 40 column 3
5
#161 opened 4 months ago
by
Naiel
Updating all readmes to be up to date !
#160 opened 4 months ago
by
pandora-s
Update README.md
#159 opened 4 months ago
by
infinityinfluence
Getting weird (same) response everytime through Mistral7B
5
#157 opened 5 months ago
by
pawankumar-108
Request: DOI
1
#156 opened 5 months ago
by
SriK007
hamzaalgohary
#155 opened 5 months ago
by
hamzaalgohary
Mistral sliding_window implementation and flash_attn_func
#154 opened 5 months ago
by
SadRick
size of hidden layers and sliding window attention - dimension is the same, 4096. Is that for a reason?
2
#153 opened 5 months ago
by
keval-sha
Need help, Getting HfHubHTTPError 429 Client Error: Too Many Requests for url
1
#152 opened 6 months ago
by
ashishomi89
Request: DOI
#151 opened 6 months ago
by
irkan
Update README.md
#150 opened 6 months ago
by
WzY1924561588
Service Unavailable
#149 opened 6 months ago
by
glitterllama
Unable to access Mistral-7B-v0.1 from AWS sagemaker
#148 opened 6 months ago
by
Tecena
Cannot access get repo
9
#147 opened 6 months ago
by
Dav22
KeyError: 'base_model.model.model.layers.0.mlp.down_proj.lora_A.weight'
#146 opened 7 months ago
by
DevSelego
Inference: CUDA out of memory error
#145 opened 7 months ago
by
Tecena
Issue with HuggingFace pipeline with RouterOutputParser OutputParserException: Got invalid return object. Expected key destination to be present, but got {}
#144 opened 7 months ago
by
Jyotiyadav
ModelError: An error occurred (ModelError) when calling the InvokeEndpoint
#143 opened 7 months ago
by
Tecena
End of sentence (</s>) does not appear to be predicted in reasoning prompts
2
#142 opened 7 months ago
by
psneto
KeyError:'mistral"' while finetuning mistral-7B-v0.1 in aws sagemaker
2
#141 opened 7 months ago
by
Tecena
Finetuning produces noisy output
#139 opened 7 months ago
by
sriramk750
When will be v0.2 updated in Huggingface?
#138 opened 7 months ago
by
SlytherinGe
Unsupervised training of Mistral for Domain-Specific Inference
1
#135 opened 8 months ago
by
H2dddhxh
Lora fine tuning for text classification with Peft
#134 opened 8 months ago
by
farbodKMSE
Easiest way to fine tune Mistral-7B
1
#133 opened 8 months ago
by
exnrt
Keep Responding in the wrong language despite the prompt template instructing to reply in a specific language
6
#132 opened 8 months ago
by
tdecae
Finetune Mistral 7B full parameters without LORA
2
#131 opened 8 months ago
by
HuggingPanda
Very long response time
4
#130 opened 8 months ago
by
farbodKMSE
Fine Tuning for Classification
6
#129 opened 9 months ago
by
MUHAMMAD-SOHAIL-ZZU
Unable to inference beyond sliding window length
#128 opened 9 months ago
by
kreas
How to finetune this model mistralai/Mistral-7B-v0.1 and also merge the weights
5
#126 opened 9 months ago
by
yeniceriSGK
Pretrain?
3
#125 opened 9 months ago
by
limha
Mistral 7B produces different results when we hit via postman api
7
#124 opened 9 months ago
by
DivyaKanniah
Load and extract the model for language modeling
1
#123 opened 9 months ago
by
theodp
Unexpected keyword 'rope_scaling' while loading model
3
#122 opened 9 months ago
by
gandhipratik65j
Kernel crashed while loading checkpoint shards
3
#121 opened 10 months ago
by
clemennntt