Mike Cooper-Stachowsky
mstachow
·
AI & ML interests
None yet
Recent Activity
New activity
about 1 month ago
microsoft/Phi-3.5-MoE-instruct
Organizations
mstachow's activity
Model consistently gets into a loop to repeat itself if there is too much in the context window
2
#48 opened about 1 month ago
by
mstachow
Mutilple gpu err
4
#9 opened 3 months ago
by
Toukesu
What counts as "text"?
2
#1 opened 3 months ago
by
mstachow
What is the significance of the parameters input_size of max_num?
5
#6 opened 4 months ago
by
mstachow
Model is extremely sensitive to the word "ignore"
4
#14 opened 4 months ago
by
mstachow
What is the context length of this model?
1
#2 opened 4 months ago
by
mstachow
Model keeps adding new input, even when using pipeline?
2
#1 opened 6 months ago
by
mstachow
It is not making use of my GPU.
1
#1 opened about 1 year ago
by
JeisonJimenez
Deeply confused about how this is running on my system - is it GPU or CPU?
#1 opened about 1 year ago
by
mstachow
Special tokens to control pausing?
3
#3 opened about 1 year ago
by
mstachow
What is the maximum token length of the model?
1
#2 opened about 1 year ago
by
mstachow
Possibility to use on CUDA?
10
#12 opened over 1 year ago
by
mstachow
Out of memory error, but both system and GPU have plenty of memory
5
#37 opened about 1 year ago
by
mstachow
A note on preventing model hallucinations
#18 opened over 1 year ago
by
mstachow
Model hallucinates almost every run
3
#15 opened over 1 year ago
by
mstachow
Working Hacky Code
2
#16 opened over 1 year ago
by
JHenzi
model sometimes repeats itself and glitches during speech.
1
#13 opened over 1 year ago
by
mstachow
What does "recommended for better performance" mean?
2
#1 opened over 1 year ago
by
mstachow
What does "recommended for better performance" mean?
2
#1 opened over 1 year ago
by
mstachow