Migel Tissera
migtissera
AI & ML interests
PhD in Deep Learning (2013-2016). I build intelligent systems using neural networks.
Co-founder and CTO, Metaspectral. |
Ethereum (ETH): 0xF9843939B3a2527Bb50B8D4bee241713081A5372
Organizations
migtissera's activity
Adding Evaluation Results
#2 opened 13 days ago
by
leaderboard-pr-bot
Adding Evaluation Results
#1 opened 28 days ago
by
leaderboard-pr-bot
Adding Evaluation Results
#3 opened 28 days ago
by
leaderboard-pr-bot
[Suggestion] Explain more clearly how "Tess" models differ from the base model in the Model Card.
3
#5 opened about 1 month ago
by
AaronFeng753
Which template shoud I use in Ollama
2
#2 opened about 1 month ago
by
AaronFeng753
A/B Test of Base vs Fine-Tune
3
#4 opened about 1 month ago
by
alby13
Axolotl training configuration
2
#2 opened about 1 month ago
by
levguy
Benchmarks for WRN-2?
1
#1 opened about 1 month ago
by
Tonic
add 405B basemodel
3
#3 opened about 2 months ago
by
cfahlgren1
Add meta-data for the model tree
#4 opened about 2 months ago
by
multimodalart
Can I run this on my raspberry pi?
1
#2 opened about 2 months ago
by
SicariusSicariiStuff
The base model doesn't generate coherently
4
#9 opened 3 months ago
by
migtissera
Questions about how to run
2
#2 opened about 2 months ago
by
DontPlanToEnd
How is the model different from Meta's?
1
#2 opened about 2 months ago
by
CloudMarked
Purpose?
1
#1 opened about 2 months ago
by
ID0M
Is this the same model as Mistral Large 2407?
2
#1 opened about 2 months ago
by
Iommed
Typos in example conversation
1
#1 opened about 2 months ago
by
PositronicLlama
What is the FSDP value for `fsdp_transformer_layer_cls_to_wrap`?
#4 opened about 2 months ago
by
migtissera
Adding Evaluation Results
#1 opened 2 months ago
by
leaderboard-pr-bot
Adding Evaluation Results
#6 opened 3 months ago
by
leaderboard-pr-bot
Adding Evaluation Results
#1 opened 3 months ago
by
leaderboard-pr-bot
Model Eval Failed: Tess-v2.5.2-Qwen2-72B
3
#826 opened 3 months ago
by
migtissera
Update migtissera/Tess-v2.5-Phi-3-medium-128k-14B_eval_request_False_float16_Original.json
2
#23 opened 3 months ago
by
migtissera
Is this model more coherent?
1
#1 opened 3 months ago
by
UniversalLove333
Model failure: migtissera/Tess-v2.5.2-Qwen2-72B model
8
#799 opened 3 months ago
by
migtissera
Update migtissera/Llama-3-70B-Synthia-v3.5_eval_request_False_float16_Original.json
#24 opened 3 months ago
by
migtissera
Can't repro MMLU: sliding window attention implementation seems broken
3
#11 opened 3 months ago
by
dzhulgakov
Any plans for Tess 2.5.2 QLoRA?
3
#1 opened 3 months ago
by
PositiveJay
eos token in gguf
9
#1 opened 4 months ago
by
mradermacher
EOS token is getting printed
9
#1 opened 4 months ago
by
migtissera
Adding `safetensors` variant of this model
2
#5 opened 4 months ago
by
SFconvertbot
Excellent model
1
#1 opened 4 months ago
by
FiditeNemini
problem with index
2
#4 opened 4 months ago
by
prudant
Missing bos_token_id in config.json
4
#2 opened 4 months ago
by
TouchNight
Nice work! Do we have plan for opening source the datasets?
2
#1 opened 4 months ago
by
yixinsong
How Was This Created?
1
#2 opened 4 months ago
by
Kquant03
AGIEval and others
3
#19 opened 4 months ago
by
migtissera
Licence?
1
#1 opened 4 months ago
by
migtissera
No good way to identify number of activated parameters causes MIxtral evaluation failures
32
#680 opened 6 months ago
by
0-hero
Model evaluation failed after a few days
1
#778 opened 4 months ago
by
migtissera
[bot] Conversion to Parquet
#1 opened 4 months ago
by
parquet-converter
Cc-by-nc?
2
#4 opened 4 months ago
by
migtissera
good model
1
#1 opened 5 months ago
by
Utochi
Kudos, impressive model
1
#5 opened 8 months ago
by
fblgit
Training
1
#2 opened 5 months ago
by
freegheist
Any chance of releasing the LoRA or QLoRA?
1
#1 opened 6 months ago
by
jukofyork
`enhanced_instruction` contains the `response`
10
#2 opened 6 months ago
by
xzuyn
Great job Migel!!!!
1
#1 opened 6 months ago
by
dillfrescott
Please, authorize access for the base weight!
39
#5 opened 6 months ago
by
Undi95
Question (Why not ChatML?)
5
#5 opened 6 months ago
by
mrfakename
Set sliding window to null to match Mistral-7B-Instruct-v0.2
1
#4 opened 6 months ago
by
veden
Clarification: Is it Yi-34b-200k v2, or v1?
1
#1 opened 6 months ago
by
SabinStargem
what kind of dataset is Tess?
2
#1 opened 6 months ago
by
rombodawg
Trailing space after "ASSISTANT:"
1
#3 opened 6 months ago
by
algorithm
Instruct-finetuning dataset
1
#6 opened 6 months ago
by
Andriy
Dataset
2
#2 opened 6 months ago
by
mrfakename