198 11

Migel Tissera PRO

migtissera

https://discord.gg/MxMaEp79Wr

migtissera

AI & ML interests

PhD in Deep Learning (2013-2016). I build intelligent systems using neural networks. Co-founder and CTO, Metaspectral. | Ethereum (ETH): 0xF9843939B3a2527Bb50B8D4bee241713081A5372

Recent Activity

updated a model 15 days ago

migtissera/Tess-R1-Limerick-Llama-3.1-70B

updated a model 16 days ago

migtissera/Tess-R1-Limerick-Llama-3.1-70B

updated a model 16 days ago

migtissera/Tess-R1-Limerick-Llama-3.1-70B

Organizations

migtissera's activity

New activity in WhiteRabbitNeo/Llama-3.1-WhiteRabbitNeo-2-70B about 2 months ago

Prompt format

#3 opened about 2 months ago by

AIGUYCONTENT

New activity in migtissera/Trinity-2-Codestral-22B 2 months ago

Adding Evaluation Results

#2 opened 2 months ago by

leaderboard-pr-bot

New activity in migtissera/Trinity-2-Codestral-22B-v0.2 3 months ago

Adding Evaluation Results

#1 opened 3 months ago by

leaderboard-pr-bot

New activity in migtissera/Tess-3-Mistral-Nemo-12B 3 months ago

Adding Evaluation Results

#3 opened 3 months ago by

leaderboard-pr-bot

New activity in migtissera/Tess-3-Llama-3.1-405B 3 months ago

[Suggestion] Explain more clearly how "Tess" models differ from the base model in the Model Card.

#5 opened 3 months ago by

AaronFeng753

New activity in migtissera/Tess-3-Mistral-Nemo-12B 3 months ago

Which template shoud I use in Ollama

#2 opened 3 months ago by

AaronFeng753

New activity in migtissera/Tess-3-Llama-3.1-70B 3 months ago

A/B Test of Base vs Fine-Tune

#4 opened 3 months ago by

alby13

New activity in migtissera/Tess-v2.5-Phi-3-medium-128k-14B 3 months ago

Axolotl training configuration

#2 opened 3 months ago by

levguy

New activity in WhiteRabbitNeo/Llama-3.1-WhiteRabbitNeo-2-8B 3 months ago

Benchmarks for WRN-2?

#1 opened 3 months ago by

Tonic

New activity in migtissera/Tess-3-Llama-3.1-405B 3 months ago

add 405B basemodel

#3 opened 3 months ago by

cfahlgren1

Add meta-data for the model tree

#4 opened 3 months ago by

multimodalart

New activity in migtissera/Tess-3-Mistral-Nemo-12B 3 months ago

Kudos

#1 opened 3 months ago by

anxcat

New activity in migtissera/Tess-3-Llama-3.1-405B 3 months ago

Can I run this on my raspberry pi?

#2 opened 3 months ago by

SicariusSicariiStuff

New activity in google/gemma-2-27b 3 months ago

The base model doesn't generate coherently

#9 opened 5 months ago by

migtissera

New activity in migtissera/Tess-3-Mistral-Large-2-123B 3 months ago

Questions about how to run

#2 opened 3 months ago by

DontPlanToEnd

New activity in migtissera/Tess-3-Llama-3.1-70B 3 months ago

How is the model different from Meta's?

#2 opened 3 months ago by

CloudMarked

New activity in migtissera/Tess-3-Llama-3.1-405B 3 months ago

Purpose?

#1 opened 3 months ago by

ID0M

New activity in migtissera/Tess-3-Mistral-Large-2-123B 3 months ago

Is this the same model as Mistral Large 2407?

#1 opened 4 months ago by

Iommed

New activity in migtissera/Tess-3-Llama-3.1-70B 4 months ago

Typos in example conversation

#1 opened 4 months ago by

PositronicLlama

New activity in deepseek-ai/DeepSeek-V2-Chat-0628 4 months ago

What is the FSDP value for `fsdp_transformer_layer_cls_to_wrap`?

#4 opened 4 months ago by

migtissera