Santiago Viquez
santiviquez
AI & ML interests
ML @ NannyML. A bit of everything. NLP, RL, and, of course, tabular. In the GenAI era, how can you not love tabular data? Educational content and OSS.
Articles
Organizations
Posts
21
Post
1027
They: you need ground truth to measure performance! π
NannyML: hold my beer...
NannyML: hold my beer...
Post
936
Just published a new article π
https://huggingface.co/blog/santiviquez/data-drift-estimate-model-performance
https://huggingface.co/blog/santiviquez/data-drift-estimate-model-performance
Collections
1
Collection of LLM hallucination and evaluation papers that I've been exploring and implementing. Some of them have my comments and annotated doodles.
-
Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation
Paper β’ 2208.05309 β’ Published β’ 1 -
LLM-Eval: Unified Multi-Dimensional Automatic Evaluation for Open-Domain Conversations with Large Language Models
Paper β’ 2305.13711 β’ Published β’ 2 -
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation
Paper β’ 2302.09664 β’ Published β’ 2 -
BARTScore: Evaluating Generated Text as Text Generation
Paper β’ 2106.11520 β’ Published β’ 1
models
19
santiviquez/flan-t5-small-ppo
Reinforcement Learning
β’
Updated
β’
2
santiviquez/reward_modeling_anthropic_hh
Text Classification
β’
Updated
β’
5
santiviquez/quora-qa-flan-t5-small
Text2Text Generation
β’
Updated
β’
21
santiviquez/t5-small-finetuned-samsum-en
Summarization
β’
Updated
β’
4
santiviquez/bart-base-finetuned-samsum-en
Summarization
β’
Updated
β’
2
santiviquez/amazon-reviews-sentiment-bert-base-uncased-6000-samples
Updated
santiviquez/amazon-reviews-sentiment-distilbert-base-uncased-6000-samples
Text Classification
β’
Updated
β’
4
santiviquez/amazon-reviews-finetuning-distilbert-base-uncased
Text Classification
β’
Updated
β’
9
santiviquez/amazon-reviews-finetuning-distilbert-base-uncased_books
Text Classification
β’
Updated
β’
4
santiviquez/amazon-reviews-finetuning-bert-base-sentiment
Text Classification
β’
Updated
β’
5