Oscar Sainz's picture

26 2 5

Oscar Sainz

OSainz

·

https://osainz59.github.io/

AI & ML interests

Artificial Inteligence, Natural Language Processing, Information Extraction, Zero and Few Shot Learning.

Organizations

OSainz's activity

New activity in HiTZ/latxa-70b-v1.2 4 months ago

missing tokenizer?

#1 opened 4 months ago by

New activity in CONDA-Workshop/Data-Contamination-Database 6 months ago

GPT-3.5 HumanEval_R CodeForces2305 contamination based on https://arxiv.org/abs/2402.15938

#28 opened 6 months ago by

Add reports from Benchmarking paper "Benchmark Leakage in Large Language Models"

#27 opened 6 months ago by

Update contamination_report.csv

#26 opened 6 months ago by

Update contamination.csv

#25 opened 6 months ago by

Add data from "An Open-Source Data Contamination Report for Large Language Models"

#5 opened 7 months ago by

Add Reports Based on "Llemma: An Open Language Model For Mathematics"

#23 opened 6 months ago by

add flores contamination in xP3

#20 opened 7 months ago by

Add Aquila model series which have gsm8k test set contamination

#21 opened 6 months ago by

GPT-3.5 Spider contamination based on https://arxiv.org/pdf/2402.08100

#18 opened 7 months ago by

Should indirect data leakages be included in the Data Contamination Database?

#19 opened 7 months ago by

New activity in CONDA-Workshop/Data-Contamination-Database 7 months ago

File fixes and cleaning

#17 opened 7 months ago by

Superglue/RealNews Contamination based on "Noise-Robust De-Duplication at Scale"

#15 opened 7 months ago by

Mistral 7B Arc Easy Contamination based on "Proving Test Set Contamination in Black Box Language Models"

#14 opened 7 months ago by

Added Contamination Evidence from GPT4 Tech Report using String matching on GPT-4

#11 opened 7 months ago by

GPT-3.5Turbo HumanEval Contamination based on "Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models"

#16 opened 7 months ago by

Added Contamination Evidence on MMLU of ChatGPT/GPT4 from "Investigating data contamination in modern benchmarks for large language models"

#10 opened 7 months ago by

Added Contamination Info on Old Models: GPT3, FLAN, GLaM, PaLM, PaLM 2

#13 opened 7 months ago by

Contamination results based on "Data Contamination Quiz"

#9 opened 7 months ago by

shahriargolchin

Code contamination in HumanEval and MBPP

#12 opened 7 months ago by