Victor Gallego's picture

Victor Gallego

vicgalle

·

https://github.com/vicgalle

AI & ML interests

Preference fine-tuning, alignment & synthetic data. Building LLMs in general!

Recent Activity

liked a model 7 days ago

Qwen/Qwen2.5-Coder-32B-Instruct

liked a dataset 12 days ago

lingjie23/TexAes

updated a collection 25 days ago

Configurable Safety Tuning ⚙️

Organizations

vicgalle's activity

upvoted an article 27 days ago

Article

VLM Art Analysis

By

•

Oct 4

• 11

upvoted a collection about 1 month ago

steiner-preview

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20 • 23

upvoted 2 papers about 1 month ago

Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems

Paper • 2410.13334 • Published Oct 17 • 12

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 166

upvoted a collection about 2 months ago

Llama 3.2 Re-upload

10 items • Updated Sep 25 • 11

upvoted 2 papers 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15 • 38

upvoted an article 3 months ago

Article

Tensor Parallelism

By

•

Aug 20

• 10

upvoted a collection 3 months ago

Hermes 3

The Hermes 3 Series of Models • 8 items • Updated Aug 23 • 91

upvoted a paper 4 months ago

WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models

Paper • 2408.03837 • Published Aug 7 • 17

upvoted a collection 4 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 622

upvoted 3 articles 4 months ago

Article

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

By

•

Jul 19

• 17

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 265

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 78

upvoted a paper 4 months ago

BM25S: Orders of magnitude faster lexical search via eager sparse scoring

Paper • 2407.03618 • Published Jul 4 • 11

upvoted 2 papers 5 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 95

Symbolic Learning Enables Self-Evolving Agents

Paper • 2406.18532 • Published Jun 26 • 11

upvoted a collection 5 months ago

Probably DPO datasets

A collection of datasets that probably support DPO • 146 items • Updated Jun 26 • 12

upvoted 2 papers 5 months ago

TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings

Paper • 2406.15586 • Published Jun 21 • 2

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Paper • 2406.14563 • Published Jun 20 • 29