Victor Jotham Ashioya's picture

Victor Jotham Ashioya

ashioyajotham

https://linktr.ee/ashioyajotham

ashioyajotham

AI & ML interests

NLP, AI Safety {red-teaming my go-to], alignment and hallucination in LLMs.

Recent Activity

liked a Space about 1 month ago

Qwen/Qwen2.5-Math-Demo

View all activity

Organizations

None yet

ashioyajotham's activity

upvoted a paper 3 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22 • 89

upvoted 3 papers 6 months ago

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach

Paper • 2405.15613 • Published May 24 • 13

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 85

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14 • 27

upvoted a paper 8 months ago

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 65

upvoted 11 papers 9 months ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124

Algorithmic progress in language models

Paper • 2403.05812 • Published Mar 9 • 18

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11 • 90

Common 7B Language Models Already Possess Strong Math Capabilities

Paper • 2403.04706 • Published Mar 7 • 16

How Far Are We from Intelligent Visual Deductive Reasoning?

Paper • 2403.04732 • Published Mar 7 • 18

SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6 • 75

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27 • 88

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 77

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19 • 16

RLVF: Learning from Verbal Feedback without Overgeneralization

Paper • 2402.10893 • Published Feb 16 • 10

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 101

upvoted 4 papers 10 months ago

Scaling Laws for Fine-Grained Mixture of Experts

Paper • 2402.07871 • Published Feb 12 • 11

Policy Improvement using Language Feedback Models

Paper • 2402.07876 • Published Feb 12 • 5

DeAL: Decoding-time Alignment for Large Language Models

Paper • 2402.06147 • Published Feb 5 • 7

Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7 • 29