hiyata (Alan)

upvoted 5 papers 22 days ago

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Paper • 2406.16714 • Published 23 days ago • 10

upvoted an article 28 days ago

Article

Introduction to State Space Models (SSM)

By

•

Jun 11

• 52

upvoted 21 papers about 1 month ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30 • 102

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2 • 50

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 108

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3 • 92

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 145

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23 • 29

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24 • 52

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 78

Phased Consistency Model

Paper • 2405.18407 • Published May 28 • 44

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 61

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4 • 36

Tx-LLM: A Large Language Model for Therapeutics

Paper • 2406.06316 • Published Jun 10 • 13

Towards a Personal Health Large Language Model

Paper • 2406.06474 • Published Jun 10 • 14

Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

Paper • 2406.06469 • Published Jun 10 • 22

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Paper • 2406.06592 • Published Jun 5 • 17

Hibou: A Family of Foundational Vision Transformers for Pathology

Paper • 2406.05074 • Published Jun 7 • 6

Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space Models

Paper • 2406.04320 • Published Jun 6 • 7

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 48

Depth Anything V2

Paper • 2406.09414 • Published Jun 13 • 88

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Paper • 2406.07522 • Published Jun 11 • 35

upvoted 2 papers 8 months ago

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

Paper • 2311.13073 • Published Nov 22, 2023 • 53

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 176

upvoted a paper 9 months ago

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 95

Alan

AI & ML interests

Organizations

hiyata's activity

Evaluating D-MERIT of Partial-annotation on Information Retrieval

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Confidence Regulation Neurons in Language Models

ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Introduction to State Space Models (SSM)

KAN: Kolmogorov-Arnold Networks

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

What matters when building vision-language models?

Your Transformer is Secretly Linear

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Not All Language Model Features Are Linear

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

An Introduction to Vision-Language Modeling

Phased Consistency Model

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Tx-LLM: A Large Language Model for Therapeutics

Towards a Personal Health Large Language Model

Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Hibou: A Family of Foundational Vision Transformers for Pathology

Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space Models

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Depth Anything V2

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

GAIA: a benchmark for General AI Assistants

BitNet: Scaling 1-bit Transformers for Large Language Models