MichaelBarryUK (Michael Barry) – Community Activity

commented a paper about 1 month ago

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15 • 38 •

3

commented 3 papers about 2 months ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13 • 65 •

4

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8 • 152 •

17

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Paper • 2408.02085 • Published Aug 4 • 17 •

4

commented a paper 2 months ago

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1 • 12 •

9

New activity in black-forest-labs/FLUX.1-dev 2 months ago

Commercial?

3

#7 opened 2 months ago by

MichaelBarryUK

commented a paper 2 months ago

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4 • 89 •

14

commented 15 papers 3 months ago

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11 • 31 •

3

Autoregressive Speech Synthesis without Vector Quantization

Paper • 2407.08551 • Published Jul 11 • 13 •

4

Inference Performance Optimization for Large Language Models on CPUs

Paper • 2407.07304 • Published Jul 10 • 52 •

7

ProgressGym: Alignment with a Millennium of Moral Progress

Paper • 2406.20087 • Published Jun 28 • 3 •

2

Wavelets Are All You Need for Autoregressive Image Generation

Paper • 2406.19997 • Published Jun 28 • 28 •

5

Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language Models by Learning from Knowledge Graphs

Paper • 2407.00653 • Published Jun 30 • 11 •

2

E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS

Paper • 2406.18009 • Published Jun 26 • 18 •

3

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1 • 33 •

7

MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

Paper • 2407.00468 • Published Jun 29 • 35 •

2

Simulating Classroom Education with LLM-Empowered Agents

Paper • 2406.19226 • Published Jun 27 • 28 •

9

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Paper • 2406.14909 • Published Jun 21 • 13 •

4

Long Code Arena: a Set of Benchmarks for Long-Context Code Models

Paper • 2406.11612 • Published Jun 17 • 21 •

3

commented 5 papers 4 months ago

TroL: Traversal of Layers for Large Language and Vision Models

Paper • 2406.12246 • Published Jun 18 • 34 •

2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17 • 56 •

3

Breaking the Attention Bottleneck

Paper • 2406.10906 • Published Jun 16 • 4 •

4

Designing a Dashboard for Transparency and Control of Conversational AI

Paper • 2406.07882 • Published Jun 12 • 9 •

4

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6 • 37 •

1

commented 2 papers 5 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118 •

9

Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30 • 118 •

19

New activity in lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF 5 months ago

I have a few questions for the quantized model quality.

8

#5 opened 5 months ago by

HannahKim

commented 4 papers 5 months ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 124 •

14

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23 • 29 •

4

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Paper • 2404.13208 • Published Apr 19 • 38 •

9

Music Consistency Models

Paper • 2404.13358 • Published Apr 20 • 12 •

3

commented 9 papers 6 months ago

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published Apr 17 • 41 •

4

LLoCO: Learning Long Contexts Offline

Paper • 2404.07979 • Published Apr 11 • 19 •

2

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

Paper • 2404.07413 • Published Apr 11 • 35 •

4

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 83 •

14

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9 • 21 •

1

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

Paper • 2404.03648 • Published Apr 4 • 23 •

3

PointInfinity: Resolution-Invariant Point Diffusion Models

Paper • 2404.03566 • Published Apr 4 • 13 •

1

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 88 •

17

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 56 •

8

commented 3 papers 7 months ago

Learning to Decode Collaboratively with Multiple Language Models

Paper • 2403.03870 • Published Mar 6 • 17 •

6

Orca-Math: Unlocking the potential of SLMs in Grade School Math

Paper • 2402.14830 • Published Feb 16 • 24 •

3

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20 • 94 •

10

commented 14 papers 8 months ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 34 •

4

In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss

Paper • 2402.10790 • Published Feb 16 • 40 •

8

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15 • 17 •

5

How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15 • 38 •

4

Premise Order Matters in Reasoning with Large Language Models

Paper • 2402.08939 • Published Feb 14 • 24 •

3

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12 • 54 •

9

More Agents Is All You Need

Paper • 2402.05120 • Published Feb 3 • 51 •

5

Grandmaster-Level Chess Without Search

Paper • 2402.04494 • Published Feb 7 • 65 •

8

Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7 • 27 •

3

BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Paper • 2402.04291 • Published Feb 6 • 48 •

3

Multi-line AI-assisted Code Authoring

Paper • 2402.04141 • Published Feb 6 • 9 •

2

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Paper • 2401.16467 • Published Jan 29 • 8 •

2

Transfer Learning for Text Diffusion Models

Paper • 2401.17181 • Published Jan 30 • 14 •

3

Proactive Detection of Voice Cloning with Localized Watermarking

Paper • 2401.17264 • Published Jan 30 • 16 •

4

Michael Barry

AI & ML interests

Organizations

MichaelBarryUK's activity

Commercial?

I have a few questions for the quantized model quality.