Aryanne's picture

Aryanne

Aryanne

·

AI & ML interests

LLMs, AI, GPU/CPU poor, any help is welcome 0x190ac445974a989a87dd223f212a76ca0090c804

Organizations

Aryanne's activity

upvoted a paper about 1 month ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10 • 49

upvoted a paper about 2 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 47

upvoted a collection about 2 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 26 days ago • 472

upvoted a collection 2 months ago

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14 • 12

upvoted an article 2 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 201

upvoted 2 articles 3 months ago

Article

Introduction to ggml

Aug 13

• 113

Article

Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization

By

•

Feb 8

• 5

upvoted a collection 5 months ago

MatMulfree LM

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10 • 25

upvoted 3 papers 8 months ago

Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute

Paper • 2401.00711 • Published Jan 1 • 2

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20 • 20

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 72

upvoted 2 papers 9 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 62

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 602

upvoted 2 papers 10 months ago

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26 • 69

Universal Neurons in GPT2 Language Models

Paper • 2401.12181 • Published Jan 22 • 5

upvoted a collection 10 months ago

Testing Might be broken

testing only models, • 10 items • Updated Feb 3 • 1

upvoted a paper 11 months ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

upvoted a collection 12 months ago

Merged Models

Using mergekit • 10 items • Updated Mar 1 • 2

upvoted 2 collections about 1 year ago

StableLM (.gguf)

Models based on StableLM Models by Stability AI • 19 items • Updated Nov 27, 2023 • 3

Honorable mentions

Some models I've made and I liked but isn't part of a serie. • 10 items • Updated Feb 4 • 6