nbroad (Nicholas Broad)

upvoted 2 collections 3 days ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 4 days ago • 287

ReLiK: Retrieve, Read and LinK

Collection

A blazing fast and lightweight Information Extraction model for Entity Linking and Relation Extraction. • 20 items • Updated Aug 8 • 18

upvoted a paper 11 days ago

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

Paper • 2409.10173 • Published 14 days ago • 21

upvoted a collection 11 days ago

Llama 3.1 GPTQ, AWQ, and BNB Quants

Collection

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated 4 days ago • 51

upvoted a collection 16 days ago

NIM Serverless Inference API

Collection

Models in this collection are available for inference via a serverless API powered by NVIDIA NIM. • 8 items • Updated 3 days ago • 18

upvoted a paper 18 days ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published 27 days ago • 76

upvoted a paper 24 days ago

The Future of Open Human Feedback

Paper • 2408.16961 • Published Aug 15 • 19

upvoted an article about 1 month ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 80

upvoted 3 articles about 2 months ago

Article

Introducing TextImage Augmentation for Document Images

Aug 6

• 29

Article

XetHub is joining Hugging Face!

Aug 8

• 77

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29

• 26

upvoted 2 collections 2 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 4 days ago • 584

Yi-1.5 (2024/05)

Collection

10 items • Updated May 20 • 89

upvoted 2 articles 2 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 93

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18

• 44

upvoted 3 articles 3 months ago

Article

Inference for PROs

Sep 22, 2023

• 40

Article

Announcing New Hugging Face and Keras NLP integration

Jul 10

• 29

Article

Experimenting with Automatic PII Detection on the Hub using Presidio

Jul 10

• 23

upvoted a paper 3 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 85

upvoted an article 3 months ago

Article

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

Jun 24

• 30

upvoted 2 papers 3 months ago

Should You Mask 15% in Masked Language Modeling?

Paper • 2202.08005 • Published Feb 16, 2022 • 1

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6 • 52

upvoted an article 3 months ago

Article

Making sense of this mess

Jun 7

• 14

upvoted 2 articles 4 months ago

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

Jun 13

• 41

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 332

upvoted a paper 4 months ago

Tx-LLM: A Large Language Model for Therapeutics

Paper • 2406.06316 • Published Jun 10 • 13

upvoted 6 articles 4 months ago

Article

Space secrets security update

May 31

• 50

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28

• 148

Article

Benchmarking Text Generation Inference

May 29

• 27

Article

AI has a problem with objectifying women

By

•

May 24

• 54

Article

Let's talk about LLM evaluation

By

•

May 23

• 106

Article

Build AI on premise with Dell Enterprise Hub

May 21

• 17

upvoted 2 articles 5 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 108

Article

2024-04-22 - Hub Incident Post Mortem

By

•

May 17

• 17

upvoted a paper 5 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

upvoted an article 5 months ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 62

upvoted 2 papers 5 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 250

Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25 • 57

upvoted 4 collections 7 months ago

upvoted 2 papers 7 months ago

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Paper • 2402.17553 • Published Feb 27 • 21

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21 • 111

upvoted 3 papers 8 months ago

Scaling Laws for Downstream Task Performance of Large Language Models

Paper • 2402.04177 • Published Feb 6 • 17

The Stack: 3 TB of permissively licensed source code

Paper • 2211.15533 • Published Nov 20, 2022 • 5

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Paper • 2401.12070 • Published Jan 22 • 42

upvoted 2 papers 9 months ago

Pheme: Efficient and Conversational Speech Generation

Paper • 2401.02839 • Published Jan 5 • 16

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 79

upvoted a paper 10 months ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 120

upvoted a paper 11 months ago

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Paper • 2306.16527 • Published Jun 21, 2023 • 47

upvoted 3 papers 12 months ago

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Paper • 2310.10638 • Published Oct 16, 2023 • 28

RMT: Retentive Networks Meet Vision Transformers

Paper • 2309.11523 • Published Sep 20, 2023 • 33

Efficient Streaming Language Models with Attention Sinks

Paper • 2309.17453 • Published Sep 29, 2023 • 13

upvoted 6 papers about 1 year ago

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 50

PDFTriage: Question Answering over Long, Structured Documents

Paper • 2309.08872 • Published Sep 16, 2023 • 53

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

Paper • 2308.01320 • Published Aug 2, 2023 • 44

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 170

Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning

Paper • 2307.03692 • Published Jul 5, 2023 • 24

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 28

Nicholas Broad PRO

AI & ML interests

Articles

Accelerating Document AI

Organizations

nbroad's activity

The 5 Most Under-Rated Tools on Hugging Face

Introducing TextImage Augmentation for Document Images

XetHub is joining Hugging Face!

Serverless Inference with Hugging Face and NVIDIA NIMs

How NuminaMath Won the 1st AIMO Progress Prize

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Inference for PROs

Announcing New Hugging Face and Keras NLP integration

Experimenting with Automatic PII Detection on the Hub using Presidio

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

Making sense of this mess

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

Uncensor any LLM with abliteration

Space secrets security update

Training and Finetuning Embedding Models with Sentence Transformers v3

Benchmarking Text Generation Inference

AI has a problem with objectifying women

Let's talk about LLM evaluation

Build AI on premise with Dell Enterprise Hub

License to Call: Introducing Transformers Agents 2.0

2024-04-22 - Hub Incident Post Mortem

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints