view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 7 days ago • 112
view article Article XLSCOUT Unveils ParaEmbed 2.0: a Powerful Embedding Model Tailored for Patents and IP with Expert Support from Hugging Face 6 days ago • 8
view article Article Build Agentic Workflow using OpenAGI and HuggingFace models By lucifertrj • 5 days ago • 5
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools Paper • 2405.20362 • Published May 30 • 2
DataComp: In search of the next generation of multimodal datasets Paper • 2304.14108 • Published Apr 27, 2023 • 2
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper • 2406.06525 • Published 20 days ago • 60
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark Paper • 2406.01574 • Published 27 days ago • 42
view article Article Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖 By m-ric • 11 days ago • 25
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Paper • 2401.05566 • Published Jan 10 • 24
view article Article How Sempre Health is leveraging the Expert Acceleration Program to accelerate their ML roadmap May 19, 2022 • 1
Metacognitive Prompting Improves Understanding in Large Language Models Paper • 2308.05342 • Published Aug 10, 2023 • 2
Large Language Models Struggle to Learn Long-Tail Knowledge Paper • 2211.08411 • Published Nov 15, 2022 • 3
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy Paper • 2305.15294 • Published May 24, 2023 • 1
No Language Left Behind: Scaling Human-Centered Machine Translation Paper • 2207.04672 • Published Jul 11, 2022 • 1
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 58
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering Paper • 1809.09600 • Published Sep 25, 2018 • 2
Fast Inference from Transformers via Speculative Decoding Paper • 2211.17192 • Published Nov 30, 2022 • 3
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper • 2401.10774 • Published Jan 19 • 50
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding Paper • 2402.12374 • Published Feb 19 • 3
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22 • 240
AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation Paper • 2404.12753 • Published Apr 19 • 39
Training-Free Long-Context Scaling of Large Language Models Paper • 2402.17463 • Published Feb 27 • 19
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Apr 22 • 75
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 614
Hydragen: High-Throughput LLM Inference with Shared Prefixes Paper • 2402.05099 • Published Feb 7 • 17
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 144
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale Paper • 2208.07339 • Published Aug 15, 2022 • 4
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers Paper • 2210.17323 • Published Oct 31, 2022 • 6