cool-papers - a dhuynh95 Collection

Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

dhuynh95 's Collections

cool-papers

updated 9 days ago

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Paper • 2403.09029 • Published Mar 14 • 54
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

Paper • 2403.12968 • Published Mar 19 • 24
RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15 • 67
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 72
Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13 • 49
Language models scale reliably with over-training and on downstream tasks

Paper • 2403.08540 • Published Mar 13 • 14
Algorithmic progress in language models

Paper • 2403.05812 • Published Mar 9 • 18
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8 • 60
TnT-LLM: Text Mining at Scale with Large Language Models

Paper • 2403.12173 • Published Mar 18 • 19
Larimar: Large Language Models with Episodic Memory Control

Paper • 2403.11901 • Published Mar 18 • 32
Reverse Training to Nurse the Reversal Curse

Paper • 2403.13799 • Published Mar 20 • 13
When Do We Not Need Larger Vision Models?

Paper • 2403.13043 • Published Mar 19 • 25
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21 • 51
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Paper • 2403.15246 • Published Mar 22 • 8
Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22 • 32
The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26 • 78
Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27 • 24
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Paper • 2403.18421 • Published Mar 27 • 22
Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 57
Poro 34B and the Blessing of Multilinguality

Paper • 2404.01856 • Published Apr 2 • 13
Long-context LLMs Struggle with Long In-context Learning

Paper • 2404.02060 • Published Apr 2 • 35
Training LLMs over Neurally Compressed Text

Paper • 2404.03626 • Published Apr 4 • 21
CodeEditorBench: Evaluating Code Editing Capability of Large Language Models

Paper • 2404.03543 • Published Apr 4 • 15
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Paper • 2404.02575 • Published Apr 3 • 47
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18 • 53
Compression Represents Intelligence Linearly

Paper • 2404.09937 • Published Apr 15 • 27
Flamingo: a Visual Language Model for Few-Shot Learning

Paper • 2204.14198 • Published Apr 29, 2022 • 14
Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1 • 27
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published Jun 6 • 28
To Believe or Not to Believe Your LLM

Paper • 2406.02543 • Published Jun 4 • 32
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

Paper • 2406.06469 • Published Jun 10 • 24
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

Paper • 2406.19280 • Published Jun 27 • 60
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1 • 85
Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages

Paper • 2407.03321 • Published Jul 3 • 15
Training Task Experts through Retrieval Based Distillation

Paper • 2407.05463 • Published Jul 7 • 7
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct

Paper • 2407.05700 • Published Jul 8 • 9
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Paper • 2407.15711 • Published Jul 22 • 9
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Paper • 2407.20183 • Published Jul 29 • 38
OmniParser for Pure Vision Based GUI Agent

Paper • 2408.00203 • Published Aug 1 • 23
Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models

Paper • 2408.06663 • Published Aug 13 • 15
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20 • 41
Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 118
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23 • 25
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3 • 81
ContextCite: Attributing Model Generation to Context

Paper • 2409.00729 • Published Sep 1 • 13
Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 87
A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18 • 43
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18 • 36
Attention Prompting on Image for Large Vision-Language Models

Paper • 2409.17143 • Published Sep 25 • 7
Can Models Learn Skill Composition from Examples?

Paper • 2409.19808 • Published Sep 29 • 8
Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30 • 53
LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3 • 34
From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond

Paper • 2411.03590 • Published 12 days ago • 9

Collection guide
Browse collections

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs