PromptTTS 2: Describing and Generating Voices with Text Prompt Paper • 2309.02285 • Published Sep 5, 2023 • 11
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 170
In-context Autoencoder for Context Compression in a Large Language Model Paper • 2307.06945 • Published Jul 13, 2023 • 27
PolyLM: An Open Source Polyglot Large Language Model Paper • 2307.06018 • Published Jul 12, 2023 • 25
Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features Paper • 2307.05454 • Published Jul 11, 2023 • 6
Collaborative Score Distillation for Consistent Visual Synthesis Paper • 2307.04787 • Published Jul 4, 2023 • 28
Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation Paper • 2307.03869 • Published Jul 8, 2023 • 22
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers Paper • 2307.03183 • Published Jul 6, 2023 • 10
Lost in the Middle: How Language Models Use Long Contexts Paper • 2307.03172 • Published Jul 6, 2023 • 35
SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference Paper • 2307.02628 • Published Jul 5, 2023 • 10
Training Models to Generate, Recognize, and Reframe Unhelpful Thoughts Paper • 2307.02768 • Published Jul 6, 2023 • 14
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding Paper • 2307.02499 • Published Jul 4, 2023 • 15
Focused Transformer: Contrastive Training for Context Scaling Paper • 2307.03170 • Published Jul 6, 2023 • 11
What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? Paper • 2307.02469 • Published Jul 5, 2023 • 12
Open-Source Large Language Models Outperform Crowd Workers and Approach ChatGPT in Text-Annotation Tasks Paper • 2307.02179 • Published Jul 5, 2023 • 7
Building Cooperative Embodied Agents Modularly with Large Language Models Paper • 2307.02485 • Published Jul 5, 2023 • 11
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models Paper • 2307.02421 • Published Jul 5, 2023 • 34
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper • 2307.01952 • Published Jul 4, 2023 • 80
Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning Paper • 2307.02053 • Published Jul 5, 2023 • 23
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 80
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning Paper • 2307.00119 • Published Jun 30, 2023 • 6
JourneyDB: A Benchmark for Generative Image Understanding Paper • 2307.00716 • Published Jul 3, 2023 • 18
Improving Language Plasticity via Pretraining with Active Forgetting Paper • 2307.01163 • Published Jul 3, 2023 • 6
Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting Paper • 2306.17563 • Published Jun 30, 2023 • 9
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs Paper • 2306.16601 • Published Jun 28, 2023 • 4
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch Paper • 2306.16857 • Published Jun 29, 2023 • 5
Automatic Calibration and Error Correction for Large Language Models via Pareto Optimal Self-Supervision Paper • 2306.16564 • Published Jun 28, 2023 • 3
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Paper • 2306.16410 • Published Jun 28, 2023 • 27
HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution Paper • 2306.15794 • Published Jun 27, 2023 • 17
Understanding In-Context Learning via Supportive Pretraining Data Paper • 2306.15091 • Published Jun 26, 2023 • 6
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models Paper • 2306.15626 • Published Jun 27, 2023 • 16
Extending Context Window of Large Language Models via Positional Interpolation Paper • 2306.15595 • Published Jun 27, 2023 • 53
Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data Paper • 2306.13840 • Published Jun 24, 2023 • 11
Swin-Free: Achieving Better Cross-Window Attention and Efficiency with Size-varying Window Paper • 2306.13776 • Published Jun 23, 2023 • 5
Supervised Pretraining Can Learn In-Context Reinforcement Learning Paper • 2306.14892 • Published Jun 26, 2023 • 8
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications Paper • 2306.14289 • Published Jun 25, 2023 • 15
H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models Paper • 2306.14048 • Published Jun 24, 2023 • 11
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 15
Opportunities and Risks of LLMs for Scalable Deliberation with Polis Paper • 2306.11932 • Published Jun 20, 2023 • 6
RepoFusion: Training Code Models to Understand Your Repository Paper • 2306.10998 • Published Jun 19, 2023 • 14