Cho's picture

Cho

1980Dragon

·

AI & ML interests

None yet

Organizations

None yet

1980Dragon's activity

upvoted 2 papers about 1 year ago

PromptTTS 2: Describing and Generating Voices with Text Prompt

Paper • 2309.02285 • Published Sep 5, 2023 • 11

Efficient RLHF: Reducing the Memory Usage of PPO

Paper • 2309.00754 • Published Sep 1, 2023 • 13

upvoted 18 papers over 1 year ago

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 22

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 170

In-context Autoencoder for Context Compression in a Large Language Model

Paper • 2307.06945 • Published Jul 13, 2023 • 27

PolyLM: An Open Source Polyglot Large Language Model

Paper • 2307.06018 • Published Jul 12, 2023 • 25

Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features

Paper • 2307.05454 • Published Jul 11, 2023 • 6

Collaborative Score Distillation for Consistent Visual Synthesis

Paper • 2307.04787 • Published Jul 4, 2023 • 28

Large Language Models for Supply Chain Optimization

Paper • 2307.03875 • Published Jul 8, 2023 • 17

RLTF: Reinforcement Learning from Unit Test Feedback

Paper • 2307.04349 • Published Jul 10, 2023 • 4

Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation

Paper • 2307.03869 • Published Jul 8, 2023 • 22

Large Language Models as General Pattern Machines

Paper • 2307.04721 • Published Jul 10, 2023 • 14

Teaching Arithmetic to Small Transformers

Paper • 2307.03381 • Published Jul 7, 2023 • 17

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers

Paper • 2307.03183 • Published Jul 6, 2023 • 10

Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 36

A Survey on Evaluation of Large Language Models

Paper • 2307.03109 • Published Jul 6, 2023 • 42

SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference

Paper • 2307.02628 • Published Jul 5, 2023 • 10

Training Models to Generate, Recognize, and Reframe Unhelpful Thoughts

Paper • 2307.02768 • Published Jul 6, 2023 • 14

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Paper • 2307.02499 • Published Jul 4, 2023 • 15

Focused Transformer: Contrastive Training for Context Scaling

Paper • 2307.03170 • Published Jul 6, 2023 • 11