leonardlin
's Collections
multilingual
updated
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper
•
2401.01055
•
Published
•
54
YAYI 2: Multilingual Open-Source Large Language Models
Paper
•
2312.14862
•
Published
•
13
Order Matters in the Presence of Dataset Imbalance for Multilingual
Learning
Paper
•
2312.06134
•
Published
•
2
TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in
LLMs through Translation-Assisted Chain-of-Thought Processes
Paper
•
2311.10797
•
Published
Distilling Efficient Language-Specific Models for Cross-Lingual Transfer
Paper
•
2306.01709
•
Published
•
1
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Paper
•
2311.18034
•
Published
Okapi: Instruction-tuned Large Language Models in Multiple Languages
with Reinforcement Learning from Human Feedback
Paper
•
2307.16039
•
Published
•
4
Monolingual or Multilingual Instruction Tuning: Which Makes a Better
Alpaca
Paper
•
2309.08958
•
Published
•
2
A Paradigm Shift in Machine Translation: Boosting Translation
Performance of Large Language Models
Paper
•
2309.11674
•
Published
•
31
Extrapolating Large Language Models to Non-English by Aligning Languages
Paper
•
2308.04948
•
Published
•
1
JaColBERT and Hard Negatives, Towards Better Japanese-First Embeddings
for Retrieval: Early Technical Report
Paper
•
2312.16144
•
Published
•
3
From Base to Conversational: Japanese Instruction Dataset and Tuning
Large Language Models
Paper
•
2309.03412
•
Published
•
1
Analyzing Syntactic Generalization Capacity of Pre-trained Language
Models on Japanese Honorific Conversion
Paper
•
2306.03055
•
Published
Efficient Finetuning Large Language Models For Vietnamese Chatbot
Paper
•
2309.04646
•
Published
•
1
Paper
•
2309.16609
•
Published
•
34
Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual
Retrieval
Paper
•
2204.02292
•
Published
•
1
Composable Sparse Fine-Tuning for Cross-Lingual Transfer
Paper
•
2110.07560
•
Published
•
1
Comparison between parameter-efficient techniques and full fine-tuning:
A case study on multilingual news article classification
Paper
•
2308.07282
•
Published
•
1
Transfer to a Low-Resource Language via Close Relatives: The Case Study
on Faroese
Paper
•
2304.08823
•
Published
•
1
Steering Large Language Models for Machine Translation with Finetuning
and In-Context Learning
Paper
•
2310.13448
•
Published
•
1
Skywork: A More Open Bilingual Foundation Model
Paper
•
2310.19341
•
Published
•
5
InstructAlign: High-and-Low Resource Language Alignment via Continual
Crosslingual Instruction Tuning
Paper
•
2305.13627
•
Published
•
1
Not All Languages Are Created Equal in LLMs: Improving Multilingual
Capability by Cross-Lingual-Thought Prompting
Paper
•
2305.07004
•
Published
•
1
Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual
Understanding With Multilingual Language Models
Paper
•
2210.12360
•
Published
•
1
Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer
Paper
•
2205.12148
•
Published
•
2
Multilingual Large Language Models Are Not (Yet) Code-Switchers
Paper
•
2305.14235
•
Published
Contextual Code Switching for Machine Translation using Language Models
Paper
•
2312.13179
•
Published
Are Multilingual Models Effective in Code-Switching?
Paper
•
2103.13309
•
Published
Reducing language context confusion for end-to-end code-switching
automatic speech recognition
Paper
•
2201.12155
•
Published
A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task
Learning
Paper
•
2204.10815
•
Published
A Multi-dimensional Evaluation of Tokenizer-free Multilingual Pretrained
Models
Paper
•
2210.07111
•
Published
Language Model Tokenizers Introduce Unfairness Between Languages
Paper
•
2305.15425
•
Published
•
1
Multilingual Text Representation
Paper
•
2309.00949
•
Published
How Robust is Neural Machine Translation to Language Imbalance in
Multilingual Tokenizer Training?
Paper
•
2204.14268
•
Published
Unified model for code-switching speech recognition and language
identification based on a concatenated tokenizer
Paper
•
2306.08753
•
Published
•
1
Cerbero-7B: A Leap Forward in Language-Specific LLMs Through Enhanced
Chat Corpus Generation and Evaluation
Paper
•
2311.15698
•
Published
Towards Better Instruction Following Language Models for Chinese:
Investigating the Impact of Training Data and Evaluation
Paper
•
2304.07854
•
Published
•
1
Eliciting the Translation Ability of Large Language Models via
Multilingual Finetuning with Translation Instructions
Paper
•
2305.15083
•
Published
•
2
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked
Language Models
Paper
•
2301.10472
•
Published
Crosslingual Generalization through Multitask Finetuning
Paper
•
2211.01786
•
Published
•
2
PolyLM: An Open Source Polyglot Large Language Model
Paper
•
2307.06018
•
Published
•
25
A Shocking Amount of the Web is Machine Translated: Insights from
Multi-Way Parallelism
Paper
•
2401.05749
•
Published
•
6
Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine
Paper
•
2301.08745
•
Published
Breaking the Curse of Multilinguality with Cross-lingual Expert Language
Models
Paper
•
2401.10440
•
Published
MaLA-500: Massive Language Adaptation of Large Language Models
Paper
•
2401.13303
•
Published
•
11
CroissantLLM: A Truly Bilingual French-English Language Model
Paper
•
2402.00786
•
Published
•
25
Multilingual E5 Text Embeddings: A Technical Report
Paper
•
2402.05672
•
Published
•
20
Aya Dataset: An Open-Access Collection for Multilingual Instruction
Tuning
Paper
•
2402.06619
•
Published
•
53
Do Llamas Work in English? On the Latent Language of Multilingual
Transformers
Paper
•
2402.10588
•
Published
•
1
Simple and Scalable Strategies to Continually Pre-train Large Language
Models
Paper
•
2403.08763
•
Published
•
48
Getting the most out of your tokenizer for pre-training and domain
adaptation
Paper
•
2402.01035
•
Published
•
2
How Good is Your Tokenizer? On the Monolingual Performance of
Multilingual Language Models
Paper
•
2012.15613
•
Published
•
1
Rethinking Tokenization: Crafting Better Tokenizers for Large Language
Models
Paper
•
2403.00417
•
Published
•
1
Fast Vocabulary Transfer for Language Model Compression
Paper
•
2402.09977
•
Published
•
2
Efficient Language Model Training through Cross-Lingual and Progressive
Transfer Learning
Paper
•
2301.09626
•
Published
•
2
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual
Alignment
Paper
•
2404.12318
•
Published
•
14
Measuring Taiwanese Mandarin Language Understanding
Paper
•
2403.20180
•
Published
•
4
Zero-Shot Tokenizer Transfer
Paper
•
2405.07883
•
Published
•
4
Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing
Japanese Language Capabilities
Paper
•
2404.17790
•
Published
•
5
A Survey on Large Language Models with Multilingualism: Recent Advances
and New Frontiers
Paper
•
2405.10936
•
Published
•
1
Dynamic data sampler for cross-language transfer learning in large
language models
Paper
•
2405.10626
•
Published
•
4
Tagengo: A Multilingual Chat Dataset
Paper
•
2405.12612
•
Published
•
3
Sailor: Open Language Models for South-East Asia
Paper
•
2404.03608
•
Published
•
20
EXAONE 3.0 7.8B Instruction Tuned Language Model
Paper
•
2408.03541
•
Published
•
34
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language
Adaptation of LLMs for Low-Resource NLP
Paper
•
2408.04303
•
Published
•
9