Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Paper
•
2504.04152
•
Published
•
1
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources