Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning Paper ⢠2508.03501 ⢠Published Aug 5 ⢠59
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper ⢠2503.19693 ⢠Published Mar 25 ⢠76
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". ⢠8 items ⢠Updated Nov 22, 2024 ⢠42
Long Code Arena: a Set of Benchmarks for Long-Context Code Models Paper ⢠2406.11612 ⢠Published Jun 17, 2024 ⢠25
Large Language Model Distillation Doesn't Need a Teacher Paper ⢠2305.14864 ⢠Published May 24, 2023 ⢠3