Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published Oct 20 • 67
Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models Paper • 2510.11683 • Published Oct 13 • 13
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets? Paper • 2510.02209 • Published Oct 2 • 52
SIRI Collection Scaling Iterative Reinforcement Learning with Interleaved Compression • 5 items • Updated Sep 30 • 3
SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression Paper • 2509.25176 • Published Sep 29 • 13
CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling Paper • 2509.21114 • Published Sep 25 • 16
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8 • 192
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11 • 249
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Paper • 2506.18841 • Published Jun 23 • 56
OpenSAE-LLaMA-3.1-8B Collection OpenSAE checkpoints for LLaMA 3.1 8B base model • 38 items • Updated Jan 29 • 5
VerIF Collection RL trained models and datasets for instruction-following • 7 items • Updated Jun 12 • 5
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following Paper • 2506.09942 • Published Jun 11 • 5
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis Paper • 2506.04142 • Published Jun 4 • 27
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models Paper • 2506.04180 • Published Jun 4 • 33
Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models Paper • 2505.20152 • Published May 26 • 11