34 20 1

Yuxian Gu

t1101675

https://t1101675.github.io/

AI & ML interests

Efficient methods for language models

Recent Activity

upvoted a paper 3 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

updated a Space 27 days ago

t1101675/trackio

published a Space 27 days ago

t1101675/trackio

View all activity

Organizations

upvoted a paper 3 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 5 days ago • 75

upvoted 3 papers 2 months ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Paper • 2509.25180 • Published Sep 29 • 6

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

Paper • 2509.25182 • Published Sep 29 • 37

upvoted 2 papers 6 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

Rectified Sparse Attention

Paper • 2506.04108 • Published Jun 4 • 10

upvoted a paper 8 months ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 75

upvoted a paper 9 months ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published Feb 20 • 13

upvoted a paper 10 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 95

upvoted 2 papers 12 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 59

upvoted 2 papers about 1 year ago

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Paper • 2410.17215 • Published Oct 22, 2024 • 16

Data Selection via Optimal Control for Language Models

Paper • 2410.07064 • Published Oct 9, 2024 • 9

upvoted 2 papers over 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 66

upvoted a paper almost 2 years ago

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 18

upvoted a paper about 2 years ago

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 13

upvoted 3 papers over 2 years ago

Yuxian Gu

AI & ML interests

Recent Activity

Organizations

t1101675's activity