Weilin Zhao's picture

3 6 13

Weilin Zhao

Achazwl

·

https://weilin-zhao.com

AI & ML interests

Efficient LLM

Recent Activity

liked a dataset 12 days ago

openbmb/InfLLM-V2-data-5B

authored a paper 2 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

upvoted a paper 2 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

View all activity

Organizations

upvoted a paper 2 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29 • 13

upvoted a collection 5 months ago

FR-Spec

Released ckpt for arxiv.org/abs/2502.14856 • 6 items • Updated Jul 2 • 1

upvoted a collection 6 months ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8 • 79

upvoted a paper 6 months ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 92

upvoted 2 papers 9 months ago

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Paper • 2502.12085 • Published Feb 17 • 4

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published Feb 20 • 8