Tung-Lin Wu's picture

Tung-Lin Wu

tunglinwu

·

tunglinwood

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Continuous batching from first principles

new activity 21 days ago

tencent/KaLM-Embedding-Gemma3-12B-2511:Rerank

upvoted a collection 4 months ago

View all activity

Organizations

None yet

upvoted an article 3 days ago

Article

Continuous batching from first principles

+1

16 days ago

•

263

New activity in tencent/KaLM-Embedding-Gemma3-12B-2511 21 days ago

Rerank

#5 opened 21 days ago by

upvoted a collection 4 months ago

DeepSeek-V3.1

4 items • Updated 13 days ago • 254

New activity in deepseek-ai/DeepSeek-R1-0528 6 months ago

Do you have deepseek-r1-0528-awq plan?

#68 opened 6 months ago by

upvoted 2 collections 8 months ago

Qwen3

84 items • Updated Aug 6 • 1.48k

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated Jun 30 • 133

upvoted 2 papers 8 months ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 222

Training Sparse Mixture Of Experts Text Embedding Models

Paper • 2502.07972 • Published Feb 11 • 8

upvoted a collection 8 months ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated Jul 21 • 160

published a Space 8 months ago

Chatui

liked a Space 9 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 9 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 137

upvoted an article 9 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

•

261

liked a model 9 months ago

PharMolix/BioMedGPT-R1

Updated Mar 26 • 20 • 16

liked a Space 9 months ago

GAIA Leaderboard

Submit and evaluate models on GAIA leaderboard

upvoted a paper 9 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 249

liked a Space 9 months ago

Open Deep-Research

OpenAI's Deep Research, but open

liked a model 10 months ago

baichuan-inc/Baichuan-M1-14B-Instruct

14B • Updated Feb 20 • 2.36k • 67

upvoted 2 papers 10 months ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 32

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429