1 29

Xiangyan Liu

xyliu6

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

upvoted a paper about 1 month ago

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

upvoted a paper about 1 month ago

Diffusion Language Models are Super Data Learners

View all activity

Organizations

None yet

upvoted a paper about 5 hours ago

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Paper • 2512.05150 • Published 8 days ago • 62

upvoted 2 papers about 1 month ago

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

Paper • 2511.06209 • Published Nov 9 • 17

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5 • 124

upvoted 4 papers 2 months ago

updated a dataset 3 months ago

xyliu6/DSTVD-backup

Updated Sep 7 • 10

published a dataset 3 months ago

xyliu6/DSTVD-backup

Updated Sep 7 • 10

upvoted 3 papers 6 months ago

Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning

Paper • 2502.11962 • Published Feb 17 • 38

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Paper • 2506.02096 • Published Jun 2 • 52

commented a paper 6 months ago

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Paper • 2506.02096 • Published Jun 2 • 52 •

upvoted a paper 6 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

upvoted 5 papers 7 months ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published May 28 • 49

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published May 28 • 29

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26 • 23

Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models

Paper • 2505.18536 • Published May 24 • 18

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19 • 36

authored a paper 7 months ago

Towards Robust Multi-Modal Reasoning via Model Selection

Paper • 2310.08446 • Published Oct 12, 2023

Xiangyan Liu

AI & ML interests

Recent Activity

Organizations

xyliu6's activity