3 25 17

Shuangrui Ding

Mar2Ding

https://mark12ding.github.io/

AI & ML interests

None yet

Recent Activity

liked a Space 8 days ago

Tongyi-MAI/Z-Image-Turbo

upvoted a paper 21 days ago

Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents

upvoted a paper 5 months ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

View all activity

Organizations

None yet

liked a Space 8 days ago

Z Image Turbo

🏃

1.24k

Generate images from text prompts

upvoted a paper 21 days ago

Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents

Paper • 2507.23698 • Published Jul 31 • 10

upvoted a paper 5 months ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Paper • 2507.15852 • Published Jul 21 • 38

upvoted a paper 6 months ago

Video World Models with Long-term Spatial Memory

Paper • 2506.05284 • Published Jun 5 • 55

updated 2 models 6 months ago

Mar2Ding/songcomposer_sft

Text Generation • Updated May 30 • 80 • 16

Mar2Ding/songcomposer_pretrain

Text Generation • Updated May 30 • 28 • 5

updated a dataset 6 months ago

Mar2Ding/songcompose_data

Viewer • Updated May 30 • 3 • 49 • 1

published a dataset 6 months ago

Mar2Ding/songcompose_data

Viewer • Updated May 30 • 3 • 49 • 1

liked 2 datasets 7 months ago

ChrisDing1105/MMIF-23k

Viewer • Updated Sep 16 • 22.6k • 228 • 12

Wiselnn/VideoRoPE

Updated Apr 7 • 197 • 2

upvoted a paper 9 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79

updated a model 9 months ago

Mar2Ding/Dispider

10B • Updated Mar 11 • 49 • 2

liked a model 9 months ago

Mar2Ding/Dispider

10B • Updated Mar 11 • 49 • 2

commented a paper 9 months ago

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published Jan 6 • 36 •

published a model 9 months ago

Mar2Ding/Dispider

10B • Updated Mar 11 • 49 • 2

upvoted a paper 9 months ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 122

liked a Space 9 months ago

SAM2 Video Predictor

🔥

Segment and track objects in videos

upvoted a paper 9 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3 • 85

upvoted 2 papers 10 months ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 74

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24 • 73

Shuangrui Ding

AI & ML interests

Recent Activity

Organizations

Mar2Ding's activity

Z Image Turbo

SAM2 Video Predictor