Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xu's picture
2 3

xu

xuzhaopan

AI & ML interests

None yet

Recent Activity

authored a paper 19 days ago
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark
upvoted a paper 20 days ago
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark
upvoted a paper 3 months ago
PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models
View all activity

Organizations

None yet

authored a paper 19 days ago

Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark

Paper • 2511.13853 • Published 22 days ago • 34
authored 3 papers 9 months ago

PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models

Paper • 2503.12545 • Published Mar 16 • 6

MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification

Paper • 2503.12505 • Published Mar 16 • 11

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges

Paper • 2503.06553 • Published Mar 9 • 7
authored a paper about 1 year ago

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Paper • 2411.18499 • Published Nov 27, 2024 • 18
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs