Walter Hugo Lopez Pinaya's picture

44 22

Walter Hugo Lopez Pinaya

Warvito

·

AI & ML interests

None yet

Recent Activity

updated a collection about 22 hours ago

upvoted a paper 8 days ago

Vision Bridge Transformer at Scale

updated a collection 8 days ago

View all activity

Organizations

upvoted 3 papers 8 days ago

Vision Bridge Transformer at Scale

Paper • 2511.23199 • Published 18 days ago • 43

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 12 days ago • 166

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published 19 days ago • 198

upvoted a paper 13 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 15 days ago • 65

upvoted a paper 27 days ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published Nov 12 • 68

upvoted 3 papers about 1 month ago

MetaCLIP 2: A Worldwide Scaling Recipe

Paper • 2507.22062 • Published Jul 29 • 36

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5 • 124

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31 • 70

upvoted a paper about 2 months ago

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

Paper • 2404.02905 • Published Apr 3, 2024 • 74

upvoted a paper 2 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 165

upvoted 4 papers 3 months ago

Stochastic activations

Paper • 2509.22358 • Published Sep 26 • 2

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published Sep 29 • 45

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 80

upvoted 6 papers 4 months ago

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

Paper • 2508.18756 • Published Aug 26 • 36

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 265

OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation

Paper • 2508.19209 • Published Aug 26 • 42

Waver: Wave Your Way to Lifelike Video Generation

Paper • 2508.15761 • Published Aug 21 • 34

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

Paper • 2410.10733 • Published Oct 14, 2024 • 8

Improving the Diffusability of Autoencoders

Paper • 2502.14831 • Published Feb 20 • 2