Jiri Fajtl's picture

13

Jiri Fajtl

ok1zjf

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

WorldGen: From Text to Traversable and Interactive 3D Worlds

upvoted a paper 7 days ago

STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow

upvoted a paper 7 days ago

Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens

View all activity

Organizations

upvoted 3 papers 7 days ago

WorldGen: From Text to Traversable and Interactive 3D Worlds

Paper • 2511.16825 • Published 18 days ago • 21

STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow

Paper • 2511.20462 • Published 13 days ago • 29

Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens

Paper • 2511.19418 • Published 14 days ago • 26

upvoted a paper about 2 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17 • 48

upvoted a paper 2 months ago

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published Sep 28 • 46

upvoted 7 papers 3 months ago

3D and 4D World Modeling: A Survey

Paper • 2509.07996 • Published Sep 4 • 58

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

Paper • 2509.15937 • Published Sep 19 • 20

The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning

Paper • 2412.00568 • Published Nov 30, 2024 • 23

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

SPATIALGEN: Layout-guided 3D Indoor Scene Generation

Paper • 2509.14981 • Published Sep 18 • 27

DINOv3

Paper • 2508.10104 • Published Aug 13 • 285

Symbolic Graphics Programming with Large Language Models

Paper • 2509.05208 • Published Sep 5 • 46

upvoted a paper 4 months ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22 • 39