Model Card for PEARL-7B-Based on Qwen3-VL-8B-Instruct

Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning. arxiv.org/abs/2511.18437

Model Details

Model Description

This is a multimodal reasoning model.

  • Developed by: [Chi Zhang~[email protected]]
  • Finetuned from model [optional]: [Qwen3-VL-8B-Instruct]

Model Sources [optional]

  • Repository: [PEARL]
  • Paper: [More Information Needed]

Uses

VLMEvalkit

Training Details

EasyR1

Training Data

ViRL39k

Citation [optional]

Downloads last month
21
Safetensors
Model size
9B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Rex1090/PEARL-8B

Finetuned
(81)
this model
Quantizations
2 models