Rex1090
/

PEARL-8B

Model card Files Files and versions

Model Card for PEARL-7B-Based on Qwen3-VL-8B-Instruct

Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning. arxiv.org/abs/2511.18437

Model Details

Model Description

This is a multimodal reasoning model.

Developed by: [Chi Zhang~[email protected]]
Finetuned from model [optional]: [Qwen3-VL-8B-Instruct]

Model Sources [optional]

Repository: [PEARL]
Paper: [More Information Needed]

Uses

Training Details

Training Data

Citation [optional]

Downloads last month: 21

Safetensors

Model size

9B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Rex1090/PEARL-8B

Base model

Qwen/Qwen3-VL-8B-Instruct

Finetuned

(81)

this model

Quantizations