Model Card for PEARL-7B-Based on Qwen3-VL-8B-Instruct
Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning. arxiv.org/abs/2511.18437
Model Details
Model Description
This is a multimodal reasoning model.
- Developed by: [Chi Zhang~[email protected]]
- Finetuned from model [optional]: [Qwen3-VL-8B-Instruct]
Model Sources [optional]
- Repository: [PEARL]
- Paper: [More Information Needed]
Uses
Training Details
Training Data
Citation [optional]
- Downloads last month
- 21
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support