Ivan Baldo's picture

Ivan Baldo

ibaldonl

·

https://www.netlabsglobal.io

AI & ML interests

MLOps, Scalability, Performance, OnPremises

Recent Activity

liked a model 24 days ago

cpatonn/Qwen3-VL-30B-A3B-Instruct-AWQ-4bit

new activity 24 days ago

RedHatAI/README:Add Qwen3-VL-30B-A3B-Instruct-NVFP4 and -quantized.w4a16

liked a model 24 days ago

Qwen/Qwen3-VL-30B-A3B-Instruct-GGUF

View all activity

Organizations

None yet

upvoted a collection 2 months ago

Inference Optimized Checkpoints (with Model Optimizer)

A collection of generative models quantized and optimized for inference with TensorRT Model Optimizer. • 45 items • Updated 4 days ago • 62

upvoted an article 5 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

+6

Jun 26

•

120

upvoted 3 collections 6 months ago

Qwen3

84 items • Updated Aug 6 • 1.47k

Qwen3-Reranker

3 items • Updated Jul 21 • 64

Qwen3-Embedding

6 items • Updated Jul 21 • 138

upvoted a collection 8 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 210

upvoted a paper over 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 626