Haotong Qin's picture

2 8 1

Haotong Qin

HaotongQin

·

https://htqin.github.io/

AI & ML interests

Model Compression, Efficient AIGC

Organizations

authored a paper 2 months ago

Quantized Visual Geometry Grounded Transformer

Paper • 2509.21302 • Published Sep 25 • 8

authored a paper 7 months ago

QVGen: Pushing the Limit of Quantized Video Generative Models

Paper • 2505.11497 • Published May 16 • 4

authored 9 papers 11 months ago

How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges

Paper • 2307.15016 • Published Jul 27, 2023

Accurate LoRA-Finetuning Quantization of LLMs via Information Retention

Paper • 2402.05445 • Published Feb 8, 2024 • 1

BiBERT: Accurate Fully Binarized BERT

Paper • 2203.06390 • Published Mar 12, 2022

DB-LLM: Accurate Dual-Binarization for Efficient LLMs

Paper • 2402.11960 • Published Feb 19, 2024 • 3

OHQ: On-chip Hardware-aware Quantization

Paper • 2309.01945 • Published Sep 5, 2023 • 1

BinaryDM: Towards Accurate Binarization of Diffusion Model

Paper • 2404.05662 • Published Apr 8, 2024 • 1

SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models

Paper • 2405.14917 • Published May 23, 2024 • 1

BiBench: Benchmarking and Analyzing Network Binarization

Paper • 2301.11233 • Published Jan 26, 2023

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

Paper • 2409.16694 • Published Sep 25, 2024

authored a paper over 1 year ago

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22, 2024 • 45