Paper publication

Pushing Bit-Width Limits in LLM Quantization With Saliency-Guided Mix-Precision Allocation and Learnable Affine Transformation

发布时间:2026-01-19 浏览量:9
Shuoyu Ma, Wenrui Dai, Maida Cao, Shaohui Li, Ziyang Zheng, Chenglin Li, Junni Zou, Hongkai Xiong, IEEE Data Compression Conference (DCC 2026), Snowbird, Utah, USA, Mar. 24-27, 2026.