Artificial-intelligenceModel-quantizationLarge-language-modelsInformatique et sciences de l'informationAnglais
Publié
Auteur Amanda Kau
A novel compression technique ensuring comparable performance with 70% less parameters Author Amanda Kau ( ORCID : 0009–0004–4949–9284) Introduction The sizes of large language models (LLMs) have been steadily increasing over the last few years.