Artificial-intelligenceModel-quantizationLarge-language-modelsCiências da Computação e da InformaçãoInglês
Publicados
Autor Amanda Kau
A novel compression technique ensuring comparable performance with 70% less parameters Author Amanda Kau ( ORCID : 0009–0004–4949–9284) Introduction The sizes of large language models (LLMs) have been steadily increasing over the last few years.