Merino: Entropy-driven Design for Generative Language Models on IoT
Devices
Merino: Entropy-driven Design for Generative Language Models on IoT
Devices
Generative Large Language Models (LLMs) stand as a revolutionary advancement in the modern era of artificial intelligence (AI). However, directly deploying LLMs in resource-constrained hardware, such as Internet-of-Things (IoT) devices, is difficult due to their high computational cost. In this paper, we propose a novel information-entropy framework for designing mobile-friendly …