Ask a Question

Prefer a chat interface with context about you and your work?

Merino: Entropy-driven Design for Generative Language Models on IoT Devices

Merino: Entropy-driven Design for Generative Language Models on IoT Devices

Generative Large Language Models (LLMs) stand as a revolutionary advancement in the modern era of artificial intelligence (AI). However, directly deploying LLMs in resource-constrained hardware, such as Internet-of-Things (IoT) devices, is difficult due to their high computational cost. In this paper, we propose a novel information-entropy framework for designing mobile-friendly …