Ask a Question

Prefer a chat interface with context about you and your work?

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

"Bigger the better" has been the predominant trend in recent Large Language Models (LLMs) development. However, LLMs do not suit well for scenarios that require on-device processing, energy efficiency, low memory footprint, and response efficiency. These requisites are crucial for privacy, security, and sustainable deployment. This paper explores the "less …