Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large
Language Models
Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large
Language Models
Online shopping is a complex multi-task, few-shot learning problem with a wide and evolving range of entities, relations, and tasks. However, existing models and benchmarks are commonly tailored to specific tasks, falling short of capturing the full complexity of online shopping. Large Language Models (LLMs), with their multi-task and few-shot …