CS-Bench: A Comprehensive Benchmark for Large Language Models towards
Computer Science Mastery
CS-Bench: A Comprehensive Benchmark for Large Language Models towards
Computer Science Mastery
Computer Science (CS) stands as a testament to the intricacies of human intelligence, profoundly advancing the development of artificial intelligence and modern society. However, the current community of large language models (LLMs) overly focuses on benchmarks for analyzing specific foundational skills (e.g. mathematics and code generation), neglecting an all-round evaluation …