A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution

Type: Article

Publication Date: 2023-01-01

Citations: 0

DOI: https://doi.org/10.18653/v1/2023.findings-acl.113

Abstract

State-of-the-art natural language processing models have been shown to achieve remarkable performance in ‘closed-world’ settings where all the labels in the evaluation set are known at training time. However, in real-world settings, ‘novel’ instances that do not belong to any known class are often observed. This renders the ability to deal with novelties crucial. To initiate a systematic research in this important area of ‘dealing with novelties’, we introduce NoveltyTask, a multi-stage task to evaluate a system’s performance on pipelined novelty ‘detection’ and ‘accommodation’ tasks. We provide mathematical formulation of NoveltyTask and instantiate it with the authorship attribution task that pertains to identifying the correct author of a given text. We use amazon reviews corpus and compile a large dataset (consisting of 250k instances across 200 authors/labels) for NoveltyTask. We conduct comprehensive experiments and explore several baseline methods for the task. Our results show that the methods achieve considerably low performance making the task challenging and leaving sufficient room for improvement. Finally, we believe our work will encourage research in this underexplored area of dealing with novelties, an important step en route to developing robust systems.

Locations

  • arXiv (Cornell University) - View - PDF
  • Findings of the Association for Computational Linguistics: ACL 2022 - View - PDF

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (23)

Action Title Year Authors
+ PDF Chat Image-Based Recommendations on Styles and Substitutes 2015 Julian McAuley
Christopher Targett
Qinfeng Shi
Anton van den Hengel
+ Deep Learning for Anomaly Detection: A Survey 2019 Raghavendra Chalapathy
Sanjay Chawla
+ Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift 2019 Yaniv Ovadia
Emily Fertig
Jie Ren
Zachary Nado
D. Sculley
Sebastian Nowozin
Joshua V. Dillon
Balaji Lakshminarayanan
Jasper Snoek
+ PDF Chat Explainable Authorship Verification in Social Media via Attention-based Similarity Learning 2019 Benedikt Boenninghoff
Steffen Hessler
Dorothea Kolossa
Robert M. Nickel
+ PDF Chat Conditional Gaussian Distribution Learning for Open Set Recognition 2020 Xin Sun
Zhenning Yang
Chi Zhang
Keck Voon Ling
Guohao Peng
+ PDF Chat Few-Shot Open-Set Recognition Using Meta-Learning 2020 Bo Liu
Hao Kang
Haoxiang Li
Gang Hua
Nuno Vasconcelos
+ Text Classification with Novelty Detection 2020 Qi Qin
Wenpeng Hu
Bing Liu
+ PDF Chat Open-world Learning and Application to Product Classification 2019 Hu Xu
Bing Liu
Lei Shu
Peilin Yu
+ A Unifying Framework for Formal Theories of Novelty:Framework, Examples and Discussion 2020 Terrance E. Boult
Przemyslaw A. Grabowicz
Derek S. Prijatelj
Roni Stern
Larry B. Holder
J. Alspector
Mohsen Jafarzadeh
Touqeer Ahmad
Akshay Raj Dhamija
Chunchun Li
+ PDF Chat The Pursuit of Knowledge: Discovering and Localizing Novel Categories using Dual Memory 2021 Sai Saketh Rambhatla
Rama Chellappa
Abhinav Shrivastava