Putting Question-Answering Systems into Practice

Type: Article

Publication Date: 2018-12-31

Citations: 29

DOI: https://doi.org/10.1145/3309706

Abstract

Traditional information retrieval (such as that offered by web search engines) impedes users with information overload from extensive result pages and the need to manually locate the desired information therein. Conversely, question-answering systems change how humans interact with information systems: users can now ask specific questions and obtain a tailored answer - both conveniently in natural language. Despite obvious benefits, their use is often limited to an academic context, largely because of expensive domain customizations, which means that the performance in domain-specific applications often fails to meet expectations. This paper proposes cost-efficient remedies: (i) we leverage metadata through a filtering mechanism, which increases the precision of document retrieval, and (ii) we develop a novel fuse-and-oversample approach for transfer learning in order to improve the performance of answer extraction. Here knowledge is inductively transferred from a related, yet different, tasks to the domain-specific application, while accounting for potential differences in the sample sizes across both tasks. The resulting performance is demonstrated with actual use cases from a finance company and the film industry, where fewer than 400 question-answer pairs had to be annotated in order to yield significant performance gains. As a direct implication to management, this presents a promising path to better leveraging of knowledge stored in information systems.

Locations

  • ACM Transactions on Management Information Systems - View
  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat Putting Question-Answering Systems into Practice: Transfer Learning for Efficient Domain Customization 2018 Bernhard Kratzwald
Stefan Feuerriegel
+ Knowing More About Questions Can Help: Improving Calibration in Question Answering 2021 Shujian Zhang
Chengyue Gong
Eunsol Choi
+ Knowing More About Questions Can Help: Improving Calibration in Question Answering 2021 Shujian Zhang
Chengyue Gong
Eunsol Choi
+ C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References 2022 Xiang Yue
Xiaoman Pan
Wenlin Yao
Dian Yu
Dong Yu
Jianshu Chen
+ C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References 2022 Xiang Yue
Xiaoman Pan
Wenlin Yao
Dian Yu
Dong Yu
Jianshu Chen
+ Evaluating LLMs on Document-Based QA: Exact Answer Selection and Numerical Extraction using Cogtale dataset 2023 Zafaryab Rasool
Scott Barnett
Stefanus Kurniawan
Sherwin Balugo
Rajesh Vasa
Courtney Chesser
Alex Bahar‐Fuchs
+ Technical Question Answering across Tasks and Domains 2020 Wenhao Yu
Lingfei Wu
Yu Deng
Qingkai Zeng
Ruchi Mahindru
Sinem Güven
Meng Jiang
+ PDF Chat RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering 2024 Zihan Zhang
Meng Fang
Ling Chen
+ PDF Chat Cheap and Good? Simple and Effective Data Augmentation for Low Resource Machine Reading 2021 Hoang Van
Vikas Yadav
Mihai Surdeanu
+ PDF Chat Enhancing Question Answering on Charts Through Effective Pre-training Tasks 2024 Ashim Gupta
Vivek Gupta
Shuo Zhang
Yujie He
Ning Zhang
Shalin Shah
+ Evaluating LLMs on document-based QA: Exact answer selection and numerical extraction using CogTale dataset 2024 Zafaryab Rasool
Stefanus Kurniawan
Sherwin Balugo
Scott Barnett
Rajesh Vasa
Courtney Chesser
Benjamin M. Hampstead
Sylvie Belleville
Kon Mouzakis
Alex Bahar‐Fuchs
+ PDF Chat Aggregated Knowledge Model: Enhancing Domain-Specific QA with Fine-Tuned and Retrieval-Augmented Generation Models 2024 Fengchen Liu
Jae-Jin Jung
Wei Feinstein
Jeff DAmbrogia
G. Jung
+ PDF Chat When to Read Documents or QA History: On Unified and Selective Open-domain QA 2023 Kyungjae Lee
SangEun Han
Seung-won Hwang
Moontae Lee
+ PDF Chat Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization 2024 Zixuan Zhang
Revanth Gangi Reddy
Kevin Small
Tong Zhang
Heng Ji
+ PDF Chat PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them 2021 Patrick A. Lewis
Yuxiang Wu
Linqing Liu
Pasquale Minervini
Heinrich Küttler
Aleksandra Piktus
Pontus Stenetorp
Sebastian Riedel
+ PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them 2021 Patrick Lewis
Yuxiang Wu
Linqing Liu
Pasquale Minervini
Heinrich Küttler
Aleksandra Piktus
Pontus Stenetorp
Sebastian Riedel
+ PDF Chat Enhancing Large Language Model Performance To Answer Questions and Extract Information More Accurately 2024 Liang Zhang
Katherine Jijo
S. Pallam Setty
Eden Chung
Fatima Javid
Natan Vidra
Tommy Clifford
+ When to Read Documents or QA History: On Unified and Selective Open-domain QA 2023 Kyungjae Lee
SangEun Han
Seung-won Hwang
Moontae Lee
+ PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them 2021 Patrick Lewis
Yuxiang Wu
Linqing Liu
Pasquale Minervini
Heinrich Küttler
Aleksandra Piktus
Pontus Stenetorp
Sebastian Riedel
+ Practical Annotation Strategies for Question Answering Datasets 2020 Bernhard Kratzwald
Xiang Yue
Huan Sun
Stefan Feuerriegel