Contrastive Domain Adaptation for Question Answering using Limited Text Corpora

Type: Article

Publication Date: 2021-01-01

Citations: 22

DOI: https://doi.org/10.18653/v1/2021.emnlp-main.754

Abstract

Question generation has recently shown impressive results in customizing question answering (QA) systems to new domains. These approaches circumvent the need for manually annotated training data from the new domain and, instead, generate synthetic question-answer pairs that are used for training. However, existing methods for question generation rely on large amounts of synthetically generated datasets and costly computational resources, which render these techniques widely inaccessible when the text corpora is of limited size. This is problematic as many niche domains rely on small text corpora, which naturally restricts the amount of synthetic data that can be generated. In this paper, we propose a novel framework for domain adaptation called contrastive domain adaptation for QA (CAQA). Specifically, CAQA combines techniques from question generation and domain-invariant learning to answer out-of-domain questions in settings with limited text corpora. Here, we train a QA system on both source data and generated data from the target domain with a contrastive adaptation loss that is incorporated in the training objective. By combining techniques from question generation and domain-invariant learning, our model achieved considerable improvements compared to state-of-the-art baselines.

Locations

  • arXiv (Cornell University) - View - PDF
  • Repository for Publications and Research Data (ETH Zurich) - View - PDF
  • Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing - View - PDF

Similar Works

Action Title Year Authors
+ Contrastive Domain Adaptation for Question Answering using Limited Text Corpora 2021 Zhenrui Yue
Bernhard Kratzwald
Stefan Feuerriegel
+ Synthetic Question Value Estimation for Domain Adaptation of Question Answering 2022 Xiang Yue
Ziyu Yao
Huan Sun
+ PDF Chat Synthetic Question Value Estimation for Domain Adaptation of Question Answering 2022 Xiang Yue
Ziyu Yao
Huan Sun
+ Domain Adaptation for Question Answering via Question Classification 2022 Zhenrui Yue
Huimin Zeng
Ziyi Kou
Lanyu Shang
Dong Wang
+ PDF Chat QA Domain Adaptation using Hidden Space Augmentation and Self-Supervised Contrastive Adaptation 2022 Zhenrui Yue
Huimin Zeng
Bernhard Kratzwald
Stefan Feuerriegel
Dong Wang
+ QA Domain Adaptation using Hidden Space Augmentation and Self-Supervised Contrastive Adaptation 2022 Zhenrui Yue
Huimin Zeng
Bernhard Kratzwald
Stefan Feuerriegel
Dong Wang
+ Back-Training excels Self-Training at Unsupervised Domain Adaptation of Question Generation and Passage Retrieval 2021 Devang Kulshreshtha
Robert Belfer
Iulian Vlad Serban
Siva Reddy
+ Towards Domain Adaptation from Limited Data for Question Answering Using Deep Neural Networks 2019 Timothy J. Hazen
Shehzaad Dhuliawala
Daniel Boies
+ PDF Chat Back-Training excels Self-Training at Unsupervised Domain Adaptation of Question Generation and Passage Retrieval 2021 Devang Kulshreshtha
Robert Belfer
Iulian Vlad Serban
Siva Reddy
+ DomainInv: Domain Invariant Fine Tuning and Adversarial Label Correction For QA Domain Adaptation 2023 Anant Khandelwal
+ Source-Free Domain Adaptation for Question Answering with Masked Self-training 2022 M. Yin
B. Wang
Yanhan Dong
C. Ling
+ Source-Free Domain Adaptation for Question Answering with Masked Self-training 2024 M. Yin
Boyu Wang
Yue Dong
Charles X. Ling
+ End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems 2020 Siamak Shakeri
CĂ­cero Nogueira dos Santos
Henry Zhu
Patrick Ng
Nan Feng
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
+ End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems 2020 Siamak Shakeri
CĂ­cero Nogueira dos Santos
Henghui Zhu
Patrick Ng
Nan Feng
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
+ PDF Chat SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains 2024 Ran Xu
Hui Liu
Sreyashi Nag
Zhenwei Dai
Yaochen Xie
Xianfeng Tang
Chen Luo
Yang Li
Joyce C. Ho
Carl Yang
+ Domain-agnostic Question-Answering with Adversarial Training. 2019 Seanie Lee
Donggyu Kim
Jangwon Park
+ Adversarial Domain Adaptation for Machine Reading Comprehension 2019 Huazheng Wang
Zhe Gan
Xiaodong Liu
Jingjing Liu
Jianfeng Gao
Hongning Wang
+ Domain-agnostic Question-Answering with Adversarial Training 2019 Seanie Lee
Donggyu Kim
Jangwon Park
+ Domain-agnostic Question-Answering with Adversarial Training 2019 Seanie Lee
Dong-Gyu Kim
Jangwon Park
+ PDF Chat Unsupervised LLM Adaptation for Question Answering 2024 Kuniaki Saito
Kihyuk Sohn
Chen‐Yu Lee
Yoshitaka Ushiku