CorDEL: A Contrastive Deep Learning Approach for Entity Linkage

Type: Article

Publication Date: 2020-11-01

Citations: 12

DOI: https://doi.org/10.1109/icdm50108.2020.00171

Abstract

Entity linkage (EL) is a critical problem in data cleaning and integration. In the past several decades, EL has typically been done by rule-based systems or traditional machine learning models with hand-curated features, both of which heavily depend on manual human inputs. With the ever-increasing growth of new data, deep learning (DL) based approaches have been proposed to alleviate the high cost of EL associated with the traditional models. Existing exploration of DL models for EL strictly follows the well-known twin-network architecture. However, we argue that the twin-network architecture is sub-optimal to EL, leading to inherent drawbacks of existing models. In order to address the drawbacks, we propose a novel and generic contrastive DL framework for EL. The proposed framework is able to capture both syntactic and semantic matching signals and pays attention to subtle but critical differences. Based on the framework, we develop a contrastive DL approach for EL, CorDEL, with a simple yet powerful variant called CorDEL-Sum. We evaluate CorDEL with extensive experiments conducted on both public benchmark datasets and a real-world dataset. CorDEL outperforms previous state-of-the-art models by 5.2% on public benchmark datasets. Moreover, CorDEL yields a 29.4% improvement over the current best DL model on the real-world dataset, while reducing the number of training parameters by 96.8%.

Locations

  • arXiv (Cornell University) - View - PDF
  • 2021 IEEE International Conference on Data Mining (ICDM) - View

Similar Works

Action Title Year Authors
+ CorDEL: A Contrastive Deep Learning Approach for Entity Linkage 2020 Zhengyang Wang
Bunyamin Sisman
Hao Wei
Dong Xin
Shuiwang Ji
+ Entity Linking Meets Deep Learning: Techniques and Solutions 2021 Wei Shen
Yuhan Li
Yinan Liu
Jiawei Han
Jianyong Wang
Xiaojie Yuan
+ Entity Linking Meets Deep Learning: Techniques and Solutions 2021 Wei Shen
Yuhan Li
Yinan Liu
Jiawei Han
Jianyong Wang
Xiaojie Yuan
+ A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching Algorithms 2023 George Papadakis
Nishadi Kirielle
Peter Christen
Themis Palpanas
+ PDF Chat Fast Record Linkage for Company Entities 2019 Thomas Gschwind
Christoph Miksovic
Julian Minder
Кацярына Мирыленка
Paolo Scotton
+ PDF Chat Sudowoodo: Contrastive Self-supervised Learning for Multi-purpose Data Integration and Preparation 2023 Runhui Wang
Yuliang Li
Jin Wang
+ Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation 2021 Di Jin
Bunyamin Sisman
Hao Wei
Dong Xin
Danai Koutra
+ PDF Chat Deep transfer learning for multi-source entity linkage via domain adaptation 2021 Di Jin
Bunyamin Sisman
Hao Wei
Xin Luna Dong
Danai Koutra
+ Sudowoodo: Contrastive Self-supervised Learning for Multi-purpose Data Integration and Preparation 2022 Runhui Wang
Yuliang Li
Jin Wang
+ Fast Record Linkage for Company Entities 2019 Thomas Gschwind
Christoph Miksovic
Julian Minder
Кацярына Мирыленка
Paolo Scotton
+ PDF Chat DeepER - Deep Entity Resolution. 2017 Muhammad Ebraheem
Saravanan Thirumuruganathan
Shafiq Joty
Mourad Ouzzani
Nan Tang
+ Fast Record Linkage for Company Entities. 2019 Thomas Gschwind
Christoph Miksovic
Julian Minder
Кацярына Мирыленка
Paolo Scotton
+ SC-Block: Supervised Contrastive Blocking within Entity Resolution Pipelines 2023 Alexander Brinkmann
Roee Shraga
Christian Bizer
+ PDF Chat FlexER: Flexible Entity Resolution for Multiple Intents 2023 Bar Genossar
Roee Shraga
Avigdor Gal
+ PDF Chat When GDD meets GNN: A Knowledge-driven Neural Connection for Effective Entity Resolution in Property Graphs 2024 Junwei Hu
Michael Bewong
Selasi Kwashie
Yidi Zhang
Vincent Mwintieru Nofong
John Wondoh
Zaiwen Feng
+ FlexER: Flexible Entity Resolution for Multiple Intents 2022 Bar Genossar
Roee Shraga
Avigdor Gal
+ IDEL: In-Database Entity Linking with Neural Embeddings 2018 Torsten Kilias
Alexander Löser
Felix A. Gers
Richard Koopmanschap
Ying Zhang
Martin Kersten
+ PDF Chat Heterogeneous Entity Matching with Complex Attribute Associations using BERT and Neural Networks 2023 Jiamin Lu
Shitao Wang
+ Heterogeneous Entity Matching with Complex Attribute Associations using BERT and Neural Networks 2023 Jiamin Lu
Shitao Wang
+ Heterogeneous Entity Matching with Complex Attribute Associations using BERT and Neural Networks 2023 Shitao Wang
Jiamin Lu