ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization
ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization
We present ClidSum, a benchmark dataset towards building cross-lingual summarization systems on dialogue documents. It consists of 67k+ dialogue documents and 112k+ annotated summaries in different target languages. Based on the proposed ClidSum, we introduce two benchmark settings for supervised and semi-supervised scenarios, respectively. We then build various baseline systems …