SiReRAG: Indexing Similar and Related Information for Multihop Reasoning

Type: Preprint

Publication Date: 2024-12-08

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2412.06206

Abstract

Indexing is an important step towards strong performance in retrieval-augmented generation (RAG) systems. However, existing methods organize data based on either semantic similarity (similarity) or related information (relatedness), but do not cover both perspectives comprehensively. Our analysis reveals that modeling only one perspective results in insufficient knowledge synthesis, leading to suboptimal performance on complex tasks requiring multihop reasoning. In this paper, we propose SiReRAG, a novel RAG indexing approach that explicitly considers both similar and related information. On the similarity side, we follow existing work and explore some variances to construct a similarity tree based on recursive summarization. On the relatedness side, SiReRAG extracts propositions and entities from texts, groups propositions via shared entities, and generates recursive summaries to construct a relatedness tree. We index and flatten both similarity and relatedness trees into a unified retrieval pool. Our experiments demonstrate that SiReRAG consistently outperforms state-of-the-art indexing methods on three multihop datasets (MuSiQue, 2WikiMultiHopQA, and HotpotQA), with an average 1.9% improvement in F1 scores. As a reasonably efficient solution, SiReRAG enhances existing reranking methods significantly, with up to 7.8% improvement in average F1 scores.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ SiReRAG: Indexing Similar and Related Information for Multihop Reasoning 2024 Nan Zhang
Prafulla Kumar Choubey
Alexander R. Fabbri
Gabriel Bernadett-Shapiro
Rui Zhang
Prasenjit Mitra
Caiming Xiong
Chien-Sheng Wu
+ PDF Chat StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization 2024 Zhuoqun Li
Xuanang Chen
Haiyang Yu
Hongyu Lin
Yaojie Lu
Qiaoyu Tang
Fei Huang
Xianpei Han
Le Sun
Yongbin Li
+ PDF Chat HIRO: Hierarchical Information Retrieval Optimization 2024 Krish Goel
Mahek Chandak
+ PDF Chat HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases 2024 Meng-Chieh Lee
Qi Zhu
Costas Mavromatis
Zhen Han
Soji Adeshina
Vassilis N. Ioannidis
Huzefa Rangwala
Christos Faloutsos
+ PDF Chat TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation 2024 Jinyuan Fang
Zaiqiao Meng
Craig Macdonald
+ PDF Chat Multi-Level Querying using A Knowledge Pyramid 2024 Rubing Chen
Xulu Zhang
Jiaxin Wu
Wenqi Fan
Xiao-Yong Wei
Qing Li
+ PDF Chat Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering 2024 Xiaoming Zhang
Ming Wang
Xiaocui Yang
Daling Wang
Feng Shi
Yifei Zhang
+ PDF Chat CommunityKG-RAG: Leveraging Community Structures in Knowledge Graphs for Advanced Retrieval-Augmented Generation in Fact-Checking 2024 Rong-Ching Chang
Jiawei Zhang
+ PDF Chat BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression 2024 Yuankai Li
Jia-Chen Gu
Di Wu
Kai-Wei Chang
Nanyun Peng
+ PDF Chat LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation 2024 Keheng Wang
Feiyu Duan
Peiguang Li
Sirui Wang
Xunliang Cai
+ PDF Chat Assisting humans in complex comparisons: automated information comparison at scale 2024 Truman Yuen
Graham A. Watt
Yuri Lawryshyn
+ Retrieval-Generation Synergy Augmented Large Language Models 2023 Zhangyin Feng
Xiaocheng Feng
Dezhi Zhao
Maojin Yang
Bing Qin
+ Retrieval-Generation Synergy Augmented Large Language Models 2024 Zhangyin Feng
Xiaocheng Feng
Dezhi Zhao
Maojin Yang
Bing Qin
+ PDF Chat FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs 2025 Zhiqiang Gao
Yukun Cao
Huanye Wang
Ao Ke
Yuan Feng
X. H. Xie
S. Kevin Zhou
+ PDF Chat Retrieval Augmentation for Commonsense Reasoning: A Unified Approach 2022 Wenhao Yu
Chenguang Zhu
Zhihan Zhang
Shuohang Wang
Zhuosheng Zhang
Yuwei Fang
Meng Jiang
+ Retrieval Augmentation for Commonsense Reasoning: A Unified Approach 2022 Wenhao Yu
Chenguang Zhu
Zhihan Zhang
Shuohang Wang
Zhuosheng Zhang
Yuwei Fang
Meng Jiang
+ PDF Chat Atomic Fact Decomposition Helps Attributed Question Answering 2024 Zheng Yan
Jun Wang
Jiaoyan Chen
Xiaoli Li
Ru Li
Jeff Z. Pan
+ PDF Chat A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models 2025 Qinggang Zhang
Shengyuan Chen
Yuanchen Bei
Zheng Yuan
Huachi Zhou
Zijin Hong
Junnan Dong
Hao Chen
Yi Chang
Jimmy Xiangji Huang
+ PDF Chat From Local to Global: A Graph RAG Approach to Query-Focused Summarization 2024 Darren Edge
Ha Trinh
Newman Cheng
Joshua Bradley
Alex Chao
Apurva Mody
Steven Truitt
Jonathan Larson
+ Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering 2022 Siyuan Wang
Zhongyu Wei
Zhihao Fan
Qi Zhang
Xuanjing Huang

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors