"John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility

Type: Preprint

Publication Date: 2022-01-01

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2210.07471

Locations

  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ Large Language Models Are Also Good Prototypical Commonsense Reasoners 2023 Chenin Li
Qianglong Chen
Yin Zhang
Yifei Zhang
Hongxiang Yao
+ Open-ended Commonsense Reasoning with Unrestricted Answer Scope 2023 Ling Chen
Xuchao Zhang
Xujiang Zhao
Yanchi Liu
Wei Cheng
Takao Osaki
Katsushi Matsuda
Haifeng Chen
Liang Zhao
+ PDF Chat Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs 2024 Elan Markowitz
Anil Ramakrishna
Jwala Dhamala
Ninareh Mehrabi
Charith Peris
Rahul Gupta
Kai-Wei Chang
Aram Galstyan
+ Faithful Question Answering with Monte-Carlo Planning 2023 Ruixin Hong
Hongming Zhang
Hong Zhao
Dong Yu
Changshui Zhang
+ Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models 2023 Lei Wang
Wanyu Xu
Yihuai Lan
Zhiqiang Hu
Yunshi Lan
Roy Ka-Wei Lee
Eeā€Peng Lim
+ Faithful Question Answering with Monte-Carlo Planning 2023 Ruixin Hong
Hongming Zhang
Hong Zhao
Dong Yu
Changshui Zhang
+ PDF Chat MoreHopQA: More Than Multi-hop Reasoning 2024 Julian Schnitzler
Xanh Ho
Jiahao Huang
Florian Boudin
Saku Sugawara
Akiko Aizawa
+ PDF Chat DARA: Decomposition-Alignment-Reasoning Autonomous Language Agent for Question Answering over Knowledge Graphs 2024 Haishuo Fang
Xiaodan Zhu
Iryna Gurevych
+ CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering 2023 Weiqi Wang
Tianqing Fang
Wenxuan Ding
Baixuan Xu
Xin Liu
Yangqiu Song
Antoine Bosselut
+ Making Large Language Models Better Reasoners with Step-Aware Verifier 2022 Yifei Li
Zeqi Lin
Shizhuo Zhang
Qiang Fu
Bei Chen
Jianā€“Guang Lou
Weizhu Chen
+ PDF Chat Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning 2024 Pengfei He
Zitao Li
Yue Xing
Yaling Li
Jiliang Tang
B. Ding
+ Faithful Chain-of-Thought Reasoning 2023 Qing Lyu
Shreya Havaldar
Adam Stein
Li Zhang
Delip Rao
Eric Wong
Marianna Apidianaki
Chris Callison-Burch
+ PDF Chat Large Language Models Still Face Challenges in Multi-Hop Reasoning with External Knowledge 2024 Haoyuan Zhang
+ PDF Chat A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains 2024 Alon Jacovi
Yonatan Bitton
Bernd Bohnet
Jonathan Herzig
Or Honovich
Michael S. Tseng
Michael Collins
Roee Aharoni
Mor Geva
+ It's Not Easy Being Wrong: Evaluating Process of Elimination Reasoning in Large Language Models 2023 Nishant Balepur
Shramay Palta
Rachel Rudinger
+ CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering 2023 Weiqi Wang
Tianqing Fang
Wenxuan Ding
Baixuan Xu
Xin Liu
Yangqiu Song
Antoine Bosselut
+ Active Prompting with Chain-of-Thought for Large Language Models 2023 Shizhe Diao
Pengcheng Wang
Yong Lin
Tong Zhang
+ WikiWhy: Answering and Explaining Cause-and-Effect Questions 2022 Matthew Ho
Aditya Sharma
Justin S. Chang
Michael Saxon
Sharon Levy
Yujie Lu
William Yang Wang
+ True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4 2023 Maksym Del
Mark Fishel
+ True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4 2022 Maksym Del
Mark Fishel

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors