+
PDF
Chat
|
Evaluating language models for mathematics through interactions
|
2024
|
Katherine M. Collins
Albert Q. Jiang
Simon Frieder
Lionel Wong
Miri Zilka
Umang Bhatt
Thomas Lukasiewicz
Yuhuai Wu
Joshua B. Tenenbaum
William Hart
|
+
PDF
Chat
|
Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with
Autoformalization
|
2024
|
Jin Zhou
Charles Staats
Wenda Li
Christian Szegedy
Kilian Q. Weinberger
Yuhuai Wu
|
+
PDF
Chat
|
REFACTOR: Learning to Extract Theorems from Proofs
|
2024
|
Jin Zhou
Yuhuai Wu
Qiyang Li
Roger Grosse
|
+
|
Magnushammer: A Transformer-based Approach to Premise Selection
|
2023
|
Maciej Mikuła
Szymon Antoniak
Szymon Tworkowski
Albert Qiaochu Jiang
Jin Zhou
Christian Szegedy
Łukasz Kuciński
Piotr Miłoś
Yuhuai Wu
|
+
|
PaLM 2 Technical Report
|
2023
|
Rohan Anil
Andrew M. Dai
Orhan Fırat
Melvin Johnson
Dmitry Lepikhin
A. M. A. dos Passos
Siamak Shakeri
Emanuel Taropa
Paige Bailey
Zhifeng Chen
|
+
|
Lexinvariant Language Models
|
2023
|
Qian Huang
Eric Zelikman
Sarah Li Chen
Yuhuai Wu
Gregory Valiant
Percy Liang
|
+
|
Evaluating Language Models for Mathematics through Interactions
|
2023
|
Katherine M. Collins
Albert Q. Jiang
Simon Frieder
Lionel Wong
Miri Zilka
Umang Bhatt
Thomas Lukasiewicz
Yuhuai Wu
Joshua B. Tenenbaum
William E. Hart
|
+
|
Length Generalization in Arithmetic Transformers
|
2023
|
Samy Jelassi
Stéphane d’Ascoli
Carles Domingo-Enrich
Yuhuai Wu
Yuanzhi Li
François Charton
|
+
|
Focused Transformer: Contrastive Training for Context Scaling
|
2023
|
Szymon Tworkowski
Konrad Staniszewski
Mikołaj Pacek
Yuhuai Wu
Henryk Michalewski
Piotr Miłoś
|
+
PDF
Chat
|
Hierarchical Transformers Are More Efficient Language Models
|
2022
|
Piotr Nawrot
Szymon Tworkowski
Michał Tyrolski
Łukasz Kaiser
Yuhuai Wu
Christian Szegedy
Henryk Michalewski
|
+
|
Memorizing Transformers
|
2022
|
Yuhuai Wu
Markus N. Rabe
DeLesley Hutchins
Christian Szegedy
|
+
|
Block-Recurrent Transformers
|
2022
|
DeLesley Hutchins
Imanol Schlag
Yuhuai Wu
Ethan Dyer
Behnam Neyshabur
|
+
|
STaR: Bootstrapping Reasoning With Reasoning
|
2022
|
Eric Zelikman
Yuhuai Wu
Noah D. Goodman
|
+
|
Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers
|
2022
|
Albert Q. Jiang
Wenda Li
Szymon Tworkowski
Konrad Czechowski
Tomasz Odrzygóźdź
Piotr Miłoś
Yuhuai Wu
Mateja Jamnik
|
+
|
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
|
2022
|
Michał Zawalski
Michał Tyrolski
Konrad Czechowski
Damian Stachura
Piotr Piękos
Tomasz Odrzygóźdź
Yuhuai Wu
Łukasz Kuciński
Piotr Miłoś
|
+
|
Autoformalization with Large Language Models
|
2022
|
Yuhuai Wu
Albert Q. Jiang
Wenda Li
Markus N. Rabe
Charles Staats
Mateja Jamnik
Christian Szegedy
|
+
|
Insights into Pre-training via Simpler Synthetic Tasks
|
2022
|
Yuhuai Wu
Felix Li
Percy Liang
|
+
|
Solving Quantitative Reasoning Problems with Language Models
|
2022
|
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
Vinay Ramasesh
Ambrose Slone
Cem Anil
Imanol Schlag
Theo Gutman-Solo
|
+
|
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs
|
2022
|
Albert Q. Jiang
Sean Welleck
Jin Zhou
Wenda Li
Jiacheng Liu
Mateja Jamnik
Timothée Lacroix
Yuhuai Wu
Guillaume Lample
|
+
|
Path Independent Equilibrium Models Can Better Exploit Test-Time Computation
|
2022
|
Cem Anil
Ashwini Pokle
Kaiqu Liang
Johannes Treutlein
Yuhuai Wu
Shaojie Bai
Zico Kolter
Roger Grosse
|
+
|
Holistic Evaluation of Language Models
|
2022
|
Percy Liang
Rishi Bommasani
Tong Lee
Dimitris Tsipras
Dilara Soylu
Michihiro Yasunaga
Yian Zhang
Deepak Narayanan
Yuhuai Wu
Ananya Kumar
|
+
|
Exploring Length Generalization in Large Language Models
|
2022
|
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
Vinay Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
|
+
|
Language Model Cascades
|
2022
|
David Dohan
Winnie Xu
Aitor Lewkowycz
Jacob Austin
David Bieber
Raphael Gontijo Lopes
Yuhuai Wu
Henryk Michalewski
Rif A. Saurous
Jascha Sohl‐Dickstein
|
+
|
Subgoal Search For Complex Reasoning Tasks
|
2021
|
Konrad Czechowski
Tomasz Odrzygóźdź
Marek Zbysiński
Michał Zawalski
Krzysztof Olejnik
Yuhuai Wu
Łukasz Kuciński
Piotr Miłoś
|
+
|
Learning to Give Checkable Answers with Prover-Verifier Games.
|
2021
|
Cem Anil
Guodong Zhang
Yuhuai Wu
Roger Grosse
|
+
|
Subgoal Search For Complex Reasoning Tasks
|
2021
|
Konrad Czechowski
Tomasz Odrzygóźdź
Marek Zbysiński
Michał Zawalski
Krzysztof Olejnik
Yuhuai Wu
Łukasz Kuciński
Piotr Miłoś
|
+
PDF
Chat
|
Learning Branching Heuristics for Propositional Model Counting
|
2021
|
Pashootan Vaezipoor
Gil Lederman
Yuhuai Wu
Chris J. Maddison
Roger Grosse
Sanjit A. Seshia
Fahiem Bacchus
|
+
|
INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving
|
2021
|
Yuhuai Wu
Albert Qiaochu Jiang
Jimmy Ba
Roger Grosse
|
+
|
Proof Artifact Co-training for Theorem Proving with Language Models.
|
2021
|
Jesse Michael Han
Jason Rute
Yuhuai Wu
Edward W. Ayers
Stanislas Polu
|
+
|
LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning.
|
2021
|
Yuhuai Wu
Markus N. Rabe
Wenda Li
Jimmy Ba
Roger Grosse
Christian Szegedy
|
+
PDF
Chat
|
LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning
|
2021
|
Yuhuai Wu
Markus Rabe
Wenda Li
Jimmy Ba
Roger Grosse
Christian Szegedy
|
+
|
Nonlinear Invariant Risk Minimization: A Causal Approach
|
2021
|
Chaochao Lu
Yuhuai Wu
José Miguel Hernández-Lobato
Bernhard Schölkopf
|
+
|
LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning
|
2021
|
Yuhuai Wu
Markus N. Rabe
Wenda Li
Jimmy Ba
Roger Grosse
Christian Szegedy
|
+
|
On the Opportunities and Risks of Foundation Models
|
2021
|
Rishi Bommasani
Drew A. Hudson
Ehsan Adeli
Russ B. Altman
Simran Arora
Sydney von Arx
Michael S. Bernstein
Jeannette Bohg
Antoine Bosselut
Emma Brunskill
|
+
|
Hierarchical Transformers Are More Efficient Language Models
|
2021
|
Piotr Nawrot
Szymon Tworkowski
Michał Tyrolski
Łukasz Kaiser
Yuhuai Wu
Christian Szegedy
Henryk Michalewski
|
+
|
Learning to Give Checkable Answers with Prover-Verifier Games
|
2021
|
Cem Anil
Guodong Zhang
Yuhuai Wu
Roger Grosse
|
+
|
Subgoal Search For Complex Reasoning Tasks
|
2021
|
Konrad Czechowski
Tomasz Odrzygóźdź
Marek Zbysiński
Michał Zawalski
Krzysztof Olejnik
Yuhuai Wu
Łukasz Kuciński
Piotr Miłoś
|
+
|
Proof Artifact Co-training for Theorem Proving with Language Models
|
2021
|
Jesse Michael Han
Jason Rute
Yuhuai Wu
Edward W. Ayers
Stanislas Polu
|
+
|
Learning Branching Heuristics for Propositional Model Counting
|
2020
|
Pashootan Vaezipoor
Gil Lederman
Yuhuai Wu
Chris J. Maddison
Roger Grosse
Edward A. Lee
Sanjit A. Seshia
Fahiem Bacchus
|
+
|
Modelling High-Level Mathematical Reasoning in Mechanised Declarative Proofs
|
2020
|
Wenda Li
Yu Lei
Yuhuai Wu
Lawrence C. Paulson
|
+
PDF
Chat
|
Discrete Equidecomposability and Ehrhart Theory of Polygons
|
2020
|
Paxton Turner
Yuhuai Wu
|
+
|
INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving
|
2020
|
Yuhuai Wu
Albert Qiaochu Jiang
Jimmy Ba
Roger Grosse
|
+
|
The Scattering Compositional Learner: Discovering Objects, Attributes, Relationships in Analogical Reasoning
|
2020
|
Yuhuai Wu
Honghua Dong
Roger Grosse
Jimmy Ba
|
+
|
IsarStep: a Benchmark for High-level Mathematical Reasoning
|
2020
|
Wenda Li
Lei Yu
Yuhuai Wu
Lawrence C. Paulson
|
+
|
Learning Branching Heuristics for Propositional Model Counting
|
2020
|
Pashootan Vaezipoor
G. Lederman
Yuhuai Wu
Chris J. Maddison
Roger Grosse
Edward Lee
Sanjit A. Seshia
Fahiem Bacchus
|
+
|
ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
|
2019
|
Harris Chan
Yuhuai Wu
Jamie Kiros
Sanja Fidler
Jimmy Ba
|
+
|
Concurrent Meta Reinforcement Learning
|
2019
|
Emilio Parisotto
Soham Ghosh
Sai Yalamanchi
Varsha Chinnaobireddy
Yuhuai Wu
Ruslan Salakhutdinov
|
+
|
Options as responses: Grounding behavioural hierarchies in multi-agent RL
|
2019
|
Alexander Sasha Vezhnevets
Yuhuai Wu
Rémi Leblond
Joel Z. Leibo
|
+
|
Understanding Short-Horizon Bias in Stochastic Meta-Optimization
|
2018
|
Yuhuai Wu
Mengye Ren
Renjie Liao
Roger Grosse
|
+
|
An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients
|
2018
|
Jiaming Song
Yuhuai Wu
|
+
|
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
|
2018
|
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
|
+
|
Understanding Short-Horizon Bias in Stochastic Meta-Optimization
|
2018
|
Yuhuai Wu
Mengye Ren
Renjie Liao
Roger Grosse
|
+
|
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
|
2017
|
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
|
+
|
Sticking the Landing: An Asymptotically Zero-Variance Gradient Estimator for Variational Inference.
|
2017
|
Geoffrey Roeder
Yuhuai Wu
David Duvenaud
|
+
|
Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference
|
2017
|
Geoffrey Roeder
Yuhuai Wu
David Duvenaud
|
+
|
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
|
2017
|
Yuhuai Wu
Elman Mansimov
S. Matthew Liao
Roger Grosse
Jimmy Ba
|
+
|
Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference
|
2017
|
Geoffrey Roeder
Yuhuai Wu
David Duvenaud
|
+
|
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
|
2017
|
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
|
+
|
On Multiplicative Integration with Recurrent Neural Networks
|
2016
|
Yuhuai Wu
Saizheng Zhang
Ying Zhang
Yoshua Bengio
Ruslan Salakhutdinov
|
+
|
Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations
|
2016
|
Behnam Neyshabur
Yuhuai Wu
Ruslan Salakhutdinov
Nathan Srebro
|
+
|
Architectural Complexity Measures of Recurrent Neural Networks
|
2016
|
Saizheng Zhang
Yuhuai Wu
Tong Che
Zhouhan Lin
Roland Memisevic
Ruslan Salakhutdinov
Yoshua Bengio
|
+
|
On the Quantitative Analysis of Decoder-Based Generative Models
|
2016
|
Yuhuai Wu
Yuri Burda
Ruslan Salakhutdinov
Roger Grosse
|
+
|
On Multiplicative Integration with Recurrent Neural Networks
|
2016
|
Yuhuai Wu
Saizheng Zhang
Ying Zhang
Yoshua Bengio
Russ R. Salakhutdinov
|
+
|
Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations
|
2016
|
Behnam Neyshabur
Yuhuai Wu
Ruslan Salakhutdinov
Nathan Srebro
|
+
|
Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations
|
2016
|
Behnam Neyshabur
Yuhuai Wu
Ruslan Salakhutdinov
Nathan Srebro
|
+
|
STDP as presynaptic activity times rate of change of postsynaptic activity
|
2015
|
Yoshua Bengio
Thomas Mesnard
Asja Fischer
Saizheng Zhang
Yuhuai Wu
|
+
|
Discrete Equidecomposability and Ehrhart Theory of Polygons
|
2014
|
Paxton Turner
Yuhuai Wu
|
+
|
Conditions for Discrete Equidecomposability of Polygons
|
2014
|
Paxton Turner
Yuhuai Wu
|
+
|
Discrete Equidecomposability and Ehrhart Theory of Polygons
|
2014
|
Paxton Turner
Yuhuai Wu
|
+
|
Estimation and testing in an imperfect-inspection model
|
1993
|
Muni S. Srivastava
Yuhuai Wu
|