Gal Dalal

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Gradient Boosting Reinforcement Learning 2024 Benjamin Fuhrer
Chen Tessler
Gal Dalal
+ PDF Chat PlaMo: Plan and Move in Rich 3D Physical Environments 2024 Assaf Hallak
Gal Dalal
Chen Tessler
Kelly Guo
Shie Mannor
Gal Chechik
+ PDF Chat Tree Search-Based Policy Optimization under Stochastic Execution Delay 2024 David Valensi
Esther Derman
Shie Mannor
Gal Dalal
+ PDF Chat Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization 2024 Yihan Du
Anna Winnicki
Gal Dalal
Shie Mannor
Ramakrishnan Srikant
+ PDF Chat Planning and Learning with Adaptive Lookahead 2023 Aviv Rosenberg
Assaf Hallak
Shie Mannor
Gal Chechik
Gal Dalal
+ PDF Chat Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs 2023 Benjamin Fuhrer
Yuval Shpigelman
Chen Tessler
Shie Mannor
Gal Chechik
Eitan Zahavi
Gal Dalal
+ SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search 2023 Gal Dalal
Assaf Hallak
Gugan Thoppe
Shie Mannor
Gal Chechik
+ On the Products of Stochastic and Diagonal Matrices 2023 Assaf Hallak
Gal Dalal
+ PDF Chat Reinforcement Learning for Datacenter Congestion Control 2022 Chen Tessler
Yuval Shpigelman
Gal Dalal
Amit Mandelbaum
Doron Haritan Kazakov
Benjamin Fuhrer
Gal Chechik
Shie Mannor
+ PDF Chat Reinforcement Learning for Datacenter Congestion Control 2022 Chen Tessler
Yuval Shpigelman
Gal Dalal
Amit Mandelbaum
Doron Haritan Kazakov
Benjamin Fuhrer
Gal Chechik
Shie Mannor
+ Planning and Learning with Adaptive Lookahead 2022 Aviv Rosenberg
Assaf Hallak
Shie Mannor
Gal Chechik
Gal Dalal
+ Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs 2022 Benjamin Fuhrer
Yuval Shpigelman
Chen Tessler
Shie Mannor
Gal Chechik
Eitan Zahavi
Gal Dalal
+ SoftTreeMax: Policy Gradient with Tree Search 2022 Gal Dalal
Assaf Hallak
Shie Mannor
Gal Chechik
+ Reinforcement Learning with a Terminator 2022 Guy Tennenholtz
Nadav Merlis
Lior Shani
Shie Mannor
Uri Shalit
Gal Chechik
Assaf Hallak
Gal Dalal
+ PDF Chat Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction 2021 Gal Dalal
Assaf Hallak
Steven Dalton
Iuri Frosio
Shie Mannor
Gal Chechik
+ Acting in Delayed Environments with Non-Stationary Markov Policies 2021 Esther Derman
Gal Dalal
Shie Mannor
+ Acting in Delayed Environments with Non-Stationary Markov Policies 2021 Gal Dalal
Esther Derman
Shie Mannor
+ Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction 2021 Assaf Hallak
Gal Dalal
Steven Dalton
Iuri Frosio
Shie Mannor
Gal Chechik
+ On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning 2021 Guy Tennenholtz
Assaf Hallak
Gal Dalal
Shie Mannor
Gal Chechik
Uri Shalit
+ Reinforcement Learning for Datacenter Congestion Control 2021 Chen Tessler
Yuval Shpigelman
Gal Dalal
Amit Mandelbaum
Doron Haritan Kazakov
Benjamin Fuhrer
Gal Chechik
Shie Mannor
+ PDF Chat A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound 2020 Gal Dalal
Balázs Szörényi
Gugan Thoppe
+ The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems 2020 Ahmet Inci
Evgeny Bolotin
Yaosheng Fu
Gal Dalal
Shie Mannor
David Nellans
Diana Marculescu
+ PDF Chat How to Combine Tree-Search Methods in Reinforcement Learning 2019 Yonathan Efroni
Gal Dalal
Bruno Scherrer
Shie Mannor
+ PDF Chat Chance-Constrained Outage Scheduling Using a Machine Learning Proxy 2019 Gal Dalal
Elad Gilboa
Shie Mannor
Louis Wehenkel
+ A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound 2019 Gal Dalal
Balázs Szörényi
Gugan Thoppe
+ PDF Chat Convergence of Online and Approximate Multiple-Step Lookahead Policy Iteration 2018 Yonathan Efroni
Gal Dalal
Bruno Scherrer
Shie Mannor
+ PDF Chat Unit Commitment Using Nearest Neighbor as a Short-Term Proxy 2018 Gal Dalal
Elad Gilboa
Shie Mannor
Louis Wehenkel
+ PDF Chat Finite Sample Analyses for TD(0) With Function Approximation 2018 Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
+ Chance-Constrained Outage Scheduling using a Machine Learning Proxy 2018 Gal Dalal
Elad Gilboa
Shie Mannor
Louis Wehenkel
+ Safe Exploration in Continuous Action Spaces 2018 Gal Dalal
Krishnamurthy Dvijotham
Matej Vecerík
Todd Hester
Cosmin Păduraru
Yuval Tassa
+ Beyond the One Step Greedy Approach in Reinforcement Learning 2018 Yonathan Efroni
Gal Dalal
Bruno Scherrer
Shie Mannor
+ Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning 2018 Yonathan Efroni
Gal Dalal
Bruno Scherrer
Shie Mannor
+ Anderson Acceleration for Reinforcement Learning 2018 Yonathan Efroni
Gal Dalal
Bruno Scherrer
Shie Mannor
+ Finite Sample Analysis for TD(0) with Linear Function Approximation. 2017 Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
+ PDF Chat Supervised learning for optimal power flow as a real-time proxy 2017 Raphael Canyasse
Gal Dalal
Shie Mannor
+ Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning 2017 Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
+ Two-Timescale Stochastic Approximation Convergence Rates with Applications to Reinforcement Learning 2017 Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
+ Finite Sample Analyses for TD(0) with Function Approximation 2017 Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
+ Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning 2017 Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
+ Supervised Learning for Optimal Power Flow as a Real-Time Proxy 2016 Raphael Canyasse
Gal Dalal
Shie Mannor
+ PDF Chat Distributed scenario-based optimization for asset management in a hierarchical decision making environment 2016 Gal Dalal
Elad Gilboa
Shie Mannor
+ Hierarchical Decision Making In Electricity Grid Management 2016 Gal Dalal
Elad Gilboa
Shie Mannor
+ Unit Commitment using Nearest Neighbor as a Short-Term Proxy 2016 Gal Dalal
Elad Gilboa
Shie Mannor
Louis Wehenkel
+ Supervised Learning for Optimal Power Flow as a Real-Time Proxy 2016 Raphael Canyasse
Gal Dalal
Shie Mannor
+ PDF Chat Reinforcement learning for the unit commitment problem 2015 Gal Dalal
Shie Mannor
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Distributed scenario-based optimization for asset management in a hierarchical decision making environment 2016 Gal Dalal
Elad Gilboa
Shie Mannor
5
+ Finite-Sample Analysis of Proximal Gradient TD Algorithms 2020 Bo Liu
Ji Liu
Mohammad Ghavamzadeh
Sridhar Mahadevan
Marek Petrik
5
+ YALMIP : a toolbox for modeling and optimization in MATLAB 2005 Johan Löfberg
5
+ Hierarchical Decision Making In Electricity Grid Management 2016 Gal Dalal
Elad Gilboa
Shie Mannor
5
+ PDF Chat Improved and Generalized Upper Bounds on the Complexity of Policy Iteration 2016 Bruno Scherrer
4
+ PDF Chat The Arcade Learning Environment: An Evaluation Platform for General Agents 2013 Marc G. Bellemare
Yavar Naddaf
Joel Veness
Michael Bowling
4
+ Proximal Policy Optimization Algorithms 2017 John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
4
+ PDF Chat Impact of forecast errors on expansion planning of power systems with a renewables target 2015 Salvador Pineda
Juan M. Morales
Trine Krogh Boomsma
4
+ Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm 2017 David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
Arthur Guez
Marc Lanctot
Laurent Sifre
Dharshan Kumaran
Thore Graepel
4
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
3
+ A Concentration Bound for Stochastic Approximation via Alekseev's Formula. 2015 Gugan Thoppe
Vivek S. Borkar
3
+ Unit Commitment using Nearest Neighbor as a Short-Term Proxy 2016 Gal Dalal
Elad Gilboa
Shie Mannor
Louis Wehenkel
3
+ PDF Chat Ordinary Differential Equations and Dynamical Systems 2012 Gerald Teschl
3
+ Beyond the One Step Greedy Approach in Reinforcement Learning 2018 Yonathan Efroni
Gal Dalal
Bruno Scherrer
Shie Mannor
3
+ PDF Chat Learning to Simulate Dynamic Environments With GameGAN 2020 Seung Wook Kim
Yuhao Zhou
Jonah Philion
Antonio Torralba
Sanja Fidler
3
+ PDF Chat Learning from the hindsight plan — Episodic MPC improvement 2017 Aviv Tamar
Garrett Thomas
Tianhao Zhang
Sergey Levine
Pieter Abbeel
3
+ Nearest neighbor pattern classification 1967 Thomas M. Cover
Peter E. Hart
2
+ Delay-Aware Model-Based Reinforcement Learning for Continuous Control 2020 Baiming Chen
Mengdi Xu
Liang Li
Ding Zhao
2
+ PDF Chat Mastering Atari, Go, chess and shogi by planning with a learned model 2020 Julian Schrittwieser
Ioannis Antonoglou
Thomas Hubert
Karen Simonyan
Laurent Sifre
Simon Schmitt
Arthur Guez
Edward Lockhart
Demis Hassabis
Thore Graepel
2
+ Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning 2020 Thomas M. Moerland
Anna Deichler
Simone Baldi
Joost Broekens
Catholijn M. Jonker
2
+ Thinking While Moving: Deep Reinforcement Learning with Concurrent Control 2020 Ted Xiao
Eric Jang
Dmitry Kalashnikov
Sergey Levine
Julian Ibarz
Karol Hausman
Alexander Herzog
2
+ Delay-Aware Multi-Agent Reinforcement Learning. 2020 Baiming Chen
Mengdi Xu
Zuxin Liu
Liang Li
Ding Zhao
2
+ A Survey on Metric Learning for Feature Vectors and Structured Data 2013 Aurélien Bellet
Amaury Habrard
Marc Sebban
2
+ PDF Chat Finite Sample Analyses for TD(0) With Function Approximation 2018 Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
2
+ PDF Chat Exponential Lower Bounds for Policy Iteration 2010 John Fearnley
2
+ Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation 2020 Hao Wu
Patrick Judd
Xiaojie Zhang
Mikhail Isaev
Paulius Micikevicius
2
+ Trust Region Policy Optimization 2015 John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
2
+ PDF Chat Impulse control problem on finite horizon with execution delay 2008 Benjamin Bruder
Huyên Pham
2
+ Performance Bounds for Lambda Policy Iteration and Application to the Game of Tetris 2007 Bruno Scherrer
2
+ GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning 2018 Jacky Liang
Viktor Makoviychuk
Ankur Handa
Nuttapong Chentanez
Miles Macklin
Dieter Fox
2
+ Survey of Nearest Neighbor Techniques 2010 Nitin Bhatia
Vandana Bharti
2
+ PDF Chat Transport-Entropy inequalities and deviation estimates for stochastic approximation schemes 2013 Max Fathi
Noufel Frikha
2
+ 26ms Inference Time for ResNet-50: Towards Real-Time Execution of all DNNs on Smartphone 2019 Wei Niu
Xiaolong Ma
Yanzhi Wang
Bin Ren
2
+ PDF Chat How to Combine Tree-Search Methods in Reinforcement Learning 2019 Yonathan Efroni
Gal Dalal
Bruno Scherrer
Shie Mannor
2
+ PDF Chat Convergence rate of linear two-time-scale stochastic approximation 2004 Vijay R. Konda
John N. Tsitsiklis
2
+ PDF Chat Applied logistic regression 1990 David W. Hosmer
Stanley Lemeshow
2
+ Continuous control with deep reinforcement learning 2015 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
2
+ Applied Linear Regression Models 1997 2
+ At Human Speed: Deep Reinforcement Learning with Action Delay 2018 Vlad Firoiu
Tina Ju
Josh Tenenbaum
2
+ Momentum and Stochastic Momentum for Stochastic Gradient, Newton, Proximal Point and Subspace Descent Methods 2017 Nicolas Loizou
Peter Richtárik
2
+ General Hit-and-Run Monte Carlo sampling for evaluating multidimensional integrals 1996 Ming‐Hui Chen
Bruce W. Schmeiser
2
+ PDF Chat Concentration bounds for stochastic approximations 2012 Noufel Frikha
Stéphane Menozzi
2
+ Towards Safety-Aware Computing System Design in Autonomous Vehicles 2019 Hengyu Zhao
Yubo Zhang
Pingfan Meng
Hui Shi
Li Erran Li
Tiancheng Lou
Jishen Zhao
2
+ IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures 2018 Lasse Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymir Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
2
+ PDF Chat On-line Policy Improvement using Monte-Carlo Search 2025 Gerald Tesauro
Gregory R. Galperin
2
+ Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning 2017 Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor
2
+ Challenges of Real-World Reinforcement Learning 2019 Gabriel Dulac-Arnold
Daniel J. Mankowitz
Todd Hester
2
+ Continuous control with deep reinforcement learning 2016 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
2
+ On Divergences and Informations in Statistics and Information Theory 2006 Friedrich Liese
Igor Vajda
1
+ Contextual Markov Decision Processes 2015 Assaf Hallak
Dotan Di Castro
Shie Mannor
1