Keith W. Ross

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning 2024 Suei-Wen Chen
Keith W. Ross
Pierre Youssef
+ PDF Chat The Prevalence of Neural Collapse in Neural Multivariate Regression 2024 George Andriopoulos
Zixuan Dong
Li Guo
Zifan Zhao
Keith W. Ross
+ PDF Chat Cross Entropy versus Label Smoothing: A Neural Collapse Perspective 2024 Li Guo
Keith W. Ross
Zifan Zhao
George Andriopoulos
Shuyang Ling
Yufeng Xu
Zixuan Dong
+ Pre-training with Synthetic Data Helps Offline Reinforcement Learning 2023 Zecheng Wang
Che Wang
Zixuan Dong
Keith W. Ross
+ VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning 2022 Wang Che
Xufang Luo
Keith W. Ross
Dongsheng Li
+ On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs 2022 Zixuan Dong
Che Wang
Keith W. Ross
+ PDF Chat Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance 2021 Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Zijian Zhou
Keith W. Ross
+ PDF Chat Randomized Ensembled Double Q-Learning: Learning Fast Without a Model 2021 Xinyue Chen
Che Wang
Zijian Zhou
Keith W. Ross
+ Randomized Ensembled Double Q-Learning: Learning Fast Without a Model 2021 Xinyue Chen
Che Wang
Zijian Zhou
Keith W. Ross
+ On-Policy Deep Reinforcement Learning for the Average-Reward Criterion 2021 Yiming Zhang
Keith W. Ross
+ Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance 2021 Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Zijian Zhou
Keith W. Ross
+ PDF Chat On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning 2020 Che Wang
Keith W. Ross
+ On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning 2020 Wang Che
Keith W. Ross
+ First Order Constrained Optimization in Policy Space 2020 Yiming Zhang
Quan Vuong
Keith W. Ross
+ BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning 2019 Xinyue Chen
Zijian Zhou
Zheng Wang
Che Wang
Yanqiu Wu
Keith W. Ross
+ Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past 2019 Che Wang
Keith W. Ross
+ Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling 2019 Che Wang
Yanqiu Wu
Quan Vuong
Keith W. Ross
+ Multi-task Batch Reinforcement Learning with Metric Learning 2019 Jiachen Li
Quan Vuong
Shuang Liu
Minghua Liu
Kamil Ciosek
Keith W. Ross
Henrik I. Christensen
Hao Su
+ BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning 2019 Xinyue Chen
Zijian Zhou
Zheng Wang
Che Wang
Yanqiu Wu
Keith W. Ross
+ Supervised Policy Update. 2018 Quan Vuong
Yiming Zhang
Keith W. Ross
+ Efficient entropy for policy gradient with multi-dimensional action space 2018 Yiming Zhang
Quan Vuong
Kenny Song
Xiao Yue Gong
Keith W. Ross
+ Sensing the Chinese Diaspora: How Mobile Apps Can Provide Insights into Global Migration Flows 2018 Minhui Xue
Xin Yuan
Heather Lee
Keith W. Ross
+ Efficient Entropy for Policy Gradient with Multidimensional Action Space 2018 Yiming Zhang
Quan Vuong
Kenny Song
X Gong
Keith W. Ross
+ Supervised Policy Update for Deep Reinforcement Learning 2018 Quan Vuong
Yiming Zhang
Keith W. Ross
+ Smoke Screener or Straight Shooter: Detecting Elite Sybil Attacks in User-Review Social Networks 2018 Haizhong Zheng
Minhui Xue
Hao Lü
Shuang Hao
Haojin Zhu
Xiaohui Liang
Keith W. Ross
+ Mining Anonymity: Identifying Sensitive Accounts on Twitter 2017 Sai Teja Peddinti
Keith W. Ross
Justin Cappos
+ Smoke Screener or Straight Shooter: Detecting Elite Sybil Attacks in User-Review Social Networks 2017 Haizhong Zheng
Minhui Xue
Hao Lü
Shuang Hao
Haojin Zhu
Xiaohui Liang
Keith W. Ross
+ I Know Where You are and What You are Sharing: Exploiting P2P Communications to Invade Users' Privacy 2011 Stevens Le Blond
Chao Zhang
Arnaud Legout
Keith W. Ross
Walid Dabbous
+ Characterizing Video Responses in Social Networks 2008 Fabrí­cio Benevenuto
Fernando Duarte
Tiago Rodrigues
Virgı́lio Almeida
Jussara M. Almeida
Keith W. Ross
+ A decomposition approximation method for multiclass BCMP queueing networks with multiple-server stations 1994 Bruno Baynat
Yves Dallery
Keith W. Ross
+ Reduced load approximations for multirate loss networks 1993 Po Yin Chung
Keith W. Ross
+ Variability Sensitive Markov Decision Processes 1992 Melike Baykal‐Gürsoy
Keith W. Ross
+ Monte Carlo Summation Applied to Product-Form Loss Networks 1992 Keith W. Ross
Jie Wang
+ A sample path theory for time-average Markov decision processes 1987 Keith W. Ross
Ravi Varadarajan
+ PDF Chat Closed subgroups of locally compact Abelian groups 1964 Keith W. Ross
+ PDF Chat Topologies induced by groups of characters 1964 W. W. Comfort
Keith W. Ross
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Proximal Policy Optimization Algorithms 2017 John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
7
+ Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor 2018 Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
6
+ Deep reinforcement learning with double Q-Learning 2016 Hado van Hasselt
Arthur Guez
David Silver
6
+ Soft Actor-Critic Algorithms and Applications 2018 Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
5
+ Continuous control with deep reinforcement learning 2015 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
5
+ Addressing Function Approximation Error in Actor-Critic Methods 2018 Scott Fujimoto
Herke van Hoof
David Meger
5
+ Deep Reinforcement Learning That Matters 2017 Peter Henderson
Riashat Islam
Philip Bachman
Joëlle Pineau
Doina Precup
David Meger
4
+ Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control 2017 Riashat Islam
Peter Henderson
Maziar Gomrokchi
Doina Precup
4
+ Trust Region Policy Optimization 2015 John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
4
+ Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction 2019 Aviral Kumar
Justin Fu
Matthew Soh
George Tucker
Sergey Levine
3
+ Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation 2017 Yuhuai Wu
Elman Mansimov
S. Matthew Liao
Roger Grosse
Jimmy Ba
3
+ Prioritized Experience Replay 2015 Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
3
+ Supervised Policy Update for Deep Reinforcement Learning 2018 Quan Vuong
Yiming Zhang
Keith W. Ross
2
+ Social Turing Tests: Crowdsourcing Sybil Detection 2012 Gang Wang
Manish Mohanlal
Christo Wilson
Xiao Wang
Miriam J. Metzger
Hai-Tao Zheng
Ben Y. Zhao
2
+ Benchmarking Model-Based Reinforcement Learning 2019 Tingwu Wang
Xuchan Bao
Ignasi Clavera
Jerrick Hoang
Yeming Wen
Eric Langlois
Shunshi Zhang
Guodong Zhang
Pieter Abbeel
Jimmy Ba
2
+ Asynchronous Methods for Deep Reinforcement Learning 2016 Volodymyr Mnih
Adrià Puigdomènech Badia
Mehdi Mirza
Alex Graves
Tim Harley
Timothy Lillicrap
David Silver
Koray Kavukcuoglu
2
+ D4RL: Datasets for Deep Data-Driven Reinforcement Learning 2020 Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
2
+ Projection-Based Constrained Policy Optimization 2020 Tsung-Yen Yang
Justinian Rosca
Karthik Narasimhan
Peter J. Ramadge
2
+ Concrete Problems in AI Safety 2016 Dario Amodei
Chris Olah
Jacob Steinhardt
Paul F. Christiano
John Schulman
Dan Mané
2
+ Reward Constrained Policy Optimization 2018 Chen Tessler
Daniel J. Mankowitz
Shie Mannor
2
+ Deep Exploration via Bootstrapped DQN 2016 Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
2
+ Rainbow: Combining Improvements in Deep Reinforcement Learning 2017 Matteo Hessel
Joseph Modayil
Hado van Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Daniel Horgan
Bilal Piot
Mohammad Gheshlaghi Azar
David Silver
2
+ PDF Chat Adversarial Feature Selection Against Evasion Attacks 2015 Fei Zhang
Patrick P. K. Chan
Battista Biggio
Daniel Yeung
Fabio Roli
2
+ PDF Chat The Arcade Learning Environment: An Evaluation Platform for General Agents 2013 Marc G. Bellemare
Yavar Naddaf
Joel Veness
Michael Bowling
2
+ PDF Chat Fast unfolding of communities in large networks 2008 Vincent D. Blondel
Jean‐Loup Guillaume
Renaud Lambiotte
Etienne Lefebvre
2
+ Sample Efficient Actor-Critic with Experience Replay 2016 Ziyu Wang
Victor Bapst
Nicolas Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
2
+ Benchmarking Deep Reinforcement Learning for Continuous Control 2016 Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
2
+ Continuous control with deep reinforcement learning 2016 Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
Nicolas Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
2
+ Paying for Likes? Understanding Facebook Like Fraud Using Honeypots 2014 Emiliano De Cristofaro
Arik Friedman
Guillaume Jourjon
Mohamed Ali Kâafar
Muhammad Shafiq
2
+ Remember and Forget for Experience Replay 2018 Guido Novati
Petros Koumoutsakos
2
+ Implied costs in loss networks 1989 P. J. Hunt
2
+ <b>bcp</b>: An<i>R</i>Package for Performing a Bayesian Analysis of Change Point Problems 2007 Chandra Erdman
John W. Emerson
2
+ Distributed Prioritized Experience Replay 2018 Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
Hado van Hasselt
David Silver
2
+ Curiosity-Driven Experience Prioritization via Density Estimation 2019 Rui Zhao
Volker Tresp
2
+ Constrained Markov Decision Processes 1999 Eitan Altman
2
+ Asymptotic analysis and computational methods for a class of simple, circuit-switched networks with blocking 1987 Debasis Mitra
2
+ Benchmarking Batch Deep Reinforcement Learning Algorithms 2019 Scott Fujimoto
Edoardo Conti
Mohammad Ghavamzadeh
Joëlle Pineau
2
+ Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks 2015 Stefan Lee
Senthil Purushwalkam
Michael Cogswell
David Crandall
Dhruv Batra
1
+ PDF Chat SybilBelief: A Semi-Supervised Learning Approach for Structure-Based Sybil Detection 2014 Neil Zhenqiang Gong
Mario Frank
Prateek Mittal
1
+ PDF Chat Learning Algorithms for Markov Decision Processes with Average Cost 2001 Jinane Abounadi
Dimitri P. Bertsekas
Vivek S. Borkar
1
+ A Class of Closed Markovian Queuing Networks: Integral Representations, Asymptotic Expansions, and Generalizations* 1981 James McKenna
Debasis Mitra
K. G. Ramakrishnan
1
+ PDF Chat Why social networks are different from other types of networks 2003 M. E. J. Newman
Juyong Park
1
+ PDF Chat An Introduction to Causal Inference 2010 Judea Pearl
1
+ PDF Chat Security Evaluation of Pattern Classifiers under Attack 2013 Battista Biggio
Giorgio Fumera
Fabio Roli
1
+ Extensions and computational aspects of an iterative method 1982 Raymond A. Marie
Patricia M. Snyder
William J. Stewart
1
+ RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning 2016 Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
1
+ PDF Chat Causal inference in statistics: An overview 2009 Judea Pearl
1
+ PDF Chat PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation 2017 Raffaelli Charles
Hao Su
Kaichun Mo
Leonidas Guibas
1
+ Deep Variational Information Bottleneck 2016 Alexander A. Alemi
Ian Fischer
Joshua V. Dillon
Kevin Murphy
1
+ Comparison of perturbation bounds for the stationary distribution of a Markov chain 2001 Grace E. Cho
Carl D. Meyer
1