Keith W. Ross

Generating author description...

All published works

Action	Title	Year	Authors
+ PDF Chat	Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning	2024	Suei-Wen Chen Keith W. Ross Pierre Youssef
+ PDF Chat	The Prevalence of Neural Collapse in Neural Multivariate Regression	2024	George Andriopoulos Zixuan Dong Li Guo Zifan Zhao Keith W. Ross
+ PDF Chat	Cross Entropy versus Label Smoothing: A Neural Collapse Perspective	2024	Li Guo Keith W. Ross Zifan Zhao George Andriopoulos Shuyang Ling Yufeng Xu Zixuan Dong
+	Pre-training with Synthetic Data Helps Offline Reinforcement Learning	2023	Zecheng Wang Che Wang Zixuan Dong Keith W. Ross
+	VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning	2022	Wang Che Xufang Luo Keith W. Ross Dongsheng Li
+	On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs	2022	Zixuan Dong Che Wang Keith W. Ross
+ PDF Chat	Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance	2021	Yanqiu Wu Xinyue Chen Che Wang Yiming Zhang Zijian Zhou Keith W. Ross
+ PDF Chat	Randomized Ensembled Double Q-Learning: Learning Fast Without a Model	2021	Xinyue Chen Che Wang Zijian Zhou Keith W. Ross
+	Randomized Ensembled Double Q-Learning: Learning Fast Without a Model	2021	Xinyue Chen Che Wang Zijian Zhou Keith W. Ross
+	On-Policy Deep Reinforcement Learning for the Average-Reward Criterion	2021	Yiming Zhang Keith W. Ross
+	Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance	2021	Yanqiu Wu Xinyue Chen Che Wang Yiming Zhang Zijian Zhou Keith W. Ross
+ PDF Chat	On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning	2020	Che Wang Keith W. Ross
+	On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning	2020	Wang Che Keith W. Ross
+	First Order Constrained Optimization in Policy Space	2020	Yiming Zhang Quan Vuong Keith W. Ross
+	BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning	2019	Xinyue Chen Zijian Zhou Zheng Wang Che Wang Yanqiu Wu Keith W. Ross
+	Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past	2019	Che Wang Keith W. Ross
+	Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling	2019	Che Wang Yanqiu Wu Quan Vuong Keith W. Ross
+	Multi-task Batch Reinforcement Learning with Metric Learning	2019	Jiachen Li Quan Vuong Shuang Liu Minghua Liu Kamil Ciosek Keith W. Ross Henrik I. Christensen Hao Su
+	BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning	2019	Xinyue Chen Zijian Zhou Zheng Wang Che Wang Yanqiu Wu Keith W. Ross
+	Supervised Policy Update.	2018	Quan Vuong Yiming Zhang Keith W. Ross
+	Efficient entropy for policy gradient with multi-dimensional action space	2018	Yiming Zhang Quan Vuong Kenny Song Xiao Yue Gong Keith W. Ross
+	Sensing the Chinese Diaspora: How Mobile Apps Can Provide Insights into Global Migration Flows	2018	Minhui Xue Xin Yuan Heather Lee Keith W. Ross
+	Efficient Entropy for Policy Gradient with Multidimensional Action Space	2018	Yiming Zhang Quan Vuong Kenny Song X Gong Keith W. Ross
+	Supervised Policy Update for Deep Reinforcement Learning	2018	Quan Vuong Yiming Zhang Keith W. Ross
+	Smoke Screener or Straight Shooter: Detecting Elite Sybil Attacks in User-Review Social Networks	2018	Haizhong Zheng Minhui Xue Hao Lü Shuang Hao Haojin Zhu Xiaohui Liang Keith W. Ross
+	Mining Anonymity: Identifying Sensitive Accounts on Twitter	2017	Sai Teja Peddinti Keith W. Ross Justin Cappos
+	Smoke Screener or Straight Shooter: Detecting Elite Sybil Attacks in User-Review Social Networks	2017	Haizhong Zheng Minhui Xue Hao Lü Shuang Hao Haojin Zhu Xiaohui Liang Keith W. Ross
+	I Know Where You are and What You are Sharing: Exploiting P2P Communications to Invade Users' Privacy	2011	Stevens Le Blond Chao Zhang Arnaud Legout Keith W. Ross Walid Dabbous
+	Characterizing Video Responses in Social Networks	2008	Fabrício Benevenuto Fernando Duarte Tiago Rodrigues Virgı́lio Almeida Jussara M. Almeida Keith W. Ross
+	A decomposition approximation method for multiclass BCMP queueing networks with multiple-server stations	1994	Bruno Baynat Yves Dallery Keith W. Ross
+	Reduced load approximations for multirate loss networks	1993	Po Yin Chung Keith W. Ross
+	Variability Sensitive Markov Decision Processes	1992	Melike Baykal‐Gürsoy Keith W. Ross
+	Monte Carlo Summation Applied to Product-Form Loss Networks	1992	Keith W. Ross Jie Wang
+	A sample path theory for time-average Markov decision processes	1987	Keith W. Ross Ravi Varadarajan
+ PDF Chat	Closed subgroups of locally compact Abelian groups	1964	Keith W. Ross
+ PDF Chat	Topologies induced by groups of characters	1964	W. W. Comfort Keith W. Ross

Common Coauthors

Coauthor	Papers Together
Che Wang	11
Quan Vuong	7
Xinyue Chen	6
Yanqiu Wu	5
Zijian Zhou	4
Yiming Zhang	4
Zixuan Dong	3
Minhui Xue	3
Haizhong Zheng	2
Shuang Hao	2
Li Guo	2
Haojin Zhu	2
Yiming Zhang	2
George Andriopoulos	2
Kenny Song	2
Yiming Zhang	2
Zheng Wang	2
Zijian Zhou	2
Xiaohui Liang	2
Zifan Zhao	2
Minghua Liu	1
Virgı́lio Almeida	1
Sai Teja Peddinti	1
Heather Lee	1
Henrik I. Christensen	1
Pierre Youssef	1
Zixuan Dong	1
Melike Baykal‐Gürsoy	1
Wang Che	1
Jussara M. Almeida	1
Fabrício Benevenuto	1
Xiao Yue Gong	1
Shuang Liu	1
Jiachen Li	1
Tiago Rodrigues	1
Walid Dabbous	1
W. W. Comfort	1
Jie Wang	1
Dongsheng Li	1
Chao Zhang	1
Xin Yuan	1
Po Yin Chung	1
Xufang Luo	1
Yufeng Xu	1
X Gong	1
Hao Su	1
Ravi Varadarajan	1
Wang Che	1
Hao Lü	1
Hao Lü	1

Commonly Cited References

Action	Title	Year	Authors	# of times referenced
+	Proximal Policy Optimization Algorithms	2017	John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov	7
+	Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor	2018	Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine	6
+	Deep reinforcement learning with double Q-Learning	2016	Hado van Hasselt Arthur Guez David Silver	6
+	Soft Actor-Critic Algorithms and Applications	2018	Tuomas Haarnoja Aurick Zhou Kristian Hartikainen George Tucker Sehoon Ha Jie Tan Vikash Kumar Henry Zhu Abhishek Gupta Pieter Abbeel	5
+	Continuous control with deep reinforcement learning	2015	Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver Daan Wierstra	5
+	Addressing Function Approximation Error in Actor-Critic Methods	2018	Scott Fujimoto Herke van Hoof David Meger	5
+	Deep Reinforcement Learning That Matters	2017	Peter Henderson Riashat Islam Philip Bachman Joëlle Pineau Doina Precup David Meger	4
+	Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control	2017	Riashat Islam Peter Henderson Maziar Gomrokchi Doina Precup	4
+	Trust Region Policy Optimization	2015	John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel	4
+	Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction	2019	Aviral Kumar Justin Fu Matthew Soh George Tucker Sergey Levine	3
+	Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation	2017	Yuhuai Wu Elman Mansimov S. Matthew Liao Roger Grosse Jimmy Ba	3
+	Prioritized Experience Replay	2015	Tom Schaul John Quan Ioannis Antonoglou David Silver	3
+	Supervised Policy Update for Deep Reinforcement Learning	2018	Quan Vuong Yiming Zhang Keith W. Ross	2
+	Social Turing Tests: Crowdsourcing Sybil Detection	2012	Gang Wang Manish Mohanlal Christo Wilson Xiao Wang Miriam J. Metzger Hai-Tao Zheng Ben Y. Zhao	2
+	Benchmarking Model-Based Reinforcement Learning	2019	Tingwu Wang Xuchan Bao Ignasi Clavera Jerrick Hoang Yeming Wen Eric Langlois Shunshi Zhang Guodong Zhang Pieter Abbeel Jimmy Ba	2
+	Asynchronous Methods for Deep Reinforcement Learning	2016	Volodymyr Mnih Adrià Puigdomènech Badia Mehdi Mirza Alex Graves Tim Harley Timothy Lillicrap David Silver Koray Kavukcuoglu	2
+	D4RL: Datasets for Deep Data-Driven Reinforcement Learning	2020	Justin Fu Aviral Kumar Ofir Nachum George Tucker Sergey Levine	2
+	Projection-Based Constrained Policy Optimization	2020	Tsung-Yen Yang Justinian Rosca Karthik Narasimhan Peter J. Ramadge	2
+	Concrete Problems in AI Safety	2016	Dario Amodei Chris Olah Jacob Steinhardt Paul F. Christiano John Schulman Dan Mané	2
+	Reward Constrained Policy Optimization	2018	Chen Tessler Daniel J. Mankowitz Shie Mannor	2
+	Deep Exploration via Bootstrapped DQN	2016	Ian Osband Charles Blundell Alexander Pritzel Benjamin Van Roy	2
+	Rainbow: Combining Improvements in Deep Reinforcement Learning	2017	Matteo Hessel Joseph Modayil Hado van Hasselt Tom Schaul Georg Ostrovski Will Dabney Daniel Horgan Bilal Piot Mohammad Gheshlaghi Azar David Silver	2
+ PDF Chat	Adversarial Feature Selection Against Evasion Attacks	2015	Fei Zhang Patrick P. K. Chan Battista Biggio Daniel Yeung Fabio Roli	2
+ PDF Chat	The Arcade Learning Environment: An Evaluation Platform for General Agents	2013	Marc G. Bellemare Yavar Naddaf Joel Veness Michael Bowling	2
+ PDF Chat	Fast unfolding of communities in large networks	2008	Vincent D. Blondel Jean‐Loup Guillaume Renaud Lambiotte Etienne Lefebvre	2
+	Sample Efficient Actor-Critic with Experience Replay	2016	Ziyu Wang Victor Bapst Nicolas Heess Volodymyr Mnih Rémi Munos Koray Kavukcuoglu Nando de Freitas	2
+	Benchmarking Deep Reinforcement Learning for Continuous Control	2016	Yan Duan Xi Chen Rein Houthooft John Schulman Pieter Abbeel	2
+	Continuous control with deep reinforcement learning	2016	Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver Daan Wierstra	2
+	Paying for Likes? Understanding Facebook Like Fraud Using Honeypots	2014	Emiliano De Cristofaro Arik Friedman Guillaume Jourjon Mohamed Ali Kâafar Muhammad Shafiq	2
+	Remember and Forget for Experience Replay	2018	Guido Novati Petros Koumoutsakos	2
+	Implied costs in loss networks	1989	P. J. Hunt	2
+	<b>bcp</b>: An<i>R</i>Package for Performing a Bayesian Analysis of Change Point Problems	2007	Chandra Erdman John W. Emerson	2
+	Distributed Prioritized Experience Replay	2018	Dan Horgan John Quan David Budden Gabriel Barth-Maron Matteo Hessel Hado van Hasselt David Silver	2
+	Curiosity-Driven Experience Prioritization via Density Estimation	2019	Rui Zhao Volker Tresp	2
+	Constrained Markov Decision Processes	1999	Eitan Altman	2
+	Asymptotic analysis and computational methods for a class of simple, circuit-switched networks with blocking	1987	Debasis Mitra	2
+	Benchmarking Batch Deep Reinforcement Learning Algorithms	2019	Scott Fujimoto Edoardo Conti Mohammad Ghavamzadeh Joëlle Pineau	2
+	Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks	2015	Stefan Lee Senthil Purushwalkam Michael Cogswell David Crandall Dhruv Batra	1
+ PDF Chat	SybilBelief: A Semi-Supervised Learning Approach for Structure-Based Sybil Detection	2014	Neil Zhenqiang Gong Mario Frank Prateek Mittal	1
+ PDF Chat	Learning Algorithms for Markov Decision Processes with Average Cost	2001	Jinane Abounadi Dimitri P. Bertsekas Vivek S. Borkar	1
+	A Class of Closed Markovian Queuing Networks: Integral Representations, Asymptotic Expansions, and Generalizations*	1981	James McKenna Debasis Mitra K. G. Ramakrishnan	1
+ PDF Chat	Why social networks are different from other types of networks	2003	M. E. J. Newman Juyong Park	1
+ PDF Chat	An Introduction to Causal Inference	2010	Judea Pearl	1
+ PDF Chat	Security Evaluation of Pattern Classifiers under Attack	2013	Battista Biggio Giorgio Fumera Fabio Roli	1
+	Extensions and computational aspects of an iterative method	1982	Raymond A. Marie Patricia M. Snyder William J. Stewart	1
+	RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning	2016	Yan Duan John Schulman Xi Chen Peter L. Bartlett Ilya Sutskever Pieter Abbeel	1
+ PDF Chat	Causal inference in statistics: An overview	2009	Judea Pearl	1
+ PDF Chat	PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation	2017	Raffaelli Charles Hao Su Kaichun Mo Leonidas Guibas	1
+	Deep Variational Information Bottleneck	2016	Alexander A. Alemi Ian Fischer Joshua V. Dillon Kevin Murphy	1
+	Comparison of perturbation bounds for the stationary distribution of a Markov chain	2001	Grace E. Cho Carl D. Meyer	1