Zhengxing Chen

Follow

Generating author description...

All published works
Action Title Year Authors
+ PDF Chat Self-Generated Critiques Boost Reward Modeling for Language Models 2024 Yue Yu
Zhengxing Chen
Aston Zhang
Liang Soon Tan
Chenguang Zhu
Richard Yuanzhe Pang
Yundi Qian
Xuewei Wang
Suchin Gururangan
Chao Zhang
+ PDF Chat Law of the Weakest Link: Cross Capabilities of Large Language Models 2024 Ming Zhong
Aston Zhang
Xuewei Wang
Rui Li
Wenhan Xiong
Chenguang Zhu
Zhengxing Chen
Liang Tan
Chloe Bi
Michael Lewis
+ PDF Chat The Llama 3 Herd of Models 2024 Abhimanyu Dubey
Abhinav Jauhri
Abhinav Pandey
Abhishek Kadian
Ahmad Al-Dahle
Aiesha Letman
Akhil Mathur
Alan Schelten
Amy Yang
Angela Fan
+ AutoML for Large Capacity Modeling of Meta's Ranking Systems 2023 Hang Yin
Kuang-Hung Liu
Mengying Sun
Yuxin Chen
Buyun Zhang
Jiang Liu
Vivek Kumar Sehgal
Rudresh Rajnikant Panchal
Eugen Hotaj
Xi Liu
+ Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale 2023 Wei Wen
Kuang-Hung Liu
Igor Fedorov
Xin Zhang
Hang Yin
Weiwei Chu
Kaveh Hassani
Mengying Sun
Jiang Liu
Xu Wang
+ Personalized Execution Time Optimization for the Scheduled Jobs 2022 Yang Liu
Juan Wang
Zhengxing Chen
Ian Fox
Imani Mufti
Jason Sukumaran
Bao-Kun He
Xiling Sun
Feng Liang
+ NASRec: Weight Sharing Neural Architecture Search for Recommender Systems 2022 Tunhou Zhang
Dehua Cheng
Yuchen He
Zhengxing Chen
Xiaoliang Dai
Xiong Liang
Feng Yan
Hai Li
Yiran Chen
Wei Wen
+ PDF Chat A Validation Tool for Designing Reinforcement Learning Environments 2021 Ruiyang Xu
Zhengxing Chen
+ PDF Chat Reinforcement Learning-based Product Delivery Frequency Control 2021 Liu Yang
Zhengxing Chen
Kittipat Virochsiri
Juan Wang
Jiahao Wu
Liang Feng
+ A Validation Tool for Designing Reinforcement Learning Environments 2021 Ruiyang Xu
Zhengxing Chen
+ Band-limited Soft Actor Critic Model 2020 Miguel Vázquez-Martin del Campo
Zhengxing Chen
Luke Kung
Kittipat Virochsiri
Jianyu Wang
+ Reinforcement Learning-based Product Delivery Frequency Control 2020 Yang Liu
Zhengxing Chen
Kittipat Virochsiri
Juan Wang
Jiahao Wu
Liang Feng
+ PDF Chat Q-DeckRec: A Fast Deck Recommendation System for Collectible Card Games 2018 Zhengxing Chen
Christopher Amato
Truong-Huy D. Nguyen
Seth Cooper
Yizhou Sun
Magy Seif El‐Nasr
+ Modeling Game Avatar Synergy and Opposition through Embedding in Multiplayer Online Battle Arena Games 2018 Zhengxing Chen
Yuyu Xu
Truong-Huy D. Nguyen
Yizhou Sun
Magy Seif El‐Nasr
+ Q-DeckRec: A Fast Deck Recommendation System for Collectible Card Games 2018 Zhengxing Chen
Christopher Amato
Truong-Huy D. Nguyen
Seth Cooper
Yizhou Sun
Magy Seif El‐Nasr
+ The Art of Drafting: A Team-Oriented Hero Recommendation System for Multiplayer Online Battle Arena Games 2018 Zhengxing Chen
Truong-Huy D. Nguyen
Yuyu Xu
Christopher Amato
Seth Cooper
Yizhou Sun
Magy Seif El‐Nasr
+ Player Skill Decomposition in Multiplayer Online Battle Arenas 2017 Zhengxing Chen
Yizhou Sun
Magy Seif El‐Nasr
Truong-Huy D. Nguyen
+ EOMM: An Engagement Optimized Matchmaking Framework 2017 Zhengxing Chen
Su Xue
John F. Kolen
Navid Aghdaie
Kazi A. Zaman
Yizhou Sun
Magy Seif El‐Nasr
Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ Player Skill Decomposition in Multiplayer Online Battle Arenas 2017 Zhengxing Chen
Yizhou Sun
Magy Seif El‐Nasr
Truong-Huy D. Nguyen
3
+ PDF Chat A Markovian Decision Process 1957 Richard Bellman
3
+ Learning to Learn without Gradient Descent by Gradient Descent 2016 Yutian Chen
Matthew W. Hoffman
Sergio Gómez Colmenarejo
Misha Denil
Timothy Lillicrap
Matt Botvinick
Nando de Freitas
2
+ Learning to Optimize Neural Nets 2017 Ke Li
Jitendra Malik
2
+ Prioritized Experience Replay 2015 Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
2
+ Learning to learn by gradient descent by gradient descent 2016 Marcin Andrychowicz
Misha Denil
Sergio Luis Suárez Gómez
Matthew W. Hoffman
David Pfau
Tom Schaul
Brendan Shillingford
Nando de Freitas
2
+ Neural Architecture Search with Reinforcement Learning 2016 Barret Zoph
Quoc V. Le
2
+ Learning to Learn 1998 Jonathan Baxter
2
+ A Generalized Bradley-Terry Model: From Group Competition to Individual Skill 2004 Tzu-Kuo Huang
Chih‐Jen Lin
Ruby C. Weng
2
+ Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations 2017 Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
John Schulman
Emanuel Todorov
Sergey Levine
1
+ Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor 2018 Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
1
+ Addressing Function Approximation Error in Actor-Critic Methods 2018 Scott Fujimoto
Herke van Hoof
David Meger
1
+ Smoothed Action Value Functions for Learning Gaussian Policies 2018 Ofir Nachum
Mohammad Norouzi
George Tucker
Dale Schuurmans
1
+ None 1999 Reuven Y. Rubinstein
1
+ Horizon: Facebook's Open Source Applied Reinforcement Learning Platform 2018 Jason Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
1
+ PDF Chat Frequency Principle: Fourier Analysis Sheds Light on Deep Neural Networks 2020 Zhi-Qin John Xu Zhi-Qin John Xu
Yaoyu Zhang Yaoyu Zhang
T. Luo
Yanyang Xiao
Zheng Ma Zheng
1
+ Leveraging exploration in off-policy algorithms via normalizing flows 2019 Bogdan Mazoure
Thang Doan
Audrey Durand
R Devon Hjelm
Joëlle Pineau
1
+ Exploring Cyberbullying and Other Toxic Behavior in Team Competition Online Games 2015 Haewoon Kwak
Jeremy Blackburn
Seungyeop Han
1
+ Distributed Representations of Words and Phrases and their Compositionality 2013 Tomáš Mikolov
Ilya Sutskever
Kai Chen
Greg S. Corrado
Jay B. Dean
1
+ Dueling Network Architectures for Deep Reinforcement Learning 2015 Ziyu Wang
Tom Schaul
Matteo Hessel
Hado van Hasselt
Marc Lanctot
Nando de Freitas
1
+ Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations 2018 Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
Emanuel Todorov
Sergey Levine
1
+ Prioritized Experience Replay 2015 Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
1
+ Reinforcement Learning Applications 2019 Yuxi Li
1
+ A Kernel Loss for Solving the Bellman Equation 2019 Yihao Feng
Lihong Li
Qiang Liu
1
+ A Convergence Result for Regularized Actor-Critic Methods 2019 Wesley A. Suttle
Zhuoran Yang
Kaiqing Zhang
Ji Liu
1
+ Principal Component Analysis 2005 Ian T. Jolliffe
1
+ PDF Chat Budget Constrained Bidding by Model-free Reinforcement Learning in Display Advertising 2018 Di Wu
Xiujun Chen
Xun Yang
Hao Wang
Qing Tan
Xiaoxun Zhang
Jian Xu
Kun Gai
1
+ PDF Chat Sim-to-Real Transfer of Robotic Control with Dynamics Randomization 2018 Xue Bin Peng
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
1
+ PDF Chat Real-Time Bidding by Reinforcement Learning in Display Advertising 2017 Han Cai
Kan Ren
Weinan Zhang
Kleanthis Malialis
Jun Wang
Yong Yu
Defeng Guo
1
+ Distributed Representations of Words and Phrases and their Compositionality 2013 Tomáš Mikolov
Ilya Sutskever
Kai Chen
Greg S. Corrado
Jeffrey Dean
1
+ Deep reinforcement learning for page-wise recommendations 2018 Xiangyu Zhao
Long Xia
Liang Zhang
Zhuoye Ding
Dawei Yin
Jiliang Tang
1
+ Pattern Recognition and Machine Learning 2007 Christopher Bishop
1
+ PDF Chat Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the authors) 2000 Jerome H. Friedman
Trevor Hastie
Robert Tibshirani
1
+ Tensor Decompositions and Applications 2009 Tamara G. Kolda
Brett W. Bader
1
+ PDF Chat Parameter Estimation in Large Dynamic Paired Comparison Experiments 1999 Mark E. Glickman
1
+ PDF Chat Online Stochastic Matching: Beating 1-1/e 2009 Jon Feldman
Aranyak Mehta
Vahab Mirrokni
S. Muthukrishnan
1
+ Benchmarking Deep Reinforcement Learning for Continuous Control 2016 Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
1
+ Real-time eSports Match Result Prediction 2017 Yifan Yang
Tian Qin
Yu-Heng Lei
1
+ Learning to Learn without Gradient Descent by Gradient Descent 2016 Yutian Chen
Matthew W. Hoffman
Sergio Gómez Colmenarejo
Misha Denil
Timothy Lillicrap
Matt Botvinick
Nando de Freitas
1