Shunsuke Aihara

Follow

Generating author description...

Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms 2011 Lihong Li
Wei Chu
John Langford
Xuanhui Wang
3
+ PDF Chat Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation 2014 Ross Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
3
+ PDF Chat Efficient Counterfactual Learning from Bandit Feedback 2019 Yusuke Narita
Shota Yasui
Kohei Yata
3
+ On the Design of Estimators for Bandit Off-Policy Evaluation 2019 Nikos Vlassis
Aurélien Bibaut
Maria Dimakopoulou
Tony Jebara
3
+ PDF Chat A contextual-bandit approach to personalized news article recommendation 2010 Lihong Li
Wei Chu
John Langford
Robert E. Schapire
3
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
3
+ PDF Chat Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions 2020 James O. McInerney
Brian M. Brost
Praveen Chandar
Rishabh Mehrotra
Benjamin Carterette
3
+ Doubly Robust Policy Evaluation and Optimization 2014 Miroslav Dudı́k
Dumitru Erhan
John Langford
Lihong Li
3
+ PDF Chat The offset tree for learning with partial labels 2009 Alina Beygelzimer
John Langford
3
+ Optimal and Adaptive Off-policy Evaluation in Contextual Bandits 2016 Yu-Xiang Wang
Alekh Agarwal
Miroslav DudĂ­k
3
+ Scikit-learn: Machine Learning in Python 2012 FabiĂĄn Pedregosa
Gaël Varoquaux
Alexandre Gramfort
Vincent Michel
Bertrand Thirion
Olivier Grisel
Mathieu Blondel
Peter Prettenhofer
Ron J. Weiss
Vincent Dubourg
2
+ Off-Policy Evaluation and Learning for External Validity under a Covariate Shift 2020 Masahiro Kato
Masatoshi Uehara
Shota Yasui
2
+ Benchmarking Graph Neural Networks 2020 Vijay Prakash Dwivedi
Chaitanya K. Joshi
Thomas Laurent
Yoshua Bengio
Xavier Bresson
2
+ Learning from Logged Implicit Exploration Data 2010 Alex Strehl
John Langford
Sham M. Kakade
Lihong Li
2
+ Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits 2021 Junpei Komiyama
Edouard Fouché
Junya Honda
2
+ A Practical Guide of Off-Policy Evaluation for Bandit Problems 2020 Masahiro Kato
Kenshi Abe
Kaito Ariu
Shota Yasui
2
+ PDF Chat Evaluating the Robustness of Off-Policy Evaluation 2021 Yuta Saito
Takuma Udagawa
Haruka Kiyohara
Kazuki Mogi
Yusuke Narita
Kei Tateno
2
+ PDF Chat Fully convolutional networks for semantic segmentation 2015 Jonathan Long
Evan Shelhamer
Trevor Darrell
2
+ Doubly Robust Off-policy Value Evaluation for Reinforcement Learning 2015 Nan Jiang
Lihong Li
2
+ RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising 2018 David Rohde
Stephen Bonner
Travis Dunlop
Flavian Vasile
Alexandros Karatzoglou
2
+ Representation Balancing MDPs for Off-policy Policy Evaluation 2018 Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Aldo Faisal
Finale Doshi‐Velez
Emma Brunskill
2
+ PDF Chat Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers 2017 Aman Agarwal
Soumya Basu
Tobias Schnabel
Thorsten Joachims
2
+ Adapting multi-armed bandits policies to contextual bandits scenarios 2018 David CortĂ©s‐Polo
2
+ PDF Chat Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data 2007 Joseph Kang
Joseph L. Schafer
2
+ Large-scale Validation of Counterfactual Learning Methods: A Test-Bed 2016 Damien Lefortier
Adith Swaminathan
Xiaotao Gu
Thorsten Joachims
Maarten de Rijke
2
+ Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning 2019 Cameron Voloshin
Hoang Le
Nan Jiang
Yisong Yue
2
+ PDF Chat Debiased Off-Policy Evaluation for Recommendation Systems 2021 Yusuke Narita
Shota Yasui
Kohei Yata
2
+ WaveNet: A Generative Model for Raw Audio 2016 AĂ€ron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
Andrew Senior
Koray Kavukcuoglu
1
+ Training Very Deep Networks 2015 Rupesh K. Srivastava
Klaus Greff
JĂŒrgen Schmidhuber
1
+ Quasi-Recurrent Neural Networks 2016 James Bradbury
Stephen Merity
Caiming Xiong
Richard Socher
1
+ PDF Chat Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models 2016 Iulian Vlad Serban
Alessandro Sordoni
Yoshua Bengio
Aaron Courville
Joëlle Pineau
1
+ Character-level Convolutional Networks for Text Classification 2015 Xiang Zhang
Junbo Zhao
Yann LeCun
1
+ Policy Evaluation and Optimization with Continuous Treatments 2018 Nathan Kallus
Angela Zhou
1
+ PDF Chat Tacotron: Towards End-to-End Speech Synthesis 2017 Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
1
+ Multi-Scale Context Aggregation by Dilated Convolutions 2015 Fisher Yu
Vladlen Koltun
1
+ PDF Chat StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks 2017 Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris Metaxas
1
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
1
+ Neural Machine Translation by Jointly Learning to Align and Translate 2015 Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
1
+ Efficiently Breaking the Curse of Horizon: Double Reinforcement Learning in Infinite-Horizon Processes. 2019 Nathan Kallus
Masatoshi Uehara
1
+ Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling 2019 Tengyang Xie
Yifei Ma
Yu-Xiang Wang
1
+ PDF Chat Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning 2022 Nathan Kallus
Masatoshi Uehara
1
+ Doubly robust off-policy evaluation with shrinkage 2020 Yi Su
Maria Dimakopoulou
Akshay Krishnamurthy
Miroslav DudĂ­k
1
+ PDF Chat Offline A/B Testing for Recommender Systems 2018 Alexandre Gilotte
Clément CalauzÚnes
Thomas Nedelec
Alexandre Abraham
Simon Dollé
1
+ Off-Policy Evaluation and Learning for External Validity under a Covariate Shift 2020 Masatoshi Uehara
Masahiro Kato
Shota Yasui
1
+ PDF Chat Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation 2020 Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
1
+ Benchmarks for Deep Off-Policy Evaluation 2021 Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
ziyu wang
Alexander Novikov
Mengjiao Yang
Michael R. Zhang
Yutian Chen
Aviral Kumar
1
+ A Neural Conversational Model 2015 Oriol Vinyals
Quoc V. Le
1
+ Neural Machine Translation in Linear Time 2016 Nal Kalchbrenner
Lasse Espeholt
Karen Simonyan
AĂ€ron van den Oord
Alex Graves
Koray Kavukcuoglu
1
+ PDF Chat Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification 2015 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
1
+ Convolutional Neural Networks for Sentence Classification 2014 Yoon Kim
1