Projects
Reading
People
Chat
SU\G
(đž)
/K·U
Projects
Reading
People
Chat
Sign Up
Light
Dark
System
Shunsuke Aihara
Follow
Share
Generating author description...
All published works
Action
Title
Year
Authors
+
A Large-scale Open Dataset for Bandit Algorithms
2020
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
+
Large-scale Open Dataset, Pipeline, and Benchmark for Bandit Algorithms
2020
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
+
PDF
Chat
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
2020
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
+
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
2020
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
+
PDF
Chat
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention
2018
H. Tachibana
Katsuya Uenoyama
Shunsuke Aihara
+
PDF
Chat
Computational Constancy Measures of TextsâYule's <i>K</i> and RĂ©nyi's Entropy
2015
Kumiko TanakaâIshii
Shunsuke Aihara
Common Coauthors
Coauthor
Papers Together
Megumi Matsutani
4
Yuta Saito
4
Yusuke Narita
4
H. Tachibana
1
Kumiko TanakaâIshii
1
Katsuya Uenoyama
1
Commonly Cited References
Action
Title
Year
Authors
# of times referenced
+
PDF
Chat
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
2011
Lihong Li
Wei Chu
John Langford
Xuanhui Wang
3
+
PDF
Chat
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
2014
Ross Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
3
+
PDF
Chat
Efficient Counterfactual Learning from Bandit Feedback
2019
Yusuke Narita
Shota Yasui
Kohei Yata
3
+
On the Design of Estimators for Bandit Off-Policy Evaluation
2019
Nikos Vlassis
Aurélien Bibaut
Maria Dimakopoulou
Tony Jebara
3
+
PDF
Chat
A contextual-bandit approach to personalized news article recommendation
2010
Lihong Li
Wei Chu
John Langford
Robert E. Schapire
3
+
PDF
Chat
Deep Residual Learning for Image Recognition
2016
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
3
+
PDF
Chat
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
2020
James O. McInerney
Brian M. Brost
Praveen Chandar
Rishabh Mehrotra
Benjamin Carterette
3
+
Doubly Robust Policy Evaluation and Optimization
2014
Miroslav DudıÌk
Dumitru Erhan
John Langford
Lihong Li
3
+
PDF
Chat
The offset tree for learning with partial labels
2009
Alina Beygelzimer
John Langford
3
+
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
2016
Yu-Xiang Wang
Alekh Agarwal
Miroslav DudĂk
3
+
Scikit-learn: Machine Learning in Python
2012
FabiĂĄn Pedregosa
Gaël Varoquaux
Alexandre Gramfort
Vincent Michel
Bertrand Thirion
Olivier Grisel
Mathieu Blondel
Peter Prettenhofer
Ron J. Weiss
Vincent Dubourg
2
+
Off-Policy Evaluation and Learning for External Validity under a Covariate Shift
2020
Masahiro Kato
Masatoshi Uehara
Shota Yasui
2
+
Benchmarking Graph Neural Networks
2020
Vijay Prakash Dwivedi
Chaitanya K. Joshi
Thomas Laurent
Yoshua Bengio
Xavier Bresson
2
+
Learning from Logged Implicit Exploration Data
2010
Alex Strehl
John Langford
Sham M. Kakade
Lihong Li
2
+
Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits
2021
Junpei Komiyama
Edouard Fouché
Junya Honda
2
+
A Practical Guide of Off-Policy Evaluation for Bandit Problems
2020
Masahiro Kato
Kenshi Abe
Kaito Ariu
Shota Yasui
2
+
PDF
Chat
Evaluating the Robustness of Off-Policy Evaluation
2021
Yuta Saito
Takuma Udagawa
Haruka Kiyohara
Kazuki Mogi
Yusuke Narita
Kei Tateno
2
+
PDF
Chat
Fully convolutional networks for semantic segmentation
2015
Jonathan Long
Evan Shelhamer
Trevor Darrell
2
+
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
2015
Nan Jiang
Lihong Li
2
+
RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
2018
David Rohde
Stephen Bonner
Travis Dunlop
Flavian Vasile
Alexandros Karatzoglou
2
+
Representation Balancing MDPs for Off-policy Policy Evaluation
2018
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Aldo Faisal
Finale DoshiâVelez
Emma Brunskill
2
+
PDF
Chat
Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers
2017
Aman Agarwal
Soumya Basu
Tobias Schnabel
Thorsten Joachims
2
+
Adapting multi-armed bandits policies to contextual bandits scenarios
2018
David CortĂ©sâPolo
2
+
PDF
Chat
Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data
2007
Joseph Kang
Joseph L. Schafer
2
+
Large-scale Validation of Counterfactual Learning Methods: A Test-Bed
2016
Damien Lefortier
Adith Swaminathan
Xiaotao Gu
Thorsten Joachims
Maarten de Rijke
2
+
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
2019
Cameron Voloshin
Hoang Le
Nan Jiang
Yisong Yue
2
+
PDF
Chat
Debiased Off-Policy Evaluation for Recommendation Systems
2021
Yusuke Narita
Shota Yasui
Kohei Yata
2
+
WaveNet: A Generative Model for Raw Audio
2016
AĂ€ron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
Andrew Senior
Koray Kavukcuoglu
1
+
Training Very Deep Networks
2015
Rupesh K. Srivastava
Klaus Greff
JĂŒrgen Schmidhuber
1
+
Quasi-Recurrent Neural Networks
2016
James Bradbury
Stephen Merity
Caiming Xiong
Richard Socher
1
+
PDF
Chat
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
2016
Iulian Vlad Serban
Alessandro Sordoni
Yoshua Bengio
Aaron Courville
Joëlle Pineau
1
+
Character-level Convolutional Networks for Text Classification
2015
Xiang Zhang
Junbo Zhao
Yann LeCun
1
+
Policy Evaluation and Optimization with Continuous Treatments
2018
Nathan Kallus
Angela Zhou
1
+
PDF
Chat
Tacotron: Towards End-to-End Speech Synthesis
2017
Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
Navdeep Jaitly
Zongheng Yang
Ying Xiao
Zhifeng Chen
Samy Bengio
1
+
Multi-Scale Context Aggregation by Dilated Convolutions
2015
Fisher Yu
Vladlen Koltun
1
+
PDF
Chat
StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks
2017
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris Metaxas
1
+
Adam: A Method for Stochastic Optimization
2014
Diederik P. Kingma
Jimmy Ba
1
+
Neural Machine Translation by Jointly Learning to Align and Translate
2015
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
1
+
Efficiently Breaking the Curse of Horizon: Double Reinforcement Learning in Infinite-Horizon Processes.
2019
Nathan Kallus
Masatoshi Uehara
1
+
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
2019
Tengyang Xie
Yifei Ma
Yu-Xiang Wang
1
+
PDF
Chat
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning
2022
Nathan Kallus
Masatoshi Uehara
1
+
Doubly robust off-policy evaluation with shrinkage
2020
Yi Su
Maria Dimakopoulou
Akshay Krishnamurthy
Miroslav DudĂk
1
+
PDF
Chat
Offline A/B Testing for Recommender Systems
2018
Alexandre Gilotte
Clément CalauzÚnes
Thomas Nedelec
Alexandre Abraham
Simon Dollé
1
+
Off-Policy Evaluation and Learning for External Validity under a Covariate Shift
2020
Masatoshi Uehara
Masahiro Kato
Shota Yasui
1
+
PDF
Chat
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
2020
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
1
+
Benchmarks for Deep Off-Policy Evaluation
2021
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
ziyu wang
Alexander Novikov
Mengjiao Yang
Michael R. Zhang
Yutian Chen
Aviral Kumar
1
+
A Neural Conversational Model
2015
Oriol Vinyals
Quoc V. Le
1
+
Neural Machine Translation in Linear Time
2016
Nal Kalchbrenner
Lasse Espeholt
Karen Simonyan
AĂ€ron van den Oord
Alex Graves
Koray Kavukcuoglu
1
+
PDF
Chat
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
2015
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
1
+
Convolutional Neural Networks for Sentence Classification
2014
Yoon Kim
1