Minjie Cai

Follow

Generating author description...

Common Coauthors
Commonly Cited References
Action Title Year Authors # of times referenced
+ PDF Chat Predicting Gaze in Egocentric Video by Learning Task-Dependent Attention Transition 2018 Yifei Huang
Minjie Cai
Zhenqiang Li
Yoichi Sato
4
+ PDF Chat SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning 2017 Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat‐Seng Chua
4
+ Very Deep Convolutional Networks for Large-Scale Image Recognition 2014 Karen Simonyan
Andrew Zisserman
4
+ PDF Chat Boosted Attention: Leveraging Human Attention for Image Captioning 2018 Shi Chen
Qi Zhao
3
+ SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation 2017 Vijay Badrinarayanan
A. C. Kendall
Roberto Cipolla
3
+ UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild 2012 Khurram Soomro
Amir Zamir
Mubarak Shah
3
+ PDF Chat Temporal Segment Networks: Towards Good Practices for Deep Action Recognition 2016 Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
3
+ PDF Chat The Evolution of First Person Vision Methods: A Survey 2015 Alejandro Betancourt
Pietro Morerio
Carlo S. Regazzoni
Matthias Rauterberg
3
+ PDF Chat Going Deeper into First-Person Activity Recognition 2016 Minghuang Ma
Haoqi Fan
Kris Kitani
3
+ Two-Stream Convolutional Networks for Action Recognition in Videos 2014 Karen Simonyan
Andrew Zisserman
3
+ PDF Chat Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering 2018 Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Jay Gould
Lei Zhang
3
+ PDF Chat Top-Down Visual Saliency Guided by Captions 2017 Vasili Ramanishka
Abir Das
Jianming Zhang
Kate Saenko
2
+ PDF Chat Shallow and Deep Convolutional Networks for Saliency Prediction 2016 Junting Pan
Elisa Sayrol
Xavier GirĂł-i-Nieto
Kevin McGuinness
Noel E. O’Connor
2
+ PDF Chat Learning Deep Features for Discriminative Localization 2016 Bolei Zhou
Aditya Khosla
Àgata Lapedriza
Aude Oliva
Antonio Torralba
2
+ PDF Chat Deep Residual Learning for Image Recognition 2016 Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
2
+ PDF Chat Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks 2018 Amy Jin
Serena Yeung
Jeffrey K. Jopling
Jonathan Krause
Dan E. Azagury
Arnold Milstein
Li Fei-Fei
2
+ PDF Chat Attend and Interact: Higher-Order Object Interactions for Video Understanding 2018 Chih‐Yao Ma
Asim Kadav
Iain Melvin
Zsolt Kira
Ghassan AlRegib
Hans Peter Graf
2
+ PDF Chat EgoSampling: Fast-forward and stereo for egocentric videos 2015 Yair Poleg
Tavi Halperin
Chetan Arora
Bezalel Peleg
2
+ PDF Chat Hierarchical Multi-scale Attention Networks for action recognition 2017 Shiyang Yan
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
2
+ PDF Chat Non-local Neural Networks 2018 Xiaolong Wang
Ross Girshick
Abhinav Gupta
Kaiming He
2
+ PDF Chat Pooled motion features for first-person videos 2015 Michael S. Ryoo
Brandon Rothrock
Larry Matthies
2
+ PDF Chat LSTA: Long Short-Term Attention for Egocentric Action Recognition 2019 Swathikiran Sudhakaran
SĂŠrgio Escalera
Oswald Lanz
2
+ Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling 2014 Jun‐Young Chung
Çaǧlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
2
+ PDF Chat Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 2017 JoĂŁo Carreira
Andrew Zisserman
2
+ PDF Chat Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields 2017 Zhe Cao
Tomas Simon
Shih-En Wei
Yaser Sheikh
2
+ PDF Chat FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks 2017 Eddy Ilg
N. Michael Mayer
Tonmoy Saikia
Margret Keuper
Alexey Dosovitskiy
Thomas Brox
2
+ Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions 2019 Yifei Huang
Zhenqiang Li
Minjie Cai
Yoichi Sato
2
+ Two-Stream Convolutional Networks for Action Recognition in Videos 2014 Karen Simonyan
Andrew Zisserman
2
+ PDF Chat Squeeze-and-Excitation Networks 2019 Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
2
+ PDF Chat Video and accelerometer-based motion analysis for automated surgical skills assessment 2018 Aneeq Zia
Yachna Sharma
Vinay Bettadapura
Eric L. Sarin
Irfan Essa
2
+ PDF Chat CBAM: Convolutional Block Attention Module 2018 Sanghyun Woo
Jongchan Park
Joon‐Young Lee
In So Kweon
2
+ The Kinetics Human Action Video Dataset 2017 Andrew Zisserman
JoĂŁo Carreira
Karen Simonyan
Will Kay
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
T.C. Green
Trevor Back
2
+ Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition 2018 Swathikiran Sudhakaran
Oswald Lanz
2
+ Ego-surfing first person videos 2015 Ryo Yonetani
Kris Kitani
Yoichi Sato
2
+ PDF Chat ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification 2017 Rohit Girdhar
Deva Ramanan
Abhinav Gupta
Josef Ĺ ivic
Bryan Russell
2
+ PDF Chat StyleBank: An Explicit Representation for Neural Image Style Transfer 2017 Dongdong Chen
Lu Yuan
Jing Liao
Nenghai Yu
Gang Hua
2
+ Learning to score the figure skating sports videos 2018 Chengming Xu
Yanwei Fu
Bing Zhang
Zitian Chen
Yu–Gang Jiang
Xiangyang Xue
2
+ PDF Chat Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks 2018 Tianfan Xue
Jiajun Wu
Katherine L. Bouman
William T. Freeman
2
+ PDF Chat Am I a Baller? Basketball Performance Assessment from First-Person Videos 2017 Gedas Bertasius
Hyun Soo Park
Stella X. Yu
Jianbo Shi
2
+ PDF Chat Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination 2018 Hazel Doughty
Dima Damen
Walterio Mayol‐Cuevas
2
+ PDF Chat Object-Part Attention Model for Fine-Grained Image Classification 2017 Yuxin Peng
Xiangteng He
Junjie Zhao
2
+ PDF Chat Hierarchical Recurrent Neural Network for Video Summarization 2017 Bin Zhao
Xuelong Li
Xiaoqiang Lu
2
+ PDF Chat Long-Term Feature Banks for Detailed Video Understanding 2019 Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krähenbßhl
Ross Girshick
2
+ Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions? 2017 Abhishek Das
Harsh Agrawal
Larry Zitnick
Devi Parikh
Dhruv Batra
2
+ On the Role of Event Boundaries in Egocentric Activity Recognition from Photostreams 2018 Alejandro Cartas
EstefanĂ­a Talavera
Petia Radeva
Mariella Dimiccoli
2
+ PDF Chat Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification 2018 Xiang Long
Chuang Gan
Gerard de Melo
Jiajun Wu
Xiao Liu
Shilei Wen
2
+ PDF Chat Residual Attention Network for Image Classification 2017 Fei Wang
Mengqing Jiang
Chen Qian
Shuo Yang
Cheng Li
Honggang Zhang
Xiaogang Wang
Xiaoou Tang
2
+ Action Recognition using Visual Attention 2015 Shikhar Sharma
Ryan Kiros
Ruslan Salakhutdinov
2
+ PDF Chat Learning to Score Olympic Events 2017 Paritosh Parmar
Brendan Morris
2
+ PDF Chat Digging Deeper Into Egocentric Gaze Prediction 2019 Hamed R. Tavakoli
Esa Rahtu
Juho Kannala
Ali Borji
2