Minjie Cai

Generating author description...

All published works

Action	Title	Year	Authors
+	Uncertainty-Aware Model Adaptation for Unsupervised Cross-Domain Object Detection	2021	Minjie Cai Minyi Luo Xionghu Zhong Hao Chen
+ PDF Chat	Mutual Context Network for Jointly Estimating Egocentric Gaze and Action	2020	Yifei Huang Minjie Cai Zhenqiang Li Feng Lu Yoichi Sato
+ PDF Chat	What I See Is What You See	2019	Huangyue Yu Minjie Cai Yunfei Liu Feng Lu
+ PDF Chat	Manipulation-Skill Assessment from Videos with Spatial Attention Network	2019	Zhenqiang Li Yifei Huang Minjie Cai Yoichi Sato
+	Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions	2019	Yifei Huang Zhenqiang Li Minjie Cai Yoichi Sato
+	Manipulation-skill Assessment from Videos with Spatial Attention Network	2019	Zhenqiang Li Yifei Huang Minjie Cai Yoichi Sato
+	What I See Is What You See: Joint Attention Learning for First and Third Person Video Co-analysis	2019	Huangyue Yu Minjie Cai Yunfei Liu Feng Lu
+ PDF Chat	Predicting Gaze in Egocentric Video by Learning Task-Dependent Attention Transition	2018	Yifei Huang Minjie Cai Zhenqiang Li Yoichi Sato
+	Understanding hand-object manipulation by modeling the contextual relationship between actions, grasp types and object attributes	2018	Minjie Cai Kris Kitani Yoichi Sato
+	Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition	2018	Yifei Huang Minjie Cai Zhenqiang Li Yoichi Sato

Common Coauthors

Coauthor	Papers Together
Yoichi Sato	6
Zhenqiang Li	6
Yifei Huang	6
Feng Lu	3
Huangyue Yu	2
Yunfei Liu	2
Yoichi Sato	1
Kris Kitani	1
Minyi Luo	1
Hao Chen	1
Xionghu Zhong	1

Commonly Cited References

Action	Title	Year	Authors	# of times referenced
+ PDF Chat	Predicting Gaze in Egocentric Video by Learning Task-Dependent Attention Transition	2018	Yifei Huang Minjie Cai Zhenqiang Li Yoichi Sato	4
+ PDF Chat	SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning	2017	Long Chen Hanwang Zhang Jun Xiao Liqiang Nie Jian Shao Wei Liu Tat‐Seng Chua	4
+	Very Deep Convolutional Networks for Large-Scale Image Recognition	2014	Karen Simonyan Andrew Zisserman	4
+ PDF Chat	Boosted Attention: Leveraging Human Attention for Image Captioning	2018	Shi Chen Qi Zhao	3
+	SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation	2017	Vijay Badrinarayanan A. C. Kendall Roberto Cipolla	3
+	UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild	2012	Khurram Soomro Amir Zamir Mubarak Shah	3
+ PDF Chat	Temporal Segment Networks: Towards Good Practices for Deep Action Recognition	2016	Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao Dahua Lin Xiaoou Tang Luc Van Gool	3
+ PDF Chat	The Evolution of First Person Vision Methods: A Survey	2015	Alejandro Betancourt Pietro Morerio Carlo S. Regazzoni Matthias Rauterberg	3
+ PDF Chat	Going Deeper into First-Person Activity Recognition	2016	Minghuang Ma Haoqi Fan Kris Kitani	3
+	Two-Stream Convolutional Networks for Action Recognition in Videos	2014	Karen Simonyan Andrew Zisserman	3
+ PDF Chat	Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering	2018	Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Jay Gould Lei Zhang	3
+ PDF Chat	Top-Down Visual Saliency Guided by Captions	2017	Vasili Ramanishka Abir Das Jianming Zhang Kate Saenko	2
+ PDF Chat	Shallow and Deep Convolutional Networks for Saliency Prediction	2016	Junting Pan Elisa Sayrol Xavier Giró-i-Nieto Kevin McGuinness Noel E. O’Connor	2
+ PDF Chat	Learning Deep Features for Discriminative Localization	2016	Bolei Zhou Aditya Khosla Àgata Lapedriza Aude Oliva Antonio Torralba	2
+ PDF Chat	Deep Residual Learning for Image Recognition	2016	Kaiming He Xiangyu Zhang Shaoqing Ren Jian Sun	2
+ PDF Chat	Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks	2018	Amy Jin Serena Yeung Jeffrey K. Jopling Jonathan Krause Dan E. Azagury Arnold Milstein Li Fei-Fei	2
+ PDF Chat	Attend and Interact: Higher-Order Object Interactions for Video Understanding	2018	Chih‐Yao Ma Asim Kadav Iain Melvin Zsolt Kira Ghassan AlRegib Hans Peter Graf	2
+ PDF Chat	EgoSampling: Fast-forward and stereo for egocentric videos	2015	Yair Poleg Tavi Halperin Chetan Arora Bezalel Peleg	2
+ PDF Chat	Hierarchical Multi-scale Attention Networks for action recognition	2017	Shiyang Yan Jeremy S. Smith Wenjin Lu Bailing Zhang	2
+ PDF Chat	Non-local Neural Networks	2018	Xiaolong Wang Ross Girshick Abhinav Gupta Kaiming He	2
+ PDF Chat	Pooled motion features for first-person videos	2015	Michael S. Ryoo Brandon Rothrock Larry Matthies	2
+ PDF Chat	LSTA: Long Short-Term Attention for Egocentric Action Recognition	2019	Swathikiran Sudhakaran Sérgio Escalera Oswald Lanz	2
+	Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling	2014	Jun‐Young Chung Çaǧlar Gülçehre Kyunghyun Cho Yoshua Bengio	2
+ PDF Chat	Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset	2017	João Carreira Andrew Zisserman	2
+ PDF Chat	Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields	2017	Zhe Cao Tomas Simon Shih-En Wei Yaser Sheikh	2
+ PDF Chat	FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks	2017	Eddy Ilg N. Michael Mayer Tonmoy Saikia Margret Keuper Alexey Dosovitskiy Thomas Brox	2
+	Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions	2019	Yifei Huang Zhenqiang Li Minjie Cai Yoichi Sato	2
+	Two-Stream Convolutional Networks for Action Recognition in Videos	2014	Karen Simonyan Andrew Zisserman	2
+ PDF Chat	Squeeze-and-Excitation Networks	2019	Jie Hu Li Shen Samuel Albanie Gang Sun Enhua Wu	2
+ PDF Chat	Video and accelerometer-based motion analysis for automated surgical skills assessment	2018	Aneeq Zia Yachna Sharma Vinay Bettadapura Eric L. Sarin Irfan Essa	2
+ PDF Chat	CBAM: Convolutional Block Attention Module	2018	Sanghyun Woo Jongchan Park Joon‐Young Lee In So Kweon	2
+	The Kinetics Human Action Video Dataset	2017	Andrew Zisserman João Carreira Karen Simonyan Will Kay Brian Zhang Chloe Hillier Sudheendra Vijayanarasimhan Fabio Viola T.C. Green Trevor Back	2
+	Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition	2018	Swathikiran Sudhakaran Oswald Lanz	2
+	Ego-surfing first person videos	2015	Ryo Yonetani Kris Kitani Yoichi Sato	2
+ PDF Chat	ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification	2017	Rohit Girdhar Deva Ramanan Abhinav Gupta Josef Šivic Bryan Russell	2
+ PDF Chat	StyleBank: An Explicit Representation for Neural Image Style Transfer	2017	Dongdong Chen Lu Yuan Jing Liao Nenghai Yu Gang Hua	2
+	Learning to score the figure skating sports videos	2018	Chengming Xu Yanwei Fu Bing Zhang Zitian Chen Yu–Gang Jiang Xiangyang Xue	2
+ PDF Chat	Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks	2018	Tianfan Xue Jiajun Wu Katherine L. Bouman William T. Freeman	2
+ PDF Chat	Am I a Baller? Basketball Performance Assessment from First-Person Videos	2017	Gedas Bertasius Hyun Soo Park Stella X. Yu Jianbo Shi	2
+ PDF Chat	Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination	2018	Hazel Doughty Dima Damen Walterio Mayol‐Cuevas	2
+ PDF Chat	Object-Part Attention Model for Fine-Grained Image Classification	2017	Yuxin Peng Xiangteng He Junjie Zhao	2
+ PDF Chat	Hierarchical Recurrent Neural Network for Video Summarization	2017	Bin Zhao Xuelong Li Xiaoqiang Lu	2
+ PDF Chat	Long-Term Feature Banks for Detailed Video Understanding	2019	Chao-Yuan Wu Christoph Feichtenhofer Haoqi Fan Kaiming He Philipp Krähenbühl Ross Girshick	2
+	Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?	2017	Abhishek Das Harsh Agrawal Larry Zitnick Devi Parikh Dhruv Batra	2
+	On the Role of Event Boundaries in Egocentric Activity Recognition from Photostreams	2018	Alejandro Cartas Estefanía Talavera Petia Radeva Mariella Dimiccoli	2
+ PDF Chat	Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification	2018	Xiang Long Chuang Gan Gerard de Melo Jiajun Wu Xiao Liu Shilei Wen	2
+ PDF Chat	Residual Attention Network for Image Classification	2017	Fei Wang Mengqing Jiang Chen Qian Shuo Yang Cheng Li Honggang Zhang Xiaogang Wang Xiaoou Tang	2
+	Action Recognition using Visual Attention	2015	Shikhar Sharma Ryan Kiros Ruslan Salakhutdinov	2
+ PDF Chat	Learning to Score Olympic Events	2017	Paritosh Parmar Brendan Morris	2
+ PDF Chat	Digging Deeper Into Egocentric Gaze Prediction	2019	Hamed R. Tavakoli Esa Rahtu Juho Kannala Ali Borji	2