+
PDF
Chat
|
Predicting Gaze in Egocentric Video by Learning Task-Dependent Attention Transition
|
2018
|
Yifei Huang
Minjie Cai
Zhenqiang Li
Yoichi Sato
|
4
|
+
PDF
Chat
|
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning
|
2017
|
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
TatâSeng Chua
|
4
|
+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
4
|
+
PDF
Chat
|
Boosted Attention: Leveraging Human Attention for Image Captioning
|
2018
|
Shi Chen
Qi Zhao
|
3
|
+
|
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
|
2017
|
Vijay Badrinarayanan
A. C. Kendall
Roberto Cipolla
|
3
|
+
|
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
|
2012
|
Khurram Soomro
Amir Zamir
Mubarak Shah
|
3
|
+
PDF
Chat
|
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
|
2016
|
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
|
3
|
+
PDF
Chat
|
The Evolution of First Person Vision Methods: A Survey
|
2015
|
Alejandro Betancourt
Pietro Morerio
Carlo S. Regazzoni
Matthias Rauterberg
|
3
|
+
PDF
Chat
|
Going Deeper into First-Person Activity Recognition
|
2016
|
Minghuang Ma
Haoqi Fan
Kris Kitani
|
3
|
+
|
Two-Stream Convolutional Networks for Action Recognition in Videos
|
2014
|
Karen Simonyan
Andrew Zisserman
|
3
|
+
PDF
Chat
|
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
|
2018
|
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Jay Gould
Lei Zhang
|
3
|
+
PDF
Chat
|
Top-Down Visual Saliency Guided by Captions
|
2017
|
Vasili Ramanishka
Abir Das
Jianming Zhang
Kate Saenko
|
2
|
+
PDF
Chat
|
Shallow and Deep Convolutional Networks for Saliency Prediction
|
2016
|
Junting Pan
Elisa Sayrol
Xavier GirĂł-i-Nieto
Kevin McGuinness
Noel E. OâConnor
|
2
|
+
PDF
Chat
|
Learning Deep Features for Discriminative Localization
|
2016
|
Bolei Zhou
Aditya Khosla
Ăgata Lapedriza
Aude Oliva
Antonio Torralba
|
2
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
2
|
+
PDF
Chat
|
Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks
|
2018
|
Amy Jin
Serena Yeung
Jeffrey K. Jopling
Jonathan Krause
Dan E. Azagury
Arnold Milstein
Li Fei-Fei
|
2
|
+
PDF
Chat
|
Attend and Interact: Higher-Order Object Interactions for Video Understanding
|
2018
|
ChihâYao Ma
Asim Kadav
Iain Melvin
Zsolt Kira
Ghassan AlRegib
Hans Peter Graf
|
2
|
+
PDF
Chat
|
EgoSampling: Fast-forward and stereo for egocentric videos
|
2015
|
Yair Poleg
Tavi Halperin
Chetan Arora
Bezalel Peleg
|
2
|
+
PDF
Chat
|
Hierarchical Multi-scale Attention Networks for action recognition
|
2017
|
Shiyang Yan
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
|
2
|
+
PDF
Chat
|
Non-local Neural Networks
|
2018
|
Xiaolong Wang
Ross Girshick
Abhinav Gupta
Kaiming He
|
2
|
+
PDF
Chat
|
Pooled motion features for first-person videos
|
2015
|
Michael S. Ryoo
Brandon Rothrock
Larry Matthies
|
2
|
+
PDF
Chat
|
LSTA: Long Short-Term Attention for Egocentric Action Recognition
|
2019
|
Swathikiran Sudhakaran
SĂŠrgio Escalera
Oswald Lanz
|
2
|
+
|
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
|
2014
|
JunâYoung Chung
Ăaǧlar GĂźlçehre
Kyunghyun Cho
Yoshua Bengio
|
2
|
+
PDF
Chat
|
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
|
2017
|
JoĂŁo Carreira
Andrew Zisserman
|
2
|
+
PDF
Chat
|
Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields
|
2017
|
Zhe Cao
Tomas Simon
Shih-En Wei
Yaser Sheikh
|
2
|
+
PDF
Chat
|
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
|
2017
|
Eddy Ilg
N. Michael Mayer
Tonmoy Saikia
Margret Keuper
Alexey Dosovitskiy
Thomas Brox
|
2
|
+
|
Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions
|
2019
|
Yifei Huang
Zhenqiang Li
Minjie Cai
Yoichi Sato
|
2
|
+
|
Two-Stream Convolutional Networks for Action Recognition in Videos
|
2014
|
Karen Simonyan
Andrew Zisserman
|
2
|
+
PDF
Chat
|
Squeeze-and-Excitation Networks
|
2019
|
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
|
2
|
+
PDF
Chat
|
Video and accelerometer-based motion analysis for automated surgical skills assessment
|
2018
|
Aneeq Zia
Yachna Sharma
Vinay Bettadapura
Eric L. Sarin
Irfan Essa
|
2
|
+
PDF
Chat
|
CBAM: Convolutional Block Attention Module
|
2018
|
Sanghyun Woo
Jongchan Park
JoonâYoung Lee
In So Kweon
|
2
|
+
|
The Kinetics Human Action Video Dataset
|
2017
|
Andrew Zisserman
JoĂŁo Carreira
Karen Simonyan
Will Kay
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
T.C. Green
Trevor Back
|
2
|
+
|
Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition
|
2018
|
Swathikiran Sudhakaran
Oswald Lanz
|
2
|
+
|
Ego-surfing first person videos
|
2015
|
Ryo Yonetani
Kris Kitani
Yoichi Sato
|
2
|
+
PDF
Chat
|
ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification
|
2017
|
Rohit Girdhar
Deva Ramanan
Abhinav Gupta
Josef Ĺ ivic
Bryan Russell
|
2
|
+
PDF
Chat
|
StyleBank: An Explicit Representation for Neural Image Style Transfer
|
2017
|
Dongdong Chen
Lu Yuan
Jing Liao
Nenghai Yu
Gang Hua
|
2
|
+
|
Learning to score the figure skating sports videos
|
2018
|
Chengming Xu
Yanwei Fu
Bing Zhang
Zitian Chen
YuâGang Jiang
Xiangyang Xue
|
2
|
+
PDF
Chat
|
Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks
|
2018
|
Tianfan Xue
Jiajun Wu
Katherine L. Bouman
William T. Freeman
|
2
|
+
PDF
Chat
|
Am I a Baller? Basketball Performance Assessment from First-Person Videos
|
2017
|
Gedas Bertasius
Hyun Soo Park
Stella X. Yu
Jianbo Shi
|
2
|
+
PDF
Chat
|
Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination
|
2018
|
Hazel Doughty
Dima Damen
Walterio MayolâCuevas
|
2
|
+
PDF
Chat
|
Object-Part Attention Model for Fine-Grained Image Classification
|
2017
|
Yuxin Peng
Xiangteng He
Junjie Zhao
|
2
|
+
PDF
Chat
|
Hierarchical Recurrent Neural Network for Video Summarization
|
2017
|
Bin Zhao
Xuelong Li
Xiaoqiang Lu
|
2
|
+
PDF
Chat
|
Long-Term Feature Banks for Detailed Video Understanding
|
2019
|
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krähenbßhl
Ross Girshick
|
2
|
+
|
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
|
2017
|
Abhishek Das
Harsh Agrawal
Larry Zitnick
Devi Parikh
Dhruv Batra
|
2
|
+
|
On the Role of Event Boundaries in Egocentric Activity Recognition from Photostreams
|
2018
|
Alejandro Cartas
EstefanĂa Talavera
Petia Radeva
Mariella Dimiccoli
|
2
|
+
PDF
Chat
|
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
|
2018
|
Xiang Long
Chuang Gan
Gerard de Melo
Jiajun Wu
Xiao Liu
Shilei Wen
|
2
|
+
PDF
Chat
|
Residual Attention Network for Image Classification
|
2017
|
Fei Wang
Mengqing Jiang
Chen Qian
Shuo Yang
Cheng Li
Honggang Zhang
Xiaogang Wang
Xiaoou Tang
|
2
|
+
|
Action Recognition using Visual Attention
|
2015
|
Shikhar Sharma
Ryan Kiros
Ruslan Salakhutdinov
|
2
|
+
PDF
Chat
|
Learning to Score Olympic Events
|
2017
|
Paritosh Parmar
Brendan Morris
|
2
|
+
PDF
Chat
|
Digging Deeper Into Egocentric Gaze Prediction
|
2019
|
Hamed R. Tavakoli
Esa Rahtu
Juho Kannala
Ali Borji
|
2
|