+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
14
|
+
PDF
Chat
|
Unsupervised Visual Representation Learning by Context Prediction
|
2015
|
Carl Doersch
Abhinav Gupta
Alexei A. Efros
|
9
|
+
PDF
Chat
|
Visually Indicated Sounds
|
2016
|
Andrew Owens
Phillip Isola
Josh H. McDermott
Antonio Torralba
Edward H. Adelson
William T. Freeman
|
7
|
+
PDF
Chat
|
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
|
2018
|
Andrew Owens
Alexei A. Efros
|
7
|
+
PDF
Chat
|
Ambient Sound Provides Supervision for Visual Learning
|
2016
|
Andrew Owens
Jiajun Wu
Josh H. McDermott
William T. Freeman
Antonio Torralba
|
7
|
+
PDF
Chat
|
Learning Deep Features for Discriminative Localization
|
2016
|
Bolei Zhou
Aditya Khosla
Ă€gata Lapedriza
Aude Oliva
Antonio Torralba
|
7
|
+
PDF
Chat
|
Learning to Localize Sound Source in Visual Scenes
|
2018
|
Arda Senocak
Tae-Hyun Oh
Junsik Kim
Ming–Hsuan Yang
In So Kweon
|
6
|
+
PDF
Chat
|
Image-to-Image Translation with Conditional Adversarial Networks
|
2017
|
Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
|
6
|
+
PDF
Chat
|
Unsupervised Learning of Visual Representations Using Videos
|
2015
|
Xiaolong Wang
Abhinav Gupta
|
6
|
+
PDF
Chat
|
VoxCeleb2: Deep Speaker Recognition
|
2018
|
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
|
6
|
+
PDF
Chat
|
Momentum Contrast for Unsupervised Visual Representation Learning
|
2020
|
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross Girshick
|
6
|
+
PDF
Chat
|
Learning Image Representations Tied to Ego-Motion
|
2015
|
Dinesh Jayaraman
Kristen Grauman
|
6
|
+
PDF
Chat
|
ImageNet Large Scale Visual Recognition Challenge
|
2015
|
Olga Russakovsky
Jia Deng
Hao Su
Jonathan Krause
Sanjeev Satheesh
Sean Ma
Zhiheng Huang
Andrej Karpathy
Aditya Khosla
Michael S. Bernstein
|
5
|
+
PDF
Chat
|
Learning Rich Features for Image Manipulation Detection
|
2018
|
Peng Zhou
Xintong Han
Vlad I. Morariu
Larry S. Davis
|
5
|
+
PDF
Chat
|
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
|
2018
|
David Harwath
AdriĂ Recasens
DĂdac SurĂs
Galen Chuang
Antonio Torralba
James Glass
|
5
|
+
PDF
Chat
|
Looking to listen at the cocktail party
|
2018
|
Ariel Ephrat
Inbar Mosseri
Oran Lang
Tali Dekel
Kevin Wilson
Avinatan Hassidim
William T. Freeman
Michael Rubinstein
|
5
|
+
PDF
Chat
|
Learning to Separate Object Sounds by Watching Unlabeled Video
|
2018
|
Ruohan Gao
Rogério Feris
Kristen Grauman
|
5
|
+
|
Learning visual groups from co-occurrences in space and time
|
2015
|
Phillip Isola
Daniel Zoran
Dilip Krishnan
Edward H. Adelson
|
5
|
+
PDF
Chat
|
Learning Correspondence From the Cycle-Consistency of Time
|
2019
|
Xiaolong Wang
Allan Jabri
Alexei A. Efros
|
5
|
+
PDF
Chat
|
CNN architectures for large-scale audio classification
|
2017
|
Shawn Hershey
Sourish Chaudhuri
Daniel P. W. Ellis
Jort F. Gemmeke
Aren Jansen
Robert C. Moore
Manoj Plakal
Devin Platt
Rif A. Saurous
Bryan Seybold
|
5
|
+
|
Adam: A Method for Stochastic Optimization
|
2014
|
Diederik P. Kingma
Jimmy Ba
|
4
|
+
PDF
Chat
|
Aligned and non-aligned double JPEG detection using convolutional neural networks
|
2017
|
Mauro Barni
Luca Bondi
Nicolò Bonettini
Paolo Bestagini
A. Costanzo
Marco Maggini
Benedetta Tondi
Stefano Tubaro
|
4
|
+
|
Very Deep Convolutional Networks for Large-Scale Image Recognition
|
2014
|
Karen Simonyan
Andrew Zisserman
|
4
|
+
PDF
Chat
|
Designing deep networks for surface normal estimation
|
2015
|
Xiaolong Wang
David F. Fouhey
Abhinav Gupta
|
4
|
+
PDF
Chat
|
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
|
2018
|
Deqing Sun
Xiaodong Yang
Ming-Yu Liu
Jan Kautz
|
4
|
+
PDF
Chat
|
The Conversation: Deep Audio-Visual Speech Enhancement
|
2018
|
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
|
4
|
+
PDF
Chat
|
Tracking Emerges by Colorizing Videos
|
2018
|
Carl Vondrick
Abhinav Shrivastava
Alireza Fathi
Sergio Guadarrama
Kevin Murphy
|
4
|
+
PDF
Chat
|
Semantic Image Synthesis With Spatially-Adaptive Normalization
|
2019
|
Taesung Park
Ming-Yu Liu
Ting-Chun Wang
Jun-Yan Zhu
|
4
|
+
PDF
Chat
|
Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning
|
2018
|
Andrew Owens
Jiajun Wu
Josh H. McDermott
William T. Freeman
Antonio Torralba
|
4
|
+
PDF
Chat
|
Localization of JPEG Double Compression Through Multi-domain Convolutional Neural Networks
|
2017
|
Irene Amerini
Tiberio Uricchio
Lamberto Ballan
Roberto Caldelli
|
4
|
+
PDF
Chat
|
Two-Stream Neural Networks for Tampered Face Detection
|
2017
|
Peng Zhou
Xintong Han
Vlad I. Morariu
Larry S. Davis
|
4
|
+
PDF
Chat
|
DeMoN: Depth and Motion Network for Learning Monocular Stereo
|
2017
|
Benjamin Ummenhofer
Huizhong Zhou
Jonas Uhrig
N. Michael Mayer
Eddy Ilg
Alexey Dosovitskiy
Thomas Brox
|
4
|
+
PDF
Chat
|
Telling Left From Right: Learning Spatial Correspondence of Sight and Sound
|
2020
|
Karren Yang
Bryan Russell
Justin Salamon
|
4
|
+
PDF
Chat
|
A Style-Based Generator Architecture for Generative Adversarial Networks
|
2019
|
Tero Karras
Samuli Laine
Timo Aila
|
4
|
+
PDF
Chat
|
Self-Supervised Moving Vehicle Tracking With Stereo Sound
|
2019
|
Chuang Gan
Hang Zhao
Peihao Chen
David Cox
Antonio Torralba
|
4
|
+
PDF
Chat
|
Fighting Fake News: Image Splice Detection via Learned Self-Consistency
|
2018
|
Minyoung Huh
Andy Liu
Andrew Owens
Alexei A. Efros
|
4
|
+
PDF
Chat
|
Learning a Discriminative Model for the Perception of Realism in Composite Images
|
2015
|
Jun-Yan Zhu
Philipp Krähenbühl
Eli Shechtman
Alexei A. Efros
|
4
|
+
PDF
Chat
|
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
|
2016
|
Justin Johnson
Alexandre Alahi
Li Fei-Fei
|
4
|
+
PDF
Chat
|
Seeing Through Noise: Visually Driven Speaker Separation And Enhancement
|
2018
|
Aviv Gabbay
Ariel Ephrat
Tavi Halperin
Bezalel Peleg
|
4
|
+
PDF
Chat
|
Looking to listen at the cocktail party: a speaker-independent audio-visual model for speech separation
|
2018
|
Ariel Ephrat
Inbar Mosseri
Oran Lang
Tali Dekel
Kevin Wilson
Avinatan Hassidim
William T. Freeman
Michael Rubinstein
|
4
|
+
PDF
Chat
|
Audio-Visual Event Localization in Unconstrained Videos
|
2018
|
Yapeng Tian
Jing Shi
Bochen Li
Zhiyao Duan
Chenliang Xu
|
4
|
+
PDF
Chat
|
Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture
|
2015
|
David Eigen
Rob Fergus
|
3
|
+
PDF
Chat
|
Learning a Predictable and Generative Vector Representation for Objects
|
2016
|
Rohit Girdhar
David F. Fouhey
Mikel RodrĂguez
Abhinav Gupta
|
3
|
+
PDF
Chat
|
Fully convolutional networks for semantic segmentation
|
2015
|
Jonathan Long
Evan Shelhamer
Trevor Darrell
|
3
|
+
PDF
Chat
|
Photographic Image Synthesis with Cascaded Refinement Networks
|
2017
|
Qifeng Chen
Vladlen Koltun
|
3
|
+
PDF
Chat
|
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
|
2017
|
JoĂŁo Carreira
Andrew Zisserman
|
3
|
+
PDF
Chat
|
Factoring Shape, Pose, and Layout from the 2D Image of a 3D Scene
|
2018
|
Shubham Tulsiani
Saurabh Gupta
David F. Fouhey
Alexei A. Efrosefros
Jitendra Malik
|
3
|
+
PDF
Chat
|
Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction
|
2017
|
Richard Zhang
Phillip Isola
Alexei A. Efros
|
3
|
+
PDF
Chat
|
Non-local Neural Networks
|
2018
|
Xiaolong Wang
Ross Girshick
Abhinav Gupta
Kaiming He
|
3
|
+
PDF
Chat
|
Deep clustering: Discriminative embeddings for segmentation and separation
|
2016
|
John R. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
|
3
|