+
|
Generation and Comprehension of Unambiguous Object Descriptions
|
2015
|
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana Camburu
Alan Yuille
Kevin Murphy
|
1
|
+
|
Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials
|
2012
|
Philipp Krähenbühl
Vladlen Koltun
|
1
|
+
PDF
Chat
|
Semantic Pose Using Deep Networks Trained on Synthetic RGB-D
|
2015
|
Jérémie Papon
Markus Schoeler
|
1
|
+
PDF
Chat
|
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
|
2017
|
Liang-Chieh Chen
George Papandreou
Iasonas Kokkinos
Kevin Murphy
Alan Yuille
|
1
|
+
PDF
Chat
|
Modeling Context in Referring Expressions
|
2016
|
Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
|
1
|
+
PDF
Chat
|
Enriching Word Vectors with Subword Information
|
2017
|
Piotr Bojanowski
Édouard Grave
Armand Joulin
Tomáš Mikolov
|
1
|
+
PDF
Chat
|
Feature Pyramid Networks for Object Detection
|
2017
|
Tsung-Yi Lin
Piotr Dollár
Ross Girshick
Kaiming He
Bharath Hariharan
Serge Belongie
|
1
|
+
PDF
Chat
|
ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes
|
2017
|
Angela Dai
Anne Lynn S. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
|
1
|
+
PDF
Chat
|
Deformable Convolutional Networks
|
2017
|
Jifeng Dai
Haozhi Qi
Yuwen Xiong
Yi Li
Guodong Zhang
Han Hu
Yichen Wei
|
1
|
+
|
The Kinetics Human Action Video Dataset
|
2017
|
Andrew Zisserman
João Carreira
Karen Simonyan
Will Kay
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
T.C. Green
Trevor Back
|
1
|
+
|
Open3D: A Modern Library for 3D Data Processing
|
2018
|
Qian-Yi Zhou
Jaesik Park
Vladlen Koltun
|
1
|
+
PDF
Chat
|
Depth Estimation via Affinity Learned with Convolutional Spatial Propagation Network
|
2018
|
Xinjing Cheng
Peng Wang
Ruigang Yang
|
1
|
+
PDF
Chat
|
Sparse and Dense Data with CNNs: Depth Completion and Semantic Segmentation
|
2018
|
Maximilian Jaritz
Raoul de Charette
Émilie Wirbel
Xavier Perrotton
Fawzi Nashashibi
|
1
|
+
PDF
Chat
|
DeepHPS: End-to-end Estimation of 3D Hand Pose and Shape by Learning from Synthetic Depth
|
2018
|
Jameel Malik
Ahmed Elhayek
Fabrizio Nunnari
Kiran Varanasi
Kiarash Tamaddon
Alexis Héloir
Didier Stricker
|
1
|
+
PDF
Chat
|
YouTube-VOS: Sequence-to-Sequence Video Object Segmentation
|
2018
|
Ning Xu
Linjie Yang
Yuchen Fan
Shuicheng Yan
Dingcheng Yue
Yuchen Liang
Brian Price
Scott Cohen
Thomas S. Huang
|
1
|
+
|
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
|
2018
|
Jacob Devlin
Ming‐Wei Chang
Kenton Lee
Kristina Toutanova
|
1
|
+
PDF
Chat
|
GANerated Hands for Real-Time 3D Hand Tracking from Monocular RGB
|
2018
|
Franziska Mueller
Florian Bernard
Oleksandr Sotnychenko
Dushyant Mehta
Srinath Sridhar
Dan Casas
Christian Theobalt
|
1
|
+
|
Decoupled Weight Decay Regularization
|
2017
|
Ilya Loshchilov
Frank Hutter
|
1
|
+
|
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
|
2017
|
Angela Dai
Anne Lynn S. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
|
1
|
+
|
Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials
|
2012
|
Philipp Krähenbühl
Vladlen Koltun
|
1
|
+
PDF
Chat
|
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
|
2017
|
Jun-Yan Zhu
Taesung Park
Phillip Isola
Alexei A. Efros
|
1
|
+
PDF
Chat
|
V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
|
2016
|
Fausto Milletarì
Nassir Navab
Seyed‐Ahmad Ahmadi
|
1
|
+
PDF
Chat
|
TossingBot: Learning to Throw Arbitrary Objects With Residual Physics
|
2020
|
Andy Zeng
Shuran Song
Johnny Lee
Alberto Rodríguez
Thomas Funkhouser
|
1
|
+
PDF
Chat
|
Generation and Comprehension of Unambiguous Object Descriptions
|
2016
|
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana Camburu
Alan Yuille
Kevin Murphy
|
1
|
+
PDF
Chat
|
Focal Loss for Dense Object Detection
|
2017
|
Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr Dollár
|
1
|
+
PDF
Chat
|
Actor and Action Video Segmentation from a Sentence
|
2018
|
Kirill Gavrilyuk
Amir Ghodrati
Zhenyang Li
Cees G. M. Snoek
|
1
|
+
PDF
Chat
|
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
|
2017
|
João Carreira
Andrew Zisserman
|
1
|
+
|
Dynamic Routing Between Capsules
|
2017
|
Sara Sabour
Nicholas Frosst
Geoffrey E. Hinton
|
1
|
+
PDF
Chat
|
RGBD Datasets: Past, Present and Future
|
2016
|
Michael Firman
|
1
|
+
|
Semi-Supervised Classification with Graph Convolutional Networks
|
2016
|
Thomas Kipf
Max Welling
|
1
|
+
|
RoBERTa: A Robustly Optimized BERT Pretraining Approach
|
2019
|
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
Mike Lewis
Luke Zettlemoyer
Veselin Stoyanov
|
1
|
+
PDF
Chat
|
Segmenting Unknown 3D Objects from Real Depth Images using Mask R-CNN Trained on Synthetic Data
|
2019
|
Michael Danielczuk
Matthew Matl
Saurabh Gupta
Andrew C. Li
Andrew Lee
Jeffrey Mahler
Ken Goldberg
|
1
|
+
PDF
Chat
|
Self-Supervised Sparse-to-Dense: Self-Supervised Depth Completion from LiDAR and Monocular Camera
|
2019
|
Fangchang Ma
Guilherme Venturelli Cavalheiro
Sertaç Karaman
|
1
|
+
|
XLNet: Generalized Autoregressive Pretraining for Language Understanding
|
2019
|
Zhilin Yang
Zihang Dai
Yiming Yang
Jaime Carbonell
Ruslan Salakhutdinov
Quoc V. Le
|
1
|
+
PDF
Chat
|
ABC: A Big CAD Model Dataset for Geometric Deep Learning
|
2019
|
Sebastian Koch
Albert Matveev
Zhongshi Jiang
Francis Williams
Alexey Artemov
Evgeny Burnaev
Marc Alexa
Denis Zorin
Daniele Panozzo
|
1
|
+
|
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
|
2019
|
Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
|
1
|
+
|
ClearGrasp: 3D Shape Estimation of Transparent Objects for Manipulation
|
2019
|
Shreeyak S. Sajjan
Matthew R. Moore
Mike Pan
Ganesh Nagaraja
Johnny Lee
Andy Zeng
Shuran Song
|
1
|
+
PDF
Chat
|
Digging Into Self-Supervised Monocular Depth Estimation
|
2019
|
Clément Godard
Oisin Mac Aodha
Michael Firman
Gabriel Brostow
|
1
|
+
PDF
Chat
|
CSPN++: Learning Context and Resource Aware Convolutional Spatial Propagation Networks for Depth Completion
|
2020
|
Xinjing Cheng
Peng Wang
Chenye Guan
Ruigang Yang
|
1
|
+
PDF
Chat
|
Clear Grasp: 3D Shape Estimation of Transparent Objects for Manipulation
|
2020
|
Shreeyak S. Sajjan
Matthew R. Moore
Mike Pan
Ganesh Nagaraja
Johnny Lee
Andy Zeng
Shuran Song
|
1
|
+
PDF
Chat
|
End-to-End Object Detection with Transformers
|
2020
|
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
|
1
|
+
PDF
Chat
|
Conditional Convolutions for Instance Segmentation
|
2020
|
Zhi Tian
Chunhua Shen
Hao Chen
|
1
|
+
|
End-to-End Video Instance Segmentation with Transformers
|
2020
|
Yuqing Wang
Zhaoliang Xu
Xinlong Wang
Chunhua Shen
Baoshan Cheng
Hao Shen
Huaxia Xia
|
1
|
+
PDF
Chat
|
Non-local Spatial Propagation Network for Depth Completion
|
2020
|
Jinsun Park
Kyungdon Joo
Zhe Hu
Chi-Kuei Liu
In So Kweon
|
1
|
+
PDF
Chat
|
Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network
|
2021
|
Linwei Ye
Mrigank Rochan
Zhi Liu
Xiaoqin Zhang
Yang Wang
|
1
|
+
PDF
Chat
|
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
|
2021
|
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
Baining Guo
|
1
|
+
PDF
Chat
|
MDETR - Modulated Detection for End-to-End Multi-Modal Understanding
|
2021
|
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
|
1
|
+
PDF
Chat
|
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
|
2021
|
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
|
1
|
+
|
Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation
|
2021
|
Liang Chen
Yu Wu
Tianfei Zhou
Wenguan Wang
Zongxin Yang
Yunchao Wei
Yi Yang
|
1
|
+
PDF
Chat
|
End-to-End Video Instance Segmentation with Transformers
|
2021
|
Yuqing Wang
Zhaoliang Xu
Xinlong Wang
Chunhua Shen
Baoshan Cheng
Hao Shen
Huaxia Xia
|
1
|