+
|
Unified Representation Learning for Cross Model Compatibility
|
2020
|
Chien-Yi Wang
Ya-Liang Chang
Shang-Ta Yang
Dong Chen
Shang‐Hong Lai
|
2
|
+
PDF
Chat
|
Asymmetric metric learning for knowledge transfer
|
2021
|
Mateusz Budnik
Yannis Avrithis
|
2
|
+
PDF
Chat
|
Learning Compatible Embeddings
|
2021
|
Qiang Meng
Chixiang Zhang
Xiaoqiang Xu
Feng Zhou
|
2
|
+
PDF
Chat
|
Towards Backward-Compatible Representation Learning
|
2020
|
Yantao Shen
Yuanjun Xiong
Wei Xia
Stefano Soatto
|
2
|
+
|
Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval
|
2022
|
Shupeng Su
Binjie Zhang
Yixiao Ge
Xuyuan Xu
Yexin Wang
Chun Yuan
Ying Shan
|
2
|
+
PDF
Chat
|
Google Landmarks Dataset v2 – A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
|
2020
|
Tobias Weyand
André Araujo
Bingyi Cao
Jack Sim
|
2
|
+
PDF
Chat
|
ArcFace: Additive Angular Margin Loss for Deep Face Recognition
|
2019
|
Jiankang Deng
Jia Guo
Niannan Xue
Stefanos Zafeiriou
|
2
|
+
|
Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking
|
2018
|
Filip Radenović
Ahmet İşcen
Giorgos Tolias
Yannis Avrithis
Ondřej Chum
|
2
|
+
PDF
Chat
|
CosFace: Large Margin Cosine Loss for Deep Face Recognition
|
2018
|
Hao Wang
Yitong Wang
Zheng Zhou
Xing Ji
Dihong Gong
Jingchao Zhou
Zhifeng Li
Wei Liu
|
1
|
+
PDF
Chat
|
Localizing Moments in Video with Natural Language
|
2017
|
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Šivic
Trevor Darrell
Bryan Russell
|
1
|
+
PDF
Chat
|
TS-LSTM and temporal-inception: Exploiting spatiotemporal dynamics for activity recognition
|
2018
|
Chih‐Yao Ma
Min-Hung Chen
Zsolt Kira
Ghassan AlRegib
|
1
|
+
PDF
Chat
|
End-to-End Dense Video Captioning with Masked Transformer
|
2018
|
Luowei Zhou
Yingbo Zhou
Jason J. Corso
Richard Socher
Caiming Xiong
|
1
|
+
PDF
Chat
|
To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression
|
2019
|
Yitian Yuan
Tao Mei
Wenwu Zhu
|
1
|
+
|
Attention is All you Need
|
2017
|
Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
|
1
|
+
PDF
Chat
|
SphereFace: Deep Hypersphere Embedding for Face Recognition
|
2017
|
Weiyang Liu
Yandong Wen
Zhiding Yu
Ming Li
Bhiksha Raj
Le Song
|
1
|
+
PDF
Chat
|
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
|
2017
|
João Carreira
Andrew Zisserman
|
1
|
+
PDF
Chat
|
Neural Aggregation Network for Video Face Recognition
|
2017
|
Jiaolong Yang
Peiran Ren
Dongqing Zhang
Dong Chen
Fang Wen
Hongdong Li
Gang Hua
|
1
|
+
PDF
Chat
|
Video Action Transformer Network
|
2019
|
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
|
1
|
+
PDF
Chat
|
Fine-Tuning CNN Image Retrieval with No Human Annotation
|
2018
|
Filip Radenović
Giorgos Tolias
Ondřej Chum
|
1
|
+
PDF
Chat
|
Dense-Captioning Events in Videos
|
2017
|
Ranjay Krishna
Kenji Hata
Frederic Ren
Li Fei-Fei
Juan Carlos Niebles
|
1
|
+
PDF
Chat
|
TALL: Temporal Activity Localization via Language Query
|
2017
|
Jiyang Gao
Chen Sun
Zhenheng Yang
Ram Nevatia
|
1
|
+
|
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
|
2019
|
Zihang Dai
Zhilin Yang
Yiming Yang
Jaime Carbonell
Quoc V. Le
Ruslan Salakhutdinov
|
1
|
+
PDF
Chat
|
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos
|
2019
|
Dongliang He
Xiang Zhao
Jizhou Huang
Fu Li
Xiao Liu
Shilei Wen
|
1
|
+
PDF
Chat
|
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
|
2020
|
Yitian Yuan
Lin Ma
Jingwen Wang
Wei Liu
Wenwu Zhu
|
1
|
+
PDF
Chat
|
FCOS: Fully Convolutional One-Stage Object Detection
|
2019
|
Zhi Tian
Chunhua Shen
Hao Chen
Tong He
|
1
|
+
PDF
Chat
|
Temporally Grounding Language Queries in Videos by Contextual Boundary-Aware Prediction
|
2020
|
Jingwen Wang
Lin Ma
Wenhao Jiang
|
1
|
+
PDF
Chat
|
Multi-Modality Latent Interaction Network for Visual Question Answering
|
2019
|
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
|
1
|
+
|
Universal Domain Adaptation through Self Supervision
|
2020
|
Kuniaki Saito
Donghyun Kim
Stan Sclaroff
Kate Saenko
|
1
|
+
PDF
Chat
|
Learning Spatiotemporal Features with 3D Convolutional Networks
|
2015
|
Du Tran
Lubomir Bourdev
Rob Fergus
Lorenzo Torresani
Manohar Paluri
|
1
|
+
|
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer.
|
2020
|
Vladimir Iashin
Esa Rahtu
|
1
|
+
PDF
Chat
|
Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning
|
2020
|
Shizhe Chen
Yida Zhao
Qin Jin
Qi Wu
|
1
|
+
PDF
Chat
|
Dense Regression Network for Video Grounding
|
2020
|
Runhao Zeng
Haoming Xu
Wenbing Huang
Peihao Chen
Mingkui Tan
Chuang Gan
|
1
|
+
|
3rd Place Solution to "Google Landmark Retrieval 2020"
|
2020
|
Ke Mei
Lei Li
Jinchang Xu
Yanhua Cheng
Yugeng Lin
|
1
|
+
|
1st Place Solution to Google Landmark Retrieval 2020
|
2020
|
SeungKee Jeon
|
1
|
+
|
Supporting large-scale image recognition with out-of-domain samples
|
2020
|
Christof Henkel
Philipp Singer
|
1
|
+
PDF
Chat
|
Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization
|
2020
|
Yixiao Ge
Haibo Wang
Feng Zhu
Rui Zhao
Hongsheng Li
|
1
|
+
PDF
Chat
|
Positive-Congruent Training: Towards Regression-Free Model Updates
|
2021
|
Sijie Yan
Yuanjun Xiong
Kaustav Kundu
Shuo Yang
Siqi Deng
Meng Wang
Wei Xia
Stefano Soatto
|
1
|
+
|
Hot-Refresh Model Upgrades with Regression-Alleviating Compatible Training in Image Retrieval
|
2022
|
Binjie Zhang
Yixiao Ge
Yantao Shen
Yu Li
Chun Yuan
Xuyuan Xu
Yexin Wang
Ying Shan
|
1
|
+
|
Towards Universal Backward-Compatible Representation Learning
|
2022
|
Binjie Zhang
Yixiao Ge
Yantao Shen
Shupeng Su
Fanzi Wu
Chun Yuan
Xuyuan Xu
Yexin Wang
Ying Shan
|
1
|
+
PDF
Chat
|
Learning Backward Compatible Embeddings
|
2022
|
Weihua Hu
Rajas Bansal
Kaidi Cao
Nikhil Rao
Karthik Subbian
Jure Leskovec
|
1
|
+
PDF
Chat
|
Forward Compatible Training for Large-Scale Embedding Retrieval Systems
|
2022
|
Vivek Ramanujan
Pavan Kumar Anasosalu Vasu
Ali Farhadi
Oncel Tuzel
Hadi Pouransari
|
1
|
+
PDF
Chat
|
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
|
2020
|
Cristian Rodríguez-Opazo
Edison Marrese-Taylor
Fatemeh Sadat Saleh
Hongdong Li
Stephen Jay Gould
|
1
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
1
|
+
|
Learning without Forgetting
|
2017
|
Zhizhong Li
Derek Hoiem
|
1
|
+
PDF
Chat
|
Deep Reconstruction-Classification Networks for Unsupervised Domain Adaptation
|
2016
|
Muhammad Ghifary
W. Bastiaan Kleijn
Mengjie Zhang
David Balduzzi
Wen Li
|
1
|
+
|
A Structured Self-attentive Sentence Embedding
|
2017
|
Zhouhan Lin
Minwei Feng
Cícero Nogueira dos Santos
Mo Yu
Bing Xiang
Bowen Zhou
Yoshua Bengio
|
1
|
+
PDF
Chat
|
ECO: Efficient Convolutional Network for Online Video Understanding
|
2018
|
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
|
1
|
+
|
MUREL: Multimodal Relational Reasoning for Visual Question Answering
|
2019
|
Rémi Cadène
Hedi Ben-younes
Matthieu Cord
Nicolas Thome
|
1
|
+
|
Large-scale Landmark Retrieval/Recognition under a Noisy and Diverse Dataset
|
2019
|
Kohei Ozaki
Shuhei Yokoo
|
1
|