+
|
Deep Speaker: an End-to-End Neural Speaker Embedding System
|
2017
|
Chao Li
Xiaokong Ma
Bing Jiang
Xiangang Li
Xuewei Zhang
Xiao Liu
Ying Cao
Ajay Kannan
Zhenyao Zhu
|
11
|
+
PDF
Chat
|
The NIST 2010 speaker recognition evaluation
|
2010
|
Alvin F. Martín
Craig S. Greenberg
|
10
|
+
PDF
Chat
|
Deep Residual Learning for Image Recognition
|
2016
|
Kaiming He
Xiangyu Zhang
Shaoqing Ren
Jian Sun
|
7
|
+
PDF
Chat
|
A Simple Model for Detection of Rare Sound Events
|
2018
|
Weiran Wang
Chieh-Chi Kao
Chao Wang
|
5
|
+
PDF
Chat
|
R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection
|
2018
|
Chieh-Chi Kao
Weiran Wang
Ming Sun
Chao Wang
|
5
|
+
PDF
Chat
|
Stacked Hourglass Networks for Human Pose Estimation
|
2016
|
Alejandro Newell
Kaiyu Yang
Jia Deng
|
5
|
+
|
In Defense of the Triplet Loss for Person Re-Identification
|
2017
|
Alexander Hermans
Lucas Beyer
Bastian Leibe
|
3
|
+
PDF
Chat
|
End-to-end text-dependent speaker verification
|
2016
|
Georg Heigold
Ignacio López Moreno
Samy Bengio
Noam Shazeer
|
3
|
+
PDF
Chat
|
Deep Speaker Feature Learning for Text-Independent Speaker Verification
|
2017
|
Lantian Li
Yixiang Chen
Ying Shi
Zhiyuan Tang
Dong Wang
|
3
|
+
|
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
|
2015
|
Sergey Ioffe
Christian Szegedy
|
3
|
+
|
Attention Is All You Need
|
2017
|
Ashish Vaswani
Noam Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan N. Gomez
Łukasz Kaiser
Illia Polosukhin
|
2
|
+
PDF
Chat
|
6-DoF object pose from semantic keypoints
|
2017
|
Georgios Pavlakos
Xiaowei Zhou
Aaron Chan
Konstantinos G. Derpanis
Kostas Daniilidis
|
2
|
+
PDF
Chat
|
Additive Margin Softmax for Face Verification
|
2018
|
Wang Feng
Jian Cheng
Weiyang Liu
Haijun Liu
|
2
|
+
PDF
Chat
|
Unsupervised Detection of Anomalous Sound Based on Deep Learning and the Neyman–Pearson Lemma
|
2018
|
Yuma Koizumi
Shoichiro Saito
Hisashi Uematsu
Yuta Kawachi
Noboru Harada
|
2
|
+
PDF
Chat
|
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
|
2019
|
Yuhan Shen
Kexin He
Wei-Qiang Zhang
|
2
|
+
PDF
Chat
|
CornerNet: Detecting Objects as Paired Keypoints
|
2018
|
Hei Law
Jia Deng
|
2
|
+
PDF
Chat
|
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
|
2016
|
Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun
|
2
|
+
PDF
Chat
|
Unifying Isolated and Overlapping Audio Event Detection with Multi-label Multi-task Convolutional Recurrent Neural Networks
|
2019
|
Huy Phan
Oliver Y. Chén
Philipp Koch
Lam Pham
Ian McLoughlin
Alfred Mertins
Maarten De Vos
|
2
|
+
PDF
Chat
|
SphereFace: Deep Hypersphere Embedding for Face Recognition
|
2017
|
Weiyang Liu
Yandong Wen
Zhiding Yu
Ming Li
Bhiksha Raj
Le Song
|
2
|
+
PDF
Chat
|
Vehicle Pose and Shape Estimation Through Multiple Monocular Vision
|
2018
|
Wenhao Ding
Shuaijun Li
Guilin Zhang
Xiangyu Lei
Huihuan Qian
|
2
|
+
PDF
Chat
|
What makes audio event detection harder than classification?
|
2017
|
Huy Phan
Philipp Koch
Fabrice Katzberg
Marco Maaß
Radoslaw Mazur
Ian McLoughlin
Alfred Mertins
|
2
|
+
PDF
Chat
|
Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network
|
2018
|
Yong Xu
Qiuqiang Kong
Wenwu Wang
Mark D. Plumbley
|
2
|
+
PDF
Chat
|
An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition
|
2016
|
Baoguang Shi
Xiang Bai
Cong Yao
|
2
|
+
PDF
Chat
|
Deep multimodal learning for Audio-Visual Speech Recognition
|
2015
|
Youssef Mroueh
Etienne Marcheret
Vaibhava Goel
|
2
|
+
PDF
Chat
|
Rethinking Feature Distribution for Loss Functions in Image Classification
|
2018
|
Weitao Wan
Yuanyi Zhong
Tianpeng Li
Jiansheng Chen
|
2
|
+
PDF
Chat
|
Residual Attention Network for Image Classification
|
2017
|
Fei Wang
Mengqing Jiang
Chen Qian
Shuo Yang
Cheng Li
Honggang Zhang
Xiaogang Wang
Xiaoou Tang
|
2
|
+
PDF
Chat
|
Effective Approaches to Attention-based Neural Machine Translation
|
2015
|
Thang Luong
Hieu Pham
Christopher D. Manning
|
2
|
+
PDF
Chat
|
Generalized End-to-End Loss for Speaker Verification
|
2018
|
Li Wan
Quan Wang
Alan Papir
Ignacio López Moreno
|
2
|
+
|
MUSAN: A Music, Speech, and Noise Corpus
|
2015
|
David Snyder
Guoguo Chen
Daniel Povey
|
2
|
+
|
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features
|
2017
|
Sharath Adavanne
Giambattista Parascandolo
Pasi Pertilä
Toni Heittola
Tuomas Virtanen
|
2
|
+
|
A report on sound event detection with different binaural features
|
2017
|
Sharath Adavanne
Tuomas Virtanen
|
2
|
+
PDF
Chat
|
CosFace: Large Margin Cosine Loss for Deep Face Recognition
|
2018
|
Hao Wang
Yitong Wang
Zheng Zhou
Xing Ji
Dihong Gong
Jingchao Zhou
Zhifeng Li
Wei Liu
|
2
|
+
|
DNN and CNN with Weighted and Multi-task Loss Functions for Audio Event Detection
|
2017
|
Huy Phan
Martin Krawczyk-Becker
Timo Gerkmann
Alfred Mertins
|
2
|
+
|
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
|
2018
|
Yuhan Shen
Kexin He
Wei-Qiang Zhang
|
2
|
+
PDF
Chat
|
Recurrent neural networks for polyphonic sound event detection in real life recordings
|
2016
|
Giambattista Parascandolo
Heikki Huttunen
Tuomas Virtanen
|
2
|
+
|
MelNet: A Generative Model for Audio in the Frequency Domain
|
2019
|
Sean Vasquez
Mike Lewis
|
2
|
+
|
How to Improve Your Speaker Embeddings Extractor in Generic Toolkits
|
2018
|
Hossein Zeinali
Lukáš Burget
Johan Rohdin
Themos Stafylakis
Jaň Černocký
|
2
|
+
PDF
Chat
|
End-to-end attention-based large vocabulary speech recognition
|
2016
|
Dzmitry Bahdanau
Jan Chorowski
Dmitriy Serdyuk
Philémon Brakel
Yoshua Bengio
|
2
|
+
PDF
Chat
|
Ring Loss: Convex Feature Normalization for Face Recognition
|
2018
|
Yutong Zheng
Dipan K. Pal
Marios Savvides
|
2
|
+
PDF
Chat
|
Cascaded Pyramid Network for Multi-person Pose Estimation
|
2018
|
Yilun Chen
Zhicheng Wang
Yuxiang Peng
Zhiqiang Zhang
Gang Yu
Jian Sun
|
2
|
+
|
Learning towards Minimum Hyperspherical Energy
|
2018
|
Weiyang Liu
Rongmei Lin
Zhen Liu
Lixin Liu
Zhiding Yu
Bo Dai
Le Song
|
2
|
+
PDF
Chat
|
Sampling Matters in Deep Embedding Learning
|
2017
|
Chao-Yuan Wu
R. Manmatha
Alexander J. Smola
Philipp Krähenbühl
|
2
|
+
PDF
Chat
|
Conversational Analysis Using Utterance-level Attention-based Bidirectional Recurrent Neural Networks
|
2018
|
Chandrakant Bothe
Sven Magg
Cornelius Weber
Stefan Wermter
|
2
|
+
|
Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data.
|
2018
|
Wei-Ning Hsu
James Glass
|
2
|
+
|
Large-Margin Softmax Loss for Convolutional Neural Networks
|
2016
|
Weiyang Liu
Yandong Wen
Zhiding Yu
Meng Yang
|
2
|
+
PDF
Chat
|
TristouNet: Triplet loss for speaker turn embedding
|
2017
|
Hervé Bredin
|
2
|
+
PDF
Chat
|
VoxCeleb2: Deep Speaker Recognition
|
2018
|
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
|
2
|
+
PDF
Chat
|
End-to-End attention based text-dependent speaker verification
|
2016
|
Shi-Xiong Zhang
Zhuo Chen
Yong Zhao
Jinyu Li
Yifan Gong
|
2
|
+
PDF
Chat
|
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection
|
2017
|
Emre Çakır
Giambattista Parascandolo
Toni Heittola
Heikki Huttunen
Tuomas Virtanen
|
2
|
+
|
Deep Factorization for Speech Signal
|
2017
|
Dong Wang
Lantian Li
Ying Shi
Yixiang Chen
Zhiyuan Tang
|
1
|