Follow
Sanjeev Khudanpur
Title
Cited by
Cited by
Year
Recurrent neural network based language model.
T Mikolov, M Karafiát, L Burget, J Cernocký, S Khudanpur
Interspeech 2 (3), 1045-1048, 2010
79102010
Librispeech: an asr corpus based on public domain audio books
V Panayotov, G Chen, D Povey, S Khudanpur
2015 IEEE international conference on acoustics, speech and signal …, 2015
63682015
X-vectors: Robust dnn embeddings for speaker recognition
D Snyder, D Garcia-Romero, G Sell, D Povey, S Khudanpur
2018 IEEE international conference on acoustics, speech and signal …, 2018
30692018
Extensions of recurrent neural network language model
T Mikolov, S Kombrink, L Burget, J Černocký, S Khudanpur
2011 IEEE international conference on acoustics, speech and signal …, 2011
16722011
Audio augmentation for speech recognition.
T Ko, V Peddinti, D Povey, S Khudanpur
Interspeech 2015, 3586, 2015
13502015
A time delay neural network architecture for efficient modeling of long temporal contexts.
V Peddinti, D Povey, S Khudanpur
Interspeech, 3214-3218, 2015
13172015
Deep neural network embeddings for text-independent speaker verification.
D Snyder, D Garcia-Romero, D Povey, S Khudanpur
Interspeech 2017, 999-1003, 2017
10442017
A study on data augmentation of reverberant speech for robust speech recognition
T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur
2017 IEEE international conference on acoustics, speech and signal …, 2017
10322017
Purely sequence-trained neural networks for ASR based on lattice-free MMI.
D Povey, V Peddinti, D Galvez, P Ghahremani, V Manohar, X Na, Y Wang, ...
Interspeech, 2751-2755, 2016
9892016
Semi-orthogonal low-rank matrix factorization for deep neural networks.
D Povey, G Cheng, Y Wang, K Li, H Xu, M Yarmohammadi, S Khudanpur
Interspeech, 3743-3747, 2018
6032018
Deep neural network-based speaker embeddings for end-to-end speaker verification
D Snyder, P Ghahremani, D Povey, D Garcia-Romero, Y Carmiel, ...
2016 IEEE spoken language technology workshop (SLT), 165-170, 2016
4372016
Jhu-isi gesture and skill assessment working set (jigsaws): A surgical activity dataset for human motion modeling
Y Gao, SS Vedula, CE Reiley, N Ahmidi, B Varadarajan, HC Lin, L Tao, ...
MICCAI workshop: M2cai 3 (2014), 3, 2014
4322014
Improving deep neural network acoustic models using generalized maxout networks
X Zhang, J Trmal, D Povey, S Khudanpur
2014 IEEE international conference on acoustics, speech and signal …, 2014
4012014
Parallel training of DNNs with natural gradient and parameter averaging
D Povey, X Zhang, S Khudanpur
arXiv preprint arXiv:1410.7455, 2014
3982014
A pitch extraction algorithm tuned for automatic speech recognition
P Ghahremani, B BabaAli, D Povey, K Riedhammer, J Trmal, ...
2014 IEEE international conference on acoustics, speech and signal …, 2014
3892014
Speaker recognition for multi-speaker conversations using x-vectors
D Snyder, D Garcia-Romero, G Sell, A McCree, D Povey, S Khudanpur
ICASSP 2019-2019 IEEE International conference on acoustics, speech and …, 2019
3622019
Developments and directions in speech recognition and understanding, Part 1 [DSP Education]
JM Baker, L Deng, J Glass, S Khudanpur, CH Lee, N Morgan, ...
IEEE Signal processing magazine 26 (3), 75-80, 2009
3622009
Highway long short-term memory rnns for distant speech recognition
Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass
2016 IEEE international conference on acoustics, speech and signal …, 2016
3612016
A smorgasbord of features for statistical machine translation
FJ Och, D Gildea, S Khudanpur, A Sarkar, K Yamada, A Fraser, S Kumar, ...
Proceedings of the Human Language Technology Conference of the North …, 2004
3612004
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
3032020
The system can't perform the operation now. Try again later.
Articles 1–20