Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018 | 439 | 2018 |
E-branchformer: Branchformer with enhanced merging for speech recognition K Kim, F Wu, Y Peng, J Pan, P Sridhar, KJ Han, S Watanabe 2022 IEEE Spoken Language Technology Workshop (SLT), 84-91, 2023 | 96 | 2023 |
Improving Keyword Spotting and Language Identification via Neural Architecture Search at Scale. H Mazzawi, X Gonzalvo, A Kracun, P Sridhar, N Subrahmanya, ... Interspeech, 1278-1282, 2019 | 45 | 2019 |
Structured pruning of self-supervised pre-trained models for speech recognition and understanding Y Peng, K Kim, F Wu, P Sridhar, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 28 | 2023 |
Tuplemax loss for language identification L Wan, P Sridhar, Y Yu, Q Wang, IL Moreno ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 25 | 2019 |
A comparative study on e-branchformer vs conformer in speech recognition, translation, and understanding tasks Y Peng, K Kim, F Wu, B Yan, S Arora, W Chen, J Tang, S Shon, P Sridhar, ... arXiv preprint arXiv:2305.11073, 2023 | 15 | 2023 |
Multi-mode transformer transducer with stochastic future context K Kim, F Wu, P Sridhar, KJ Han, S Watanabe arXiv preprint arXiv:2106.09760, 2021 | 9 | 2021 |
Context-aware fine-tuning of self-supervised speech models S Shon, F Wu, K Kim, P Sridhar, K Livescu, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Improving ASR Contextual Biasing with Guided Attention J Tang, K Kim, S Shon, F Wu, P Sridhar ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking HR Muckenhirn, IL Moreno, J Hershey, K Wilson, P Sridhar, Q Wang, ... conference of the international speech communication association, 2019 | 5 | 2019 |
Targeted voice separation by speaker conditioned on spectrogram masking Q Wang, P Sridhar, IL Moreno, H Muckenhirn US Patent 11,217,254, 2022 | 4 | 2022 |
An experimental study into spectral and geometric approaches to data clustering P Sridhar Master’s thesis, Carnegie Mellon University, 2015 | 4 | 2015 |
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding S Shon, K Kim, YT Hsu, P Sridhar, S Watanabe, K Livescu arXiv preprint arXiv:2406.09345, 2024 | 3 | 2024 |
Targeted voice separation by speaker conditioned on spectrogram masking Q Wang, P Sridhar, IL Moreno, H Muckenhirn US Patent 11,922,951, 2024 | 2 | 2024 |
Generative Context-Aware Fine-Tuning of Self-Supervised Speech Models S Shon, K Kim, P Sridhar, YT Hsu, S Watanabe, K Livescu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Targeted voice separation by speaker conditioned on spectrogram masking Q Wang, P Sridhar, IL Moreno, H Muckenhim US Patent App. 18/594,833, 2024 | | 2024 |
Convolution-Augmented Parameter-Efficient Fine-Tuning for Speech Recognition K Kim, S Shon, YT Hsu, P Sridhar, K Livescu, S Watanabe Proc. Interspeech 2024, 2830-2834, 2024 | | 2024 |
Training and/or using a language selection model for automatically determining language for speech recognition of spoken utterance L Wan, Y Yu, P Sridhar, IL Moreno, Q Wang US Patent 11,646,011, 2023 | | 2023 |
Stochastic future context for speech processing KIM Kwangyoun, F Wu, P Sridhar, KJ Han US Patent App. 17/530,139, 2022 | | 2022 |
Training and/or using a language selection model for automatically determining language for speech recognition of spoken utterance L Wan, Y Yu, P Sridhar, IL Moreno, Q Wang US Patent 11,410,641, 2022 | | 2022 |