Urmăriți
Xiong Xiao
Xiong Xiao
Principal Applied scientist, Microsoft
Adresă de e-mail confirmată pe microsoft.com
Titlu
Citat de
Citat de
Anul
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
3782022
A learning-based approach to direction of arrival estimation in noisy and reverberant environments
X Xiao, S Zhao, X Zhong, DL Jones, ES Chng, H Li
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
2752015
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
1902016
Continuous speech separation: Dataset and analysis
Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1422020
Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge.
X Xiao, X Tian, S Du, H Xu, E Chng, H Li
Interspeech, 2052-2056, 2015
1402015
Synthetic speech detection using temporal modulation feature
Z Wu, X Xiao, ES Chng, H Li
2013 IEEE international conference on acoustics, speech and signal …, 2013
1352013
Multi-channel overlapped speech recognition with location guided speech extraction network
Z Chen, X Xiao, T Yoshioka, H Erdogan, J Li, Y Gong
2018 IEEE Spoken Language Technology Workshop (SLT), 558-565, 2018
1082018
On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition
X Xiao, S Zhao, DL Jones, ES Chng, H Li
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
952017
Normalization of the speech modulation spectra for robust speech recognition
X Xiao, ES Chng, H Li
IEEE Transactions on Audio, Speech, and Language Processing 16 (8), 1662-1674, 2008
892008
Unified architecture for multichannel end-to-end speech recognition with neural beamforming
T Ochiai, S Watanabe, T Hori, JR Hershey, X Xiao
IEEE Journal of Selected Topics in Signal Processing 11 (8), 1274-1288, 2017
872017
Recognizing overlapped speech in meetings: A multichannel separation approach using neural networks
T Yoshioka, H Erdogan, Z Chen, X Xiao, F Alleva
arXiv preprint arXiv:1810.03655, 2018
802018
Single channel speech separation with constrained utterance level permutation invariant training using grid lstm
C Xu, W Rao, X Xiao, ES Chng, H Li
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
762018
Advances in online audio-visual meeting transcription
T Yoshioka, I Abramovski, C Aksoylar, Z Chen, M David, D Dimitriadis, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
712019
Computerized intelligent assistant for conferences
A Diamant, KM Ben-Dor, E Krupka, R Halaly, Y Smolin, I Gurvich, ...
US Patent 10,867,610, 2020
652020
MASS: A Malay language LVCSR corpus resource
TP Tan, X Xiao, EK Tang, ES Chng, H Li
2009 Oriental COCOSDA International Conference on Speech Database and …, 2009
592009
Developing far-field speaker system via teacher-student learning
J Li, R Zhao, Z Chen, C Liu, X Xiao, G Ye, Y Gong
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
582018
Microsoft speaker diarization system for the voxceleb speaker recognition challenge 2020
X Xiao, N Kanda, Z Chen, T Zhou, T Yoshioka, S Chen, Y Zhao, G Liu, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
572021
Single-channel speech extraction using speaker inventory and attention network
X Xiao, Z Chen, T Yoshioka, H Erdogan, C Liu, D Dimitriadis, J Droppo, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
572019
Spoofing detection from a feature representation perspective
X Tian, Z Wu, X Xiao, ES Chng, H Li
2016 IEEE International conference on acoustics, speech and signal …, 2016
552016
Speaker-aware training of LSTM-RNNs for acoustic modelling
T Tan, Y Qian, D Yu, S Kundu, L Lu, KC Sim, X Xiao, Y Zhang
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
522016
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20