Urmăriți
Benjamin Elizalde
Benjamin Elizalde
Microsoft, Carnegie Mellon University
Adresă de e-mail confirmată pe microsoft.com - Pagina de pornire
Titlu
Citat de
Citat de
Anul
YFCC100M: The new data in multimedia research
B Thomee, DA Shamma, G Friedland, B Elizalde, K Ni, D Poland, D Borth, ...
Communications of the ACM 59 (2), 64-73, 2016
2325*2016
DCASE 2017 challenge setup: Tasks, datasets and baseline system
A Mesaros, T Heittola, A Diment, B Elizalde, A Shah, E Vincent, B Raj, ...
DCASE 2017-workshop on detection and classification of acoustic scenes and …, 2017
5832017
Clap learning audio concepts from natural language supervision
B Elizalde, S Deshmukh, M Al Ismail, H Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
3082023
Sound event detection in the DCASE 2017 challenge
A Mesaros, A Diment, B Elizalde, T Heittola, E Vincent, B Raj, T Virtanen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (6), 992-1006, 2019
1522019
Pengi: An audio language model for audio tasks
S Deshmukh, B Elizalde, R Singh, H Wang
Advances in Neural Information Processing Systems 36, 18090-18108, 2023
842023
Cross modal audio search and retrieval with joint embeddings based on text and audio
B Elizalde, S Zarar, B Raj
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
612019
Experimentation on the dcase challenge 2016: Task 1-acoustic scene classification and task 3-sound event detection in real life audio
B Elizalde, A Kumar, A Shah, R Badlani, E Vincent, B Raj, I Lane
Detection and Classification of Acoustic Scenes and Events 2016, 2016
60*2016
Content-based representations of audio using siamese neural networks
P Manocha, R Badlani, A Kumar, A Shah, B Elizalde, B Raj
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
572018
The placing task: A large-scale geo-estimation challenge for social-media videos and images
J Choi, B Thomee, G Friedland, L Cao, K Ni, D Borth, B Elizalde, ...
Proceedings of the 3rd acm multimedia workshop on geotagging and its …, 2014
552014
Audio Retrieval with WavText5K and CLAP Training
S Deshmukh, B Elizalde, H Wang
Proc. INTERSPEECH 2023, 2948--2952, 2023
472023
Audio concept classification with hierarchical deep neural networks
M Ravanelli, B Elizalde, K Ni, G Friedland
2014 22nd European Signal Processing Conference (EUSIPCO), 606-610, 2014
352014
Natural language supervision for general-purpose audio representations
B Elizalde, S Deshmukh, H Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
302024
AudioPairBank: towards a large-scale tag-pair-based audio content analysis
S Säger, B Elizalde, D Borth, C Schulze, B Raj, I Lane
EURASIP Journal on Audio, Speech, and Music Processing 2018, 1-12, 2018
30*2018
An approach for self-training audio event detectors using web data
B Elizalde, A Shah, S Dalmia, MH Lee, R Badlani, A Kumar, B Raj, I Lane
2017 25th European Signal Processing Conference (EUSIPCO), 1863-1867, 2017
29*2017
An i-vector based approach for audio scene detection
B Elizalde, H Lei, G Friedland, N Peters
IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and …, 2013
292013
Sound event classification using ontology-based neural networks
A Jimenez, B Elizalde, B Raj
Proceedings of the Annual Conference on Neural Information Processing Systems 9, 2018
262018
Audio-based multimedia event detection with DNNs and sparse sampling
K Ashraf, B Elizalde, F Iandola, M Moskewicz, J Bernd, G Friedland, ...
Proceedings of the 5th ACM on International Conference on Multimedia …, 2015
262015
There is no data like less data: Percepts for video concept detection on consumer-produced media
B Elizalde, G Friedland, H Lei, A Divakaran
Proceedings of the 2012 ACM international workshop on Audio and multimedia …, 2012
26*2012
The YLI-MED corpus: Characteristics, procedures, and plans
J Bernd, D Borth, B Elizalde, G Friedland, H Gallagher, L Gottlieb, A Janin, ...
arXiv preprint arXiv:1503.04250, 2015
212015
An i-vector representation of acoustic environments for audio-based video event detection on user generated content
B Elizalde, H Lei, G Friedland
2013 IEEE International Symposium on Multimedia, 114-117, 2013
212013
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20