Neil Zeghidour

Cited by

	All	Since 2019
Citations	3297	3147
h-index	25	25
i10-index	29	28

1200

600

300

900

2017201820192020202120222023202421 112 216 291 384 514 1141 581

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

David GrangierApple Machine Learning ResearchVerified email at apple.com
Zalán BorsosGoogle DeepMindVerified email at google.com
Usunier NicolasFacebook AI ResearchVerified email at fb.com
Matt SharifiGoogleVerified email at google.com
Olivier TeboulTherapanaceaVerified email at therapanacea.eu
Gabriel SynnaeveResearch scientist at Facebook AI ResearchVerified email at fb.com
Guillaume LampleMistral AIVerified email at mistral.ai
Jesse H. EngelGoogle DeepMindVerified email at google.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Emmanuel DupouxProfessor of Cognitive Psychology, Ecole des Hautes Etudes en Sciences Sociales, ParisVerified email at ehess.fr
Andrea AgostinelliGoogleVerified email at google.com
Antoine CaillonGoogle DeepMindVerified email at google.com
Adam RobertsGoogle BrainVerified email at google.com
Timo I. DenkGoogleVerified email at google.com
Eugene KharitonovGoogle DeepMindVerified email at google.com
Raphaël MarinierGoogle AIVerified email at google.com
Ahmed OmranSoftware Engineer, GoogleVerified email at google.com
Jan SkoglundGoogle, LLCVerified email at google.com
Marc'Aurelio RanzatoDeepMindVerified email at google.com
Antoine BordesHelsingVerified email at helsing.ai

Neil Zeghidour

Kyutai

Verified email at kyutai.org

Machine Learning Speech Recognition Audio Understanding


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Fader networks: Manipulating images by sliding attributes G Lample, N Zeghidour, N Usunier, A Bordes, L Denoyer, MA Ranzato Advances in Neural Information Processing Systems, 2017	593	2017
Soundstream: An end-to-end neural audio codec N Zeghidour, A Luebs, A Omran, J Skoglund, M Tagliasacchi IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 495-507, 2021	328	2021
Musiclm: Generating music from text A Agostinelli, TI Denk, Z Borsos, J Engel, M Verzetti, A Caillon, Q Huang, ... arXiv preprint arXiv:2301.11325, 2023	306	2023
Audiolm: a language modeling approach to audio generation Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023	286	2023
Wavesplit: End-to-end speech separation by speaker clustering N Zeghidour, D Grangier IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2840-2849, 2021	256	2021
Contrastive learning of general-purpose audio representations A Saeed, D Grangier, N Zeghidour ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	240	2021
LEAF: A Learnable Frontend for Audio Classification N Zeghidour, O Teboul, FC Quitry, M Tagliasacchi ICLR 2021, 2021	138	2021
Learning Filterbanks from Raw Speech for Phone Recognition N Zeghidour, N Usunier, I Kokkinos, T Schatz, G Synnaeve, E Dupoux ICASSP 2018, 2017	130	2017
Fully convolutional speech recognition N Zeghidour, Q Xu, V Liptchinsky, N Usunier, G Synnaeve, R Collobert arXiv preprint arXiv:1812.06864, 2018	109	2018
End-to-end speech recognition from the raw waveform N Zeghidour, N Usunier, G Synnaeve, R Collobert, E Dupoux Interspeech 2018, 2018	108	2018
Sing: Symbol-to-instrument neural generator A Défossez, N Zeghidour, N Usunier, L Bottou, F Bach Advances in Neural Information Processing Systems, 2018	77	2018
Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023	72	2023
Speak, read and prompt: High-fidelity text-to-speech with minimal supervision E Kharitonov, D Vincent, Z Borsos, R Marinier, S Girgin, O Pietquin, ... Transactions of the Association for Computational Linguistics 11, 1703-1718, 2023	68	2023
Joint learning of speaker and phonetic similarities with siamese networks. N Zeghidour, G Synnaeve, N Usunier, E Dupoux INTERSPEECH, 1295-1299, 2016	63	2016
General-purpose, long-context autoregressive modeling with perceiver AR C Hawthorne, A Jaegle, C Cangea, S Borgeaud, C Nash, M Malinowski, ... International Conference on Machine Learning, 8535-8558, 2022	55	2022
A Deep Scattering Spectrum - Deep Siamese network Pipeline For Unsupervised Acoustic Modeling N Zeghidour, G Synnaeve, M Versteegh, E Dupoux 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	49	2016
Learning strides in convolutional neural networks R Riad, O Teboul, D Grangier, N Zeghidour arXiv preprint arXiv:2202.01653, 2022	42	2022
To reverse the gradient or not: An empirical comparison of adversarial and multi-task learning in speech recognition Y Adi, N Zeghidour, R Collobert, N Usunier, V Liptchinsky, G Synnaeve ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	41	2019
Multi-instrument music synthesis with spectrogram diffusion C Hawthorne, I Simon, A Roberts, N Zeghidour, J Gardner, E Manilow, ... arXiv preprint arXiv:2206.05408, 2022	39	2022
Soundstorm: Efficient parallel audio generation Z Borsos, M Sharifi, D Vincent, E Kharitonov, N Zeghidour, M Tagliasacchi arXiv preprint arXiv:2305.09636, 2023	36	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors