Me in Montreal in December 2015.

Hi, I’m Jon Gauthier. I’m a postdoctoral scholar working in the Chang Lab at the University of California, San Francisco. I build computational models of how people understand and produce language, and use them to explain human behavior and brain activity, combining methods from artificial intelligence, linguistics, neuroscience, and philosophy.

I completed my Ph.D. in cognitive science at the MIT Department of Brain and Cognitive Sciences in the Computational Psycholinguistics Laboratory. You can find my full CV here.

I had the good fortune to begin research at a young age, thanks to the generosity and support of my advisors and academic community. I’m interested in helping ambitious undergraduate students likewise break into the world of academia. Please get in touch!

Recent personal news

Research

(Find me on Google Scholar for an up-to-date list.)

The neural dynamics of auditory word recognition and integration.
@inproceedings{gauthier-levy-2023-neural, title = "The neural dynamics of word recognition and integration", author = "Gauthier, Jon and Levy, Roger", editor = "Bouamor, Houda and Pino, Juan and Bali, Kalika", booktitle = "Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.emnlp-main.62", doi = "10.18653/v1/2023.emnlp-main.62", pages = "980--995", abstract = "Listeners recognize and integrate words in rapid and noisy everyday speech by combining expectations about upcoming content with incremental sensory evidence. We present a computational model of word recognition which formalizes this perceptual process in Bayesian decision theory. We fit this model to explain scalp EEG signals recorded as subjects passively listened to a fictional story, revealing both the dynamics of the online auditory word recognition process and the neural correlates of the recognition and integration of words. The model reveals distinct neural processing of words depending on whether or not they can be quickly recognized. While all words trigger a neural response characteristic of probabilistic integration {---} voltage modulations predicted by a word{'}s surprisal in context {---} these modulations are amplified for words which require more than roughly 150 ms of input to be recognized. We observe no difference in the latency of these neural responses according to words{'} recognition times. Our results support a two-part model of speech comprehension, combining an eager and rapid process of word recognition with a temporally independent process of word integration. However, we also developed alternative models of the scalp EEG signal not incorporating word recognition dynamics which showed similar performance improvements. We discuss potential future modeling steps which may help to separate these hypotheses.", }
The neural dynamics of auditory word recognition and integration.
@InProceedings{gauthier2023neural, author = {Gauthier, Jon and Levy, Roger}, title = {The neural dynamics of auditory word recognition and integration}, booktitle = {Proceedings of the 7th Conference on Cognitive Computational Neuroscience}, month = {August}, year = {2023}, address = {Oxford, England} }
Probing self-supervised speech models for phonetic and phonemic information: a case study in aspiration.
@inproceedings{martin23_interspeech, author={Kinan Martin and Jon Gauthier and Canaan Breiss and Roger Levy}, title=, year=2023, booktitle={Proc. INTERSPEECH 2023}, pages={251--255}, doi={10.21437/Interspeech.2023-2359} }
Language model acceptability judgements are not always robust to context.
Outstanding Paper award (top ~2% of accepted papers)
@inproceedings{sinha-etal-2023-language, title = "Language model acceptability judgements are not always robust to context", author = "Sinha, Koustuv and Gauthier, Jon and Mueller, Aaron and Misra, Kanishka and Fuentes, Keren and Levy, Roger and Williams, Adina", booktitle = "Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)", month = jul, year = "2023", address = "Toronto, Canada", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.acl-long.333", pages = "6043--6063", abstract = "Targeted syntactic evaluations of language models ask whether models show stable preferences for syntactically acceptable content over minimal-pair unacceptable inputs. Our best syntactic evaluation datasets, however, provide substantially less linguistic context than models receive during pretraining. This mismatch raises an important question: how robust are models{'} syntactic judgements across different contexts? In this paper, we vary the input contexts based on: length, the types of syntactic phenomena it contains, and whether or not there are grammatical violations. We find that model judgements are generally robust when placed in randomly sampled linguistic contexts, but are unstable when contexts match the test stimuli in syntactic structure. Among all tested models (GPT-2 and five variants of OPT), we find that model performance is affected when we provided contexts with matching syntactic structure: performance significantly improves when contexts are acceptable, and it significantly declines when they are unacceptable. This effect is amplified by the length of the context, except for unrelated inputs. We show that these changes in model performance are not explainable by acceptability-preserving syntactic perturbations. This sensitivity to highly specific syntactic features of the context can only be explained by the models{'} implicit in-context learning abilities.", }
SyntaxGym: An online platform for targeted syntactic evaluation of language models.
@inproceedings{gauthier-etal-2020-syntaxgym, title = "{S}yntax{G}ym: An Online Platform for Targeted Evaluation of Language Models", author = "Gauthier, Jon and Hu, Jennifer and Wilcox, Ethan and Qian, Peng and Levy, Roger", editor = "Celikyilmaz, Asli and Wen, Tsung-Hsien", booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations", month = jul, year = "2020", address = "Online", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2020.acl-demos.10", doi = "10.18653/v1/2020.acl-demos.10", pages = "70--76", abstract = "Targeted syntactic evaluations have yielded insights into the generalizations learned by neural network language models. However, this line of research requires an uncommon confluence of skills: both the theoretical knowledge needed to design controlled psycholinguistic experiments, and the technical proficiency needed to train and deploy large-scale language models. We present SyntaxGym, an online platform designed to make targeted evaluations accessible to both experts in NLP and linguistics, reproducible across computing environments, and standardized following the norms of psycholinguistic experimental design. This paper releases two tools of independent value for the computational linguistics community: 1. A website, syntaxgym.org, which centralizes the process of targeted syntactic evaluation and provides easy tools for analysis and visualization; 2. Two command-line tools, {`}syntaxgym{`} and {`}lm-zoo{`}, which allow any user to reproduce targeted syntactic evaluations and general language model inference on their own machine.", }
A systematic assessment of syntactic generalization in neural language models.
@inproceedings{hu2020systematic, author = {Hu, Jennifer and Gauthier, Jon and Qian, Peng and Wilcox, Ethan and Levy, Roger}, title = {A systematic assessment of syntactic generalization in neural language models}, booktitle = {Proceedings of the Association of Computational Linguistics}, year = {2020} }
On the predictive power of neural language models for human real-time comprehension behavior.
@inproceedings{wilcox-etal:2020-on-the-predictive-power, year = {2020}, title = {On the Predictive Power of Neural Language Models for Human Real-Time Comprehension Behavior}, pages = {1707–1713}, booktitle = {Proceedings of the 42nd Annual Meeting of the Cognitive Science Society}, author = {Wilcox, Ethan Gotlieb and Gauthier, Jon and Hu, Jennifer and Qian, Peng and Levy, Roger P.} }
From mental representations to neural codes: A multilevel approach.
@article{gauthier2019from, title={From mental representations to neural codes: A multilevel approach}, volume={42}, DOI={10.1017/S0140525X19001390}, journal={Behavioral and Brain Sciences}, publisher={Cambridge University Press}, author={Gauthier, Jon and Loula, João and Pollock, Eli and Wilson, Tyler Brooke and Wong, Catherine}, year={2019}, pages={e228} }
Linking human and artificial neural representations of language.
@InProceedings{gauthier2019linking, author = {Gauthier, Jon and Levy, Roger P.}, title = {Linking human and artificial neural representations of language}, booktitle = {Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing}, month = {November}, year = {2019}, address = {Hong Kong}, publisher = {Association for Computational Linguistics}, }
A rational model of syntactic bootstrapping.
@InProceedings{gauthier2019rational, author = {Gauthier, Jon and Levy, Roger and Tenenbaum, Joshua B.}, title = {A rational model of syntactic bootstrapping}, booktitle = {Proceedings of the 41st Annual Meeting of the Cognitive Science Society}, month = {July}, year = {2019}, address = {Montreal, Canada}, }
Query-guided visual search.
Does the brain represent words? An evaluation of brain decoding studies of language understanding.
@InProceedings{gauthier2018word, author = {Gauthier, Jon and Ivanova, Anna}, title = {Does the brain represent words? An evaluation of brain decoding studies of language understanding.}, booktitle = {Proceedings of the 2nd Conference on Cognitive Computational Neuroscience}, month = {September}, year = {2018}, address = {Philadelphia, Pennsylvania} },
Word learning and the acquisition of syntactic–semantic overhypotheses.
@InProceedings{gauthier2018word, author = {Gauthier, Jon and Levy, Roger and Tenenbaum, Joshua B.}, title = {Word learning and the acquisition of syntactic--semantic overhypotheses}, booktitle = {Proceedings of the 40th Annual Meeting of the Cognitive Science Society}, month = {July}, year = {2018}, address = {Madison, Wisconsin}, }
Are distributional representations ready for the real world? Evaluating word vectors for grounded perceptual meaning.
@misc{1705.11168, Author = {Li Lucy and Jon Gauthier}, Title = {Are distributional representations ready for the real world? Evaluating word vectors for grounded perceptual meaning}, Year = {2017}, Eprint = {arXiv:1705.11168}, }
A paradigm for situated and goal-driven language learning.
@misc{1610.03585, Author = {Jon Gauthier and Igor Mordatch}, Title = {A Paradigm for Situated and Goal-Driven Language Learning}, Year = {2016}, Eprint = {arXiv:1610.03585}, }
A fast unified model for parsing and sentence understanding.
@InProceedings{bowman2016fast, author = {Bowman, Samuel R. and Gauthier, Jon and Rastogi, Abhinav and Gupta, Raghav and Manning, Christopher D. and Potts, Christopher}, title = {A Fast Unified Model for Parsing and Sentence Understanding}, booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)}, month = {August}, year = {2016}, address = {Berlin, Germany}, publisher = {Association for Computational Linguistics}, pages = {1466--1477}, url = {http://www.aclweb.org/anthology/P16-1139} }
Conditional generative adversarial networks for convolutional face generation.
@TechnicalReport{gauthier2015conditional, author = {Gauthier, Jon}, title = {Conditional generative adversarial networks for convolutional face generation}, year = {2015} }
Exploiting long-distance context in transition-based dependency parsing with recurrent neural networks.

Around the web

Currently reading

Currently reading

The Information: A History, a Theory, a Flood
tagged: mind-tickling and currently-reading
The Haskell Road to Logic, Maths and Programming
tagged: programming, haskell, functional-programming, math, mind-tickling, ...
Learning J
tagged: currently-reading and programming
On LISP: Advanced Techniques for Common LISP
tagged: currently-reading, lisp, and programming