Tal Linzen

2025

Jackson Petty, Michael Y. Hu, Wentao Wang, Shauli Ravfogel, William Merrill & Tal Linzen. RELIC: Evaluating Compositional Instruction Following via Language Recognition. []

Linlu Qiu, Fei Sha, Kelsey Allen, Yoon Kim, Tal Linzen & Sjoerd van Steenkiste. Bayesian Teaching Enables Probabilistic Reasoning in Large Language Models. []

Wentao Wang, Guangyuan Jiang, Tal Linzen & Brenden M. Lake. Rapid Word Learning Through Meta In-Context Learning. []

Anastasia Kobzeva, Suhas Arehalli, Tal Linzen & Dave Kush. Learning Filler-Gap Dependencies with Neural Language Models: Testing Island Sensitivity in Norwegian and English. Journal of Memory and Language. [] []

Michael Y. Hu, Jackson Petty, Chuan Shi, William Merrill & Tal Linzen. Between Circuits and Chomsky: Pre-pretraining on Formal Languages Imparts Linguistic Biases. ACL []

Lindia Tjuatja, Graham Neubig, Tal Linzen & Sophie Hao. What Goes Into a LM Acceptability Judgment? Rethinking the Impact of Frequency and Length. NAACL. [] [] []

Jackson Petty, Sjoerd van Steenkiste & Tal Linzen. How Does Code Pretraining Affect Language Model Task Performance? TMLR. [] []

Ethan Wilcox, Michael Y. Hu, Aaron Mueller, Alex Warstadt, Leshem Choshen, Chengxu Zhuang, Adina Williams, Ryan Cotterell & Tal Linzen. Bigger is not always better: The importance of human-scale language modeling for psycholinguistics. Journal of Memory and Language. [] []

Cara Leong & Tal Linzen. Testing learning hypotheses using neural networks by manipulating learning data. []

2024

Linlu Qiu, Fei Sha, Kelsey R Allen, Yoon Kim, Tal Linzen & Sjoerd van Steenkiste. Can Language Models Perform Implicit Bayesian Inference Over User Preference States? NeurIPS System 2 Reasoning At Scale Workshop. []

Michael Y. Hu, Aaron Mueller, Candace Ross, Adina Williams, Tal Linzen, Chengxu Zhuang, Ryan Cotterell, Leshem Choshen, Alex Warstadt, & Ethan Gotlieb Wilcox. Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora. The 2nd BabyLM Challenge at CoNLL. []

Grusha Prasad & Tal Linzen. SPAWNing Structural Priming Predictions from a Cognitively Motivated Parser. CoNLL. [] []

William Merrill, Zhaofeng Wu, Norihito Naka, Yoon Kim & Tal Linzen. Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment. ACL Findings. [] [] []

Matthew Mandelkern & Tal Linzen (2024). Do language models' words refer? Computational Linguistics. []

Suhas Arehalli & Tal Linzen (2024). Neural networks as cognitive models of the processing of syntactic constraints. Open Mind. [] []

Aaron Mueller, Albert Webson, Jackson Petty & Tal Linzen (2024). In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax. NAACL. [] []

Tiwalayo Eisape, MH Tessler, Ishita Dasgupta, Fei Sha, Sjoerd van Steenkiste & Tal Linzen (2024). A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models. NAACL. [] []

Jackson Petty, Sjoerd van Steenkiste, Ishita Dasgupta, Fei Sha, Dan Garrette & Tal Linzen (2024). The Impact of Depth on Compositional Generalization in Transformer Language Models. NAACL. [] []

Kuan-Jung Huang, Suhas Arehalli, Mari Kugemoto, Christian Muxica, Grusha Prasad, Brian Dillon & Tal Linzen (2024). Large-scale benchmark yields no evidence that language model surprisal explains syntactic disambiguation difficulty. Journal of Memory and Language. [] []

2023

Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen & Ryan Cotterell (2023). Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora. Proceedings of the BabyLM Challenge. []

Sophie Hao & Tal Linzen (2023). Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject Number. Findings of EMNLP. [] [] []

Bingzhi Li, Lucia Donatelli, Alexander Koller, Tal Linzen, Yuekun Yao & Najoung Kim (2023). SLOG: A Structural Generalization Benchmark for Semantic Parsing. EMNLP. [] [] []

William Timkey & Tal Linzen (2023). A Language Model with Limited Memory Capacity Captures Interference in Human Sentence Processing. Findings of EMNLP. []

Aaron Mueller & Tal Linzen (2023). How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases. ACL. [] []

Aditya Yedetore, Tal Linzen, Robert Frank & R. Thomas McCoy (2023). How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech. ACL. [] [] []

R. Thomas McCoy, Paul Smolensky, Tal Linzen, Jianfeng Gao & Asli Celikyilmaz (2023). How much do language models copy from their training data? Evaluating linguistic novelty in text generation using RAVEN. TACL. [] [] []

Cara Leong & Tal Linzen (2023). Language models can learn exceptions to syntactic rules. Society for Computation in Linguistics. [] [] []

Anastasia Kobzeva, Suhas Arehalli, Tal Linzen & Dave Kush (2023). Neural Networks Can Learn Patterns of Island-insensitivity in Norwegian. Society for Computation in Linguistics. [] [] []

2022

William Merrill, Alex Warstadt & Tal Linzen (2022). Entailment semantics can be extracted from an ideal language model. CoNLL. [] [] []

Aaron Mueller, Yu Xia & Tal Linzen (2022). Causal analysis of syntactic agreement neurons in multilingual language models. CoNLL. [] [] []

Suhas Arehalli, Brian Dillon & Tal Linzen (2022). Syntactic surprisal from neural models predicts, but underestimates, human processing difficulty from syntactic ambiguities. CoNLL. [] [] []

Kristijan Armeni, Christopher Honey & Tal Linzen (2022). Characterizing verbatim short-term memory in neural language models. CoNLL. [] [] [

Nouha Dziri, Hannah Rashkin, Tal Linzen & David Reitter (2022). Evaluating attribution in dialogue systems: the BEGIN benchmark. TACL. [] [] []

Anastasia Kobzeva, Suhas Arehalli, Tal Linzen & Dave Kush (2022). LSTMs can learn basic wh- and relative clause dependencies in Norwegian. Cognitive Science Society. [] []

Sebastian Schuster & Tal Linzen (2022). When a sentence does not introduce a discourse entity, Transformer-based models still often refer to it. NAACL. [] [] []

Linlu Qiu, Peter Shaw, Panupong Pasupat, Paweł Krzysztof Nowak, Tal Linzen, Fei Sha, Kristina Toutanova (2022). Improving compositional generalization with latent structure and data augmentation. NAACL. [] [] []

Aaron Mueller, Robert Frank, Tal Linzen, Luheng Wang & Sebastian Schuster (2022). Coloring the blank slate: Pre-training imparts a hierarchical inductive bias to sequence-to-sequence models. Findings of ACL. [] [] []

Thibault Sellam, Steve Yadlowsky, Jason Wei, Naomi Saphra, Alexander D'Amour, Tal Linzen, Jasmijn Bastings, Iulia Turc, Jacob Eisenstein, Dipanjan Das, Ian Tenney & Ellie Pavlick (2022). The MultiBERTs: BERT reproductions for robustness analysis. ICLR. []

2021

Grusha Prasad & Tal Linzen (2021). Rapid syntactic adaptation in self-paced reading: detectable, but only with many participants. Journal of Experimental Psychology: Learning, Memory, and Cognition. [] [] []

Jason Wei, Dan Garrette, Tal Linzen & Ellie Pavlick (2021). Frequency effects on syntactic rule learning in Transformers. EMNLP. [] []

Alicia Parrish, William Huang, Omar Agha, Soo-Hwan Lee, Nikita Nangia, Alex Warstadt, Karmanya Aggarwal, Emily Allaway, Tal Linzen & Samuel R. Bowman (2021). Does putting a linguist in the loop improve NLU data collection? Findings of EMNLP. [] []

Alicia Parrish*, Sebastian Schuster*, Alex Warstadt*, Omar Agha, Soo-Hwan Lee, Zhuoye Zhao, Samuel R. Bowman & Tal Linzen (2021). NOPE: A corpus of naturally-occurring presuppositions in English. CoNLL. [] []

Shauli Ravfogel*, Grusha Prasad*, Tal Linzen & Yoav Goldberg (2021). Counterfactual interventions reveal the causal effect of relative clause representations on agreement prediction. CoNLL. [] []

Laura Aina & Tal Linzen (2021). The language model understood the prompt was ambiguous: probing syntactic uncertainty through generation. BlackboxNLP. [] []

Matthew Finlayson, Aaron Mueller, Stuart Shieber, Sebastian Gehrmann, Tal Linzen & Yonatan Belinkov (2021). Causal analysis of syntactic agreement mechanisms in neural language models. ACL. [] []

Marten van Schijndel & Tal Linzen (2021). Single-stage prediction models do not explain the magnitude of syntactic disambiguation difficulty. Cognitive Science. [] []

Charles Lovering, Rohan Jha, Tal Linzen & Ellie Pavlick (2021). Predicting inductive biases of pre-trained models. ICLR. [] [] [bib]

Karl Mulligan, Robert Frank & Tal Linzen (2021). Structure here, bias there: Hierarchical generalization by jointly learning syntactic transformations. Society for Computation in Linguistics. [bib] [] []

Tal Linzen & Marco Baroni (2021). Syntactic structure from deep learning. Annual Reviews of Linguistics. [bib] [] []

2020

Paul Soulos, R. Thomas McCoy, Tal Linzen & Paul Smolensky (2020). Discovering the compositional structure of vector representations with role learning networks. BlackboxNLP. [] []

R. Thomas McCoy, Junghyun Min & Tal Linzen (2020). BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance. BlackboxNLP. [] []

Najoung Kim & Tal Linzen (2020). COGS: A compositional generalization challenge based on semantic interpretation. EMNLP. [] [] [bib]

R. Thomas McCoy, Erin Grant, Paul Smolensky, Tom Griffiths & Tal Linzen (2020). Universal linguistic inductive biases via meta-learning. Cognitive Science Society. [bib] [] []

Suhas Arehalli & Tal Linzen (2020). Neural language models capture some, but not all, agreement attraction effects. Cognitive Science Society. [bib] [] []

Naomi Havron, Camila Scaff, Maria Julia Carbajal, Tal Linzen, Axel Barrault & Anne Christophe (2020). Priming syntactic ambiguity resolution in children and adults. Language, Cognition and Neuroscience. [] []

Tal Linzen (2020). How can we accelerate progress towards human-like linguistic generalization? ACL. [] [bib]

Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou, Natalia Talmina & Tal Linzen (2020). Cross-linguistic syntactic evaluation of word prediction models. ACL. [] [] [bib]

Junghyun Min, Richard T. McCoy, Dipanjan Das, Emily Pitler & Tal Linzen (2020). Syntactic data augmentation increases robustness to inference heuristics. ACL. [] [arXiv] [bib]

Michael Lepori, Tal Linzen & R. Thomas McCoy (2020). Representations of syntax [MASK] useful: Effects of constituency and dependency structure in recursive LSTMs. ACL. [] [] [bib]

R. Thomas McCoy, Robert Frank & Tal Linzen (2020). Does syntax need to grow on trees? Sources of hierarchical inductive bias in sequence-to-sequence networks. Transactions of the Association for Computational Linguistics, 8, 125–140. [] [] [bib]

Natalia Talmina & Tal Linzen (2020). Neural network learning of the Russian genitive of negation: optionality and structure sensitivity. Society for Computation in Linguistics (SCiL), 21. [] [] [bib]

2019

Marten van Schijndel, Aaron Mueller & Tal Linzen (2019). Quantity doesn't buy quality syntax with neural language models. EMNLP. [] [] [bib]

Grusha Prasad, Marten van Schijndel & Tal Linzen (2019). Using priming to uncover the organization of syntactic representations in neural language models. CoNLL. [] [bib]

R. Thomas McCoy, Ellie Pavlick & Tal Linzen (2019). Right for the wrong reasons: Diagnosing syntactic heuristics in natural language inference. ACL. [] [] [bib]

Afra Alishahi, Grzegorz Chrupała & Tal Linzen (2019). Analyzing and interpreting neural networks for NLP: A report on the first BlackboxNLP workshop. Journal of Natural Language Engineering, 25(4), 543–557. [] [] [bib]

Brenden Lake, Tal Linzen & Marco Baroni (2019). Human few-shot learning of compositional instructions. Cognitive Science Society. [] [] [bib]

Shauli Ravfogel, Yoav Goldberg & Tal Linzen (2019). Studying the inductive biases of RNNs with synthetic variations of natural languages. NAACL. [] [] [bib]

Najoung Kim, Roma Patel, Adam Poliak, Alex Wang, Patrick Xia, R. Thomas McCoy, Ian Tenney, Alexis Ross, Tal Linzen, Benjamin Van Durme, Samuel R. Bowman, Ellie Pavlick (2019). Probing what different NLP tasks teach machines about function word comprehension. Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM 2019), pages 235–249. [] [] [bib]

R. Thomas McCoy, Tal Linzen, Ewan Dunbar & Paul Smolensky (2019). RNNs implicitly implement tensor product representations. ICLR. [arXiv] []

Tal Linzen (2019). What can linguistics and deep learning contribute to each other? Response to Pater. Language. [] [] [bib]

R. Thomas McCoy & Tal Linzen (2019). Non-entailed subsequences as a challenge for natural language inference. Society for Computation in Linguistics (SCiL) (extended abstract). [] [bib]

Marten van Schijndel & Tal Linzen (2019). Can entropy explain successor surprisal effects in reading? Society for Computation in Linguistics (SCiL). [] [bib]

2018

Rebecca Marvin & Tal Linzen (2018). Targeted syntactic evaluation of language models. EMNLP. [] [] [video] [bib]

Marten van Schijndel & Tal Linzen (2018). A neural model of adaptation in reading. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), pages 4704–4710. [] [] [video] [bib]

Tal Linzen & Yohei Oseki (2018). The reliability of acceptability judgments across languages. Glossa: a journal of general linguistics, 3(1), 100. [] [] [bib] [data]

Laura Gwilliams, Tal Linzen, David Poeppel & Alec Marantz (2018). In spoken word recognition the future predicts the past. Journal of Neuroscience 38(35), 7585–7599. [] [] [bib]

Tal Linzen & Brian Leonard (2018). Distinct patterns of syntactic agreement errors in recurrent networks and humans. Proceedings of the 40th Annual Conference of the Cognitive Science Society, pages 692–697. [] [] [bib]

Marten van Schijndel & Tal Linzen (2018). Modeling garden path effects without explicit hierarchical syntax. Proceedings of the 40th Annual Conference of the Cognitive Science Society, pages 2600–2605. [] [] [bib]

R. Thomas McCoy, Robert Frank & Tal Linzen (2018). Revisiting the poverty of the stimulus: hierarchical generalization without a hierarchical bias in recurrent neural networks. Proceedings of the 40th Annual Conference of the Cognitive Science Society, pages 2093–2098. [] [] [bib]

Kristina Gulordava, Piotr Bojanowski, Edouard Grave, Tal Linzen, Marco Baroni (2018). Colorless green recurrent networks dream hierarchically. In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pages 1195–1205. [] [] [bib]

Laura Gwilliams, David Poeppel, Alec Marantz & Tal Linzen (2018). Phonological (un)certainty weights lexical activation. Proceedings of the 8th Workshop on Cognitive Modeling and Computational Linguistics (CMCL), pages 29–34. [] [] [bib]

James White, René Kager, Tal Linzen, Giorgos Markopoulos, Alexander Martin, Andrew Nevins, Sharon Peperkamp, Krisztina Polgárdi, Nina Topintzi & Ruben van de Vijver (2018). Preference for locality is affected by the prefix/suffix asymmetry: Evidence from artificial language learning. Proceedings of the 48th Annual Meeting of the North East Linguistic Society (NELS), pages 207–220. []

Itamar Kastner & Tal Linzen (2018). A morphosyntactic inductive bias in artificial language learning. Proceedings of the 48th Annual Meeting of the North East Linguistic Society (NELS), pages 81–90. [] [bib]

2017

Tal Linzen & Gillian Gallagher (2017). Rapid generalization in phonotactic learning. Laboratory Phonology: Journal of the Association for Laboratory Phonology 8(1): 24. [] [] [bib]

Émile Enguehard, Yoav Goldberg & Tal Linzen (2017). Exploring the Syntactic Abilities of RNNs with Multi-task Learning. Proceedings of the SIGNLL Conference on Computational Natural Language Learning (CoNLL), pages 3–14. [] [] [bib]

Tal Linzen, Noam Siegelman & Louisa Bogaerts (2017). Prediction and uncertainty in an artificial language. Proceedings of the 39th Annual Conference of the Cognitive Science Society, pages 2592–2597. [] [] [bib]

Gael Le Godais, Tal Linzen & Emmanuel Dupoux (2017). Comparing character-level neural language models using a lexical decision task. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL): Volume 2, Short Papers, pages 125–130. [] [] [bib]

2016

Tal Linzen, Emmanuel Dupoux & Yoav Goldberg (2016). Assessing the ability of LSTMs to learn syntax-sensitive dependencies. Transactions of the Association for Computational Linguistics 4, 521–535. [] [] [bib]

Allyson Ettinger & Tal Linzen (2016). Evaluating vector space models using human semantic priming results. Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP, 72–77. [] [] [bib]

Tal Linzen (2016). Issues in evaluating semantic spaces using word analogies. Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP, 13–18. [] [] [bib]

Tal Linzen, Emmanuel Dupoux & Benjamin Spector (2016). Quantificational features in distributional word representations. Proceedings of the Fifth Joint Conference on Lexical and Computational Semantics (*SEM 2016), 1–11. [] [] [bib]

Einat Shetreet, Tal Linzen & Naama Friedmann (2016). Against all odds: exhaustive activation in lexical access of verb complementation options. Language, Cognition & Neuroscience 31(9), 1206–1214. [] [] [bib]

Tal Linzen (2016). The diminishing role of inalienability in the Hebrew Possessive Dative. Corpus Linguistics and Linguistic Theory 12(2), 325–354. [] [] [bib]

Tal Linzen & Florian Jaeger (2016). Uncertainty and expectation in sentence processing: Evidence from subcategorization distributions. Cognitive Science 40(6), 1382–1411. [] [] [bib]

2015

Joseph Fruchter*, Tal Linzen*, Masha Westerlund & Alec Marantz (2015). Lexical preactivation in basic linguistic phrases. Journal of Cognitive Neuroscience 27(10), 1912–1935. (* indicates equal contribution.) [] [] [bib]

Tal Linzen & Timothy O'Donnell (2015). A model of rapid phonotactic generalization. Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2015), 1126–1131. [] [] [bib]

Maria Gouskova & Tal Linzen (2015). Morphological conditioning of phonological regularization. The Linguistic Review 32(3), 427–473. [] [lingbuzz] [] [bib]

Mira Ariel, Elitzur Dattner, John Du Bois & Tal Linzen (2015). Pronominal datives: The royal road to argument status. Studies in Language 39(2), 257–321. [] [] [bib]

2014

Tal Linzen & Florian Jaeger (2014). Investigating the role of entropy in sentence processing. Proceedings of the 2014 ACL Workshop on Cognitive Modeling and Computational Linguistics (CMCL), 10–18. [] [] [bib]

Tal Linzen & Gillian Gallagher (2014). The timecourse of generalization in phonotactic learning. Proceedings of Phonology 2013, ed. John Kingston, Claire Moore-Cantwell, Joe Pater, and Robert Staub. Washington, DC: Linguistic Society of America. [] [] [bib]

Tal Linzen (2014). Parallels between cross-linguistic and language-internal variation in Hebrew possessive constructions. Linguistics 52(3), 759–792. [] [] [bib]

Allyson Ettinger, Tal Linzen & Alec Marantz (2014). The role of morphology in phoneme prediction: Evidence from MEG. Brain and Language 129, 14–23. [] [] [bib]

2013

Tal Linzen, Alec Marantz & Liina Pylkkänen (2013). Syntactic context effects in single word recognition: An MEG study. The Mental Lexicon 8(2), 117–139. [link] [] [bib]

Tal Linzen, Sophia Kasyanenko & Maria Gouskova (2013). Lexical and phonological variation in Russian prepositions. Phonology 30(3), 453–515. [lingbuzz] [] [] [code and data] [bib]