Cognitive and computer scientists have found child language development and the historical evolution of the world’s languages share a common cognitive foundation — a core knowledge base where patterns of children’s language innovation can predict patterns of language evolution, and vice versa.
The scientists investigated processes of word meaning extension across populations and within individuals.
Published by scientists at the University of Toronto, Universitat Pompeu Fabra and the Catalan Institution for Research and Advanced Studies in Science, a current paper is a first-of-its-kind step toward a unified theory of the lexicon and the mind examined across timescales. The result may also help predict how a word’s meaning may change in the future — across different languages, in language learners and in machine learning.
For the study, the team focused on a common form of human lexical creativity, or word coinage, known as word meaning extension — where people use known words to express something new instead of creating new words. For example, the word “mouse” in the historical evolution of English extends from its rodent meaning to refer to a portable computer device. In language development, children as young as two years old can use the word “ball” to refer to “balloon,” presumably because they haven’t yet acquired the right word to describe “balloon,” so they overextend the known word “ball” to express that new object.
“We investigated processes of word meaning extension across populations and within individuals, and at two very different timescales — in language change and evolution, which take over hundreds and thousands of years, and in child language development during the first few months and years of life,” says last author Yang Xu, Associate Professor, Department of Computer Science, Cognitive Science Program, University of Toronto. “We found that these diverse processes are fundamentally the same, and that the creation of new word meanings relies on a shared foundation of knowledge grounded in human experience.”
First author Thomas Brochhagen, Assistant Professor, Department of Translation and Language Sciences, Universitat Pompeu Fabra says: “This possible relationship between individual learning and the evolution of languages in terms of how meaning is organized had not been demonstrated thus far, and our study does so on a large scale and in a generalized way.”
For the study, the researchers built a computational model that takes pairs of concepts as input, such as “ball” versus “balloon” and “door” versus “key” and makes a prediction about how likely these concepts can be co-named under the same word.
To identify similarities between concepts, the model draws on four primary knowledge types grounded in human experience: visual, associative, taxonomic (how terms are organized in a hierarchy, like referring to an apple as a fruit), and affective (how pleasant and intense a term is, like “sunny”).
The pair of concepts like “ball” and “balloon” would score high due to their similar visual features, whereas “door” and “key” would score high because they are thematically related or often occur together in daily scenarios. “Water” and “pencil” would have little similarity measured in any of the four knowledge types, so that pair would receive a low score. As a result, the model would predict they are unlikely to extend to each other.
The team found that the four knowledge types contributed to word meaning extension which indicates that word meaning extension relies on multifaceted and grounded knowledge based on people’s perceptual, affective, and common-sense knowledge.
The researchers then performed a cross-predictive analysis using a model built exclusively from children’s word meaning extension data to predict successfully word meaning extension patterns from both language evolution and language change, and in the reverse.
The researchers also checked the robustness of these predictive models in languages other than English, verifying that the creation of new word meanings follows similar patterns in 1,400 different languages, including Spanish, Catalan, Basque, Galician, German, French, Portuguese, Dutch, Danish, Norwegian, Swahili, Arabic, Mandarin Chinese, Hindi and Korean.
Existing research on child overextension is typically discussed in the context of developmental psychology whereas word meaning extension in history is typically discussed in historical and computational linguistics.
Date: 08.12.2025
Naturally, we always handle your personal data responsibly. Any personal data we receive from you is processed in accordance with applicable data protection legislation. For detailed information please see our privacy policy.
Consent to the use of data for promotional purposes
I hereby consent to Vogel Communications Group GmbH & Co. KG, Max-Planck-Str. 7-9, 97082 Würzburg including any affiliated companies according to §§ 15 et seq. AktG (hereafter: Vogel Communications Group) using my e-mail address to send editorial newsletters. A list of all affiliated companies can be found here
Newsletter content may include all products and services of any companies mentioned above, including for example specialist journals and books, events and fairs as well as event-related products and services, print and digital media offers and services such as additional (editorial) newsletters, raffles, lead campaigns, market research both online and offline, specialist webportals and e-learning offers. In case my personal telephone number has also been collected, it may be used for offers of aforementioned products, for services of the companies mentioned above, and market research purposes.
Additionally, my consent also includes the processing of my email address and telephone number for data matching for marketing purposes with select advertising partners such as LinkedIn, Google, and Meta. For this, Vogel Communications Group may transmit said data in hashed form to the advertising partners who then use said data to determine whether I am also a member of the mentioned advertising partner portals. Vogel Communications Group uses this feature for the purposes of re-targeting (up-selling, cross-selling, and customer loyalty), generating so-called look-alike audiences for acquisition of new customers, and as basis for exclusion for on-going advertising campaigns. Further information can be found in section “data matching for marketing purposes”.
In case I access protected data on Internet portals of Vogel Communications Group including any affiliated companies according to §§ 15 et seq. AktG, I need to provide further data in order to register for the access to such content. In return for this free access to editorial content, my data may be used in accordance with this consent for the purposes stated here. This does not apply to data matching for marketing purposes.
Right of revocation
I understand that I can revoke my consent at will. My revocation does not change the lawfulness of data processing that was conducted based on my consent leading up to my revocation. One option to declare my revocation is to use the contact form found at https://contact.vogel.de. In case I no longer wish to receive certain newsletters, I have subscribed to, I can also click on the unsubscribe link included at the end of a newsletter. Further information regarding my right of revocation and the implementation of it as well as the consequences of my revocation can be found in the data protection declaration, section editorial newsletter.
“By building this connection between the two fields, we find a core knowledge engine that supports lexical creativity in word meaning extension, which is fundamentally important to human cognition and linguistic communication of emerging meanings,” says Xu.
These computational models may also help facilitate and understand second language acquisition by interpreting errors that learners made in English and other languages, which could resemble how children and adults extend word meaning in their mother tongue.
Future research will further explore the origins and cognitive mechanisms of human lexical creativity, and the possibility of predicting new or emerging meaning in both human language development and machine learning systems.
“Developing a unified theory of the mind across timescales is a challenging undertaking — we don’t have access to human minds dating back hundreds or thousands of years,” says Xu. “Our study offers an alternative way for exploring this unification through the lexicon — a creative product of the human mind and the system of word-meaning mappings, for which we have data available to us.”
References: From language development to language evolution: A unified view of human lexical creativity; Science; DOI:10.1126/science.ade7981