Driving and suppressing the human language network using large language models

Greta Tuckute; Aalok Sathe; Shashank Srikant; Maya Taliaferro; Mingye Wang; Martin Schrimpf; Kendrick Kay; Evelina Fedorenko

doi:10.1038/s41562-023-01783-7

Driving and suppressing the human language network using large language models

Greta Tuckute, Aalok Sathe, Shashank Srikant, Maya Taliaferro, Mingye Wang, Martin Schrimpf, Kendrick Kay, Evelina Fedorenko

Center for Magnetic Resonance Research

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

Transformer models such as GPT generate human-like language and are predictive of human brain responses to language. Here, using functional-MRI-measured brain responses to 1,000 diverse sentences, we first show that a GPT-based encoding model can predict the magnitude of the brain response associated with each sentence. We then use the model to identify new sentences that are predicted to drive or suppress responses in the human language network. We show that these model-selected novel sentences indeed strongly drive and suppress the activity of human language areas in new individuals. A systematic analysis of the model-selected sentences reveals that surprisal and well-formedness of linguistic input are key determinants of response strength in the language network. These results establish the ability of neural network models to not only mimic human language but also non-invasively control neural activity in higher-level cortical areas, such as the language network.

Original language	English (US)
Pages (from-to)	544-561
Number of pages	18
Journal	Nature Human Behaviour
Volume	8
Issue number	3
DOIs	https://doi.org/10.1038/s41562-023-01783-7
State	Published - Mar 2024

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive licence to Springer Nature Limited 2024.

PubMed: MeSH publication types

Journal Article

Access

10.1038/s41562-023-01783-7

OpenUrl availability

Full text

Cite this

@article{e77fc110d017463eb600671a0f42039d,

title = "Driving and suppressing the human language network using large language models",

abstract = "Transformer models such as GPT generate human-like language and are predictive of human brain responses to language. Here, using functional-MRI-measured brain responses to 1,000 diverse sentences, we first show that a GPT-based encoding model can predict the magnitude of the brain response associated with each sentence. We then use the model to identify new sentences that are predicted to drive or suppress responses in the human language network. We show that these model-selected novel sentences indeed strongly drive and suppress the activity of human language areas in new individuals. A systematic analysis of the model-selected sentences reveals that surprisal and well-formedness of linguistic input are key determinants of response strength in the language network. These results establish the ability of neural network models to not only mimic human language but also non-invasively control neural activity in higher-level cortical areas, such as the language network.",

author = "Greta Tuckute and Aalok Sathe and Shashank Srikant and Maya Taliaferro and Mingye Wang and Martin Schrimpf and Kendrick Kay and Evelina Fedorenko",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Nature Limited 2024.",

year = "2024",

month = mar,

doi = "10.1038/s41562-023-01783-7",

language = "English (US)",

volume = "8",

pages = "544--561",

journal = "Nature Human Behaviour",

issn = "2397-3374",

publisher = "Nature Publishing Group",

number = "3",

}

TY - JOUR

T1 - Driving and suppressing the human language network using large language models

AU - Tuckute, Greta

AU - Sathe, Aalok

AU - Srikant, Shashank

AU - Taliaferro, Maya

AU - Wang, Mingye

AU - Schrimpf, Martin

AU - Kay, Kendrick

AU - Fedorenko, Evelina

PY - 2024/3

Y1 - 2024/3

N2 - Transformer models such as GPT generate human-like language and are predictive of human brain responses to language. Here, using functional-MRI-measured brain responses to 1,000 diverse sentences, we first show that a GPT-based encoding model can predict the magnitude of the brain response associated with each sentence. We then use the model to identify new sentences that are predicted to drive or suppress responses in the human language network. We show that these model-selected novel sentences indeed strongly drive and suppress the activity of human language areas in new individuals. A systematic analysis of the model-selected sentences reveals that surprisal and well-formedness of linguistic input are key determinants of response strength in the language network. These results establish the ability of neural network models to not only mimic human language but also non-invasively control neural activity in higher-level cortical areas, such as the language network.

AB - Transformer models such as GPT generate human-like language and are predictive of human brain responses to language. Here, using functional-MRI-measured brain responses to 1,000 diverse sentences, we first show that a GPT-based encoding model can predict the magnitude of the brain response associated with each sentence. We then use the model to identify new sentences that are predicted to drive or suppress responses in the human language network. We show that these model-selected novel sentences indeed strongly drive and suppress the activity of human language areas in new individuals. A systematic analysis of the model-selected sentences reveals that surprisal and well-formedness of linguistic input are key determinants of response strength in the language network. These results establish the ability of neural network models to not only mimic human language but also non-invasively control neural activity in higher-level cortical areas, such as the language network.

UR - http://www.scopus.com/inward/record.url?scp=85181495375&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85181495375&partnerID=8YFLogxK

U2 - 10.1038/s41562-023-01783-7

DO - 10.1038/s41562-023-01783-7

M3 - Article

C2 - 38172630

AN - SCOPUS:85181495375

SN - 2397-3374

VL - 8

SP - 544

EP - 561

JO - Nature Human Behaviour

JF - Nature Human Behaviour

IS - 3

ER -

Driving and suppressing the human language network using large language models

Abstract

Bibliographical note

PubMed: MeSH publication types

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this