One Text / Two Languages

How linguists and programmers approach code-switching Dr. Jacqueline Serigos, George Mason University


It is estimated that more than half of the world’s population is multilingual; therefore, it is not surprising that many texts and conversations incorporate multiple languages, resulting in a linguistic phenomenon called code-switching. This workshop covers how linguists gather, process and analyze code-switched data. We will explore the NPL pipeline for processing multilingual texts and discuss various approaches to language identification.


Jacqueline Serigos is an Assistant Professor of Spanish Linguistics in the Department of Modern and Classical Languages at George Mason University. Her research explores various forms of language contact, such as lexical borrowings, code-switching, and calques. She employs approaches from multiple linguistic subfields, including corpus linguistics, computational linguistics, sociolinguistics, and semantics, as well as the discipline of statistics. She received her BA in Anthropology and Latin American Studies from Emory University and a PhD in Hispanic Linguistics from the University of Texas at Austin.