ABSTRACT :

It is estimated that more than half of the world’s population is multilingual; therefore, it is not surprising that many texts and conversations incorporate multiple languages, resulting in a linguistic phenomenon called code-switching. This workshop covers how linguists gather, process and analyze code-switched data. We will explore the NPL pipeline for processing multilingual texts and discuss various approaches to language identification.

BIO:

Jacqueline Serigos is an Assistant Professor of Spanish Linguistics in the Department of Modern and Classical Languages at George Mason University. Her research explores various forms of language contact, such as lexical borrowings, code-switching, and calques. She employs approaches from multiple linguistic subfields, including corpus linguistics, computational linguistics, sociolinguistics, and semantics, as well as the discipline of statistics. She received her BA in Anthropology and Latin American Studies from Emory University and a PhD in Hispanic Linguistics from the University of Texas at Austin.

  • Georgetown Hackathon participants
    Georgetown Co-Hosts Successful Hackathon

    Thanks to all the students who came out for our first hackathon, co-hosted by Georgetown Analytics, GWU Data Science, QED Group, and the Center for Global Data Visualization.

  • Data Viz Challenge
    Georgetown to Co-Host Data Visualization Challenge

    In collaboration with GWU and CGDV, Georgetown Analytics invites you to participate in a Data Visualization Challenge on April 12, 2019.

  • Georgetown Analytics Hosts Deloitte Core Consulting Series
    Georgetown Analytics Hosts Deloitte Core Consulting Series

    The MS Analytics program is partnering with the Deloitte Foundation to bring an exciting opportunity to Georgetown's campus on March 22 & 29, 2019.

  • Linguistic Diversity Around The World
    One Text / Two Languages

    This workshop covered how linguists gather, process and analyze code-switched data, exploring the NPL pipeline for processing multilingual texts and discussing various approaches to language identification.