Automated Text Analysis for Political Science

Andrew Halterman

March 14, 2019 1:00PM E53-482

This workshop will give an overview of current and future text analysis techniques in political science, with a focus on how they fit into good research design and projects using machine learning for measurement. We'll talk through three phases in designing a text analysis project, focusing on how documents are represented computationally and what purpose different algorithms perform in different research designs. This workshop will focus on concepts in text analysis, but we'll also work through R code for preprocessing documents, fitting a topic model, and doing supervised learning on documents. No experience with R beyond Quant 1 is needed.