Voter Turnout Data Analysis

STA 210 - Regression Analysis

By Aidan Gildea, Jenny Huang, Bella Larsen, Jessie Bierschenk in Data Science R Shiny Regression Analysis

January 1, 0001

About this Project

In this project, we investigated voter turnout rates over a 14-year period. We used data from IPUMS and the American Statistical Association that that were originally collected from surveys conducted by the United States Census Bureau and the Bureau of Labor Statistics [1,2]. The surveys are part of a “Voting and Registration Supplement” of the Current Population Survey that occurs every two years after the November elections [3]. We used logistic regression and model selection processes (backward selection using AIC and drop-in-deviance tests) to predict whether a person voted or not based on predictor variables including sex, age, marital status, veteran status, citizenship status (native born or naturalized citizen), whether or not someone is Hispanic or Latinx, employment status and more. We assumed a hypothetical role of an organization planning to distribute mailers to encourage people to vote, with the goal of minimizing wasted resources by only sending mailers to the people who are not likely to vote using a decision threshold based on our final model.

This analysis’s code can be found on Github.

References: [1] The University of Minnesota. “What Is IPUMS?” Text. IPUMS, February 7, 2019. https://ipums.org/what-is-ipums. [2] American Statistical Association. “Fall Data Challenge: Get Out the Vote!” This Is Statistics. Accessed November 17, 2020. https://thisisstatistics.org/falldatachallenge/. [3] American Statistical Association. “ASA’s 2020 Fall Data Challenge: Get Out the Vote! Challenge Dataset FAQ,” n.d. https://thisisstatistics.org/wp-content/uploads/2020/09/2020-Fall-Data-Challenge-DatasetFAQ-PDF.pdf.

Posted on:
January 1, 0001
Length:
2 minute read, 230 words
Categories:
Data Science R Shiny Regression Analysis
Tags:
hugo-site
See Also:
Promoting Accessibility in Duke Data Visualization
Modeling the Visual Field For Young Glaucoma Patients
Classifying DNA Barcodes from the Lepidoptera Order