Skip to content

msatula/LSAT-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Law School Admissions Test - Data Analysis

This project is a data analysis report of LSAT (Law School Admission Test) multiple-choice questions, conducted for my final project in my Data Modeling II class at UW-Madison. For the project, we analyzed whether there was a strategy for guessing the answer to a multiple-choice question that is more likely to be correct than randomly guessing. This was a partner project with my classmate, Ben Kizaric.

Tech

  • R (programming language)
  • Posit (formerly R Studio)
  • Python

How we did it

We first acquired LSAT answer sheets from an official LSAT prep source. But since we couldn't get them as usable data files, we had to manually output the questions and answers ourselves into text files. After cleaning the data, we then used a self-written Python script to convert the text files into CSV files.

With CSV files, we analyzed the data in Posit, using R. We used analysis methods like monte-carlo simulations to verify our hypothesis about the data. We also visualized the data using various charts and graphs.

Unfortunately, we didn't discover any statistically significant answer-guessing strategies, but we did discover that the LSAT multiple-choice tests are much less likely to have the same letter answer be correct in a row, compared to a randomly-generated answer sheet.

How it could be improved

  • More recent data - The available answer sheets were several years old.
  • Explore further answer-guessing strategies.

To open

  1. Download LSAT Data Analysis STAT340.html
  2. Open the file in your web browser

About

Data analysis of the Law School Admission Test via R and Python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages