Skip to Main Content

Research Tools: Clean Data

An index of useful tools and software for the research process.

Prepare raw data for analysis by correcting errors, filling in missing values, and formatting it consistently. Clean data helps ensure accurate results and is a key first step in any research involving datasets.

OpenRefine

An example of a worksheet within OpenRefine

 

Tool for cleaning messy data and transforming it between formats.


Type: Data Cleaning

Access: Free

Where Can I Access It?: Must be installed on personal computer

Availability: Desktop

Skill Level: Intermediate

Potential Use Cases: Students working with large CSVs or inconsistent datasets in digital humanities, survey analysis, or archival research.


Tabula

An example of Tabula worksheets

 

Extracts tabular data from PDFs into CSV or Excel formats.


Type: Data Cleaning

Access: Free

Where Can I Access It?: Must be installed on personal computer

Availability: Desktop

Skill Level: Basic

Potential Use Cases: Students extracting data from academic papers, public reports, or scanned documents for use in their assignments or research.


EasyMorph

An image of the EasyMorph workspace

 

Tool for automating data preparation without coding.


Type: Data Cleaning

Access: Free (Limited features available)

Where Can I Access It?: Must be installed on personal computer

Availability: Desktop

Skill Level: Intermediate

Potential Use Cases: Business or social science students organizing large datasets for reports or capstone projects without needing to code.


LSU Resources

Digital Humanities Guide

LSU Library's guide to Digital Humanities tools.

Further Resources