Skip to Main Content

Research Data Management

A guide on managing, organising, sharing and preserving research data

Tools for Cleaning and Wrangling

OpenRefine (https://openrefine.org/) is a tool for data cleaning and wrangling. You can check the OpenRefine for Social Science Data tutorial on Data Carpentry.

Messydates is an R package developed by Professor James Hollway at the Global Governance Centre to tidy up dates in your data files so they are compatible with other R packages.

Pandas is the most widely-used Python tool for data wrangling and analysis.

Transcription tools

Offline LLM/AI tools for transcription of sensitive data (require a powerful PC)

Online LLM/AI tools for interview transcription (more expensive and should not be used with sensitive data)

Tool for handwritten archives transcription