Data Cleaning with OpenRefine
December 13th @ 10:00 am - 11:30 am
Fall 2023 Research Data Management Seminar Series
A part of the data workflow is preparing the data for analysis. Some of this involves data cleaning, where errors in the data are identified and corrected or formatting made consistent. This step must be taken with the same care and attention to reproducibility as the analysis.
OpenRefine is a powerful free, open-source tool with a graphical user interface (GUI) to clean and organize data – no coding required! Many people comment that this tool saves them literally months of work trying to make these edits by hand!
Do you work with data in spreadsheets? This workshop will teach you how to clean and standardize your tabular data efficiently and reproducibly using OpenRefine. Please bring your own laptop if available!
Instructors: Julie Goldman, Research Data Services, Countway Library
Meghan Kerr, Archivist and Records Manager, Center for the History of Medicine
Ashley Thomas, Health Science Data and Digital Services Librarian, Countway Library
This is an in-person workshop. Registration is limited and attendees must have a Harvard University ID (HUID) to enter the Countway Library.