lessons, e-learning

Introduction to Data Management Practices - Cleaning tabular data with OpenRefine

part of the data workflow is preparing the data for analysis. Some of this involves data cleaning, where errors in the data are identified and corrected, or formatting made consistent. This step must be taken with the same care and attention to reproducibility as the analysis.

OpenRefine is a powerful free and open source tool for working with messy data: cleaning it and transforming it from one format into another.

This lesson will teach you to use OpenRefine to effectively clean and format data and automatically track any changes that you make. Many people comment that this tool saves them literally months of work trying to make these edits by hand.

Licence: Creative Commons Attribution Share Alike 4.0 International

Contact: edu.intro-dm@nbis.se

Keywords: Open science, FAIR, Data management, CONVERGE

Target audience: Researcher in life sciences, PhD candidates, Scientists

Resource type: lessons, e-learning

Status: Active

Prerequisites:

This lesson requires a working copy of OpenRefine.

Authors: Elin Kronander, Stephan Nylinder

Contributors: Niclas Jareborg, Yvonne Kallberg, Mattias Strömberg, Wolmar Nyberg Åkerström, Erik Hedman, Markus Englund

Scientific topics: Data management, Open science, FAIR data


Activity log