Beyond Excel: how to start cleaning data with OpenRefine

Groves, Antony (2016) Beyond Excel: how to start cleaning data with OpenRefine. Multimedia Information and Technology, 42 (2). pp. 18-22. ISSN 1466-190X

[img] PDF - Accepted Version
Available under License Creative Commons Attribution.

Download (629kB)

Abstract

Within our different roles as information professionals, we are all expected to handle larger and larger amounts of data, from the resources we manage to the analytics we collect. However as this data gets bigger it can become harder to analyse. Ham explains that this is often due to errors and inconsistencies in the collection and management of data (2013, p.233), not to mention the time involved in learning how to analyse all of this information, along with the analysis itself. The following guide hopes to address some of these issues by introducing readers to OpenRefine (formerly Google Refine), an open source piece of software that can help to remove some of the errors and inconsistencies in datasets, in a timely manner, without expert knowledge being required.

Item Type: Article
Schools and Departments: Professional Services > Library
Subjects: Z Bibliography. Library Science. Information Resources
Z Bibliography. Library Science. Information Resources > Z0665 Library Science. Information Science
Z Bibliography. Library Science. Information Resources > Z0719 Libraries (General)
Related URLs:
Depositing User: Antony Groves
Date Deposited: 12 Aug 2016 06:53
Last Modified: 20 Oct 2017 12:23
URI: http://srodev.sussex.ac.uk/id/eprint/62368

View download statistics for this item

📧 Request an update