Open Refine and Messy Data

MISSING IMAGE

Material Information

Title:
Open Refine and Messy Data
Series Title:
Big Data, Little Data: Having it All A Research Data & Data Management 2013 Workshop
Physical Description:
Presentation slides
Language:
English
Creator:
Minson, Valrie
Publisher:
George A. Smathers Libraries, University of Florida
Place of Publication:
Gainesville, FL
Publication Date:

Notes

Abstract:
Presentation slides by resource expert presenter at the Big Data, Little Data Workshop on Oct. 3, 2013.
General Note:
Data Management / Curation Task Force Materials ( DMCTF Materials )

Record Information

Source Institution:
University of Florida
Holding Location:
University of Florida
Rights Management:
Applicable rights reserved.
System ID:
AA00017906:00006


This item is only available as the following downloads:


Full Text

PAGE 1

MSL George A. Smathers Libraries Marston Science Library OPEN REFINE: CLEAN YOUR MESSY DATA Valrie Minson Outreach Librarian for Agricultural Sciences

PAGE 2

MSL OpenRefine OpenRefine.org ( Google Refine) Open Source Runs locally on computer (privacy) Looks like Excel or Google Spreadsheets Data: clean it, transform it, extend it

PAGE 3

MSL Excel: my messy data

PAGE 4

MSL Data issues Human/free text errors Inconsistent journal titles Redundant citations Data (volume/issue) in wrong fields ARTICLES IN CAPS LOCK

PAGE 5

MSL Data in Open Refine

PAGE 6

MSL Use filters to edit data

PAGE 7

MSL Faceted: journal titles by count

PAGE 8

MSL

PAGE 9

MSL Filtering/faceting Use filters or facets to select subsets of data Journal of Agriculture Journ of Agriculture Journla of Agriculture Agriculture, Journal of Not just for Messy data

PAGE 10

MSL OpenRefine Expression Language (GREL) Transform list into table (create columns) Merge datasets Export into Excel, CSV, OpenOffice Google Spreadsheets, JSON, RDF, etc. Use with other systems (Excel, SPSS, etc.) Great videos

PAGE 11

MSL OpenRefine.org Valrie Minson vdavis@ufl.edu