This is the RefinePro knowledge base about OpenRefine. We build it over the years, and keep adding to it. From great tutorials and how-to, to handy GREL expressions and links to external resources, you will find here one of the most comprehensive list of resources to learn OpenRefine.

For a comprehensive documentation you should refer to the official OpenRefine wiki.

Don't where to get started? Search for a specific function below, or read our most popular article from the right side menu.

3.10.12

Grabbing Twitter Search Results into Google Refine And Exporting Conversations into Gephi (from around the web)

Grabbing Twitter Search Results into Google Refine And Exporting Conversations into Gephi 

This neat tutorial explained how to import data directly from the twitter API at the project creation stage using JSON language. The second part of the tutorial explains how to prepare the data to import the in Gephi for data visualization purpose.

via Delicious 

29.9.12

Use Google Refine to clean your data for Fulcrum (from around the web)

Use Google Refine to clean your data for Fulcrum

Fulcrum allow to create location-based data collection apps and deploy them to your mobile device. This tutorial show how to use google refine to take advantages of the data you have collected using fulcrum

via Delicious http://docs.fulcrumapp.com/guides/cleaning-up-data-with-google-refine/

10.9.12

Error: smartSplit error: Un-terminated quoted field at end of CSV line

I am a big fan of the smartSplit function. It is really easy to understand and help to extract quickly part of a string based on any character. However if while using the smartSplit function a cells contains a double quote - " - sign, google refine will return the following error message
Error: smartSplit error: Un-terminated quoted field at end of CSV line

Here is my work around.

5.9.12

Google Refine Workshop (from around the web)

This tutorial / exercise will walk you through all google refine main functionality. Through it's exercise so you can get your hand on quickly!

Data Journalism Workshop - New York (from around the web)

Google Hangout of HHNew York presenting Google Refine

11.8.12

Google Refine Uploader and Stats Extension



The Google Refine Uploader Extension allows you to export datasets from Google Refine and post them as JSON to web servers! Intended for use with CouchDB. Please note that this extension is a work in progress. Feel free to join and help 


This extension is based on the Chicago Tribune Stats extension. A tutorial is available on their blog. Please note that the extension does not work with Google Refine 2.5. It should be tested with the 2.0 version available here.

If you have installed and tried any of those two extensions, I`ll be pleased to hear from you!

Data Shaping in Google Refine – Generating New Rows from Multiple Values in a Single Column


Data Shaping in Google Refine – Generating New Rows from Multiple Values in a Single Column


Great tutorial to reshape data set using transpose and fill down function. This article also introduce the split multi-valued cells function to split and transpose in one shot.