Google Refine (Former Freebase Gridworks) Open Source Power Tool for Data Wranglers

Google open soure blog announced that Freebase Gridworks project now called as "Google Refine."Google Refine is a power tool for working with messy data sets, including cleaning up inconsistencies, transforming them from one format into another, and extending them with new data from external web services or other databases. V2.0 introduces a new extensions architecture, […]

Google open soure blog announced that Freebase Gridworks project now called as "Google Refine."

Google Refine is a power tool for working with messy data sets, including cleaning up inconsistencies, transforming them from one format into another, and extending them with new data from external web services or other databases. V2.0 introduces a new extensions architecture, a reconciliation framework for linking records to other databases (like Freebase), and a ton of new transformation commands and expressions.

To learn more about what you can do with Google Refine 2.0, watch following screencasts:

The project and its code is available here. Changes from version 1.1 to 2.0 are listed here.

[Source]