OpenRefine extensions
You can install the following extensions to add functionalities to OpenRefine. We do not vouch for the function of any of these extensions. Some only work with earlier version of OpenRefine, or with third-party distributions. See how to install an extension.
Extension Name | Description | Authors | Latest compatible version |
---|---|---|---|
RDF Transform | Transform data into RDF formats. Based on the RDF extension. However, it has been thoroughly rewritten to incorporate the newer Java and JavaScript technologies, techniques, and process enhancements. | AtesComp | 3.6.0 3.7-SNAPSHOT |
OSM Extractor | Adds a new import option that allows you to import OpenStreetMap data into OpenRefine using the Overpass API. Also adds a new GREL function called getInteriorPoint() which extracts the interior point of any Geometry (represented using Well-Known Text). | Labian Gashi | 3.5.0 |
GeoJSON Export | Adds a Graphical User Interface (GUI) that allows you to export OpenRefine data to the GeoJSON format. Supports latitude/longitude coordinates and Well-known Text (WKT). | Labian Gashi | 3.5.0 |
FAIR metadata | Supports FAIR metadata by integrating with FAIR Data Point to store your data and export to FAIR. It is a successor of the legacy FAIRifier OpenRefine 2.6 fork. Manual. | FAIRDataTeam | 3.4.1 |
VIB-Bits | Adds history tools, pivot tool, cross() graphical user interface (GUI), and scatterplot tool. [Manual | ||
PDF](http://web.archive.org/web/20210817193059/http://data.bits.vib.be/hidden/g7dt6RjuUTU421dY2CwrGePGX/OpenRefine%20VIB-BITS%20plugin.pdf) | VIB-Bits | 3.4.1 | |
Named-Entity Recognition | Extracts identifiable or reconcilable entities from text fields. It supports AlchemyAPI, DBpedia Lookup, and Zemanta. Manual. | Ruben Verborgh and Stuart Kenny | 3.4.1 |
String-Transformers | Adds a collection of Java string transformers to Jython functions. Some generic, some aimed at botany and taxonomy. Download this and this and follow their installation instructions. | RBGKew | 3.4.1 |
RDF extension | RDF Refine - an improved fork for exporting RDF. Adds a graphical user interface (GUI) for exporting data in RDF format. | stkenny & DRI | 3.4.1 |
GOKb Utilities | Adds three new features: prepend rows (add new blank rows); trim all data (remove preceding/trailing whitespace from across all cells); extractHost() (GREL function that extracts a host name from a URL). | GOKb & Owen Stephens | 3.1 |
Stats extension for Google Refine 2.5+ | Computes elementary statistics on column data. | sparkica | 3.1 |
CKAN Storage Extension | Exports data to CKAN Storage 2.2+ using CKAN storage API. | Ontodia | 3.1 |
GeoRefine | Add geospatial processing GREL functions. | Ryan Baumann | 2.8 |
D2Refine | A clinical study metadata harmonization and validation workbench to 1) convert restrictions (specified in a spreadsheet-like interface) into a clinical model; and 2) provide reconciliation services leveraging the Common Terminology Services 2 (CTS2). Manual. | Deepak K. Sharma | 2.7 |
Legacy Extensions
These extensions were produced in the past and are good examples of desired uses of OpenRefine. They are maintained here for historical reference, and in case a contributor would like to renew them.
Extension Name | Description | Authors | Suggested compatible version |
---|---|---|---|
LODRefine | LODRefine is actually OpenRefine with integrated extensions that make transition from tabular data to Linked Data a bit easier. Integrated extensions are: RDF extension, DBpedia extension, CrowdFlower crowdsourcing extension, Stats extension | Sparkica | |
VIB-BITS Diff plugin | Adds a "diff" tool which compares two text values. Manual PDF. | VIB-Bits | 3.4.1. The VIB website states "We are investigating how to provide [the diff plugin] again with version 3.4 of OpenRefine." |
VIB-BITS Save facets plugin | Adds support for save facet settings and re-apply the same facets at a later time. Manual PDF. | VIB-Bits | Unknown. The VIB website states "We are investigating how to provide [the diff plugin] again with version 3.4 of OpenRefine." |
geoXtension | geoXtension is based on GDAL, Proj, and GEOS libraries. Adding this extension takes a lot of time, usage of the ready-made docker is recommended. | giTorto | (circa 2.6) |
geoXtension | geoXtension is based on GDAL, Proj, and GEOS libraries. Adding this extension takes a lot of time, usage of the ready-made docker is recommended. | giTorto | (circa 2.6) |
extraCTU-plugin | To extract e-mails, telephone numbers, URLs and identification numbers from text. | giTorto | (circa 2.7) |
Opentree | Displays phylogenetic trees from the Open Tree of Life. Manual. | nickynicolson | 2.5 |
BioVeL | Adds functions for cleaning & clustering biodiversity data. The entire package can be installed and run locally as well as on a dedicated server. | Botanic Garden and Botanical Museum Berlin-Dahlem | 2.5 |
DBpedia extension | Adds columns from DBpedia to reconciled data. Data has to be reconciled with DBpedia or SPARQL endpoint, which returns DBpedia resource types. | Zemanta | 2.5 |
Stats extension for Google Refine 2.1 | Computes elementary statistics on column data. | The Chicago Tribune | 2.1 |
Reconciliation services
OpenRefine can connect to several reconciliation services. You can find information about them at:
- Reconcilable Data Sources in our wiki
- Reconciliation Software at the Reconciliation API Census
- Reconciliation Services at the Reconciliation API testbench (generated by a SPARQL query on Wikidata)
Client libraries
You can automate some OpenRefine operations using one of the existing libraries. Those libraries are using the OpenRefine API.