Translation(s): English

Introduction

OpenRefine is a power tool for working with messy data. Use it to improve data consistency, link it to data registries like Wikidata, augment it with data from other sources, transform it into different formats for other tools to consume, and contribute it back to the original sources. openrefine is not a web service but a desktop app that runs on your own computer, so you can process sensitive data with privacy.

History

Work on packaging OpenRefine began in February 2021 and was completed almost a year later when OpenRefine finally entered the Debian main archive.

List of newly packaged dependencies

OpenRefine required several new Java dependencies

Several updates of existing packages were also required: google-api-client-java, google-http-client-java, jsoup, libjuniversalchardet-java, jetty9, libokhttp-java, libowasp-esapi-java.

Installation

(testing/unstable):  apt install openrefine 

(bullseye-backports): Edit your /etc/apt/sources.list and add

deb http://deb.debian.org/debian bullseye-backports main 
deb-src http://deb.debian.org/debian bullseye-backports main

then install OpenRefine with

 apt install -t bullseye-backports openrefine 

Docker

A Docker image for development purposes is available. You can find the Dockerfile in debian/examples inside the source package.

  1. docker pull apo1999/openrefine || Install the image

  2. docker run -d -p 3333:3333 --name openrefine-test apo1999/openrefine
  3. docker exec -it openrefine-test /bin/bash || Inspect the container

  4. docker stop openrefine-test || Stop the container

  5. docker rm openrefine-test || Remove the container

  6. docker rmi apo1999/openrefine || Remove the image