Translation(s): English
Introduction
OpenRefine is a power tool for working with messy data. Use it to improve data consistency, link it to data registries like Wikidata, augment it with data from other sources, transform it into different formats for other tools to consume, and contribute it back to the original sources. openrefine is not a web service but a desktop app that runs on your own computer, so you can process sensitive data with privacy.
History
Work on packaging OpenRefine began in February 2021 and was completed almost a year later when OpenRefine finally entered the Debian main archive.
List of newly packaged dependencies
OpenRefine required several new Java dependencies
Several updates of existing packages were also required: google-api-client-java, google-http-client-java, jsoup, libjuniversalchardet-java, jetty9, libokhttp-java, libowasp-esapi-java.
Installation
(testing/unstable): apt install openrefine
(bullseye-backports): Edit your /etc/apt/sources.list and add
deb http://deb.debian.org/debian bullseye-backports main deb-src http://deb.debian.org/debian bullseye-backports main
then install OpenRefine with
apt install -t bullseye-backports openrefine
Docker
A Docker image for development purposes is available. You can find the Dockerfile in debian/examples inside the source package.
docker pull apo1999/openrefine || Install the image
- docker run -d -p 3333:3333 --name openrefine-test apo1999/openrefine
docker exec -it openrefine-test /bin/bash || Inspect the container
docker stop openrefine-test || Stop the container
docker rm openrefine-test || Remove the container
docker rmi apo1999/openrefine || Remove the image