Brussels / 3 & 4 February 2024


Making OpenRefine more reproducible

OpenRefine is a data cleaning tool used in various fields such as data journalism, libraries, scientific research and the Wikimedia movement. Its point and click interface is appreciated for making sophisticated data cleaning operations accessible without programming. Although the tool already makes it possible to replay a series of operations on a new dataset, this feature lacks the robustness and flexibility it deserves. In this talk I will present our ongoing work on improving OpenRefine's reproducibility and seek feedback from the audience and broader community about design choices.


Photo of Antonin Delpeuch Antonin Delpeuch