Happy December! Didn't 2021 just fly by? For the last month of this year, we decided to share the most relevant and helpful OpenRefine November 2021 community announcements, releases, tutorials, and videos with you.
Don't forget to subscribe to our newsletter to get our monthly updates right in your inbox!
Happy November to all! This month, we shared the most relevant and helpful OpenRefine October 2021 community announcements, releases, tutorials, videos, and academic publications with you.
Don't forget to subscribe to our newsletter to get our monthly updates right in your inbox!
We wanted to wish you a very happy October! This month, we decided to share the most relevant and helpful OpenRefine September 2021 community announcements, releases, tutorials, videos, and academic publications with you.
Don't forget to subscribe to our newsletter to get our monthly updates right in your inbox!
We wish you a happy September! This month, we decided to share the most relevant and helpful OpenRefine August 2021 community announcements, releases, tutorials, videos, and academic publications with you.
Don't forget to subscribe to our newsletter to get our monthly updates right in your inbox!
Happy August! This month, we decided to share the most relevant and helpful OpenRefine July 2021 community announcements, releases, tutorials, videos, and academic publications with you.
Don't forget to subscribe to our newsletter to get our monthly updates right in your inbox!
Thanks to this StackOverflow question, I finally found a great use case to introduce the columnName variable in OpenRefine.
The columnName variable has been poorly documented (see previous discussions in SO and on the OpenRefine mailing list). The feature got really interesting with the All > Transform option available in OpenRefine 2.7 back in 2017 (yes, this blog post is long overdue!)
I just can't believe we have not yet published a tutorial for this essential OpenRefine feature!
When you select the edit option from the facet menu, you can edit all matching values in a column, similar to a search and replace function. Of course, the update is recorded in OpenRefine history and can be replicated on a similar dataset.
You can use the edit using facet to quickly clean up typos or misspelled cells. It is also useful to address duplicates that are either not detected by the clustering feature or caught in larger clusters with irrelevant matches.
In this video example below, we replace the value US to United States for 45 cells in one shot.
With spring at our doors, we have a new section on our monthly update: Academic Publication. Here we will list recent academic publications about OpenRefine, or papers that use OpenRefine in their process. This should be a trove of more advanced use cases promoting reproducible science.
Do not forget to subscribe to our newsletter to get our monthly update right in your mailbox.
2021 starts big with the release of the new user documentation for OpenRefine. It took over six months of hard work to migrate the documentation from GitHub wiki to Docusaurus. It was also the opportunity to update the content.
We also listed below new tutorials and videos published about OpenRefine through January along with the update of the NER and RDF extensions.
Do not forget to subscribe to our newsletter to get our monthly update right in your mailbox.
Sometimes you just need to add a new empty column to your OpenRefine project. The process is straight forward as we show in this video tutorial.
From any existing column select the option Edit column and then Add a column based on this column ...
By default, the GREL formula repeat the value from the column you selected
Just replace the value with two quotes like this "" to create an empty string. You can add any text between the quotes. OpenRefine will add it for all the rows selected by your facet.
Give your new column and name and click OK
and you are done!
We made a quick video tutorial to show you the steps: