Your subscription could not be saved. Please try again.
Your subscription has been successful.

Subscribe to receive our monthly OpenRefine roundups with new tutorials, release updates and community announcements.

22.7.11

Remove or replace a specific character in a column

You want to remove a space or a specific character from your column like the sign # before some number.


2 expressions does the same job:
  • value.split("#").join("")
  • This splits the value at every space ("#") and joins them with no space ("")

  • replace(value, “#”, “”)
    this look for every # ("#") sign and replace it with no space ("")

This post was

3 comments:

  1. Hi, great Blog - do you know how I can remove or replace this html expression?

    a href="http://www.tweetdeck.com" rel="nofollow">TweetDeck</a

    I just need the "TweetDeck" information.

    Thanks

    ReplyDelete
  2. I suppose that you column contains different url, so I will do that in 3 steps.

    1.Split your cell content based on > sign and in two columns (see this post for more details http://googlerefine.blogspot.com/2011/06/split-cell-content-into-multiple-column.html)

    2. Remove the column containing a href="http://www.tweetdeck.com" rel="nofollow"

    3. On the remaining column run the following expression: replace(value, “”, “”)

    2. run the following expression replace(value, “</a”, “”)

    ReplyDelete