Extract html tag containing string in openrefine?

There is nothing to add to the title. This is what I am trying to do. Any suggestions?

I looked at the docs on github and parsed it widely on googled.

The best I got:

value.parseHtml().select('p[contains('xyz')]')

This leads to a syntax error.

+4
source share
2 answers

The select syntax is based on the select syntax in Beautiful Soup ( http://jsoup.org/cookbook/extracting-data/selector-syntax )

In this case, I believe you need the syntax:

value.parseHtml().select("p:contains(xyz)")

Owen

+3
source
+1

All Articles