Python regex vimeo id output from url

Question

Python regex vimeo id output from url

embed_url = 'http://www.vimeo.com/52422837' response = re.search(r'^(http://)?(www\.)?(vimeo\.com/)?([\/\d+])', embed_url) return response.group(4)

Answer:

I was hoping for

52422837

Does anyone have an idea? I'm really bad with regular expressions: S

+6

python url regex vimeo

Jeroen gerits Mar 08 '13 at 14:51

source share

4 answers

Do not reinvent the wheel!

 >>> import urlparse >>> urlparse.urlparse('http://www.vimeo.com/52422837') ParseResult(scheme='http', netloc='www.vimeo.com', path='/52422837', params='', query='', fragment='') >>> urlparse.urlparse('http://www.vimeo.com/52422837').path.lstrip("/") '52422837'

+10

Colonel panic Mar 08 '13 at 14:57

source share

To get everything after the last slash (assuming it is), the following regular expression should do this:

 [^/]*$

(Greedily captures everything to the end, which is not a slash).

+1

Steve chambers Mar 08 '13 at 14:56

source share

Have you tried to finish your regular expression with a dollar sign ($)?

0

Yann Mar 08 '13 at 14:54

source share

Martijn pieters · Accepted Answer · 2013-03-08T14:53:14+0000

Use \d+ (without brackets) according to the literal trait + numbers:

 response = re.search(r'^(http://)?(www\.)?(vimeo\.com/)?(\d+)', embed_url)

Result:

 >>> re.search(r'^(http://)?(www\.)?(vimeo\.com/)?(\d+)', embed_url).group(4) '52422837'

You used a group of characters ( [...] ) where it was not necessary. The pattern [\/\d+] matches exactly one of / , + or a digit.

Python regex vimeo id output from url

More articles: