Regular Expressions 101

Sponsors

Library entries

1
python

phone

This regex is decision phone number
Submitted by anonymous - a year ago
1
python

Test end

Test end
Submitted by anonymous - 10 months ago
1
python

link findall

Submitted by Kerem Göksu - 10 months ago
1
python

Don't match numbers longer than 5 length

using ^ and $
Submitted by anonymous - 8 months ago
1
python

ss001

Rep
Submitted by anonymous - 8 months ago
1
python

Remove HTML tags and comments

Remove HTML tags and comments
Submitted by Smit Patel - 7 months ago
1
python

Parse Data Fields

Parsing example
Submitted by anonymous - 4 months ago
1
python

ds100reg2

Data 100 Regular Expressions, adapted from Princeton
Submitted by anonymous - 4 months ago
1
python

Get domain name from URL

Get domain name from URL
Submitted by anonymous - 4 months ago
1
python

Extract time from text

Extract time from text
Submitted by anonymous - 4 months ago
1
python

CRON

Validades cron job time specification
Submitted by anonymous - 3 months ago

Distinguish torrent files (series vs movies)

Vote

80

Regular Expression
python

"
^ # get the title of this movie or series (?P<title> [-\w'"]+ # match separator to later replace into correct title (?P<separator> [ .] ) # note this *must* be lazy for the engine to work ltr not rtl (?: [-\w'"]+\2 )*? ) # start of movie vs serie check (?: # if this is an episode, lets match the season # number one way or another. if not, the year # of the movie (?: # series. can be a lot prettier if we used perl regex... # make sure this is not just a number in the title followed by our separator. # like, iron man 3 2013 or my.fictional.24.series (?! \d+ \2 ) # now try to match the season number (?: s (?: eason \2? )? )? (?P<season> \d\d? ) # needed to validate the last token is a dot, or whatever. (?: e\d\d? (?:-e?\d\d?)? | x\d\d? )? | # this is likely a movie, match the year (?P<year> [(\]]?\d{4}[)\]]? ) ) # make sure this ends with the separator, otherwise we # might be in the middle of something like "1080p" (?=\2) | # if we get here, this is likely still a movie. # match until one of the keywords (?= BOXSET | XVID | DIVX | LIMITED | UNRATED | PROPER | DTS | AC3 | AAC | BLU[ -]?RAY | HD(?:TV|DVD) | (?:DVD|B[DR]|WEB)RIP | \d+p | [hx]\.?264 ) )
"
gimx

Description

Loading markdown...
Submitted by Firas Dib - 8 years ago