Regular Expressions 101

Save & Share

Flavor

  • PCRE2 (PHP >=7.3)
  • PCRE (PHP <7.3)
  • ECMAScript (JavaScript)
  • Python
  • Golang
  • Java 8

Function

  • Match
  • Substitution
  • List
  • Unit Tests
"
^\$(?P<guid>[\w\d-]+).(.|\n)(?P<nome>.+)..(?P<url>https://[\w\-\.\/]+)..(?P<desc>\w.*)\(.+$
"
gm
^ asserts position at start of a line
\$ matches the character $ with index 3610 (2416 or 448) literally (case sensitive)
Named Capture Group guid
(?P<guid>[\w\d-]+)
Match a single character present in the list below
[\w\d-]
+ matches the previous token between one and unlimited times, as many times as possible, giving back as needed (greedy)
\w matches any word character (equivalent to [a-zA-Z0-9_])
\d matches a digit (equivalent to [0-9])
- matches the character - with index 4510 (2D16 or 558) literally (case sensitive)
. matches any character (except for line terminators)
2nd Capturing Group
(.|\n)
1st Alternative
.
. matches any character (except for line terminators)
2nd Alternative
\n
\n matches a line-feed (newline) character (ASCII 10)
Named Capture Group nome
(?P<nome>.+)
.
matches any character (except for line terminators)
+ matches the previous token between one and unlimited times, as many times as possible, giving back as needed (greedy)
. matches any character (except for line terminators)
. matches any character (except for line terminators)
Named Capture Group url
(?P<url>https://[\w\-\.\/]+)
https://
matches the characters https:// literally (case sensitive)
Match a single character present in the list below
[\w\-\.\/]
+ matches the previous token between one and unlimited times, as many times as possible, giving back as needed (greedy)
\w matches any word character (equivalent to [a-zA-Z0-9_])
\- matches the character - with index 4510 (2D16 or 558) literally (case sensitive)
\. matches the character . with index 4610 (2E16 or 568) literally (case sensitive)
\/ matches the character / with index 4710 (2F16 or 578) literally (case sensitive)
. matches any character (except for line terminators)
. matches any character (except for line terminators)
Named Capture Group desc
(?P<desc>\w.*)
\w matches any word character (equivalent to [a-zA-Z0-9_])
.
matches any character (except for line terminators)
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
\( matches the character ( with index 4010 (2816 or 508) literally (case sensitive)
.
matches any character (except for line terminators)
+ matches the previous token between one and unlimited times, as many times as possible, giving back as needed (greedy)
$ asserts position at the end of a line
Global pattern flags
g modifier: global. All matches (don't return after first match)
m modifier: multi line. Causes ^ and $ to match the begin/end of each line (not only begin/end of string)
Your regular expression does not match the subject string.

Regular Expression
No Match

r"
"
gm

Test String