Regular Expressions 101

Save & Share

Flavor

  • PCRE2 (PHP >=7.3)
  • PCRE (PHP <7.3)
  • ECMAScript (JavaScript)
  • Python
  • Golang
  • Java 8

Function

  • Match
  • Substitution
  • List
  • Unit Tests
`
^([a-zA-Z]*)[\s\t\x{00a0}]*\((["'\\\w-\/\s\t\x{00a0}]*)\)[\s\t\x{00a0}]*\:?[\s\t\x{00a0}]*(.*)
`
g
^ asserts position at start of the string
1st Capturing Group
([a-zA-Z]*)
Match a single character present in the list below
[a-zA-Z]
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
a-z matches a single character in the range between a (index 97) and z (index 122) (case sensitive)
A-Z matches a single character in the range between A (index 65) and Z (index 90) (case sensitive)
Match a single character present in the list below
[\s\t\x{00a0}]
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
\s matches any whitespace character (equivalent to [\r\n\t\f\v ])
\t matches a tab character (ASCII 9)
\x{00a0} matches the character   with index A016 (16010 or 2408) literally (case sensitive)
\( matches the character ( with index 4010 (2816 or 508) literally (case sensitive)
2nd Capturing Group
(["'\\\w-\/\s\t\x{00a0}]*)
Match a single character present in the list below
["'\\\w-\/\s\t\x{00a0}]
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
"'
matches a single character in the list "' (case sensitive)
\\ matches the character \ with index 9210 (5C16 or 1348) literally (case sensitive)
\w matches any word character (equivalent to [a-zA-Z0-9_])
- matches the character - with index 4510 (2D16 or 558) literally (case sensitive)
This hyphen is treated literally, which might be confusing for others. Consider escaping it or placing at the start or end of the class!
\/ matches the character / with index 4710 (2F16 or 578) literally (case sensitive)
\s matches any whitespace character (equivalent to [\r\n\t\f\v ])
\t matches a tab character (ASCII 9)
\x{00a0} matches the character   with index A016 (16010 or 2408) literally (case sensitive)
\) matches the character ) with index 4110 (2916 or 518) literally (case sensitive)
Match a single character present in the list below
[\s\t\x{00a0}]
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
\s matches any whitespace character (equivalent to [\r\n\t\f\v ])
\t matches a tab character (ASCII 9)
\x{00a0} matches the character   with index A016 (16010 or 2408) literally (case sensitive)
\:
matches the character : with index 5810 (3A16 or 728) literally (case sensitive)
? matches the previous token between zero and one times, as many times as possible, giving back as needed (greedy)
Match a single character present in the list below
[\s\t\x{00a0}]
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
\s matches any whitespace character (equivalent to [\r\n\t\f\v ])
\t matches a tab character (ASCII 9)
\x{00a0} matches the character   with index A016 (16010 or 2408) literally (case sensitive)
3rd Capturing Group
(.*)
.
matches any character (except for line terminators)
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
Global pattern flags
g modifier: global. All matches (don't return after first match)
Your regular expression does not match the subject string.

Regular Expression
No Match

`
`
g

Test String