Regular Expressions 101

Save & Share

Flavor

  • PCRE2 (PHP >=7.3)
  • PCRE (PHP <7.3)
  • ECMAScript (JavaScript)
  • Python
  • Golang
  • Java 8

Function

  • Match
  • Substitution
  • List
  • Unit Tests
/
<([^>\s]+)[^>]*>(?:\s*(?:br \/>| |&nbsp;|&thinsp;|&ensp;|&emsp;|&#8201;|&#8194;|&#8195;)\s*)*<\/\1>
/
gm
< matches the character < with index 6010 (3C16 or 748) literally (case sensitive)
1st Capturing Group
([^>\s]+)
Match a single character not present in the list below
[^>\s]
+ matches the previous token between one and unlimited times, as many times as possible, giving back as needed (greedy)
> matches the character > with index 6210 (3E16 or 768) literally (case sensitive)
\s matches any whitespace character (equivalent to [\r\n\t\f\v ])
Match a single character not present in the list below
[^>]
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
> matches the character > with index 6210 (3E16 or 768) literally (case sensitive)
> matches the character > with index 6210 (3E16 or 768) literally (case sensitive)
Non-capturing group
(?:\s*(?:br \/>| |&nbsp;|&thinsp;|&ensp;|&emsp;|&#8201;|&#8194;|&#8195;)\s*)*
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
\s
matches any whitespace character (equivalent to [\r\n\t\f\v ])
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
Non-capturing group
(?:br \/>| |&nbsp;|&thinsp;|&ensp;|&emsp;|&#8201;|&#8194;|&#8195;)
1st Alternative
br \/>
br
matches the characters br literally (case sensitive)
\/ matches the character / with index 4710 (2F16 or 578) literally (case sensitive)
> matches the character > with index 6210 (3E16 or 768) literally (case sensitive)
2nd Alternative
matches the character with index 3210 (2016 or 408) literally (case sensitive)
3rd Alternative
&nbsp;
&nbsp;
matches the characters &nbsp; literally (case sensitive)
4th Alternative
&thinsp;
&thinsp;
matches the characters &thinsp; literally (case sensitive)
5th Alternative
&ensp;
&ensp;
matches the characters &ensp; literally (case sensitive)
6th Alternative
&emsp;
&emsp;
matches the characters &emsp; literally (case sensitive)
7th Alternative
&#8201;
&#8201;
matches the characters &#8201; literally (case sensitive)
8th Alternative
&#8194;
&#8194;
matches the characters &#8194; literally (case sensitive)
9th Alternative
&#8195;
\s
matches any whitespace character (equivalent to [\r\n\t\f\v ])
* matches the previous token between zero and unlimited times, as many times as possible, giving back as needed (greedy)
< matches the character < with index 6010 (3C16 or 748) literally (case sensitive)
\/ matches the character / with index 4710 (2F16 or 578) literally (case sensitive)
\1 matches the same text as most recently matched by the 1st capturing group
> matches the character > with index 6210 (3E16 or 768) literally (case sensitive)
Global pattern flags
g modifier: global. All matches (don't return after first match)
m modifier: multi line. Causes ^ and $ to match the begin/end of each line (not only begin/end of string)
Your regular expression does not match the subject string.Try launching the debugger to find out why.

Regular Expression
No Match

/
/
gm

Test String

Substitution

Processing...