Regular Expressions 101

Save & Share

Current Version: 2
Update Regex
ctrl+⇧+s
Save new Regex
ctrl+s
Add to Community Library

Flavor

PCRE2 (PHP)
ECMAScript (JavaScript)
Python
Golang
Java
.NET 7.0 (C#)
Rust
PCRE (Legacy)
Regex Flavor Guide

Function

Match
Substitution
List
Unit Tests

Tools

Regular Expression
Processing...

Test String

Code Generator

Language

Generated Code

import re

regex = re.compile(r"""
	(?(DEFINE)
	  (?<XHTML>
	    (?&comment)*\s*
	    (?&DOCTYPE)
	    (?&comment)*\s*
	    (?&HTML)
	    (?&comment)*\s*
	  )
	
	  # HTML element
	  (?<HTML>
	    <html(?&attrs)?>(?&content)<\/html>
	  )
	
	  # Match content
	  (?<content>
	    \s*
	    (?:
	      ((?&tag) | [^<>]+)\s*
	    )*
	  )
	
	  # General tag
	  (?<tag>
	     <((?&tagname))(?&attrs)?\s*(?:
	       \/>|
	       >\s*(?&content)\s*
	       <\/\g'-1'>
	     )|(?&comment)
	  )
	
	  # Attributes
	  (?<attrs>\s+
	    # The name
	    (?&keyword) (
	      \s*=\s*
	      (?:
	        (?&keyword)|(?&string)
	      )
	    )?
	    (?&attrs)?
	  )
	
	  (?<string>
	    "(?:\\.|.)+?"|
	    '(?:\\.|.)+?'
	  )
	
	  (?<comment>
	    \s*<!--(.+?)-->\s*
	  )
	
	  # Match keyword
	  (?<keyword>[^\s\/>"'=]+)
	  # Match tag name
	  (?<tagname>(?!xml)[A-Za-z_][A-Za-z\d_.-]*)
	
	  # DOCTYPE expression
	  (?<DOCTYPE>
	     <!doctype\s+x?html\s*(public\s*(?&string))?(\s+(?&string))*>
	  )
	)
	
	^\s*(?&XHTML)\s*$
	""", flags=re.VERBOSE | re.IGNORECASE | re.DOTALL)

test_str = ("<!-- test -->\n"
	"<!DOCTYPE html \n"
	"     PUBLIC \"-//W3C//DTD XHTML 1.0 Frameset//EN\"\n"
	"     \"http://www.w3.org/TR/xhtml1/DTD/xhtml1-frameset.dtd\">\n"
	"  <html goat:style=\"''inva'lid css I know\">textttt<head></head><body><div class=\"onfoodstamps\"><div class=\"upper\">foo<p>ayyy</p>bar</div>baz</div><br/></head></html>")

matches = regex.finditer(test_str)

for match_num, match in enumerate(matches, start=1):
    print(f"Match {match_num} was found at {match.start()}-{match.end()}: {match.group()}")
    
    for group_num, group in enumerate(match.groups(), start=1):
        print(f"Group {group_num} found at {match.start(group_num)}-{match.end(group_num)}: {group}")

Please keep in mind that these code samples are automatically generated and are not guaranteed to work. If you find any syntax errors, feel free to submit a bug report. For a full regex reference for Python, please visit: https://docs.python.org/3/library/re.html

Regular Expressions 101

Save & Share

Flavor

Function

Tools

Explanation

Match Information

Quick Reference

Regular Expression
Processing...

Test String

Code Generator

Language

Generated Code

Save & Share

Flavor

Function

Tools

Explanation

Match Information

Quick Reference

Regular ExpressionProcessing...

Test String

Regular Expression
Processing...