Regular Expressions 101

Save & Manage Regex

Current Version: 2
Save & Share
Community Library

Flavor

PCRE2 (PHP)
ECMAScript (JavaScript)
Python
Golang
Java
.NET 7.0 (C#)
Rust
PCRE (Legacy)
Regex Flavor Guide

Function

Match
Substitution
List
Unit Tests

Tools

Regular Expression
Processing...

Test String

Code Generator

Language

Generated Code

import re

regex = re.compile(r"(?:([-‐‑‒–—―−⁃﹘﹣－])|(?:&(?:(?:#x(2d|201[0-5]|2212|2043|fe58|fe63|ff0d))|(?:#(45|820[89]|821[0123]|8722|8259|65112|65123|65293))|(hyphen|[nm]?dash|hybull|horbar|minus));?))", flags=re.MULTILINE | re.UNICODE)

test_str = ("This captures an entity even if it lacks the ';', which is commonly encountered in the wild.\n\n"
	"kbdash  &#x2d;                          &#45;           &#x2d   &#45    -;       -\n"
	"dash    &#x2010;        &dash;          &#8208;         &#x2010 &#8208  ‐;       ‐\n"
	"hyphen  &#x2011;        &hyphen;        &#8209;         &#x2011 &#8209  ‑;       ‑\n"
	"figure  &#x2012;                        &#8210;         &#x2012 &#8210  ‒;       ‒\n"
	"em      &#x2013;        &ndash;         &#8211;         &#x2013 &#8211  –;       –\n"
	"en      &#x2014;        &mdash;         &#8212;         &#x2014 &#8212  —;       —\n"
	"horbar  &#x2015;        &horbar;        &#8213;         &#x2015 &#8213  ―;       ―\n"
	"minus   &#x2212;        &minus;         &#8722;         &#x2212 &#8722  −;       −\n"
	"hybull  &#x2043;        &hybull;        &#8259;         &#x2043 &#8259  ⁃;       ⁃\n"
	"fe58    &#xfe58;                        &#65112;        &#xfe58 &#65112 ﹘;      ﹘\n"
	"fe63    &#xfe63;                        &#65123;        &#xfe63 &#65123 ﹣;      ﹣\n"
	"ff0d    &#xff0d;                        &#65293;        &#xff0d &#65293 －;      －\n\n"
	"(?:([-‐‑‒–—―−⁃﹘﹣－])|(?:&(?:(?:#x(2d|201[0-5]|2212|2043|fe58|fe63|ff0d))|(?:#(45|820[89]|821[0123]|8722|8259|65112|65123|65293))|(hyphen|[nm]?dash|hybull|horbar|minus));?))")

matches = regex.finditer(test_str)

for match_num, match in enumerate(matches, start=1):
    print(f"Match {match_num} was found at {match.start()}-{match.end()}: {match.group()}")
    
    for group_num, group in enumerate(match.groups(), start=1):
        print(f"Group {group_num} found at {match.start(group_num)}-{match.end(group_num)}: {group}")

Please keep in mind that these code samples are automatically generated and are not guaranteed to work. If you find any syntax errors, feel free to submit a bug report. For a full regex reference for Python, please visit: https://docs.python.org/3/library/re.html

Regular Expressions 101

Save & Manage Regex

Flavor

Function

Tools

Explanation

Match Information

Quick Reference

Regular Expression
Processing...

Test String

Code Generator

Language

Generated Code

Save & Manage Regex

Flavor

Function

Tools

Explanation

Match Information

Quick Reference

Regular ExpressionProcessing...

Test String

Regular Expression
Processing...