Regular Expressions 101

Community Patterns

22

Get path from any text

Created·2023-01-31 14:38
Updated·2023-07-23 20:17
Flavor·PCRE2 (PHP)
Recommended·
Get path (windows style) from any type of text (error message, e-mail corps ...), quoted or not. THIS IS THE SINGLE LINE VERSION ! If you want understand how it work or edit it, go https://regex101.com/r/7o2fyy Relative path are not supported The goal is to catch what "Look like" a path. See the limitations UNC path and prefix path like //./], [//?/] or [//./UNC/] are allowed some url path like [file:///C:/] or [file://] are allowed Catch path quoted with ["] and [']. But these quotes are include with the catch Quoted path is not concerned by limitations Limitations : (only unquoted path) [dot] and [space] is allowed, but not in a row [dot+space] or [space+dot at end of file name isn't catched INSIDE A NAME FILE (or last directory if it is a path to a directory) : [comma] is not supported (it stop the catch) after a first [dot], any [space] stop the catch after a [space], catch is stoped if next character is not a [letter], [digit] or [-] so, double [space] stop the catch Compatibility compatible PCRE, PCRE2 AutoHotkey : don't forget to escape "%" in "`%" /!\ Powershell and .Net /!\\ : this regex need some modification to be interpreted by powershell. You have to replace each (?&CapturGroupName) by \k. Use this powershell code to do this replacement : ` $powershellRegex = @' [Put here the regex to replace (?&CapturGroupName) with \k] '@ -replace '\(\?&(\w+)\)', '\k' ` This example code must return : [Put here the regex to replace \k with \k]
Submitted by nitrateag

Community Library Entry

1

Regular Expression
Created·2024-08-25 15:52
Updated·2024-08-26 08:27
Flavor·JavaScript

/
^[a-z0-9]+(?:[.\-_'+][a-z0-9]+)*@([a-z0-9]+(?:[-][a-z0-9]+)*\.)+[a-z]{2,}$
/
igm
Open regex in editor

Description

"/" - start of regular expression "^" - anchor to beginning of string

Group: [a-z0-9]+ "[a-z0-9]" - Latin character or digit "+" - occurring ONE or more times

Group: (?:[.-][a-z0-9]+)* "(" - start of group "?:[.-_'+][a-z0-9]" - first character of group is allowed to be single character "." OR "-", followed by Latin character or digit "+" - occurring ONE or more times ")" - end of group "*" - group occurs ZERO or more times

"@" - mandatory single at sign

Second-level domain group and below: ([a-z0-9]+(?:[-][a-z0-9]+).)+ "(" - start groups "[a-z0-9]" - Latin character or digit "+" - occurring one or more times "(" - beginning of subgroup "?:[-][a-z0-9]" - single hyphen mandatory for subgroup, followed by Latin character or digit "+" - one or more times ")" - end of subgroup "" - subgroup occurs ZERO or more times "." - mandatory single dot ")" - end of group "+" - occurring ONE or more times

Top-level domain group: [a-z]{2,}$ "[a-z]" - Latin character "{2,}" - occurring TWO or more times

"$" - anchor to end of line "/" - end of regular expression "i" - expression case-insensitive flag (makes ranges [a-z] and [a-zA-Z] equivalent)

"gm" - only for success validation multiline in Test string section, these flags not needed in checking once string value

Submitted by Diableros [https://github.com/Diableros]