💾 Archived View for gem.sdf.org › s.kaplan › cheatsheets › programming-languages › regex.md captured on 2024-06-16 at 12:47:01.

View Raw

More Information

⬅️ Previous capture (2023-09-28)

-=-=-=-=-=-=-

# Regular Expressions (Regex) Cheatsheet

## Overview
- Regular expressions (regex or regexp) are patterns used to match character combinations in strings.
- They are often used in programming languages for tasks such as search and replace, input validation, and parsing.
- Regex syntax can vary slightly between programming languages and tools.

## Basic Syntax
- Characters in a regex match themselves, unless they are special characters with a specific meaning.
- Special characters can be escaped using a backslash (`\`) to match the character itself.
- Character classes can be used to match a range of characters.
- Quantifiers can be used to match a specific number of characters.

## Examples
### Example 1: Matching an Email Address

// Example regex to match an email address

[\w.%+-]+@[A-Za-z0-9.-]+\.[A-Za-z]{2,}

This regex will match an email address in the format `username@domain.com`.

### Example 2: Matching a Phone Number

// Example regex to match a phone number

\+?\d{1,3}[-. ]?\d{3}[-. ]?\d{4}

This regex will match a phone number in a variety of formats, including `123-456-7890`, `123.456.7890`, and `+1 123-456-7890`.

### Example 3: Matching a URL

// Example regex to match a URL

https?://[\w\-\.]+(:\d+)?(/[\w/_\.]*)?

This regex will match a URL in a variety of formats, including `http://www.example.com`, `https://example.com/path/to/page.html`, and `http://www.example.com:8080/path/to/page.html`.

## Special Characters
- `.`: Matches any single character except a newline.
- `^`: Matches the start of a string.
- `


gemini - kennedy.gemi.dev




: Matches the end of a string.
- `*`: Matches zero or more occurrences of the previous character.
- `+`: Matches one or more occurrences of the previous character.
- `?`: Matches zero or one occurrence of the previous character.
- `|`: Matches either the character on the left or the character on the right.
- `[]`: Matches any one character within the brackets.
- `()`: Groups characters together.

## Character Classes
- `\d`: Matches any digit character.
- `\w`: Matches any word character (alphanumeric and underscore).
- `\s`: Matches any whitespace character.
- `\D`: Matches any non-digit character.
- `\W`: Matches any non-word character.
- `\S`: Matches any non-whitespace character.

## Quantifiers
- `{n}`: Matches exactly n occurrences of the previous character.
- `{n,}`: Matches n or more occurrences of the previous character.
- `{n,m}`: Matches between n and m occurrences of the previous character.

## Advanced Syntax
- Lookaheads (`(?=...)`) and lookbehinds (`(?<=...)`) can be used to match characters based on what precedes or follows them.
- Backreferences (`\1`, `\2`, etc.) can be used to refer to previously matched groups.
- Flags can be used to modify the behavior of the regex (e.g. case-insensitive matching).

## Resources
- [RegexOne](https://regexone.com/) - Interactive regex tutorial
- [Regular-Expressions.info](https://www.regular-expressions.info/) - Comprehensive regex guide
- [Regex101](https://regex101.com/) - Online regex tester and debugger