From 7e8aced3e396c43a10d23c523205aea424885742 Mon Sep 17 00:00:00 2001 From: marko1olo Date: Sat, 6 Jun 2026 08:07:48 +0400 Subject: [PATCH] docs: clarify regex-lite word boundaries Fixes #1356 --- regex-lite/src/lib.rs | 2 ++ 1 file changed, 2 insertions(+) diff --git a/regex-lite/src/lib.rs b/regex-lite/src/lib.rs index 22b1141449..3a84589a3d 100644 --- a/regex-lite/src/lib.rs +++ b/regex-lite/src/lib.rs @@ -364,6 +364,8 @@ The precise set of differences at the syntax level: * The Perl character classes are limited to ASCII codepoints. That is, `\d` is `[0-9]`, `\s` is `[\t\n\v\f\r ]` and `\w` is `[0-9A-Za-z_]`. +* Word boundary assertions are also limited to ASCII codepoints. For example, +`\b` is defined in terms of `\w` which is itself ASCII-only. * Unicode character classes of the form `\p{...}` and `\P{...}` are not supported at all. Note though that things like `[^β]` are still supported and will match any Unicode scalar value except for `β`.