You signed in with another tab or window.Reload to refresh your session.You signed out in another tab or window.Reload to refresh your session.You switched accounts on another tab or window.Reload to refresh your session.Dismiss alert
* Use Presidio for masking* Improve PII to handle encoded content* Reject large encoded content as DOS* Handle hex errors* Fix structured output masking path
Copy file name to clipboardExpand all lines: docs/ref/checks/pii.md
+45-6Lines changed: 45 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,22 +2,33 @@
2
2
3
3
Detects personally identifiable information (PII) such as SSNs, phone numbers, credit card numbers, and email addresses using Microsoft's[Presidio library](https://microsoft.github.io/presidio/). Will automatically mask detected PII or block content based on configuration.
4
4
5
+
**Advanced Security Features:**
6
+
7
+
-**Unicode normalization**: Prevents bypasses using fullwidth characters (@) or zero-width spaces
8
+
-**Encoded PII detection**: Optionally detects PII hidden in Base64, URL-encoded, or hex strings
9
+
-**URL context awareness**: Detects emails in query parameters (e.g.,`GET /api?user=john@example.com`)
10
+
-**Custom recognizers**: Includes CVV/CVC codes and BIC/SWIFT codes beyond Presidio defaults
-**`entities`** (required): List of PII entity types to detect. See the full list of[supported entities](https://microsoft.github.io/presidio/supported_entities/).
27
+
-**`entities`** (required): List of PII entity types to detect. Includes:
28
+
- Standard Presidio entities: See the full list of[supported entities](https://microsoft.github.io/presidio/supported_entities/)