- Notifications
You must be signed in to change notification settings - Fork1
Redacts the PII information. This package uses Stanford NER package to identify and scrub PII data. It redacts email,ssn,driver license,passport no. It aggressively removes any number with more than 4 consecutive digits. Use AddToWhitelist to whitelist any pattern.
License
Musfiqur01/PIIRedact
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Redacts the PII information. This package uses Stanford NER package to identify and scrub Name, Organization and location. It also redacts email,ssn,driver license,passport no. It aggressively removes any number with more than 3 consecutive digits. Use AddToWhitelist to whitelist any pattern.In order to use this you must have java installed.
Install the nuget package to get started.
The usage is:var redactor = new PIIRedactor();var redactedData = redactor.GetRedactedData("My name is John Doe. My email ism@n.o");
The redacted string looks like : My name is xxxx xxx. My email isx@x.x
If you want to whitelist any pattern i.e any number with 6-8 consecutive digits, it should be done as follows:redactor.AddToWhitelist(new RegexFinder("\b\d{6,8}\b"));
Similarly to add a new redactable pattern will redact any word 6-8 consecutive digits.redactor.AddToWhitelist(new RegexFinder("\b\d{6,8}\b"));
In order to use this package , you have to have java installed. If you dont want to use java, you have to disable IncludeEntityRedaction = false;
We use appveyor for versioning.
Musfiqur Rahman
See also the list ofcontributors who participated in this project.
This project is licensed under the MIT License - see theLICENSE.md file for details
This project uses Standford NER package.
About
Redacts the PII information. This package uses Stanford NER package to identify and scrub PII data. It redacts email,ssn,driver license,passport no. It aggressively removes any number with more than 4 consecutive digits. Use AddToWhitelist to whitelist any pattern.