Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
a9dcd2e
document_redaction
/
tools
Ctrl+K
Ctrl+K
3 contributors
History:
79 commits
seanpedrickcase
Moved review components to give more space for page. Extended zoom limits. Existing redaction labels should now appear in new redaction box dropdown.
a9dcd2e
5 months ago
__init__.py
Safe
0 Bytes
Initial commit
about 1 year ago
auth.py
Safe
2.65 kB
You can now have output redaction boxes in grey according to an environment variable. Review files are now saved every time page is changed.
5 months ago
aws_functions.py
Safe
7.37 kB
Fixed issue where redactions were sometimes not removing text underneath boxes. You can now redact in different colours from review page
6 months ago
aws_textract.py
Safe
11.8 kB
Fix bug to identify all handwriting labels. Now only concatenates entity_type boxes if they have different labels.
5 months ago
cli_redact.py
Safe
4.73 kB
Allowed for overwriting of default output folder in choose_and_run_redactor function.
6 months ago
custom_csvlogger.py
Safe
6.65 kB
Created custom csvlogger to try to overcome AWS Lambda's incompatibility with multithread locks
6 months ago
custom_image_analyser_engine.py
Safe
39.6 kB
App should now resize images that are too large before sending to Textract. Textract now more robust to failure. Improved reliability of json conversion to review dataframe
5 months ago
data_anonymise.py
Safe
20.9 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
7 months ago
file_conversion.py
Safe
36.1 kB
Corrected image resizing method for instances where the image is very large.
5 months ago
file_redaction.py
Safe
102 kB
Corrected image resizing method for instances where the image is very large.
5 months ago
helper_functions.py
Safe
11.8 kB
App should now resize images that are too large before sending to Textract. Textract now more robust to failure. Improved reliability of json conversion to review dataframe
5 months ago
load_spacy_model_custom_recognisers.py
Safe
6.69 kB
Greatly improved regex for direct matching with custom entities
5 months ago
presidio_analyzer_custom.py
Safe
4.94 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
7 months ago
redaction_review.py
14.2 kB
Moved review components to give more space for page. Extended zoom limits. Existing redaction labels should now appear in new redaction box dropdown.
5 months ago