Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
3187788
document_redaction
/
tools
3 contributors
History:
80 commits
seanpedrickcase
Dropdown choices for redactions are now listed correctly
3187788
2 months ago
__init__.py
Safe
0 Bytes
Initial commit
11 months ago
auth.py
Safe
2.65 kB
You can now have output redaction boxes in grey according to an environment variable. Review files are now saved every time page is changed.
2 months ago
aws_functions.py
Safe
7.37 kB
Fixed issue where redactions were sometimes not removing text underneath boxes. You can now redact in different colours from review page
3 months ago
aws_textract.py
Safe
11.8 kB
Fix bug to identify all handwriting labels. Now only concatenates entity_type boxes if they have different labels.
2 months ago
cli_redact.py
Safe
4.73 kB
Allowed for overwriting of default output folder in choose_and_run_redactor function.
4 months ago
custom_csvlogger.py
Safe
6.65 kB
Created custom csvlogger to try to overcome AWS Lambda's incompatibility with multithread locks
4 months ago
custom_image_analyser_engine.py
Safe
39.6 kB
App should now resize images that are too large before sending to Textract. Textract now more robust to failure. Improved reliability of json conversion to review dataframe
2 months ago
data_anonymise.py
Safe
20.9 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
5 months ago
file_conversion.py
Safe
36.1 kB
Corrected image resizing method for instances where the image is very large.
2 months ago
file_redaction.py
Safe
102 kB
Corrected image resizing method for instances where the image is very large.
2 months ago
helper_functions.py
Safe
11.8 kB
App should now resize images that are too large before sending to Textract. Textract now more robust to failure. Improved reliability of json conversion to review dataframe
2 months ago
load_spacy_model_custom_recognisers.py
Safe
6.69 kB
Greatly improved regex for direct matching with custom entities
2 months ago
presidio_analyzer_custom.py
Safe
4.94 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
5 months ago
redaction_review.py
Safe
15.1 kB
Dropdown choices for redactions are now listed correctly
2 months ago