Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
3cecbfa
document_redaction
/
tools
3 contributors
History:
88 commits
seanpedrickcase
Fixed issues with gradio version 5.16. Fixed fuzzy search error with pages with no data.
3cecbfa
about 1 month ago
__init__.py
Safe
0 Bytes
Initial commit
11 months ago
auth.py
Safe
2.92 kB
Ensured the text ocr outputs have no line breaks at end. Multi-line custom text searches now possible. Files for review sent from redact button. Fixed image redaction (not review yet). Can get user pool details from headers. Gradio update.
about 2 months ago
aws_functions.py
Safe
7.37 kB
Fixed issue where redactions were sometimes not removing text underneath boxes. You can now redact in different colours from review page
3 months ago
aws_textract.py
Safe
11.8 kB
Fix bug to identify all handwriting labels. Now only concatenates entity_type boxes if they have different labels.
2 months ago
cli_redact.py
Safe
4.73 kB
Allowed for overwriting of default output folder in choose_and_run_redactor function.
4 months ago
custom_csvlogger.py
Safe
6.65 kB
Created custom csvlogger to try to overcome AWS Lambda's incompatibility with multithread locks
4 months ago
custom_image_analyser_engine.py
Safe
49.2 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
about 2 months ago
data_anonymise.py
Safe
21.1 kB
Zoom and rotate features from forked gradio_annotation package. Fixed csv/xlsx redaction. Updated guide on creating exe.
about 1 month ago
file_conversion.py
Safe
40.6 kB
Corrected image coordinate translation when the pdf mediabox is not the same size as pdf page rectangle
about 1 month ago
file_redaction.py
Safe
94.8 kB
Fixed issues with gradio version 5.16. Fixed fuzzy search error with pages with no data.
about 1 month ago
find_duplicate_pages.py
Safe
9.63 kB
Added tab to be able to compare pages across multiple documents and redact duplicates
about 2 months ago
helper_functions.py
Safe
14 kB
Fuzzy match implementation for deny list. Added option to merge multiple review files. Review files from redaction step should now include text.
about 2 months ago
load_spacy_model_custom_recognisers.py
Safe
13.7 kB
Fixed issues with gradio version 5.16. Fixed fuzzy search error with pages with no data.
about 1 month ago
presidio_analyzer_custom.py
Safe
4.94 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
5 months ago
redaction_review.py
Safe
28.3 kB
Fixed issues with gradio version 5.16. Fixed fuzzy search error with pages with no data.
about 1 month ago