Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
a265560
document_redaction
/
tools
3 contributors
History:
83 commits
seanpedrickcase
Added tab to be able to compare pages across multiple documents and redact duplicates
a265560
2 months ago
__init__.py
Safe
0 Bytes
Initial commit
11 months ago
auth.py
Safe
2.92 kB
Ensured the text ocr outputs have no line breaks at end. Multi-line custom text searches now possible. Files for review sent from redact button. Fixed image redaction (not review yet). Can get user pool details from headers. Gradio update.
2 months ago
aws_functions.py
Safe
7.37 kB
Fixed issue where redactions were sometimes not removing text underneath boxes. You can now redact in different colours from review page
3 months ago
aws_textract.py
Safe
11.8 kB
Fix bug to identify all handwriting labels. Now only concatenates entity_type boxes if they have different labels.
2 months ago
cli_redact.py
Safe
4.73 kB
Allowed for overwriting of default output folder in choose_and_run_redactor function.
4 months ago
custom_csvlogger.py
Safe
6.65 kB
Created custom csvlogger to try to overcome AWS Lambda's incompatibility with multithread locks
4 months ago
custom_image_analyser_engine.py
Safe
47.6 kB
Ensured the text ocr outputs have no line breaks at end. Multi-line custom text searches now possible. Files for review sent from redact button. Fixed image redaction (not review yet). Can get user pool details from headers. Gradio update.
2 months ago
data_anonymise.py
Safe
20.9 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
5 months ago
file_conversion.py
Safe
36.4 kB
Ensured the text ocr outputs have no line breaks at end. Multi-line custom text searches now possible. Files for review sent from redact button. Fixed image redaction (not review yet). Can get user pool details from headers. Gradio update.
2 months ago
file_redaction.py
91.7 kB
Added tab to be able to compare pages across multiple documents and redact duplicates
2 months ago
find_duplicate_pages.py
Safe
9.63 kB
Added tab to be able to compare pages across multiple documents and redact duplicates
2 months ago
helper_functions.py
12.7 kB
Added tab to be able to compare pages across multiple documents and redact duplicates
2 months ago
load_spacy_model_custom_recognisers.py
Safe
6.68 kB
Ensured the text ocr outputs have no line breaks at end. Multi-line custom text searches now possible. Files for review sent from redact button. Fixed image redaction (not review yet). Can get user pool details from headers. Gradio update.
2 months ago
presidio_analyzer_custom.py
Safe
4.94 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
5 months ago
redaction_review.py
17.5 kB
Added tab to be able to compare pages across multiple documents and redact duplicates
2 months ago