Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
7
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
4c95b3c
document_redaction
/
tools
676 kB
3 contributors
History:
152 commits
seanpedrickcase
Fix for fuzzy matching
4c95b3c
2 months ago
__init__.py
Safe
0 Bytes
Initial commit
over 1 year ago
auth.py
Safe
2.46 kB
Added compatibility with gradio_image_annotation for passing through id and text properties to annotator. Corrected csv location for Textract api calls. Other minor changes
6 months ago
aws_functions.py
Safe
9.51 kB
Updated duplicate pages functionality. Improve redaction efficiency a little with concat method. Minor modification to documentation and interface
5 months ago
aws_textract.py
Safe
18.3 kB
Added support for other languages. Improved DynamoDB download
2 months ago
cli_redact.py
Safe
9.08 kB
Updated command line redaction script with more options
2 months ago
config.py
22.2 kB
Corrected Tesseract data location
2 months ago
custom_csvlogger.py
Safe
12.9 kB
Updated packages. Corrected CSV logger headings, can now submit custom log csv names to S3. Started work on identifying and deduplicating at the line level
4 months ago
custom_image_analyser_engine.py
68.7 kB
Corrected an issue with finding valid language entities for AWS comprehend redaction
2 months ago
data_anonymise.py
43.1 kB
Updated command line redaction script with more options
2 months ago
example_cli_calls.txt
Safe
769 Bytes
Repaired Dockerfile hopefully finally
2 months ago
file_conversion.py
109 kB
Updated command line redaction script with more options
2 months ago
file_redaction.py
139 kB
Fix for fuzzy matching
2 months ago
find_duplicate_pages.py
58.5 kB
Added support for other languages. Improved DynamoDB download
2 months ago
helper_functions.py
Safe
30.2 kB
Added support for other languages. Improved DynamoDB download
2 months ago
load_spacy_model_custom_recognisers.py
24.9 kB
Fix for fuzzy matching
2 months ago
presidio_analyzer_custom.py
Safe
4.92 kB
More config options. Fixed some bugs with removing elements from review page and Adobe export. Some UI rearrangements
7 months ago
redaction_review.py
94.7 kB
Updated command line redaction script with more options
2 months ago
textract_batch_call.py
Safe
28 kB
Expanded checks for out of range page cropboxes
5 months ago