Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
7
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
6f96988
document_redaction
/
tools
676 kB
1 contributor
History:
153 commits
seanpedrickcase
Corrected some multiple xlsx/docx file redaction issues. package updates.
6f96988
3 months ago
__init__.py
Safe
0 Bytes
Initial commit
over 1 year ago
auth.py
Safe
2.46 kB
Added compatibility with gradio_image_annotation for passing through id and text properties to annotator. Corrected csv location for Textract api calls. Other minor changes
7 months ago
aws_functions.py
Safe
9.51 kB
Updated duplicate pages functionality. Improve redaction efficiency a little with concat method. Minor modification to documentation and interface
5 months ago
aws_textract.py
Safe
18.3 kB
Added support for other languages. Improved DynamoDB download
3 months ago
cli_redact.py
Safe
9.08 kB
Updated command line redaction script with more options
3 months ago
config.py
22.2 kB
Corrected Tesseract data location
3 months ago
custom_csvlogger.py
Safe
12.9 kB
Updated packages. Corrected CSV logger headings, can now submit custom log csv names to S3. Started work on identifying and deduplicating at the line level
5 months ago
custom_image_analyser_engine.py
Safe
68.6 kB
Corrected some multiple xlsx/docx file redaction issues. package updates.
3 months ago
data_anonymise.py
43.2 kB
Corrected some multiple xlsx/docx file redaction issues. package updates.
3 months ago
example_cli_calls.txt
Safe
769 Bytes
Repaired Dockerfile hopefully finally
3 months ago
file_conversion.py
109 kB
Updated command line redaction script with more options
3 months ago
file_redaction.py
139 kB
Fix for fuzzy matching
3 months ago
find_duplicate_pages.py
58.5 kB
Added support for other languages. Improved DynamoDB download
3 months ago
helper_functions.py
Safe
30.2 kB
Added support for other languages. Improved DynamoDB download
3 months ago
load_spacy_model_custom_recognisers.py
24.9 kB
Fix for fuzzy matching
3 months ago
presidio_analyzer_custom.py
Safe
4.92 kB
More config options. Fixed some bugs with removing elements from review page and Adobe export. Some UI rearrangements
8 months ago
redaction_review.py
94.8 kB
Corrected some multiple xlsx/docx file redaction issues. package updates.
3 months ago
textract_batch_call.py
Safe
28 kB
Expanded checks for out of range page cropboxes
6 months ago