Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
7
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
826ed50
document_redaction
3.8 MB
3 contributors
History:
264 commits
seanpedrickcase
Added further file limits to deduplication and file load functions
826ed50
about 1 month ago
.github
Updated CDK code for custom KMS keys, new VPCs. Minor package updates.
4 months ago
cdk
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
2 months ago
example_data
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 1 month ago
src
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
2 months ago
tools
Added further file limits to deduplication and file load functions
about 1 month ago
.dockerignore
Safe
315 Bytes
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 1 month ago
.gitattributes
Safe
296 Bytes
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 1 month ago
.gitignore
Safe
345 Bytes
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 1 month ago
DocRedactApp.spec
Safe
2 kB
Updated version numbers, gradio package version.
6 months ago
Dockerfile
Safe
4.72 kB
Fix to tabular redaction, added tabular deduplication. Updated cli call capability for both
about 2 months ago
README.md
Safe
69.6 kB
Fix to tabular redaction, added tabular deduplication. Updated cli call capability for both
about 2 months ago
_quarto.yml
Safe
662 Bytes
Added source files for quarto documentation website
5 months ago
app.py
Safe
167 kB
Added form, table, and layout extraction options to AWS Textract calls. Added options to config to bound document length, maximum table rows, etc.
about 1 month ago
cli_redact.py
54.2 kB
Added further file limits to deduplication and file load functions
about 1 month ago
entrypoint.sh
Safe
235 Bytes
Updated Dockerfile and entrypoint file to hopefully deal correctly with APP_MODE environment variable
11 months ago
example_config.env
Safe
927 Bytes
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
2 months ago
how_to_create_exe_dist.txt
Safe
3.22 kB
Updated version numbers, minor text revision
6 months ago
index.qmd
Safe
1.89 kB
Corrected some multiple xlsx/docx file redaction issues. package updates.
2 months ago
lambda_entrypoint.py
Safe
10 kB
Added form, table, and layout extraction options to AWS Textract calls. Added options to config to bound document length, maximum table rows, etc.
about 1 month ago
load_dynamo_logs.py
Safe
3.21 kB
Added support for other languages. Improved DynamoDB download
2 months ago
load_s3_logs.py
Safe
2.87 kB
Updated logging format for timestamps to be compatible with AWS. Added load_dynamo_logs.py example file.
6 months ago
pyproject.toml
Safe
1.75 kB
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 1 month ago
requirements.txt
Safe
1.27 kB
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 1 month ago