Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
document_redaction
3.8 MB
3 contributors
History:
265 commits
Sean Pedrick-Case
Merge pull request #66 from seanpedrick-case/dev
fc6a4bb
unverified
about 11 hours ago
.github
Updated CDK code for custom KMS keys, new VPCs. Minor package updates.
3 months ago
cdk
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
about 1 month ago
example_data
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 12 hours ago
src
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
about 1 month ago
tools
Added form, table, and layout extraction options to AWS Textract calls. Added options to config to bound document length, maximum table rows, etc.
about 11 hours ago
.dockerignore
Safe
315 Bytes
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 12 hours ago
.gitattributes
Safe
296 Bytes
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 12 hours ago
.gitignore
Safe
345 Bytes
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 12 hours ago
DocRedactApp.spec
Safe
2 kB
Updated version numbers, gradio package version.
4 months ago
Dockerfile
Safe
4.72 kB
Fix to tabular redaction, added tabular deduplication. Updated cli call capability for both
5 days ago
README.md
Safe
69.6 kB
Fix to tabular redaction, added tabular deduplication. Updated cli call capability for both
5 days ago
_quarto.yml
Safe
662 Bytes
Added source files for quarto documentation website
3 months ago
app.py
167 kB
Added form, table, and layout extraction options to AWS Textract calls. Added options to config to bound document length, maximum table rows, etc.
about 11 hours ago
cli_redact.py
54.2 kB
Added form, table, and layout extraction options to AWS Textract calls. Added options to config to bound document length, maximum table rows, etc.
about 11 hours ago
entrypoint.sh
Safe
235 Bytes
Updated Dockerfile and entrypoint file to hopefully deal correctly with APP_MODE environment variable
10 months ago
example_config.env
Safe
927 Bytes
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
about 1 month ago
how_to_create_exe_dist.txt
Safe
3.22 kB
Updated version numbers, minor text revision
5 months ago
index.qmd
Safe
1.89 kB
Corrected some multiple xlsx/docx file redaction issues. package updates.
about 1 month ago
lambda_entrypoint.py
10 kB
Added form, table, and layout extraction options to AWS Textract calls. Added options to config to bound document length, maximum table rows, etc.
about 11 hours ago
load_dynamo_logs.py
Safe
3.21 kB
Added support for other languages. Improved DynamoDB download
about 1 month ago
load_s3_logs.py
Safe
2.87 kB
Updated logging format for timestamps to be compatible with AWS. Added load_dynamo_logs.py example file.
5 months ago
pyproject.toml
Safe
1.75 kB
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 12 hours ago
requirements.txt
Safe
1.27 kB
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 12 hours ago