Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
7
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
bbf844d
document_redaction
4.02 MB
3 contributors
History:
281 commits
seanpedrickcase
Added examples to tops of various tabs to demonstrate basic functions (optional). Minor changes to example csv ocr output
bbf844d
about 1 month ago
.github
Corrected a polynomial regex issue. Reformatted code.
about 1 month ago
cdk
Fixed duplicate page argument mismatch. Readded Windows tests. Added refresh token options to cdk. Package updates
about 1 month ago
example_data
Added examples to tops of various tabs to demonstrate basic functions (optional). Minor changes to example csv ocr output
about 1 month ago
src
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
2 months ago
test
Fixes to test suites. Minor default package changes (paddleocr not required)
about 1 month ago
tools
Added examples to tops of various tabs to demonstrate basic functions (optional). Minor changes to example csv ocr output
about 1 month ago
.dockerignore
Safe
430 Bytes
General code changes and reformatting to address code vulnerabilities highlighted by codeQL scan, and black/ruff repplied to code. Fixes/optimisation of Github Actions
about 1 month ago
.gitattributes
Safe
296 Bytes
Added example data files. Greatly revised CLI redaction for redaction, deduplication, and AWS Textract batch calls. Various minor fixes and package updates.
about 1 month ago
.gitignore
Safe
460 Bytes
General code changes and reformatting to address code vulnerabilities highlighted by codeQL scan, and black/ruff repplied to code. Fixes/optimisation of Github Actions
about 1 month ago
DocRedactApp.spec
Safe
2 kB
Updated version numbers, gradio package version.
6 months ago
Dockerfile
Safe
4.72 kB
Fix to tabular redaction, added tabular deduplication. Updated cli call capability for both
about 2 months ago
README.md
Safe
69.6 kB
Updated Windows Tesseract install location for test
about 1 month ago
_quarto.yml
Safe
662 Bytes
Added source files for quarto documentation website
5 months ago
app.py
236 kB
Added examples to tops of various tabs to demonstrate basic functions (optional). Minor changes to example csv ocr output
about 1 month ago
cli_redact.py
Safe
59.6 kB
Fixed on deprecated Github workflow functions. Applied linter and formatter to code throughout. Added tests for GUI load.
about 1 month ago
entrypoint.sh
Safe
235 Bytes
Updated Dockerfile and entrypoint file to hopefully deal correctly with APP_MODE environment variable
11 months ago
example_config.env
Safe
927 Bytes
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
2 months ago
how_to_create_exe_dist.txt
Safe
3.22 kB
Updated version numbers, minor text revision
6 months ago
index.qmd
Safe
1.89 kB
Corrected some multiple xlsx/docx file redaction issues. package updates.
2 months ago
lambda_entrypoint.py
Safe
10.3 kB
Fixed on deprecated Github workflow functions. Applied linter and formatter to code throughout. Added tests for GUI load.
about 1 month ago
load_dynamo_logs.py
Safe
3.07 kB
Fixed on deprecated Github workflow functions. Applied linter and formatter to code throughout. Added tests for GUI load.
about 1 month ago
load_s3_logs.py
Safe
3.01 kB
Fixed on deprecated Github workflow functions. Applied linter and formatter to code throughout. Added tests for GUI load.
about 1 month ago
pyproject.toml
2.16 kB
Updated Windows Tesseract install location for test
about 1 month ago
requirements.txt
1.43 kB
Fixed duplicate page argument mismatch. Readded Windows tests. Added refresh token options to cdk. Package updates
about 1 month ago