Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
f0c28d7
document_redaction
3 contributors
History:
97 commits
seanpedrickcase
Updated packages. Reinstituted multithreading with page load, now with order protected. Smaller spacy model used for speed. Textract calls should now be faster
f0c28d7
3 months ago
.github
Initial commit
11 months ago
tld
Added TLDExtract cache files so that internet connection is not required
10 months ago
tools
Updated packages. Reinstituted multithreading with page load, now with order protected. Smaller spacy model used for speed. Textract calls should now be faster
3 months ago
.dockerignore
Safe
203 Bytes
Added logging, anonymising all Excel sheets, simple redaction tags, some Dockerfile optimisation
7 months ago
.gitignore
Safe
203 Bytes
Added logging, anonymising all Excel sheets, simple redaction tags, some Dockerfile optimisation
7 months ago
DocRedactApp_0.2.spec
Safe
1.16 kB
Added possibility to do authentication with AWS Cognito on load. Other minor changes.
8 months ago
Dockerfile
Safe
2.55 kB
Can now define queue size, max file size, and server port in environment variables
4 months ago
Dockerfile_old
Safe
2.37 kB
Updated Dockerfile and requirements to include relevant Lambda packages
4 months ago
README.md
Safe
10.8 kB
Improved time taken reporting and readme
5 months ago
app.py
Safe
37.2 kB
Started adding in support for custom deny list. Fixed textract call issue. Removed multithreading for now as it mixes up pages
3 months ago
entrypoint.sh
Safe
235 Bytes
Updated Dockerfile and entrypoint file to hopefully deal correctly with APP_MODE environment variable
4 months ago
how_to_create_exe_dist.txt
Safe
2.17 kB
Added possibility to do authentication with AWS Cognito on load. Other minor changes.
8 months ago
lambda_entrypoint.py
Safe
4.59 kB
Allowed for overwriting of default output folder in choose_and_run_redactor function.
4 months ago
requirements.txt
Safe
635 Bytes
Updated packages. Reinstituted multithreading with page load, now with order protected. Smaller spacy model used for speed. Textract calls should now be faster
3 months ago