Commit History

Merge pull request #37 from seanpedrick-case/dev
95ca426
unverified

Sean Pedrick-Case commited on

Added possibility of changing model and entity types in config file
bce761b

seanpedrickcase commited on

Merge pull request #36 from seanpedrick-case/remove_from_single_row_select
9d7cf92
unverified

Sean Pedrick-Case commited on

Added capability to redact all redactions with the same text based on the selected row. Rerranged buttons on review page a little. Improved page navigation efficiency.
c4e3724

seanpedrickcase commited on

Merge pull request #34 from seanpedrick-case/dev
b7d2635
unverified

Sean Pedrick-Case commited on

Merge pull request #33 from seanpedrick-case/xfdf_redaction_text
4011f21
unverified

Sean Pedrick-Case commited on

Now xfdf Adobe exports can export redacted text that is searchable in Acrobat
a91f87b

seanpedrickcase commited on

Merge pull request #29 from seanpedrick-case/dev
e06b754
unverified

Sean Pedrick-Case commited on

Updated version numbers, gradio package version.
20b655f

seanpedrickcase commited on

Expanded checks for out of range page cropboxes
5fcccbe

seanpedrickcase commited on

Merge pull request #27 from seanpedrick-case/dev
176c4ad
unverified

Sean Pedrick-Case commited on

Removed reference to chatbot in readme for now until it is improved
e27231e

seanpedrickcase commited on

Merge pull request #25 from seanpedrick-case/dev
c2a4864
unverified

Sean Pedrick-Case commited on

Updated gradio version. Minor changes to redactor function sequence. Minor formatting and wording changes.
5a21738

seanpedrickcase commited on

Added config options for compressing output pdfs, returning output redacted pdfs at all, and for changing the length of time for showing previous Textract jobs
3bbf593

seanpedrickcase commited on

Merge pull request #23 from seanpedrick-case/dev
03d0cfd
unverified

Sean Pedrick-Case commited on

More checks on ocr outputs in redaction functions
97097ff

seanpedrickcase commited on

Merge pull request #21 from seanpedrick-case/dev
23f892d
unverified

Sean Pedrick-Case commited on

Corrected a couple of bugs. Now Textract whole document API call outputs will load also the input PDF into the app
10f46e9

seanpedrickcase commited on

Merge pull request #20 from seanpedrick-case/dev
baabf97
unverified

Sean Pedrick-Case commited on

Updated version numbers, minor text revision
69c2af9

seanpedrickcase commited on

Updated logging format for timestamps to be compatible with AWS. Added load_dynamo_logs.py example file.
94e514b

seanpedrickcase commited on

Merge pull request #19 from seanpedrick-case/dev
643a230
unverified

Sean Pedrick-Case commited on

Merge pull request #18 from seanpedrick-case/dev
d998102
unverified

Sean Pedrick-Case commited on

Minor changes for cost codes, package updates. Added pyproject.toml file
47a3a80

seanpedrickcase commited on

Now local OCR outputs can be saved to file and reloaded to save preparation time. Bug fixing in logs and tabular data redaction. Update to documentation
f93e49c

seanpedrickcase commited on

Improved logging format a little. Now possible to save logs to DynamoDB
0042e78

seanpedrickcase commited on

Improved efficiency of review page navigation, especially for large documents. Updated user guide
93b4c8a

seanpedrickcase commited on

Merge pull request #17 from seanpedrick-case/dev
4d4ca01
unverified

Sean Pedrick-Case commited on

Merge pull request #16 from seanpedrick-case/dev
4f93b3f
unverified

Sean Pedrick-Case commited on

Now loads in 'convert_textract_outputs_to_ocr_results' button whether textract_api options enabled or not
a56b9b0

seanpedrickcase commited on

Added button to convert Textract API outputs to ocr_output files easily. Corrected Textract job file location
46bf91e

seanpedrickcase commited on

Added compatibility with gradio_image_annotation for passing through id and text properties to annotator. Corrected csv location for Textract api calls. Other minor changes
52c1a90

seanpedrickcase commited on

Merge pull request #15 from seanpedrick-case/dev
c27db98
unverified

Sean Pedrick-Case commited on

Changed reference for gradio_image_annotation to latest .whl file to allow for correct implementation in Docker containers
d370b1c

seanpedrickcase commited on

Added example .py file for accessing and downloading logs from s3
eada14e

seanpedrickcase commited on

Minor function documentation changes. Requirements update for new Gradio and version of Gradio annotator that allows for saving preferred redaction format and to include box id
f6e6d80

seanpedrickcase commited on

Merge pull request #14 from seanpedrick-case/dev
4a5cee5
unverified

Sean Pedrick-Case commited on

Corrected RUN_AWS_FUNCTIONS environment variable reference when downloading cost codes
818efbc

seanpedrickcase commited on

Made changes to hopefully resolve issue with downloading cost centre details from S3 to container
1418017

seanpedrickcase commited on

Fixed issue where S3 cost codes are defined but not local cost code location
7b345c3

seanpedrickcase commited on

Fixed issue in Docker containers built locally without correct folder permissions. Improved config file. Updated Gradio version to fix issue with selecting filtered rows. Minor bug fixes.
a33b955

seanpedrickcase commited on

Modified Dockerfile for correct logging folder ownership
0b9e789

seanpedrickcase commited on

Implemented Textract document API calls and associated output tracking/download. Fixes to config and cost code implementation. General minor bug fixes.
ed5f8c7

seanpedrickcase commited on

Merge pull request #13 from seanpedrick-case/dev
5203951
unverified

Sean Pedrick-Case commited on