Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
topic_modelling
like
13
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
89c4d20
topic_modelling
/
funcs
Ctrl+K
Ctrl+K
3 contributors
History:
38 commits
seanpedrickcase
Improved initial clean options. Now has option to return embeddings only.
89c4d20
7 months ago
__init__.py
Safe
0 Bytes
first commit
over 1 year ago
anonymiser.py
Safe
10.6 kB
App now retains original index following cleaning to allow for referring back to original data
8 months ago
auth.py
Safe
1.88 kB
Only aggregate topics not 'other', allowed for minimum sentence length, default max_topics now will auto aggregate topics. Added Cognito Auth functionality (boto3 with AWS).
10 months ago
bertopic_vis_documents.py
Safe
47.6 kB
Can split passages into sentences. Improved embedding, LLM representation models, improved zero shot capabilities
11 months ago
clean_funcs.py
Safe
6.54 kB
Improved initial clean options. Now has option to return embeddings only.
7 months ago
embeddings.py
Safe
3.37 kB
App now retains original index following cleaning to allow for referring back to original data
8 months ago
helper_functions.py
Safe
18.3 kB
App now retains original index following cleaning to allow for referring back to original data
8 months ago
presidio_analyzer_custom.py
Safe
4.18 kB
Added clean data options, improved re-representation options and visualisation. General format changes
over 1 year ago
prompts.py
Safe
6.24 kB
Updated packages. Improve hierarchy vis. Better models - mixedbread and phi3. Now option to split texts into sentences before modelling.
12 months ago
representation_model.py
Safe
7.83 kB
Removed some requirements from Dockerfile for AWS deployment to reduce container size
10 months ago
topic_core_funcs.py
Safe
38.9 kB
Improved initial clean options. Now has option to return embeddings only.
7 months ago