Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
topic_modelling
like
10
Running
App
Files
Files
Community
main
topic_modelling
/
funcs
3 contributors
History:
38 commits
seanpedrickcase
Improved initial clean options. Now has option to return embeddings only.
89c4d20
about 19 hours ago
__init__.py
0 Bytes
first commit
10 months ago
anonymiser.py
Safe
10.6 kB
App now retains original index following cleaning to allow for referring back to original data
about 2 months ago
auth.py
Safe
1.88 kB
Only aggregate topics not 'other', allowed for minimum sentence length, default max_topics now will auto aggregate topics. Added Cognito Auth functionality (boto3 with AWS).
3 months ago
bertopic_vis_documents.py
Safe
47.6 kB
Can split passages into sentences. Improved embedding, LLM representation models, improved zero shot capabilities
5 months ago
clean_funcs.py
Safe
6.54 kB
Improved initial clean options. Now has option to return embeddings only.
about 19 hours ago
embeddings.py
Safe
3.37 kB
App now retains original index following cleaning to allow for referring back to original data
about 2 months ago
helper_functions.py
Safe
18.3 kB
App now retains original index following cleaning to allow for referring back to original data
about 2 months ago
presidio_analyzer_custom.py
Safe
4.18 kB
Added clean data options, improved re-representation options and visualisation. General format changes
10 months ago
prompts.py
Safe
6.24 kB
Updated packages. Improve hierarchy vis. Better models - mixedbread and phi3. Now option to split texts into sentences before modelling.
5 months ago
representation_model.py
Safe
7.83 kB
Removed some requirements from Dockerfile for AWS deployment to reduce container size
3 months ago
topic_core_funcs.py
Safe
38.9 kB
Improved initial clean options. Now has option to return embeddings only.
about 19 hours ago