BERTopic-ALI-LARGE / README.md
Tsunnami's picture
Add BERTopic model
dcabcf9 verified
|
raw
history blame
No virus
38.5 kB
---
tags:
- bertopic
library_name: bertopic
pipeline_tag: text-classification
---
# BERTopic-ALI-LARGE
This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model.
BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
## Usage
To use this model, please install BERTopic:
```
pip install -U bertopic
```
You can use the model as follows:
```python
from bertopic import BERTopic
topic_model = BERTopic.load("Tsunnami/BERTopic-ALI-LARGE")
topic_model.get_topic_info()
```
## Topic overview
* Number of topics: 372
* Number of training documents: 19550
<details>
<summary>Click here for an overview of all topics.</summary>
| Topic ID | Topic Keywords | Topic Frequency | Label |
|----------|----------------|-----------------|-------|
| -1 | and - the - of - was - to | 10 | -1_and_the_of_was |
| 0 | proton - collisions - tev - boson - cms | 5490 | 0_proton_collisions_tev_boson |
| 1 | af - cardiac - heart - ventricular - patients | 528 | 1_af_cardiac_heart_ventricular |
| 2 | supply - innovation - chain - smes - business | 260 | 2_supply_innovation_chain_smes |
| 3 | firms - board - corporate - market - takeover | 194 | 3_firms_board_corporate_market |
| 4 | glaucoma - eyes - corneal - iop - eye | 152 | 4_glaucoma_eyes_corneal_iop |
| 5 | hiv - art - cd4 - antiretroviral - prep | 143 | 5_hiv_art_cd4_antiretroviral |
| 6 | cancer - lung - akt - cells - csc | 135 | 6_cancer_lung_akt_cells |
| 7 | photocatalytic - tio2 - visible - light - photocatalysts | 128 | 7_photocatalytic_tio2_visible_light |
| 8 | strain - streptomyces - mk - genus - micromonospora | 125 | 8_strain_streptomyces_mk_genus |
| 9 | mdd - depression - disorders - disorder - immune | 124 | 9_mdd_depression_disorders_disorder |
| 10 | synechocystis - cyanobacteria - phb - pcc - 6803 | 121 | 10_synechocystis_cyanobacteria_phb_pcc |
| 11 | lymphoma - survival - patients - pfs - os | 117 | 11_lymphoma_survival_patients_pfs |
| 12 | image - images - dataset - classification - convolutional | 113 | 12_image_images_dataset_classification |
| 13 | pressure - gpa - structure - electronic - phonon | 111 | 13_pressure_gpa_structure_electronic |
| 14 | power - pv - electricity - energy - generation | 106 | 14_power_pv_electricity_energy |
| 15 | ti - alloys - zr - alloy - microstructure | 106 | 15_ti_alloys_zr_alloy |
| 16 | osteogenic - differentiation - expression - bone - cells | 106 | 16_osteogenic_differentiation_expression_bone |
| 17 | groundwater - pb - waste - metals - cd | 105 | 17_groundwater_pb_waste_metals |
| 18 | undrained - soil - stability - anisotropic - clays | 103 | 18_undrained_soil_stability_anisotropic |
| 19 | text - word - words - classification - language | 98 | 19_text_word_words_classification |
| 20 | skin - psoriasis - acne - laser - sebum | 97 | 20_skin_psoriasis_acne_laser |
| 21 | electrochemical - sensor - detection - electrode - printed | 96 | 21_electrochemical_sensor_detection_electrode |
| 22 | brand - customer - intention - trust - purchase | 92 | 22_brand_customer_intention_trust |
| 23 | vaccine - vaccination - booster - bnt162b2 - coronavac | 90 | 23_vaccine_vaccination_booster_bnt162b2 |
| 24 | oer - electrocatalysts - orr - electrochemical - reduction | 87 | 24_oer_electrocatalysts_orr_electrochemical |
| 25 | shrimp - wssv - vpahpnd - monodon - penaeus | 86 | 25_shrimp_wssv_vpahpnd_monodon |
| 26 | learning - students - teachers - online - skills | 86 | 26_learning_students_teachers_online |
| 27 | parkinson - pd - tremor - movement - motor | 84 | 27_parkinson_pd_tremor_movement |
| 28 | biodiesel - catalyst - transesterification - reaction - methanol | 84 | 28_biodiesel_catalyst_transesterification_reaction |
| 29 | fault - rocks - triassic - basin - permian | 84 | 29_fault_rocks_triassic_basin |
| 30 | fish - reproductive - digestive - histological - pranburi | 77 | 30_fish_reproductive_digestive_histological |
| 31 | zinc - zn - batteries - zibs - electrolyte | 77 | 31_zinc_zn_batteries_zibs |
| 32 | elastic - crack - boundary - element - beam | 76 | 32_elastic_crack_boundary_element |
| 33 | antenna - db - photonic - mimo - soa | 76 | 33_antenna_db_photonic_mimo |
| 34 | music - cultural - thai - musical - arts | 74 | 34_music_cultural_thai_musical |
| 35 | silk - sf - scaffolds - fibroin - hydrogels | 73 | 35_silk_sf_scaffolds_fibroin |
| 36 | nov - species - sp - genus - snail | 73 | 36_nov_species_sp_genus |
| 37 | pla - pbs - pbat - poly - blend | 73 | 37_pla_pbs_pbat_poly |
| 38 | elegans - glutamate - neuroprotective - extracts - extract | 72 | 38_elegans_glutamate_neuroprotective_extracts |
| 39 | diabetes - sleep - osa - glucose - hba1c | 71 | 39_diabetes_sleep_osa_glucose |
| 40 | balance - fall - falls - training - sts | 71 | 40_balance_fall_falls_training |
| 41 | ceo2 - catalysts - catalyst - ni - co2 | 69 | 41_ceo2_catalysts_catalyst_ni |
| 42 | nasal - rhinosinusitis - rhinitis - saline - incs | 69 | 42_nasal_rhinosinusitis_rhinitis_saline |
| 43 | binding - cov - sars - protease - molecular | 68 | 43_binding_cov_sars_protease |
| 44 | dialysis - pd - peritoneal - peritonitis - kidney | 67 | 44_dialysis_pd_peritoneal_peritonitis |
| 45 | ash - cement - fly - geopolymer - concrete | 67 | 45_ash_cement_fly_geopolymer |
| 46 | antioxidant - rice - phenolic - flour - riceberry | 67 | 46_antioxidant_rice_phenolic_flour |
| 47 | coral - corals - reefs - gulf - species | 66 | 47_coral_corals_reefs_gulf |
| 48 | english - learners - l1 - corpus - lexical | 65 | 48_english_learners_l1_corpus |
| 49 | thalassemia - g6pd - transfusion - tdt - iron | 64 | 49_thalassemia_g6pd_transfusion_tdt |
| 50 | lumbar - fusion - spine - interbody - endoscopic | 64 | 50_lumbar_fusion_spine_interbody |
| 51 | schizophrenia - deficit - igm - iga - symptoms | 64 | 51_schizophrenia_deficit_igm_iga |
| 52 | tilapia - fish - tilv - oreochromis - nile | 64 | 52_tilapia_fish_tilv_oreochromis |
| 53 | fermented - lactic - bacillus - lab - strains | 63 | 53_fermented_lactic_bacillus_lab |
| 54 | malaria - health - adolescent - pakistan - breastfeeding | 63 | 54_malaria_health_adolescent_pakistan |
| 55 | political - asean - china - military - party | 62 | 55_political_asean_china_military |
| 56 | caries - dental - oral - health - children | 62 | 56_caries_dental_oral_health |
| 57 | cognitive - moca - hearing - dementia - memory | 62 | 57_cognitive_moca_hearing_dementia |
| 58 | alcohol - smoking - drinking - cannabis - tobacco | 61 | 58_alcohol_smoking_drinking_cannabis |
| 59 | land - rainfall - climate - basin - flood | 61 | 59_land_rainfall_climate_basin |
| 60 | mcr - coli - resistance - isolates - colistin | 61 | 60_mcr_coli_resistance_isolates |
| 61 | chitosan - nanoparticles - nps - alg - delivery | 60 | 61_chitosan_nanoparticles_nps_alg |
| 62 | compounds - isolated - cytotoxicity - ic50 - kb | 60 | 62_compounds_isolated_cytotoxicity_ic50 |
| 63 | english - students - language - reading - writing | 60 | 63_english_students_language_reading |
| 64 | nursing - nurses - nurse - competence - managerial | 59 | 64_nursing_nurses_nurse_competence |
| 65 | pyrolysis - catalyst - catalytic - ni - oil | 57 | 65_pyrolysis_catalyst_catalytic_ni |
| 66 | disaster - flood - tsunami - damage - bcm | 57 | 66_disaster_flood_tsunami_damage |
| 67 | fluorescence - fluorescent - ions - cu2 - sensor | 56 | 67_fluorescence_fluorescent_ions_cu2 |
| 68 | cellulose - cmc - hydrogels - nanocellulose - fibers | 54 | 68_cellulose_cmc_hydrogels_nanocellulose |
| 69 | salt - rice - stress - tolerance - kdml105 | 53 | 69_salt_rice_stress_tolerance |
| 70 | reactor - fluidized - sorbent - bed - solid | 53 | 70_reactor_fluidized_sorbent_bed |
| 71 | hawc - gamma - ray - observatory - tev | 52 | 71_hawc_gamma_ray_observatory |
| 72 | warehouse - cost - inventory - scheduling - problem | 52 | 72_warehouse_cost_inventory_scheduling |
| 73 | hpv - cervical - papillomavirus - women - cancer | 52 | 73_hpv_cervical_papillomavirus_women |
| 74 | hypertension - pressure - blood - ht - adherence | 51 | 74_hypertension_pressure_blood_ht |
| 75 | care - health - caregivers - older - nhi | 51 | 75_care_health_caregivers_older |
| 76 | blockchain - iot - bct - privacy - trust | 51 | 76_blockchain_iot_bct_privacy |
| 77 | polynomials - infinite - ideals - definable - if | 50 | 77_polynomials_infinite_ideals_definable |
| 78 | robot - grasping - stiffness - rehabilitation - actuator | 50 | 78_robot_grasping_stiffness_rehabilitation |
| 79 | quicke - braconidae - species - hymenoptera - nov | 50 | 79_quicke_braconidae_species_hymenoptera |
| 80 | electric - field - particle - cell - trapping | 50 | 80_electric_field_particle_cell |
| 81 | design - optimization - robust - nonlinear - feedback | 50 | 81_design_optimization_robust_nonlinear |
| 82 | pain - tka - knee - morphine - postoperative | 50 | 82_pain_tka_knee_morphine |
| 83 | furfural - hmf - catalysts - catalyst - hydroxymethylfurfural | 50 | 83_furfural_hmf_catalysts_catalyst |
| 84 | covid - 19 - pandemic - health - countries | 49 | 84_covid_19_pandemic_health |
| 85 | flow - drift - heat - wake - velocity | 49 | 85_flow_drift_heat_wake |
| 86 | retirement - older - health - life - happiness | 49 | 86_retirement_older_health_life |
| 87 | peasant - farmers - agroecology - farming - resilience | 49 | 87_peasant_farmers_agroecology_farming |
| 88 | bond - resin - zirconia - adhesive - ceramic | 49 | 88_bond_resin_zirconia_adhesive |
| 89 | job - employee - employees - turnover - organizational | 48 | 89_job_employee_employees_turnover |
| 90 | forest - mangrove - trees - tree - forests | 48 | 90_forest_mangrove_trees_tree |
| 91 | consortium - degradation - biodegradation - profenofos - degrading | 48 | 91_consortium_degradation_biodegradation_profenofos |
| 92 | extraction - gold - liquid - hg - mercury | 48 | 92_extraction_gold_liquid_hg |
| 93 | variants - exome - sequencing - genetic - variant | 48 | 93_variants_exome_sequencing_genetic |
| 94 | levan - levansucrase - inulosucrase - residues - maltose | 47 | 94_levan_levansucrase_inulosucrase_residues |
| 95 | cpv - dogs - canine - pdcov - virus | 46 | 95_cpv_dogs_canine_pdcov |
| 96 | spp - bartonella - parasites - immitis - cats | 46 | 96_spp_bartonella_parasites_immitis |
| 97 | sjs - hla - ten - reactions - scars | 45 | 97_sjs_hla_ten_reactions |
| 98 | glucosidase - inhibitory - inhibition - ic50 - compounds | 44 | 98_glucosidase_inhibitory_inhibition_ic50 |
| 99 | bim - construction - building - project - lca | 44 | 99_bim_construction_building_project |
| 100 | network - cloud - iot - wireless - traffic | 44 | 100_network_cloud_iot_wireless |
| 101 | macaques - macaque - tailed - macaca - fascicularis | 44 | 101_macaques_macaque_tailed_macaca |
| 102 | rollover - traffic - prediction - machine - tripped | 44 | 102_rollover_traffic_prediction_machine |
| 103 | gravity - stars - black - massive - wormhole | 43 | 103_gravity_stars_black_massive |
| 104 | rubber - nr - composites - vulcanization - cure | 43 | 104_rubber_nr_composites_vulcanization |
| 105 | co2 - mea - amine - amp - absorption | 42 | 105_co2_mea_amine_amp |
| 106 | pain - neck - office - back - workers | 42 | 106_pain_neck_office_back |
| 107 | anaerobic - wastewater - mbr - cod - bioreactor | 42 | 107_anaerobic_wastewater_mbr_cod |
| 108 | mutations - imperfecta - mutation - variants - heterozygous | 42 | 108_mutations_imperfecta_mutation_variants |
| 109 | inflation - supergravity - scalar - inflaton - cosmological | 41 | 109_inflation_supergravity_scalar_inflaton |
| 110 | pythiosis - insidiosum - pythium - fungal - keratitis | 41 | 110_pythiosis_insidiosum_pythium_fungal |
| 111 | galaxies - star - alma - stellar - agn | 41 | 111_galaxies_star_alma_stellar |
| 112 | energy - δln - emissions - sector - policy | 41 | 112_energy_δln_emissions_sector |
| 113 | falciparum - plasmodium - malaria - vivax - parasite | 41 | 113_falciparum_plasmodium_malaria_vivax |
| 114 | seizure - epilepsy - seizures - eeg - neurological | 41 | 114_seizure_epilepsy_seizures_eeg |
| 115 | stone - aldosterone - urinary - oxalate - urolithiasis | 40 | 115_stone_aldosterone_urinary_oxalate |
| 116 | petri - timed - nets - cpn - verification | 40 | 116_petri_timed_nets_cpn |
| 117 | oa - knee - synovial - rtl - osteoarthritis | 40 | 117_oa_knee_synovial_rtl |
| 118 | beams - concrete - shear - strengthened - ets | 40 | 118_beams_concrete_shear_strengthened |
| 119 | species - corolla - kidyoo - apocynaceae - ceropegia | 40 | 119_species_corolla_kidyoo_apocynaceae |
| 120 | tb - tuberculosis - mtb - mycobacterium - ltbi | 39 | 120_tb_tuberculosis_mtb_mycobacterium |
| 121 | pna - nucleic - dna - acpcpna - pyrrolidinyl | 39 | 121_pna_nucleic_dna_acpcpna |
| 122 | exercise - hiit - training - chest - plb | 39 | 122_exercise_hiit_training_chest |
| 123 | pm2 - pm10 - air - pollution - particulate | 38 | 123_pm2_pm10_air_pollution |
| 124 | gaas - insb - epitaxy - nanowires - quantum | 38 | 124_gaas_insb_epitaxy_nanowires |
| 125 | cmv - covid - 19 - recipients - transplant | 38 | 125_cmv_covid_19_recipients |
| 126 | microplastics - mps - microplastic - plastic - pollution | 38 | 126_microplastics_mps_microplastic_plastic |
| 127 | qol - version - validity - reliability - thai | 38 | 127_qol_version_validity_reliability |
| 128 | hbv - hepatitis - hbsag - chb - hbeag | 37 | 128_hbv_hepatitis_hbsag_chb |
| 129 | γcd - solubility - eye - cyclodextrin - aqueous | 37 | 129_γcd_solubility_eye_cyclodextrin |
| 130 | adsorption - adsorbent - removal - dye - zeolite | 37 | 130_adsorption_adsorbent_removal_dye |
| 131 | lupus - fcgriib - sle - mice - fcγriib | 37 | 131_lupus_fcgriib_sle_mice |
| 132 | transit - motorcycle - bangkok - bus - rha | 37 | 132_transit_motorcycle_bangkok_bus |
| 133 | anesthesia - incidents - anesthetic - perioperative - paad | 37 | 133_anesthesia_incidents_anesthetic_perioperative |
| 134 | pertussis - vaccine - vaccination - measles - tetanus | 37 | 134_pertussis_vaccine_vaccination_measles |
| 135 | microalgae - biomass - wastewater - algal - sp | 37 | 135_microalgae_biomass_wastewater_algal |
| 136 | aki - rrt - kidney - injury - acute | 36 | 136_aki_rrt_kidney_injury |
| 137 | forecasting - forecast - weather - arima - model | 35 | 137_forecasting_forecast_weather_arima |
| 138 | repair - tendon - arthroscopic - suture - pain | 35 | 138_repair_tendon_arthroscopic_suture |
| 139 | curcumin - cur - cisplatin - prodrug - curdg | 35 | 139_curcumin_cur_cisplatin_prodrug |
| 140 | dose - ct - radiation - kv - cbct | 35 | 140_dose_ct_radiation_kv |
| 141 | hcv - hepatitis - hbv - infection - anti | 35 | 141_hcv_hepatitis_hbv_infection |
| 142 | preeclampsia - placental - uterine - trimester - pregnant | 35 | 142_preeclampsia_placental_uterine_trimester |
| 143 | steam - reforming - h2 - cao - gasification | 34 | 143_steam_reforming_h2_cao |
| 144 | leishmania - martiniquensis - leishmaniasis - mundinia - sand | 34 | 144_leishmania_martiniquensis_leishmaniasis_mundinia |
| 145 | alu - dna - methylation - hypomethylation - epigenetic | 34 | 145_alu_dna_methylation_hypomethylation |
| 146 | oil - surfactant - wax - oilfield - recovery | 34 | 146_oil_surfactant_wax_oilfield |
| 147 | tsunami - sea - deposits - beach - sedimentary | 34 | 147_tsunami_sea_deposits_beach |
| 148 | eeg - emotion - bci - granger - emotions | 34 | 148_eeg_emotion_bci_granger |
| 149 | cd - inclusion - cyclodextrin - βcd - complexes | 34 | 149_cd_inclusion_cyclodextrin_βcd |
| 150 | artery - facial - mm - arteries - nerve | 34 | 150_artery_facial_mm_arteries |
| 151 | tac - tacrolimus - transplantation - kidney - rejection | 34 | 151_tac_tacrolimus_transplantation_kidney |
| 152 | ethanol - catalysts - dehydrogenation - catalyst - acetaldehyde | 34 | 152_ethanol_catalysts_dehydrogenation_catalyst |
| 153 | waste - management - msw - solid - municipal | 33 | 153_waste_management_msw_solid |
| 154 | curcuma - herbal - barcoding - species - speciosa | 33 | 154_curcuma_herbal_barcoding_species |
| 155 | species - tylototriton - verrucosus - snake - feihyla | 33 | 155_species_tylototriton_verrucosus_snake |
| 156 | mirifica - candollei - pueraria - puerarin - isoflavonoids | 32 | 156_mirifica_candollei_pueraria_puerarin |
| 157 | leptospira - leptospirosis - interrogans - lipl32 - serovar | 32 | 157_leptospira_leptospirosis_interrogans_lipl32 |
| 158 | gerd - esophageal - reflux - egj - achalasia | 32 | 158_gerd_esophageal_reflux_egj |
| 159 | nutrition - sepsis - septic - hpn - parenteral | 32 | 159_nutrition_sepsis_septic_hpn |
| 160 | biliary - drainage - ercp - eus - stent | 32 | 160_biliary_drainage_ercp_eus |
| 161 | pesticide - farmers - exposure - pesticides - op | 32 | 161_pesticide_farmers_exposure_pesticides |
| 162 | asd - autism - bpa - bdnf - genes | 31 | 162_asd_autism_bpa_bdnf |
| 163 | reaction - yields - synthesis - cyclization - thioglycosides | 31 | 163_reaction_yields_synthesis_cyclization |
| 164 | goats - dcad - milk - colostrum - crossbred | 31 | 164_goats_dcad_milk_colostrum |
| 165 | gut - candida - mice - sepsis - bg | 31 | 165_gut_candida_mice_sepsis |
| 166 | pda - polydiacetylene - reversible - nanocomposites - assemblies | 31 | 166_pda_polydiacetylene_reversible_nanocomposites |
| 167 | lignin - pyrolysis - htl - bio - biomass | 31 | 167_lignin_pyrolysis_htl_bio |
| 168 | ammonia - nitrification - nitrogen - nitrifying - biofilter | 30 | 168_ammonia_nitrification_nitrogen_nitrifying |
| 169 | surfactant - surfactants - detergency - washing - oil | 30 | 169_surfactant_surfactants_detergency_washing |
| 170 | fibrosis - nafld - liver - steatosis - fib | 30 | 170_fibrosis_nafld_liver_steatosis |
| 171 | gnss - positioning - pwv - cors - navigation | 30 | 171_gnss_positioning_pwv_cors |
| 172 | olp - oral - pain - oidp - ohrqol | 30 | 172_olp_oral_pain_oidp |
| 173 | dose - vmat - field - plans - beam | 30 | 173_dose_vmat_field_plans |
| 174 | breast - lesions - mri - mammography - pet | 30 | 174_breast_lesions_mri_mammography |
| 175 | acemannan - bone - prf - periodontal - bubaline | 29 | 175_acemannan_bone_prf_periodontal |
| 176 | ckd - cinacalcet - serum - bone - bko | 29 | 176_ckd_cinacalcet_serum_bone |
| 177 | pani - conductivity - electrical - pss - polyaniline | 29 | 177_pani_conductivity_electrical_pss |
| 178 | hiv - plhiv - plwh - nutritional - antiretroviral | 29 | 178_hiv_plhiv_plwh_nutritional |
| 179 | nov - sp - species - enghoff - genus | 29 | 179_nov_sp_species_enghoff |
| 180 | ibd - uc - colitis - bowel - crohn | 29 | 180_ibd_uc_colitis_bowel |
| 181 | sows - piglets - piglet - farrowing - colostrum | 29 | 181_sows_piglets_piglet_farrowing |
| 182 | dom - adsorption - fenton - dissolved - decolorization | 29 | 182_dom_adsorption_fenton_dissolved |
| 183 | ba - hcc - liver - mir - atresia | 29 | 183_ba_hcc_liver_mir |
| 184 | cov - sars - rbd - fc - vaccine | 29 | 184_cov_sars_rbd_fc |
| 185 | thyroid - ptc - braf - mutation - niftp | 29 | 185_thyroid_ptc_braf_mutation |
| 186 | web - software - code - test - bpmn | 28 | 186_web_software_code_test |
| 187 | grass - pretreatment - napier - fermentation - ethanol | 28 | 187_grass_pretreatment_napier_fermentation |
| 188 | mangostin - mangosteen - zebrafish - acanthamoeba - αm | 28 | 188_mangostin_mangosteen_zebrafish_acanthamoeba |
| 189 | implant - cais - implants - static - freehand | 28 | 189_implant_cais_implants_static |
| 190 | moral - crying - situational - gift - happiness | 27 | 190_moral_crying_situational_gift |
| 191 | adsorption - co2 - zif - carbon - mil | 27 | 191_adsorption_co2_zif_carbon |
| 192 | claudin - odontogenic - ameloblastoma - lesions - pten | 27 | 192_claudin_odontogenic_ameloblastoma_lesions |
| 193 | plantar - foot - fasciitis - ankle - stretching | 27 | 193_plantar_foot_fasciitis_ankle |
| 194 | perovskite - perovskites - solar - halide - pscs | 26 | 194_perovskite_perovskites_solar_halide |
| 195 | ysz - anode - scgz - electrolysis - electrolyte | 26 | 195_ysz_anode_scgz_electrolysis |
| 196 | allergens - allergen - der - ait - hdm | 26 | 196_allergens_allergen_der_ait |
| 197 | aclf - liver - cirrhosis - aarc - failure | 26 | 197_aclf_liver_cirrhosis_aarc |
| 198 | tourism - tourists - destination - food - visitors | 26 | 198_tourism_tourists_destination_food |
| 199 | sperm - semen - motility - spermatozoa - cryopreservation | 26 | 199_sperm_semen_motility_spermatozoa |
| 200 | dosing - mic - pharmacokinetic - meropenem - vancomycin | 26 | 200_dosing_mic_pharmacokinetic_meropenem |
| 201 | indoor - air - respiratory - pollution - exposure | 26 | 201_indoor_air_respiratory_pollution |
| 202 | cca - cholangiocarcinoma - lin28b - fluke - necroptosis | 25 | 202_cca_cholangiocarcinoma_lin28b_fluke |
| 203 | tdf - hiv - taf - switching - rpv | 25 | 203_tdf_hiv_taf_switching |
| 204 | battery - vanadium - vrfb - zinc - flow | 25 | 204_battery_vanadium_vrfb_zinc |
| 205 | user - recommendation - rating - recommender - filtering | 25 | 205_user_recommendation_rating_recommender |
| 206 | npc - ebv - nasopharyngeal - adc - imrt | 25 | 206_npc_ebv_nasopharyngeal_adc |
| 207 | earthquake - seismic - ground - liquefaction - northern | 24 | 207_earthquake_seismic_ground_liquefaction |
| 208 | images - segmentation - cnn - ai - eus | 24 | 208_images_segmentation_cnn_ai |
| 209 | gc - column - chromatography - 2d - comprehensive | 24 | 209_gc_column_chromatography_2d |
| 210 | amh - kisspeptin - testicular - gilts - pigs | 23 | 210_amh_kisspeptin_testicular_gilts |
| 211 | egfr - jak2 - tk - kinase - inhibitors | 23 | 211_egfr_jak2_tk_kinase |
| 212 | sars - cov - cats - wastewater - coronavirus | 23 | 212_sars_cov_cats_wastewater |
| 213 | middle - east - subfamily - species - iran | 23 | 213_middle_east_subfamily_species |
| 214 | cos - broilers - diet - supplementation - yolk | 22 | 214_cos_broilers_diet_supplementation |
| 215 | ckd - kidney - egfr - prevalence - glomerular | 22 | 215_ckd_kidney_egfr_prevalence |
| 216 | mxene - ldh - lithium - ti3c2 - mah | 22 | 216_mxene_ldh_lithium_ti3c2 |
| 217 | venom - snakebite - antivenom - antivenoms - snake | 22 | 217_venom_snakebite_antivenom_antivenoms |
| 218 | radon - thoron - bq - msv - dose | 22 | 218_radon_thoron_bq_msv |
| 219 | addiction - media - children - screen - smartphone | 22 | 219_addiction_media_children_screen |
| 220 | chikv - zikv - mosquitoes - denv - chikungunya | 22 | 220_chikv_zikv_mosquitoes_denv |
| 221 | methanol - dme - co2 - synthesis - process | 21 | 221_methanol_dme_co2_synthesis |
| 222 | tax - investment - export - firms - subsidies | 21 | 222_tax_investment_export_firms |
| 223 | theories - gauge - quiver - duality - 2n | 21 | 223_theories_gauge_quiver_duality |
| 224 | drug - network - associations - heterogeneous - similarity | 21 | 224_drug_network_associations_heterogeneous |
| 225 | lewis - metathesis - catalysts - sio2 - wo3 | 21 | 225_lewis_metathesis_catalysts_sio2 |
| 226 | plasma - hydrogenation - fame - discharge - margarine | 21 | 226_plasma_hydrogenation_fame_discharge |
| 227 | plant - antibody - produced - benthamiana - mab | 21 | 227_plant_antibody_produced_benthamiana |
| 228 | asfv - swine - csfv - fever - dt | 21 | 228_asfv_swine_csfv_fever |
| 229 | thermal - heat - cooling - gshp - pumps | 21 | 229_thermal_heat_cooling_gshp |
| 230 | complexes - coordination - ii - ligand - cu | 21 | 230_complexes_coordination_ii_ligand |
| 231 | probiotics - synbiotic - gut - obesity - microbiota | 21 | 231_probiotics_synbiotic_gut_obesity |
| 232 | sigma - process - defect - dmaic - defective | 20 | 232_sigma_process_defect_dmaic |
| 233 | tyrosinase - melanin - antityrosinase - extract - activity | 20 | 233_tyrosinase_melanin_antityrosinase_extract |
| 234 | residue - protein - hv1 - gyration - grained | 20 | 234_residue_protein_hv1_gyration |
| 235 | shape - memory - copolymers - benzoxazine - epoxy | 20 | 235_shape_memory_copolymers_benzoxazine |
| 236 | education - mooc - competencies - schools - policy | 20 | 236_education_mooc_competencies_schools |
| 237 | lichen - usnea - parmotrema - xanthone - nmr | 20 | 237_lichen_usnea_parmotrema_xanthone |
| 238 | shielding - csi - radiation - glass - ray | 20 | 238_shielding_csi_radiation_glass |
| 239 | bleeding - esd - ppi - rebleeding - endoscopic | 20 | 239_bleeding_esd_ppi_rebleeding |
| 240 | bounds - approximation - random - stein - theorem | 20 | 240_bounds_approximation_random_stein |
| 241 | toothpaste - fluoride - enamel - sdf - decontamination | 20 | 241_toothpaste_fluoride_enamel_sdf |
| 242 | ugt1a1 - slco1b1 - polymorphisms - cyp2d6 - pgx | 19 | 242_ugt1a1_slco1b1_polymorphisms_cyp2d6 |
| 243 | groundwater - recharge - aquifer - river - pumping | 19 | 243_groundwater_recharge_aquifer_river |
| 244 | series - outlier - time - motif - datasets | 19 | 244_series_outlier_time_motif |
| 245 | so - gauged - supergravity - gauge - supersymmetric | 19 | 245_so_gauged_supergravity_gauge |
| 246 | pesticide - ache - pesticides - organophosphate - sensor | 19 | 246_pesticide_ache_pesticides_organophosphate |
| 247 | fractional - chebyshev - equations - nonlocal - equation | 19 | 247_fractional_chebyshev_equations_nonlocal |
| 248 | sg - organoids - epithelial - gland - salivary | 19 | 248_sg_organoids_epithelial_gland |
| 249 | cd - edta - cadmium - soil - arsenic | 19 | 249_cd_edta_cadmium_soil |
| 250 | hydrate - methane - hydrates - formation - dissociation | 19 | 250_hydrate_methane_hydrates_formation |
| 251 | concrete - confined - ultimate - compressive - strength | 19 | 251_concrete_confined_ultimate_compressive |
| 252 | juno - neutrino - detector - pmts - scintillator | 19 | 252_juno_neutrino_detector_pmts |
| 253 | cephalometric - skeletal - convex - landmarks - landmark | 18 | 253_cephalometric_skeletal_convex_landmarks |
| 254 | hydroxyapatite - hap - eggshell - granules - calcium | 18 | 254_hydroxyapatite_hap_eggshell_granules |
| 255 | moea - assembly - line - problem - balancing | 18 | 255_moea_assembly_line_problem |
| 256 | electrochemical - immunosensor - crp - detection - electrode | 18 | 256_electrochemical_immunosensor_crp_detection |
| 257 | sicp - moulding - powder - vol - si3n4 | 18 | 257_sicp_moulding_powder_vol |
| 258 | ppp - infrastructure - projects - subcontractor - private | 18 | 258_ppp_infrastructure_projects_subcontractor |
| 259 | captcha - personality - voice - letters - eye | 18 | 259_captcha_personality_voice_letters |
| 260 | gii - hadv - norovirus - a71 - gastroenteritis | 18 | 260_gii_hadv_norovirus_a71 |
| 261 | stroke - ischemic - ais - nihss - nwu | 18 | 261_stroke_ischemic_ais_nihss |
| 262 | dredged - pavement - sediments - cement - opc | 18 | 262_dredged_pavement_sediments_cement |
| 263 | supercapacitor - graphene - capacitance - supercapacitors - density | 17 | 263_supercapacitor_graphene_capacitance_supercapacitors |
| 264 | corruption - cpi - prosperity - education - firearms | 17 | 264_corruption_cpi_prosperity_education |
| 265 | biofilm - chitosan - vanillin - fluoride - duwls | 17 | 265_biofilm_chitosan_vanillin_fluoride |
| 266 | cdtb - cdta - cdr1 - mpr1 - albicans | 17 | 266_cdtb_cdta_cdr1_mpr1 |
| 267 | access - cancer - drugs - screening - colorectal | 17 | 267_access_cancer_drugs_screening |
| 268 | masticatory - denture - bite - wearers - peanut | 17 | 268_masticatory_denture_bite_wearers |
| 269 | sulfonated - membrane - proton - speek - ether | 17 | 269_sulfonated_membrane_proton_speek |
| 270 | pr - prb - pra - progesterone - breast | 17 | 270_pr_prb_pra_progesterone |
| 271 | steel - girder - columns - concrete - girders | 17 | 271_steel_girder_columns_concrete |
| 272 | antimicrobial - antibiotic - amr - antibiotics - asp | 17 | 272_antimicrobial_antibiotic_amr_antibiotics |
| 273 | biorefinery - pulping - kraft - pulp - process | 17 | 273_biorefinery_pulping_kraft_pulp |
| 274 | methylation - hpv - cervical - hpv16 - promoter | 17 | 274_methylation_hpv_cervical_hpv16 |
| 275 | robot - automation - robots - autonomous - twin | 17 | 275_robot_automation_robots_autonomous |
| 276 | membranes - membrane - pvdf - tio2 - flux | 17 | 276_membranes_membrane_pvdf_tio2 |
| 277 | pharmacists - pharmacy - community - hds - pharmacist | 16 | 277_pharmacists_pharmacy_community_hds |
| 278 | canine - dogs - oral - tumors - lom | 16 | 278_canine_dogs_oral_tumors |
| 279 | formula - formulas - conditional - moments - swaps | 16 | 279_formula_formulas_conditional_moments |
| 280 | periodontitis - periodontal - flowcharts - cal - salivary | 16 | 280_periodontitis_periodontal_flowcharts_cal |
| 281 | peptides - ace - kda - fraction - scavenging | 16 | 281_peptides_ace_kda_fraction |
| 282 | iii - nzvi - removal - adsorption - arsenate | 16 | 282_iii_nzvi_removal_adsorption |
| 283 | dtmuv - ducks - duck - tembusu - mosquitoes | 16 | 283_dtmuv_ducks_duck_tembusu |
| 284 | sccmec - mrsa - methicillin - isolates - staphylococcus | 16 | 284_sccmec_mrsa_methicillin_isolates |
| 285 | death - acceptance - spiritual - buddhist - care | 16 | 285_death_acceptance_spiritual_buddhist |
| 286 | colonoscopy - adr - endoscopists - polyps - lci | 16 | 286_colonoscopy_adr_endoscopists_polyps |
| 287 | peritonitis - peritoneal - catheter - dialysis - pd | 16 | 287_peritonitis_peritoneal_catheter_dialysis |
| 288 | fuel - pemfc - converter - algorithm - optimization | 16 | 288_fuel_pemfc_converter_algorithm |
| 289 | glycerol - propanediol - hydrogenolysis - amo - layered | 16 | 289_glycerol_propanediol_hydrogenolysis_amo |
| 290 | covid - 19 - pandemic - perceived - anxiety | 15 | 290_covid_19_pandemic_perceived |
| 291 | rice - bioaccessible - arsenic - consumption - cadmium | 15 | 291_rice_bioaccessible_arsenic_consumption |
| 292 | vaginal - postmenopausal - menopause - women - genitourinary | 15 | 292_vaginal_postmenopausal_menopause_women |
| 293 | baumannii - colistin - fosfomycin - carbapenem - isolates | 15 | 293_baumannii_colistin_fosfomycin_carbapenem |
| 294 | bc - wound - cellulose - film - bacterial | 15 | 294_bc_wound_cellulose_film |
| 295 | tendon - fhl - peroneal - mkh - accessory | 15 | 295_tendon_fhl_peroneal_mkh |
| 296 | quantum - algorithm - classical - qaoa - grover | 15 | 296_quantum_algorithm_classical_qaoa |
| 297 | aki - kidney - injury - igfbp7 - acute | 15 | 297_aki_kidney_injury_igfbp7 |
| 298 | ripening - durian - fruit - auxin - ethylene | 15 | 298_ripening_durian_fruit_auxin |
| 299 | sars - cov - device - detection - covid | 15 | 299_sars_cov_device_detection |
| 300 | adsorption - h2 - zro2 - vo - tio | 15 | 300_adsorption_h2_zro2_vo |
| 301 | pla - nr - rubber - nsio2 - pcl | 15 | 301_pla_nr_rubber_nsio2 |
| 302 | trigeminal - cgrp - migraine - neurons - pain | 14 | 302_trigeminal_cgrp_migraine_neurons |
| 303 | release - drug - permeation - transdermal - dpnr | 14 | 303_release_drug_permeation_transdermal |
| 304 | titanium - ti - dlc - anodized - anodization | 14 | 304_titanium_ti_dlc_anodized |
| 305 | stock - neural - prediction - trading - market | 14 | 305_stock_neural_prediction_trading |
| 306 | codes - graphs - cayley - finite - rings | 14 | 306_codes_graphs_cayley_finite |
| 307 | resolution - super - image - network - attention | 14 | 307_resolution_super_image_network |
| 308 | lps - macrophages - sepsis - ezh2 - mice | 14 | 308_lps_macrophages_sepsis_ezh2 |
| 309 | hydrogen - succinic - production - fermentation - clostridium | 14 | 309_hydrogen_succinic_production_fermentation |
| 310 | nanoemulsion - cinnamon - clove - oil - nanoemulsions | 14 | 310_nanoemulsion_cinnamon_clove_oil |
| 311 | anorectal - defecation - anal - biofeedback - fi | 14 | 311_anorectal_defecation_anal_biofeedback |
| 312 | gini - income - lorenz - inequality - form | 14 | 312_gini_income_lorenz_inequality |
| 313 | leadership - teacher - school - teachers - principals | 14 | 313_leadership_teacher_school_teachers |
| 314 | sleep - pulse - snoring - psg - wave | 14 | 314_sleep_pulse_snoring_psg |
| 315 | polymerization - ethylene - hexene - thf - tnoa | 14 | 315_polymerization_ethylene_hexene_thf |
| 316 | micp - spores - urea - soil - calcite | 14 | 316_micp_spores_urea_soil |
| 317 | apis - honey - honeybee - mellifera - bee | 14 | 317_apis_honey_honeybee_mellifera |
| 318 | cr - vi - chromium - extraction - electromembrane | 14 | 318_cr_vi_chromium_extraction |
| 319 | sofc - fuel - system - soec - power | 14 | 319_sofc_fuel_system_soec |
| 320 | graph - edge - skirted - graphs - vertex | 14 | 320_graph_edge_skirted_graphs |
| 321 | smart - building - energy - bems - appliance | 14 | 321_smart_building_energy_bems |
| 322 | records - speleothem - isotope - δ18o - monsoon | 13 | 322_records_speleothem_isotope_δ18o |
| 323 | aeromonas - hydrophila - isolates - vah - veronii | 13 | 323_aeromonas_hydrophila_isolates_vah |
| 324 | fruits - insect - abundance - richness - bee | 13 | 324_fruits_insect_abundance_richness |
| 325 | fad - flavin - etfab - bcd - iso | 13 | 325_fad_flavin_etfab_bcd |
| 326 | muslim - malay - religious - muslims - ayutthaya | 13 | 326_muslim_malay_religious_muslims |
| 327 | wheezing - allergy - children - aaf - allergic | 13 | 327_wheezing_allergy_children_aaf |
| 328 | starch - starches - rice - glutinous - flour | 13 | 328_starch_starches_rice_glutinous |
| 329 | cbct - root - mandibular - arch - canal | 13 | 329_cbct_root_mandibular_arch |
| 330 | rabies - id - rabv - prophylaxis - vaccines | 13 | 330_rabies_id_rabv_prophylaxis |
| 331 | hnscc - methylation - pbmcs - squamous - oscc | 13 | 331_hnscc_methylation_pbmcs_squamous |
| 332 | transgender - depression - sexual - gender - dysfunction | 13 | 332_transgender_depression_sexual_gender |
| 333 | buddhist - ethics - ethical - modernity - buddhism | 13 | 333_buddhist_ethics_ethical_modernity |
| 334 | reservoir - decline - commingled - gas - layer | 13 | 334_reservoir_decline_commingled_gas |
| 335 | extract - tpl - antioxidant - hepg2 - indicum | 13 | 335_extract_tpl_antioxidant_hepg2 |
| 336 | blastocystis - stercoralis - entamoeba - infections - parasitic | 13 | 336_blastocystis_stercoralis_entamoeba_infections |
| 337 | spo2 - covid - affective - long - physio | 12 | 337_spo2_covid_affective_long |
| 338 | sinus - frontal - draf - petrous - endoscopic | 12 | 338_sinus_frontal_draf_petrous |
| 339 | poroelastic - foundations - rigid - dynamic - foundation | 12 | 339_poroelastic_foundations_rigid_dynamic |
| 340 | pd - fame - mcm - hydrogenation - sba | 12 | 340_pd_fame_mcm_hydrogenation |
| 341 | genistein - ovx - nash - rats - hfhf | 12 | 341_genistein_ovx_nash_rats |
| 342 | migrant - migration - workers - policy - inequalities | 12 | 342_migrant_migration_workers_policy |
| 343 | silver - colloids - cellulose - nanoparticles - cmcs | 12 | 343_silver_colloids_cellulose_nanoparticles |
| 344 | infliximab - secukinumab - p13 - psa - biosimilar | 12 | 344_infliximab_secukinumab_p13_psa |
| 345 | pleistocene - fossil - miocene - sung - khok | 12 | 345_pleistocene_fossil_miocene_sung |
| 346 | ballistic - polybenzoxazine - friction - composites - aramid | 12 | 346_ballistic_polybenzoxazine_friction_composites |
| 347 | crassna - leaves - mangiferin - extracts - tlc | 12 | 347_crassna_leaves_mangiferin_extracts |
| 348 | h2 - halophytica - production - hydrogenase - cyanobacterium | 12 | 348_h2_halophytica_production_hydrogenase |
| 349 | peri - implant - implantitis - abutment - tissue | 12 | 349_peri_implant_implantitis_abutment |
| 350 | prrsv - pigs - porcine - vaccinated - mlv | 12 | 350_prrsv_pigs_porcine_vaccinated |
| 351 | sensor - voc - sensing - methanol - vocs | 12 | 351_sensor_voc_sensing_methanol |
| 352 | pwm - overmodulation - converters - voltage - inverters | 12 | 352_pwm_overmodulation_converters_voltage |
| 353 | eca - 233 - asiatica - centella - madecassoside | 11 | 353_eca_233_asiatica_centella |
| 354 | nanomaterials - cuonps - toxicity - nanoparticles - plants | 11 | 354_nanomaterials_cuonps_toxicity_nanoparticles |
| 355 | pluripotency - pluripotent - ipscs - reprogramming - rabbit | 11 | 355_pluripotency_pluripotent_ipscs_reprogramming |
| 356 | crispr - sars - cov - cas12a - detection | 11 | 356_crispr_sars_cov_cas12a |
| 357 | crocodiles - freshwater - crocodylus - pharmacokinetic - intramuscular | 11 | 357_crocodiles_freshwater_crocodylus_pharmacokinetic |
| 358 | krfpc - antimalarial - hcshmt - mangiferin - thf | 11 | 358_krfpc_antimalarial_hcshmt_mangiferin |
| 359 | gvhd - nrm - acute - gdf - elafin | 11 | 359_gvhd_nrm_acute_gdf |
| 360 | jig - separation - hybrid - plastics - flotation | 11 | 360_jig_separation_hybrid_plastics |
| 361 | oyster - salmonella - contamination - coliforms - beaches | 11 | 361_oyster_salmonella_contamination_coliforms |
| 362 | oryzanol - lycopene - basil - mushroom - rbao | 11 | 362_oryzanol_lycopene_basil_mushroom |
| 363 | color - pigments - 3d - nm - reflectance | 11 | 363_color_pigments_3d_nm |
| 364 | uv - shelf - dmdc - juice - microbial | 10 | 364_uv_shelf_dmdc_juice |
| 365 | resonant - nonlinear - quantum - harmonic - weakly | 10 | 365_resonant_nonlinear_quantum_harmonic |
| 366 | fluorescence - gqds - fluorescent - fluorescein - go | 10 | 366_fluorescence_gqds_fluorescent_fluorescein |
| 367 | instances - minority - tree - class - classifier | 10 | 367_instances_minority_tree_class |
| 368 | coding - video - hevc - partitioning - vvc | 10 | 368_coding_video_hevc_partitioning |
| 369 | mindfulness - self - athletes - narcissistic - grit | 10 | 369_mindfulness_self_athletes_narcissistic |
| 370 | bats - bat - contact - borne - zoonotic | 10 | 370_bats_bat_contact_borne |
</details>
## Training hyperparameters
* calculate_probabilities: False
* language: None
* low_memory: False
* min_topic_size: 10
* n_gram_range: (1, 1)
* nr_topics: None
* seed_topic_list: None
* top_n_words: 10
* verbose: True
* zeroshot_min_similarity: 0.7
* zeroshot_topic_list: None
## Framework versions
* Numpy: 1.26.4
* HDBSCAN: 0.8.33
* UMAP: 0.5.6
* Pandas: 2.2.2
* Scikit-Learn: 1.4.2
* Sentence-transformers: 2.7.0
* Transformers: 4.40.2
* Numba: 0.59.1
* Plotly: 5.22.0
* Python: 3.10.12