BERTopic-ALI-LARGE / README.md
Tsunnami's picture
Add BERTopic model
dcabcf9 verified
metadata
tags:
  - bertopic
library_name: bertopic
pipeline_tag: text-classification

BERTopic-ALI-LARGE

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("Tsunnami/BERTopic-ALI-LARGE")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 372
  • Number of training documents: 19550
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 and - the - of - was - to 10 -1_and_the_of_was
0 proton - collisions - tev - boson - cms 5490 0_proton_collisions_tev_boson
1 af - cardiac - heart - ventricular - patients 528 1_af_cardiac_heart_ventricular
2 supply - innovation - chain - smes - business 260 2_supply_innovation_chain_smes
3 firms - board - corporate - market - takeover 194 3_firms_board_corporate_market
4 glaucoma - eyes - corneal - iop - eye 152 4_glaucoma_eyes_corneal_iop
5 hiv - art - cd4 - antiretroviral - prep 143 5_hiv_art_cd4_antiretroviral
6 cancer - lung - akt - cells - csc 135 6_cancer_lung_akt_cells
7 photocatalytic - tio2 - visible - light - photocatalysts 128 7_photocatalytic_tio2_visible_light
8 strain - streptomyces - mk - genus - micromonospora 125 8_strain_streptomyces_mk_genus
9 mdd - depression - disorders - disorder - immune 124 9_mdd_depression_disorders_disorder
10 synechocystis - cyanobacteria - phb - pcc - 6803 121 10_synechocystis_cyanobacteria_phb_pcc
11 lymphoma - survival - patients - pfs - os 117 11_lymphoma_survival_patients_pfs
12 image - images - dataset - classification - convolutional 113 12_image_images_dataset_classification
13 pressure - gpa - structure - electronic - phonon 111 13_pressure_gpa_structure_electronic
14 power - pv - electricity - energy - generation 106 14_power_pv_electricity_energy
15 ti - alloys - zr - alloy - microstructure 106 15_ti_alloys_zr_alloy
16 osteogenic - differentiation - expression - bone - cells 106 16_osteogenic_differentiation_expression_bone
17 groundwater - pb - waste - metals - cd 105 17_groundwater_pb_waste_metals
18 undrained - soil - stability - anisotropic - clays 103 18_undrained_soil_stability_anisotropic
19 text - word - words - classification - language 98 19_text_word_words_classification
20 skin - psoriasis - acne - laser - sebum 97 20_skin_psoriasis_acne_laser
21 electrochemical - sensor - detection - electrode - printed 96 21_electrochemical_sensor_detection_electrode
22 brand - customer - intention - trust - purchase 92 22_brand_customer_intention_trust
23 vaccine - vaccination - booster - bnt162b2 - coronavac 90 23_vaccine_vaccination_booster_bnt162b2
24 oer - electrocatalysts - orr - electrochemical - reduction 87 24_oer_electrocatalysts_orr_electrochemical
25 shrimp - wssv - vpahpnd - monodon - penaeus 86 25_shrimp_wssv_vpahpnd_monodon
26 learning - students - teachers - online - skills 86 26_learning_students_teachers_online
27 parkinson - pd - tremor - movement - motor 84 27_parkinson_pd_tremor_movement
28 biodiesel - catalyst - transesterification - reaction - methanol 84 28_biodiesel_catalyst_transesterification_reaction
29 fault - rocks - triassic - basin - permian 84 29_fault_rocks_triassic_basin
30 fish - reproductive - digestive - histological - pranburi 77 30_fish_reproductive_digestive_histological
31 zinc - zn - batteries - zibs - electrolyte 77 31_zinc_zn_batteries_zibs
32 elastic - crack - boundary - element - beam 76 32_elastic_crack_boundary_element
33 antenna - db - photonic - mimo - soa 76 33_antenna_db_photonic_mimo
34 music - cultural - thai - musical - arts 74 34_music_cultural_thai_musical
35 silk - sf - scaffolds - fibroin - hydrogels 73 35_silk_sf_scaffolds_fibroin
36 nov - species - sp - genus - snail 73 36_nov_species_sp_genus
37 pla - pbs - pbat - poly - blend 73 37_pla_pbs_pbat_poly
38 elegans - glutamate - neuroprotective - extracts - extract 72 38_elegans_glutamate_neuroprotective_extracts
39 diabetes - sleep - osa - glucose - hba1c 71 39_diabetes_sleep_osa_glucose
40 balance - fall - falls - training - sts 71 40_balance_fall_falls_training
41 ceo2 - catalysts - catalyst - ni - co2 69 41_ceo2_catalysts_catalyst_ni
42 nasal - rhinosinusitis - rhinitis - saline - incs 69 42_nasal_rhinosinusitis_rhinitis_saline
43 binding - cov - sars - protease - molecular 68 43_binding_cov_sars_protease
44 dialysis - pd - peritoneal - peritonitis - kidney 67 44_dialysis_pd_peritoneal_peritonitis
45 ash - cement - fly - geopolymer - concrete 67 45_ash_cement_fly_geopolymer
46 antioxidant - rice - phenolic - flour - riceberry 67 46_antioxidant_rice_phenolic_flour
47 coral - corals - reefs - gulf - species 66 47_coral_corals_reefs_gulf
48 english - learners - l1 - corpus - lexical 65 48_english_learners_l1_corpus
49 thalassemia - g6pd - transfusion - tdt - iron 64 49_thalassemia_g6pd_transfusion_tdt
50 lumbar - fusion - spine - interbody - endoscopic 64 50_lumbar_fusion_spine_interbody
51 schizophrenia - deficit - igm - iga - symptoms 64 51_schizophrenia_deficit_igm_iga
52 tilapia - fish - tilv - oreochromis - nile 64 52_tilapia_fish_tilv_oreochromis
53 fermented - lactic - bacillus - lab - strains 63 53_fermented_lactic_bacillus_lab
54 malaria - health - adolescent - pakistan - breastfeeding 63 54_malaria_health_adolescent_pakistan
55 political - asean - china - military - party 62 55_political_asean_china_military
56 caries - dental - oral - health - children 62 56_caries_dental_oral_health
57 cognitive - moca - hearing - dementia - memory 62 57_cognitive_moca_hearing_dementia
58 alcohol - smoking - drinking - cannabis - tobacco 61 58_alcohol_smoking_drinking_cannabis
59 land - rainfall - climate - basin - flood 61 59_land_rainfall_climate_basin
60 mcr - coli - resistance - isolates - colistin 61 60_mcr_coli_resistance_isolates
61 chitosan - nanoparticles - nps - alg - delivery 60 61_chitosan_nanoparticles_nps_alg
62 compounds - isolated - cytotoxicity - ic50 - kb 60 62_compounds_isolated_cytotoxicity_ic50
63 english - students - language - reading - writing 60 63_english_students_language_reading
64 nursing - nurses - nurse - competence - managerial 59 64_nursing_nurses_nurse_competence
65 pyrolysis - catalyst - catalytic - ni - oil 57 65_pyrolysis_catalyst_catalytic_ni
66 disaster - flood - tsunami - damage - bcm 57 66_disaster_flood_tsunami_damage
67 fluorescence - fluorescent - ions - cu2 - sensor 56 67_fluorescence_fluorescent_ions_cu2
68 cellulose - cmc - hydrogels - nanocellulose - fibers 54 68_cellulose_cmc_hydrogels_nanocellulose
69 salt - rice - stress - tolerance - kdml105 53 69_salt_rice_stress_tolerance
70 reactor - fluidized - sorbent - bed - solid 53 70_reactor_fluidized_sorbent_bed
71 hawc - gamma - ray - observatory - tev 52 71_hawc_gamma_ray_observatory
72 warehouse - cost - inventory - scheduling - problem 52 72_warehouse_cost_inventory_scheduling
73 hpv - cervical - papillomavirus - women - cancer 52 73_hpv_cervical_papillomavirus_women
74 hypertension - pressure - blood - ht - adherence 51 74_hypertension_pressure_blood_ht
75 care - health - caregivers - older - nhi 51 75_care_health_caregivers_older
76 blockchain - iot - bct - privacy - trust 51 76_blockchain_iot_bct_privacy
77 polynomials - infinite - ideals - definable - if 50 77_polynomials_infinite_ideals_definable
78 robot - grasping - stiffness - rehabilitation - actuator 50 78_robot_grasping_stiffness_rehabilitation
79 quicke - braconidae - species - hymenoptera - nov 50 79_quicke_braconidae_species_hymenoptera
80 electric - field - particle - cell - trapping 50 80_electric_field_particle_cell
81 design - optimization - robust - nonlinear - feedback 50 81_design_optimization_robust_nonlinear
82 pain - tka - knee - morphine - postoperative 50 82_pain_tka_knee_morphine
83 furfural - hmf - catalysts - catalyst - hydroxymethylfurfural 50 83_furfural_hmf_catalysts_catalyst
84 covid - 19 - pandemic - health - countries 49 84_covid_19_pandemic_health
85 flow - drift - heat - wake - velocity 49 85_flow_drift_heat_wake
86 retirement - older - health - life - happiness 49 86_retirement_older_health_life
87 peasant - farmers - agroecology - farming - resilience 49 87_peasant_farmers_agroecology_farming
88 bond - resin - zirconia - adhesive - ceramic 49 88_bond_resin_zirconia_adhesive
89 job - employee - employees - turnover - organizational 48 89_job_employee_employees_turnover
90 forest - mangrove - trees - tree - forests 48 90_forest_mangrove_trees_tree
91 consortium - degradation - biodegradation - profenofos - degrading 48 91_consortium_degradation_biodegradation_profenofos
92 extraction - gold - liquid - hg - mercury 48 92_extraction_gold_liquid_hg
93 variants - exome - sequencing - genetic - variant 48 93_variants_exome_sequencing_genetic
94 levan - levansucrase - inulosucrase - residues - maltose 47 94_levan_levansucrase_inulosucrase_residues
95 cpv - dogs - canine - pdcov - virus 46 95_cpv_dogs_canine_pdcov
96 spp - bartonella - parasites - immitis - cats 46 96_spp_bartonella_parasites_immitis
97 sjs - hla - ten - reactions - scars 45 97_sjs_hla_ten_reactions
98 glucosidase - inhibitory - inhibition - ic50 - compounds 44 98_glucosidase_inhibitory_inhibition_ic50
99 bim - construction - building - project - lca 44 99_bim_construction_building_project
100 network - cloud - iot - wireless - traffic 44 100_network_cloud_iot_wireless
101 macaques - macaque - tailed - macaca - fascicularis 44 101_macaques_macaque_tailed_macaca
102 rollover - traffic - prediction - machine - tripped 44 102_rollover_traffic_prediction_machine
103 gravity - stars - black - massive - wormhole 43 103_gravity_stars_black_massive
104 rubber - nr - composites - vulcanization - cure 43 104_rubber_nr_composites_vulcanization
105 co2 - mea - amine - amp - absorption 42 105_co2_mea_amine_amp
106 pain - neck - office - back - workers 42 106_pain_neck_office_back
107 anaerobic - wastewater - mbr - cod - bioreactor 42 107_anaerobic_wastewater_mbr_cod
108 mutations - imperfecta - mutation - variants - heterozygous 42 108_mutations_imperfecta_mutation_variants
109 inflation - supergravity - scalar - inflaton - cosmological 41 109_inflation_supergravity_scalar_inflaton
110 pythiosis - insidiosum - pythium - fungal - keratitis 41 110_pythiosis_insidiosum_pythium_fungal
111 galaxies - star - alma - stellar - agn 41 111_galaxies_star_alma_stellar
112 energy - δln - emissions - sector - policy 41 112_energy_δln_emissions_sector
113 falciparum - plasmodium - malaria - vivax - parasite 41 113_falciparum_plasmodium_malaria_vivax
114 seizure - epilepsy - seizures - eeg - neurological 41 114_seizure_epilepsy_seizures_eeg
115 stone - aldosterone - urinary - oxalate - urolithiasis 40 115_stone_aldosterone_urinary_oxalate
116 petri - timed - nets - cpn - verification 40 116_petri_timed_nets_cpn
117 oa - knee - synovial - rtl - osteoarthritis 40 117_oa_knee_synovial_rtl
118 beams - concrete - shear - strengthened - ets 40 118_beams_concrete_shear_strengthened
119 species - corolla - kidyoo - apocynaceae - ceropegia 40 119_species_corolla_kidyoo_apocynaceae
120 tb - tuberculosis - mtb - mycobacterium - ltbi 39 120_tb_tuberculosis_mtb_mycobacterium
121 pna - nucleic - dna - acpcpna - pyrrolidinyl 39 121_pna_nucleic_dna_acpcpna
122 exercise - hiit - training - chest - plb 39 122_exercise_hiit_training_chest
123 pm2 - pm10 - air - pollution - particulate 38 123_pm2_pm10_air_pollution
124 gaas - insb - epitaxy - nanowires - quantum 38 124_gaas_insb_epitaxy_nanowires
125 cmv - covid - 19 - recipients - transplant 38 125_cmv_covid_19_recipients
126 microplastics - mps - microplastic - plastic - pollution 38 126_microplastics_mps_microplastic_plastic
127 qol - version - validity - reliability - thai 38 127_qol_version_validity_reliability
128 hbv - hepatitis - hbsag - chb - hbeag 37 128_hbv_hepatitis_hbsag_chb
129 γcd - solubility - eye - cyclodextrin - aqueous 37 129_γcd_solubility_eye_cyclodextrin
130 adsorption - adsorbent - removal - dye - zeolite 37 130_adsorption_adsorbent_removal_dye
131 lupus - fcgriib - sle - mice - fcγriib 37 131_lupus_fcgriib_sle_mice
132 transit - motorcycle - bangkok - bus - rha 37 132_transit_motorcycle_bangkok_bus
133 anesthesia - incidents - anesthetic - perioperative - paad 37 133_anesthesia_incidents_anesthetic_perioperative
134 pertussis - vaccine - vaccination - measles - tetanus 37 134_pertussis_vaccine_vaccination_measles
135 microalgae - biomass - wastewater - algal - sp 37 135_microalgae_biomass_wastewater_algal
136 aki - rrt - kidney - injury - acute 36 136_aki_rrt_kidney_injury
137 forecasting - forecast - weather - arima - model 35 137_forecasting_forecast_weather_arima
138 repair - tendon - arthroscopic - suture - pain 35 138_repair_tendon_arthroscopic_suture
139 curcumin - cur - cisplatin - prodrug - curdg 35 139_curcumin_cur_cisplatin_prodrug
140 dose - ct - radiation - kv - cbct 35 140_dose_ct_radiation_kv
141 hcv - hepatitis - hbv - infection - anti 35 141_hcv_hepatitis_hbv_infection
142 preeclampsia - placental - uterine - trimester - pregnant 35 142_preeclampsia_placental_uterine_trimester
143 steam - reforming - h2 - cao - gasification 34 143_steam_reforming_h2_cao
144 leishmania - martiniquensis - leishmaniasis - mundinia - sand 34 144_leishmania_martiniquensis_leishmaniasis_mundinia
145 alu - dna - methylation - hypomethylation - epigenetic 34 145_alu_dna_methylation_hypomethylation
146 oil - surfactant - wax - oilfield - recovery 34 146_oil_surfactant_wax_oilfield
147 tsunami - sea - deposits - beach - sedimentary 34 147_tsunami_sea_deposits_beach
148 eeg - emotion - bci - granger - emotions 34 148_eeg_emotion_bci_granger
149 cd - inclusion - cyclodextrin - βcd - complexes 34 149_cd_inclusion_cyclodextrin_βcd
150 artery - facial - mm - arteries - nerve 34 150_artery_facial_mm_arteries
151 tac - tacrolimus - transplantation - kidney - rejection 34 151_tac_tacrolimus_transplantation_kidney
152 ethanol - catalysts - dehydrogenation - catalyst - acetaldehyde 34 152_ethanol_catalysts_dehydrogenation_catalyst
153 waste - management - msw - solid - municipal 33 153_waste_management_msw_solid
154 curcuma - herbal - barcoding - species - speciosa 33 154_curcuma_herbal_barcoding_species
155 species - tylototriton - verrucosus - snake - feihyla 33 155_species_tylototriton_verrucosus_snake
156 mirifica - candollei - pueraria - puerarin - isoflavonoids 32 156_mirifica_candollei_pueraria_puerarin
157 leptospira - leptospirosis - interrogans - lipl32 - serovar 32 157_leptospira_leptospirosis_interrogans_lipl32
158 gerd - esophageal - reflux - egj - achalasia 32 158_gerd_esophageal_reflux_egj
159 nutrition - sepsis - septic - hpn - parenteral 32 159_nutrition_sepsis_septic_hpn
160 biliary - drainage - ercp - eus - stent 32 160_biliary_drainage_ercp_eus
161 pesticide - farmers - exposure - pesticides - op 32 161_pesticide_farmers_exposure_pesticides
162 asd - autism - bpa - bdnf - genes 31 162_asd_autism_bpa_bdnf
163 reaction - yields - synthesis - cyclization - thioglycosides 31 163_reaction_yields_synthesis_cyclization
164 goats - dcad - milk - colostrum - crossbred 31 164_goats_dcad_milk_colostrum
165 gut - candida - mice - sepsis - bg 31 165_gut_candida_mice_sepsis
166 pda - polydiacetylene - reversible - nanocomposites - assemblies 31 166_pda_polydiacetylene_reversible_nanocomposites
167 lignin - pyrolysis - htl - bio - biomass 31 167_lignin_pyrolysis_htl_bio
168 ammonia - nitrification - nitrogen - nitrifying - biofilter 30 168_ammonia_nitrification_nitrogen_nitrifying
169 surfactant - surfactants - detergency - washing - oil 30 169_surfactant_surfactants_detergency_washing
170 fibrosis - nafld - liver - steatosis - fib 30 170_fibrosis_nafld_liver_steatosis
171 gnss - positioning - pwv - cors - navigation 30 171_gnss_positioning_pwv_cors
172 olp - oral - pain - oidp - ohrqol 30 172_olp_oral_pain_oidp
173 dose - vmat - field - plans - beam 30 173_dose_vmat_field_plans
174 breast - lesions - mri - mammography - pet 30 174_breast_lesions_mri_mammography
175 acemannan - bone - prf - periodontal - bubaline 29 175_acemannan_bone_prf_periodontal
176 ckd - cinacalcet - serum - bone - bko 29 176_ckd_cinacalcet_serum_bone
177 pani - conductivity - electrical - pss - polyaniline 29 177_pani_conductivity_electrical_pss
178 hiv - plhiv - plwh - nutritional - antiretroviral 29 178_hiv_plhiv_plwh_nutritional
179 nov - sp - species - enghoff - genus 29 179_nov_sp_species_enghoff
180 ibd - uc - colitis - bowel - crohn 29 180_ibd_uc_colitis_bowel
181 sows - piglets - piglet - farrowing - colostrum 29 181_sows_piglets_piglet_farrowing
182 dom - adsorption - fenton - dissolved - decolorization 29 182_dom_adsorption_fenton_dissolved
183 ba - hcc - liver - mir - atresia 29 183_ba_hcc_liver_mir
184 cov - sars - rbd - fc - vaccine 29 184_cov_sars_rbd_fc
185 thyroid - ptc - braf - mutation - niftp 29 185_thyroid_ptc_braf_mutation
186 web - software - code - test - bpmn 28 186_web_software_code_test
187 grass - pretreatment - napier - fermentation - ethanol 28 187_grass_pretreatment_napier_fermentation
188 mangostin - mangosteen - zebrafish - acanthamoeba - αm 28 188_mangostin_mangosteen_zebrafish_acanthamoeba
189 implant - cais - implants - static - freehand 28 189_implant_cais_implants_static
190 moral - crying - situational - gift - happiness 27 190_moral_crying_situational_gift
191 adsorption - co2 - zif - carbon - mil 27 191_adsorption_co2_zif_carbon
192 claudin - odontogenic - ameloblastoma - lesions - pten 27 192_claudin_odontogenic_ameloblastoma_lesions
193 plantar - foot - fasciitis - ankle - stretching 27 193_plantar_foot_fasciitis_ankle
194 perovskite - perovskites - solar - halide - pscs 26 194_perovskite_perovskites_solar_halide
195 ysz - anode - scgz - electrolysis - electrolyte 26 195_ysz_anode_scgz_electrolysis
196 allergens - allergen - der - ait - hdm 26 196_allergens_allergen_der_ait
197 aclf - liver - cirrhosis - aarc - failure 26 197_aclf_liver_cirrhosis_aarc
198 tourism - tourists - destination - food - visitors 26 198_tourism_tourists_destination_food
199 sperm - semen - motility - spermatozoa - cryopreservation 26 199_sperm_semen_motility_spermatozoa
200 dosing - mic - pharmacokinetic - meropenem - vancomycin 26 200_dosing_mic_pharmacokinetic_meropenem
201 indoor - air - respiratory - pollution - exposure 26 201_indoor_air_respiratory_pollution
202 cca - cholangiocarcinoma - lin28b - fluke - necroptosis 25 202_cca_cholangiocarcinoma_lin28b_fluke
203 tdf - hiv - taf - switching - rpv 25 203_tdf_hiv_taf_switching
204 battery - vanadium - vrfb - zinc - flow 25 204_battery_vanadium_vrfb_zinc
205 user - recommendation - rating - recommender - filtering 25 205_user_recommendation_rating_recommender
206 npc - ebv - nasopharyngeal - adc - imrt 25 206_npc_ebv_nasopharyngeal_adc
207 earthquake - seismic - ground - liquefaction - northern 24 207_earthquake_seismic_ground_liquefaction
208 images - segmentation - cnn - ai - eus 24 208_images_segmentation_cnn_ai
209 gc - column - chromatography - 2d - comprehensive 24 209_gc_column_chromatography_2d
210 amh - kisspeptin - testicular - gilts - pigs 23 210_amh_kisspeptin_testicular_gilts
211 egfr - jak2 - tk - kinase - inhibitors 23 211_egfr_jak2_tk_kinase
212 sars - cov - cats - wastewater - coronavirus 23 212_sars_cov_cats_wastewater
213 middle - east - subfamily - species - iran 23 213_middle_east_subfamily_species
214 cos - broilers - diet - supplementation - yolk 22 214_cos_broilers_diet_supplementation
215 ckd - kidney - egfr - prevalence - glomerular 22 215_ckd_kidney_egfr_prevalence
216 mxene - ldh - lithium - ti3c2 - mah 22 216_mxene_ldh_lithium_ti3c2
217 venom - snakebite - antivenom - antivenoms - snake 22 217_venom_snakebite_antivenom_antivenoms
218 radon - thoron - bq - msv - dose 22 218_radon_thoron_bq_msv
219 addiction - media - children - screen - smartphone 22 219_addiction_media_children_screen
220 chikv - zikv - mosquitoes - denv - chikungunya 22 220_chikv_zikv_mosquitoes_denv
221 methanol - dme - co2 - synthesis - process 21 221_methanol_dme_co2_synthesis
222 tax - investment - export - firms - subsidies 21 222_tax_investment_export_firms
223 theories - gauge - quiver - duality - 2n 21 223_theories_gauge_quiver_duality
224 drug - network - associations - heterogeneous - similarity 21 224_drug_network_associations_heterogeneous
225 lewis - metathesis - catalysts - sio2 - wo3 21 225_lewis_metathesis_catalysts_sio2
226 plasma - hydrogenation - fame - discharge - margarine 21 226_plasma_hydrogenation_fame_discharge
227 plant - antibody - produced - benthamiana - mab 21 227_plant_antibody_produced_benthamiana
228 asfv - swine - csfv - fever - dt 21 228_asfv_swine_csfv_fever
229 thermal - heat - cooling - gshp - pumps 21 229_thermal_heat_cooling_gshp
230 complexes - coordination - ii - ligand - cu 21 230_complexes_coordination_ii_ligand
231 probiotics - synbiotic - gut - obesity - microbiota 21 231_probiotics_synbiotic_gut_obesity
232 sigma - process - defect - dmaic - defective 20 232_sigma_process_defect_dmaic
233 tyrosinase - melanin - antityrosinase - extract - activity 20 233_tyrosinase_melanin_antityrosinase_extract
234 residue - protein - hv1 - gyration - grained 20 234_residue_protein_hv1_gyration
235 shape - memory - copolymers - benzoxazine - epoxy 20 235_shape_memory_copolymers_benzoxazine
236 education - mooc - competencies - schools - policy 20 236_education_mooc_competencies_schools
237 lichen - usnea - parmotrema - xanthone - nmr 20 237_lichen_usnea_parmotrema_xanthone
238 shielding - csi - radiation - glass - ray 20 238_shielding_csi_radiation_glass
239 bleeding - esd - ppi - rebleeding - endoscopic 20 239_bleeding_esd_ppi_rebleeding
240 bounds - approximation - random - stein - theorem 20 240_bounds_approximation_random_stein
241 toothpaste - fluoride - enamel - sdf - decontamination 20 241_toothpaste_fluoride_enamel_sdf
242 ugt1a1 - slco1b1 - polymorphisms - cyp2d6 - pgx 19 242_ugt1a1_slco1b1_polymorphisms_cyp2d6
243 groundwater - recharge - aquifer - river - pumping 19 243_groundwater_recharge_aquifer_river
244 series - outlier - time - motif - datasets 19 244_series_outlier_time_motif
245 so - gauged - supergravity - gauge - supersymmetric 19 245_so_gauged_supergravity_gauge
246 pesticide - ache - pesticides - organophosphate - sensor 19 246_pesticide_ache_pesticides_organophosphate
247 fractional - chebyshev - equations - nonlocal - equation 19 247_fractional_chebyshev_equations_nonlocal
248 sg - organoids - epithelial - gland - salivary 19 248_sg_organoids_epithelial_gland
249 cd - edta - cadmium - soil - arsenic 19 249_cd_edta_cadmium_soil
250 hydrate - methane - hydrates - formation - dissociation 19 250_hydrate_methane_hydrates_formation
251 concrete - confined - ultimate - compressive - strength 19 251_concrete_confined_ultimate_compressive
252 juno - neutrino - detector - pmts - scintillator 19 252_juno_neutrino_detector_pmts
253 cephalometric - skeletal - convex - landmarks - landmark 18 253_cephalometric_skeletal_convex_landmarks
254 hydroxyapatite - hap - eggshell - granules - calcium 18 254_hydroxyapatite_hap_eggshell_granules
255 moea - assembly - line - problem - balancing 18 255_moea_assembly_line_problem
256 electrochemical - immunosensor - crp - detection - electrode 18 256_electrochemical_immunosensor_crp_detection
257 sicp - moulding - powder - vol - si3n4 18 257_sicp_moulding_powder_vol
258 ppp - infrastructure - projects - subcontractor - private 18 258_ppp_infrastructure_projects_subcontractor
259 captcha - personality - voice - letters - eye 18 259_captcha_personality_voice_letters
260 gii - hadv - norovirus - a71 - gastroenteritis 18 260_gii_hadv_norovirus_a71
261 stroke - ischemic - ais - nihss - nwu 18 261_stroke_ischemic_ais_nihss
262 dredged - pavement - sediments - cement - opc 18 262_dredged_pavement_sediments_cement
263 supercapacitor - graphene - capacitance - supercapacitors - density 17 263_supercapacitor_graphene_capacitance_supercapacitors
264 corruption - cpi - prosperity - education - firearms 17 264_corruption_cpi_prosperity_education
265 biofilm - chitosan - vanillin - fluoride - duwls 17 265_biofilm_chitosan_vanillin_fluoride
266 cdtb - cdta - cdr1 - mpr1 - albicans 17 266_cdtb_cdta_cdr1_mpr1
267 access - cancer - drugs - screening - colorectal 17 267_access_cancer_drugs_screening
268 masticatory - denture - bite - wearers - peanut 17 268_masticatory_denture_bite_wearers
269 sulfonated - membrane - proton - speek - ether 17 269_sulfonated_membrane_proton_speek
270 pr - prb - pra - progesterone - breast 17 270_pr_prb_pra_progesterone
271 steel - girder - columns - concrete - girders 17 271_steel_girder_columns_concrete
272 antimicrobial - antibiotic - amr - antibiotics - asp 17 272_antimicrobial_antibiotic_amr_antibiotics
273 biorefinery - pulping - kraft - pulp - process 17 273_biorefinery_pulping_kraft_pulp
274 methylation - hpv - cervical - hpv16 - promoter 17 274_methylation_hpv_cervical_hpv16
275 robot - automation - robots - autonomous - twin 17 275_robot_automation_robots_autonomous
276 membranes - membrane - pvdf - tio2 - flux 17 276_membranes_membrane_pvdf_tio2
277 pharmacists - pharmacy - community - hds - pharmacist 16 277_pharmacists_pharmacy_community_hds
278 canine - dogs - oral - tumors - lom 16 278_canine_dogs_oral_tumors
279 formula - formulas - conditional - moments - swaps 16 279_formula_formulas_conditional_moments
280 periodontitis - periodontal - flowcharts - cal - salivary 16 280_periodontitis_periodontal_flowcharts_cal
281 peptides - ace - kda - fraction - scavenging 16 281_peptides_ace_kda_fraction
282 iii - nzvi - removal - adsorption - arsenate 16 282_iii_nzvi_removal_adsorption
283 dtmuv - ducks - duck - tembusu - mosquitoes 16 283_dtmuv_ducks_duck_tembusu
284 sccmec - mrsa - methicillin - isolates - staphylococcus 16 284_sccmec_mrsa_methicillin_isolates
285 death - acceptance - spiritual - buddhist - care 16 285_death_acceptance_spiritual_buddhist
286 colonoscopy - adr - endoscopists - polyps - lci 16 286_colonoscopy_adr_endoscopists_polyps
287 peritonitis - peritoneal - catheter - dialysis - pd 16 287_peritonitis_peritoneal_catheter_dialysis
288 fuel - pemfc - converter - algorithm - optimization 16 288_fuel_pemfc_converter_algorithm
289 glycerol - propanediol - hydrogenolysis - amo - layered 16 289_glycerol_propanediol_hydrogenolysis_amo
290 covid - 19 - pandemic - perceived - anxiety 15 290_covid_19_pandemic_perceived
291 rice - bioaccessible - arsenic - consumption - cadmium 15 291_rice_bioaccessible_arsenic_consumption
292 vaginal - postmenopausal - menopause - women - genitourinary 15 292_vaginal_postmenopausal_menopause_women
293 baumannii - colistin - fosfomycin - carbapenem - isolates 15 293_baumannii_colistin_fosfomycin_carbapenem
294 bc - wound - cellulose - film - bacterial 15 294_bc_wound_cellulose_film
295 tendon - fhl - peroneal - mkh - accessory 15 295_tendon_fhl_peroneal_mkh
296 quantum - algorithm - classical - qaoa - grover 15 296_quantum_algorithm_classical_qaoa
297 aki - kidney - injury - igfbp7 - acute 15 297_aki_kidney_injury_igfbp7
298 ripening - durian - fruit - auxin - ethylene 15 298_ripening_durian_fruit_auxin
299 sars - cov - device - detection - covid 15 299_sars_cov_device_detection
300 adsorption - h2 - zro2 - vo - tio 15 300_adsorption_h2_zro2_vo
301 pla - nr - rubber - nsio2 - pcl 15 301_pla_nr_rubber_nsio2
302 trigeminal - cgrp - migraine - neurons - pain 14 302_trigeminal_cgrp_migraine_neurons
303 release - drug - permeation - transdermal - dpnr 14 303_release_drug_permeation_transdermal
304 titanium - ti - dlc - anodized - anodization 14 304_titanium_ti_dlc_anodized
305 stock - neural - prediction - trading - market 14 305_stock_neural_prediction_trading
306 codes - graphs - cayley - finite - rings 14 306_codes_graphs_cayley_finite
307 resolution - super - image - network - attention 14 307_resolution_super_image_network
308 lps - macrophages - sepsis - ezh2 - mice 14 308_lps_macrophages_sepsis_ezh2
309 hydrogen - succinic - production - fermentation - clostridium 14 309_hydrogen_succinic_production_fermentation
310 nanoemulsion - cinnamon - clove - oil - nanoemulsions 14 310_nanoemulsion_cinnamon_clove_oil
311 anorectal - defecation - anal - biofeedback - fi 14 311_anorectal_defecation_anal_biofeedback
312 gini - income - lorenz - inequality - form 14 312_gini_income_lorenz_inequality
313 leadership - teacher - school - teachers - principals 14 313_leadership_teacher_school_teachers
314 sleep - pulse - snoring - psg - wave 14 314_sleep_pulse_snoring_psg
315 polymerization - ethylene - hexene - thf - tnoa 14 315_polymerization_ethylene_hexene_thf
316 micp - spores - urea - soil - calcite 14 316_micp_spores_urea_soil
317 apis - honey - honeybee - mellifera - bee 14 317_apis_honey_honeybee_mellifera
318 cr - vi - chromium - extraction - electromembrane 14 318_cr_vi_chromium_extraction
319 sofc - fuel - system - soec - power 14 319_sofc_fuel_system_soec
320 graph - edge - skirted - graphs - vertex 14 320_graph_edge_skirted_graphs
321 smart - building - energy - bems - appliance 14 321_smart_building_energy_bems
322 records - speleothem - isotope - δ18o - monsoon 13 322_records_speleothem_isotope_δ18o
323 aeromonas - hydrophila - isolates - vah - veronii 13 323_aeromonas_hydrophila_isolates_vah
324 fruits - insect - abundance - richness - bee 13 324_fruits_insect_abundance_richness
325 fad - flavin - etfab - bcd - iso 13 325_fad_flavin_etfab_bcd
326 muslim - malay - religious - muslims - ayutthaya 13 326_muslim_malay_religious_muslims
327 wheezing - allergy - children - aaf - allergic 13 327_wheezing_allergy_children_aaf
328 starch - starches - rice - glutinous - flour 13 328_starch_starches_rice_glutinous
329 cbct - root - mandibular - arch - canal 13 329_cbct_root_mandibular_arch
330 rabies - id - rabv - prophylaxis - vaccines 13 330_rabies_id_rabv_prophylaxis
331 hnscc - methylation - pbmcs - squamous - oscc 13 331_hnscc_methylation_pbmcs_squamous
332 transgender - depression - sexual - gender - dysfunction 13 332_transgender_depression_sexual_gender
333 buddhist - ethics - ethical - modernity - buddhism 13 333_buddhist_ethics_ethical_modernity
334 reservoir - decline - commingled - gas - layer 13 334_reservoir_decline_commingled_gas
335 extract - tpl - antioxidant - hepg2 - indicum 13 335_extract_tpl_antioxidant_hepg2
336 blastocystis - stercoralis - entamoeba - infections - parasitic 13 336_blastocystis_stercoralis_entamoeba_infections
337 spo2 - covid - affective - long - physio 12 337_spo2_covid_affective_long
338 sinus - frontal - draf - petrous - endoscopic 12 338_sinus_frontal_draf_petrous
339 poroelastic - foundations - rigid - dynamic - foundation 12 339_poroelastic_foundations_rigid_dynamic
340 pd - fame - mcm - hydrogenation - sba 12 340_pd_fame_mcm_hydrogenation
341 genistein - ovx - nash - rats - hfhf 12 341_genistein_ovx_nash_rats
342 migrant - migration - workers - policy - inequalities 12 342_migrant_migration_workers_policy
343 silver - colloids - cellulose - nanoparticles - cmcs 12 343_silver_colloids_cellulose_nanoparticles
344 infliximab - secukinumab - p13 - psa - biosimilar 12 344_infliximab_secukinumab_p13_psa
345 pleistocene - fossil - miocene - sung - khok 12 345_pleistocene_fossil_miocene_sung
346 ballistic - polybenzoxazine - friction - composites - aramid 12 346_ballistic_polybenzoxazine_friction_composites
347 crassna - leaves - mangiferin - extracts - tlc 12 347_crassna_leaves_mangiferin_extracts
348 h2 - halophytica - production - hydrogenase - cyanobacterium 12 348_h2_halophytica_production_hydrogenase
349 peri - implant - implantitis - abutment - tissue 12 349_peri_implant_implantitis_abutment
350 prrsv - pigs - porcine - vaccinated - mlv 12 350_prrsv_pigs_porcine_vaccinated
351 sensor - voc - sensing - methanol - vocs 12 351_sensor_voc_sensing_methanol
352 pwm - overmodulation - converters - voltage - inverters 12 352_pwm_overmodulation_converters_voltage
353 eca - 233 - asiatica - centella - madecassoside 11 353_eca_233_asiatica_centella
354 nanomaterials - cuonps - toxicity - nanoparticles - plants 11 354_nanomaterials_cuonps_toxicity_nanoparticles
355 pluripotency - pluripotent - ipscs - reprogramming - rabbit 11 355_pluripotency_pluripotent_ipscs_reprogramming
356 crispr - sars - cov - cas12a - detection 11 356_crispr_sars_cov_cas12a
357 crocodiles - freshwater - crocodylus - pharmacokinetic - intramuscular 11 357_crocodiles_freshwater_crocodylus_pharmacokinetic
358 krfpc - antimalarial - hcshmt - mangiferin - thf 11 358_krfpc_antimalarial_hcshmt_mangiferin
359 gvhd - nrm - acute - gdf - elafin 11 359_gvhd_nrm_acute_gdf
360 jig - separation - hybrid - plastics - flotation 11 360_jig_separation_hybrid_plastics
361 oyster - salmonella - contamination - coliforms - beaches 11 361_oyster_salmonella_contamination_coliforms
362 oryzanol - lycopene - basil - mushroom - rbao 11 362_oryzanol_lycopene_basil_mushroom
363 color - pigments - 3d - nm - reflectance 11 363_color_pigments_3d_nm
364 uv - shelf - dmdc - juice - microbial 10 364_uv_shelf_dmdc_juice
365 resonant - nonlinear - quantum - harmonic - weakly 10 365_resonant_nonlinear_quantum_harmonic
366 fluorescence - gqds - fluorescent - fluorescein - go 10 366_fluorescence_gqds_fluorescent_fluorescein
367 instances - minority - tree - class - classifier 10 367_instances_minority_tree_class
368 coding - video - hevc - partitioning - vvc 10 368_coding_video_hevc_partitioning
369 mindfulness - self - athletes - narcissistic - grit 10 369_mindfulness_self_athletes_narcissistic
370 bats - bat - contact - borne - zoonotic 10 370_bats_bat_contact_borne

Training hyperparameters

  • calculate_probabilities: False
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: True
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.33
  • UMAP: 0.5.6
  • Pandas: 2.2.2
  • Scikit-Learn: 1.4.2
  • Sentence-transformers: 2.7.0
  • Transformers: 4.40.2
  • Numba: 0.59.1
  • Plotly: 5.22.0
  • Python: 3.10.12