new data and app
Browse files- (bgc OR bonifacio global city).csv +199 -0
- (katip OR katipunan).csv +217 -0
- (pobla OR poblacion).csv +307 -0
- app.py +358 -176
- app_old.py +364 -0
- cubao.csv +236 -0
- requirements.txt +6 -2
- taft.csv +207 -0
(bgc OR bonifacio global city).csv
ADDED
@@ -0,0 +1,199 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
,tweet
|
2 |
+
0,"@AngelynSo5 Kasalanan ni vince, di pa nga kami nakakapunta jan inaaya na ko mag bgc😭"
|
3 |
+
1,@hiiitsnam parks around makati/bgc
|
4 |
+
2,wanna punta sa bgc at uminom ng caramel beer at kumain 🥹
|
5 |
+
3,@ooohhhKaySee @jisoo_bel @francinecarreld Si Seth nga Hindi pagud kahit galing Cebu deretso agad sa studio nag bgc pa lam na nila pahinga nila ang isat isa.
|
6 |
+
4,OMG ZACK TABUDLO IN BGC
|
7 |
+
5,@davidthewalrus ay i miss read it tot u were here sa bgc HSHSHHSSHHAHAAHHAHAHAJAHA
|
8 |
+
6,@ZeusNetworkTea @itsahnamac @RPSnackMealz Rollie is a bully too ! If they gone keep bringing her fat ass bck then they need to put another big btch in there 1 thts not scary ! A Tanisha(BGC) type of btch. Cause Rollie love tryna fight on the lil girls & DONT BE WINNING ! All outta breath saying “I beat your ass” & DIDNT
|
9 |
+
7,Ahhh siguro pumunta siguro sila dito feel ko din hahahaha kaya pala nag BGC https://t.co/Wm8Rd2RrEY
|
10 |
+
8,are we even allowed to check the launch of GM in bgc? I mean — are fans allowed hehe ㅜㅜ
|
11 |
+
9,@ooohhhKaySee @jisoo_bel @francinecarreld Ang ganda nga ng ngitin ni Chin kanina eh. Minsan naman talaga sa mga kuha video ni Celina nakaganun si Chin hahaha. May masama ba pakiramdam na nag bgc pa 😅
|
12 |
+
10,"KICKOFF SATURDAY: High school football continues on Saturday with great matchups such as Lamar HS vs. Sam Houston!
|
13 |
+
|
14 |
+
🏈 Stream all the Big Game Coverage through the free BGC app! Or watch live on https://t.co/gW89m6Ii27.... https://t.co/TX2SM5gxLP"
|
15 |
+
11,mind museum lang bakit nag bar hopping na sa bgc? ngek
|
16 |
+
12,they did lmaoo they got bgc shut the fxk down n got paid 😹😹😹 https://t.co/Y9BAFXOmM1
|
17 |
+
13,pagod na pagod katawang lupa — pampanga to bulacan to bgc to greenfield 😭😂 deserve ko ng cuddle ng baby ko. ay 🫣🤭
|
18 |
+
14,ang lala ng bgc trip 🤣🤣
|
19 |
+
15,BGC with Lime 🐈⬛ https://t.co/qOYlihERKL
|
20 |
+
16,"@jaaaykaaaay @soundslikekash @buckethat2tini @jimin3d @deeminkook @MIKEEEbond g sa bgc if you'll have me 🥹🫶🏼 (if you haven't tried bb.q, ang sarap ng chicken nila huhu)"
|
21 |
+
17,Same sila dalawa pero quality time daw dapat kaya nag date sa bgc https://t.co/XHOJWQVBGT
|
22 |
+
18,"Ngayon nalang ulit ako magpapaumaga sa bgc, sa st.lukes pa. Saet 😩"
|
23 |
+
19,Si Gelo nag paparty sa BGC. Grabe I FEEL SO OLD
|
24 |
+
20,"baka sakaling magkita pa tayo sa qc, sa up, SA KALSADA NG BGC📢📢📢"
|
25 |
+
21,Totoo ba ang BGC gubat? Makapunta nga
|
26 |
+
22,Huy! parang gusto ko na manood ng Eras Tour ni taylor! kasama na lang kulang HAHA! Anyone nearby sa bgc? #alterbgc #altertaguig #altermakati Mostly swiftie mga alter kaya i hashtag ko na HAHA https://t.co/cPVYqFwx0P
|
27 |
+
23,Na I-inggit ako sa mga nasa BGC ngayon😭😭 GIVE ME YOUR FOREVER I wanna hear it live😭
|
28 |
+
24,"Huy! May idadagdag ulit ako sa playlist soon! Bakit may naalala ko sa AM? Haha!
|
29 |
+
|
30 |
+
JAYDA ROCKS BGC https://t.co/BnpiJFhyjc"
|
31 |
+
25,"Take care of your butt. I really like hotel-style massage.
|
32 |
+
📍BGC https://t.co/jJlMcVE1GA"
|
33 |
+
26,"Trending parin ang bestgirl!
|
34 |
+
|
35 |
+
JAYDA ROCKS BGC https://t.co/XE5B513GiK"
|
36 |
+
27,Hahahha ate @iiamcelina update naman sa bgc time ninyo namiss namin kayo hahahaha
|
37 |
+
28,"AM.. Can’t wait to hear the full song.
|
38 |
+
|
39 |
+
JAYDA ROCKS BGC https://t.co/cgIb13LlWq"
|
40 |
+
29,"Well deserve!!keep on tweeting!
|
41 |
+
|
42 |
+
JAYDA ROCKS BGC https://t.co/PDUJXsIdPK"
|
43 |
+
30,@xg_luffy Hahaha oo nga sleep call lang daw yan hahahaha walang update sa BGC time nila ngayon
|
44 |
+
31,sa bgc lang pala ako makakahanap ng kamukha ni jungsu
|
45 |
+
32,@evilrashida @kaseyonnuh Definitely giving Janelle from bgc 😂
|
46 |
+
33,"AM initials reveal.
|
47 |
+
|
48 |
+
JAYDA ROCKS BGC https://t.co/Ie69qMTV8w"
|
49 |
+
34,"Sino muna yung AM, miss mam!
|
50 |
+
|
51 |
+
JAYDA ROCKS BGC https://t.co/RCKQYD6PpK"
|
52 |
+
35,@xg_luffy Hahahaha oo nga eh miss siguro plus kagabi hindi kasi nag dahan dahan si sethyboy natin hahahahaha nag overthink ako malala haahahaha panay suyo talaga yun sa BGC
|
53 |
+
36,"@FranSethBhlFn @imsethfedelin Nako seth kawawa naman yong rumored boyfriend ni chin na kasama nya sa bgc ngayon balita ko gwapo din yon yancy ba name non
|
54 |
+
|
55 |
+
#FranSeth"
|
56 |
+
37,"Still trending!
|
57 |
+
|
58 |
+
JAYDA ROCKS BGC https://t.co/NBez0E07f9"
|
59 |
+
38,@xg_luffy Oo nga ehh hahaha kaya panay tingin si seth ni chin ngayon panay suyo haha alam ko yung sa bgc date nila panay suyo yun hahahaaha
|
60 |
+
39,"Kinanta ni @jaydaavanzado ung bagong song niya na ang title is ""AM"". 🤔
|
61 |
+
|
62 |
+
JAYDA ROCKS BGC"
|
63 |
+
40,"Tang, BGC will now be among ELF's must-visit and must-dine destinations when in PH. 🍽 https://t.co/EW43A0R8wH"
|
64 |
+
41,Ay Sheeeet got a chance to hear you singing live @zacktabudlo here at BGC. https://t.co/irZw0syw2g
|
65 |
+
42,"10:00 PM | TAG UPDATE
|
66 |
+
|
67 |
+
Join 1st. One for an Epic Night on PPOP NIGHT!
|
68 |
+
|
69 |
+
🗓️ Saturday, October 14, 2023 | 8PM
|
70 |
+
📍 Buena Vida, 4F UNIQLO BLDG, BGC
|
71 |
+
|
72 |
+
PARTY ALL NIGHT WITH 1STONE
|
73 |
+
@1stoneOfficial #1stOne
|
74 |
+
#1stOneOnPPOPNIGHT #PPOPNIGHT https://t.co/3zqGiwVRPD"
|
75 |
+
43,"Is there anything you can’t do??? @jaydaavanzado
|
76 |
+
|
77 |
+
JAYDA ROCKS BGC https://t.co/ui7mksGe5Q"
|
78 |
+
44,"Last night before flight
|
79 |
+
bgc mode https://t.co/8V56ciWAF2"
|
80 |
+
45,"Cebu to its your lucky day to bgc and siguro nasa bahay nila chin to ngayon kasi wala pang update sa vlog ni chin eh kanina pa ako pa balik2x sa YouTube
|
81 |
+
|
82 |
+
NASA FRANSETH ANG SWERTE
|
83 |
+
#FranSeth https://t.co/HbMOIY2dc0"
|
84 |
+
46,@xg_luffy Kaya din siguro hindi umuwi binigyan niya nang time yung binibini niya nag BGC
|
85 |
+
47,wala ba pede mag ampon sakin dyan sa na taga makati or malapit sa bgc like huhuhuhuhuhu sheeett ang layo talaga ng workplace ko sa bahay namin ugh
|
86 |
+
48,today at bgc — #archivetwt https://t.co/dB7bogPOhx
|
87 |
+
49,Nagbabalak sia mag bgc oh
|
88 |
+
50,"We love you, mahal! You did a great job!
|
89 |
+
|
90 |
+
JAYDA ROCKS BGC https://t.co/9vPUmzI0j6"
|
91 |
+
51,"@Thoughtswd1am @fentybullets @babgirlsanthem the same way they were able to incorporate bgc into a routine. they could switch out lyrics and cater it to LJ, play on the “miami” theme city girls already have, like there’s a lot. trust if there’s a will, there’s a way"
|
92 |
+
52,"[ADMIN POST]
|
93 |
+
|
94 |
+
The @allriseph Team visited Tang, BGC today!
|
95 |
+
|
96 |
+
We ordered Hyukjae’s order & it is the ff:
|
97 |
+
- Tteokbokki
|
98 |
+
- Sundae
|
99 |
+
- Kimchi-jeon
|
100 |
+
|
101 |
+
They are so nice & accommodating when we asked if we can display his AllRisePH Round Fan Support!
|
102 |
+
|
103 |
+
@AllRiseSilver
|
104 |
+
#EUNHYUK #은혁 #이혁재 https://t.co/9KvY1QohA4"
|
105 |
+
53,"She made an effort para lumapit sa Team live dahil wala silang ticket and nasa labas sila ng barricade. What a beautiful soul.
|
106 |
+
|
107 |
+
JAYDA ROCKS BGC https://t.co/cFNfoLTC9A"
|
108 |
+
54,"Lumot represent! Thank you Jayda girl! We love you!
|
109 |
+
|
110 |
+
JAYDA ROCKS BGC https://t.co/8rCtUcGGlN"
|
111 |
+
55,spinning tonight at studio 28 bgc 10-1am! 🥳
|
112 |
+
56,@teamjaydauae JAYDA ROCKS BGC
|
113 |
+
57,"Jayda with our team live. Thank you @jaydaavanzado ang galing mo talaga at napaka ganda mo! Shout out mga lumot!
|
114 |
+
|
115 |
+
JAYDA ROCKS BGC https://t.co/YLykl7XJ4k"
|
116 |
+
58,BGC is never lacking in pretty girls and hot guys
|
117 |
+
59,Kakatawa kayo pinipilit paniwalain ang Franseth may pa ig story pa si baboy na nasa BGC sila effort yarn😅 parang kailan lang nag date si siopao at raheel sa BGC gagawin talaga lahat hindi lang masira ang career ni siopao demonyo talaga kayo ayaw pa aminin @francinecarreld
|
118 |
+
60,IS THIS IN BGC???? I AM 15 MINUTES AWAY https://t.co/ePCqocJ9Qk
|
119 |
+
61,"The power in her eyes!
|
120 |
+
|
121 |
+
JAYDA ROCKS BGC https://t.co/yx0c7XRG7R"
|
122 |
+
62,"Look whos trending!
|
123 |
+
|
124 |
+
@Team_JaydaOFC @JAYDAnationOFC @jaydaavanzado
|
125 |
+
|
126 |
+
JAYDA ROCKS BGC https://t.co/0tpk7Xs2if"
|
127 |
+
63,"Hearing a lot of praises and admirations.
|
128 |
+
|
129 |
+
JAYDA ROCKS BGC"
|
130 |
+
64,"BGC with Sweetheart ba?!
|
131 |
+
😜 https://t.co/tSpisynBjB"
|
132 |
+
65,"Our princess, but Aljon's queen!
|
133 |
+
|
134 |
+
JAYDA ROCKS BGC https://t.co/JKKr4tLn5K"
|
135 |
+
66,"Keep tweeting lumot!
|
136 |
+
|
137 |
+
JAYDA ROCKS BGC https://t.co/DQifuJLUYV"
|
138 |
+
67,"Trending na rin @jaydaavanzado
|
139 |
+
|
140 |
+
JAYDA ROCKS BGC https://t.co/5iTBDpik7W"
|
141 |
+
68,"Trending!!!
|
142 |
+
|
143 |
+
JAYDA ROCKS BGC https://t.co/komi4pEUti"
|
144 |
+
69,"Vocalist cum Instrumentalist? Baka Jayda yan. Ang galing galing mo!
|
145 |
+
|
146 |
+
CREDITS: Mama Eldz new IGS
|
147 |
+
|
148 |
+
JAYDA ROCKS BGC https://t.co/lcRypBTtks"
|
149 |
+
70,@panisluis joyride ka nlng muna sis hahahahaha or di kaya mag mrt ka papuntang ayala then may bus don papuntang bgc sabihin mo sa stopover ka
|
150 |
+
71,"POW will attend Gentle Monster Manila opening event on Oct 16th. The store is located at 5th Avenue, Bonifacio Global City, Taguig.
|
151 |
+
|
152 |
+
our boys are booked and busy😭🥹
|
153 |
+
|
154 |
+
#POW #파우 https://t.co/ZjjCAyQpG8 https://t.co/XdEngu7CbR"
|
155 |
+
72,"So proud of you, luv!!! @jaydaavanzado
|
156 |
+
|
157 |
+
JAYDA ROCKS BGC"
|
158 |
+
73,Nako! Pano na si Seth nito? Spotted pala ang rumored boyfriend ni francine sa BGC 😫 https://t.co/X2Ye1wx28F
|
159 |
+
74,"Am baxk! Hahaha!
|
160 |
+
|
161 |
+
JAYDA ROCKS BGC"
|
162 |
+
75,Ayan napagbigyn kayo niyo ni literal ang BGC 😂
|
163 |
+
76,@NayeonNabong di ko mapost yung sa akin kasi wala ako mahanap sa google ng poster 😂😂😂 BGC open field days
|
164 |
+
77,"@ate_jomzqatar Wait lang mi, nakipagdate pa kasi sa BFf nya nasa BGC pa 🤭"
|
165 |
+
78,"Umuwi from BGC si Jingliu!
|
166 |
+
|
167 |
+
Aabot na ba sa LC to next? We will see...
|
168 |
+
|
169 |
+
https://t.co/IiCFMfjTbr"
|
170 |
+
79,"@AlJayOfficial_ @jaydaavanzado We love you jayda!
|
171 |
+
|
172 |
+
JAYDA ROCKS BGC"
|
173 |
+
80,"You did great best girl as always @jaydaavanzado
|
174 |
+
|
175 |
+
JAYDA ROCKS BGC https://t.co/CG8jPTKjoa"
|
176 |
+
81,"It's the A5 Wagyu bowl for us.
|
177 |
+
https://t.co/9wiRhUxEXW"
|
178 |
+
82,"😭🥺🥰🐾
|
179 |
+
|
180 |
+
so happy with how the photos turned out! ang saya kaninang umaga huhu pagoda lang talaga mag-ikot sa BGC after hahahuhu tenkyu talaga @bumaBAgyo ♥️ https://t.co/wfh9F5gM6s"
|
181 |
+
83,@frankmajorfkj San ka? Nasa bgc ako.
|
182 |
+
84,"@aimaneth GM Aiman! Philippines, BGC 😎"
|
183 |
+
85,soft girl = mga poging tambay sa bgc playlist
|
184 |
+
86,@Kitty_pinkyyyyy @francinecarreld bka poh sa BGC ipalabas 😅
|
185 |
+
87,Bgc boi ang galawan ah https://t.co/oiz2ucRa0K
|
186 |
+
88,Ano kayang magandang iterinary kapag pumunta ko bgc bukas 🤔
|
187 |
+
89,"I remember a similar example from first year biology about fishes here in BC, but it was 5000 yrs.
|
188 |
+
|
189 |
+
This has always been in the back of my head when I consider NP chemodiversification.
|
190 |
+
|
191 |
+
In mNP BGC; I have often wondered if we are sometimes catching them in the middle of R&D https://t.co/yXzWzhpf23"
|
192 |
+
90,i have a job interview at bgc next week aAaAAaa wish me luck huhu sana po hindi ko kwestyunin mga desisyon ko sa buhay fkshdjshs *nahimatai*
|
193 |
+
91,Aattend ata sila sa event ni Jayda sa BGC. 🫢💙😊 https://t.co/gY3pW96LIk
|
194 |
+
92,"All this, plus delicious food from Fat Seed + Roastery (BGC), and we surely couldn't have asked for a better morning! 🥳😍
|
195 |
+
|
196 |
+
Special thanks to our stewards Leslie Uy, Danielle Parreño, and Isabel Chin for making this possible! 🥰"
|
197 |
+
93,@Dewgong_FTW wala naman. i just really don't like BGC
|
198 |
+
94,"New Arrivals Starts @ 49 ‼️ #ayalamallmarketmarket #ayalamarikina #taguig #shangrilamall #bataan #bgc #vistamallstarosa #robinsonsgentrias #robinsonslaspinas
|
199 |
+
#robinsonsplacenaga #laguna #sancarlostowncenter #vistamallbataan #ayalamarqueemall #angelespampanga #cavite #ayalamalls https://t.co/YzsM2n8BdC"
|
(katip OR katipunan).csv
ADDED
@@ -0,0 +1,217 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
,tweet
|
2 |
+
0,"a friend is looking for a fubu near katip!
|
3 |
+
|
4 |
+
preferrably vers/vers top, taller than 5’4” , and hygienic!"
|
5 |
+
1,"YOU'RE INVITED! 👻
|
6 |
+
|
7 |
+
Nightmares : Sangyeon & Q Halloween Birthday Beersleeve Event Teaser 🎃🍻
|
8 |
+
|
9 |
+
🗓️ November 18, 2023
|
10 |
+
📍Hangout Restobar Katipunan
|
11 |
+
|
12 |
+
Registration starts on October 21, 6 PM!
|
13 |
+
|
14 |
+
#SurviveTheNightBSE #THEBOYZ #SANGYEON #Q #SANGKYU https://t.co/l7Ld3uN5Bn"
|
15 |
+
2,gagu after 4 months??? see u again later katipunan. i hope this time ill have only good memories in you :((
|
16 |
+
3,@CanyonEveSimp @HilltopGamers at one katipunan! a walk away from lrt 2 katipunan
|
17 |
+
4,Driving around katipunan while listening to Rewind by Wonder Girls. All in God's Plan.
|
18 |
+
5,"Oo na, naa na pud lagi ko sa katip karong gabii 😭"
|
19 |
+
6,"@MeekoPod Dam... katipunan be a bit far from where i stay...
|
20 |
+
Tho not hard to get to by car 👀"
|
21 |
+
7,IN PHOTOS: The League of Filipino Students - Katipunan marched alongside the Ateneo community in celebration of One Big Pride–bringing along calls for junking VFA-EDCA and bringing justice to the brutal murder of Jennifer Laude at the hands of US marine Joseph Scott-Pemberton. https://t.co/RyIyRHQWQg
|
22 |
+
8,@_narakku near katipunan LRT!! ^^
|
23 |
+
9,katip invite pleasee huhuuu
|
24 |
+
10,"katip fgc has a new home 👀🥺🥹
|
25 |
+
|
26 |
+
Pls follow @HilltopGamers for future announcements 👀 https://t.co/aBywPwjVAz"
|
27 |
+
11,"The red side of Katipunan flexing its basketball dominance in #UAAPSeason86
|
28 |
+
|
29 |
+
@upmbt 5-0
|
30 |
+
@upwbt_ 5-0"
|
31 |
+
12,nothing like good food in katip in between study sessions :) #studytwt #studytwtph https://t.co/kbY0PS8k0R
|
32 |
+
13,"wts lfb blackpink funko pop keychain ph
|
33 |
+
|
34 |
+
1 rose, 1 lisa, 1 jisoo ✅
|
35 |
+
₱520 each (w/ case protector)
|
36 |
+
₱1500 if taking as set
|
37 |
+
|
38 |
+
✰ onhand
|
39 |
+
✰ payo gcash
|
40 |
+
|
41 |
+
mod: prio sdd
|
42 |
+
📍katip, qc https://t.co/KDDuD1Q5uQ"
|
43 |
+
14,📍 Ateneo-Katipunan! Anyone need grab car ride or out of town drive. DM me. Tnx #booking #privatedriver #grabdriver https://t.co/mpx8f22DAJ
|
44 |
+
15,"ㅤ
|
45 |
+
|
46 |
+
october 14 2023
|
47 |
+
https://t.co/CZPgMZFE0D — barista update⠀
|
48 |
+
katipunan avenue, quezon city
|
49 |
+
|
50 |
+
ㅤ https://t.co/M4JIGZOb6n"
|
51 |
+
16,"My weekend the past weeks/months:
|
52 |
+
Nagpunta sa bgc > megamall > araneta cubao > moa to paranaque > mckinley > katipunan.
|
53 |
+
|
54 |
+
🔜 office to megamall for team bowling > angeles > sangkyu cse > bgc with couz > chanyeol cse(?) > rizal for tb 😂😂😂 isa lang massabe ko: d na nakapagdiet. https://t.co/Nj39UfViwM"
|
55 |
+
17,"Every friday eh yung mga friends ko niyayaya din ako minsan gumala pero I always decline kasi gastos ulit yun for me.
|
56 |
+
|
57 |
+
Either punta sila sa BGC, Katip, Maginhawa, SMNE, etc.
|
58 |
+
|
59 |
+
Diretso uwi nalang ako."
|
60 |
+
18,"Update sa lebel sa Tubig!
|
61 |
+
As of 01:45PM | October 14, 2023
|
62 |
+
Source: LFEWS
|
63 |
+
|
64 |
+
Maayong Hapon!
|
65 |
+
|
66 |
+
Anaa gihapon sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera.
|
67 |
+
|
68 |
+
""Early Warning, Early Action!""
|
69 |
+
|
70 |
+
Duty Personnel: loc / msm https://t.co/Te8OEmvi7P"
|
71 |
+
19,@DefenderDlsu Dapat magpabonfire sila pag naka isang set sila sa DSLU at NU..mukhang hangang sa battle of katipunan lang sila aabot.
|
72 |
+
20,"Ang Tibaksyunaryo ay unang ginawa ng ASAP-Katipunan ng UP Manila. 📷
|
73 |
+
|
74 |
+
#STANDUPNCPAG
|
75 |
+
#TunayPalabanMakabayan"
|
76 |
+
21,my katip friends are chatting me to hangout with them later sa walrus😭
|
77 |
+
22,natatalo ateneo tuwing inaagawan ako ng tricycle ng isang atenista sa katipunan and methinks daserb https://t.co/THrltBwmBV
|
78 |
+
23,Ang tanga lang nung #wagniyokaminakawansacampus sana manakawan kayo sa Katip/Ever/Pob/BGC/wherever the fuck you pretentious assholes go 🙄
|
79 |
+
24,sure na me sa katipunan laterrrrr see yaaa
|
80 |
+
25,ang funny talaga ng up golf club HAHHAHAHA eto topic namin kanina habang nagpapa sober da mcdo katip 🤣
|
81 |
+
26,@cheylwse sa taft at katipunan po
|
82 |
+
27,MMDA ALERT: Vehicular accident at C5 Nawasa Katipunan NB involving truck and motorcycle as of 3:11 PM. 1 lane occupied. MMDA enforcers on site. #mmda
|
83 |
+
28,@jvveon PLSSS WISH U WERE HEREEE KATIP PA NAMAN
|
84 |
+
29,"Sobrang lost ng ADMU today, sana makapag-adjust sila sa battle of Katipunan round 1 anek"
|
85 |
+
30,Katip casuals :) https://t.co/ggjEGgq9dP
|
86 |
+
31,"visit @HilltopGamers guys, near LRT katip station !! https://t.co/ZRPL8gcLCv"
|
87 |
+
32,"hello :DD may comshop po ba sa katipunan and magkano po per hour?
|
88 |
+
thankyew"
|
89 |
+
33,"I can do meet-ups if it is around Quezon City (Hanggang Katipunan area lang kaya ng powers ko) or San Juan Area. If you need more pictures or you want to negotiate the prices feel free to message me 😄
|
90 |
+
Shipping will be shouldered by the buyer.
|
91 |
+
Current Location: New Manila, Q.C"
|
92 |
+
34,If you're seeing this please please please start organizing protests for Palestine around España Taft and Diliman/Katipunan. We need to make it clear to the youth that we have to stand with Palestine now and always for whenever in our lives it is finally liberated
|
93 |
+
35,"wts lfb
|
94 |
+
|
95 |
+
on festival off concert
|
96 |
+
|
97 |
+
1 ub 403A ticket
|
98 |
+
- php 4790 (original price)
|
99 |
+
- can do meet up around sm north/katipunan
|
100 |
+
- payo (prio) or installment until nov 3/d-day
|
101 |
+
|
102 |
+
get na pls :("
|
103 |
+
36,"wts lfb
|
104 |
+
|
105 |
+
on festival off concert
|
106 |
+
|
107 |
+
1 ub 403A ticket
|
108 |
+
- php 4790 (original price)
|
109 |
+
- can do meet up around sm north/katipunan
|
110 |
+
- payo (prio) or installment until nov 3/d-day
|
111 |
+
|
112 |
+
huhu pls get it na po >< masaya kami katabi sa con ng friend ko https://t.co/UG0srrZYm4"
|
113 |
+
37,"I remember this was released at 6PM KST and playing the music video maybe 10 times before I left to go to some inuman with my blockmates in Blue Finn.
|
114 |
+
|
115 |
+
Shit yan ang proof ng Katip tanders. Blue Finn! https://t.co/Ga26uHgP0B"
|
116 |
+
38,"Going back to Ateneo today. It's been like, 5 to 6 years since I last went there. But this time, with someone special who also happens to become a fellow Atenean.
|
117 |
+
|
118 |
+
Hindi ko akalaing babalik ako ng Katipunan na may baong jowa hahahaha."
|
119 |
+
39,"Admittedly, mas less car-centric pa ang town proper ng Baguio kaysa Diliman-Katipunan."
|
120 |
+
40,apaka layo ng katipunan kalokaa
|
121 |
+
41,"Update sa lebel sa Tubig!
|
122 |
+
As of 06:04AM | October 14, 2023
|
123 |
+
Source: LFEWS
|
124 |
+
Maayong Buntag!
|
125 |
+
Anaa sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera. https://t.co/wk9SZ7Grn7"
|
126 |
+
42,MMDA ALERT: Stalled Wing Van due to double flat tire at C5 Labor Hospital Katipunan NB as of 5:39 AM. 1 lane occupied. MMDA enforcers on site. #mmda
|
127 |
+
43,haha pis awt katipunan putangina mo z word mabulok ka dyan!
|
128 |
+
44,whos at katip tommm taraa lets sleep and cuddleee
|
129 |
+
45,#skl pero ang cute ng tita q kanina kasi nagjeep kami s may katipunan tas sabi niya sakin 5-ish years na siyang di nagkocommute so selfie kami nang selfie sa loobxhhdhshs
|
130 |
+
46,msg me if ull be around katip tom hmpp me wanna sleep and cuddle
|
131 |
+
47,sino nasa katipunan tom? lets sleep and cuddle pleasee
|
132 |
+
48,"Update sa lebel sa Tubig!
|
133 |
+
As of 11:02PM | October 13, 2023
|
134 |
+
Source: LFEWS
|
135 |
+
Maayong Gabii!
|
136 |
+
Nagpabilin sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera. https://t.co/bcooxwxLkj"
|
137 |
+
49,"So #TSErasTourFilm is extremely extraordinary....i enjoyed it much,but after all...hihi..huhu...hirap ng sakayan. Ng lalakad ako nito,,basta makauwi lng....rn dumaan ako ng dunkin donut dito sa katipunan ave. https://t.co/Qv0JM6j48a"
|
138 |
+
50,i’m so happy when i pass by katip & see this. i’m glad they didn’t remove it 🥺 https://t.co/FftcR2QoXL
|
139 |
+
51,I hate commuting to Ateneo/katipunan with a burning passion
|
140 |
+
52,"Officials of the Caloocan city government and the Napolcom, and representatives of other local government units in Metro Manila tackled law enforcement issues at the PLEB Consultative Meeting at the Bulwagang Katipunan on Thursday, Oct. 12.
|
141 |
+
|
142 |
+
https://t.co/EaZvYVWSMJ"
|
143 |
+
53,"@euhhlala not really tho daghag comments like why are you in up, you’re in the wrong side of katip etc like ???
|
144 |
+
|
145 |
+
and kabalo man ta kinsa sala ngano ing ana ang demo sa up so why are they getting the hate??"
|
146 |
+
54,@hoshipsbw Hihi. Sabi ko na may bagong visual inspo for AW. Kuliglig na naman sa mga taga Katip 😘
|
147 |
+
55,i hate the katip traffic but i looove seeing this https://t.co/bFJeg1VnjY
|
148 |
+
56,uwing uwi na ko traffic pa rin sa cubao pa katipunan??????
|
149 |
+
57,"@w1thdiamonds another thing: lan kwai/rooftop(before it closed) every other friday, jackie running maroon fm and/or tv like the navy, lottie the som crushie, sometimes taivan+mari+the ust peeps would leave taft+españa & travel to see the rest of them in katip(and complain about the traffic)"
|
150 |
+
58,@w1thdiamonds bro i cry internally everyday when i pass the hellscape that is katipunan traffic
|
151 |
+
59,@Its_Me_Angela_ That katipunan ave traffic though. 🥲.
|
152 |
+
60,@katip_31 @Polizei_Ffm Straftat?! 😅
|
153 |
+
61,yooo qc katip g hmu bored #jakolph #zoomjakol #alterqc #alterph
|
154 |
+
62,"me submitting my ateneo application in katipunan, on friday the 13th 😁 #OBF"
|
155 |
+
63,"di na yan ireereview kasi based sa words ni Naveen, ang rules ng liga ay dapat within 48 hours after the game lang ang decision soooooo UP gets away with it again for the nth time HAHAHAHAHA
|
156 |
+
|
157 |
+
and ppl has the nerve to say the other side of Katipunan is the one playing dirty LOOOOL https://t.co/p6jLbv6k49"
|
158 |
+
64,yung poging nakatambay sa katipunan https://t.co/Wx0ssDW0J5
|
159 |
+
65,katipunan i will NOT! puke in your streets tonight
|
160 |
+
66,@AntonioShe46517 @ham_7_game @AngelGABv13 @S_sants @NateBar12224359 @DonnyniB @SatriaRafi05 @Yadiiiiraaa_ @Christo46511122 @UofLCuban @Sharadrusalla @ricardozamorac1 @AlexBenitez2306 @NNe_SR @trhouser @inakialdasoro @knk150506 @hatem5267 @BigHossKnows @Katip_hizli @foreveryounct Check my prоfile to get launсh 🛡️✅
|
161 |
+
67,"@allforyyj @wouziriri may jeep terminal po na walking distance from katip station, sakay kayo don then pagka u-turn nung jeep lampas uptc baba na kayo agad dun sa unang gate, andon na rin po yung gt toyota ^^"
|
162 |
+
68,"buong trip from un to d.jose + from recto to katip, nakatayo ako pls for one week na po itong workout na ito"
|
163 |
+
69,how to commute from katip station to gt toyota auditorium in up
|
164 |
+
70,"⚠ Practice defensive driving.
|
165 |
+
|
166 |
+
METROBASE Incident report at Major roads
|
167 |
+
from 2:00pm to 7:00pm
|
168 |
+
|
169 |
+
WHEN: October 13, 2023
|
170 |
+
|
171 |
+
Stalled Vehicle/s :
|
172 |
+
1. C5 Lanuza SB (Dumptruck; Flat tire; 2:12pm time reported; Assisted by MMDA Enforcers;
|
173 |
+
|
174 |
+
3. C5 Katipunan ave. Miriam SB (Closedvan; Flat… https://t.co/oiWZHbezxQ https://t.co/bcUOVKgRyn"
|
175 |
+
71,MABIBILAUKAN AKO SA MCDO KATIPUNAN KABULASTUGAN TALAGA https://t.co/qujGwy84OU
|
176 |
+
72,"ㅤ
|
177 |
+
|
178 |
+
https://t.co/UlLYKINytL sns update! ❦
|
179 |
+
1O月 13日 、 FRIDAY
|
180 |
+
Katipunan Avenue, QC.
|
181 |
+
|
182 |
+
ㅤ https://t.co/cdM08HYkPV"
|
183 |
+
73,"Update sa lebel sa Tubig!
|
184 |
+
As of 06:04PM | October 13, 2023
|
185 |
+
Source: LFEWS
|
186 |
+
Maayong Gabii!
|
187 |
+
Anaa sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera. https://t.co/y2nUMrGzmi"
|
188 |
+
74,gago yung red light sa katipunan parang 3-4 minutes maybe 5 anong kapunyetahan to
|
189 |
+
75,"FARMerch yarn? 🌿🌾
|
190 |
+
|
191 |
+
Ngayong buwan ng magsasaka, sama-sama nating ipaglaban ang tunay na reporma sa lupa, lokal na produksyon at hustisya!
|
192 |
+
|
193 |
+
Dahil diyan, inilulunsad ng Katipunan ng mga Samahang Magbubukid sa Timog Katagalugan (KASAMA-TK) ang FARMerch, https://t.co/iTZypwhI8b"
|
194 |
+
76,Katip later
|
195 |
+
77,"I fully love that the spot in Katipunan that is known as ""the cursed land where none may plant their seed and have them prosper"" is now filled by D/IMSUM T/REATS"
|
196 |
+
78,@theblessedbhie oh pak napromote pa ang blessed bhiestro 🫶🏻 labyu madam kelan na ang ating katip date!!!!!
|
197 |
+
79,grindr katipunan isn’t doing its thing for me so i guess it’s time for twitter to do its thing heheheh https://t.co/sIs1I31ng7
|
198 |
+
80,“mabuhay ang pilipinas! mabuhay ang himagsikan! mabuhay ang katipunan!” https://t.co/Da6ZIBYGKX
|
199 |
+
81,MMDA ALERT: Vehicular accident at C5 Katipunan Ave. B. Serrano Intersection WB involving a van and motorcycle as of 1:25 AM. 1 lane occupied. MMDA enforcers on site. #Mmda
|
200 |
+
82,"Update sa lebel sa Tubig!
|
201 |
+
As of 01:02PM | October 13, 2023
|
202 |
+
Source: LFEWS
|
203 |
+
|
204 |
+
Maayong Hapon!
|
205 |
+
|
206 |
+
Anaa gihapon sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera. https://t.co/JaPPdSqvLV"
|
207 |
+
83,"In this digital age, journalism combats the rampant and persisting threat of misinformation and disinformation. Let us listen as journalist and keynote speaker Atom Araullo shares his take on this pressing matter.
|
208 |
+
|
209 |
+
See you later here on Level Up at 3pm live on Radyo Katipunan. https://t.co/OpdQtzlFAA"
|
210 |
+
84,MMDA ALERT: Stalled truck due to mechanical problem at C5 Katipunan Uptown SB as of 11:40 AM. 1 lane occupied. MMDA enforcers on site. #mmda
|
211 |
+
85,Bakit walang tiwala sa ating katipunan pero bakit wala pera ang ating panliipunan ano nangyayari…. sa ating pang-inuman
|
212 |
+
86,what did andres bonifacio name the philippines revolutionary 😭 /gen — kataastaasan kagalanggalang na katipunan ng mga anak ng bayan (KKK) https://t.co/A2VNDbILnJ
|
213 |
+
87,"damn 8 ako umalis sa bahay, and mag teten na katip palang ako ahahaha, nywy ahahaha grabe talaga traffic dto"
|
214 |
+
88,mahangin talaga dito sa katip no?
|
215 |
+
89,MMDA ALERT: Vehicular accident at Katipunan Miriam NB involving 2 motorcycles as of 9:20 AM. 1 lane occupied. MMDA enforcer on site. #mmda
|
216 |
+
90,no break date near katip 💔💔💔
|
217 |
+
91,paano ba bumyahe papunta katipunan hHASHHS
|
(pobla OR poblacion).csv
ADDED
@@ -0,0 +1,307 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
,tweet
|
2 |
+
0,"Pobla, Makati hehe https://t.co/QAkWKcg82V"
|
3 |
+
1,otw pobla nagutom nako kakahintay kay bakla 🙄
|
4 |
+
2,Who’s around Pobla????
|
5 |
+
3,@EldornM Meron yan wag ka magalala Hahahahahahha kita kits sa pobla Hahahahahha
|
6 |
+
4,"lf fem, pobla area now, dm your rates
|
7 |
+
|
8 |
+
and for the love of god, no dp pls. I can book your ride prepaid
|
9 |
+
#alterpinayph #alterbooking #AlterMakat"
|
10 |
+
5,gusto ko sana sumama sa reddit peeps kaso pobla pa hahaha hmm
|
11 |
+
6,dapat magpunta aq pobla now pero ang sarap ng higa q ngayon HAHAHHSHDLDHDJD
|
12 |
+
7,Pobla Sabado now here at @UrusMakati asan na kau guys? Unli drinks until supply last till 4am! https://t.co/ua69y0wbQN
|
13 |
+
8,"cause we still got a lot of energy from the eras film
|
14 |
+
|
15 |
+
📍 POBLA"
|
16 |
+
9,"lf fem, pobla area now, dm your rates
|
17 |
+
|
18 |
+
#alterpinayph #alterbooking #AlterMakati"
|
19 |
+
10,"📢 Exciting news! An airplane carrying medical supplies from the WHO has just landed in Egypt for the people of Gaza. 🌍 Learn more about this crucial humanitarian effort: https://t.co/5X1bx3Kq8z. Together, we can make a difference! 💙 #HumanitarianAid #WHO #Gaza #LobosNews"
|
20 |
+
11,Edi nasa pobla sana ako mamaya kung maeenjoy ko weekend ko diba??? https://t.co/ESpYpMP7qA
|
21 |
+
12,"@KylieVogue___ @carlabrews @Laylaitera Mag apply keah aqo na kasambahay kay Donya Carla? $5/hour. Afford nemen nya at xia pinakamayabang este, ""pinakamayaman"" sa kanilang barangay Isla Poblacion, Pangil, Laguna. 😂😂😂"
|
22 |
+
13,"War crimes. Vonderleyen should be promptly sacked before an appearance at the Hague on war crimes charges:
|
23 |
+
|
24 |
+
https://t.co/dFCVD4NU8c"
|
25 |
+
14,Who’s gonna be at pobla tonight? Tara?
|
26 |
+
15,Tara sa Poblacion mamaya? 🥂 Free entrance ang mga nasa guestlist. Dm me for the guestlist 🪩
|
27 |
+
16,"Tara sa Disturbia Poblacion, Makati mamaya? 🥂 Free entrance ang mga nasa guestlist. Dm me for the guestlist 🪩"
|
28 |
+
17,"See you later 👋
|
29 |
+
|
30 |
+
Brief party 🎉
|
31 |
+
|
32 |
+
At makati poblacion
|
33 |
+
|
34 |
+
Contact: @AshtonDaks69 https://t.co/OO3LWHfdl7"
|
35 |
+
18,Nag sibat gani ko sa Poblacion para layo sa shot pag abot diri shot gyapon
|
36 |
+
19,"Okay final na, will be at bgc tomorrow hahaha. Malay niyo mag pobla ako, libre niyo ko drinks. Gift niyo na sa bday ghorl 😚🤗😇 https://t.co/h6bd7psFHF"
|
37 |
+
20,tangina takang taka na siguro magulang namin kung bakit buong araw tulog kme ni kuya asa pobla din kasi pala si gagu kagabi HSHDHXBDBDBSBSBXHHXHS
|
38 |
+
21,Tas asikaso ulit at punta namang bgc and pobla 😩
|
39 |
+
22,"EU's top diplomat Joseph Borrell breaks from EU Commission president von der Leyen and says Israel's retaliation doesn't comply with international law.
|
40 |
+
https://t.co/x5zgQBC3Zi"
|
41 |
+
23,Pobla Days with Avida boys https://t.co/YBrOkYonUg
|
42 |
+
24,"At around 1 O'clock in the afternoon on October 14, 2023, personnel of Peñarrubia MPS led by PSMS Milagros B. Ibrahim provided Police Presence and motorist assistance in Barangay Poblacion, Peñarrubia to pre-empt occurrence of crime and any untoward incidents around AOR. https://t.co/j4hJR3Xmdp"
|
43 |
+
25,"At around 8:00 AM, personnel of Peñarrubia MPS led by PMSg Jocelyn B. Caba conducted Police Presence/Area Security to avoid any untoward incidents that may happen and to maintain peace and order in Barangay Poblacion, Peñarrubia, Abra. https://t.co/DiS7EYtjLv"
|
44 |
+
26,"EU foreign policy chief @JosepBorrellF breaks ranks with @VonderLeyen and accuses Israel of committing war crimes.
|
45 |
+
|
46 |
+
Huge division now at the top of the EU.
|
47 |
+
|
48 |
+
https://t.co/rML95qRz5T"
|
49 |
+
27,"🇮🇱 #Israel announces to the population of 🇵🇸 Gaza two temporary southbound exit routes ⏳
|
50 |
+
|
51 |
+
""For your own safety and that of your loved ones, leave,"" Israeli authorities announce.
|
52 |
+
|
53 |
+
↘️
|
54 |
+
https://t.co/dUB1X6p78z"
|
55 |
+
28,"FOR SALE ""LANIE BALOGO""
|
56 |
+
|
57 |
+
PM ME FOR DETAILS OR IF YOU WANT TO COLLAB WITH HER CONTACT THIS PERSON.
|
58 |
+
|
59 |
+
Pacifico Jr Delfin Yalao
|
60 |
+
09452019207, 09122390128
|
61 |
+
Poblacion San Pablo Zamboanga del Sur
|
62 |
+
[email protected] https://t.co/wIoe1C2AUI"
|
63 |
+
29,saang cafe sa poblacion kaya aq pwede magreview KSKSKSKS
|
64 |
+
30,"@stvrlie @zZzsie hahahahaha the heck, eli! you're in pobla na naman? want ko rin uminom. dm me the deets 😉"
|
65 |
+
31,"hahhaa the counselor at lys is so funny every time 😭
|
66 |
+
him: kumusta ka?
|
67 |
+
me: inaantok
|
68 |
+
him: pumarty? saan ka pumarty
|
69 |
+
me: titahan sa poblacion po
|
70 |
+
him: ako sa timog"
|
71 |
+
32,"pobla hopping
|
72 |
+
|
73 |
+
sanctuary > polilya > apartment > tambai https://t.co/Gl3PH8Zg0q"
|
74 |
+
33,Who's going to Poblacion tonight? Anyone???
|
75 |
+
34,To be honest. Had fun last Wednesday episode in poblacion onto next? (Crossfingers) 🤞🏼🤞🏼
|
76 |
+
35,"@MacaesBruno I think Borrell has shown some sense this week on the topic.
|
77 |
+
|
78 |
+
https://t.co/egU6mOskou"
|
79 |
+
36,"Says parts of Israeli response counter international law
|
80 |
+
|
81 |
+
""There are elements of this response that certainly do not comply with international law. I think this is the third time I've said it. But I'm not saying it; The UN SG says it. It's no discovery.""
|
82 |
+
|
83 |
+
https://t.co/egU6mOskou"
|
84 |
+
37,"🚨 BREEDERS!
|
85 |
+
|
86 |
+
Undress your finest body and release your sexiness tonight at the “BRIEF PARTY”💙🩲
|
87 |
+
|
88 |
+
Catch me for an exclusive ORGYNUMAN
|
89 |
+
at Poblacion Makati!🥂
|
90 |
+
|
91 |
+
See you there, Breeders! 😈💦
|
92 |
+
|
93 |
+
📩 For more details about the event,
|
94 |
+
Kindly dm @AshtonDaks69 https://t.co/uednKM0kAx"
|
95 |
+
38,"@Weh227089049811 San sa poblacion,,malapit lang ba sa makati ave"
|
96 |
+
39,grabe bday celeb ni kang umabot kami gang pobla 😭
|
97 |
+
40,Tumatalab parin pala ang “Handle me with care.” sa mga taga Pobla 😂
|
98 |
+
41,"TONIGHT @ MAKATI
|
99 |
+
💪🔥 BRIEF-PARTY 🔥💪
|
100 |
+
|
101 |
+
💯 ORGYNUMAN 💯
|
102 |
+
|
103 |
+
OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
|
104 |
+
|
105 |
+
PRE BOOKING (599,₱)
|
106 |
+
WALK-IN FEE ( 799,₱)
|
107 |
+
|
108 |
+
NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
|
109 |
+
FREE CONDOMS AND LUBE
|
110 |
+
|
111 |
+
INCLUSION:
|
112 |
+
💯 UNLIMITED DRINK
|
113 |
+
💯 COCKTAIL… https://t.co/BgHj3MXH0k https://t.co/1kwFJvB6Mk"
|
114 |
+
42,"will watch ts eras tour movie and hop to prolly pobla/maginghawa tonight.
|
115 |
+
|
116 |
+
oh how I love october month ❤️🩹"
|
117 |
+
43,3hrs tulog kaya ba mag pobla mamaya 🥲
|
118 |
+
44,"TONIGHT @ MAKATI
|
119 |
+
💪🔥 BRIEF-PARTY 🔥💪
|
120 |
+
|
121 |
+
💯 ORGYNUMAN 💯
|
122 |
+
|
123 |
+
OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
|
124 |
+
|
125 |
+
PRE BOOKING (599,₱)
|
126 |
+
WALK-IN FEE ( 799,₱)
|
127 |
+
|
128 |
+
NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
|
129 |
+
FREE CONDOMS AND LUBE
|
130 |
+
|
131 |
+
INCLUSION:
|
132 |
+
💯 UNLIMITED DRINK
|
133 |
+
💯 COCKTAIL… https://t.co/ZkK02L8e3O https://t.co/m3rgkWwyVF"
|
134 |
+
45,"TONIGHT @ MAKATI
|
135 |
+
💪🔥 BRIEF-PARTY 🔥💪
|
136 |
+
|
137 |
+
💯 ORGYNUMAN 💯
|
138 |
+
|
139 |
+
OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
|
140 |
+
|
141 |
+
PRE BOOKING (599,₱)
|
142 |
+
WALK-IN FEE ( 799,₱)
|
143 |
+
|
144 |
+
NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
|
145 |
+
FREE CONDOMS AND LUBE
|
146 |
+
|
147 |
+
INCLUSION:
|
148 |
+
💯 UNLIMITED DRINK
|
149 |
+
💯 COCKTAIL… https://t.co/ajOJyO5Kvz https://t.co/ufVN0CZZFx"
|
150 |
+
46,"TONIGHT @ MAKATI
|
151 |
+
💪🔥 BRIEF-PARTY 🔥💪
|
152 |
+
|
153 |
+
💯 ORGYNUMAN 💯
|
154 |
+
|
155 |
+
OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
|
156 |
+
|
157 |
+
PRE BOOKING (599,₱)
|
158 |
+
WALK-IN FEE ( 799,₱)
|
159 |
+
|
160 |
+
NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
|
161 |
+
FREE CONDOMS AND LUBE
|
162 |
+
|
163 |
+
INCLUSION:
|
164 |
+
💯 UNLIMITED DRINK
|
165 |
+
💯 COCKTAIL… https://t.co/vlgKTVgILq https://t.co/p30UC42BuQ"
|
166 |
+
47,"TONIGHT @ MAKATI
|
167 |
+
💪🔥 BRIEF-PARTY 🔥💪
|
168 |
+
|
169 |
+
💯 ORGYNUMAN 💯
|
170 |
+
|
171 |
+
OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
|
172 |
+
|
173 |
+
PRE BOOKING (599,₱)
|
174 |
+
WALK-IN FEE ( 799,₱)
|
175 |
+
|
176 |
+
NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
|
177 |
+
FREE CONDOMS AND LUBE
|
178 |
+
|
179 |
+
INCLUSION:
|
180 |
+
💯 UNLIMITED DRINK
|
181 |
+
💯 COCKTAIL… https://t.co/HHQZYokDur https://t.co/QT8Rf2jryA"
|
182 |
+
48,"TONIGHT @ MAKATI
|
183 |
+
💪🔥 BRIEF-PARTY 🔥💪
|
184 |
+
|
185 |
+
💯 ORGYNUMAN 💯
|
186 |
+
|
187 |
+
OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
|
188 |
+
|
189 |
+
PRE BOOKING (599,₱)
|
190 |
+
WALK-IN FEE ( 799,₱)
|
191 |
+
|
192 |
+
NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
|
193 |
+
FREE CONDOMS AND LUBE
|
194 |
+
|
195 |
+
INCLUSION:
|
196 |
+
💯 UNLIMITED DRINK
|
197 |
+
💯 COCKTAIL… https://t.co/tV1yUxqbN5 https://t.co/stM2LF82ph"
|
198 |
+
49,"TONIGHT @ MAKATI
|
199 |
+
💪🔥 BRIEF-PARTY 🔥💪
|
200 |
+
|
201 |
+
💯 ORGYNUMAN 💯
|
202 |
+
|
203 |
+
OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
|
204 |
+
|
205 |
+
PRE BOOKING (599,₱)
|
206 |
+
WALK-IN FEE ( 799,₱)
|
207 |
+
|
208 |
+
NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
|
209 |
+
FREE CONDOMS AND LUBE
|
210 |
+
|
211 |
+
INCLUSION:
|
212 |
+
💯 UNLIMITED DRINK
|
213 |
+
💯 COCKTAIL… https://t.co/AsvBfU73gb https://t.co/GzbScV3hwM"
|
214 |
+
50,"TONIGHT @ MAKATI
|
215 |
+
💪🔥 BRIEF-PARTY 🔥💪
|
216 |
+
|
217 |
+
💯 ORGYNUMAN 💯
|
218 |
+
|
219 |
+
OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
|
220 |
+
|
221 |
+
PRE BOOKING (599,₱)
|
222 |
+
WALK-IN FEE ( 799,₱)
|
223 |
+
|
224 |
+
NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
|
225 |
+
FREE CONDOMS AND LUBE
|
226 |
+
|
227 |
+
INCLUSION:
|
228 |
+
💯 UNLIMITED DRINK
|
229 |
+
💯 COCKTAIL… https://t.co/R2fmvLp4d3 https://t.co/opqiWMhlCz"
|
230 |
+
51,"ℍ𝔸ℙℙ𝔼ℕ𝕀ℕ𝔾 ℕ𝕆𝕎| National ID registration at the 3rd Floor of the LDRRM Bldg., Poblacion, Paracelis. Registration is open Monday to Saturday, 8:00AM- 5:00PM.
|
231 |
+
|
232 |
+
#NationalID
|
233 |
+
#IbaNaAngKilala https://t.co/RnYLDGbEjO"
|
234 |
+
52,"Resumen
|
235 |
+
|
236 |
+
...
|
237 |
+
➡️ https://t.co/cEcdvICoX1"
|
238 |
+
53,@RadioGenoa If this continues escalating perhaps they will need to start using weapons to defend themselves and the población of this criminals.
|
239 |
+
54,"ℍ𝔸ℙℙ𝔼ℕ𝕀ℕ𝔾 ℕ𝕆𝕎| National ID registration and market day of Sabangan at San Alfonso Grounds, Poblacion, Sabangan.
|
240 |
+
|
241 |
+
For those who are not yet registered, visit our Fixed Registration Center at the Groundfloor of the Bontoc Municipal Capitol.
|
242 |
+
#NationalID
|
243 |
+
#IbaNaAngKilala https://t.co/HtnmhjPVUw"
|
244 |
+
55,"𝕀ℕ ℙℍ𝕆𝕋𝕆𝕊| National ID registration during the DSWD Family Development Session at the Municipal Capitol, Poblacion.
|
245 |
+
|
246 |
+
For those who are not yet registered, visit our Fixed Registration Center at the Groundfloor of the Bontoc Municipal Capitol.
|
247 |
+
#NationalID
|
248 |
+
#IbaNaAngKilala https://t.co/R6ORufhFDM"
|
249 |
+
56,"1/2: 2165 beneficiaries mula sa Brgy. Kasilawan, Brgy. Tejeros, Brgy. Sta. Cruz, Brgy. La Paz, at Brgy. Poblacion ang nakatanggap ng tulong mula sa ating Assistance to Individuals in Crisis Situations (AICS) program na ginanap ngayong araw. https://t.co/c1kG8BxJ2o"
|
250 |
+
57,"WANTED SCAMMER !!
|
251 |
+
|
252 |
+
Yalao Jr Pacifico Delfin
|
253 |
+
09452019207, 09122390128
|
254 |
+
Poblacion San Pablo Zamboanga del Sur
|
255 |
+
[email protected] https://t.co/LjwKaW04L1"
|
256 |
+
58,performing at Apotheka Poblacion tonight with @ladygagitaofcl 🫶🏼🫶🏼
|
257 |
+
59,Limgaw kaayo kog bar hopping sa pobla gabie haha
|
258 |
+
60,"Craving? Try Kurimi milktea, coffee, yogurt and snacks at Poblacion- M.H Del Pilar St., Silang, Cavite. Visit the 2nd Kurīmī Milk Tea branch of Atty. Vea Magsaysay #KurimiPoblacion #kurimi2D #kurimilicious https://t.co/StFxxLis73"
|
259 |
+
61,"Baka lang meron may alam here. Me and my cousins plan na mag party sa poblacion Oct 31 Halloween salubong. Any reco po where to stay? Like condotel, hotels or kahit transient na pwedi namin ma rent buong space."
|
260 |
+
62,THE ERASE TOUR APOTHEKA POBLACION TONIGHT. SOLD OUT!
|
261 |
+
63,"Palestinian President Mahmoud Abbas reaffirms to Blinken that they will not allow their population to leave Gaza.
|
262 |
+
#IsraelPalestineConflict
|
263 |
+
|
264 |
+
https://t.co/3zUqjhK6Jj"
|
265 |
+
64,Not so sleigh ang nangyare sa pobla ha 😩
|
266 |
+
65,"That day has come. I am in Poblacion at 430am running while people are still actively partying while some are passed out outside. Prepandemic me would've said, ""hi old"""
|
267 |
+
66,mingaw nauud kyko sakong uyab lami kaayo ilupad ug poblacion
|
268 |
+
67,"@MohammedPu50150 @GalindoKno52453 @ortizdiegoo8 @tunakorcak @blakerypel16 @KILLBARI @AlpheuBALDE @agboola_giwa @vkvjnaf @SorenHedman @adamsantaii @lichisup @poblacion_csic @el_forjador @gaolay @Gawanwb @dver_fioletovay @ahmetalibacak2 @SakukoGame @SmiLe_In_D_dark @bewarethehunt Fuck you mohammed, I’m gonna make Nagasaki look like a kids show if you keep this up"
|
269 |
+
68,Or sana may ma-meet akong cutie and matinong guys from Makati esp. sa Pobla na potential bebe ko. Huhu
|
270 |
+
69,hindi ako mapupunta sa poblacion
|
271 |
+
70,Traffic around Poblacion on Friday nights is crazy
|
272 |
+
71,girlie dont u ever stop working??? nageemail ka pa sakin ng 10pm ON A FRIDAY??? dont tell me nasa pobla ka habang tinatype mo yan pls nakakaloka GET A LIFE OUTSIDE OF WORK NAMAN HUY
|
273 |
+
72,gusto ko maranasan halloween party sa poblacion plsss di ako nakapunta last year kasi busy sa thesis at exam 😢
|
274 |
+
73,sadt naman here sa Pobla. gusto ko nlg mgkape pls
|
275 |
+
74,nag poblacion para kumain >>> uminom
|
276 |
+
75,what if pobla?🤔
|
277 |
+
76,Gagu napunta kami sa poblacion 😭
|
278 |
+
77,@ Poblacion init ng ulo ko dito na ako kumain
|
279 |
+
78,"@aipeiramr Mag usap naman po tayo, okay naman tayo eh. Hindi ko alam bakit ka po nagagalit saglit lang naman po kami sa Poblacion kanina"
|
280 |
+
79,Poblacion gaming tomorrow after a week of loaded work!!!!!!!! 😭😭😭 finallyyyyyyyyyyyyyyy
|
281 |
+
80,kadami naman nka NCTdream varsity jacket dito sa pobla 😅
|
282 |
+
81,i guess see u at pobla tom???
|
283 |
+
82,@jadedazes i become the poblacion girl of the convention world /j
|
284 |
+
83,"Continous conduct of PNP/COMELEC Checkpoint along Abra-Kalinga road particularly in So. Nagsangalan, Barangay Poblacion East, Pidigan, Abra. https://t.co/lTTdJKiE50"
|
285 |
+
84,"BUKAS NA PO ITO OCT 14 SATURDAY AT POBLACION MAKATI
|
286 |
+
SEE YOU BUKAS UGH 😍🍆
|
287 |
+
|
288 |
+
DM @AshtonDaks69
|
289 |
+
@pinoyspabuddy https://t.co/rJpbbMhG3O"
|
290 |
+
85,"@yanamnv gagi wag, mag pobla pa tauu 🙁🙁😭"
|
291 |
+
86,@_kyunhee tara ituloy ang pobla 👀 AHAHAHA
|
292 |
+
87,"Huhu grabe pota kapoyyy tgif walang paparty, miss ko na mag pobla"
|
293 |
+
88,Daming nag aaya may pop up may pobla hahaha shuta
|
294 |
+
89,guys pobla tayo mamaya
|
295 |
+
90,@smninews @indaysara Madame VP-Sarah muzta na po kayo me tweet po ako sa inyo si Jaime A. CABACANG po i2 Poblacion Munt city
|
296 |
+
91,Friday the 13th. Poblacion. Working. Waiting for PSJ updates. 😌 https://t.co/hjO21pVhMB
|
297 |
+
92,"Usap-usapan ngayon sa social media ang rambulan ng mga babae sa isang bar sa Poblacion, Makati.
|
298 |
+
|
299 |
+
https://t.co/0ephIV6FJW
|
300 |
+
|
301 |
+
#TunayNaTabloidista #AbanteNews"
|
302 |
+
93,"Happy Friday the 13th! 🍻
|
303 |
+
https://t.co/a3fwZ0uorS"
|
304 |
+
94,"@Jorge17125130 @MediavillaMarga En fin
|
305 |
+
https://t.co/lqGvl9DFSN"
|
306 |
+
95,"We know you're sooo ready to clock out.
|
307 |
+
https://t.co/0kvopK5jYH"
|
app.py
CHANGED
@@ -1,45 +1,78 @@
|
|
1 |
-
# Required Libraries
|
2 |
-
|
3 |
-
#Base and Cleaning
|
4 |
-
import json
|
5 |
-
import requests
|
6 |
import pandas as pd
|
7 |
-
import
|
8 |
-
import emoji
|
9 |
-
import regex
|
10 |
import re
|
11 |
-
import
|
12 |
-
from collections import Counter
|
13 |
-
import tqdm
|
14 |
-
from operator import itemgetter
|
15 |
-
|
16 |
-
#Visualizations
|
17 |
-
import plotly.express as px
|
18 |
-
import seaborn as sns
|
19 |
-
import matplotlib.pyplot as plt
|
20 |
-
import pyLDAvis.gensim
|
21 |
-
import chart_studio
|
22 |
-
import chart_studio.plotly as py
|
23 |
-
import chart_studio.tools as tls
|
24 |
-
|
25 |
-
#Natural Language Processing (NLP)
|
26 |
import spacy
|
27 |
import gensim
|
28 |
import json
|
|
|
|
|
29 |
from spacy.tokenizer import Tokenizer
|
|
|
|
|
|
|
30 |
from gensim.corpora import Dictionary
|
31 |
-
from gensim.models.ldamulticore import LdaMulticore
|
32 |
from gensim.models.coherencemodel import CoherenceModel
|
33 |
-
|
34 |
-
from sklearn.decomposition import LatentDirichletAllocation, TruncatedSVD
|
35 |
-
from sklearn.feature_extraction.text import CountVectorizer, TfidfVectorizer
|
36 |
-
from sklearn.model_selection import GridSearchCV
|
37 |
from pprint import pprint
|
38 |
-
|
|
|
|
|
|
|
|
|
39 |
from gensim.parsing.preprocessing import preprocess_string, strip_punctuation, strip_numeric
|
40 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
41 |
import gradio as gr
|
42 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
43 |
def give_emoji_free_text(text):
|
44 |
"""
|
45 |
Removes emoji's from tweets
|
@@ -59,6 +92,18 @@ def url_free_text(text):
|
|
59 |
text = re.sub(r'http\S+', '', text)
|
60 |
return text
|
61 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
62 |
# Tokenizer function
|
63 |
def tokenize(text):
|
64 |
"""
|
@@ -86,60 +131,8 @@ def tokenize(text):
|
|
86 |
|
87 |
return tokens
|
88 |
|
89 |
-
def
|
90 |
-
coherence_values = []
|
91 |
-
model_list = []
|
92 |
-
for num_topics in range(start, limit, step):
|
93 |
-
model = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
94 |
-
num_topics=num_topics,
|
95 |
-
random_state=100,
|
96 |
-
chunksize=200,
|
97 |
-
passes=10,
|
98 |
-
per_word_topics=True,
|
99 |
-
id2word=id2word)
|
100 |
-
model_list.append(model)
|
101 |
-
coherencemodel = CoherenceModel(model=model, texts=texts, dictionary=dictionary, coherence='c_v')
|
102 |
-
coherence_values.append(coherencemodel.get_coherence())
|
103 |
-
|
104 |
-
return model_list, coherence_values
|
105 |
-
|
106 |
-
def compute_coherence_values2(corpus, dictionary, k, a, b):
|
107 |
-
lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
108 |
-
id2word=id2word,
|
109 |
-
num_topics=num_topics,
|
110 |
-
random_state=100,
|
111 |
-
chunksize=200,
|
112 |
-
passes=10,
|
113 |
-
alpha=a,
|
114 |
-
eta=b,
|
115 |
-
per_word_topics=True)
|
116 |
-
coherence_model_lda = CoherenceModel(model=lda_model, texts=df['lemma_tokens'], dictionary=id2word, coherence='c_v')
|
117 |
-
|
118 |
-
return coherence_model_lda.get_coherence()
|
119 |
-
|
120 |
-
def assignTopic(l):
|
121 |
-
maxTopic = max(l,key=itemgetter(1))[0]
|
122 |
-
return maxTopic
|
123 |
-
|
124 |
-
def get_topic_value(row, i):
|
125 |
-
if len(row) == 1:
|
126 |
-
return row[0][1]
|
127 |
-
else:
|
128 |
-
return row[i][1]
|
129 |
-
|
130 |
-
def dataframeProcessing(dataset):
|
131 |
-
# Opening JSON file
|
132 |
-
f = open('stopwords-tl.json')
|
133 |
-
tlStopwords = json.loads(f.read())
|
134 |
-
stopwords = set(STOPWORDS)
|
135 |
-
stopwords.update(tlStopwords)
|
136 |
-
stopwords.update(['na', 'sa', 'ko', 'ako', 'ng', 'mga', 'ba', 'ka', 'yung', 'lang', 'di', 'mo', 'kasi'])
|
137 |
-
|
138 |
-
global df
|
139 |
-
df = pd.read_csv(dataset + '.csv')
|
140 |
df.rename(columns = {'tweet':'original_tweets'}, inplace = True)
|
141 |
-
df = df.apply(lambda row: row[df['language'].isin(['en'])])
|
142 |
-
df.reset_index(inplace=True)
|
143 |
|
144 |
# Apply the function above and get tweets free of emoji's
|
145 |
call_emoji_free = lambda x: give_emoji_free_text(x)
|
@@ -150,9 +143,13 @@ def dataframeProcessing(dataset):
|
|
150 |
#Create a new column with url free tweets
|
151 |
df['url_free_tweets'] = df['emoji_free_tweets'].apply(url_free_text)
|
152 |
|
153 |
-
|
154 |
-
|
155 |
-
|
|
|
|
|
|
|
|
|
156 |
|
157 |
# Tokenizer
|
158 |
tokenizer = Tokenizer(nlp.vocab)
|
@@ -185,19 +182,6 @@ def dataframeProcessing(dataset):
|
|
185 |
# Make tokens a string again
|
186 |
df['tokens_back_to_text'] = [' '.join(map(str, l)) for l in df['tokens']]
|
187 |
|
188 |
-
def get_lemmas(text):
|
189 |
-
'''Used to lemmatize the processed tweets'''
|
190 |
-
lemmas = []
|
191 |
-
|
192 |
-
doc = nlp(text)
|
193 |
-
|
194 |
-
# Something goes here :P
|
195 |
-
for token in doc:
|
196 |
-
if ((token.is_stop == False) and (token.is_punct == False)) and (token.pos_ != 'PRON'):
|
197 |
-
lemmas.append(token.lemma_)
|
198 |
-
|
199 |
-
return lemmas
|
200 |
-
|
201 |
df['lemmas'] = df['tokens_back_to_text'].apply(get_lemmas)
|
202 |
|
203 |
# Make lemmas a string again
|
@@ -206,60 +190,111 @@ def dataframeProcessing(dataset):
|
|
206 |
# Apply tokenizer
|
207 |
df['lemma_tokens'] = df['lemmas_back_to_text'].apply(tokenize)
|
208 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
209 |
# Create a id2word dictionary
|
210 |
-
global id2word
|
211 |
id2word = Dictionary(df['lemma_tokens'])
|
212 |
|
213 |
# Filtering Extremes
|
214 |
id2word.filter_extremes(no_below=2, no_above=.99)
|
215 |
-
print(len(id2word))
|
216 |
|
217 |
# Creating a corpus object
|
|
|
218 |
corpus = [id2word.doc2bow(d) for d in df['lemma_tokens']]
|
|
|
|
|
219 |
|
220 |
-
|
221 |
-
|
222 |
-
num_topics=5,
|
223 |
-
random_state=100,
|
224 |
-
chunksize=200,
|
225 |
-
passes=10,
|
226 |
-
per_word_topics=True)
|
227 |
|
228 |
-
|
229 |
-
|
230 |
|
231 |
-
|
232 |
-
|
233 |
-
|
234 |
-
|
|
|
|
|
235 |
texts=df['lemma_tokens'],
|
236 |
start=2,
|
237 |
limit=10,
|
238 |
-
step=1
|
|
|
|
|
|
|
239 |
|
240 |
-
|
|
|
241 |
|
242 |
-
|
243 |
-
num_topics = coherence_values.index(k_max) + 2
|
244 |
|
245 |
-
|
246 |
-
|
247 |
-
|
248 |
-
|
249 |
-
chunksize=200,
|
250 |
-
passes=10,
|
251 |
-
per_word_topics=True)
|
252 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
253 |
grid = {}
|
254 |
grid['Validation_Set'] = {}
|
255 |
|
|
|
|
|
|
|
|
|
|
|
256 |
alpha = [0.05, 0.1, 0.5, 1, 5, 10]
|
|
|
|
|
257 |
|
258 |
beta = [0.05, 0.1, 0.5, 1, 5, 10]
|
|
|
259 |
|
260 |
-
num_of_docs = len(
|
261 |
-
corpus_sets = [gensim.utils.ClippedCorpus(
|
262 |
-
|
263 |
corpus_title = ['75% Corpus', '100% Corpus']
|
264 |
model_results = {'Validation_Set': [],
|
265 |
'Alpha': [],
|
@@ -272,69 +307,87 @@ def dataframeProcessing(dataset):
|
|
272 |
for i in range(len(corpus_sets)):
|
273 |
for a in alpha:
|
274 |
for b in beta:
|
275 |
-
cv = compute_coherence_values2(corpus=corpus_sets[i],
|
|
|
|
|
|
|
|
|
276 |
model_results['Validation_Set'].append(corpus_title[i])
|
277 |
model_results['Alpha'].append(a)
|
278 |
model_results['Beta'].append(b)
|
279 |
model_results['Coherence'].append(cv)
|
280 |
|
281 |
-
|
282 |
pd.DataFrame(model_results).to_csv('lda_tuning_results_new.csv', index=False)
|
283 |
pbar.close()
|
284 |
|
285 |
params_df = pd.read_csv('lda_tuning_results_new.csv')
|
286 |
-
params_df = params_df[params_df.Validation_Set == '
|
287 |
params_df.reset_index(inplace=True)
|
288 |
-
|
289 |
max_params = params_df.loc[params_df['Coherence'].idxmax()]
|
290 |
max_coherence = max_params['Coherence']
|
291 |
max_alpha = max_params['Alpha']
|
292 |
max_beta = max_params['Beta']
|
293 |
-
|
294 |
-
|
295 |
-
|
296 |
-
|
297 |
-
|
298 |
-
|
299 |
-
|
300 |
-
|
301 |
-
|
302 |
-
|
303 |
-
|
|
|
|
|
304 |
coherence_model_lda = CoherenceModel(model=lda_model_final, texts=df['lemma_tokens'], dictionary=id2word,
|
305 |
-
|
306 |
coherence_lda = coherence_model_lda.get_coherence()
|
307 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
308 |
lda_topics = lda_model_final.show_topics(num_words=10)
|
309 |
|
310 |
topics = []
|
311 |
filters = [lambda x: x.lower(), strip_punctuation, strip_numeric]
|
312 |
-
lda_topics_string = ''
|
313 |
|
314 |
for topic in lda_topics:
|
315 |
-
print(topic)
|
316 |
-
lda_topics_string += 'Topic ' + str(topic[0]) + '\n' + str(topic[1]) + '\n\n'
|
317 |
topics.append(preprocess_string(topic[1], filters))
|
318 |
|
319 |
-
df['topic'] = [sorted(lda_model_final[
|
320 |
-
|
321 |
-
def sort_topics(l):
|
322 |
-
return(sorted(l, key=lambda x: x[1], reverse=True))
|
323 |
-
|
324 |
-
df['topic'] = df['topic'].apply(sort_topics)
|
325 |
-
df['topic_string'] = df['topic'].astype(str)
|
326 |
-
|
327 |
|
328 |
df = df[df['topic'].map(lambda d: len(d)) > 0]
|
329 |
-
df['
|
330 |
-
|
331 |
-
df['max_topic'] = df['topic'].map(lambda row: assignTopic(row))
|
332 |
|
|
|
333 |
topic_clusters = []
|
334 |
for i in range(num_topics):
|
335 |
topic_clusters.append(df[df['max_topic'].isin(([i]))])
|
336 |
topic_clusters[i] = topic_clusters[i]['original_tweets'].tolist()
|
337 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
338 |
for i in range(len(topic_clusters)):
|
339 |
tweets = df.loc[df['max_topic'] == i]
|
340 |
tweets['topic'] = tweets['topic'].apply(lambda x: get_topic_value(x, i))
|
@@ -343,22 +396,151 @@ def dataframeProcessing(dataset):
|
|
343 |
tweets_sorted.drop_duplicates(subset=['original_tweets'])
|
344 |
rep_tweets = tweets_sorted['original_tweets']
|
345 |
rep_tweets = [*set(rep_tweets)]
|
346 |
-
|
347 |
-
print(
|
348 |
-
|
349 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
350 |
|
351 |
-
|
352 |
-
|
353 |
-
|
354 |
-
|
355 |
-
|
356 |
-
|
357 |
-
|
358 |
-
|
359 |
-
|
360 |
-
|
361 |
-
|
362 |
-
|
363 |
-
|
364 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
import pandas as pd
|
2 |
+
import tweepy
|
|
|
|
|
3 |
import re
|
4 |
+
import emoji
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
import spacy
|
6 |
import gensim
|
7 |
import json
|
8 |
+
import string
|
9 |
+
|
10 |
from spacy.tokenizer import Tokenizer
|
11 |
+
from gensim.parsing.preprocessing import STOPWORDS as SW
|
12 |
+
from wordcloud import STOPWORDS
|
13 |
+
|
14 |
from gensim.corpora import Dictionary
|
|
|
15 |
from gensim.models.coherencemodel import CoherenceModel
|
16 |
+
import matplotlib.pyplot as plt
|
|
|
|
|
|
|
17 |
from pprint import pprint
|
18 |
+
|
19 |
+
import numpy as np
|
20 |
+
import tqdm
|
21 |
+
import pyLDAvis.gensim
|
22 |
+
|
23 |
from gensim.parsing.preprocessing import preprocess_string, strip_punctuation, strip_numeric
|
24 |
|
25 |
+
import torch
|
26 |
+
from transformers import T5ForConditionalGeneration,T5Tokenizer
|
27 |
+
from googletrans import Translator
|
28 |
+
|
29 |
+
from bertopic import BERTopic
|
30 |
+
from umap import UMAP
|
31 |
+
from sklearn.feature_extraction.text import CountVectorizer
|
32 |
+
|
33 |
+
from operator import itemgetter
|
34 |
+
|
35 |
import gradio as gr
|
36 |
|
37 |
+
bearer_token = 'AAAAAAAAAAAAAAAAAAAAACEigwEAAAAACoP8KHJYLOKCL4OyB9LEPV00VB0%3DmyeDROUvw4uipHwvbPPfnTuY0M9ORrLuXrMvcByqZhwo3SUc4F'
|
38 |
+
client = tweepy.Client(bearer_token=bearer_token)
|
39 |
+
nlp = spacy.load('en_core_web_lg')
|
40 |
+
|
41 |
+
def scrape(keyword_list):
|
42 |
+
if len(keyword_list) > 1:
|
43 |
+
keywords = '(' + ' OR '.join(keyword_list) + ')'
|
44 |
+
else:
|
45 |
+
keywords = keyword_list[0]
|
46 |
+
|
47 |
+
query = keywords + ' (lang:en OR lang:tl) -is:retweet'
|
48 |
+
max_results = 100
|
49 |
+
tweet_fields=['geo', 'id', 'lang', 'created_at']
|
50 |
+
expansions=['geo.place_id']
|
51 |
+
place_fields = ['contained_within', 'country', 'country_code', 'full_name', 'geo', 'id', 'name', 'place_type']
|
52 |
+
|
53 |
+
response = client.search_recent_tweets(
|
54 |
+
query=query,
|
55 |
+
max_results=max_results,
|
56 |
+
tweet_fields=tweet_fields,
|
57 |
+
expansions=expansions,
|
58 |
+
place_fields=place_fields
|
59 |
+
)
|
60 |
+
|
61 |
+
tweets = []
|
62 |
+
for x in response[0]:
|
63 |
+
tweets.append(str(x))
|
64 |
+
|
65 |
+
place_data = response[1]
|
66 |
+
|
67 |
+
global df
|
68 |
+
df = pd.DataFrame(tweets, columns=['tweet'])
|
69 |
+
|
70 |
+
return place_data
|
71 |
+
|
72 |
+
def get_example(dataset):
|
73 |
+
global df
|
74 |
+
df = pd.read_csv(dataset + '.csv')
|
75 |
+
|
76 |
def give_emoji_free_text(text):
|
77 |
"""
|
78 |
Removes emoji's from tweets
|
|
|
92 |
text = re.sub(r'http\S+', '', text)
|
93 |
return text
|
94 |
|
95 |
+
def get_lemmas(text):
|
96 |
+
'''Used to lemmatize the processed tweets'''
|
97 |
+
lemmas = []
|
98 |
+
|
99 |
+
doc = nlp(text)
|
100 |
+
|
101 |
+
for token in doc:
|
102 |
+
if ((token.is_stop == False) and (token.is_punct == False)) and (token.pos_ != 'PRON'):
|
103 |
+
lemmas.append(token.lemma_)
|
104 |
+
|
105 |
+
return lemmas
|
106 |
+
|
107 |
# Tokenizer function
|
108 |
def tokenize(text):
|
109 |
"""
|
|
|
131 |
|
132 |
return tokens
|
133 |
|
134 |
+
def cleaning():
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
135 |
df.rename(columns = {'tweet':'original_tweets'}, inplace = True)
|
|
|
|
|
136 |
|
137 |
# Apply the function above and get tweets free of emoji's
|
138 |
call_emoji_free = lambda x: give_emoji_free_text(x)
|
|
|
143 |
#Create a new column with url free tweets
|
144 |
df['url_free_tweets'] = df['emoji_free_tweets'].apply(url_free_text)
|
145 |
|
146 |
+
|
147 |
+
|
148 |
+
f = open('node_modules/stopwords-tl/stopwords-tl.json')
|
149 |
+
tlStopwords = json.loads(f.read())
|
150 |
+
stopwords = set(STOPWORDS)
|
151 |
+
stopwords.update(tlStopwords)
|
152 |
+
stopwords.update(['na', 'sa', 'ko', 'ako', 'ng', 'mga', 'ba', 'ka', 'yung', 'lang', 'di', 'mo', 'kasi'])
|
153 |
|
154 |
# Tokenizer
|
155 |
tokenizer = Tokenizer(nlp.vocab)
|
|
|
182 |
# Make tokens a string again
|
183 |
df['tokens_back_to_text'] = [' '.join(map(str, l)) for l in df['tokens']]
|
184 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
185 |
df['lemmas'] = df['tokens_back_to_text'].apply(get_lemmas)
|
186 |
|
187 |
# Make lemmas a string again
|
|
|
190 |
# Apply tokenizer
|
191 |
df['lemma_tokens'] = df['lemmas_back_to_text'].apply(tokenize)
|
192 |
|
193 |
+
def split_corpus(corpus, n):
|
194 |
+
for i in range(0, len(corpus), n):
|
195 |
+
corpus_split = corpus
|
196 |
+
yield corpus_split[i:i + n]
|
197 |
+
|
198 |
+
def compute_coherence_values_base_lda(dictionary, corpus, texts, limit, coherence, start=2, step=1):
|
199 |
+
coherence_values = []
|
200 |
+
model_list = []
|
201 |
+
for num_topics in range(start, limit, step):
|
202 |
+
model = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
203 |
+
num_topics=num_topics,
|
204 |
+
random_state=100,
|
205 |
+
chunksize=200,
|
206 |
+
passes=10,
|
207 |
+
per_word_topics=True,
|
208 |
+
id2word=id2word)
|
209 |
+
model_list.append(model)
|
210 |
+
coherencemodel = CoherenceModel(model=model, texts=texts, dictionary=dictionary, coherence=coherence)
|
211 |
+
coherence_values.append(coherencemodel.get_coherence())
|
212 |
+
|
213 |
+
return model_list, coherence_values
|
214 |
+
|
215 |
+
def base_lda():
|
216 |
# Create a id2word dictionary
|
217 |
+
global id2word
|
218 |
id2word = Dictionary(df['lemma_tokens'])
|
219 |
|
220 |
# Filtering Extremes
|
221 |
id2word.filter_extremes(no_below=2, no_above=.99)
|
|
|
222 |
|
223 |
# Creating a corpus object
|
224 |
+
global corpus
|
225 |
corpus = [id2word.doc2bow(d) for d in df['lemma_tokens']]
|
226 |
+
global corpus_og
|
227 |
+
corpus_og = [id2word.doc2bow(d) for d in df['lemma_tokens']]
|
228 |
|
229 |
+
corpus_split = corpus
|
230 |
+
split_corpus(corpus_split, 5)
|
|
|
|
|
|
|
|
|
|
|
231 |
|
232 |
+
global coherence
|
233 |
+
coherence = 'c_v'
|
234 |
|
235 |
+
coherence_averages = [0] * 8
|
236 |
+
for i in range(5):
|
237 |
+
training_corpus = corpus_split
|
238 |
+
training_corpus.remove(training_corpus[i])
|
239 |
+
print(training_corpus[i])
|
240 |
+
model_list, coherence_values = compute_coherence_values_base_lda(dictionary=id2word, corpus=training_corpus,
|
241 |
texts=df['lemma_tokens'],
|
242 |
start=2,
|
243 |
limit=10,
|
244 |
+
step=1,
|
245 |
+
coherence=coherence)
|
246 |
+
for j in range(len(coherence_values)):
|
247 |
+
coherence_averages[j] += coherence_values[j]
|
248 |
|
249 |
+
limit = 10; start = 2; step = 1;
|
250 |
+
x = range(start, limit, step)
|
251 |
|
252 |
+
coherence_averages = [x / 5 for x in coherence_averages]
|
|
|
253 |
|
254 |
+
if coherence == 'c_v':
|
255 |
+
k_max = max(coherence_averages)
|
256 |
+
else:
|
257 |
+
k_max = min(coherence_averages, key=abs)
|
|
|
|
|
|
|
258 |
|
259 |
+
global num_topics
|
260 |
+
num_topics = coherence_averages.index(k_max) + 2
|
261 |
+
|
262 |
+
def compute_coherence_values2(corpus, dictionary, k, a, b):
|
263 |
+
lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
264 |
+
id2word=id2word,
|
265 |
+
num_topics=num_topics,
|
266 |
+
random_state=100,
|
267 |
+
chunksize=200,
|
268 |
+
passes=10,
|
269 |
+
alpha=a,
|
270 |
+
eta=b,
|
271 |
+
per_word_topics=True)
|
272 |
+
coherence_model_lda = CoherenceModel(model=lda_model,
|
273 |
+
texts=df['lemma_tokens'],
|
274 |
+
dictionary=id2word,
|
275 |
+
coherence='c_v')
|
276 |
+
|
277 |
+
return coherence_model_lda.get_coherence()
|
278 |
+
|
279 |
+
def hyperparameter_optimization():
|
280 |
grid = {}
|
281 |
grid['Validation_Set'] = {}
|
282 |
|
283 |
+
min_topics = 1
|
284 |
+
max_topics = 10
|
285 |
+
step_size = 1
|
286 |
+
topics_range = range(min_topics, max_topics, step_size)
|
287 |
+
|
288 |
alpha = [0.05, 0.1, 0.5, 1, 5, 10]
|
289 |
+
# alpha.append('symmetric')
|
290 |
+
# alpha.append('asymmetric')
|
291 |
|
292 |
beta = [0.05, 0.1, 0.5, 1, 5, 10]
|
293 |
+
# beta.append('symmetric')
|
294 |
|
295 |
+
num_of_docs = len(corpus_og)
|
296 |
+
corpus_sets = [gensim.utils.ClippedCorpus(corpus_og, int(num_of_docs*0.75)),
|
297 |
+
corpus_og]
|
298 |
corpus_title = ['75% Corpus', '100% Corpus']
|
299 |
model_results = {'Validation_Set': [],
|
300 |
'Alpha': [],
|
|
|
307 |
for i in range(len(corpus_sets)):
|
308 |
for a in alpha:
|
309 |
for b in beta:
|
310 |
+
cv = compute_coherence_values2(corpus=corpus_sets[i],
|
311 |
+
dictionary=id2word,
|
312 |
+
k=num_topics,
|
313 |
+
a=a,
|
314 |
+
b=b)
|
315 |
model_results['Validation_Set'].append(corpus_title[i])
|
316 |
model_results['Alpha'].append(a)
|
317 |
model_results['Beta'].append(b)
|
318 |
model_results['Coherence'].append(cv)
|
319 |
|
320 |
+
pbar.update(1)
|
321 |
pd.DataFrame(model_results).to_csv('lda_tuning_results_new.csv', index=False)
|
322 |
pbar.close()
|
323 |
|
324 |
params_df = pd.read_csv('lda_tuning_results_new.csv')
|
325 |
+
params_df = params_df[params_df.Validation_Set == '75% Corpus']
|
326 |
params_df.reset_index(inplace=True)
|
327 |
+
params_df = params_df.replace(np.inf, -np.inf)
|
328 |
max_params = params_df.loc[params_df['Coherence'].idxmax()]
|
329 |
max_coherence = max_params['Coherence']
|
330 |
max_alpha = max_params['Alpha']
|
331 |
max_beta = max_params['Beta']
|
332 |
+
max_validation_set = max_params['Validation_Set']
|
333 |
+
|
334 |
+
global lda_model_final
|
335 |
+
lda_model_final = gensim.models.ldamodel.LdaModel(corpus=corpus_og,
|
336 |
+
id2word=id2word,
|
337 |
+
num_topics=num_topics,
|
338 |
+
random_state=100,
|
339 |
+
chunksize=200,
|
340 |
+
passes=10,
|
341 |
+
alpha=max_alpha,
|
342 |
+
eta=max_beta,
|
343 |
+
per_word_topics=True)
|
344 |
+
|
345 |
coherence_model_lda = CoherenceModel(model=lda_model_final, texts=df['lemma_tokens'], dictionary=id2word,
|
346 |
+
coherence='c_v')
|
347 |
coherence_lda = coherence_model_lda.get_coherence()
|
348 |
|
349 |
+
return coherence_lda
|
350 |
+
|
351 |
+
def assignMaxTopic(l):
|
352 |
+
maxTopic = max(l,key=itemgetter(1))[0]
|
353 |
+
return maxTopic
|
354 |
+
|
355 |
+
def assignTopic(l):
|
356 |
+
topics = []
|
357 |
+
for x in l:
|
358 |
+
topics.append(x[0])
|
359 |
+
|
360 |
+
def topic_assignment():
|
361 |
lda_topics = lda_model_final.show_topics(num_words=10)
|
362 |
|
363 |
topics = []
|
364 |
filters = [lambda x: x.lower(), strip_punctuation, strip_numeric]
|
|
|
365 |
|
366 |
for topic in lda_topics:
|
|
|
|
|
367 |
topics.append(preprocess_string(topic[1], filters))
|
368 |
|
369 |
+
df['topic'] = [sorted(lda_model_final[corpus_og][text][0]) for text in range(len(df['original_tweets']))]
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
370 |
|
371 |
df = df[df['topic'].map(lambda d: len(d)) > 0]
|
372 |
+
df['max_topic'] = df['topic'].map(lambda row: assignMaxTopic(row))
|
|
|
|
|
373 |
|
374 |
+
global topic_clusters
|
375 |
topic_clusters = []
|
376 |
for i in range(num_topics):
|
377 |
topic_clusters.append(df[df['max_topic'].isin(([i]))])
|
378 |
topic_clusters[i] = topic_clusters[i]['original_tweets'].tolist()
|
379 |
+
|
380 |
+
def get_topic_value(row, i):
|
381 |
+
if len(row) == 1:
|
382 |
+
return row[0][1]
|
383 |
+
else:
|
384 |
+
try:
|
385 |
+
return row[i][1]
|
386 |
+
except Exception as e:
|
387 |
+
print(e)
|
388 |
+
|
389 |
+
global top_tweets
|
390 |
+
top_tweets = []
|
391 |
for i in range(len(topic_clusters)):
|
392 |
tweets = df.loc[df['max_topic'] == i]
|
393 |
tweets['topic'] = tweets['topic'].apply(lambda x: get_topic_value(x, i))
|
|
|
396 |
tweets_sorted.drop_duplicates(subset=['original_tweets'])
|
397 |
rep_tweets = tweets_sorted['original_tweets']
|
398 |
rep_tweets = [*set(rep_tweets)]
|
399 |
+
top_tweets.append(rep_tweets[:5])
|
400 |
+
# print('Topic ', i)
|
401 |
+
# print(rep_tweets[:5])
|
402 |
+
|
403 |
+
def topic_summarization(topic_groups):
|
404 |
+
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
|
405 |
+
|
406 |
+
model = T5ForConditionalGeneration.from_pretrained("Michau/t5-base-en-generate-headline")
|
407 |
+
tokenizer = T5Tokenizer.from_pretrained("Michau/t5-base-en-generate-headline")
|
408 |
+
model = model.to(device)
|
409 |
+
translator = Translator()
|
410 |
+
|
411 |
+
headlines = []
|
412 |
+
for i in range(len(topic_groups)):
|
413 |
+
tweets = " ".join(topic_groups[i])
|
414 |
+
# print(tweets)
|
415 |
+
out = translator.translate(tweets, dest='en')
|
416 |
+
text = out.text
|
417 |
+
# print(tweets)
|
418 |
+
|
419 |
+
max_len = 256
|
420 |
+
|
421 |
+
encoding = tokenizer.encode_plus(text, return_tensors = "pt")
|
422 |
+
input_ids = encoding["input_ids"].to(device)
|
423 |
+
attention_masks = encoding["attention_mask"].to(device)
|
424 |
+
|
425 |
+
beam_outputs = model.generate(
|
426 |
+
input_ids = input_ids,
|
427 |
+
attention_mask = attention_masks,
|
428 |
+
max_length = 64,
|
429 |
+
num_beams = 3,
|
430 |
+
early_stopping = True,
|
431 |
+
)
|
432 |
+
|
433 |
+
result = tokenizer.decode(beam_outputs[0])
|
434 |
+
headlines += "Topic " + str(i) + " " + result
|
435 |
+
|
436 |
+
return headlines
|
437 |
+
|
438 |
+
def compute_coherence_value_bertopic(topic_model):
|
439 |
+
topic_words = [[words for words, _ in topic_model.get_topic(topic)] for topic in range(len(set(topics))-1)]
|
440 |
+
coherence_model = CoherenceModel(topics=topic_words,
|
441 |
+
texts=df['lemma_tokens'],
|
442 |
+
corpus=corpus,
|
443 |
+
dictionary=id2word,
|
444 |
+
coherence=coherence)
|
445 |
+
coherence_score = coherence_model.get_coherence()
|
446 |
+
|
447 |
+
return coherence_score
|
448 |
+
|
449 |
+
def base_bertopic():
|
450 |
+
df['lemma_tokens_string'] = df['lemma_tokens'].apply(lambda x: ' '.join(x))
|
451 |
+
global id2word
|
452 |
+
id2word = Dictionary(df['lemma_tokens'])
|
453 |
+
global corpus
|
454 |
+
corpus = [id2word.doc2bow(d) for d in df['lemma_tokens']]
|
455 |
+
|
456 |
+
global umap_model
|
457 |
+
umap_model = UMAP(n_neighbors=15,
|
458 |
+
n_components=5,
|
459 |
+
min_dist=0.0,
|
460 |
+
metric='cosine',
|
461 |
+
random_state=100)
|
462 |
|
463 |
+
base_topic_model = BERTopic(umap_model=umap_model, language="english", calculate_probabilities=True)
|
464 |
+
|
465 |
+
topics, probabilities = base_topic_model.fit_transform(df['lemma_tokens_string'])
|
466 |
+
|
467 |
+
try:
|
468 |
+
print(compute_coherence_value_bertopic(base_topic_model))
|
469 |
+
except:
|
470 |
+
print('Unable to generate meaningful topics (Base BERTopic model)')
|
471 |
+
|
472 |
+
def optimized_bertopic():
|
473 |
+
vectorizer_model = CountVectorizer(max_features=1_000, stop_words="english")
|
474 |
+
optimized_topic_model = BERTopic(umap_model=umap_model,
|
475 |
+
language="multilingual",
|
476 |
+
n_gram_range=(1, 3),
|
477 |
+
vectorizer_model=vectorizer_model,
|
478 |
+
calculate_probabilities=True)
|
479 |
+
|
480 |
+
topics, probabilities = optimized_topic_model.fit_transform(df['lemma_tokens_string'])
|
481 |
+
|
482 |
+
try:
|
483 |
+
print(compute_coherence_value_bertopic(optimized_topic_model))
|
484 |
+
except:
|
485 |
+
print('Unable to generate meaningful topics, base BERTopic model if possible')
|
486 |
+
|
487 |
+
rep_docs = optimized_topic_model.representative_docs_
|
488 |
+
|
489 |
+
global top_tweets
|
490 |
+
top_tweets = []
|
491 |
+
|
492 |
+
for topic in rep_docs:
|
493 |
+
if topic == -1:
|
494 |
+
print('test')
|
495 |
+
continue
|
496 |
+
topic_docs = rep_docs.get(topic)
|
497 |
+
|
498 |
+
tweets = []
|
499 |
+
for doc in topic_docs:
|
500 |
+
index = df.isin([doc]).any(axis=1).idxmax()
|
501 |
+
# print(index)
|
502 |
+
tweets.append(df.loc[index, 'original_tweets'])
|
503 |
+
print(tweets)
|
504 |
+
top_tweets.append(tweets)
|
505 |
+
|
506 |
+
def main(dataset, model):
|
507 |
+
keyword_list = dataset.split(',')
|
508 |
+
if dataset in examples:
|
509 |
+
get_example(keyword_list)
|
510 |
+
place_data = 'test'
|
511 |
+
else:
|
512 |
+
place_data = str(scrape(keyword_list))
|
513 |
+
cleaning()
|
514 |
+
if model == 'LDA':
|
515 |
+
base_lda()
|
516 |
+
coherence = hyperparameter_optimization()
|
517 |
+
topic_assignment()
|
518 |
+
else:
|
519 |
+
base_bertopic()
|
520 |
+
optimized_bertopic()
|
521 |
+
|
522 |
+
headlines = topic_summarization(top_tweets)
|
523 |
+
headlines = '\n'.join(str(h) for h in headlines)
|
524 |
+
|
525 |
+
|
526 |
+
|
527 |
+
return place_data, headlines
|
528 |
+
|
529 |
+
global examples
|
530 |
+
examples = [
|
531 |
+
["katip,katipunan"],
|
532 |
+
["bgc,bonifacio global city"],
|
533 |
+
["pobla,poblacion"],
|
534 |
+
["cubao"],
|
535 |
+
["taft"]
|
536 |
+
]
|
537 |
+
iface = gr.Interface(fn=main,
|
538 |
+
inputs=["text",
|
539 |
+
gr.Dropdown(["LDA",
|
540 |
+
"BERTopic"],
|
541 |
+
label="Model")
|
542 |
+
],
|
543 |
+
examples=examples
|
544 |
+
outputs=["text",
|
545 |
+
"text"])
|
546 |
+
iface.launch()
|
app_old.py
ADDED
@@ -0,0 +1,364 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Required Libraries
|
2 |
+
|
3 |
+
#Base and Cleaning
|
4 |
+
import json
|
5 |
+
import requests
|
6 |
+
import pandas as pd
|
7 |
+
import numpy as np
|
8 |
+
import emoji
|
9 |
+
import regex
|
10 |
+
import re
|
11 |
+
import string
|
12 |
+
from collections import Counter
|
13 |
+
import tqdm
|
14 |
+
from operator import itemgetter
|
15 |
+
|
16 |
+
#Visualizations
|
17 |
+
import plotly.express as px
|
18 |
+
import seaborn as sns
|
19 |
+
import matplotlib.pyplot as plt
|
20 |
+
import pyLDAvis.gensim
|
21 |
+
import chart_studio
|
22 |
+
import chart_studio.plotly as py
|
23 |
+
import chart_studio.tools as tls
|
24 |
+
|
25 |
+
#Natural Language Processing (NLP)
|
26 |
+
import spacy
|
27 |
+
import gensim
|
28 |
+
import json
|
29 |
+
from spacy.tokenizer import Tokenizer
|
30 |
+
from gensim.corpora import Dictionary
|
31 |
+
from gensim.models.ldamulticore import LdaMulticore
|
32 |
+
from gensim.models.coherencemodel import CoherenceModel
|
33 |
+
from gensim.parsing.preprocessing import STOPWORDS as SW
|
34 |
+
from sklearn.decomposition import LatentDirichletAllocation, TruncatedSVD
|
35 |
+
from sklearn.feature_extraction.text import CountVectorizer, TfidfVectorizer
|
36 |
+
from sklearn.model_selection import GridSearchCV
|
37 |
+
from pprint import pprint
|
38 |
+
from wordcloud import STOPWORDS
|
39 |
+
from gensim.parsing.preprocessing import preprocess_string, strip_punctuation, strip_numeric
|
40 |
+
|
41 |
+
import gradio as gr
|
42 |
+
|
43 |
+
def give_emoji_free_text(text):
|
44 |
+
"""
|
45 |
+
Removes emoji's from tweets
|
46 |
+
Accepts:
|
47 |
+
Text (tweets)
|
48 |
+
Returns:
|
49 |
+
Text (emoji free tweets)
|
50 |
+
"""
|
51 |
+
emoji_list = [c for c in text if c in emoji.EMOJI_DATA]
|
52 |
+
clean_text = ' '.join([str for str in text.split() if not any(i in str for i in emoji_list)])
|
53 |
+
return clean_text
|
54 |
+
|
55 |
+
def url_free_text(text):
|
56 |
+
'''
|
57 |
+
Cleans text from urls
|
58 |
+
'''
|
59 |
+
text = re.sub(r'http\S+', '', text)
|
60 |
+
return text
|
61 |
+
|
62 |
+
# Tokenizer function
|
63 |
+
def tokenize(text):
|
64 |
+
"""
|
65 |
+
Parses a string into a list of semantic units (words)
|
66 |
+
Args:
|
67 |
+
text (str): The string that the function will tokenize.
|
68 |
+
Returns:
|
69 |
+
list: tokens parsed out
|
70 |
+
"""
|
71 |
+
# Removing url's
|
72 |
+
pattern = r"http\S+"
|
73 |
+
|
74 |
+
tokens = re.sub(pattern, "", text) # https://www.youtube.com/watch?v=O2onA4r5UaY
|
75 |
+
tokens = re.sub('[^a-zA-Z 0-9]', '', text)
|
76 |
+
tokens = re.sub('[%s]' % re.escape(string.punctuation), '', text) # Remove punctuation
|
77 |
+
tokens = re.sub('\w*\d\w*', '', text) # Remove words containing numbers
|
78 |
+
# tokens = re.sub('@*!*$*', '', text) # Remove @ ! $
|
79 |
+
tokens = tokens.strip(',') # TESTING THIS LINE
|
80 |
+
tokens = tokens.strip('?') # TESTING THIS LINE
|
81 |
+
tokens = tokens.strip('!') # TESTING THIS LINE
|
82 |
+
tokens = tokens.strip("'") # TESTING THIS LINE
|
83 |
+
tokens = tokens.strip(".") # TESTING THIS LINE
|
84 |
+
|
85 |
+
tokens = tokens.lower().split() # Make text lowercase and split it
|
86 |
+
|
87 |
+
return tokens
|
88 |
+
|
89 |
+
def compute_coherence_values(dictionary, corpus, texts, limit, start=2, step=1):
|
90 |
+
coherence_values = []
|
91 |
+
model_list = []
|
92 |
+
for num_topics in range(start, limit, step):
|
93 |
+
model = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
94 |
+
num_topics=num_topics,
|
95 |
+
random_state=100,
|
96 |
+
chunksize=200,
|
97 |
+
passes=10,
|
98 |
+
per_word_topics=True,
|
99 |
+
id2word=id2word)
|
100 |
+
model_list.append(model)
|
101 |
+
coherencemodel = CoherenceModel(model=model, texts=texts, dictionary=dictionary, coherence='c_v')
|
102 |
+
coherence_values.append(coherencemodel.get_coherence())
|
103 |
+
|
104 |
+
return model_list, coherence_values
|
105 |
+
|
106 |
+
def compute_coherence_values2(corpus, dictionary, k, a, b):
|
107 |
+
lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
108 |
+
id2word=id2word,
|
109 |
+
num_topics=num_topics,
|
110 |
+
random_state=100,
|
111 |
+
chunksize=200,
|
112 |
+
passes=10,
|
113 |
+
alpha=a,
|
114 |
+
eta=b,
|
115 |
+
per_word_topics=True)
|
116 |
+
coherence_model_lda = CoherenceModel(model=lda_model, texts=df['lemma_tokens'], dictionary=id2word, coherence='c_v')
|
117 |
+
|
118 |
+
return coherence_model_lda.get_coherence()
|
119 |
+
|
120 |
+
def assignTopic(l):
|
121 |
+
maxTopic = max(l,key=itemgetter(1))[0]
|
122 |
+
return maxTopic
|
123 |
+
|
124 |
+
def get_topic_value(row, i):
|
125 |
+
if len(row) == 1:
|
126 |
+
return row[0][1]
|
127 |
+
else:
|
128 |
+
return row[i][1]
|
129 |
+
|
130 |
+
def dataframeProcessing(dataset):
|
131 |
+
# Opening JSON file
|
132 |
+
f = open('stopwords-tl.json')
|
133 |
+
tlStopwords = json.loads(f.read())
|
134 |
+
stopwords = set(STOPWORDS)
|
135 |
+
stopwords.update(tlStopwords)
|
136 |
+
stopwords.update(['na', 'sa', 'ko', 'ako', 'ng', 'mga', 'ba', 'ka', 'yung', 'lang', 'di', 'mo', 'kasi'])
|
137 |
+
|
138 |
+
global df
|
139 |
+
df = pd.read_csv(dataset + '.csv')
|
140 |
+
df.rename(columns = {'tweet':'original_tweets'}, inplace = True)
|
141 |
+
df = df.apply(lambda row: row[df['language'].isin(['en'])])
|
142 |
+
df.reset_index(inplace=True)
|
143 |
+
|
144 |
+
# Apply the function above and get tweets free of emoji's
|
145 |
+
call_emoji_free = lambda x: give_emoji_free_text(x)
|
146 |
+
|
147 |
+
# Apply `call_emoji_free` which calls the function to remove all emoji's
|
148 |
+
df['emoji_free_tweets'] = df['original_tweets'].apply(call_emoji_free)
|
149 |
+
|
150 |
+
#Create a new column with url free tweets
|
151 |
+
df['url_free_tweets'] = df['emoji_free_tweets'].apply(url_free_text)
|
152 |
+
|
153 |
+
# Load spacy
|
154 |
+
# Make sure to restart the runtime after running installations and libraries tab
|
155 |
+
nlp = spacy.load('en_core_web_lg')
|
156 |
+
|
157 |
+
# Tokenizer
|
158 |
+
tokenizer = Tokenizer(nlp.vocab)
|
159 |
+
|
160 |
+
|
161 |
+
# Custom stopwords
|
162 |
+
custom_stopwords = ['hi','\n','\n\n', '&', ' ', '.', '-', 'got', "it's", 'it’s', "i'm", 'i’m', 'im', 'want', 'like', '$', '@']
|
163 |
+
|
164 |
+
|
165 |
+
# Customize stop words by adding to the default list
|
166 |
+
STOP_WORDS = nlp.Defaults.stop_words.union(custom_stopwords)
|
167 |
+
|
168 |
+
# ALL_STOP_WORDS = spacy + gensim + wordcloud
|
169 |
+
ALL_STOP_WORDS = STOP_WORDS.union(SW).union(stopwords)
|
170 |
+
|
171 |
+
|
172 |
+
tokens = []
|
173 |
+
STOP_WORDS.update(stopwords)
|
174 |
+
|
175 |
+
for doc in tokenizer.pipe(df['url_free_tweets'], batch_size=500):
|
176 |
+
doc_tokens = []
|
177 |
+
for token in doc:
|
178 |
+
if token.text.lower() not in STOP_WORDS:
|
179 |
+
doc_tokens.append(token.text.lower())
|
180 |
+
tokens.append(doc_tokens)
|
181 |
+
|
182 |
+
# Makes tokens column
|
183 |
+
df['tokens'] = tokens
|
184 |
+
|
185 |
+
# Make tokens a string again
|
186 |
+
df['tokens_back_to_text'] = [' '.join(map(str, l)) for l in df['tokens']]
|
187 |
+
|
188 |
+
def get_lemmas(text):
|
189 |
+
'''Used to lemmatize the processed tweets'''
|
190 |
+
lemmas = []
|
191 |
+
|
192 |
+
doc = nlp(text)
|
193 |
+
|
194 |
+
# Something goes here :P
|
195 |
+
for token in doc:
|
196 |
+
if ((token.is_stop == False) and (token.is_punct == False)) and (token.pos_ != 'PRON'):
|
197 |
+
lemmas.append(token.lemma_)
|
198 |
+
|
199 |
+
return lemmas
|
200 |
+
|
201 |
+
df['lemmas'] = df['tokens_back_to_text'].apply(get_lemmas)
|
202 |
+
|
203 |
+
# Make lemmas a string again
|
204 |
+
df['lemmas_back_to_text'] = [' '.join(map(str, l)) for l in df['lemmas']]
|
205 |
+
|
206 |
+
# Apply tokenizer
|
207 |
+
df['lemma_tokens'] = df['lemmas_back_to_text'].apply(tokenize)
|
208 |
+
|
209 |
+
# Create a id2word dictionary
|
210 |
+
global id2word
|
211 |
+
id2word = Dictionary(df['lemma_tokens'])
|
212 |
+
|
213 |
+
# Filtering Extremes
|
214 |
+
id2word.filter_extremes(no_below=2, no_above=.99)
|
215 |
+
print(len(id2word))
|
216 |
+
|
217 |
+
# Creating a corpus object
|
218 |
+
corpus = [id2word.doc2bow(d) for d in df['lemma_tokens']]
|
219 |
+
|
220 |
+
lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
221 |
+
id2word=id2word,
|
222 |
+
num_topics=5,
|
223 |
+
random_state=100,
|
224 |
+
chunksize=200,
|
225 |
+
passes=10,
|
226 |
+
per_word_topics=True)
|
227 |
+
|
228 |
+
pprint(lda_model.print_topics())
|
229 |
+
doc_lda = lda_model[corpus]
|
230 |
+
|
231 |
+
coherence_model_lda = CoherenceModel(model=lda_model, texts=df['lemma_tokens'], dictionary=id2word, coherence='c_v')
|
232 |
+
coherence_lda = coherence_model_lda.get_coherence()
|
233 |
+
|
234 |
+
model_list, coherence_values = compute_coherence_values(dictionary=id2word, corpus=corpus,
|
235 |
+
texts=df['lemma_tokens'],
|
236 |
+
start=2,
|
237 |
+
limit=10,
|
238 |
+
step=1)
|
239 |
+
|
240 |
+
k_max = max(coherence_values)
|
241 |
+
|
242 |
+
global num_topics
|
243 |
+
num_topics = coherence_values.index(k_max) + 2
|
244 |
+
|
245 |
+
lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
246 |
+
id2word=id2word,
|
247 |
+
num_topics=num_topics,
|
248 |
+
random_state=100,
|
249 |
+
chunksize=200,
|
250 |
+
passes=10,
|
251 |
+
per_word_topics=True)
|
252 |
+
|
253 |
+
grid = {}
|
254 |
+
grid['Validation_Set'] = {}
|
255 |
+
|
256 |
+
alpha = [0.05, 0.1, 0.5, 1, 5, 10]
|
257 |
+
|
258 |
+
beta = [0.05, 0.1, 0.5, 1, 5, 10]
|
259 |
+
|
260 |
+
num_of_docs = len(corpus)
|
261 |
+
corpus_sets = [gensim.utils.ClippedCorpus(corpus, int(num_of_docs*0.75)),
|
262 |
+
corpus]
|
263 |
+
corpus_title = ['75% Corpus', '100% Corpus']
|
264 |
+
model_results = {'Validation_Set': [],
|
265 |
+
'Alpha': [],
|
266 |
+
'Beta': [],
|
267 |
+
'Coherence': []
|
268 |
+
}
|
269 |
+
if 1 == 1:
|
270 |
+
pbar = tqdm.tqdm(total=540)
|
271 |
+
|
272 |
+
for i in range(len(corpus_sets)):
|
273 |
+
for a in alpha:
|
274 |
+
for b in beta:
|
275 |
+
cv = compute_coherence_values2(corpus=corpus_sets[i], dictionary=id2word, k=num_topics, a=a, b=b)
|
276 |
+
model_results['Validation_Set'].append(corpus_title[i])
|
277 |
+
model_results['Alpha'].append(a)
|
278 |
+
model_results['Beta'].append(b)
|
279 |
+
model_results['Coherence'].append(cv)
|
280 |
+
|
281 |
+
pbar.update(1)
|
282 |
+
pd.DataFrame(model_results).to_csv('lda_tuning_results_new.csv', index=False)
|
283 |
+
pbar.close()
|
284 |
+
|
285 |
+
params_df = pd.read_csv('lda_tuning_results_new.csv')
|
286 |
+
params_df = params_df[params_df.Validation_Set == '100% Corpus']
|
287 |
+
params_df.reset_index(inplace=True)
|
288 |
+
|
289 |
+
max_params = params_df.loc[params_df['Coherence'].idxmax()]
|
290 |
+
max_coherence = max_params['Coherence']
|
291 |
+
max_alpha = max_params['Alpha']
|
292 |
+
max_beta = max_params['Beta']
|
293 |
+
|
294 |
+
lda_model_final = gensim.models.ldamodel.LdaModel(corpus=corpus,
|
295 |
+
id2word=id2word,
|
296 |
+
num_topics=7,
|
297 |
+
random_state=100,
|
298 |
+
chunksize=200,
|
299 |
+
passes=10,
|
300 |
+
alpha=max_alpha,
|
301 |
+
eta=max_beta,
|
302 |
+
per_word_topics=True)
|
303 |
+
|
304 |
+
coherence_model_lda = CoherenceModel(model=lda_model_final, texts=df['lemma_tokens'], dictionary=id2word,
|
305 |
+
coherence='c_v')
|
306 |
+
coherence_lda = coherence_model_lda.get_coherence()
|
307 |
+
|
308 |
+
lda_topics = lda_model_final.show_topics(num_words=10)
|
309 |
+
|
310 |
+
topics = []
|
311 |
+
filters = [lambda x: x.lower(), strip_punctuation, strip_numeric]
|
312 |
+
lda_topics_string = ''
|
313 |
+
|
314 |
+
for topic in lda_topics:
|
315 |
+
print(topic)
|
316 |
+
lda_topics_string += 'Topic ' + str(topic[0]) + '\n' + str(topic[1]) + '\n\n'
|
317 |
+
topics.append(preprocess_string(topic[1], filters))
|
318 |
+
|
319 |
+
df['topic'] = [sorted(lda_model_final[corpus][text][0]) for text in range(len(df['original_tweets']))]
|
320 |
+
|
321 |
+
def sort_topics(l):
|
322 |
+
return(sorted(l, key=lambda x: x[1], reverse=True))
|
323 |
+
|
324 |
+
df['topic'] = df['topic'].apply(sort_topics)
|
325 |
+
df['topic_string'] = df['topic'].astype(str)
|
326 |
+
|
327 |
+
|
328 |
+
df = df[df['topic'].map(lambda d: len(d)) > 0]
|
329 |
+
df['topic'][0]
|
330 |
+
|
331 |
+
df['max_topic'] = df['topic'].map(lambda row: assignTopic(row))
|
332 |
+
|
333 |
+
topic_clusters = []
|
334 |
+
for i in range(num_topics):
|
335 |
+
topic_clusters.append(df[df['max_topic'].isin(([i]))])
|
336 |
+
topic_clusters[i] = topic_clusters[i]['original_tweets'].tolist()
|
337 |
+
|
338 |
+
for i in range(len(topic_clusters)):
|
339 |
+
tweets = df.loc[df['max_topic'] == i]
|
340 |
+
tweets['topic'] = tweets['topic'].apply(lambda x: get_topic_value(x, i))
|
341 |
+
# tweets['topic'] = [row[i][1] for row in tweets['topic']]
|
342 |
+
tweets_sorted = tweets.sort_values('topic', ascending=False)
|
343 |
+
tweets_sorted.drop_duplicates(subset=['original_tweets'])
|
344 |
+
rep_tweets = tweets_sorted['original_tweets']
|
345 |
+
rep_tweets = [*set(rep_tweets)]
|
346 |
+
print('Topic ', i)
|
347 |
+
print(rep_tweets[:5])
|
348 |
+
|
349 |
+
output_df = df[['topic_string', 'original_tweets']].copy()
|
350 |
+
|
351 |
+
return lda_topics_string, output_df
|
352 |
+
|
353 |
+
def greet(name):
|
354 |
+
return "Hello " + name + "!!"
|
355 |
+
|
356 |
+
iface = gr.Interface(fn=dataframeProcessing,
|
357 |
+
inputs=gr.Dropdown(["katip-december",
|
358 |
+
"katipunan-december",
|
359 |
+
"bgc-december",
|
360 |
+
"bonifacio global city-december"],
|
361 |
+
label="Dataset"),
|
362 |
+
outputs=["text",
|
363 |
+
gr.Dataframe(headers=['topic_string', 'original_tweets'])])
|
364 |
+
iface.launch()
|
cubao.csv
ADDED
@@ -0,0 +1,236 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
,tweet
|
2 |
+
0,@HorizonSpaCubao @BaldzyP Yey! Cubao na ulit. I should try him na! 🌟🙌🏻
|
3 |
+
1,Renan available in cubao branch for walk in https://t.co/FbELfVlla3
|
4 |
+
2,"galang gala ako today, pauwi pa lang me HAJDBWKFJS araneta cubao lang usapan hanggang sa nakarating rin kami sa moa 😭"
|
5 |
+
3,New york cubao 😌
|
6 |
+
4,"anybody interested for on festival off concert tickets?
|
7 |
+
|
8 |
+
3 ub tickets (same rows)
|
9 |
+
- will be selling same price
|
10 |
+
- can meet up in cubao/ortigas/makati
|
11 |
+
|
12 |
+
dm if interested 😊"
|
13 |
+
5,See you bukas cubao ARAT MEET💦 DM ME‼️ https://t.co/M5Ag7E46Yr
|
14 |
+
6,"Wts lfb ph
|
15 |
+
|
16 |
+
A1 Twenty Five Live in Manila
|
17 |
+
Discounted Tickets for sale
|
18 |
+
|
19 |
+
- pwede ipalalamove now yung ticket free sf or meetup tomorrow sa 15th avenue cubao at 9am :)
|
20 |
+
|
21 |
+
🎫 Orchestra B left
|
22 |
+
🎰 4 slots only https://t.co/HiYVUVbg80"
|
23 |
+
7,Ok fully paid na ako for Batanes trip. Kailangan ko na lang mamalimos ng pocket money sa tulay ng Cubao. Emi 🥲🥹
|
24 |
+
8,ako na gusto patirin si coc nung sep. 15 dahil paikot ikot sa cubao kaso ang laki nya🥹 https://t.co/CyOYRx2sI2
|
25 |
+
9,@heyyannahh bukas ata meron sa cubao
|
26 |
+
10,"aala na ba talaga cubao side trip
|
27 |
+
#altercubao #alter #jakolph #zoomjakol #alterbagets"
|
28 |
+
11,@hump_fri @oohhhkaye @BembangBiik Alam na alam mo ang cubao ano? Wag kana mahiya matagal ng buking yan panahon pa ni marimar roxas
|
29 |
+
12,@DamascusBlade Oh YES. Arizona and the Rapture Royalties queens know how to lip sync in an unhinged manner so well. That’s Cubao drag 🥹🥹🥹
|
30 |
+
13,"quickie goodcatch bagets here cubao area only dm me
|
31 |
+
#altercubao #alter #jakolph #zoomjakol #alterbagets https://t.co/bDW2m80wvh"
|
32 |
+
14,"quickie jaks cubao quickie hmu bagets here
|
33 |
+
#altercubao #alter #jakolph #zoomjakol #alterbagets"
|
34 |
+
15,ang sarap magikot sa cubao HUHU
|
35 |
+
16,"jakol quickie cubao area dm me bagets here kattapos lang mag gym
|
36 |
+
#altercubao #alter #jakolph #zoomjakol #alterbagets"
|
37 |
+
17,"cubao area omly quickie hmu bagets here
|
38 |
+
#altercubao #alter #jakolph #zoomjakol #alterbagets"
|
39 |
+
18,"785 Aurora Blvd Cubao QC
|
40 |
+
☎️ 09054471994
|
41 |
+
📞 09998864822
|
42 |
+
|
43 |
+
Marvin https://t.co/E5uAUO0Ahn"
|
44 |
+
19,"cubao quickie hmu bagets here
|
45 |
+
#altercubao #alter #jakolph #zoomjakol #alterbagets"
|
46 |
+
20,"sino nasa cubao side trip dm me @
|
47 |
+
#alterbagets #alterph #altercubao #alterqc #zoomjakol #jakolph #jakol"
|
48 |
+
21,#AtinIto #WestPhSea EDSA Cubao North Bound https://t.co/NHPrJ9kF9i
|
49 |
+
22,tataxi sana ako pauwi dito sa cubao pero natatakot ako ahakajs
|
50 |
+
23,@avesouls heeseung cubao
|
51 |
+
24,@Mang_asar3 @oohhhkaye @BembangBiik Parang mas familiar ka? HAHAHAHA. Galing ka siguro don ano? HAHAHAHAHAHA. Alam ko lang sa Cubao ay terminal ng Van. 🤣
|
52 |
+
25,malupit chismis ko guys cubao gaming to
|
53 |
+
26,@joesalad90 HELLO San sa Cubao
|
54 |
+
27,"@SelectaPH sa ibang lugar po ang daming holo, dito sa cubao ang ilap 😭😭😭😭😭😭😭😭🤧🤧🤧🤧"
|
55 |
+
28,"sino asa cubao ngayon na up for quickie hmu
|
56 |
+
|
57 |
+
#alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol"
|
58 |
+
29,"Sup katatapos lang mag gym sino g side trip gateway cubao dm me
|
59 |
+
|
60 |
+
#alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol"
|
61 |
+
30,side trip gateway cubao#alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol
|
62 |
+
31,Sino asa cubao ngayin #alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol
|
63 |
+
32,HUWAG TULARAN FROM LAGUNA DUMAYO SA CUBAO DAPAT ANG HALFWAY AY SA SM MEGAMALL CHAROT https://t.co/KlMWhnmC77
|
64 |
+
33,"katatapos lang mag gym sino g side trip gateway cubao dm me
|
65 |
+
|
66 |
+
#alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol"
|
67 |
+
34,@jenjaemlovr sa cubao lang ?!
|
68 |
+
35,All I can say is fuck cubao ❤️
|
69 |
+
36,Sino asa cubao now pasabit
|
70 |
+
37,"WTS- LFB
|
71 |
+
|
72 |
+
Vinci PC-300
|
73 |
+
Kyler SID - 270
|
74 |
+
Winston ID picture -100
|
75 |
+
|
76 |
+
meet cubao farmers/araneta tom only
|
77 |
+
check out on shopee https://t.co/AQlFdC7oaK"
|
78 |
+
38,all roads lead to Cubao anw
|
79 |
+
39,hindi alam pa’no umuwi —> pumuntang cubao
|
80 |
+
40,walking new york cubao https://t.co/XgDSGc00im
|
81 |
+
41,"785 Aurora Blvd Cubao QC
|
82 |
+
☎️ 09054471994
|
83 |
+
📞 09998864822
|
84 |
+
|
85 |
+
Jelson https://t.co/87N3vsd01I"
|
86 |
+
42,"785 Aurora Blvd Cubao QC
|
87 |
+
☎️ 09054471994
|
88 |
+
📞 09998864822 https://t.co/JJmwNbFOh3"
|
89 |
+
43,"785 Aurora Blvd Cubao QC
|
90 |
+
☎️ 09054471994
|
91 |
+
📞 09998864822 https://t.co/gSfKK8v6wn"
|
92 |
+
44,"785 Aurora Blvd Cubao QC
|
93 |
+
☎️ 09054471994
|
94 |
+
📞 09998864822
|
95 |
+
|
96 |
+
Jordan 🔥 Matt https://t.co/4q7EP8j8I1"
|
97 |
+
45,"785 Aurora Blvd Cubao QC
|
98 |
+
☎️ 09054471994
|
99 |
+
📞 09998864822 https://t.co/SZfgoP1bP2"
|
100 |
+
46,"785 Aurora Blvd Cubao QC
|
101 |
+
☎️ 09054471994
|
102 |
+
📞 09998864822
|
103 |
+
|
104 |
+
Balmond https://t.co/L0XwCdTZLK"
|
105 |
+
47,Makabiyahe nga may event pala ngayon sa cubao andun si boss Park Seo-joon hahaha
|
106 |
+
48,Namimiss ko panoorin yung busy streets nang Araneta - Cubao 🥺
|
107 |
+
49,"ZB1 100 DAYS OF DEBUT!
|
108 |
+
|
109 |
+
Freebies for tomorrow's event at Cubao, QC. Just approach me when you see me ☺️
|
110 |
+
|
111 |
+
See you there zeroses! 🌹😘 https://t.co/rYhF3YPsuL https://t.co/OAoOxB1TAo"
|
112 |
+
50,@srslyclrsa IM STILL EMO NASA CUBAO EXPO AKO
|
113 |
+
51,"Nag punta ako ng cubao para mapagod, kasi yun yung goal! Mapagod para makatulog agad😭"
|
114 |
+
52,Cubao ibabaw 🤭 https://t.co/qj1WXQCPJm
|
115 |
+
53,Tara trip with pops. Timog-Morato-Cubao area
|
116 |
+
54,The Royal Class bus is set to take its first batch of trips this October. https://t.co/rDmAJIkz2y https://t.co/rCYWIkUBRV
|
117 |
+
55,"@theskinsensei After OBar, isunod mo Doc sa list mo ng puluntahan ang Rapture sa Cubao 😁"
|
118 |
+
56,cubao now
|
119 |
+
57,"@ army friends/moots na naghahanap ng pupuntahan na cse, cubao area lang 'to oh~ very accessible 😁"
|
120 |
+
58,@woosanstation gago ka pag balik ko ng cubao humanda ka
|
121 |
+
59,"METROBASE Traffic Update as of 7:00PM
|
122 |
+
|
123 |
+
WHEN: October 14, 2023
|
124 |
+
|
125 |
+
A. MAJOR ROADS:
|
126 |
+
|
127 |
+
*EDSA
|
128 |
+
SOUTHBOUND
|
129 |
+
- New york Cubao to Ortigas split, moderate to slow moving (due to volume of vehicles)
|
130 |
+
- Ortigas to Shaw split, moderate to slow moving (due to volume of vehicles)
|
131 |
+
- Boni, moderate to… https://t.co/1FcX7f3TKr https://t.co/bPqgHRN4vR"
|
132 |
+
60,"🖤 #PJMFreed 🤍
|
133 |
+
|
134 |
+
🧋 A FREE CUPSLEEVE EVENT CELEBRATING PARK JIMIN'S BIRTHDAY
|
135 |
+
|
136 |
+
🗓 October 21, 2023 (Saturday)
|
137 |
+
📍 Drink Lab Café, Araneta City (Cubao)
|
138 |
+
⏰ 12:00 NN - 6:30 PM
|
139 |
+
|
140 |
+
✨ 60 SLOTS ONLY ✨
|
141 |
+
|
142 |
+
✍🏼 REGISTER HERE: https://t.co/7sKcmAcecG https://t.co/F7rqDAlrRi"
|
143 |
+
61,I was the one with the La Luna Bag and She was the one with the Ken Bag and SB19 pins. She was with her niece and their from Montalban I am from San Mateo. Their going to vist her 2 children going to cavite while I was going to my School STI Cubao. +++
|
144 |
+
62,"To Ate Em,
|
145 |
+
|
146 |
+
If ever you read this, dito sa twt Thank you po for talking to me 🥹
|
147 |
+
I had a bad week, dahil sa acads and personal life circumstances happening. Talking to you while on the way to cubao in the mini bus, made things lighter for me. +++ https://t.co/N63zbS21h7"
|
148 |
+
63,"@khgbbovely Yep, yep. 'Yung sinuot mo sa Cubao right??? They both looked cute on you. 😊😊"
|
149 |
+
64,"Uwi na tayo..
|
150 |
+
Walang ganap dito sa cubao.."
|
151 |
+
65,@Carl_Cubao Hala d nman ako cute 😁
|
152 |
+
66,"My weekend the past weeks/months:
|
153 |
+
Nagpunta sa bgc > megamall > araneta cubao > moa to paranaque > mckinley > katipunan.
|
154 |
+
|
155 |
+
🔜 office to megamall for team bowling > angeles > sangkyu cse > bgc with couz > chanyeol cse(?) > rizal for tb 😂😂😂 isa lang massabe ko: d na nakapagdiet. https://t.co/Nj39UfViwM"
|
156 |
+
67,Dreamland hotel cubao🍆🍆🍆🍆🍆😎 https://t.co/BXpYvcobJV
|
157 |
+
68,@hwajoongyu HAHAHA TRUE 😭😭😭 hanggang baclaran lang kasi to Cubao Ave ang ini-MRT ko eh HAHAHHAHA
|
158 |
+
69,San goods magpa-massage? near Cubao sana huhu badly needed rn
|
159 |
+
70,"@hump_fri @oohhhkaye @BembangBiik Binuking mo nanaman ang troll farm nyo mga Ulahh Loooo Tah Bhu Ghuk 😂😂😂😂😂😂
|
160 |
+
|
161 |
+
Yan ba ang nasa Cubao? 😂😂😂😂😂"
|
162 |
+
71,"𝗙𝗢𝗥 𝗠𝗔𝗦𝗦𝗔𝗚𝗘 𝗕𝗢𝗢𝗞𝗜𝗡𝗚𝗦 & 𝗜𝗡𝗤𝗨𝗜𝗥𝗜𝗘𝗦, 𝗖𝗔𝗟𝗟 or 𝗧𝗘𝗫𝗧 our 𝗙𝗥𝗢𝗡𝗧 𝗗𝗘𝗦𝗞: 𝟬𝟵𝟭𝟵𝟬𝟵𝟵𝟳𝟬𝟰𝟱 | 𝟴𝟮𝟰𝟭𝟮𝟬𝟱𝟭.
|
163 |
+
|
164 |
+
𝗪𝗘𝗟𝗟𝗡𝗘𝗦𝗦𝗩𝗜𝗟𝗟𝗘 𝗠𝗮𝘀𝘀𝗮𝗴𝗲 𝗦𝗽𝗮
|
165 |
+
Unit 1, 88 New York Avenue, Cubao, Quezon City, Philippines https://t.co/Whos7dsyj8"
|
166 |
+
72,kabado bente kanina sa cubao pls bat kamuka ni jay :DD
|
167 |
+
73,"baka may pa free tix po kau jan, dito po lang po ako pakalat kalat sa cubao 🥹🥹🥹
|
168 |
+
|
169 |
+
park seo jun park seo joon in manila
|
170 |
+
|
171 |
+
#parkseojun #parkseojunmanila"
|
172 |
+
74,"@skz_ldl Sayang di tayo nag-abot haha! Dumaan lang ako, papunta ako Cubao, nood ng concert hihi"
|
173 |
+
75,Park Seo Jun caused so much traffic here in Cubao 😄 pero ok lang siya naman yun eh #AranetaCity
|
174 |
+
76,"baka may pa free tix po kau jan, dito po lang po ako pakalat kalat sa cubao 🥲🥲🥲
|
175 |
+
|
176 |
+
park seo jun park seo joon in manila
|
177 |
+
|
178 |
+
#parkseojun #parkseojunmanila https://t.co/vo2XwOWQep"
|
179 |
+
77,"baka may pa free tix po kau jan, dito po lang po ako pakalat kalat sa cubao 🥲🥲
|
180 |
+
|
181 |
+
park seo jun park seo joon in manila
|
182 |
+
|
183 |
+
#parkseojun #parkseojunmanila"
|
184 |
+
78,bakit ba kung sino-sino napapadpad sa cubao😭😭😭😭
|
185 |
+
79,"📋𝐁𝐨𝐨𝐤 𝐚 𝐌𝐀𝐒𝐒𝐀𝐆𝐄 𝐧𝐨𝐰.
|
186 |
+
|
187 |
+
Looking for sure Client 😊
|
188 |
+
✅ giving you good and Relaxing Massage😘
|
189 |
+
|
190 |
+
-May Service Clients from:
|
191 |
+
✅Pasay ✅Taguig ✅Cubao ✅ Mandaluyong
|
192 |
+
✅ BGC ✅Guadalupe ✅ Recto ✅ Pasig, and more
|
193 |
+
Messags or call
|
194 |
+
09292367409 https://t.co/TtyvIlK7wt"
|
195 |
+
80,"@Yujiseph If around cubao, Novotel is good.
|
196 |
+
|
197 |
+
If around manila, I'm not as familiar eh, but Bayview and manila hotel are nice.
|
198 |
+
|
199 |
+
These are the pricey ones tho. I haven't tried bnbs around the area but for sure it will be more cost effective"
|
200 |
+
81,Cubao Gateway now 📍 tara good s*x 💦
|
201 |
+
82,alam kong kakatapos lang kmagic pero may cse ba for gyeom around cubao? 🥹🥹
|
202 |
+
83,wala ba ads sa cubao si hee? 🙇
|
203 |
+
84,This Mcdo store in Cubao has been playing 6 Mariah Carey songs already. I know the one in charge of the songs is a gay. You can't convince either 💀😭
|
204 |
+
85,"Kuya Konduktor: /nag aaya ng pasahero pag Riverbanks at Cubao/ Tara, tara, tara! Sa Riverbanks na tara, marami rides doon!
|
205 |
+
Me na sa Rizal bababa: 😒😒😒😒"
|
206 |
+
86,"baka may pa free tix po kau jan, dito po lang po ako pakalat kalat sa cubao 🥲🥲🥲
|
207 |
+
|
208 |
+
park seo jun park seo joon in manila
|
209 |
+
|
210 |
+
#parkseojun #parkseojunmanila https://t.co/jS3eB05NLO"
|
211 |
+
87,Perfect shirt for Cubao Expo. 💜💜💜 #IU #아이유 #leejieun https://t.co/grwTBNhkpq
|
212 |
+
88,Hanapin kita dito sa Gateway Cubao. 😂 https://t.co/iIvss1prvX
|
213 |
+
89,"at cubao, waiting umandar ang bus https://t.co/nnkBMv9T5R"
|
214 |
+
90,@HorizonSpaCubao @BaldzyP Nice bumalik na si Nathan sa Cubao Branch. ☺️❤️
|
215 |
+
91,bakit puro senior citizen sa tropical hut cubao?
|
216 |
+
92,hello vinyl friends this vintage record store in cubao ex is on a 50% clearance sale! https://t.co/IH6UUPn1hR
|
217 |
+
93,@eunoyah lika na sa cubao
|
218 |
+
94,Where to go after Cubao? https://t.co/aMGGOfRE6W
|
219 |
+
95,Dito siya sa cubao baka sakaling makita si park seojun gumagala char HAHHAHAHAHA https://t.co/0x7rUCFqJb
|
220 |
+
96,"PTPA‼️
|
221 |
+
GOOD DAY PO IM MARCO A FREELANCE MASSEUR
|
222 |
+
|
223 |
+
AVAILABLE FOR HOME AND HOTEL SERVICE ONLY NO PLACE
|
224 |
+
|
225 |
+
•CAVITE
|
226 |
+
•PASAY
|
227 |
+
•MANILA
|
228 |
+
•CUBAO
|
229 |
+
•ETC
|
230 |
+
|
231 |
+
DM ME https://t.co/Vset9YYcAr"
|
232 |
+
97,"Ang astig ng sleeper bus aka #TheRoyalClass bus ng @VictoryLinerInc! Tara na’t mag-Baguio👋😃
|
233 |
+
|
234 |
+
https://t.co/2pHJTO81yb"
|
235 |
+
98,May susunod na sa character count ng Justice Lourdes Paredes San Diego Avenue (dating Main Avenue) sa Cubao. 👀 https://t.co/lgcqKzzNhM
|
236 |
+
99,@coderinthebox Hahahaha guni2 lang ni @alexanderrieta1 yan feeling nasa ibang bansa pero troll na Pah Tay Go Tom lang sa cubao ang Ulahh Loooo Tah Bhu Ghuk 😂😂😂😂
|
requirements.txt
CHANGED
@@ -9,6 +9,10 @@ chart_studio
|
|
9 |
autopep8
|
10 |
transformers
|
11 |
sentencepiece
|
12 |
-
bert-extractive-summarizer
|
13 |
tqdm
|
14 |
-
wordcloud
|
|
|
|
|
|
|
|
|
|
|
|
9 |
autopep8
|
10 |
transformers
|
11 |
sentencepiece
|
|
|
12 |
tqdm
|
13 |
+
wordcloud
|
14 |
+
gensim
|
15 |
+
transformers
|
16 |
+
googletrans==3.1.0a0
|
17 |
+
bertopic flair
|
18 |
+
scikit-learn==1.1.0
|
taft.csv
ADDED
@@ -0,0 +1,207 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
,tweet
|
2 |
+
0,@Pullma_Pfanger @Cushdog5 @mmpadellan Outdated and false information. It’s been confirmed. https://t.co/SlYORP8yyo
|
3 |
+
1,"Ewan ko, I usually buy food or treat my oomf somewhere to eat or relax.
|
4 |
+
I mean, dude, trimming and cleaning is not a joke. Not a reward but just to show them I appreciate them and their efforts.
|
5 |
+
|
6 |
+
PS. Di po sya taga Lasalle Taft pero dumayo
|
7 |
+
|
8 |
+
char 🤣 https://t.co/ykSrI0305v"
|
9 |
+
2,"📅 Today's @ 3:00 PM
|
10 |
+
Taft School vs. Choate Rosemary Hall School - Prep Schools High School Football
|
11 |
+
https://t.co/9BMGat7pqb
|
12 |
+
🎟️🎥Watch Now➠ https://t.co/5XLM3ckm43
|
13 |
+
https://t.co/9fSx3lyegG
|
14 |
+
sdslzdlgfd"
|
15 |
+
3,Ano hmm punta ka na taft potekk
|
16 |
+
4,galing talaga ng la salle taft one week na down pa din website nila angaling. 🤦🤦🤦
|
17 |
+
5,"28’ | Taft makes an important block after Richards had shifted into a yard of space to shoot.
|
18 |
+
|
19 |
+
🐯 1-1 ⚪️"
|
20 |
+
6,"📅 Today's @ 2:45 PM
|
21 |
+
Worcester Academy vs. Taft School - Prep Schools High School Boys Soccer
|
22 |
+
https://t.co/7ME6E6KoXu
|
23 |
+
🎟️🎥Watch Live➠ https://t.co/Sr46vANKOH
|
24 |
+
https://t.co/oUfMTqWyPe
|
25 |
+
cemcpsdzqg"
|
26 |
+
7,@dave_taft @markknoller The FCC cannot control the editorial decisions of local broadcast stations under the First Amendment
|
27 |
+
8,Taft Bike Club with the Mayor!!! @NISDTaft #wegopublic https://t.co/p0NpgUp5c8
|
28 |
+
9,grabe sobrang traffic sa taft and walang masakyan kanina huhu
|
29 |
+
10,@ProjectLincoln I'm wondering which president was fatter. @realDonaldTrump or William Howard Taft at 300 lbs?
|
30 |
+
11,"1’ | Ward forces an early corner from Taft.
|
31 |
+
|
32 |
+
🐯 0-0 ⚪️"
|
33 |
+
12,@_amdej tara lipat sa taft
|
34 |
+
13,"@ProjectLincoln Who wore it better, President Taft or President Daft? https://t.co/AB1BkHi6wt"
|
35 |
+
14,@tellitPAC @GovChristie We don’t need a Taft right now. Stop taking people’s money. He doesn’t stand a chance. He knows it and whatever organization this is knows it.
|
36 |
+
15,Taft Museum's new curator Ellen Roberts discusses plans for future exhibitions - Cincinnati Business Courier https://t.co/fa0qXvgAEN via @BusinessCourier #robertsduncanson
|
37 |
+
16,"Bequests of roughly $6 million each will support the endowments of the @CincyMuseum, @cincyartmuseum, and the Taft Museum of Art. https://t.co/kUtzWNHTrm"
|
38 |
+
17,Taft carfun anyone #altertaft
|
39 |
+
18,"Taft School vs. Choate Rosemary Hall School - Prep Schools High School Football
|
40 |
+
https://t.co/oIyPz3SpEA
|
41 |
+
🔴Live Now➠ https://t.co/ksSe0LNXLk
|
42 |
+
📅 Saturday, October 14, 2023 ⏰ 3:00 PM"
|
43 |
+
19,。:°ஐ went to a cafe in taft ! ࿐ ˚ . ✦ https://t.co/RtkMDptKlm
|
44 |
+
20,"Happy Birthday, Dwight D. Eisenhower! Here Hoover is seen with President Eisenhower at dedication ceremony of Bell Tower Memorial for Robert Taft. @ikelibrary (31-1959-17) #happybirthday https://t.co/XAymydrsIg"
|
45 |
+
21,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford
|
46 |
+
|
47 |
+
https://t.co/58ZeCGEpkD"
|
48 |
+
22,📍taft
|
49 |
+
23,@ysblldpn My conyo guy from Taft ☺️ (Taft b shea hayssss basta lozol)
|
50 |
+
24,"SPORTS: The DLSU Green Booters Team 1 blazed their way to the Ang Liga Season 19 Division 1 quarterfinals with a win-loss record of 3-1, following their victory over the CSB Blazers, 2-0, earlier tonight, October 14, at the UP Diliman Football pitch. https://t.co/gjQjoeZgB3"
|
51 |
+
25,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford https://t.co/N4H6DslTnn"
|
52 |
+
26,Andrea Mitchell say she misses the days when she could sit down for a whiskey & cigar with President Taft.
|
53 |
+
27,may nakasalubong ako around UN taft kanina na pogii 😭😭😭 i think he’s from NU based on his uniform since it’s blue hashaha basta nagka eye contact kami pota ang pogi like singkit sya naka salamin na moreno 😭
|
54 |
+
28,taft avenue sa gabi feeling ko nasa mv ako ni zild
|
55 |
+
29,"NISD WATER POLO BOYS AREA PLAYOFFS TODAY @ NISD SWIM CENTER & PSJA. GAMES @ 8, 9 & 1030 & 12. COME CATCH THE ACTION! @NISD @NISDBrandeis
|
56 |
+
@clark_swim
|
57 |
+
@HawksAquatics
|
58 |
+
@NISDHCHS
|
59 |
+
@Jay_Aquatics
|
60 |
+
@aquatics_oc
|
61 |
+
@taft_club
|
62 |
+
TICKETS: https://t.co/JladCMWctU
|
63 |
+
PSJA: https://t.co/owkfJ8LOHa https://t.co/RsEDAO2Pno"
|
64 |
+
30,"Worcester Academy vs. Taft School - Prep Schools High School Boys Soccer
|
65 |
+
https://t.co/BhK1dtd1sx
|
66 |
+
🔴Watch Now➠ https://t.co/mAZocrUeBF
|
67 |
+
📅 Saturday, October 14, 2023 ⏰ 2:45 PM"
|
68 |
+
31,@gary_taft Nice
|
69 |
+
32,idk kung paano ko isisingit ang birthday bash for hao pero ayoko pumayag na wala akong ganap for him so if you guys are inch pls lmk and need ko rin ng help sa pago-organize 👉👈🥹 jollibee kids party para pwede mag geng geng and this will be around taft lang !! https://t.co/FuLNnkAjmj
|
70 |
+
33,"Samantala, sunod na makahaharap ng Taft-based squad ang Ateneo Men’s Chess Team bukas, Oktubre 15, sa ganap na ika-1 ng hapon sa parehong lugar. https://t.co/XG5IvINvYO"
|
71 |
+
34,"LF / WTB HAMILTON PH TICKETS
|
72 |
+
|
73 |
+
3 HAMILTON tickets
|
74 |
+
no preferred seats (as long as seated together)
|
75 |
+
preferably physical ticket (but not required)
|
76 |
+
preferably meet up (taft area)"
|
77 |
+
35,Dudley Taft y Manu Lanvin mano a mano en Madrid https://t.co/1MogCn9Qbz @DudleyTaft @ManuLavin @blackmooncreep
|
78 |
+
36,"LF / WTB HAMILTON PH TICKETS
|
79 |
+
|
80 |
+
3 HAMILTON tickets
|
81 |
+
no preferred seats (as long as seated together)
|
82 |
+
preferably meet up (taft area)"
|
83 |
+
37,@MMDA Kelan kaya maaayos ang traffic sa EDSA corner Taft Avenue? Grabe ang loading and unloading sa kanto nito na sanhi ng heavy traffic.
|
84 |
+
38,"What are my turn-ons? Meet me on @Flirt4Free. I'll give you all the juicy details.
|
85 |
+
https://t.co/hJMwqFD9l3 https://t.co/jiUYcwWlGB"
|
86 |
+
39,"@taft_gvs plays against Choate under the 💡 on KJM Field
|
87 |
+
|
88 |
+
🦏⚽️ #taftathletics #gorhinos #gobigred #wearetaft https://t.co/h0agVh03ie"
|
89 |
+
40,"@threeinningfan Fascinating & informative! Was that William Howard Taft I saw in the Chief Justice position? Anyway, great job! 👍⚾️👍⚾️"
|
90 |
+
41,@MMDA Manila both lanes Taft hanggang Quiapo
|
91 |
+
42,"TODAY: Are ya ready kids? @TCTCincinnati presents The Spongebob Musical: Youth Edition at Taft Theatre 🌴 Get more info and tickets here: https://t.co/AiGNczxkQ3
|
92 |
+
|
93 |
+
2:00 PM SHOW
|
94 |
+
DOORS: 1:00 PM
|
95 |
+
SHOW: 2:00 PM
|
96 |
+
|
97 |
+
5:00PM SHOW
|
98 |
+
DOORS: 4:00 PM
|
99 |
+
SHOW: 5:00 PM https://t.co/gSh9HzJ7Ws"
|
100 |
+
43,Anybody in Taft?? Jk
|
101 |
+
44,taft people are wildd i swear
|
102 |
+
45,jgh from españa to taft for 2 FUCKING HOURS WTF
|
103 |
+
46,@babygurldex dont make me come all the way to taft eme
|
104 |
+
47,"Yung patience ng bf ko lord, please.
|
105 |
+
Akala mo naman di sanay na traffic sa taft"
|
106 |
+
48,"from Taft Manila, to Bulacan then to Araneta…what a very productive day you had and you made a lot of people happy!
|
107 |
+
|
108 |
+
#FiveBreakUpsAndARomance
|
109 |
+
#ALDENRichards
|
110 |
+
@aldenrichards02 https://t.co/eVL2YpdWjd"
|
111 |
+
49,"Travelled the entire length of Taft and headed straight to moa, all on foot. This is a new record for me. And my feet don’t even hurt yet :D"
|
112 |
+
50,"aaaa traffic wth, 2hrs na asa byahe pa taft lang ako ahshehahdhs"
|
113 |
+
51,may ukay ukay ba doon sa taft 🥹
|
114 |
+
52,Wtf may bok chicken sa taft?!?!
|
115 |
+
53,"SPORTS: DLSU Green Booters Team 1 successfully notched a win in their Taft derby match against the CSB Blazers, 2-0, in their Ang Liga Season 19 Division 1 matchup earlier tonight, October 14, at the UP Diliman Football Pitch. Isaac Anoh led the way with his first-half brace. https://t.co/tYOqTwh28L"
|
116 |
+
54,"Nasa MRT, Taft stationnn palangggg yawaaa"
|
117 |
+
55,"WTS LFB SMISKI
|
118 |
+
WTT LFT SMISKI
|
119 |
+
|
120 |
+
Have: SMISKI thinking
|
121 |
+
Want: SMISKI duck, daydreaming, nap time
|
122 |
+
Mop: gcash
|
123 |
+
Mod: flash/ggx/meetup
|
124 |
+
Loc: santa rosa laguna/taft
|
125 |
+
|
126 |
+
Fresh pull !!
|
127 |
+
With box and foil https://t.co/FWUQlbv8F7"
|
128 |
+
56,"Continuous STRIDE ⚡️
|
129 |
+
|
130 |
+
The DLSU Green Booters (12 pts) put on a statement after a hard-fought match against their Taft neighbor, the Benilde Blazers (5 pts), 1-0! 🏹
|
131 |
+
|
132 |
+
#GoLaSalle #AnimoLaSalle #DLSUSports https://t.co/Dnix6Qxyuw"
|
133 |
+
57,Walking the entire length of Taft otw to Moa https://t.co/G7lLIReYdP
|
134 |
+
58,Gago just witnessed a hostage taking dito aa taft 😭.
|
135 |
+
59,im @ taft na tf tf
|
136 |
+
60,@Suckerlingamist taft
|
137 |
+
61,@fewyearsearlier c hokkaido uni polsci blockm8 araw-araw ig story nasa ibang bansa tas boogsh mkakasalubong m somewhere sa taft tsaka si 🌹 lv bayong girlie na nagfeflex ng designer bags at credit cards sa mga di daw afford pero nakikiagaw ng scholarship stipend meganonnnn 🤣
|
138 |
+
62,sino nasa taft pahiram jacket pls andito ako sa projuice malapit na maging frozen parang hotdog
|
139 |
+
63,"ღ wts lfb ღ
|
140 |
+
|
141 |
+
🐨 koala - php 550 (payo php 500)
|
142 |
+
🐱 can also trade siamese to pig / lop ear hipper or same demand (nfs, wtt)
|
143 |
+
|
144 |
+
mop: gcash
|
145 |
+
mod: sdd or meet up around taft
|
146 |
+
|
147 |
+
🏷️ sonny angel wts koala reg wtt cat life series https://t.co/IdQxePA2MD"
|
148 |
+
64,https://t.co/8oGdk3fp5v goodcatch from taft 1on1 #zoomjakol
|
149 |
+
65,"SPORTS: DLSU Green Booters Team 1 take on Taft neighbor CSB Blazers at the UP Diliman Football Pitch today, October 14, in hopes of returning to winning form after their previous loss in the Ang Liga Season 19 Division 1. https://t.co/wdGsVPtEkv"
|
150 |
+
66,@cheylwse sa taft at katipunan po
|
151 |
+
67,"BATTLE OF TAFT ⚽️: The DLSU Green Booters are going up against familiar opponents, the CSB Blazers, as they look to get back up on the win column.
|
152 |
+
|
153 |
+
Follow @dlsusports for LIVE play-by-play updates! 🏹
|
154 |
+
|
155 |
+
#GoLaSalle #AnimoLaSalle"
|
156 |
+
68,"Taft School vs Choate Rosemary Hall School High School Football
|
157 |
+
🎥🔴Live Events: https://t.co/pQDPdf2IAz
|
158 |
+
Saturday, October 14 @ 3p.
|
159 |
+
@CRHFootball @goChoate @ChoateVBHockey @TaftFootballCT @TaftRhinos @TaftSchool @AndreCrews6 @TaftFootballCT https://t.co/xILtoS9Gls"
|
160 |
+
69,"@yeontoosiast Unnie Cafe, Taft! ✨"
|
161 |
+
70,"Granada Hills football runs through Taft – Daily News
|
162 |
+
https://t.co/pIuC865dxf"
|
163 |
+
71,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford https://t.co/4uD3XRRneW [How, pray tell, does this worthless subhuman continue to make his way into the news?! A mystery of life...]"
|
164 |
+
72,Granada Hills football runs through Taft https://t.co/9gJx5v3qKW
|
165 |
+
73,Granada Hills football runs through Taft https://t.co/7tvNTwuNJv
|
166 |
+
74,Granada Hills football runs through Taft https://t.co/Qk4459g3Mz
|
167 |
+
75,Granada Hills football runs through Taft https://t.co/GG9bEqzQqA
|
168 |
+
76,"bored in Taft
|
169 |
+
|
170 |
+
#altertaft #alterpasay #altervitocruz #alterquirino"
|
171 |
+
77,"@cyrilamarchand,@KhaitanCo,Taft Law, Foley & Lardner act on iLink Group's fundraising from TrueNorth
|
172 |
+
|
173 |
+
iLink Group has raised approximately ₹624.2 crore
|
174 |
+
https://t.co/cNL0KJpn7P"
|
175 |
+
78,"@PAGMETER Notice you have Wasco rank 9 in D5 they are 3-1 in SSL and will probably beat Taft and put them 4-1 SSL going into Shafter game
|
176 |
+
There Loses Are to
|
177 |
+
9-0 Tehachapi
|
178 |
+
7-1 Kennedy
|
179 |
+
6-2 Golden Valley
|
180 |
+
So what are chance’s they jump to D-4 playoff ?"
|
181 |
+
79,@altrmnl1 /*dumeretcho taft
|
182 |
+
80,argh bat 30 mins away pa yung dlsu taft sa monumento I wanna see ate Andrei
|
183 |
+
81,"Volvo 240 GLT
|
184 |
+
Altis 1.8 G ( manual version )
|
185 |
+
TAFT GT 4X4
|
186 |
+
YARIS 2004 J TYPE https://t.co/tZZpy7Iek8"
|
187 |
+
82,@vonderleyen Thank you Bill for Drei Wetter Taft
|
188 |
+
83,"@beinlibertarian I wonder if Taft, and JP Morgan had anything to do with that.. There was a lot of rich and powerful people on that boat. JP Morgan was even rumored to be expected on the boat, but he was not. This was the first 9-11. Shorty after in 1914 we go to war (WW1). https://t.co/lutOnwwVo9"
|
189 |
+
84,Who’s in taft tara?
|
190 |
+
85,who’s around taft rn 😗 g
|
191 |
+
86,@Taft_HS @TaftHSAthletics @CPLAthletics @network14cps @NadigNewspapers @ChiPubSchools Congratulations!!!
|
192 |
+
87,"Tomorrow’s Games:
|
193 |
+
|
194 |
+
2PM- (4-2) Sam Houston Hurricanes vs. (4-2) McCollum Cowboys @ Alamo Stadium
|
195 |
+
|
196 |
+
7PM- (4-2) Sotomayor Wildcats vs. (4-2) Taft Red Raiders @ Farris Stadium"
|
197 |
+
88,South Kern Sol | ‘Change is here in town’: Taft City School District is urged to pick maps Cherry or Plum in districting process https://t.co/P7vdafhf1V
|
198 |
+
89,"It’s final. Taft Union 34, Cesar Chavez 25"
|
199 |
+
90,"@Mormonger @benjaminpacini or even to defend them against their enemies.” (Ezra Taft Benson, America at the Crossroads, August 30, 1969.)"
|
200 |
+
91,"rainy weather in Taft = jakol ☔️😈🍆✊🏼💦
|
201 |
+
|
202 |
+
#altertaft #alterpasay #altervitocruz #alterquirino"
|
203 |
+
92,@Once_dre @jooye_onthebass HAHAHAHAHA SA TAFT
|
204 |
+
93,"When They Say They Side With the Hama-Nazis, Believe Them https://t.co/vSfKT7Mj5y"
|
205 |
+
94,"Final: Taft 34, Chavez 25"
|
206 |
+
95,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford https://t.co/aROloyHAUr"
|
207 |
+
96,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford https://t.co/aROloyI8JZ"
|