MarMont commited on
Commit
56685c8
1 Parent(s): 3c6d0ff

new data and app

Browse files
Files changed (8) hide show
  1. (bgc OR bonifacio global city).csv +199 -0
  2. (katip OR katipunan).csv +217 -0
  3. (pobla OR poblacion).csv +307 -0
  4. app.py +358 -176
  5. app_old.py +364 -0
  6. cubao.csv +236 -0
  7. requirements.txt +6 -2
  8. taft.csv +207 -0
(bgc OR bonifacio global city).csv ADDED
@@ -0,0 +1,199 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,tweet
2
+ 0,"@AngelynSo5 Kasalanan ni vince, di pa nga kami nakakapunta jan inaaya na ko mag bgc😭"
3
+ 1,@hiiitsnam parks around makati/bgc
4
+ 2,wanna punta sa bgc at uminom ng caramel beer at kumain 🥹
5
+ 3,@ooohhhKaySee @jisoo_bel @francinecarreld Si Seth nga Hindi pagud kahit galing Cebu deretso agad sa studio nag bgc pa lam na nila pahinga nila ang isat isa.
6
+ 4,OMG ZACK TABUDLO IN BGC
7
+ 5,@davidthewalrus ay i miss read it tot u were here sa bgc HSHSHHSSHHAHAAHHAHAHAJAHA
8
+ 6,@ZeusNetworkTea @itsahnamac @RPSnackMealz Rollie is a bully too ! If they gone keep bringing her fat ass bck then they need to put another big btch in there 1 thts not scary ! A Tanisha(BGC) type of btch. Cause Rollie love tryna fight on the lil girls & DONT BE WINNING ! All outta breath saying “I beat your ass” & DIDNT
9
+ 7,Ahhh siguro pumunta siguro sila dito feel ko din hahahaha kaya pala nag BGC https://t.co/Wm8Rd2RrEY
10
+ 8,are we even allowed to check the launch of GM in bgc? I mean — are fans allowed hehe ㅜㅜ
11
+ 9,@ooohhhKaySee @jisoo_bel @francinecarreld Ang ganda nga ng ngitin ni Chin kanina eh. Minsan naman talaga sa mga kuha video ni Celina nakaganun si Chin hahaha. May masama ba pakiramdam na nag bgc pa 😅
12
+ 10,"KICKOFF SATURDAY: High school football continues on Saturday with great matchups such as Lamar HS vs. Sam Houston!
13
+
14
+ 🏈 Stream all the Big Game Coverage through the free BGC app! Or watch live on https://t.co/gW89m6Ii27.... https://t.co/TX2SM5gxLP"
15
+ 11,mind museum lang bakit nag bar hopping na sa bgc? ngek
16
+ 12,they did lmaoo they got bgc shut the fxk down n got paid 😹😹😹 https://t.co/Y9BAFXOmM1
17
+ 13,pagod na pagod katawang lupa — pampanga to bulacan to bgc to greenfield 😭😂 deserve ko ng cuddle ng baby ko. ay 🫣🤭
18
+ 14,ang lala ng bgc trip 🤣🤣
19
+ 15,BGC with Lime 🐈‍⬛ https://t.co/qOYlihERKL
20
+ 16,"@jaaaykaaaay @soundslikekash @buckethat2tini @jimin3d @deeminkook @MIKEEEbond g sa bgc if you'll have me 🥹🫶🏼 (if you haven't tried bb.q, ang sarap ng chicken nila huhu)"
21
+ 17,Same sila dalawa pero quality time daw dapat kaya nag date sa bgc https://t.co/XHOJWQVBGT
22
+ 18,"Ngayon nalang ulit ako magpapaumaga sa bgc, sa st.lukes pa. Saet 😩"
23
+ 19,Si Gelo nag paparty sa BGC. Grabe I FEEL SO OLD
24
+ 20,"baka sakaling magkita pa tayo sa qc, sa up, SA KALSADA NG BGC📢📢📢"
25
+ 21,Totoo ba ang BGC gubat? Makapunta nga
26
+ 22,Huy! parang gusto ko na manood ng Eras Tour ni taylor! kasama na lang kulang HAHA! Anyone nearby sa bgc? #alterbgc #altertaguig #altermakati Mostly swiftie mga alter kaya i hashtag ko na HAHA https://t.co/cPVYqFwx0P
27
+ 23,Na I-inggit ako sa mga nasa BGC ngayon😭😭 GIVE ME YOUR FOREVER I wanna hear it live😭
28
+ 24,"Huy! May idadagdag ulit ako sa playlist soon! Bakit may naalala ko sa AM? Haha!
29
+
30
+ JAYDA ROCKS BGC https://t.co/BnpiJFhyjc"
31
+ 25,"Take care of your butt. I really like hotel-style massage.
32
+ 📍BGC https://t.co/jJlMcVE1GA"
33
+ 26,"Trending parin ang bestgirl!
34
+
35
+ JAYDA ROCKS BGC https://t.co/XE5B513GiK"
36
+ 27,Hahahha ate @iiamcelina update naman sa bgc time ninyo namiss namin kayo hahahaha
37
+ 28,"AM.. Can’t wait to hear the full song.
38
+
39
+ JAYDA ROCKS BGC https://t.co/cgIb13LlWq"
40
+ 29,"Well deserve!!keep on tweeting!
41
+
42
+ JAYDA ROCKS BGC https://t.co/PDUJXsIdPK"
43
+ 30,@xg_luffy Hahaha oo nga sleep call lang daw yan hahahaha walang update sa BGC time nila ngayon
44
+ 31,sa bgc lang pala ako makakahanap ng kamukha ni jungsu
45
+ 32,@evilrashida @kaseyonnuh Definitely giving Janelle from bgc 😂
46
+ 33,"AM initials reveal.
47
+
48
+ JAYDA ROCKS BGC https://t.co/Ie69qMTV8w"
49
+ 34,"Sino muna yung AM, miss mam!
50
+
51
+ JAYDA ROCKS BGC https://t.co/RCKQYD6PpK"
52
+ 35,@xg_luffy Hahahaha oo nga eh miss siguro plus kagabi hindi kasi nag dahan dahan si sethyboy natin hahahahaha nag overthink ako malala haahahaha panay suyo talaga yun sa BGC
53
+ 36,"@FranSethBhlFn @imsethfedelin Nako seth kawawa naman yong rumored boyfriend ni chin na kasama nya sa bgc ngayon balita ko gwapo din yon yancy ba name non
54
+
55
+ #FranSeth"
56
+ 37,"Still trending!
57
+
58
+ JAYDA ROCKS BGC https://t.co/NBez0E07f9"
59
+ 38,@xg_luffy Oo nga ehh hahaha kaya panay tingin si seth ni chin ngayon panay suyo haha alam ko yung sa bgc date nila panay suyo yun hahahaaha
60
+ 39,"Kinanta ni @jaydaavanzado ung bagong song niya na ang title is ""AM"". 🤔
61
+
62
+ JAYDA ROCKS BGC"
63
+ 40,"Tang, BGC will now be among ELF's must-visit and must-dine destinations when in PH. 🍽 https://t.co/EW43A0R8wH"
64
+ 41,Ay Sheeeet got a chance to hear you singing live @zacktabudlo here at BGC. https://t.co/irZw0syw2g
65
+ 42,"10:00 PM | TAG UPDATE
66
+
67
+ Join 1st. One for an Epic Night on PPOP NIGHT!
68
+
69
+ 🗓️ Saturday, October 14, 2023 | 8PM
70
+ 📍 Buena Vida, 4F UNIQLO BLDG, BGC
71
+
72
+ PARTY ALL NIGHT WITH 1STONE
73
+ @1stoneOfficial #1stOne
74
+ #1stOneOnPPOPNIGHT #PPOPNIGHT https://t.co/3zqGiwVRPD"
75
+ 43,"Is there anything you can’t do??? @jaydaavanzado
76
+
77
+ JAYDA ROCKS BGC https://t.co/ui7mksGe5Q"
78
+ 44,"Last night before flight
79
+ bgc mode https://t.co/8V56ciWAF2"
80
+ 45,"Cebu to its your lucky day to bgc and siguro nasa bahay nila chin to ngayon kasi wala pang update sa vlog ni chin eh kanina pa ako pa balik2x sa YouTube
81
+
82
+ NASA FRANSETH ANG SWERTE
83
+ #FranSeth https://t.co/HbMOIY2dc0"
84
+ 46,@xg_luffy Kaya din siguro hindi umuwi binigyan niya nang time yung binibini niya nag BGC
85
+ 47,wala ba pede mag ampon sakin dyan sa na taga makati or malapit sa bgc like huhuhuhuhuhu sheeett ang layo talaga ng workplace ko sa bahay namin ugh
86
+ 48,today at bgc — #archivetwt https://t.co/dB7bogPOhx
87
+ 49,Nagbabalak sia mag bgc oh
88
+ 50,"We love you, mahal! You did a great job!
89
+
90
+ JAYDA ROCKS BGC https://t.co/9vPUmzI0j6"
91
+ 51,"@Thoughtswd1am @fentybullets @babgirlsanthem the same way they were able to incorporate bgc into a routine. they could switch out lyrics and cater it to LJ, play on the “miami” theme city girls already have, like there’s a lot. trust if there’s a will, there’s a way"
92
+ 52,"[ADMIN POST]
93
+
94
+ The @allriseph Team visited Tang, BGC today!
95
+
96
+ We ordered Hyukjae’s order & it is the ff:
97
+ - Tteokbokki
98
+ - Sundae
99
+ - Kimchi-jeon
100
+
101
+ They are so nice & accommodating when we asked if we can display his AllRisePH Round Fan Support!
102
+
103
+ @AllRiseSilver
104
+ #EUNHYUK #은혁 #이혁재 https://t.co/9KvY1QohA4"
105
+ 53,"She made an effort para lumapit sa Team live dahil wala silang ticket and nasa labas sila ng barricade. What a beautiful soul.
106
+
107
+ JAYDA ROCKS BGC https://t.co/cFNfoLTC9A"
108
+ 54,"Lumot represent! Thank you Jayda girl! We love you!
109
+
110
+ JAYDA ROCKS BGC https://t.co/8rCtUcGGlN"
111
+ 55,spinning tonight at studio 28 bgc 10-1am! 🥳
112
+ 56,@teamjaydauae JAYDA ROCKS BGC
113
+ 57,"Jayda with our team live. Thank you @jaydaavanzado ang galing mo talaga at napaka ganda mo! Shout out mga lumot!
114
+
115
+ JAYDA ROCKS BGC https://t.co/YLykl7XJ4k"
116
+ 58,BGC is never lacking in pretty girls and hot guys
117
+ 59,Kakatawa kayo pinipilit paniwalain ang Franseth may pa ig story pa si baboy na nasa BGC sila effort yarn😅 parang kailan lang nag date si siopao at raheel sa BGC gagawin talaga lahat hindi lang masira ang career ni siopao demonyo talaga kayo ayaw pa aminin @francinecarreld
118
+ 60,IS THIS IN BGC???? I AM 15 MINUTES AWAY https://t.co/ePCqocJ9Qk
119
+ 61,"The power in her eyes!
120
+
121
+ JAYDA ROCKS BGC https://t.co/yx0c7XRG7R"
122
+ 62,"Look whos trending!
123
+
124
+ @Team_JaydaOFC @JAYDAnationOFC @jaydaavanzado
125
+
126
+ JAYDA ROCKS BGC https://t.co/0tpk7Xs2if"
127
+ 63,"Hearing a lot of praises and admirations.
128
+
129
+ JAYDA ROCKS BGC"
130
+ 64,"BGC with Sweetheart ba?!
131
+ 😜 https://t.co/tSpisynBjB"
132
+ 65,"Our princess, but Aljon's queen!
133
+
134
+ JAYDA ROCKS BGC https://t.co/JKKr4tLn5K"
135
+ 66,"Keep tweeting lumot!
136
+
137
+ JAYDA ROCKS BGC https://t.co/DQifuJLUYV"
138
+ 67,"Trending na rin @jaydaavanzado
139
+
140
+ JAYDA ROCKS BGC https://t.co/5iTBDpik7W"
141
+ 68,"Trending!!!
142
+
143
+ JAYDA ROCKS BGC https://t.co/komi4pEUti"
144
+ 69,"Vocalist cum Instrumentalist? Baka Jayda yan. Ang galing galing mo!
145
+
146
+ CREDITS: Mama Eldz new IGS
147
+
148
+ JAYDA ROCKS BGC https://t.co/lcRypBTtks"
149
+ 70,@panisluis joyride ka nlng muna sis hahahahaha or di kaya mag mrt ka papuntang ayala then may bus don papuntang bgc sabihin mo sa stopover ka
150
+ 71,"POW will attend Gentle Monster Manila opening event on Oct 16th. The store is located at 5th Avenue, Bonifacio Global City, Taguig.
151
+
152
+ our boys are booked and busy😭🥹
153
+
154
+ #POW #파우 https://t.co/ZjjCAyQpG8 https://t.co/XdEngu7CbR"
155
+ 72,"So proud of you, luv!!! @jaydaavanzado
156
+
157
+ JAYDA ROCKS BGC"
158
+ 73,Nako! Pano na si Seth nito? Spotted pala ang rumored boyfriend ni francine sa BGC 😫 https://t.co/X2Ye1wx28F
159
+ 74,"Am baxk! Hahaha!
160
+
161
+ JAYDA ROCKS BGC"
162
+ 75,Ayan napagbigyn kayo niyo ni literal ang BGC 😂
163
+ 76,@NayeonNabong di ko mapost yung sa akin kasi wala ako mahanap sa google ng poster 😂😂😂 BGC open field days
164
+ 77,"@ate_jomzqatar Wait lang mi, nakipagdate pa kasi sa BFf nya nasa BGC pa 🤭"
165
+ 78,"Umuwi from BGC si Jingliu!
166
+
167
+ Aabot na ba sa LC to next? We will see...
168
+
169
+ https://t.co/IiCFMfjTbr"
170
+ 79,"@AlJayOfficial_ @jaydaavanzado We love you jayda!
171
+
172
+ JAYDA ROCKS BGC"
173
+ 80,"You did great best girl as always @jaydaavanzado
174
+
175
+ JAYDA ROCKS BGC https://t.co/CG8jPTKjoa"
176
+ 81,"It's the A5 Wagyu bowl for us.
177
+ https://t.co/9wiRhUxEXW"
178
+ 82,"😭🥺🥰🐾
179
+
180
+ so happy with how the photos turned out! ang saya kaninang umaga huhu pagoda lang talaga mag-ikot sa BGC after hahahuhu tenkyu talaga @bumaBAgyo ♥️ https://t.co/wfh9F5gM6s"
181
+ 83,@frankmajorfkj San ka? Nasa bgc ako.
182
+ 84,"@aimaneth GM Aiman! Philippines, BGC 😎"
183
+ 85,soft girl = mga poging tambay sa bgc playlist
184
+ 86,@Kitty_pinkyyyyy @francinecarreld bka poh sa BGC ipalabas 😅
185
+ 87,Bgc boi ang galawan ah https://t.co/oiz2ucRa0K
186
+ 88,Ano kayang magandang iterinary kapag pumunta ko bgc bukas 🤔
187
+ 89,"I remember a similar example from first year biology about fishes here in BC, but it was 5000 yrs.
188
+
189
+ This has always been in the back of my head when I consider NP chemodiversification.
190
+
191
+ In mNP BGC; I have often wondered if we are sometimes catching them in the middle of R&D https://t.co/yXzWzhpf23"
192
+ 90,i have a job interview at bgc next week aAaAAaa wish me luck huhu sana po hindi ko kwestyunin mga desisyon ko sa buhay fkshdjshs *nahimatai*
193
+ 91,Aattend ata sila sa event ni Jayda sa BGC. 🫢💙😊 https://t.co/gY3pW96LIk
194
+ 92,"All this, plus delicious food from Fat Seed + Roastery (BGC), and we surely couldn't have asked for a better morning! 🥳😍
195
+
196
+ Special thanks to our stewards Leslie Uy, Danielle Parreño, and Isabel Chin for making this possible! 🥰"
197
+ 93,@Dewgong_FTW wala naman. i just really don't like BGC
198
+ 94,"New Arrivals Starts @ 49 ‼️ #ayalamallmarketmarket #ayalamarikina #taguig #shangrilamall #bataan #bgc #vistamallstarosa #robinsonsgentrias #robinsonslaspinas
199
+ #robinsonsplacenaga #laguna #sancarlostowncenter #vistamallbataan #ayalamarqueemall #angelespampanga #cavite #ayalamalls https://t.co/YzsM2n8BdC"
(katip OR katipunan).csv ADDED
@@ -0,0 +1,217 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,tweet
2
+ 0,"a friend is looking for a fubu near katip!
3
+
4
+ preferrably vers/vers top, taller than 5’4” , and hygienic!"
5
+ 1,"YOU'RE INVITED! 👻
6
+
7
+ Nightmares : Sangyeon & Q Halloween Birthday Beersleeve Event Teaser 🎃🍻
8
+
9
+ 🗓️ November 18, 2023
10
+ 📍Hangout Restobar Katipunan
11
+
12
+ Registration starts on October 21, 6 PM!
13
+
14
+ #SurviveTheNightBSE #THEBOYZ #SANGYEON #Q #SANGKYU https://t.co/l7Ld3uN5Bn"
15
+ 2,gagu after 4 months??? see u again later katipunan. i hope this time ill have only good memories in you :((
16
+ 3,@CanyonEveSimp @HilltopGamers at one katipunan! a walk away from lrt 2 katipunan
17
+ 4,Driving around katipunan while listening to Rewind by Wonder Girls. All in God's Plan.
18
+ 5,"Oo na, naa na pud lagi ko sa katip karong gabii 😭"
19
+ 6,"@MeekoPod Dam... katipunan be a bit far from where i stay...
20
+ Tho not hard to get to by car 👀"
21
+ 7,IN PHOTOS: The League of Filipino Students - Katipunan marched alongside the Ateneo community in celebration of One Big Pride–bringing along calls for junking VFA-EDCA and bringing justice to the brutal murder of Jennifer Laude at the hands of US marine Joseph Scott-Pemberton. https://t.co/RyIyRHQWQg
22
+ 8,@_narakku near katipunan LRT!! ^^
23
+ 9,katip invite pleasee huhuuu
24
+ 10,"katip fgc has a new home 👀🥺🥹
25
+
26
+ Pls follow @HilltopGamers for future announcements 👀 https://t.co/aBywPwjVAz"
27
+ 11,"The red side of Katipunan flexing its basketball dominance in #UAAPSeason86
28
+
29
+ @upmbt 5-0
30
+ @upwbt_ 5-0"
31
+ 12,nothing like good food in katip in between study sessions :) #studytwt #studytwtph https://t.co/kbY0PS8k0R
32
+ 13,"wts lfb blackpink funko pop keychain ph
33
+
34
+ 1 rose, 1 lisa, 1 jisoo ✅
35
+ ₱520 each (w/ case protector)
36
+ ₱1500 if taking as set
37
+
38
+ ✰ onhand
39
+ ✰ payo gcash
40
+
41
+ mod: prio sdd
42
+ 📍katip, qc https://t.co/KDDuD1Q5uQ"
43
+ 14,📍 Ateneo-Katipunan! Anyone need grab car ride or out of town drive. DM me. Tnx #booking #privatedriver #grabdriver https://t.co/mpx8f22DAJ
44
+ 15,"ㅤ
45
+
46
+ october 14 2023
47
+ https://t.co/CZPgMZFE0D — barista update⠀
48
+ katipunan avenue, quezon city
49
+
50
+ ㅤ https://t.co/M4JIGZOb6n"
51
+ 16,"My weekend the past weeks/months:
52
+ Nagpunta sa bgc > megamall > araneta cubao > moa to paranaque > mckinley > katipunan.
53
+
54
+ 🔜 office to megamall for team bowling > angeles > sangkyu cse > bgc with couz > chanyeol cse(?) > rizal for tb 😂😂😂 isa lang massabe ko: d na nakapagdiet. https://t.co/Nj39UfViwM"
55
+ 17,"Every friday eh yung mga friends ko niyayaya din ako minsan gumala pero I always decline kasi gastos ulit yun for me.
56
+
57
+ Either punta sila sa BGC, Katip, Maginhawa, SMNE, etc.
58
+
59
+ Diretso uwi nalang ako."
60
+ 18,"Update sa lebel sa Tubig!
61
+ As of 01:45PM | October 14, 2023
62
+ Source: LFEWS
63
+
64
+ Maayong Hapon!
65
+
66
+ Anaa gihapon sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera.
67
+
68
+ ""Early Warning, Early Action!""
69
+
70
+ Duty Personnel: loc / msm https://t.co/Te8OEmvi7P"
71
+ 19,@DefenderDlsu Dapat magpabonfire sila pag naka isang set sila sa DSLU at NU..mukhang hangang sa battle of katipunan lang sila aabot.
72
+ 20,"Ang Tibaksyunaryo ay unang ginawa ng ASAP-Katipunan ng UP Manila. 📷
73
+
74
+ #STANDUPNCPAG
75
+ #TunayPalabanMakabayan"
76
+ 21,my katip friends are chatting me to hangout with them later sa walrus😭
77
+ 22,natatalo ateneo tuwing inaagawan ako ng tricycle ng isang atenista sa katipunan and methinks daserb https://t.co/THrltBwmBV
78
+ 23,Ang tanga lang nung #wagniyokaminakawansacampus sana manakawan kayo sa Katip/Ever/Pob/BGC/wherever the fuck you pretentious assholes go 🙄
79
+ 24,sure na me sa katipunan laterrrrr see yaaa
80
+ 25,ang funny talaga ng up golf club HAHHAHAHA eto topic namin kanina habang nagpapa sober da mcdo katip 🤣
81
+ 26,@cheylwse sa taft at katipunan po
82
+ 27,MMDA ALERT: Vehicular accident at C5 Nawasa Katipunan NB involving truck and motorcycle as of 3:11 PM. 1 lane occupied. MMDA enforcers on site. #mmda
83
+ 28,@jvveon PLSSS WISH U WERE HEREEE KATIP PA NAMAN
84
+ 29,"Sobrang lost ng ADMU today, sana makapag-adjust sila sa battle of Katipunan round 1 anek"
85
+ 30,Katip casuals :) https://t.co/ggjEGgq9dP
86
+ 31,"visit @HilltopGamers guys, near LRT katip station !! https://t.co/ZRPL8gcLCv"
87
+ 32,"hello :DD may comshop po ba sa katipunan and magkano po per hour?
88
+ thankyew"
89
+ 33,"I can do meet-ups if it is around Quezon City (Hanggang Katipunan area lang kaya ng powers ko) or San Juan Area. If you need more pictures or you want to negotiate the prices feel free to message me 😄
90
+ Shipping will be shouldered by the buyer.
91
+ Current Location: New Manila, Q.C"
92
+ 34,If you're seeing this please please please start organizing protests for Palestine around España Taft and Diliman/Katipunan. We need to make it clear to the youth that we have to stand with Palestine now and always for whenever in our lives it is finally liberated
93
+ 35,"wts lfb
94
+
95
+ on festival off concert
96
+
97
+ 1 ub 403A ticket
98
+ - php 4790 (original price)
99
+ - can do meet up around sm north/katipunan
100
+ - payo (prio) or installment until nov 3/d-day
101
+
102
+ get na pls :("
103
+ 36,"wts lfb
104
+
105
+ on festival off concert
106
+
107
+ 1 ub 403A ticket
108
+ - php 4790 (original price)
109
+ - can do meet up around sm north/katipunan
110
+ - payo (prio) or installment until nov 3/d-day
111
+
112
+ huhu pls get it na po >< masaya kami katabi sa con ng friend ko https://t.co/UG0srrZYm4"
113
+ 37,"I remember this was released at 6PM KST and playing the music video maybe 10 times before I left to go to some inuman with my blockmates in Blue Finn.
114
+
115
+ Shit yan ang proof ng Katip tanders. Blue Finn! https://t.co/Ga26uHgP0B"
116
+ 38,"Going back to Ateneo today. It's been like, 5 to 6 years since I last went there. But this time, with someone special who also happens to become a fellow Atenean.
117
+
118
+ Hindi ko akalaing babalik ako ng Katipunan na may baong jowa hahahaha."
119
+ 39,"Admittedly, mas less car-centric pa ang town proper ng Baguio kaysa Diliman-Katipunan."
120
+ 40,apaka layo ng katipunan kalokaa
121
+ 41,"Update sa lebel sa Tubig!
122
+ As of 06:04AM | October 14, 2023
123
+ Source: LFEWS
124
+ Maayong Buntag!
125
+ Anaa sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera. https://t.co/wk9SZ7Grn7"
126
+ 42,MMDA ALERT: Stalled Wing Van due to double flat tire at C5 Labor Hospital Katipunan NB as of 5:39 AM. 1 lane occupied. MMDA enforcers on site. #mmda
127
+ 43,haha pis awt katipunan putangina mo z word mabulok ka dyan!
128
+ 44,whos at katip tommm taraa lets sleep and cuddleee
129
+ 45,#skl pero ang cute ng tita q kanina kasi nagjeep kami s may katipunan tas sabi niya sakin 5-ish years na siyang di nagkocommute so selfie kami nang selfie sa loobxhhdhshs
130
+ 46,msg me if ull be around katip tom hmpp me wanna sleep and cuddle
131
+ 47,sino nasa katipunan tom? lets sleep and cuddle pleasee
132
+ 48,"Update sa lebel sa Tubig!
133
+ As of 11:02PM | October 13, 2023
134
+ Source: LFEWS
135
+ Maayong Gabii!
136
+ Nagpabilin sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera. https://t.co/bcooxwxLkj"
137
+ 49,"So #TSErasTourFilm is extremely extraordinary....i enjoyed it much,but after all...hihi..huhu...hirap ng sakayan. Ng lalakad ako nito,,basta makauwi lng....rn dumaan ako ng dunkin donut dito sa katipunan ave. https://t.co/Qv0JM6j48a"
138
+ 50,i’m so happy when i pass by katip & see this. i’m glad they didn’t remove it 🥺 https://t.co/FftcR2QoXL
139
+ 51,I hate commuting to Ateneo/katipunan with a burning passion
140
+ 52,"Officials of the Caloocan city government and the Napolcom, and representatives of other local government units in Metro Manila tackled law enforcement issues at the PLEB Consultative Meeting at the Bulwagang Katipunan on Thursday, Oct. 12.
141
+
142
+ https://t.co/EaZvYVWSMJ"
143
+ 53,"@euhhlala not really tho daghag comments like why are you in up, you’re in the wrong side of katip etc like ???
144
+
145
+ and kabalo man ta kinsa sala ngano ing ana ang demo sa up so why are they getting the hate??"
146
+ 54,@hoshipsbw Hihi. Sabi ko na may bagong visual inspo for AW. Kuliglig na naman sa mga taga Katip 😘
147
+ 55,i hate the katip traffic but i looove seeing this https://t.co/bFJeg1VnjY
148
+ 56,uwing uwi na ko traffic pa rin sa cubao pa katipunan??????
149
+ 57,"@w1thdiamonds another thing: lan kwai/rooftop(before it closed) every other friday, jackie running maroon fm and/or tv like the navy, lottie the som crushie, sometimes taivan+mari+the ust peeps would leave taft+españa & travel to see the rest of them in katip(and complain about the traffic)"
150
+ 58,@w1thdiamonds bro i cry internally everyday when i pass the hellscape that is katipunan traffic
151
+ 59,@Its_Me_Angela_ That katipunan ave traffic though. 🥲.
152
+ 60,@katip_31 @Polizei_Ffm Straftat?! 😅
153
+ 61,yooo qc katip g hmu bored #jakolph #zoomjakol #alterqc #alterph
154
+ 62,"me submitting my ateneo application in katipunan, on friday the 13th 😁 #OBF"
155
+ 63,"di na yan ireereview kasi based sa words ni Naveen, ang rules ng liga ay dapat within 48 hours after the game lang ang decision soooooo UP gets away with it again for the nth time HAHAHAHAHA
156
+
157
+ and ppl has the nerve to say the other side of Katipunan is the one playing dirty LOOOOL https://t.co/p6jLbv6k49"
158
+ 64,yung poging nakatambay sa katipunan https://t.co/Wx0ssDW0J5
159
+ 65,katipunan i will NOT! puke in your streets tonight
160
+ 66,@AntonioShe46517 @ham_7_game @AngelGABv13 @S_sants @NateBar12224359 @DonnyniB @SatriaRafi05 @Yadiiiiraaa_ @Christo46511122 @UofLCuban @Sharadrusalla @ricardozamorac1 @AlexBenitez2306 @NNe_SR @trhouser @inakialdasoro @knk150506 @hatem5267 @BigHossKnows @Katip_hizli @foreveryounct Check my prоfile to get launсh 🛡️✅
161
+ 67,"@allforyyj @wouziriri may jeep terminal po na walking distance from katip station, sakay kayo don then pagka u-turn nung jeep lampas uptc baba na kayo agad dun sa unang gate, andon na rin po yung gt toyota ^^"
162
+ 68,"buong trip from un to d.jose + from recto to katip, nakatayo ako pls for one week na po itong workout na ito"
163
+ 69,how to commute from katip station to gt toyota auditorium in up
164
+ 70,"⚠ Practice defensive driving.
165
+
166
+ METROBASE Incident report at Major roads
167
+ from 2:00pm to 7:00pm
168
+
169
+ WHEN: October 13, 2023
170
+
171
+ Stalled Vehicle/s :
172
+ 1. C5 Lanuza SB (Dumptruck; Flat tire; 2:12pm time reported; Assisted by MMDA Enforcers;
173
+
174
+ 3. C5 Katipunan ave. Miriam SB (Closedvan; Flat… https://t.co/oiWZHbezxQ https://t.co/bcUOVKgRyn"
175
+ 71,MABIBILAUKAN AKO SA MCDO KATIPUNAN KABULASTUGAN TALAGA https://t.co/qujGwy84OU
176
+ 72,"ㅤ
177
+
178
+ https://t.co/UlLYKINytL sns update! ❦
179
+ 1O月 13日 、 FRIDAY
180
+ Katipunan Avenue, QC.
181
+
182
+ ㅤ https://t.co/cdM08HYkPV"
183
+ 73,"Update sa lebel sa Tubig!
184
+ As of 06:04PM | October 13, 2023
185
+ Source: LFEWS
186
+ Maayong Gabii!
187
+ Anaa sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera. https://t.co/y2nUMrGzmi"
188
+ 74,gago yung red light sa katipunan parang 3-4 minutes maybe 5 anong kapunyetahan to
189
+ 75,"FARMerch yarn? 🌿🌾
190
+
191
+ Ngayong buwan ng magsasaka, sama-sama nating ipaglaban ang tunay na reporma sa lupa, lokal na produksyon at hustisya!
192
+
193
+ Dahil diyan, inilulunsad ng Katipunan ng mga Samahang Magbubukid sa Timog Katagalugan (KASAMA-TK) ang FARMerch, https://t.co/iTZypwhI8b"
194
+ 76,Katip later
195
+ 77,"I fully love that the spot in Katipunan that is known as ""the cursed land where none may plant their seed and have them prosper"" is now filled by D/IMSUM T/REATS"
196
+ 78,@theblessedbhie oh pak napromote pa ang blessed bhiestro 🫶🏻 labyu madam kelan na ang ating katip date!!!!!
197
+ 79,grindr katipunan isn’t doing its thing for me so i guess it’s time for twitter to do its thing heheheh https://t.co/sIs1I31ng7
198
+ 80,“mabuhay ang pilipinas! mabuhay ang himagsikan! mabuhay ang katipunan!” https://t.co/Da6ZIBYGKX
199
+ 81,MMDA ALERT: Vehicular accident at C5 Katipunan Ave. B. Serrano Intersection WB involving a van and motorcycle as of 1:25 AM. 1 lane occupied. MMDA enforcers on site. #Mmda
200
+ 82,"Update sa lebel sa Tubig!
201
+ As of 01:02PM | October 13, 2023
202
+ Source: LFEWS
203
+
204
+ Maayong Hapon!
205
+
206
+ Anaa gihapon sa NORMAL nga lebel ang tubig sa LIBUGANON DAM - BRGY. KATIPUNAN, KAPALONG base sa naka instalar nga Flood Camera. https://t.co/JaPPdSqvLV"
207
+ 83,"In this digital age, journalism combats the rampant and persisting threat of misinformation and disinformation. Let us listen as journalist and keynote speaker Atom Araullo shares his take on this pressing matter.
208
+
209
+ See you later here on Level Up at 3pm live on Radyo Katipunan. https://t.co/OpdQtzlFAA"
210
+ 84,MMDA ALERT: Stalled truck due to mechanical problem at C5 Katipunan Uptown SB as of 11:40 AM. 1 lane occupied. MMDA enforcers on site. #mmda
211
+ 85,Bakit walang tiwala sa ating katipunan pero bakit wala pera ang ating panliipunan ano nangyayari…. sa ating pang-inuman
212
+ 86,what did andres bonifacio name the philippines revolutionary 😭 /gen — kataastaasan kagalanggalang na katipunan ng mga anak ng bayan (KKK) https://t.co/A2VNDbILnJ
213
+ 87,"damn 8 ako umalis sa bahay, and mag teten na katip palang ako ahahaha, nywy ahahaha grabe talaga traffic dto"
214
+ 88,mahangin talaga dito sa katip no?
215
+ 89,MMDA ALERT: Vehicular accident at Katipunan Miriam NB involving 2 motorcycles as of 9:20 AM. 1 lane occupied. MMDA enforcer on site. #mmda
216
+ 90,no break date near katip 💔💔💔
217
+ 91,paano ba bumyahe papunta katipunan hHASHHS
(pobla OR poblacion).csv ADDED
@@ -0,0 +1,307 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,tweet
2
+ 0,"Pobla, Makati hehe https://t.co/QAkWKcg82V"
3
+ 1,otw pobla nagutom nako kakahintay kay bakla 🙄
4
+ 2,Who’s around Pobla????
5
+ 3,@EldornM Meron yan wag ka magalala Hahahahahahha kita kits sa pobla Hahahahahha
6
+ 4,"lf fem, pobla area now, dm your rates
7
+
8
+ and for the love of god, no dp pls. I can book your ride prepaid
9
+ #alterpinayph #alterbooking #AlterMakat"
10
+ 5,gusto ko sana sumama sa reddit peeps kaso pobla pa hahaha hmm
11
+ 6,dapat magpunta aq pobla now pero ang sarap ng higa q ngayon HAHAHHSHDLDHDJD
12
+ 7,Pobla Sabado now here at @UrusMakati asan na kau guys? Unli drinks until supply last till 4am! https://t.co/ua69y0wbQN
13
+ 8,"cause we still got a lot of energy from the eras film
14
+
15
+ 📍 POBLA"
16
+ 9,"lf fem, pobla area now, dm your rates
17
+
18
+ #alterpinayph #alterbooking #AlterMakati"
19
+ 10,"📢 Exciting news! An airplane carrying medical supplies from the WHO has just landed in Egypt for the people of Gaza. 🌍 Learn more about this crucial humanitarian effort: https://t.co/5X1bx3Kq8z. Together, we can make a difference! 💙 #HumanitarianAid #WHO #Gaza #LobosNews"
20
+ 11,Edi nasa pobla sana ako mamaya kung maeenjoy ko weekend ko diba??? https://t.co/ESpYpMP7qA
21
+ 12,"@KylieVogue___ @carlabrews @Laylaitera Mag apply keah aqo na kasambahay kay Donya Carla? $5/hour. Afford nemen nya at xia pinakamayabang este, ""pinakamayaman"" sa kanilang barangay Isla Poblacion, Pangil, Laguna. 😂😂😂"
22
+ 13,"War crimes. Vonderleyen should be promptly sacked before an appearance at the Hague on war crimes charges:
23
+
24
+ https://t.co/dFCVD4NU8c"
25
+ 14,Who’s gonna be at pobla tonight? Tara?
26
+ 15,Tara sa Poblacion mamaya? 🥂 Free entrance ang mga nasa guestlist. Dm me for the guestlist 🪩
27
+ 16,"Tara sa Disturbia Poblacion, Makati mamaya? 🥂 Free entrance ang mga nasa guestlist. Dm me for the guestlist 🪩"
28
+ 17,"See you later 👋
29
+
30
+ Brief party 🎉
31
+
32
+ At makati poblacion
33
+
34
+ Contact: @AshtonDaks69 https://t.co/OO3LWHfdl7"
35
+ 18,Nag sibat gani ko sa Poblacion para layo sa shot pag abot diri shot gyapon
36
+ 19,"Okay final na, will be at bgc tomorrow hahaha. Malay niyo mag pobla ako, libre niyo ko drinks. Gift niyo na sa bday ghorl 😚🤗😇 https://t.co/h6bd7psFHF"
37
+ 20,tangina takang taka na siguro magulang namin kung bakit buong araw tulog kme ni kuya asa pobla din kasi pala si gagu kagabi HSHDHXBDBDBSBSBXHHXHS
38
+ 21,Tas asikaso ulit at punta namang bgc and pobla 😩
39
+ 22,"EU's top diplomat Joseph Borrell breaks from EU Commission president von der Leyen and says Israel's retaliation doesn't comply with international law.
40
+ https://t.co/x5zgQBC3Zi"
41
+ 23,Pobla Days with Avida boys https://t.co/YBrOkYonUg
42
+ 24,"At around 1 O'clock in the afternoon on October 14, 2023, personnel of Peñarrubia MPS led by PSMS Milagros B. Ibrahim provided Police Presence and motorist assistance in Barangay Poblacion, Peñarrubia to pre-empt occurrence of crime and any untoward incidents around AOR. https://t.co/j4hJR3Xmdp"
43
+ 25,"At around 8:00 AM, personnel of Peñarrubia MPS led by PMSg Jocelyn B. Caba conducted Police Presence/Area Security to avoid any untoward incidents that may happen and to maintain peace and order in Barangay Poblacion, Peñarrubia, Abra. https://t.co/DiS7EYtjLv"
44
+ 26,"EU foreign policy chief @JosepBorrellF breaks ranks with @VonderLeyen and accuses Israel of committing war crimes.
45
+
46
+ Huge division now at the top of the EU.
47
+
48
+ https://t.co/rML95qRz5T"
49
+ 27,"🇮🇱 #Israel announces to the population of 🇵🇸 Gaza two temporary southbound exit routes ⏳
50
+
51
+ ""For your own safety and that of your loved ones, leave,"" Israeli authorities announce.
52
+
53
+ ↘️
54
+ https://t.co/dUB1X6p78z"
55
+ 28,"FOR SALE ""LANIE BALOGO""
56
+
57
+ PM ME FOR DETAILS OR IF YOU WANT TO COLLAB WITH HER CONTACT THIS PERSON.
58
+
59
+ Pacifico Jr Delfin Yalao
60
+ 09452019207, 09122390128
61
+ Poblacion San Pablo Zamboanga del Sur
62
+ [email protected] https://t.co/wIoe1C2AUI"
63
+ 29,saang cafe sa poblacion kaya aq pwede magreview KSKSKSKS
64
+ 30,"@stvrlie @zZzsie hahahahaha the heck, eli! you're in pobla na naman? want ko rin uminom. dm me the deets 😉"
65
+ 31,"hahhaa the counselor at lys is so funny every time 😭
66
+ him: kumusta ka?
67
+ me: inaantok
68
+ him: pumarty? saan ka pumarty
69
+ me: titahan sa poblacion po
70
+ him: ako sa timog"
71
+ 32,"pobla hopping
72
+
73
+ sanctuary > polilya > apartment > tambai https://t.co/Gl3PH8Zg0q"
74
+ 33,Who's going to Poblacion tonight? Anyone???
75
+ 34,To be honest. Had fun last Wednesday episode in poblacion onto next? (Crossfingers) 🤞🏼🤞🏼
76
+ 35,"@MacaesBruno I think Borrell has shown some sense this week on the topic.
77
+
78
+ https://t.co/egU6mOskou"
79
+ 36,"Says parts of Israeli response counter international law
80
+
81
+ ""There are elements of this response that certainly do not comply with international law. I think this is the third time I've said it. But I'm not saying it; The UN SG says it. It's no discovery.""
82
+
83
+ https://t.co/egU6mOskou"
84
+ 37,"🚨 BREEDERS!
85
+
86
+ Undress your finest body and release your sexiness tonight at the “BRIEF PARTY”💙🩲
87
+
88
+ Catch me for an exclusive ORGYNUMAN
89
+ at Poblacion Makati!🥂
90
+
91
+ See you there, Breeders! 😈💦
92
+
93
+ 📩 For more details about the event,
94
+ Kindly dm @AshtonDaks69 https://t.co/uednKM0kAx"
95
+ 38,"@Weh227089049811 San sa poblacion,,malapit lang ba sa makati ave"
96
+ 39,grabe bday celeb ni kang umabot kami gang pobla 😭
97
+ 40,Tumatalab parin pala ang “Handle me with care.” sa mga taga Pobla 😂
98
+ 41,"TONIGHT @ MAKATI
99
+ 💪🔥 BRIEF-PARTY 🔥💪
100
+
101
+ 💯 ORGYNUMAN 💯
102
+
103
+ OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
104
+
105
+ PRE BOOKING (599,₱)
106
+ WALK-IN FEE ( 799,₱)
107
+
108
+ NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
109
+ FREE CONDOMS AND LUBE
110
+
111
+ INCLUSION:
112
+ 💯 UNLIMITED DRINK
113
+ 💯 COCKTAIL… https://t.co/BgHj3MXH0k https://t.co/1kwFJvB6Mk"
114
+ 42,"will watch ts eras tour movie and hop to prolly pobla/maginghawa tonight.
115
+
116
+ oh how I love october month ❤️‍🩹"
117
+ 43,3hrs tulog kaya ba mag pobla mamaya 🥲
118
+ 44,"TONIGHT @ MAKATI
119
+ 💪🔥 BRIEF-PARTY 🔥💪
120
+
121
+ 💯 ORGYNUMAN 💯
122
+
123
+ OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
124
+
125
+ PRE BOOKING (599,₱)
126
+ WALK-IN FEE ( 799,₱)
127
+
128
+ NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
129
+ FREE CONDOMS AND LUBE
130
+
131
+ INCLUSION:
132
+ 💯 UNLIMITED DRINK
133
+ 💯 COCKTAIL… https://t.co/ZkK02L8e3O https://t.co/m3rgkWwyVF"
134
+ 45,"TONIGHT @ MAKATI
135
+ 💪🔥 BRIEF-PARTY 🔥💪
136
+
137
+ 💯 ORGYNUMAN 💯
138
+
139
+ OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
140
+
141
+ PRE BOOKING (599,₱)
142
+ WALK-IN FEE ( 799,₱)
143
+
144
+ NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
145
+ FREE CONDOMS AND LUBE
146
+
147
+ INCLUSION:
148
+ 💯 UNLIMITED DRINK
149
+ 💯 COCKTAIL… https://t.co/ajOJyO5Kvz https://t.co/ufVN0CZZFx"
150
+ 46,"TONIGHT @ MAKATI
151
+ 💪🔥 BRIEF-PARTY 🔥💪
152
+
153
+ 💯 ORGYNUMAN 💯
154
+
155
+ OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
156
+
157
+ PRE BOOKING (599,₱)
158
+ WALK-IN FEE ( 799,₱)
159
+
160
+ NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
161
+ FREE CONDOMS AND LUBE
162
+
163
+ INCLUSION:
164
+ 💯 UNLIMITED DRINK
165
+ 💯 COCKTAIL… https://t.co/vlgKTVgILq https://t.co/p30UC42BuQ"
166
+ 47,"TONIGHT @ MAKATI
167
+ 💪🔥 BRIEF-PARTY 🔥💪
168
+
169
+ 💯 ORGYNUMAN 💯
170
+
171
+ OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
172
+
173
+ PRE BOOKING (599,₱)
174
+ WALK-IN FEE ( 799,₱)
175
+
176
+ NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
177
+ FREE CONDOMS AND LUBE
178
+
179
+ INCLUSION:
180
+ 💯 UNLIMITED DRINK
181
+ 💯 COCKTAIL… https://t.co/HHQZYokDur https://t.co/QT8Rf2jryA"
182
+ 48,"TONIGHT @ MAKATI
183
+ 💪🔥 BRIEF-PARTY 🔥💪
184
+
185
+ 💯 ORGYNUMAN 💯
186
+
187
+ OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
188
+
189
+ PRE BOOKING (599,₱)
190
+ WALK-IN FEE ( 799,₱)
191
+
192
+ NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
193
+ FREE CONDOMS AND LUBE
194
+
195
+ INCLUSION:
196
+ 💯 UNLIMITED DRINK
197
+ 💯 COCKTAIL… https://t.co/tV1yUxqbN5 https://t.co/stM2LF82ph"
198
+ 49,"TONIGHT @ MAKATI
199
+ 💪🔥 BRIEF-PARTY 🔥💪
200
+
201
+ 💯 ORGYNUMAN 💯
202
+
203
+ OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
204
+
205
+ PRE BOOKING (599,₱)
206
+ WALK-IN FEE ( 799,₱)
207
+
208
+ NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
209
+ FREE CONDOMS AND LUBE
210
+
211
+ INCLUSION:
212
+ 💯 UNLIMITED DRINK
213
+ 💯 COCKTAIL… https://t.co/AsvBfU73gb https://t.co/GzbScV3hwM"
214
+ 50,"TONIGHT @ MAKATI
215
+ 💪🔥 BRIEF-PARTY 🔥💪
216
+
217
+ 💯 ORGYNUMAN 💯
218
+
219
+ OCTOBER 14 POBLACION MAKATI SATURDAY DOOR OPEN 6PM TO 6AM
220
+
221
+ PRE BOOKING (599,₱)
222
+ WALK-IN FEE ( 799,₱)
223
+
224
+ NO DRUGS NO VIDEO NO PICTURE TAKING DURING EVENT
225
+ FREE CONDOMS AND LUBE
226
+
227
+ INCLUSION:
228
+ 💯 UNLIMITED DRINK
229
+ 💯 COCKTAIL… https://t.co/R2fmvLp4d3 https://t.co/opqiWMhlCz"
230
+ 51,"ℍ𝔸ℙℙ𝔼ℕ𝕀ℕ𝔾 ℕ𝕆𝕎| National ID registration at the 3rd Floor of the LDRRM Bldg., Poblacion, Paracelis. Registration is open Monday to Saturday, 8:00AM- 5:00PM.
231
+
232
+ #NationalID
233
+ #IbaNaAngKilala https://t.co/RnYLDGbEjO"
234
+ 52,"Resumen
235
+
236
+ ...
237
+ ➡️ https://t.co/cEcdvICoX1"
238
+ 53,@RadioGenoa If this continues escalating perhaps they will need to start using weapons to defend themselves and the población of this criminals.
239
+ 54,"ℍ𝔸ℙℙ𝔼ℕ𝕀ℕ𝔾 ℕ𝕆𝕎| National ID registration and market day of Sabangan at San Alfonso Grounds, Poblacion, Sabangan.
240
+
241
+ For those who are not yet registered, visit our Fixed Registration Center at the Groundfloor of the Bontoc Municipal Capitol.
242
+ #NationalID
243
+ #IbaNaAngKilala https://t.co/HtnmhjPVUw"
244
+ 55,"𝕀ℕ ℙℍ𝕆𝕋𝕆𝕊| National ID registration during the DSWD Family Development Session at the Municipal Capitol, Poblacion.
245
+
246
+ For those who are not yet registered, visit our Fixed Registration Center at the Groundfloor of the Bontoc Municipal Capitol.
247
+ #NationalID
248
+ #IbaNaAngKilala https://t.co/R6ORufhFDM"
249
+ 56,"1/2: 2165 beneficiaries mula sa Brgy. Kasilawan, Brgy. Tejeros, Brgy. Sta. Cruz, Brgy. La Paz, at Brgy. Poblacion ang nakatanggap ng tulong mula sa ating Assistance to Individuals in Crisis Situations (AICS) program na ginanap ngayong araw. https://t.co/c1kG8BxJ2o"
250
+ 57,"WANTED SCAMMER !!
251
+
252
+ Yalao Jr Pacifico Delfin
253
+ 09452019207, 09122390128
254
+ Poblacion San Pablo Zamboanga del Sur
255
+ [email protected] https://t.co/LjwKaW04L1"
256
+ 58,performing at Apotheka Poblacion tonight with @ladygagitaofcl 🫶🏼🫶🏼
257
+ 59,Limgaw kaayo kog bar hopping sa pobla gabie haha
258
+ 60,"Craving? Try Kurimi milktea, coffee, yogurt and snacks at Poblacion- M.H Del Pilar St., Silang, Cavite. Visit the 2nd Kurīmī Milk Tea branch of Atty. Vea Magsaysay #KurimiPoblacion #kurimi2D #kurimilicious https://t.co/StFxxLis73"
259
+ 61,"Baka lang meron may alam here. Me and my cousins plan na mag party sa poblacion Oct 31 Halloween salubong. Any reco po where to stay? Like condotel, hotels or kahit transient na pwedi namin ma rent buong space."
260
+ 62,THE ERASE TOUR APOTHEKA POBLACION TONIGHT. SOLD OUT!
261
+ 63,"Palestinian President Mahmoud Abbas reaffirms to Blinken that they will not allow their population to leave Gaza.
262
+ #IsraelPalestineConflict
263
+
264
+ https://t.co/3zUqjhK6Jj"
265
+ 64,Not so sleigh ang nangyare sa pobla ha 😩
266
+ 65,"That day has come. I am in Poblacion at 430am running while people are still actively partying while some are passed out outside. Prepandemic me would've said, ""hi old"""
267
+ 66,mingaw nauud kyko sakong uyab lami kaayo ilupad ug poblacion
268
+ 67,"@MohammedPu50150 @GalindoKno52453 @ortizdiegoo8 @tunakorcak @blakerypel16 @KILLBARI @AlpheuBALDE @agboola_giwa @vkvjnaf @SorenHedman @adamsantaii @lichisup @poblacion_csic @el_forjador @gaolay @Gawanwb @dver_fioletovay @ahmetalibacak2 @SakukoGame @SmiLe_In_D_dark @bewarethehunt Fuck you mohammed, I’m gonna make Nagasaki look like a kids show if you keep this up"
269
+ 68,Or sana may ma-meet akong cutie and matinong guys from Makati esp. sa Pobla na potential bebe ko. Huhu
270
+ 69,hindi ako mapupunta sa poblacion
271
+ 70,Traffic around Poblacion on Friday nights is crazy
272
+ 71,girlie dont u ever stop working??? nageemail ka pa sakin ng 10pm ON A FRIDAY??? dont tell me nasa pobla ka habang tinatype mo yan pls nakakaloka GET A LIFE OUTSIDE OF WORK NAMAN HUY
273
+ 72,gusto ko maranasan halloween party sa poblacion plsss di ako nakapunta last year kasi busy sa thesis at exam 😢
274
+ 73,sadt naman here sa Pobla. gusto ko nlg mgkape pls
275
+ 74,nag poblacion para kumain >>> uminom
276
+ 75,what if pobla?🤔
277
+ 76,Gagu napunta kami sa poblacion 😭
278
+ 77,@ Poblacion init ng ulo ko dito na ako kumain
279
+ 78,"@aipeiramr Mag usap naman po tayo, okay naman tayo eh. Hindi ko alam bakit ka po nagagalit saglit lang naman po kami sa Poblacion kanina"
280
+ 79,Poblacion gaming tomorrow after a week of loaded work!!!!!!!! 😭😭😭 finallyyyyyyyyyyyyyyy
281
+ 80,kadami naman nka NCTdream varsity jacket dito sa pobla 😅
282
+ 81,i guess see u at pobla tom???
283
+ 82,@jadedazes i become the poblacion girl of the convention world /j
284
+ 83,"Continous conduct of PNP/COMELEC Checkpoint along Abra-Kalinga road particularly in So. Nagsangalan, Barangay Poblacion East, Pidigan, Abra. https://t.co/lTTdJKiE50"
285
+ 84,"BUKAS NA PO ITO OCT 14 SATURDAY AT POBLACION MAKATI
286
+ SEE YOU BUKAS UGH 😍🍆
287
+
288
+ DM @AshtonDaks69
289
+ @pinoyspabuddy https://t.co/rJpbbMhG3O"
290
+ 85,"@yanamnv gagi wag, mag pobla pa tauu 🙁🙁😭"
291
+ 86,@_kyunhee tara ituloy ang pobla 👀 AHAHAHA
292
+ 87,"Huhu grabe pota kapoyyy tgif walang paparty, miss ko na mag pobla"
293
+ 88,Daming nag aaya may pop up may pobla hahaha shuta
294
+ 89,guys pobla tayo mamaya
295
+ 90,@smninews @indaysara Madame VP-Sarah muzta na po kayo me tweet po ako sa inyo si Jaime A. CABACANG po i2 Poblacion Munt city
296
+ 91,Friday the 13th. Poblacion. Working. Waiting for PSJ updates. 😌 https://t.co/hjO21pVhMB
297
+ 92,"Usap-usapan ngayon sa social media ang rambulan ng mga babae sa isang bar sa Poblacion, Makati.
298
+
299
+ https://t.co/0ephIV6FJW
300
+
301
+ #TunayNaTabloidista #AbanteNews"
302
+ 93,"Happy Friday the 13th! 🍻
303
+ https://t.co/a3fwZ0uorS"
304
+ 94,"@Jorge17125130 @MediavillaMarga En fin
305
+ https://t.co/lqGvl9DFSN"
306
+ 95,"We know you're sooo ready to clock out.
307
+ https://t.co/0kvopK5jYH"
app.py CHANGED
@@ -1,45 +1,78 @@
1
- # Required Libraries
2
-
3
- #Base and Cleaning
4
- import json
5
- import requests
6
  import pandas as pd
7
- import numpy as np
8
- import emoji
9
- import regex
10
  import re
11
- import string
12
- from collections import Counter
13
- import tqdm
14
- from operator import itemgetter
15
-
16
- #Visualizations
17
- import plotly.express as px
18
- import seaborn as sns
19
- import matplotlib.pyplot as plt
20
- import pyLDAvis.gensim
21
- import chart_studio
22
- import chart_studio.plotly as py
23
- import chart_studio.tools as tls
24
-
25
- #Natural Language Processing (NLP)
26
  import spacy
27
  import gensim
28
  import json
 
 
29
  from spacy.tokenizer import Tokenizer
 
 
 
30
  from gensim.corpora import Dictionary
31
- from gensim.models.ldamulticore import LdaMulticore
32
  from gensim.models.coherencemodel import CoherenceModel
33
- from gensim.parsing.preprocessing import STOPWORDS as SW
34
- from sklearn.decomposition import LatentDirichletAllocation, TruncatedSVD
35
- from sklearn.feature_extraction.text import CountVectorizer, TfidfVectorizer
36
- from sklearn.model_selection import GridSearchCV
37
  from pprint import pprint
38
- from wordcloud import STOPWORDS
 
 
 
 
39
  from gensim.parsing.preprocessing import preprocess_string, strip_punctuation, strip_numeric
40
 
 
 
 
 
 
 
 
 
 
 
41
  import gradio as gr
42
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
43
  def give_emoji_free_text(text):
44
  """
45
  Removes emoji's from tweets
@@ -59,6 +92,18 @@ def url_free_text(text):
59
  text = re.sub(r'http\S+', '', text)
60
  return text
61
 
 
 
 
 
 
 
 
 
 
 
 
 
62
  # Tokenizer function
63
  def tokenize(text):
64
  """
@@ -86,60 +131,8 @@ def tokenize(text):
86
 
87
  return tokens
88
 
89
- def compute_coherence_values(dictionary, corpus, texts, limit, start=2, step=1):
90
- coherence_values = []
91
- model_list = []
92
- for num_topics in range(start, limit, step):
93
- model = gensim.models.ldamodel.LdaModel(corpus=corpus,
94
- num_topics=num_topics,
95
- random_state=100,
96
- chunksize=200,
97
- passes=10,
98
- per_word_topics=True,
99
- id2word=id2word)
100
- model_list.append(model)
101
- coherencemodel = CoherenceModel(model=model, texts=texts, dictionary=dictionary, coherence='c_v')
102
- coherence_values.append(coherencemodel.get_coherence())
103
-
104
- return model_list, coherence_values
105
-
106
- def compute_coherence_values2(corpus, dictionary, k, a, b):
107
- lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
108
- id2word=id2word,
109
- num_topics=num_topics,
110
- random_state=100,
111
- chunksize=200,
112
- passes=10,
113
- alpha=a,
114
- eta=b,
115
- per_word_topics=True)
116
- coherence_model_lda = CoherenceModel(model=lda_model, texts=df['lemma_tokens'], dictionary=id2word, coherence='c_v')
117
-
118
- return coherence_model_lda.get_coherence()
119
-
120
- def assignTopic(l):
121
- maxTopic = max(l,key=itemgetter(1))[0]
122
- return maxTopic
123
-
124
- def get_topic_value(row, i):
125
- if len(row) == 1:
126
- return row[0][1]
127
- else:
128
- return row[i][1]
129
-
130
- def dataframeProcessing(dataset):
131
- # Opening JSON file
132
- f = open('stopwords-tl.json')
133
- tlStopwords = json.loads(f.read())
134
- stopwords = set(STOPWORDS)
135
- stopwords.update(tlStopwords)
136
- stopwords.update(['na', 'sa', 'ko', 'ako', 'ng', 'mga', 'ba', 'ka', 'yung', 'lang', 'di', 'mo', 'kasi'])
137
-
138
- global df
139
- df = pd.read_csv(dataset + '.csv')
140
  df.rename(columns = {'tweet':'original_tweets'}, inplace = True)
141
- df = df.apply(lambda row: row[df['language'].isin(['en'])])
142
- df.reset_index(inplace=True)
143
 
144
  # Apply the function above and get tweets free of emoji's
145
  call_emoji_free = lambda x: give_emoji_free_text(x)
@@ -150,9 +143,13 @@ def dataframeProcessing(dataset):
150
  #Create a new column with url free tweets
151
  df['url_free_tweets'] = df['emoji_free_tweets'].apply(url_free_text)
152
 
153
- # Load spacy
154
- # Make sure to restart the runtime after running installations and libraries tab
155
- nlp = spacy.load('en_core_web_lg')
 
 
 
 
156
 
157
  # Tokenizer
158
  tokenizer = Tokenizer(nlp.vocab)
@@ -185,19 +182,6 @@ def dataframeProcessing(dataset):
185
  # Make tokens a string again
186
  df['tokens_back_to_text'] = [' '.join(map(str, l)) for l in df['tokens']]
187
 
188
- def get_lemmas(text):
189
- '''Used to lemmatize the processed tweets'''
190
- lemmas = []
191
-
192
- doc = nlp(text)
193
-
194
- # Something goes here :P
195
- for token in doc:
196
- if ((token.is_stop == False) and (token.is_punct == False)) and (token.pos_ != 'PRON'):
197
- lemmas.append(token.lemma_)
198
-
199
- return lemmas
200
-
201
  df['lemmas'] = df['tokens_back_to_text'].apply(get_lemmas)
202
 
203
  # Make lemmas a string again
@@ -206,60 +190,111 @@ def dataframeProcessing(dataset):
206
  # Apply tokenizer
207
  df['lemma_tokens'] = df['lemmas_back_to_text'].apply(tokenize)
208
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
209
  # Create a id2word dictionary
210
- global id2word
211
  id2word = Dictionary(df['lemma_tokens'])
212
 
213
  # Filtering Extremes
214
  id2word.filter_extremes(no_below=2, no_above=.99)
215
- print(len(id2word))
216
 
217
  # Creating a corpus object
 
218
  corpus = [id2word.doc2bow(d) for d in df['lemma_tokens']]
 
 
219
 
220
- lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
221
- id2word=id2word,
222
- num_topics=5,
223
- random_state=100,
224
- chunksize=200,
225
- passes=10,
226
- per_word_topics=True)
227
 
228
- pprint(lda_model.print_topics())
229
- doc_lda = lda_model[corpus]
230
 
231
- coherence_model_lda = CoherenceModel(model=lda_model, texts=df['lemma_tokens'], dictionary=id2word, coherence='c_v')
232
- coherence_lda = coherence_model_lda.get_coherence()
233
-
234
- model_list, coherence_values = compute_coherence_values(dictionary=id2word, corpus=corpus,
 
 
235
  texts=df['lemma_tokens'],
236
  start=2,
237
  limit=10,
238
- step=1)
 
 
 
239
 
240
- k_max = max(coherence_values)
 
241
 
242
- global num_topics
243
- num_topics = coherence_values.index(k_max) + 2
244
 
245
- lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
246
- id2word=id2word,
247
- num_topics=num_topics,
248
- random_state=100,
249
- chunksize=200,
250
- passes=10,
251
- per_word_topics=True)
252
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
253
  grid = {}
254
  grid['Validation_Set'] = {}
255
 
 
 
 
 
 
256
  alpha = [0.05, 0.1, 0.5, 1, 5, 10]
 
 
257
 
258
  beta = [0.05, 0.1, 0.5, 1, 5, 10]
 
259
 
260
- num_of_docs = len(corpus)
261
- corpus_sets = [gensim.utils.ClippedCorpus(corpus, int(num_of_docs*0.75)),
262
- corpus]
263
  corpus_title = ['75% Corpus', '100% Corpus']
264
  model_results = {'Validation_Set': [],
265
  'Alpha': [],
@@ -272,69 +307,87 @@ def dataframeProcessing(dataset):
272
  for i in range(len(corpus_sets)):
273
  for a in alpha:
274
  for b in beta:
275
- cv = compute_coherence_values2(corpus=corpus_sets[i], dictionary=id2word, k=num_topics, a=a, b=b)
 
 
 
 
276
  model_results['Validation_Set'].append(corpus_title[i])
277
  model_results['Alpha'].append(a)
278
  model_results['Beta'].append(b)
279
  model_results['Coherence'].append(cv)
280
 
281
- pbar.update(1)
282
  pd.DataFrame(model_results).to_csv('lda_tuning_results_new.csv', index=False)
283
  pbar.close()
284
 
285
  params_df = pd.read_csv('lda_tuning_results_new.csv')
286
- params_df = params_df[params_df.Validation_Set == '100% Corpus']
287
  params_df.reset_index(inplace=True)
288
-
289
  max_params = params_df.loc[params_df['Coherence'].idxmax()]
290
  max_coherence = max_params['Coherence']
291
  max_alpha = max_params['Alpha']
292
  max_beta = max_params['Beta']
293
-
294
- lda_model_final = gensim.models.ldamodel.LdaModel(corpus=corpus,
295
- id2word=id2word,
296
- num_topics=7,
297
- random_state=100,
298
- chunksize=200,
299
- passes=10,
300
- alpha=max_alpha,
301
- eta=max_beta,
302
- per_word_topics=True)
303
-
 
 
304
  coherence_model_lda = CoherenceModel(model=lda_model_final, texts=df['lemma_tokens'], dictionary=id2word,
305
- coherence='c_v')
306
  coherence_lda = coherence_model_lda.get_coherence()
307
 
 
 
 
 
 
 
 
 
 
 
 
 
308
  lda_topics = lda_model_final.show_topics(num_words=10)
309
 
310
  topics = []
311
  filters = [lambda x: x.lower(), strip_punctuation, strip_numeric]
312
- lda_topics_string = ''
313
 
314
  for topic in lda_topics:
315
- print(topic)
316
- lda_topics_string += 'Topic ' + str(topic[0]) + '\n' + str(topic[1]) + '\n\n'
317
  topics.append(preprocess_string(topic[1], filters))
318
 
319
- df['topic'] = [sorted(lda_model_final[corpus][text][0]) for text in range(len(df['original_tweets']))]
320
-
321
- def sort_topics(l):
322
- return(sorted(l, key=lambda x: x[1], reverse=True))
323
-
324
- df['topic'] = df['topic'].apply(sort_topics)
325
- df['topic_string'] = df['topic'].astype(str)
326
-
327
 
328
  df = df[df['topic'].map(lambda d: len(d)) > 0]
329
- df['topic'][0]
330
-
331
- df['max_topic'] = df['topic'].map(lambda row: assignTopic(row))
332
 
 
333
  topic_clusters = []
334
  for i in range(num_topics):
335
  topic_clusters.append(df[df['max_topic'].isin(([i]))])
336
  topic_clusters[i] = topic_clusters[i]['original_tweets'].tolist()
337
-
 
 
 
 
 
 
 
 
 
 
 
338
  for i in range(len(topic_clusters)):
339
  tweets = df.loc[df['max_topic'] == i]
340
  tweets['topic'] = tweets['topic'].apply(lambda x: get_topic_value(x, i))
@@ -343,22 +396,151 @@ def dataframeProcessing(dataset):
343
  tweets_sorted.drop_duplicates(subset=['original_tweets'])
344
  rep_tweets = tweets_sorted['original_tweets']
345
  rep_tweets = [*set(rep_tweets)]
346
- print('Topic ', i)
347
- print(rep_tweets[:5])
348
-
349
- output_df = df[['topic_string', 'original_tweets']].copy()
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
350
 
351
- return lda_topics_string, output_df
352
-
353
- def greet(name):
354
- return "Hello " + name + "!!"
355
-
356
- iface = gr.Interface(fn=dataframeProcessing,
357
- inputs=gr.Dropdown(["katip-december",
358
- "katipunan-december",
359
- "bgc-december",
360
- "bonifacio global city-december"],
361
- label="Dataset"),
362
- outputs=["text",
363
- gr.Dataframe(headers=['topic_string', 'original_tweets'])])
364
- iface.launch()
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  import pandas as pd
2
+ import tweepy
 
 
3
  import re
4
+ import emoji
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  import spacy
6
  import gensim
7
  import json
8
+ import string
9
+
10
  from spacy.tokenizer import Tokenizer
11
+ from gensim.parsing.preprocessing import STOPWORDS as SW
12
+ from wordcloud import STOPWORDS
13
+
14
  from gensim.corpora import Dictionary
 
15
  from gensim.models.coherencemodel import CoherenceModel
16
+ import matplotlib.pyplot as plt
 
 
 
17
  from pprint import pprint
18
+
19
+ import numpy as np
20
+ import tqdm
21
+ import pyLDAvis.gensim
22
+
23
  from gensim.parsing.preprocessing import preprocess_string, strip_punctuation, strip_numeric
24
 
25
+ import torch
26
+ from transformers import T5ForConditionalGeneration,T5Tokenizer
27
+ from googletrans import Translator
28
+
29
+ from bertopic import BERTopic
30
+ from umap import UMAP
31
+ from sklearn.feature_extraction.text import CountVectorizer
32
+
33
+ from operator import itemgetter
34
+
35
  import gradio as gr
36
 
37
+ bearer_token = 'AAAAAAAAAAAAAAAAAAAAACEigwEAAAAACoP8KHJYLOKCL4OyB9LEPV00VB0%3DmyeDROUvw4uipHwvbPPfnTuY0M9ORrLuXrMvcByqZhwo3SUc4F'
38
+ client = tweepy.Client(bearer_token=bearer_token)
39
+ nlp = spacy.load('en_core_web_lg')
40
+
41
+ def scrape(keyword_list):
42
+ if len(keyword_list) > 1:
43
+ keywords = '(' + ' OR '.join(keyword_list) + ')'
44
+ else:
45
+ keywords = keyword_list[0]
46
+
47
+ query = keywords + ' (lang:en OR lang:tl) -is:retweet'
48
+ max_results = 100
49
+ tweet_fields=['geo', 'id', 'lang', 'created_at']
50
+ expansions=['geo.place_id']
51
+ place_fields = ['contained_within', 'country', 'country_code', 'full_name', 'geo', 'id', 'name', 'place_type']
52
+
53
+ response = client.search_recent_tweets(
54
+ query=query,
55
+ max_results=max_results,
56
+ tweet_fields=tweet_fields,
57
+ expansions=expansions,
58
+ place_fields=place_fields
59
+ )
60
+
61
+ tweets = []
62
+ for x in response[0]:
63
+ tweets.append(str(x))
64
+
65
+ place_data = response[1]
66
+
67
+ global df
68
+ df = pd.DataFrame(tweets, columns=['tweet'])
69
+
70
+ return place_data
71
+
72
+ def get_example(dataset):
73
+ global df
74
+ df = pd.read_csv(dataset + '.csv')
75
+
76
  def give_emoji_free_text(text):
77
  """
78
  Removes emoji's from tweets
 
92
  text = re.sub(r'http\S+', '', text)
93
  return text
94
 
95
+ def get_lemmas(text):
96
+ '''Used to lemmatize the processed tweets'''
97
+ lemmas = []
98
+
99
+ doc = nlp(text)
100
+
101
+ for token in doc:
102
+ if ((token.is_stop == False) and (token.is_punct == False)) and (token.pos_ != 'PRON'):
103
+ lemmas.append(token.lemma_)
104
+
105
+ return lemmas
106
+
107
  # Tokenizer function
108
  def tokenize(text):
109
  """
 
131
 
132
  return tokens
133
 
134
+ def cleaning():
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
135
  df.rename(columns = {'tweet':'original_tweets'}, inplace = True)
 
 
136
 
137
  # Apply the function above and get tweets free of emoji's
138
  call_emoji_free = lambda x: give_emoji_free_text(x)
 
143
  #Create a new column with url free tweets
144
  df['url_free_tweets'] = df['emoji_free_tweets'].apply(url_free_text)
145
 
146
+
147
+
148
+ f = open('node_modules/stopwords-tl/stopwords-tl.json')
149
+ tlStopwords = json.loads(f.read())
150
+ stopwords = set(STOPWORDS)
151
+ stopwords.update(tlStopwords)
152
+ stopwords.update(['na', 'sa', 'ko', 'ako', 'ng', 'mga', 'ba', 'ka', 'yung', 'lang', 'di', 'mo', 'kasi'])
153
 
154
  # Tokenizer
155
  tokenizer = Tokenizer(nlp.vocab)
 
182
  # Make tokens a string again
183
  df['tokens_back_to_text'] = [' '.join(map(str, l)) for l in df['tokens']]
184
 
 
 
 
 
 
 
 
 
 
 
 
 
 
185
  df['lemmas'] = df['tokens_back_to_text'].apply(get_lemmas)
186
 
187
  # Make lemmas a string again
 
190
  # Apply tokenizer
191
  df['lemma_tokens'] = df['lemmas_back_to_text'].apply(tokenize)
192
 
193
+ def split_corpus(corpus, n):
194
+ for i in range(0, len(corpus), n):
195
+ corpus_split = corpus
196
+ yield corpus_split[i:i + n]
197
+
198
+ def compute_coherence_values_base_lda(dictionary, corpus, texts, limit, coherence, start=2, step=1):
199
+ coherence_values = []
200
+ model_list = []
201
+ for num_topics in range(start, limit, step):
202
+ model = gensim.models.ldamodel.LdaModel(corpus=corpus,
203
+ num_topics=num_topics,
204
+ random_state=100,
205
+ chunksize=200,
206
+ passes=10,
207
+ per_word_topics=True,
208
+ id2word=id2word)
209
+ model_list.append(model)
210
+ coherencemodel = CoherenceModel(model=model, texts=texts, dictionary=dictionary, coherence=coherence)
211
+ coherence_values.append(coherencemodel.get_coherence())
212
+
213
+ return model_list, coherence_values
214
+
215
+ def base_lda():
216
  # Create a id2word dictionary
217
+ global id2word
218
  id2word = Dictionary(df['lemma_tokens'])
219
 
220
  # Filtering Extremes
221
  id2word.filter_extremes(no_below=2, no_above=.99)
 
222
 
223
  # Creating a corpus object
224
+ global corpus
225
  corpus = [id2word.doc2bow(d) for d in df['lemma_tokens']]
226
+ global corpus_og
227
+ corpus_og = [id2word.doc2bow(d) for d in df['lemma_tokens']]
228
 
229
+ corpus_split = corpus
230
+ split_corpus(corpus_split, 5)
 
 
 
 
 
231
 
232
+ global coherence
233
+ coherence = 'c_v'
234
 
235
+ coherence_averages = [0] * 8
236
+ for i in range(5):
237
+ training_corpus = corpus_split
238
+ training_corpus.remove(training_corpus[i])
239
+ print(training_corpus[i])
240
+ model_list, coherence_values = compute_coherence_values_base_lda(dictionary=id2word, corpus=training_corpus,
241
  texts=df['lemma_tokens'],
242
  start=2,
243
  limit=10,
244
+ step=1,
245
+ coherence=coherence)
246
+ for j in range(len(coherence_values)):
247
+ coherence_averages[j] += coherence_values[j]
248
 
249
+ limit = 10; start = 2; step = 1;
250
+ x = range(start, limit, step)
251
 
252
+ coherence_averages = [x / 5 for x in coherence_averages]
 
253
 
254
+ if coherence == 'c_v':
255
+ k_max = max(coherence_averages)
256
+ else:
257
+ k_max = min(coherence_averages, key=abs)
 
 
 
258
 
259
+ global num_topics
260
+ num_topics = coherence_averages.index(k_max) + 2
261
+
262
+ def compute_coherence_values2(corpus, dictionary, k, a, b):
263
+ lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
264
+ id2word=id2word,
265
+ num_topics=num_topics,
266
+ random_state=100,
267
+ chunksize=200,
268
+ passes=10,
269
+ alpha=a,
270
+ eta=b,
271
+ per_word_topics=True)
272
+ coherence_model_lda = CoherenceModel(model=lda_model,
273
+ texts=df['lemma_tokens'],
274
+ dictionary=id2word,
275
+ coherence='c_v')
276
+
277
+ return coherence_model_lda.get_coherence()
278
+
279
+ def hyperparameter_optimization():
280
  grid = {}
281
  grid['Validation_Set'] = {}
282
 
283
+ min_topics = 1
284
+ max_topics = 10
285
+ step_size = 1
286
+ topics_range = range(min_topics, max_topics, step_size)
287
+
288
  alpha = [0.05, 0.1, 0.5, 1, 5, 10]
289
+ # alpha.append('symmetric')
290
+ # alpha.append('asymmetric')
291
 
292
  beta = [0.05, 0.1, 0.5, 1, 5, 10]
293
+ # beta.append('symmetric')
294
 
295
+ num_of_docs = len(corpus_og)
296
+ corpus_sets = [gensim.utils.ClippedCorpus(corpus_og, int(num_of_docs*0.75)),
297
+ corpus_og]
298
  corpus_title = ['75% Corpus', '100% Corpus']
299
  model_results = {'Validation_Set': [],
300
  'Alpha': [],
 
307
  for i in range(len(corpus_sets)):
308
  for a in alpha:
309
  for b in beta:
310
+ cv = compute_coherence_values2(corpus=corpus_sets[i],
311
+ dictionary=id2word,
312
+ k=num_topics,
313
+ a=a,
314
+ b=b)
315
  model_results['Validation_Set'].append(corpus_title[i])
316
  model_results['Alpha'].append(a)
317
  model_results['Beta'].append(b)
318
  model_results['Coherence'].append(cv)
319
 
320
+ pbar.update(1)
321
  pd.DataFrame(model_results).to_csv('lda_tuning_results_new.csv', index=False)
322
  pbar.close()
323
 
324
  params_df = pd.read_csv('lda_tuning_results_new.csv')
325
+ params_df = params_df[params_df.Validation_Set == '75% Corpus']
326
  params_df.reset_index(inplace=True)
327
+ params_df = params_df.replace(np.inf, -np.inf)
328
  max_params = params_df.loc[params_df['Coherence'].idxmax()]
329
  max_coherence = max_params['Coherence']
330
  max_alpha = max_params['Alpha']
331
  max_beta = max_params['Beta']
332
+ max_validation_set = max_params['Validation_Set']
333
+
334
+ global lda_model_final
335
+ lda_model_final = gensim.models.ldamodel.LdaModel(corpus=corpus_og,
336
+ id2word=id2word,
337
+ num_topics=num_topics,
338
+ random_state=100,
339
+ chunksize=200,
340
+ passes=10,
341
+ alpha=max_alpha,
342
+ eta=max_beta,
343
+ per_word_topics=True)
344
+
345
  coherence_model_lda = CoherenceModel(model=lda_model_final, texts=df['lemma_tokens'], dictionary=id2word,
346
+ coherence='c_v')
347
  coherence_lda = coherence_model_lda.get_coherence()
348
 
349
+ return coherence_lda
350
+
351
+ def assignMaxTopic(l):
352
+ maxTopic = max(l,key=itemgetter(1))[0]
353
+ return maxTopic
354
+
355
+ def assignTopic(l):
356
+ topics = []
357
+ for x in l:
358
+ topics.append(x[0])
359
+
360
+ def topic_assignment():
361
  lda_topics = lda_model_final.show_topics(num_words=10)
362
 
363
  topics = []
364
  filters = [lambda x: x.lower(), strip_punctuation, strip_numeric]
 
365
 
366
  for topic in lda_topics:
 
 
367
  topics.append(preprocess_string(topic[1], filters))
368
 
369
+ df['topic'] = [sorted(lda_model_final[corpus_og][text][0]) for text in range(len(df['original_tweets']))]
 
 
 
 
 
 
 
370
 
371
  df = df[df['topic'].map(lambda d: len(d)) > 0]
372
+ df['max_topic'] = df['topic'].map(lambda row: assignMaxTopic(row))
 
 
373
 
374
+ global topic_clusters
375
  topic_clusters = []
376
  for i in range(num_topics):
377
  topic_clusters.append(df[df['max_topic'].isin(([i]))])
378
  topic_clusters[i] = topic_clusters[i]['original_tweets'].tolist()
379
+
380
+ def get_topic_value(row, i):
381
+ if len(row) == 1:
382
+ return row[0][1]
383
+ else:
384
+ try:
385
+ return row[i][1]
386
+ except Exception as e:
387
+ print(e)
388
+
389
+ global top_tweets
390
+ top_tweets = []
391
  for i in range(len(topic_clusters)):
392
  tweets = df.loc[df['max_topic'] == i]
393
  tweets['topic'] = tweets['topic'].apply(lambda x: get_topic_value(x, i))
 
396
  tweets_sorted.drop_duplicates(subset=['original_tweets'])
397
  rep_tweets = tweets_sorted['original_tweets']
398
  rep_tweets = [*set(rep_tweets)]
399
+ top_tweets.append(rep_tweets[:5])
400
+ # print('Topic ', i)
401
+ # print(rep_tweets[:5])
402
+
403
+ def topic_summarization(topic_groups):
404
+ device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
405
+
406
+ model = T5ForConditionalGeneration.from_pretrained("Michau/t5-base-en-generate-headline")
407
+ tokenizer = T5Tokenizer.from_pretrained("Michau/t5-base-en-generate-headline")
408
+ model = model.to(device)
409
+ translator = Translator()
410
+
411
+ headlines = []
412
+ for i in range(len(topic_groups)):
413
+ tweets = " ".join(topic_groups[i])
414
+ # print(tweets)
415
+ out = translator.translate(tweets, dest='en')
416
+ text = out.text
417
+ # print(tweets)
418
+
419
+ max_len = 256
420
+
421
+ encoding = tokenizer.encode_plus(text, return_tensors = "pt")
422
+ input_ids = encoding["input_ids"].to(device)
423
+ attention_masks = encoding["attention_mask"].to(device)
424
+
425
+ beam_outputs = model.generate(
426
+ input_ids = input_ids,
427
+ attention_mask = attention_masks,
428
+ max_length = 64,
429
+ num_beams = 3,
430
+ early_stopping = True,
431
+ )
432
+
433
+ result = tokenizer.decode(beam_outputs[0])
434
+ headlines += "Topic " + str(i) + " " + result
435
+
436
+ return headlines
437
+
438
+ def compute_coherence_value_bertopic(topic_model):
439
+ topic_words = [[words for words, _ in topic_model.get_topic(topic)] for topic in range(len(set(topics))-1)]
440
+ coherence_model = CoherenceModel(topics=topic_words,
441
+ texts=df['lemma_tokens'],
442
+ corpus=corpus,
443
+ dictionary=id2word,
444
+ coherence=coherence)
445
+ coherence_score = coherence_model.get_coherence()
446
+
447
+ return coherence_score
448
+
449
+ def base_bertopic():
450
+ df['lemma_tokens_string'] = df['lemma_tokens'].apply(lambda x: ' '.join(x))
451
+ global id2word
452
+ id2word = Dictionary(df['lemma_tokens'])
453
+ global corpus
454
+ corpus = [id2word.doc2bow(d) for d in df['lemma_tokens']]
455
+
456
+ global umap_model
457
+ umap_model = UMAP(n_neighbors=15,
458
+ n_components=5,
459
+ min_dist=0.0,
460
+ metric='cosine',
461
+ random_state=100)
462
 
463
+ base_topic_model = BERTopic(umap_model=umap_model, language="english", calculate_probabilities=True)
464
+
465
+ topics, probabilities = base_topic_model.fit_transform(df['lemma_tokens_string'])
466
+
467
+ try:
468
+ print(compute_coherence_value_bertopic(base_topic_model))
469
+ except:
470
+ print('Unable to generate meaningful topics (Base BERTopic model)')
471
+
472
+ def optimized_bertopic():
473
+ vectorizer_model = CountVectorizer(max_features=1_000, stop_words="english")
474
+ optimized_topic_model = BERTopic(umap_model=umap_model,
475
+ language="multilingual",
476
+ n_gram_range=(1, 3),
477
+ vectorizer_model=vectorizer_model,
478
+ calculate_probabilities=True)
479
+
480
+ topics, probabilities = optimized_topic_model.fit_transform(df['lemma_tokens_string'])
481
+
482
+ try:
483
+ print(compute_coherence_value_bertopic(optimized_topic_model))
484
+ except:
485
+ print('Unable to generate meaningful topics, base BERTopic model if possible')
486
+
487
+ rep_docs = optimized_topic_model.representative_docs_
488
+
489
+ global top_tweets
490
+ top_tweets = []
491
+
492
+ for topic in rep_docs:
493
+ if topic == -1:
494
+ print('test')
495
+ continue
496
+ topic_docs = rep_docs.get(topic)
497
+
498
+ tweets = []
499
+ for doc in topic_docs:
500
+ index = df.isin([doc]).any(axis=1).idxmax()
501
+ # print(index)
502
+ tweets.append(df.loc[index, 'original_tweets'])
503
+ print(tweets)
504
+ top_tweets.append(tweets)
505
+
506
+ def main(dataset, model):
507
+ keyword_list = dataset.split(',')
508
+ if dataset in examples:
509
+ get_example(keyword_list)
510
+ place_data = 'test'
511
+ else:
512
+ place_data = str(scrape(keyword_list))
513
+ cleaning()
514
+ if model == 'LDA':
515
+ base_lda()
516
+ coherence = hyperparameter_optimization()
517
+ topic_assignment()
518
+ else:
519
+ base_bertopic()
520
+ optimized_bertopic()
521
+
522
+ headlines = topic_summarization(top_tweets)
523
+ headlines = '\n'.join(str(h) for h in headlines)
524
+
525
+
526
+
527
+ return place_data, headlines
528
+
529
+ global examples
530
+ examples = [
531
+ ["katip,katipunan"],
532
+ ["bgc,bonifacio global city"],
533
+ ["pobla,poblacion"],
534
+ ["cubao"],
535
+ ["taft"]
536
+ ]
537
+ iface = gr.Interface(fn=main,
538
+ inputs=["text",
539
+ gr.Dropdown(["LDA",
540
+ "BERTopic"],
541
+ label="Model")
542
+ ],
543
+ examples=examples
544
+ outputs=["text",
545
+ "text"])
546
+ iface.launch()
app_old.py ADDED
@@ -0,0 +1,364 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Required Libraries
2
+
3
+ #Base and Cleaning
4
+ import json
5
+ import requests
6
+ import pandas as pd
7
+ import numpy as np
8
+ import emoji
9
+ import regex
10
+ import re
11
+ import string
12
+ from collections import Counter
13
+ import tqdm
14
+ from operator import itemgetter
15
+
16
+ #Visualizations
17
+ import plotly.express as px
18
+ import seaborn as sns
19
+ import matplotlib.pyplot as plt
20
+ import pyLDAvis.gensim
21
+ import chart_studio
22
+ import chart_studio.plotly as py
23
+ import chart_studio.tools as tls
24
+
25
+ #Natural Language Processing (NLP)
26
+ import spacy
27
+ import gensim
28
+ import json
29
+ from spacy.tokenizer import Tokenizer
30
+ from gensim.corpora import Dictionary
31
+ from gensim.models.ldamulticore import LdaMulticore
32
+ from gensim.models.coherencemodel import CoherenceModel
33
+ from gensim.parsing.preprocessing import STOPWORDS as SW
34
+ from sklearn.decomposition import LatentDirichletAllocation, TruncatedSVD
35
+ from sklearn.feature_extraction.text import CountVectorizer, TfidfVectorizer
36
+ from sklearn.model_selection import GridSearchCV
37
+ from pprint import pprint
38
+ from wordcloud import STOPWORDS
39
+ from gensim.parsing.preprocessing import preprocess_string, strip_punctuation, strip_numeric
40
+
41
+ import gradio as gr
42
+
43
+ def give_emoji_free_text(text):
44
+ """
45
+ Removes emoji's from tweets
46
+ Accepts:
47
+ Text (tweets)
48
+ Returns:
49
+ Text (emoji free tweets)
50
+ """
51
+ emoji_list = [c for c in text if c in emoji.EMOJI_DATA]
52
+ clean_text = ' '.join([str for str in text.split() if not any(i in str for i in emoji_list)])
53
+ return clean_text
54
+
55
+ def url_free_text(text):
56
+ '''
57
+ Cleans text from urls
58
+ '''
59
+ text = re.sub(r'http\S+', '', text)
60
+ return text
61
+
62
+ # Tokenizer function
63
+ def tokenize(text):
64
+ """
65
+ Parses a string into a list of semantic units (words)
66
+ Args:
67
+ text (str): The string that the function will tokenize.
68
+ Returns:
69
+ list: tokens parsed out
70
+ """
71
+ # Removing url's
72
+ pattern = r"http\S+"
73
+
74
+ tokens = re.sub(pattern, "", text) # https://www.youtube.com/watch?v=O2onA4r5UaY
75
+ tokens = re.sub('[^a-zA-Z 0-9]', '', text)
76
+ tokens = re.sub('[%s]' % re.escape(string.punctuation), '', text) # Remove punctuation
77
+ tokens = re.sub('\w*\d\w*', '', text) # Remove words containing numbers
78
+ # tokens = re.sub('@*!*$*', '', text) # Remove @ ! $
79
+ tokens = tokens.strip(',') # TESTING THIS LINE
80
+ tokens = tokens.strip('?') # TESTING THIS LINE
81
+ tokens = tokens.strip('!') # TESTING THIS LINE
82
+ tokens = tokens.strip("'") # TESTING THIS LINE
83
+ tokens = tokens.strip(".") # TESTING THIS LINE
84
+
85
+ tokens = tokens.lower().split() # Make text lowercase and split it
86
+
87
+ return tokens
88
+
89
+ def compute_coherence_values(dictionary, corpus, texts, limit, start=2, step=1):
90
+ coherence_values = []
91
+ model_list = []
92
+ for num_topics in range(start, limit, step):
93
+ model = gensim.models.ldamodel.LdaModel(corpus=corpus,
94
+ num_topics=num_topics,
95
+ random_state=100,
96
+ chunksize=200,
97
+ passes=10,
98
+ per_word_topics=True,
99
+ id2word=id2word)
100
+ model_list.append(model)
101
+ coherencemodel = CoherenceModel(model=model, texts=texts, dictionary=dictionary, coherence='c_v')
102
+ coherence_values.append(coherencemodel.get_coherence())
103
+
104
+ return model_list, coherence_values
105
+
106
+ def compute_coherence_values2(corpus, dictionary, k, a, b):
107
+ lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
108
+ id2word=id2word,
109
+ num_topics=num_topics,
110
+ random_state=100,
111
+ chunksize=200,
112
+ passes=10,
113
+ alpha=a,
114
+ eta=b,
115
+ per_word_topics=True)
116
+ coherence_model_lda = CoherenceModel(model=lda_model, texts=df['lemma_tokens'], dictionary=id2word, coherence='c_v')
117
+
118
+ return coherence_model_lda.get_coherence()
119
+
120
+ def assignTopic(l):
121
+ maxTopic = max(l,key=itemgetter(1))[0]
122
+ return maxTopic
123
+
124
+ def get_topic_value(row, i):
125
+ if len(row) == 1:
126
+ return row[0][1]
127
+ else:
128
+ return row[i][1]
129
+
130
+ def dataframeProcessing(dataset):
131
+ # Opening JSON file
132
+ f = open('stopwords-tl.json')
133
+ tlStopwords = json.loads(f.read())
134
+ stopwords = set(STOPWORDS)
135
+ stopwords.update(tlStopwords)
136
+ stopwords.update(['na', 'sa', 'ko', 'ako', 'ng', 'mga', 'ba', 'ka', 'yung', 'lang', 'di', 'mo', 'kasi'])
137
+
138
+ global df
139
+ df = pd.read_csv(dataset + '.csv')
140
+ df.rename(columns = {'tweet':'original_tweets'}, inplace = True)
141
+ df = df.apply(lambda row: row[df['language'].isin(['en'])])
142
+ df.reset_index(inplace=True)
143
+
144
+ # Apply the function above and get tweets free of emoji's
145
+ call_emoji_free = lambda x: give_emoji_free_text(x)
146
+
147
+ # Apply `call_emoji_free` which calls the function to remove all emoji's
148
+ df['emoji_free_tweets'] = df['original_tweets'].apply(call_emoji_free)
149
+
150
+ #Create a new column with url free tweets
151
+ df['url_free_tweets'] = df['emoji_free_tweets'].apply(url_free_text)
152
+
153
+ # Load spacy
154
+ # Make sure to restart the runtime after running installations and libraries tab
155
+ nlp = spacy.load('en_core_web_lg')
156
+
157
+ # Tokenizer
158
+ tokenizer = Tokenizer(nlp.vocab)
159
+
160
+
161
+ # Custom stopwords
162
+ custom_stopwords = ['hi','\n','\n\n', '&', ' ', '.', '-', 'got', "it's", 'it’s', "i'm", 'i’m', 'im', 'want', 'like', '$', '@']
163
+
164
+
165
+ # Customize stop words by adding to the default list
166
+ STOP_WORDS = nlp.Defaults.stop_words.union(custom_stopwords)
167
+
168
+ # ALL_STOP_WORDS = spacy + gensim + wordcloud
169
+ ALL_STOP_WORDS = STOP_WORDS.union(SW).union(stopwords)
170
+
171
+
172
+ tokens = []
173
+ STOP_WORDS.update(stopwords)
174
+
175
+ for doc in tokenizer.pipe(df['url_free_tweets'], batch_size=500):
176
+ doc_tokens = []
177
+ for token in doc:
178
+ if token.text.lower() not in STOP_WORDS:
179
+ doc_tokens.append(token.text.lower())
180
+ tokens.append(doc_tokens)
181
+
182
+ # Makes tokens column
183
+ df['tokens'] = tokens
184
+
185
+ # Make tokens a string again
186
+ df['tokens_back_to_text'] = [' '.join(map(str, l)) for l in df['tokens']]
187
+
188
+ def get_lemmas(text):
189
+ '''Used to lemmatize the processed tweets'''
190
+ lemmas = []
191
+
192
+ doc = nlp(text)
193
+
194
+ # Something goes here :P
195
+ for token in doc:
196
+ if ((token.is_stop == False) and (token.is_punct == False)) and (token.pos_ != 'PRON'):
197
+ lemmas.append(token.lemma_)
198
+
199
+ return lemmas
200
+
201
+ df['lemmas'] = df['tokens_back_to_text'].apply(get_lemmas)
202
+
203
+ # Make lemmas a string again
204
+ df['lemmas_back_to_text'] = [' '.join(map(str, l)) for l in df['lemmas']]
205
+
206
+ # Apply tokenizer
207
+ df['lemma_tokens'] = df['lemmas_back_to_text'].apply(tokenize)
208
+
209
+ # Create a id2word dictionary
210
+ global id2word
211
+ id2word = Dictionary(df['lemma_tokens'])
212
+
213
+ # Filtering Extremes
214
+ id2word.filter_extremes(no_below=2, no_above=.99)
215
+ print(len(id2word))
216
+
217
+ # Creating a corpus object
218
+ corpus = [id2word.doc2bow(d) for d in df['lemma_tokens']]
219
+
220
+ lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
221
+ id2word=id2word,
222
+ num_topics=5,
223
+ random_state=100,
224
+ chunksize=200,
225
+ passes=10,
226
+ per_word_topics=True)
227
+
228
+ pprint(lda_model.print_topics())
229
+ doc_lda = lda_model[corpus]
230
+
231
+ coherence_model_lda = CoherenceModel(model=lda_model, texts=df['lemma_tokens'], dictionary=id2word, coherence='c_v')
232
+ coherence_lda = coherence_model_lda.get_coherence()
233
+
234
+ model_list, coherence_values = compute_coherence_values(dictionary=id2word, corpus=corpus,
235
+ texts=df['lemma_tokens'],
236
+ start=2,
237
+ limit=10,
238
+ step=1)
239
+
240
+ k_max = max(coherence_values)
241
+
242
+ global num_topics
243
+ num_topics = coherence_values.index(k_max) + 2
244
+
245
+ lda_model = gensim.models.ldamodel.LdaModel(corpus=corpus,
246
+ id2word=id2word,
247
+ num_topics=num_topics,
248
+ random_state=100,
249
+ chunksize=200,
250
+ passes=10,
251
+ per_word_topics=True)
252
+
253
+ grid = {}
254
+ grid['Validation_Set'] = {}
255
+
256
+ alpha = [0.05, 0.1, 0.5, 1, 5, 10]
257
+
258
+ beta = [0.05, 0.1, 0.5, 1, 5, 10]
259
+
260
+ num_of_docs = len(corpus)
261
+ corpus_sets = [gensim.utils.ClippedCorpus(corpus, int(num_of_docs*0.75)),
262
+ corpus]
263
+ corpus_title = ['75% Corpus', '100% Corpus']
264
+ model_results = {'Validation_Set': [],
265
+ 'Alpha': [],
266
+ 'Beta': [],
267
+ 'Coherence': []
268
+ }
269
+ if 1 == 1:
270
+ pbar = tqdm.tqdm(total=540)
271
+
272
+ for i in range(len(corpus_sets)):
273
+ for a in alpha:
274
+ for b in beta:
275
+ cv = compute_coherence_values2(corpus=corpus_sets[i], dictionary=id2word, k=num_topics, a=a, b=b)
276
+ model_results['Validation_Set'].append(corpus_title[i])
277
+ model_results['Alpha'].append(a)
278
+ model_results['Beta'].append(b)
279
+ model_results['Coherence'].append(cv)
280
+
281
+ pbar.update(1)
282
+ pd.DataFrame(model_results).to_csv('lda_tuning_results_new.csv', index=False)
283
+ pbar.close()
284
+
285
+ params_df = pd.read_csv('lda_tuning_results_new.csv')
286
+ params_df = params_df[params_df.Validation_Set == '100% Corpus']
287
+ params_df.reset_index(inplace=True)
288
+
289
+ max_params = params_df.loc[params_df['Coherence'].idxmax()]
290
+ max_coherence = max_params['Coherence']
291
+ max_alpha = max_params['Alpha']
292
+ max_beta = max_params['Beta']
293
+
294
+ lda_model_final = gensim.models.ldamodel.LdaModel(corpus=corpus,
295
+ id2word=id2word,
296
+ num_topics=7,
297
+ random_state=100,
298
+ chunksize=200,
299
+ passes=10,
300
+ alpha=max_alpha,
301
+ eta=max_beta,
302
+ per_word_topics=True)
303
+
304
+ coherence_model_lda = CoherenceModel(model=lda_model_final, texts=df['lemma_tokens'], dictionary=id2word,
305
+ coherence='c_v')
306
+ coherence_lda = coherence_model_lda.get_coherence()
307
+
308
+ lda_topics = lda_model_final.show_topics(num_words=10)
309
+
310
+ topics = []
311
+ filters = [lambda x: x.lower(), strip_punctuation, strip_numeric]
312
+ lda_topics_string = ''
313
+
314
+ for topic in lda_topics:
315
+ print(topic)
316
+ lda_topics_string += 'Topic ' + str(topic[0]) + '\n' + str(topic[1]) + '\n\n'
317
+ topics.append(preprocess_string(topic[1], filters))
318
+
319
+ df['topic'] = [sorted(lda_model_final[corpus][text][0]) for text in range(len(df['original_tweets']))]
320
+
321
+ def sort_topics(l):
322
+ return(sorted(l, key=lambda x: x[1], reverse=True))
323
+
324
+ df['topic'] = df['topic'].apply(sort_topics)
325
+ df['topic_string'] = df['topic'].astype(str)
326
+
327
+
328
+ df = df[df['topic'].map(lambda d: len(d)) > 0]
329
+ df['topic'][0]
330
+
331
+ df['max_topic'] = df['topic'].map(lambda row: assignTopic(row))
332
+
333
+ topic_clusters = []
334
+ for i in range(num_topics):
335
+ topic_clusters.append(df[df['max_topic'].isin(([i]))])
336
+ topic_clusters[i] = topic_clusters[i]['original_tweets'].tolist()
337
+
338
+ for i in range(len(topic_clusters)):
339
+ tweets = df.loc[df['max_topic'] == i]
340
+ tweets['topic'] = tweets['topic'].apply(lambda x: get_topic_value(x, i))
341
+ # tweets['topic'] = [row[i][1] for row in tweets['topic']]
342
+ tweets_sorted = tweets.sort_values('topic', ascending=False)
343
+ tweets_sorted.drop_duplicates(subset=['original_tweets'])
344
+ rep_tweets = tweets_sorted['original_tweets']
345
+ rep_tweets = [*set(rep_tweets)]
346
+ print('Topic ', i)
347
+ print(rep_tweets[:5])
348
+
349
+ output_df = df[['topic_string', 'original_tweets']].copy()
350
+
351
+ return lda_topics_string, output_df
352
+
353
+ def greet(name):
354
+ return "Hello " + name + "!!"
355
+
356
+ iface = gr.Interface(fn=dataframeProcessing,
357
+ inputs=gr.Dropdown(["katip-december",
358
+ "katipunan-december",
359
+ "bgc-december",
360
+ "bonifacio global city-december"],
361
+ label="Dataset"),
362
+ outputs=["text",
363
+ gr.Dataframe(headers=['topic_string', 'original_tweets'])])
364
+ iface.launch()
cubao.csv ADDED
@@ -0,0 +1,236 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,tweet
2
+ 0,@HorizonSpaCubao @BaldzyP Yey! Cubao na ulit. I should try him na! 🌟🙌🏻
3
+ 1,Renan available in cubao branch for walk in https://t.co/FbELfVlla3
4
+ 2,"galang gala ako today, pauwi pa lang me HAJDBWKFJS araneta cubao lang usapan hanggang sa nakarating rin kami sa moa 😭"
5
+ 3,New york cubao 😌
6
+ 4,"anybody interested for on festival off concert tickets?
7
+
8
+ 3 ub tickets (same rows)
9
+ - will be selling same price
10
+ - can meet up in cubao/ortigas/makati
11
+
12
+ dm if interested 😊"
13
+ 5,See you bukas cubao ARAT MEET💦 DM ME‼️ https://t.co/M5Ag7E46Yr
14
+ 6,"Wts lfb ph
15
+
16
+ A1 Twenty Five Live in Manila
17
+ Discounted Tickets for sale
18
+
19
+ - pwede ipalalamove now yung ticket free sf or meetup tomorrow sa 15th avenue cubao at 9am :)
20
+
21
+ 🎫 Orchestra B left
22
+ 🎰 4 slots only https://t.co/HiYVUVbg80"
23
+ 7,Ok fully paid na ako for Batanes trip. Kailangan ko na lang mamalimos ng pocket money sa tulay ng Cubao. Emi 🥲🥹
24
+ 8,ako na gusto patirin si coc nung sep. 15 dahil paikot ikot sa cubao kaso ang laki nya🥹 https://t.co/CyOYRx2sI2
25
+ 9,@heyyannahh bukas ata meron sa cubao
26
+ 10,"aala na ba talaga cubao side trip
27
+ #altercubao #alter #jakolph #zoomjakol #alterbagets"
28
+ 11,@hump_fri @oohhhkaye @BembangBiik Alam na alam mo ang cubao ano? Wag kana mahiya matagal ng buking yan panahon pa ni marimar roxas
29
+ 12,@DamascusBlade Oh YES. Arizona and the Rapture Royalties queens know how to lip sync in an unhinged manner so well. That’s Cubao drag 🥹🥹🥹
30
+ 13,"quickie goodcatch bagets here cubao area only dm me
31
+ #altercubao #alter #jakolph #zoomjakol #alterbagets https://t.co/bDW2m80wvh"
32
+ 14,"quickie jaks cubao quickie hmu bagets here
33
+ #altercubao #alter #jakolph #zoomjakol #alterbagets"
34
+ 15,ang sarap magikot sa cubao HUHU
35
+ 16,"jakol quickie cubao area dm me bagets here kattapos lang mag gym
36
+ #altercubao #alter #jakolph #zoomjakol #alterbagets"
37
+ 17,"cubao area omly quickie hmu bagets here
38
+ #altercubao #alter #jakolph #zoomjakol #alterbagets"
39
+ 18,"785 Aurora Blvd Cubao QC
40
+ ☎️ 09054471994
41
+ 📞 09998864822
42
+
43
+ Marvin https://t.co/E5uAUO0Ahn"
44
+ 19,"cubao quickie hmu bagets here
45
+ #altercubao #alter #jakolph #zoomjakol #alterbagets"
46
+ 20,"sino nasa cubao side trip dm me @
47
+ #alterbagets #alterph #altercubao #alterqc #zoomjakol #jakolph #jakol"
48
+ 21,#AtinIto #WestPhSea EDSA Cubao North Bound https://t.co/NHPrJ9kF9i
49
+ 22,tataxi sana ako pauwi dito sa cubao pero natatakot ako ahakajs
50
+ 23,@avesouls heeseung cubao
51
+ 24,@Mang_asar3 @oohhhkaye @BembangBiik Parang mas familiar ka? HAHAHAHA. Galing ka siguro don ano? HAHAHAHAHAHA. Alam ko lang sa Cubao ay terminal ng Van. 🤣
52
+ 25,malupit chismis ko guys cubao gaming to
53
+ 26,@joesalad90 HELLO San sa Cubao
54
+ 27,"@SelectaPH sa ibang lugar po ang daming holo, dito sa cubao ang ilap 😭😭😭😭😭😭😭😭🤧🤧🤧🤧"
55
+ 28,"sino asa cubao ngayon na up for quickie hmu
56
+
57
+ #alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol"
58
+ 29,"Sup katatapos lang mag gym sino g side trip gateway cubao dm me
59
+
60
+ #alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol"
61
+ 30,side trip gateway cubao#alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol
62
+ 31,Sino asa cubao ngayin #alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol
63
+ 32,HUWAG TULARAN FROM LAGUNA DUMAYO SA CUBAO DAPAT ANG HALFWAY AY SA SM MEGAMALL CHAROT https://t.co/KlMWhnmC77
64
+ 33,"katatapos lang mag gym sino g side trip gateway cubao dm me
65
+
66
+ #alterqc #jakolph #altercubao #zoomjakol #jakol #alterjakol"
67
+ 34,@jenjaemlovr sa cubao lang ?!
68
+ 35,All I can say is fuck cubao ❤️
69
+ 36,Sino asa cubao now pasabit
70
+ 37,"WTS- LFB
71
+
72
+ Vinci PC-300
73
+ Kyler SID - 270
74
+ Winston ID picture -100
75
+
76
+ meet cubao farmers/araneta tom only
77
+ check out on shopee https://t.co/AQlFdC7oaK"
78
+ 38,all roads lead to Cubao anw
79
+ 39,hindi alam pa’no umuwi —> pumuntang cubao
80
+ 40,walking new york cubao https://t.co/XgDSGc00im
81
+ 41,"785 Aurora Blvd Cubao QC
82
+ ☎️ 09054471994
83
+ 📞 09998864822
84
+
85
+ Jelson https://t.co/87N3vsd01I"
86
+ 42,"785 Aurora Blvd Cubao QC
87
+ ☎️ 09054471994
88
+ 📞 09998864822 https://t.co/JJmwNbFOh3"
89
+ 43,"785 Aurora Blvd Cubao QC
90
+ ☎️ 09054471994
91
+ 📞 09998864822 https://t.co/gSfKK8v6wn"
92
+ 44,"785 Aurora Blvd Cubao QC
93
+ ☎️ 09054471994
94
+ 📞 09998864822
95
+
96
+ Jordan 🔥 Matt https://t.co/4q7EP8j8I1"
97
+ 45,"785 Aurora Blvd Cubao QC
98
+ ☎️ 09054471994
99
+ 📞 09998864822 https://t.co/SZfgoP1bP2"
100
+ 46,"785 Aurora Blvd Cubao QC
101
+ ☎️ 09054471994
102
+ 📞 09998864822
103
+
104
+ Balmond https://t.co/L0XwCdTZLK"
105
+ 47,Makabiyahe nga may event pala ngayon sa cubao andun si boss Park Seo-joon hahaha
106
+ 48,Namimiss ko panoorin yung busy streets nang Araneta - Cubao 🥺
107
+ 49,"ZB1 100 DAYS OF DEBUT!
108
+
109
+ Freebies for tomorrow's event at Cubao, QC. Just approach me when you see me ☺️
110
+
111
+ See you there zeroses! 🌹😘 https://t.co/rYhF3YPsuL https://t.co/OAoOxB1TAo"
112
+ 50,@srslyclrsa IM STILL EMO NASA CUBAO EXPO AKO
113
+ 51,"Nag punta ako ng cubao para mapagod, kasi yun yung goal! Mapagod para makatulog agad😭"
114
+ 52,Cubao ibabaw 🤭 https://t.co/qj1WXQCPJm
115
+ 53,Tara trip with pops. Timog-Morato-Cubao area
116
+ 54,The Royal Class bus is set to take its first batch of trips this October. https://t.co/rDmAJIkz2y https://t.co/rCYWIkUBRV
117
+ 55,"@theskinsensei After OBar, isunod mo Doc sa list mo ng puluntahan ang Rapture sa Cubao 😁"
118
+ 56,cubao now
119
+ 57,"@ army friends/moots na naghahanap ng pupuntahan na cse, cubao area lang 'to oh~ very accessible 😁"
120
+ 58,@woosanstation gago ka pag balik ko ng cubao humanda ka
121
+ 59,"METROBASE Traffic Update as of 7:00PM
122
+
123
+ WHEN: October 14, 2023
124
+
125
+ A. MAJOR ROADS:
126
+
127
+ *EDSA
128
+ SOUTHBOUND
129
+ - New york Cubao to Ortigas split, moderate to slow moving (due to volume of vehicles)
130
+ - Ortigas to Shaw split, moderate to slow moving (due to volume of vehicles)
131
+ - Boni, moderate to… https://t.co/1FcX7f3TKr https://t.co/bPqgHRN4vR"
132
+ 60,"🖤 #PJMFreed 🤍
133
+
134
+ 🧋 A FREE CUPSLEEVE EVENT CELEBRATING PARK JIMIN'S BIRTHDAY
135
+
136
+ 🗓 October 21, 2023 (Saturday)
137
+ 📍 Drink Lab Café, Araneta City (Cubao)
138
+ ⏰ 12:00 NN - 6:30 PM
139
+
140
+ ✨ 60 SLOTS ONLY ✨
141
+
142
+ ✍🏼 REGISTER HERE: https://t.co/7sKcmAcecG https://t.co/F7rqDAlrRi"
143
+ 61,I was the one with the La Luna Bag and She was the one with the Ken Bag and SB19 pins. She was with her niece and their from Montalban I am from San Mateo. Their going to vist her 2 children going to cavite while I was going to my School STI Cubao. +++
144
+ 62,"To Ate Em,
145
+
146
+ If ever you read this, dito sa twt Thank you po for talking to me 🥹
147
+ I had a bad week, dahil sa acads and personal life circumstances happening. Talking to you while on the way to cubao in the mini bus, made things lighter for me. +++ https://t.co/N63zbS21h7"
148
+ 63,"@khgbbovely Yep, yep. 'Yung sinuot mo sa Cubao right??? They both looked cute on you. 😊😊"
149
+ 64,"Uwi na tayo..
150
+ Walang ganap dito sa cubao.."
151
+ 65,@Carl_Cubao Hala d nman ako cute 😁
152
+ 66,"My weekend the past weeks/months:
153
+ Nagpunta sa bgc > megamall > araneta cubao > moa to paranaque > mckinley > katipunan.
154
+
155
+ 🔜 office to megamall for team bowling > angeles > sangkyu cse > bgc with couz > chanyeol cse(?) > rizal for tb 😂😂😂 isa lang massabe ko: d na nakapagdiet. https://t.co/Nj39UfViwM"
156
+ 67,Dreamland hotel cubao🍆🍆🍆🍆🍆😎 https://t.co/BXpYvcobJV
157
+ 68,@hwajoongyu HAHAHA TRUE 😭😭😭 hanggang baclaran lang kasi to Cubao Ave ang ini-MRT ko eh HAHAHHAHA
158
+ 69,San goods magpa-massage? near Cubao sana huhu badly needed rn
159
+ 70,"@hump_fri @oohhhkaye @BembangBiik Binuking mo nanaman ang troll farm nyo mga Ulahh Loooo Tah Bhu Ghuk 😂😂😂😂😂😂
160
+
161
+ Yan ba ang nasa Cubao? 😂😂😂😂😂"
162
+ 71,"𝗙𝗢𝗥 𝗠𝗔𝗦𝗦𝗔𝗚𝗘 𝗕𝗢𝗢𝗞𝗜𝗡𝗚𝗦 & 𝗜𝗡𝗤𝗨𝗜𝗥𝗜𝗘𝗦, 𝗖𝗔𝗟𝗟 or 𝗧𝗘𝗫𝗧 our 𝗙𝗥𝗢𝗡𝗧 𝗗𝗘𝗦𝗞: 𝟬𝟵𝟭𝟵𝟬𝟵𝟵𝟳𝟬𝟰𝟱 | 𝟴𝟮𝟰𝟭𝟮𝟬𝟱𝟭.
163
+
164
+ 𝗪𝗘𝗟𝗟𝗡𝗘𝗦𝗦𝗩𝗜𝗟𝗟𝗘 𝗠𝗮𝘀𝘀𝗮𝗴𝗲 𝗦𝗽𝗮
165
+ Unit 1, 88 New York Avenue, Cubao, Quezon City, Philippines https://t.co/Whos7dsyj8"
166
+ 72,kabado bente kanina sa cubao pls bat kamuka ni jay :DD
167
+ 73,"baka may pa free tix po kau jan, dito po lang po ako pakalat kalat sa cubao 🥹🥹🥹
168
+
169
+ park seo jun park seo joon in manila
170
+
171
+ #parkseojun #parkseojunmanila"
172
+ 74,"@skz_ldl Sayang di tayo nag-abot haha! Dumaan lang ako, papunta ako Cubao, nood ng concert hihi"
173
+ 75,Park Seo Jun caused so much traffic here in Cubao 😄 pero ok lang siya naman yun eh #AranetaCity
174
+ 76,"baka may pa free tix po kau jan, dito po lang po ako pakalat kalat sa cubao 🥲🥲🥲
175
+
176
+ park seo jun park seo joon in manila
177
+
178
+ #parkseojun #parkseojunmanila https://t.co/vo2XwOWQep"
179
+ 77,"baka may pa free tix po kau jan, dito po lang po ako pakalat kalat sa cubao 🥲🥲
180
+
181
+ park seo jun park seo joon in manila
182
+
183
+ #parkseojun #parkseojunmanila"
184
+ 78,bakit ba kung sino-sino napapadpad sa cubao😭😭😭😭
185
+ 79,"📋𝐁𝐨𝐨𝐤 𝐚 𝐌𝐀𝐒𝐒𝐀𝐆𝐄 𝐧𝐨𝐰.
186
+
187
+ Looking for sure Client 😊
188
+ ✅ giving you good and Relaxing Massage😘
189
+
190
+ -May Service Clients from:
191
+ ✅Pasay ✅Taguig ✅Cubao ✅ Mandaluyong
192
+ ✅ BGC ✅Guadalupe ✅ Recto ✅ Pasig, and more
193
+ Messags or call
194
+ 09292367409 https://t.co/TtyvIlK7wt"
195
+ 80,"@Yujiseph If around cubao, Novotel is good.
196
+
197
+ If around manila, I'm not as familiar eh, but Bayview and manila hotel are nice.
198
+
199
+ These are the pricey ones tho. I haven't tried bnbs around the area but for sure it will be more cost effective"
200
+ 81,Cubao Gateway now 📍 tara good s*x 💦
201
+ 82,alam kong kakatapos lang kmagic pero may cse ba for gyeom around cubao? 🥹🥹
202
+ 83,wala ba ads sa cubao si hee? 🙇
203
+ 84,This Mcdo store in Cubao has been playing 6 Mariah Carey songs already. I know the one in charge of the songs is a gay. You can't convince either 💀😭
204
+ 85,"Kuya Konduktor: /nag aaya ng pasahero pag Riverbanks at Cubao/ Tara, tara, tara! Sa Riverbanks na tara, marami rides doon!
205
+ Me na sa Rizal bababa: 😒😒😒😒"
206
+ 86,"baka may pa free tix po kau jan, dito po lang po ako pakalat kalat sa cubao 🥲🥲🥲
207
+
208
+ park seo jun park seo joon in manila
209
+
210
+ #parkseojun #parkseojunmanila https://t.co/jS3eB05NLO"
211
+ 87,Perfect shirt for Cubao Expo. 💜💜💜 #IU #아이유 #leejieun https://t.co/grwTBNhkpq
212
+ 88,Hanapin kita dito sa Gateway Cubao. 😂 https://t.co/iIvss1prvX
213
+ 89,"at cubao, waiting umandar ang bus https://t.co/nnkBMv9T5R"
214
+ 90,@HorizonSpaCubao @BaldzyP Nice bumalik na si Nathan sa Cubao Branch. ☺️❤️
215
+ 91,bakit puro senior citizen sa tropical hut cubao?
216
+ 92,hello vinyl friends this vintage record store in cubao ex is on a 50% clearance sale! https://t.co/IH6UUPn1hR
217
+ 93,@eunoyah lika na sa cubao
218
+ 94,Where to go after Cubao? https://t.co/aMGGOfRE6W
219
+ 95,Dito siya sa cubao baka sakaling makita si park seojun gumagala char HAHHAHAHAHA https://t.co/0x7rUCFqJb
220
+ 96,"PTPA‼️
221
+ GOOD DAY PO IM MARCO A FREELANCE MASSEUR
222
+
223
+ AVAILABLE FOR HOME AND HOTEL SERVICE ONLY NO PLACE
224
+
225
+ •CAVITE
226
+ •PASAY
227
+ •MANILA
228
+ •CUBAO
229
+ •ETC
230
+
231
+ DM ME https://t.co/Vset9YYcAr"
232
+ 97,"Ang astig ng sleeper bus aka #TheRoyalClass bus ng @VictoryLinerInc! Tara na’t mag-Baguio👋😃
233
+
234
+ https://t.co/2pHJTO81yb"
235
+ 98,May susunod na sa character count ng Justice Lourdes Paredes San Diego Avenue (dating Main Avenue) sa Cubao. 👀 https://t.co/lgcqKzzNhM
236
+ 99,@coderinthebox Hahahaha guni2 lang ni @alexanderrieta1 yan feeling nasa ibang bansa pero troll na Pah Tay Go Tom lang sa cubao ang Ulahh Loooo Tah Bhu Ghuk 😂😂😂😂
requirements.txt CHANGED
@@ -9,6 +9,10 @@ chart_studio
9
  autopep8
10
  transformers
11
  sentencepiece
12
- bert-extractive-summarizer
13
  tqdm
14
- wordcloud
 
 
 
 
 
 
9
  autopep8
10
  transformers
11
  sentencepiece
 
12
  tqdm
13
+ wordcloud
14
+ gensim
15
+ transformers
16
+ googletrans==3.1.0a0
17
+ bertopic flair
18
+ scikit-learn==1.1.0
taft.csv ADDED
@@ -0,0 +1,207 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,tweet
2
+ 0,@Pullma_Pfanger @Cushdog5 @mmpadellan Outdated and false information. It’s been confirmed. https://t.co/SlYORP8yyo
3
+ 1,"Ewan ko, I usually buy food or treat my oomf somewhere to eat or relax.
4
+ I mean, dude, trimming and cleaning is not a joke. Not a reward but just to show them I appreciate them and their efforts.
5
+
6
+ PS. Di po sya taga Lasalle Taft pero dumayo
7
+
8
+ char 🤣 https://t.co/ykSrI0305v"
9
+ 2,"📅 Today's @ 3:00 PM
10
+ Taft School vs. Choate Rosemary Hall School - Prep Schools High School Football
11
+ https://t.co/9BMGat7pqb
12
+ 🎟️🎥Watch Now➠ https://t.co/5XLM3ckm43
13
+ https://t.co/9fSx3lyegG
14
+ sdslzdlgfd"
15
+ 3,Ano hmm punta ka na taft potekk
16
+ 4,galing talaga ng la salle taft one week na down pa din website nila angaling. 🤦🤦🤦
17
+ 5,"28’ | Taft makes an important block after Richards had shifted into a yard of space to shoot.
18
+
19
+ 🐯 1-1 ⚪️"
20
+ 6,"📅 Today's @ 2:45 PM
21
+ Worcester Academy vs. Taft School - Prep Schools High School Boys Soccer
22
+ https://t.co/7ME6E6KoXu
23
+ 🎟️🎥Watch Live➠ https://t.co/Sr46vANKOH
24
+ https://t.co/oUfMTqWyPe
25
+ cemcpsdzqg"
26
+ 7,@dave_taft @markknoller The FCC cannot control the editorial decisions of local broadcast stations under the First Amendment
27
+ 8,Taft Bike Club with the Mayor!!! @NISDTaft #wegopublic https://t.co/p0NpgUp5c8
28
+ 9,grabe sobrang traffic sa taft and walang masakyan kanina huhu
29
+ 10,@ProjectLincoln I'm wondering which president was fatter. @realDonaldTrump or William Howard Taft at 300 lbs?
30
+ 11,"1’ | Ward forces an early corner from Taft.
31
+
32
+ 🐯 0-0 ⚪️"
33
+ 12,@_amdej tara lipat sa taft
34
+ 13,"@ProjectLincoln Who wore it better, President Taft or President Daft? https://t.co/AB1BkHi6wt"
35
+ 14,@tellitPAC @GovChristie We don’t need a Taft right now. Stop taking people’s money. He doesn’t stand a chance. He knows it and whatever organization this is knows it.
36
+ 15,Taft Museum's new curator Ellen Roberts discusses plans for future exhibitions - Cincinnati Business Courier https://t.co/fa0qXvgAEN via @BusinessCourier #robertsduncanson
37
+ 16,"Bequests of roughly $6 million each will support the endowments of the @CincyMuseum, @cincyartmuseum, and the Taft Museum of Art. https://t.co/kUtzWNHTrm"
38
+ 17,Taft carfun anyone #altertaft
39
+ 18,"Taft School vs. Choate Rosemary Hall School - Prep Schools High School Football
40
+ https://t.co/oIyPz3SpEA
41
+ 🔴Live Now➠ https://t.co/ksSe0LNXLk
42
+ 📅 Saturday, October 14, 2023 ⏰ 3:00 PM"
43
+ 19,。:°ஐ went to a cafe in taft ! ࿐ ˚ . ✦ https://t.co/RtkMDptKlm
44
+ 20,"Happy Birthday, Dwight D. Eisenhower! Here Hoover is seen with President Eisenhower at dedication ceremony of Bell Tower Memorial for Robert Taft. @ikelibrary (31-1959-17) #happybirthday https://t.co/XAymydrsIg"
45
+ 21,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford
46
+
47
+ https://t.co/58ZeCGEpkD"
48
+ 22,📍taft
49
+ 23,@ysblldpn My conyo guy from Taft ☺️ (Taft b shea hayssss basta lozol)
50
+ 24,"SPORTS: The DLSU Green Booters Team 1 blazed their way to the Ang Liga Season 19 Division 1 quarterfinals with a win-loss record of 3-1, following their victory over the CSB Blazers, 2-0, earlier tonight, October 14, at the UP Diliman Football pitch. https://t.co/gjQjoeZgB3"
51
+ 25,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford https://t.co/N4H6DslTnn"
52
+ 26,Andrea Mitchell say she misses the days when she could sit down for a whiskey & cigar with President Taft.
53
+ 27,may nakasalubong ako around UN taft kanina na pogii 😭😭😭 i think he’s from NU based on his uniform since it’s blue hashaha basta nagka eye contact kami pota ang pogi like singkit sya naka salamin na moreno 😭
54
+ 28,taft avenue sa gabi feeling ko nasa mv ako ni zild
55
+ 29,"NISD WATER POLO BOYS AREA PLAYOFFS TODAY @ NISD SWIM CENTER & PSJA. GAMES @ 8, 9 & 1030 & 12. COME CATCH THE ACTION! @NISD @NISDBrandeis
56
+ @clark_swim
57
+ @HawksAquatics
58
+ @NISDHCHS
59
+ @Jay_Aquatics
60
+ @aquatics_oc
61
+ @taft_club
62
+ TICKETS: https://t.co/JladCMWctU
63
+ PSJA: https://t.co/owkfJ8LOHa https://t.co/RsEDAO2Pno"
64
+ 30,"Worcester Academy vs. Taft School - Prep Schools High School Boys Soccer
65
+ https://t.co/BhK1dtd1sx
66
+ 🔴Watch Now➠ https://t.co/mAZocrUeBF
67
+ 📅 Saturday, October 14, 2023 ⏰ 2:45 PM"
68
+ 31,@gary_taft Nice
69
+ 32,idk kung paano ko isisingit ang birthday bash for hao pero ayoko pumayag na wala akong ganap for him so if you guys are inch pls lmk and need ko rin ng help sa pago-organize 👉👈🥹 jollibee kids party para pwede mag geng geng and this will be around taft lang !! https://t.co/FuLNnkAjmj
70
+ 33,"Samantala, sunod na makahaharap ng Taft-based squad ang Ateneo Men’s Chess Team bukas, Oktubre 15, sa ganap na ika-1 ng hapon sa parehong lugar. https://t.co/XG5IvINvYO"
71
+ 34,"LF / WTB HAMILTON PH TICKETS
72
+
73
+ 3 HAMILTON tickets
74
+ no preferred seats (as long as seated together)
75
+ preferably physical ticket (but not required)
76
+ preferably meet up (taft area)"
77
+ 35,Dudley Taft y Manu Lanvin mano a mano en Madrid https://t.co/1MogCn9Qbz @DudleyTaft @ManuLavin @blackmooncreep
78
+ 36,"LF / WTB HAMILTON PH TICKETS
79
+
80
+ 3 HAMILTON tickets
81
+ no preferred seats (as long as seated together)
82
+ preferably meet up (taft area)"
83
+ 37,@MMDA Kelan kaya maaayos ang traffic sa EDSA corner Taft Avenue? Grabe ang loading and unloading sa kanto nito na sanhi ng heavy traffic.
84
+ 38,"What are my turn-ons? Meet me on @Flirt4Free. I'll give you all the juicy details.
85
+ https://t.co/hJMwqFD9l3 https://t.co/jiUYcwWlGB"
86
+ 39,"@taft_gvs plays against Choate under the 💡 on KJM Field
87
+
88
+ 🦏⚽️ #taftathletics #gorhinos #gobigred #wearetaft https://t.co/h0agVh03ie"
89
+ 40,"@threeinningfan Fascinating & informative! Was that William Howard Taft I saw in the Chief Justice position? Anyway, great job! 👍⚾️👍⚾️"
90
+ 41,@MMDA Manila both lanes Taft hanggang Quiapo
91
+ 42,"TODAY: Are ya ready kids? @TCTCincinnati presents The Spongebob Musical: Youth Edition at Taft Theatre 🌴 Get more info and tickets here: https://t.co/AiGNczxkQ3
92
+
93
+ 2:00 PM SHOW
94
+ DOORS: 1:00 PM
95
+ SHOW: 2:00 PM
96
+
97
+ 5:00PM SHOW
98
+ DOORS: 4:00 PM
99
+ SHOW: 5:00 PM https://t.co/gSh9HzJ7Ws"
100
+ 43,Anybody in Taft?? Jk
101
+ 44,taft people are wildd i swear
102
+ 45,jgh from españa to taft for 2 FUCKING HOURS WTF
103
+ 46,@babygurldex dont make me come all the way to taft eme
104
+ 47,"Yung patience ng bf ko lord, please.
105
+ Akala mo naman di sanay na traffic sa taft"
106
+ 48,"from Taft Manila, to Bulacan then to Araneta…what a very productive day you had and you made a lot of people happy!
107
+
108
+ #FiveBreakUpsAndARomance
109
+ #ALDENRichards
110
+ @aldenrichards02 https://t.co/eVL2YpdWjd"
111
+ 49,"Travelled the entire length of Taft and headed straight to moa, all on foot. This is a new record for me. And my feet don’t even hurt yet :D"
112
+ 50,"aaaa traffic wth, 2hrs na asa byahe pa taft lang ako ahshehahdhs"
113
+ 51,may ukay ukay ba doon sa taft 🥹
114
+ 52,Wtf may bok chicken sa taft?!?!
115
+ 53,"SPORTS: DLSU Green Booters Team 1 successfully notched a win in their Taft derby match against the CSB Blazers, 2-0, in their Ang Liga Season 19 Division 1 matchup earlier tonight, October 14, at the UP Diliman Football Pitch. Isaac Anoh led the way with his first-half brace. https://t.co/tYOqTwh28L"
116
+ 54,"Nasa MRT, Taft stationnn palangggg yawaaa"
117
+ 55,"WTS LFB SMISKI
118
+ WTT LFT SMISKI
119
+
120
+ Have: SMISKI thinking
121
+ Want: SMISKI duck, daydreaming, nap time
122
+ Mop: gcash
123
+ Mod: flash/ggx/meetup
124
+ Loc: santa rosa laguna/taft
125
+
126
+ Fresh pull !!
127
+ With box and foil https://t.co/FWUQlbv8F7"
128
+ 56,"Continuous STRIDE ⚡️
129
+
130
+ The DLSU Green Booters (12 pts) put on a statement after a hard-fought match against their Taft neighbor, the Benilde Blazers (5 pts), 1-0! 🏹
131
+
132
+ #GoLaSalle #AnimoLaSalle #DLSUSports https://t.co/Dnix6Qxyuw"
133
+ 57,Walking the entire length of Taft otw to Moa https://t.co/G7lLIReYdP
134
+ 58,Gago just witnessed a hostage taking dito aa taft 😭.
135
+ 59,im @ taft na tf tf
136
+ 60,@Suckerlingamist taft
137
+ 61,@fewyearsearlier c hokkaido uni polsci blockm8 araw-araw ig story nasa ibang bansa tas boogsh mkakasalubong m somewhere sa taft tsaka si 🌹 lv bayong girlie na nagfeflex ng designer bags at credit cards sa mga di daw afford pero nakikiagaw ng scholarship stipend meganonnnn 🤣
138
+ 62,sino nasa taft pahiram jacket pls andito ako sa projuice malapit na maging frozen parang hotdog
139
+ 63,"ღ wts lfb ღ
140
+
141
+ 🐨 koala - php 550 (payo php 500)
142
+ 🐱 can also trade siamese to pig / lop ear hipper or same demand (nfs, wtt)
143
+
144
+ mop: gcash
145
+ mod: sdd or meet up around taft
146
+
147
+ 🏷️ sonny angel wts koala reg wtt cat life series https://t.co/IdQxePA2MD"
148
+ 64,https://t.co/8oGdk3fp5v goodcatch from taft 1on1 #zoomjakol
149
+ 65,"SPORTS: DLSU Green Booters Team 1 take on Taft neighbor CSB Blazers at the UP Diliman Football Pitch today, October 14, in hopes of returning to winning form after their previous loss in the Ang Liga Season 19 Division 1. https://t.co/wdGsVPtEkv"
150
+ 66,@cheylwse sa taft at katipunan po
151
+ 67,"BATTLE OF TAFT ⚽️: The DLSU Green Booters are going up against familiar opponents, the CSB Blazers, as they look to get back up on the win column.
152
+
153
+ Follow @dlsusports for LIVE play-by-play updates! 🏹
154
+
155
+ #GoLaSalle #AnimoLaSalle"
156
+ 68,"Taft School vs Choate Rosemary Hall School High School Football
157
+ 🎥🔴Live Events: https://t.co/pQDPdf2IAz
158
+ Saturday, October 14 @ 3p.
159
+ @CRHFootball @goChoate @ChoateVBHockey @TaftFootballCT @TaftRhinos @TaftSchool @AndreCrews6 @TaftFootballCT https://t.co/xILtoS9Gls"
160
+ 69,"@yeontoosiast Unnie Cafe, Taft! ✨"
161
+ 70,"Granada Hills football runs through Taft – Daily News
162
+ https://t.co/pIuC865dxf"
163
+ 71,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford https://t.co/4uD3XRRneW [How, pray tell, does this worthless subhuman continue to make his way into the news?! A mystery of life...]"
164
+ 72,Granada Hills football runs through Taft https://t.co/9gJx5v3qKW
165
+ 73,Granada Hills football runs through Taft https://t.co/7tvNTwuNJv
166
+ 74,Granada Hills football runs through Taft https://t.co/Qk4459g3Mz
167
+ 75,Granada Hills football runs through Taft https://t.co/GG9bEqzQqA
168
+ 76,"bored in Taft
169
+
170
+ #altertaft #alterpasay #altervitocruz #alterquirino"
171
+ 77,"@cyrilamarchand,@KhaitanCo,Taft Law, Foley & Lardner act on iLink Group's fundraising from TrueNorth
172
+
173
+ iLink Group has raised approximately ₹624.2 crore
174
+ https://t.co/cNL0KJpn7P"
175
+ 78,"@PAGMETER Notice you have Wasco rank 9 in D5 they are 3-1 in SSL and will probably beat Taft and put them 4-1 SSL going into Shafter game
176
+ There Loses Are to
177
+ 9-0 Tehachapi
178
+ 7-1 Kennedy
179
+ 6-2 Golden Valley
180
+ So what are chance’s they jump to D-4 playoff ?"
181
+ 79,@altrmnl1 /*dumeretcho taft
182
+ 80,argh bat 30 mins away pa yung dlsu taft sa monumento I wanna see ate Andrei
183
+ 81,"Volvo 240 GLT
184
+ Altis 1.8 G ( manual version )
185
+ TAFT GT 4X4
186
+ YARIS 2004 J TYPE https://t.co/tZZpy7Iek8"
187
+ 82,@vonderleyen Thank you Bill for Drei Wetter Taft
188
+ 83,"@beinlibertarian I wonder if Taft, and JP Morgan had anything to do with that.. There was a lot of rich and powerful people on that boat. JP Morgan was even rumored to be expected on the boat, but he was not. This was the first 9-11. Shorty after in 1914 we go to war (WW1). https://t.co/lutOnwwVo9"
189
+ 84,Who’s in taft tara?
190
+ 85,who’s around taft rn 😗 g
191
+ 86,@Taft_HS @TaftHSAthletics @CPLAthletics @network14cps @NadigNewspapers @ChiPubSchools Congratulations!!!
192
+ 87,"Tomorrow’s Games:
193
+
194
+ 2PM- (4-2) Sam Houston Hurricanes vs. (4-2) McCollum Cowboys @ Alamo Stadium
195
+
196
+ 7PM- (4-2) Sotomayor Wildcats vs. (4-2) Taft Red Raiders @ Farris Stadium"
197
+ 88,South Kern Sol | ‘Change is here in town’: Taft City School District is urged to pick maps Cherry or Plum in districting process https://t.co/P7vdafhf1V
198
+ 89,"It’s final. Taft Union 34, Cesar Chavez 25"
199
+ 90,"@Mormonger @benjaminpacini or even to defend them against their enemies.” (Ezra Taft Benson, America at the Crossroads, August 30, 1969.)"
200
+ 91,"rainy weather in Taft = jakol ☔️😈🍆✊🏼💦
201
+
202
+ #altertaft #alterpasay #altervitocruz #alterquirino"
203
+ 92,@Once_dre @jooye_onthebass HAHAHAHAHA SA TAFT
204
+ 93,"When They Say They Side With the Hama-Nazis, Believe Them https://t.co/vSfKT7Mj5y"
205
+ 94,"Final: Taft 34, Chavez 25"
206
+ 95,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford https://t.co/aROloyHAUr"
207
+ 96,"West Coast, Messed Coast™ — Kaepernick's Friend Is 'Rounding Up Jews' at Stanford https://t.co/aROloyI8JZ"