unb-lamfo-nlp-mcti
/

NLP-Recommendation-MCTI

Model card Files Files and versions Community

Jmilagres commited on Dec 16, 2022

Commit

aeb84d9

•

1 Parent(s): d665fc0

Update README.md

Files changed (1) hide show

README.md +7 -10

README.md CHANGED Viewed

@@ -117,6 +117,7 @@ The databases (ml_100k, ml_1m and jester) are built-in the surprise package for
 Hyperparameters -
   `n_users` : number of simulated users in the database;
   `n_ratings` : number of simulated rating events in the database.
 This is a fictional dataset based in the choice of an uniformly distributed random rating(from 1 to 5) for one of the simulated users of the recommender-system that is being designed in this research project.
@@ -136,17 +137,13 @@ This is a fictional dataset based in the choice of an uniformly distributed rand
 ```
 Hyperparameters -
-        n_users` : number of simulated users in the database;
-        n_ratings` : number of simulated rating events in the database.
-        This first LDA based dataset builds a model with K = `n_users` topics. LDA topics
-        are used as proxies for simulated users with different clusters of interest. At first
-        a random opportunity is chosen, than the amount of a randomly chosen topic inside the description
-        is multiplied by five. The ceiling operation of this result is the rating that the fictional user
-        will give to that opportunity.
-        Because the amount of each topic predicted by the model is disollved among various topics,
-        it is very rare to find an opportunity that has a higher LDA value. The consequence is that this dataset
-        has really low volatility and the major part of ratings are equal to 1.
 ```python
     def read_lda_topics(self):

 Hyperparameters -
   `n_users` : number of simulated users in the database;
   `n_ratings` : number of simulated rating events in the database.
 This is a fictional dataset based in the choice of an uniformly distributed random rating(from 1 to 5) for one of the simulated users of the recommender-system that is being designed in this research project.
 ```
 Hyperparameters -
+  n_users` : number of simulated users in the database;
+  n_ratings` : number of simulated rating events in the database.
+This first LDA based dataset builds a model with K = `n_users` topics. LDA topics are used as proxies for simulated users with different clusters of interest. At first a random opportunity is chosen, than the amount of a randomly chosen topic inside the description is multiplied by five. The ceiling operation of this result is the rating that the fictional user will give to that opportunity. Because the amount of each topic predicted by the model is disollved among various topics, it is very rare to find an opportunity that has a higher LDA value. The consequence is that this dataset has really low volatility and the major part of ratings are equal to 1.
 ```python
     def read_lda_topics(self):