Closed RuihongQiu closed 3 years ago
I found out that in the preprocessing of S3Rec, there is a time filtering. Only interactions in 2019 will be considered. Are there any similar args can be used in RecBole?
Hi,
In RecBole 0.2.1, you can use lowest_val
and highest_val
to filter interactions.
Details can be found in our API Doc.
Thank you for the great suggestion.
Using the following config:
min_user_inter_num: 5
min_item_inter_num: 5
lowest_val:
timestamp: 1546264800
highest_val:
timestamp: 1577714400
Already have a similar result:
31 Jul 10:08 INFO yelp
The number of users: 30500
Average actions of users: 10.399750811502017
The number of items: 20069
Average actions of items: 15.805361769982062
The number of inters: 317182
The sparsity of the dataset: 99.94818172387231%
Hi,
Thank you for providing so many processed datasets.
I have a question when using the
yelp
dataset with RecBole.I mainly use it for the sequential recommendation.
The general statistics of
yelp
is like: From S3Rec paper From BERT4Rec paperThey say they both filter out the items and users appearing less than 5 times. I firstly download the processed dataset from google drive. When I set the dataset config in RecBole as:
The logging statistics of
yelp
is as:Why is it so large and so different from the other papers? Is that my config is wrong somewhere?