Yelp2018

class dhg.data.Yelp2018(data_root=None)[source]

Bases: dhg.data.base.BaseData

The Yelp2018 dataset is collected for user-item recommendation task. This dataset is adopted from the 2018 edition of the Yelp challenge. Wherein, the local businesses like restaurants and bars are viewed as the items.

The Content of the Yelp2018 dataset includes the following:

  • num_users: The number of users: \(31,668\).

  • num_items: The number of items: \(38,048\).

  • num_interactions: The number of interactions: \(1,561,406\).

  • train_adj_list: The train adjacency list.

  • test_adj_list: The test adjacency list.

Note

The first item of each line in the adj_list is the user id, and the rest is the item id.

Parameters

data_root (str, optional) – The data_root has stored the data. If set to None, this function will auto-download from server and save into the default direction ~/.dhg/datasets/. Defaults to None.