dhg.data
Base Class
- class dhg.data.BaseData(name, data_root=None)[source]
The Base Class of all datasets.
self._content = { 'item': { 'upon': [ {'filename': 'part1.pkl', 'md5': '', bk_url: None}, {'filename': 'part2.pkl', 'md5': '', bk_url: None}, ], 'loader': loader_function, 'preprocess': [datapipe1, datapipe2], }, ... }
- property content
Return the content of the dataset.
- fetch_files(files)[source]
Download and check the files if they are not exist.
- Parameters
files (
List[Dict[str, str]]) – The files to download, each element in the list is a dict with at lease two keys:filenameandmd5. If extra keybk_urlis provided, it will be used to download the file from the backup url.
Vertex Classification Datasets
The Cora dataset is a citation network dataset for vertex classification task. |
|
The PubMed dataset is a citation network dataset for vertex classification task. |
|
The Citeseer dataset is a citation network dataset for vertex classification task. |
|
The Cooking 200 dataset is collected from Yummly.com for vertex classification task. |
User-Item Recommender Datasets
The MovieLens1M dataset is collected for user-item recommendation task. |
|
The AmazonBook dataset is collected for user-item recommendation task. |
|
The Yelp2018 dataset is collected for user-item recommendation task. |
|
The Gowalla dataset is collected for user-item recommendation task. |
Welcome to contribute datasets!