Datasets¶
Sample datasets for use with PyKEEN, borrowed from https://github.com/ZhenfengLei/KGDatasets.
New datasets (inheriting from pykeen.datasets.base.Dataset
) can be registered with PyKEEN using the
pykeen.datasets
group in Python entrypoints in your own setup.py or setup.cfg package configuration.
They are loaded automatically with pkg_resources.iter_entry_points()
.
Functions¶
|
Get the dataset. |
|
Return if the dataset is registered in PyKEEN. |
Classes¶
|
The Hetionet dataset is a large biological network. |
|
The Kinships dataset. |
|
The Nations dataset. |
|
The OpenBioLink dataset. |
|
The PyKEEN First Filtered OpenBioLink 2020 Dataset. |
|
The PyKEEN Second Filtered OpenBioLink 2020 Dataset. |
|
The low-quality variant of the OpenBioLink dataset. |
|
The CoDEx small dataset. |
|
The CoDEx medium dataset. |
|
The CoDEx large dataset. |
|
The OGB BioKG dataset. |
|
The OGB WikiKG dataset. |
|
The UMLS dataset. |
|
The FB15k dataset. |
|
The FB15k-237 dataset. |
|
The WN18 dataset. |
|
The WN18-RR dataset. |
|
The YAGO3-10 dataset is a subset of YAGO3 that only contains entities with at least 10 relations. |
|
The DRKG dataset. |
|
The ConceptNet dataset from [speer2017]. |
|
The Clinical Knowledge Graph (CKG) dataset from [santos2020]. |
|
The CSKG dataset. |
|
The DBpedia50 dataset. |
Class Inheritance Diagram¶
Utility classes for constructing datasets.
Classes¶
|
Contains a lazy reference to a training, testing, and validation dataset. |
|
A dataset that has already been loaded. |
A dataset that has lazy loading. |
|
|
Contains a lazy reference to a training, testing, and validation dataset. |
|
Contains a lazy reference to a remote dataset that is loaded if needed. |
|
A dataset with all three of train, test, and validation sets as URLs. |
|
A remote dataset stored as a tar file. |
|
A remote dataset stored as a zip file. |
|
Contains a lazy reference to a remote dataset that is loaded if needed. |
|
Loads a dataset that’s a single file inside a tar.gz archive. |
|
This class is for when you’ve got a single TSV of edges and want them to get auto-split. |
|
This class is for when you’ve got a single TSV of edges and want them to get auto-split. |