OpenBioLink¶
-
class
OpenBioLink(create_inverse_triples=False, eager=False)[source]¶ Bases:
pykeen.datasets.base.PackedZipRemoteDatasetThe OpenBioLink dataset.
OpenBioLink is an open-source, reproducible framework for generating biological knowledge graphs for benchmarking link prediction. It is available on GitHub at https://github.com/openbiolink/openbiolink and published in [breit2020]. There are four available datasets - this class represents the high quality, directed set.
- breit2020
Breit, A. (2020) OpenBioLink: A benchmarking framework for large-scale biomedical link prediction, Bioinformatics
Initialize dataset.
- Parameters
url – The url where to download the dataset from
name – The name of the file. If not given, tries to get the name from the end of the URL
cache_root – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable
PYKEEN_HOMEor defaults to~/.pykeen.eager (
bool) – Should the data be loaded eagerly? Defaults to false.create_inverse_triples (
bool) – Should inverse triples be created? Defaults to false.