Hetionet

class Hetionet(create_inverse_triples=False, random_state=0, **kwargs)[source]

Bases: pykeen.datasets.base.SingleTabbedDataset

The Hetionet dataset is a large biological network.

In its publication [himmelstein2017], it is demonstrated to be useful for link prediction in drug repositioning and made publicly available through its GitHub repository in several formats. The link prediction algorithm showcased does not rely on embeddings, which leaves room for interesting comparison. One such comparison was made during the master’s thesis of Lingling Xu [xu2019].

For reproducibility, the random_state argument is set by default to 0. For permutation studies, you can change this.

himmelstein2017(1,2)

Himmelstein, D. S., et al (2017). Systematic integration of biomedical knowledge prioritizes drugs for repurposing. ELife, 6.

xu2019

Xu, L (2019) A Comparison of Learned and Engineered Features in Network-Based Drug Repositioning. Master’s Thesis.

Initialize the Hetionet dataset from [himmelstein2017].

Parameters
  • create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.

  • random_state (Union[None, str, Generator]) – The random seed to use in splitting the dataset. Defaults to 0.

  • kwargs – keyword arguments passed to pykeen.datasets.base.SingleTabbedDataset.