CKG

class CKG(eager=False, create_inverse_triples=False, random_state=0, cache_root=None)[source]

Bases: pykeen.datasets.base.TabbedDataset

The Clinical Knowledge Graph (CKG) dataset from [santos2020].

This dataset contains ~7.6 million nodes, 11 relations, and ~26 million triples.

santos2020

Santos, A., et al (2020). Clinical Knowledge Graph Integrates Proteomics Data into Clinical Decision-Making. bioRxiv, 2020.05.09.084897.

Initialize dataset.

Parameters
  • cache_root (Optional[str]) – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable PYKEEN_HOME or defaults to ~/.pykeen.

  • eager (bool) – Should the data be loaded eagerly? Defaults to false.

  • create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.