RemoteDataset

class RemoteDataset(url, relative_training_path, relative_testing_path, relative_validation_path, cache_root=None, eager=False, create_inverse_triples=False)[source]

Bases: PathDataset

Contains a lazy reference to a remote dataset that is loaded if needed.

Initialize dataset.

Parameters:
  • url (str) – The url where to download the dataset from.

  • relative_training_path (Union[str, PurePath]) – The path inside the cache root where the training path gets extracted

  • relative_testing_path (Union[str, PurePath]) – The path inside the cache root where the testing path gets extracted

  • relative_validation_path (Union[str, PurePath]) – The path inside the cache root where the validation path gets extracted

  • cache_root (Optional[str]) – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable PYKEEN_HOME or defaults to ~/.data/pykeen.

  • eager (bool) – Should the data be loaded eagerly? Defaults to false.

  • create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.