UnpackedRemoteDataset

class UnpackedRemoteDataset(training_url, testing_url, validation_url, cache_root=None, force=False, eager=False, create_inverse_triples=False, load_triples_kwargs=None, download_kwargs=None)[source]

Bases: PathDataset

A dataset with all three of train, test, and validation sets as URLs.

Initialize dataset.

Parameters
  • training_url (str) – The URL of the training file

  • testing_url (str) – The URL of the testing file

  • validation_url (str) – The URL of the validation file

  • cache_root (Optional[str]) – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable PYKEEN_HOME or defaults to ~/.data/pykeen.

  • force (bool) – If true, redownload any cached files

  • eager (bool) – Should the data be loaded eagerly? Defaults to false.

  • create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.

  • load_triples_kwargs (Optional[Mapping[str, Any]]) – Arguments to pass through to TriplesFactory.from_path() and ultimately through to pykeen.triples.utils.load_triples().

  • download_kwargs (Optional[Mapping[str, Any]]) – Keyword arguments to pass to pystow.utils.download()