UnpackedRemoteDataset

class UnpackedRemoteDataset(training_url: str, testing_url: str, validation_url: str, cache_root: str | None = None, force: bool = False, eager: bool = False, create_inverse_triples: bool = False, load_triples_kwargs: Mapping[str, Any] | None = None, download_kwargs: Mapping[str, Any] | None = None)[source]

Bases: PathDataset

A dataset with all three of train, test, and validation sets as URLs.

Initialize dataset.

Parameters:
  • training_url (str) – The URL of the training file

  • testing_url (str) – The URL of the testing file

  • validation_url (str) – The URL of the validation file

  • cache_root (Path) – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable PYKEEN_HOME or defaults to ~/.data/pykeen.

  • force (bool) – If true, redownload any cached files

  • eager (bool) – Should the data be loaded eagerly? Defaults to false.

  • create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.

  • load_triples_kwargs (Mapping[str, Any] | None) – Arguments to pass through to TriplesFactory.from_path() and ultimately through to pykeen.triples.utils.load_triples().

  • download_kwargs (Mapping[str, Any] | None) – Keyword arguments to pass to pystow.utils.download()