PackedZipRemoteDataset

class PackedZipRemoteDataset(relative_training_path, relative_testing_path, relative_validation_path, url=None, name=None, cache_root=None, eager=False, create_inverse_triples=False)[source]

Bases: LazyDataset

Contains a lazy reference to a remote dataset that is loaded if needed.

Initialize dataset.

Parameters:
  • relative_training_path (Union[str, PurePath]) – The path inside the zip file for the training data

  • relative_testing_path (Union[str, PurePath]) – The path inside the zip file for the testing data

  • relative_validation_path (Union[str, PurePath]) – The path inside the zip file for the validation data

  • url (Optional[str]) – The url where to download the dataset from

  • name (Optional[str]) – The name of the file. If not given, tries to get the name from the end of the URL

  • cache_root (Optional[str]) – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable PYKEEN_HOME or defaults to ~/.pykeen.

  • eager (bool) – Should the data be loaded eagerly? Defaults to false.

  • create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.

Raises:

ValueError – if there’s no URL specified and there is no data already at the calculated path

Attributes Summary

head_column

header

relation_column

sep

tail_column

Attributes Documentation

head_column: int = 0
header = None
relation_column: int = 1
sep = '\t'
tail_column: int = 2