PackedZipRemoteDataset
- class PackedZipRemoteDataset(relative_training_path: str | PurePath, relative_testing_path: str | PurePath, relative_validation_path: str | PurePath, url: str | None = None, name: str | None = None, cache_root: str | None = None, eager: bool = False, create_inverse_triples: bool = False)[source]
Bases:
LazyDataset
Contains a lazy reference to a remote dataset that is loaded if needed.
Initialize dataset.
- Parameters:
relative_training_path (str | pathlib.PurePath) – The path inside the zip file for the training data
relative_testing_path (str | pathlib.PurePath) – The path inside the zip file for the testing data
relative_validation_path (str | pathlib.PurePath) – The path inside the zip file for the validation data
url (str | None) – The url where to download the dataset from
name (str | None) – The name of the file. If not given, tries to get the name from the end of the URL
cache_root (Path) – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable
PYKEEN_HOME
or defaults to~/.pykeen
.eager (bool) – Should the data be loaded eagerly? Defaults to false.
create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.
- Raises:
ValueError – if there’s no URL specified and there is no data already at the calculated path
Attributes Summary
Attributes Documentation
- header = None
- sep = '\t'