UnpackedRemoteDataset
- class UnpackedRemoteDataset(training_url: str, testing_url: str, validation_url: str, cache_root: str | None = None, force: bool = False, eager: bool = False, create_inverse_triples: bool = False, load_triples_kwargs: Mapping[str, Any] | None = None, download_kwargs: Mapping[str, Any] | None = None)[source]
Bases:
PathDataset
A dataset with all three of train, test, and validation sets as URLs.
Initialize dataset.
- Parameters:
training_url (str) – The URL of the training file
testing_url (str) – The URL of the testing file
validation_url (str) – The URL of the validation file
cache_root (Path) – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable
PYKEEN_HOME
or defaults to~/.data/pykeen
.force (bool) – If true, redownload any cached files
eager (bool) – Should the data be loaded eagerly? Defaults to false.
create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.
load_triples_kwargs (Mapping[str, Any] | None) – Arguments to pass through to
TriplesFactory.from_path()
and ultimately through topykeen.triples.utils.load_triples()
.download_kwargs (Mapping[str, Any] | None) – Keyword arguments to pass to
pystow.utils.download()