AristoV4
- class AristoV4(**kwargs)[source]
Bases:
PackedZipRemoteDataset
The Aristo-v4 dataset from [chen2021].
Note
The dataset is based on the Aristo tuple KG from https://aclanthology.org/Q17-1017/.
Warning
While the original dataset is described as having 44,950 entities, after removing the entities not present in train, only 42,016 remain. Similarly, only 1,593 relations occur in training (from the original 1,605 ones). Consequently, some validation and testing triples are removed (originally: 20,000). Finally, only 242,567 of 242,594 original training triples are unique.
Initialize the Aristo-v4 dataset.
- Parameters:
kwargs – keyword arguments passed to
pykeen.datasets.base.ZipFileRemoteDataset
.