TabbedDataset

class TabbedDataset(cache_root: str | None = None, eager: bool = False, create_inverse_triples: bool = False, random_state: None | int | Generator = None)[source]

Bases: LazyDataset

This class is for when you’ve got a single TSV of edges and want them to get auto-split.

Initialize dataset.

Parameters:
  • cache_root (Path) – An optional directory to store the extracted files. Is none is given, the default PyKEEN directory is used. This is defined either by the environment variable PYKEEN_HOME or defaults to ~/.pykeen.

  • eager (bool) – Should the data be loaded eagerly? Defaults to false.

  • create_inverse_triples (bool) – Should inverse triples be created? Defaults to false.

  • random_state (TorchRandomHint) – An optional random state to make the training/testing/validation split reproducible.

Attributes Summary

ratios

Attributes Documentation

ratios: ClassVar[Sequence[float]] = (0.8, 0.1, 0.1)