OpenEA

class OpenEA(graph_pair='D_W', side='D', size='15K', version='V1', cache_root=None, eager=False, create_inverse_triples=False, random_state=0, split_ratios=(0.8, 0.1, 0.1), force=False)[source]

Bases: pykeen.datasets.base.LazyDataset

The OpenEA dataset family.

Initialize the dataset.

Parameters
  • graph_pair (str) – The graph-pair within the dataset family (cf. GRAPH_PAIRS).

  • side (str) – The side of the graph-pair, a substring of the graph-pair selection.

  • size (str) – The size of the graphs (either “15K” or “100K”).

  • version (str) – The version of the pairing (either “V1” or “V2). “V1” has lower connectivity in the graph than “V2”.

  • cache_root (Optional[str]) – The cache root.

  • eager (bool) – Whether to directly load the dataset, or defer it to the first access of a relevant attribute.

  • create_inverse_triples (bool) – Whether to create inverse triples.

  • random_state (Union[None, int, Generator]) – The random state used for splitting.

  • split_ratios (Tuple[float, float, float]) – The split ratios used for splitting the dataset into train / validation / test.

  • force (bool) – Whether to enforce re-download of existing files.

Raises

ValueError – If the graph pair or side is invalid.

Attributes Summary

FIGSHARE_LINK

The link to the zip file

SHA512

The hex digest for the zip file

Attributes Documentation

The link to the zip file

SHA512: str = 'c1589f185f86e05c497de147b4d6c243c66775cb4b50c6b41ecc71b36cfafb4c9f86fbee94e1e78a7ee056dd69df1ce3fc210ae07dc64955ad2bfda7450545ef'

The hex digest for the zip file