unleak

unleak(train: CoreTriplesFactory, *triples_factories: CoreTriplesFactory, n: None | int | float = None, minimum_frequency: float | None = None) Iterable[CoreTriplesFactory][source]

Unleak a train, test, and validate triples factory.

Parameters:
  • train (CoreTriplesFactory) – The target triples factory

  • triples_factories (CoreTriplesFactory) – All other triples factories (test, validate, etc.)

  • n (None | int | float) – Either the (integer) number of top relations to keep or the (float) percentage of top relationships to keep. If left none, frequent relations are not removed.

  • minimum_frequency (float | None) – The minimum overlap between two relations’ triples to consider them as inverses or duplicates. The default value, 0.97, is taken from Toutanova and Chen (2015), who originally described the generation of FB15k-237.

Returns:

A sequence of reindexed triples factories

Return type:

Iterable[CoreTriplesFactory]