ComplEx

class ComplEx(triples_factory, embedding_dim=200, automatic_memory_optimization=None, loss=None, preferred_device=None, random_seed=None, regularizer=None)[source]

Bases: pykeen.models.base.EntityRelationEmbeddingModel

An implementation of ComplEx [trouillon2016].

ComplEx is an extension of pykeen.models.DistMult that uses complex valued representations for the entities and relations. Entities and relations are represented as vectors \(\textbf{e}_i, \textbf{r}_i \in \mathbb{C}^d\), and the plausibility score is computed using the Hadamard product:

\[f(h,r,t) = Re(\mathbf{e}_h\odot\mathbf{r}_r\odot\mathbf{e}_t)\]

Which expands to:

\[f(h,r,t) = \left\langle Re(\mathbf{e}_h),Re(\mathbf{r}_r),Re(\mathbf{e}_t)\right\rangle + \left\langle Im(\mathbf{e}_h),Re(\mathbf{r}_r),Im(\mathbf{e}_t)\right\rangle + \left\langle Re(\mathbf{e}_h),Re(\mathbf{r}_r),Im(\mathbf{e}_t)\right\rangle - \left\langle Im(\mathbf{e}_h),Im(\mathbf{r}_r),Im(\mathbf{e}_t)\right\rangle\]

where \(Re(\textbf{x})\) and \(Im(\textbf{x})\) denote the real and imaginary parts of the complex valued vector \(\textbf{x}\). Because the Hadamard product is not commutative in the complex space, ComplEx can model anti-symmetric relations in contrast to DistMult.

See also

Official implementation: https://github.com/ttrouill/complex/

Initialize ComplEx.

Parameters
  • triples_factory (TriplesFactory) – TriplesFactory The triple factory connected to the model.

  • embedding_dim (int) – The embedding dimensionality of the entity embeddings.

  • automatic_memory_optimization (Optional[bool]) – bool Whether to automatically optimize the sub-batch size during training and batch size during evaluation with regards to the hardware at hand.

  • loss (Optional[Loss]) – OptionalLoss (optional) The loss to use. Defaults to SoftplusLoss.

  • preferred_device (Optional[str]) – str (optional) The default device where to model is located.

  • random_seed (Optional[int]) – int (optional) An optional random seed to set before the initialization of weights.

  • regularizer (Optional[Regularizer]) – BaseRegularizer The regularizer to use.

Attributes Summary

hpo_default

The default strategy for optimizing the model’s hyper-parameters

loss_default_kwargs

The default parameters for the default loss function class

regularizer_default_kwargs

The LP settings used by [trouillon2016] for ComplEx.

Methods Summary

interaction_function(h, r, t)

Evaluate the interaction function of ComplEx for given embeddings.

score_hrt(hrt_batch)

Forward pass.

Attributes Documentation

hpo_default: ClassVar[Mapping[str, Any]] = {'embedding_dim': {'high': 300, 'low': 50, 'q': 50, 'type': <class 'int'>}}

The default strategy for optimizing the model’s hyper-parameters

loss_default_kwargs: ClassVar[Optional[Mapping[str, Any]]] = {'reduction': 'mean'}

The default parameters for the default loss function class

regularizer_default_kwargs: ClassVar[Optional[Mapping[str, Any]]] = {'normalize': True, 'p': 2.0, 'weight': 0.01}

The LP settings used by [trouillon2016] for ComplEx.

Methods Documentation

static interaction_function(h, r, t)[source]

Evaluate the interaction function of ComplEx for given embeddings.

The embeddings have to be in a broadcastable shape.

Parameters
  • h (FloatTensor) – Head embeddings.

  • r (FloatTensor) – Relation embeddings.

  • t (FloatTensor) – Tail embeddings.

Return type

FloatTensor

Returns

shape: (…) The scores.

score_hrt(hrt_batch)[source]

Forward pass.

This method takes head, relation and tail of each triple and calculates the corresponding score.

Parameters

hrt_batch (LongTensor) – shape: (batch_size, 3), dtype: long The indices of (head, relation, tail) triples.

Raises

NotImplementedError – If the method was not implemented for this class.

Return type

FloatTensor

Returns

shape: (batch_size, 1), dtype: float The score for each triple.