get_relation_functionality_df

get_relation_functionality_df(*, dataset, parts=None, add_labels=True)[source]

Calculate the functionality and inverse functionality score per relation.

The (inverse) functionality was proposed in [wang2018]. It is defined as the number of unique head (tail) entities divided by the of triples in which the relation occurs. Thus, its value range is [0, 1]. Smaller values indicate that entities usually have more than one outgoing (incoming) triple with the corresponding relation type. Hence, the score is related to the relation cardinality types.

Parameters:
  • dataset (Dataset) – The dataset to investigate.

  • parts (Optional[Collection[str]]) – Only use certain parts of the dataset, e.g., train triples. Defaults to using all triples, i.e. {“training”, “validation”, “testing}.

  • add_labels (bool) – Whether to add relation labels (if available).

Return type:

DataFrame

Returns:

A dataframe with columns (relation_id | functionality | inverse functionality)

[wang2018]

Wang, Z., et al. (2018). Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 349–357.