get_tail_prediction_df¶
-
get_tail_prediction_df
(model, head_label, relation_label, *, add_novelties=True, remove_known=False, testing=None)[source]¶ Predict tails for the given head and relation (given by label).
- Parameters
model (
Model
) – A PyKEEN modelhead_label (
str
) – The string label for the head entityrelation_label (
str
) – The string label for the relationadd_novelties (
bool
) – Should the dataframe include a column denoting if the ranked tail entities correspond to novel triples?remove_known (
bool
) – Should non-novel triples (those appearing in the training set) be shown with the results? On one hand, this allows you to better assess the goodness of the predictions - you want to see that the non-novel triples generally have higher scores. On the other hand, if you’re doing hypothesis generation, they may pose as a distraction. If this is set to True, then non-novel triples will be removed and the column denoting novelty will be excluded, since all remaining triples will be novel. Defaults to false.testing (
Optional
[LongTensor
]) – The mapped_triples from the testing triples factory (TriplesFactory.mapped_triples)
- Return type
DataFrame
- Returns
shape: (k, 3) A dataframe with columns based on the settings or a tensor. Contains either the k highest scoring triples, or all possible triples if k is None.
The following example shows that after you train a model on the Nations dataset, you can score all entities w.r.t a given head entity and relation.
>>> from pykeen.pipeline import pipeline >>> from pykeen.models.predict import get_tail_prediction_df >>> result = pipeline( ... dataset='Nations', ... model='RotatE', ... ) >>> df = get_tail_prediction_df(result.model, 'brazil', 'accusation')