MedianRank
- class MedianRank[source]
Bases:
pykeen.metrics.ranking.RankBasedMetric
The median rank.
Attributes Summary
whether the metric needs binarized scores
whether it is increasing, i.e., larger values are better
Return the key for use in metric result dictionaries.
The name of the metric
whether the metric requires the number of candidates for each ranking task
the supported rank types.
synonyms for this metric
the value range
Methods Summary
__call__
(ranks[, num_candidates])Evaluate the metric.
expected_value
(num_candidates[, num_samples])Compute expected metric value.
Get the description.
get_link
()Get the link from the docdata.
Get the math notation for the range of this metric.
get_sampled_values
(num_candidates, num_samples)Calculate the metric on sampled rank arrays.
numeric_expected_value
(num_candidates, ...)Compute expected metric value by summation.
numeric_expected_value_with_ci
(**kwargs)Estimate expected value with confidence intervals.
numeric_variance
(num_candidates, ...)Compute variance by summation.
numeric_variance_with_ci
(**kwargs)Estimate variance with confidence intervals.
std
(num_candidates[, num_samples])Compute the standard deviation.
variance
(num_candidates[, num_samples])Compute variance.
Attributes Documentation
- needs_candidates: ClassVar[bool] = False
whether the metric requires the number of candidates for each ranking task
- supported_rank_types: ClassVar[Collection[typing_extensions.Literal[optimistic, realistic, pessimistic]]] = ('optimistic', 'realistic', 'pessimistic')
the supported rank types. Most of the time equal to all rank types
- value_range: ClassVar[pykeen.metrics.utils.ValueRange] = ValueRange(lower=1, lower_inclusive=True, upper=inf, upper_inclusive=False)
the value range
Methods Documentation
- expected_value(num_candidates, num_samples=None, **kwargs)
Compute expected metric value.
The expectation is computed under the assumption that each individual rank follows a discrete uniform distribution \(\mathcal{U}\left(1, N_i\right)\), where \(N_i\) denotes the number of candidates for ranking task \(r_i\).
- Parameters
num_candidates (
ndarray
) – the number of candidates for each individual rank computationnum_samples (
Optional
[int
]) – the number of samples to use for simulation, if no closed form expected value is implementedkwargs – additional keyword-based parameters passed to
get_sampled_values()
, if no closed form solution is available
- Return type
- Returns
the expected value of this metric
- Raises
NoClosedFormError – raised if a closed form expectation has not been implemented and no number of samples are given
Note
Prefers analytical solution, if available, but falls back to numeric estimation via summation, cf.
RankBasedMetric.numeric_expected_value()
.
- get_sampled_values(num_candidates, num_samples, generator=None, memory_intense=True)
Calculate the metric on sampled rank arrays.
- Parameters
- Return type
- Returns
shape: (num_samples,) the metric evaluated on num_samples sampled rank arrays
- numeric_expected_value(num_candidates, num_samples, **kwargs)
Compute expected metric value by summation.
The expectation is computed under the assumption that each individual rank follows a discrete uniform distribution \(\mathcal{U}\left(1, N_i\right)\), where \(N_i\) denotes the number of candidates for ranking task \(r_i\).
- Parameters
num_candidates (
ndarray
) – the number of candidates for each individual rank computationnum_samples (
int
) – the number of samples to use for simulationkwargs – additional keyword-based parameters passed to
get_sampled_values()
- Return type
- Returns
The estimated expected value of this metric
Warning
Depending on the metric, the estimate may not be very accurate and converge slowly, cf. https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.rv_discrete.expect.html
- numeric_expected_value_with_ci(**kwargs)
Estimate expected value with confidence intervals.
- Return type
- numeric_variance(num_candidates, num_samples, **kwargs)
Compute variance by summation.
The variance is computed under the assumption that each individual rank follows a discrete uniform distribution \(\mathcal{U}\left(1, N_i\right)\), where \(N_i\) denotes the number of candidates for ranking task \(r_i\).
- Parameters
num_candidates (
ndarray
) – the number of candidates for each individual rank computationnum_samples (
int
) – the number of samples to use for simulationkwargs – additional keyword-based parameters passed to
get_sampled_values()
- Return type
- Returns
The estimated variance of this metric
Warning
Depending on the metric, the estimate may not be very accurate and converge slowly, cf. https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.rv_discrete.expect.html
- numeric_variance_with_ci(**kwargs)
Estimate variance with confidence intervals.
- Return type
- std(num_candidates, num_samples=None, **kwargs)
Compute the standard deviation.
- Parameters
num_candidates (
ndarray
) – the number of candidates for each individual rank computationnum_samples (
Optional
[int
]) – the number of samples to use for simulation, if no closed form expected value is implementedkwargs – additional keyword-based parameters passed to
variance()
,
- Return type
- Returns
The standard deviation (i.e. the square root of the variance) of this metric
For a detailed explanation, cf.
RankBasedMetric.variance()
.
- variance(num_candidates, num_samples=None, **kwargs)
Compute variance.
The variance is computed under the assumption that each individual rank follows a discrete uniform distribution \(\mathcal{U}\left(1, N_i\right)\), where \(N_i\) denotes the number of candidates for ranking task \(r_i\).
- Parameters
num_candidates (
ndarray
) – the number of candidates for each individual rank computationnum_samples (
Optional
[int
]) – the number of samples to use for simulation, if no closed form expected value is implementedkwargs – additional keyword-based parameters passed to
get_sampled_values()
, if no closed form solution is available
- Return type
- Returns
The variance of this metric
- Raises
NoClosedFormError – Raised if a closed form variance has not been implemented and no number of samples are given
Note
Prefers analytical solution, if available, but falls back to numeric estimation via summation, cf.
RankBasedMetric.numeric_variance()
.