HitsAtK

class HitsAtK(k: int = 10)[source]

Bases: RankBasedMetric

The Hits @ k.

The hits @ k describes the fraction of true entities that appear in the first \(k\) entities of the sorted rank list. For individual ranks \(\{r_i\}_{i=1}^n\), it is given as:

\[H_k = \frac{1}{n} \sum \limits_{i=1}^{n} \mathbb{I}[r_i \leq k]\]

For example, if Google shows 20 results on the first page, then the percentage of results that are relevant is the hits @ 20. The hits @ k, regardless of \(k\), lies on the \([0, 1]\) where closer to 1 is better.

Warning

This metric does not differentiate between cases when the rank is larger than \(k\). This means that a miss with rank \(k+1\) and \(k+d\) where \(d \gg 1\) have the same effect on the final score. Therefore, it is less suitable for the comparison of different models.

For the expected values, we first note that

\[\mathbb{I}[r_i \leq k] \sim \textit{Bernoulli}(p_i)\]

with \(p_i = \min\{\frac{k}{C_i}, 1\}\), where \(C_i\) denotes the number of candidates for ranking task \(i\). Thus, we have

\[\mathbb{E}[\mathbb{I}[r_i \leq k]] = p_i\]

and

\[\mathbb{V}[\mathbb{I}[r_i \leq k]] = p_i \cdot (1 - p_i)\]

Hence, we obtain

\[\begin{split}\mathbb{E}[Hits@k] &= \mathbb{E}\left[\frac{1}{n} \sum \limits_{i=1}^{n} \mathbb{I}[r_i \leq k]\right] \\ &= \frac{1}{n} \sum \limits_{i=1}^{n} \mathbb{E}[\mathbb{I}[r_i \leq k]] \\ &= \frac{1}{n} \sum \limits_{i=1}^{n} p_i\end{split}\]

For the variance, we have

\[\begin{split}\mathbb{V}[Hits@k] &= \mathbb{V}\left[\frac{1}{n} \sum \limits_{i=1}^{n} \mathbb{I}[r_i \leq k]\right] \\ &= \frac{1}{n^2} \sum \limits_{i=1}^{n} \mathbb{V}\left[\mathbb{I}[r_i \leq k]\right] \\ &= \frac{1}{n^2} \sum \limits_{i=1}^{n} p_i(1 - p_i)\end{split}\]

Initialize the metric.

Parameters:: k (int) – the parameter \(k\) of number of top entries to consider

Attributes Summary

`binarize`	whether the metric needs binarized scores
`closed_expectation`	whether there is a closed-form solution of the expectation
`closed_variance`	whether there is a closed-form solution of the variance
`increasing`	whether it is increasing, i.e., larger values are better
`key`	Return the key for use in metric result dictionaries.
`name`	The name of the metric
`needs_candidates`	whether the metric requires the number of candidates for each ranking task
`supported_rank_types`	the supported rank types.
`supports_weights`	whether the metric supports weights
`synonyms`	synonyms for this metric
`value_range`	the value range

Methods Summary

`__call__`(ranks[, num_candidates, weights])	Evaluate the metric.
`expected_value`(num_candidates[, ...])	Compute expected metric value.
`extra_repr`()	Generate the extra repr, cf.
`get_description`()	Get the description.
`get_link`()	Get the link from the docdata.
`get_range`()	Get the math notation for the range of this metric.
`get_sampled_values`(num_candidates, num_samples)	Calculate the metric on sampled rank arrays.
`iter_extra_repr`()	Iterate over the components of the `extra_repr()`.
`numeric_expected_value`(**kwargs)	Compute expected metric value by summation.
`numeric_expected_value_with_ci`(**kwargs)	Estimate expected value with confidence intervals.
`numeric_variance`(**kwargs)	Compute variance by summation.
`numeric_variance_with_ci`(**kwargs)	Estimate variance with confidence intervals.
`std`(num_candidates[, num_samples, weights])	Compute the standard deviation.
`variance`(num_candidates[, num_samples, weights])	Compute variance.

Attributes Documentation

binarize: ClassVar[bool] = False: whether the metric needs binarized scores

closed_expectation: ClassVar[bool] = True: whether there is a closed-form solution of the expectation

closed_variance: ClassVar[bool] = True: whether there is a closed-form solution of the variance

increasing: ClassVar[bool] = True: whether it is increasing, i.e., larger values are better

key

name: ClassVar[str] = 'Hits @ K': The name of the metric

needs_candidates: ClassVar[bool] = False: whether the metric requires the number of candidates for each ranking task

supported_rank_types: ClassVar[Collection[Literal['optimistic', 'realistic', 'pessimistic']]] = ('optimistic', 'realistic', 'pessimistic'): the supported rank types. Most of the time equal to all rank types

supports_weights: ClassVar[bool] = True: whether the metric supports weights

synonyms: ClassVar[Collection[str]] = ('h@k', 'hits@k', 'h@', 'hits@', 'hits_at_', 'h_at_'): synonyms for this metric

value_range: ClassVar[ValueRange] = ValueRange(lower=0, lower_inclusive=True, upper=1, upper_inclusive=True): the value range

Methods Documentation

__call__(ranks: ndarray, num_candidates: ndarray | None = None, weights: ndarray | None = None) → float[source]

Evaluate the metric.

Parameters:

ranks (ndarray) – shape: s the individual ranks
num_candidates (ndarray | None) – shape: s the number of candidates for each individual ranking task
weights (ndarray | None) – shape: s the weights for the individual ranks

Return type:

float

expected_value(num_candidates: ndarray, num_samples: int | None = None, weights: ndarray | None = None, **kwargs) → float[source]

Compute expected metric value.

The expectation is computed under the assumption that each individual rank follows a discrete uniform distribution \(\mathcal{U}\left(1, C_i\right)\), where \(C_i\) denotes the number of candidates for ranking task \(r_i\).

Parameters:

num_candidates (ndarray) – the number of candidates for each individual rank computation
num_samples (int | None) – the number of samples to use for simulation, if no closed form expected value is implemented
weights (ndarray | None) – shape: s the weights for the individual ranking tasks
kwargs – additional keyword-based parameters passed to get_sampled_values(), if no closed form solution is available

Returns:

the expected value of this metric

Raises:

NoClosedFormError – raised if a closed form expectation has not been implemented and no number of samples are given

Return type:

float

Note

Prefers analytical solution, if available, but falls back to numeric estimation via summation, cf. RankBasedMetric.numeric_expected_value().

extra_repr() → str

Generate the extra repr, cf. :meth`torch.nn.Module.extra_repr`.

Returns:: the extra part of the repr()
Return type:: str

classmethod get_description() → str

Get the description.

Return type:: str

classmethod get_link() → str

Get the link from the docdata.

Return type:: str

classmethod get_range() → str

Get the math notation for the range of this metric.

Return type:: str

get_sampled_values(num_candidates: ndarray, num_samples: int, weights: ndarray | None = None, generator: Generator | None = None, memory_intense: bool = True) → ndarray

Calculate the metric on sampled rank arrays.

Parameters:

num_candidates (ndarray) – shape: s the number of candidates for each ranking task
num_samples (int) – the number of samples
weights (ndarray | None) – shape: s the weights for the individual ranking tasks
generator (Generator | None) – a random state for reproducibility
memory_intense (bool) – whether to use a more memory-intense, but more time-efficient variant

Returns:

shape: (num_samples,) the metric evaluated on num_samples sampled rank arrays

Return type:

ndarray

iter_extra_repr() → Iterable[str][source]

Iterate over the components of the extra_repr().

This method is typically overridden. A common pattern would be

def iter_extra_repr(self) -> Iterable[str]:
    yield from super().iter_extra_repr()
    yield "<key1>=<value1>"
    yield "<key2>=<value2>"

Returns:: an iterable over individual components of the extra_repr()
Return type:: Iterable[str]

numeric_expected_value(**kwargs) → float

Compute expected metric value by summation.

The expectation is computed under the assumption that each individual rank follows a discrete uniform distribution \(\mathcal{U}\left(1, C_i\right)\), where \(C_i\) denotes the number of candidates for ranking task \(r_i\).

Parameters:: kwargs – keyword-based parameters passed to get_sampled_values()
Returns:: The estimated expected value of this metric
Return type:: float

Warning

Depending on the metric, the estimate may not be very accurate and converge slowly, cf. https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.rv_discrete.expect.html

numeric_expected_value_with_ci(**kwargs) → ndarray

Estimate expected value with confidence intervals.

Return type:: ndarray

numeric_variance(**kwargs) → float

Compute variance by summation.

The variance is computed under the assumption that each individual rank follows a discrete uniform distribution \(\mathcal{U}\left(1, C_i\right)\), where \(C_i\) denotes the number of candidates for ranking task \(r_i\).

Parameters:: kwargs – keyword-based parameters passed to get_sampled_values()
Returns:: The estimated variance of this metric
Return type:: float

Warning

Depending on the metric, the estimate may not be very accurate and converge slowly, cf. https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.rv_discrete.expect.html

numeric_variance_with_ci(**kwargs) → ndarray

Estimate variance with confidence intervals.

Return type:: ndarray

std(num_candidates: ndarray, num_samples: int | None = None, weights: ndarray | None = None, **kwargs) → float

Compute the standard deviation.

Parameters:

num_candidates (ndarray) – the number of candidates for each individual rank computation
num_samples (int | None) – the number of samples to use for simulation, if no closed form expected value is implemented
weights (ndarray | None) – shape: s the weights for the individual ranking tasks
kwargs – additional keyword-based parameters passed to variance(),

Returns:

The standard deviation (i.e. the square root of the variance) of this metric

Return type:

float

For a detailed explanation, cf. RankBasedMetric.variance().

variance(num_candidates: ndarray, num_samples: int | None = None, weights: ndarray | None = None, **kwargs) → float[source]

Compute variance.

The variance is computed under the assumption that each individual rank follows a discrete uniform distribution \(\mathcal{U}\left(1, C_i\right)\), where \(C_i\) denotes the number of candidates for ranking task \(r_i\).

Parameters:

num_candidates (ndarray) – the number of candidates for each individual rank computation
num_samples (int | None) – the number of samples to use for simulation, if no closed form expected value is implemented
weights (ndarray | None) – shape: s the weights for the individual ranking tasks
kwargs – additional keyword-based parameters passed to get_sampled_values(), if no closed form solution is available

Returns:

The variance of this metric

Raises:

NoClosedFormError – Raised if a closed form variance has not been implemented and no number of samples are given

Return type:

float

Note

Prefers analytical solution, if available, but falls back to numeric estimation via summation, cf. RankBasedMetric.numeric_variance().