Leave one out error - Alchetron, The Free Social Encyclopedia

Preliminary notations

X and Y ⊂ R being respectively an input and an output space, we consider a training set

S = { z 1 = ( x 1 , y 1 ) , . . , z m = ( x m , y m ) } of size m in Z = X × Y drawn i.i.d. from an unknown distribution D. A learning algorithm is a function f from Z m into F ⊂ Y X which maps a learning set S onto a function f S from X to Y. To avoid complex notation, we consider only deterministic algorithms. It is also assumed that the algorithm f is symmetric with respect to S, i.e. it does not depend on the order of the elements in the training set. Furthermore, we assume that all functions are measurable and all sets are countable which does not limit the interest of the results presented here.

The loss of an hypothesis f with respect to an example z = ( x , y ) is then defined as V ( f , z ) = V ( f ( x ) , y ) . The empirical error of f is I S [ f ] = 1 n ∑ V ( f , z i ) .

The true error of f is I [ f ] = E z V ( f , z )

Given a training set S of size m, we will build, for all i = 1....,m, modified training sets as follows:

By removing the i-th element

S | i = { z 1 , . . . , z i − 1 , z i + 1 , . . . , z m }

By replacing the i-th element

S i = { z 1 , . . . , z i − 1 , z i ′ , z i + 1 , . . . , z m }

References

Leave-one-out error Wikipedia

(Text) CC BY-SA