Quantitative way to compare statistical models
In statistics, the deficiency is a measure to compare a statistical model with another statistical model. The concept was introduced in the 1960s by the french mathematician Lucien Le Cam, who used it to prove an approximative version of the Blackwell–Sherman–Stein theorem.[1][2] Closely related is the Le Cam distance, a pseudometric for the maximum deficiency between two statistical models. If the deficiency of a model in relation to is zero, then one says is better or more informative or stronger than .
Introduction
Le Cam defined the statistical model more abstract than a probability space with a family of probability measures. He also didn't use the term "statistical model" and instead used the term "experiment". In his publication from 1964 he introduced the statistical experiment to a parameter set as a triple consisting of a set , a vector lattice with unit and a family of normalized positive functionals on .[3][4] In his book from 1986 he omitted and .[5]
This article follows his definition from 1986 and uses his terminology to emphasize the generalization.
Basic concepts
Let be a parameter space. Given an abstract L1-space (i.e. a Banach lattice such that for elements also holds) consisting of lineare positive functionals . An experiment is a map of the form , such that . is the band induced by and therefore we use the notation . For a denote the . The topological dual of an L-space with the conjugated norm is called an abstract M-space. It's also a lattice with unit defined through for .
Let and be two L-space of two experiments and , then one calls a positive, norm-preserving linear map, i.e. for all , a transition. The adjoint of a transitions is a positive linear map from the dual space of into the dual space of , such that the unit of is the image of the unit of ist.[5]
Deficiency
Let be a parameter space and and be two experiments indexed by . Le and denote the corresponding L-spaces and let be the set of all transitions from to .
The deficiency of in relation to is the number defined in terms of inf sup:
- [6]
where denoted the total variation norm . The factor is just for computational purposes and is sometimes omitted.
Le Cam distance
The Le Cam distance is the following pseudometric
This induces an equivalence relation and when , then one says and are equivalent. The equivalent class of is also called the type of .
Often one is interested in families of experiments with and with . If as , then one says and are asymptotically equivalent.
Let be a parameter space and be the set of all types that are induced by , then the Le Cam distance is complete with respect to . The condition induces a partial order on , one says is better or more informative or stronger than .[6]
References
Bibliography