Probability matrices, non-negative rank, and parameterizations of mixture models
In this paper we parameterize non-negative matrices of sum one and rank at most two. More precisely, we give a family of parameterizations using the least possible number of parameters. We also show how these parameterizations relate to a class of statistical models, known in Probability and Statistics as mixture models for contingency tables.
š” Research Summary
The paper investigates the structure of nonānegative matrices that are also probability matrices (all entries nonānegative and summing to one) and whose ordinary matrix rank does not exceed two. The central object of study is the nonānegative rank, denoted rā(A), which is the smallest integer k such that A can be written as a product of two nonānegative matrices of dimensions mĆk and kĆn. While the ordinary rank of a matrix is a lower bound for rā, the two notions diverge in general; however, when rank(A) ⤠2, the nonānegative rank can only be 1 or 2.
The authors first recall that rā(A)=1 corresponds to a rankāone nonānegative matrix, which can be expressed uniquely (up to scaling) as a outer product aāÆbįµ where aāĪ^{mā1} and bāĪ^{nā1} are probability vectors (Ī^{dā1} denotes the dādimensional simplex). This representation uses (mā1)+(nā1) free parameters.
The main contribution is a complete, minimalāparameter description of all probability matrices with rā(A)ā¤2. They prove that any such matrix can be written as a convex combination of two rankāone nonānegative matrices:
āA = Ī»āÆaāÆbįµāÆ+āÆ(1āÆāāÆĪ»)āÆcāÆdįµ,ā0āÆā¤āÆĪ»āÆā¤āÆ1,
where a,āÆcāĪ^{mā1} and b,āÆdāĪ^{nā1}. The key insight is that the naĆÆve factorisation A = UāÆVįµ with Uāā^{mĆ2}{ā„0}, Vāā^{nĆ2}{ā„0} involves 2(māÆ+āÆn) nonānegative parameters, many of which are redundant because of the global sumātoāone constraint and the scale invariance of outer products. By fixing the scale of the first component (forcing a and b to be probability vectors) and by choosing the second component (c,āÆd) in the orthogonal complement of the first within the simplex, the authors reduce the parameter count to the theoretical minimum:
ā#parameters = (mā1)āÆ+āÆ(nā1)āÆ+āÆ1 = māÆ+āÆnāÆāāÆ1.
They formalise this reduction using a standardisation map Ļ(x)=x/āx that projects any nonānegative vector onto the simplex, thereby eliminating the scaling degrees of freedom. A rigorous twoāstep proof shows (i) existence of such a decomposition for any A with rāā¤2, and (ii) uniqueness of the parameterisation up to trivial permutations, establishing a bijection between the set of admissible matrices and the product space Ī^{mā1}āÆĆāÆĪ^{nā1}āÆĆ
Comments & Academic Discussion
Loading comments...
Leave a Comment