VC dimension of ellipsoids

Reading time: 7 minute
...

๐Ÿ“ Original Info

  • Title: VC dimension of ellipsoids
  • ArXiv ID: 1109.4347
  • Date: 2011-09-21
  • Authors: Yohji Akama and Kei Irie

๐Ÿ“ Abstract

We will establish that the VC dimension of the class of d-dimensional ellipsoids is (d^2+3d)/2, and that maximum likelihood estimate with N-component d-dimensional Gaussian mixture models induces a geometric class having VC dimension at least N(d^2+3d)/2. Keywords: VC dimension; finite dimensional ellipsoid; Gaussian mixture model

๐Ÿ’ก Deep Analysis

Deep Dive into VC dimension of ellipsoids.

We will establish that the VC dimension of the class of d-dimensional ellipsoids is (d^2+3d)/2, and that maximum likelihood estimate with N-component d-dimensional Gaussian mixture models induces a geometric class having VC dimension at least N(d^2+3d)/2. Keywords: VC dimension; finite dimensional ellipsoid; Gaussian mixture model

๐Ÿ“„ Full Content

For sets X โІ R d and Y โІ X, we say that a set B โІ R d cuts Y out of X if Y = X โˆฉ B. A class C of subsets of R d is said to shatter a set X โІ R d if every Y โІ X is cut out of X by some B โˆˆ C. The vc dimension of C, denoted by VCdim(C), is defined to be the maximum n (or โˆž if no such maximum exists) for which some subset of R d of cardinality n is shattered by C.

The vc dimension of a class describes a complexity of the class, and are employed in empirical process theory [4], statistical and computational learning theory [8,3] and discrete geometry [6]. Although asymptotic estimates of vc dimensions are given for many classes, the exact values of vc dimensions are known for only a few classes (e.g. the class of Euclidean balls [10], the class of halfspaces [6], and so on).

In Section 2, we prove :

where a covariance matrix of size d is, by definition, a real, positive definite matrix. As in statistical learning theory [8], for a class P of probability density functions we consider the class D (P) of sets {x โˆˆ R d ; f (x) > s} such that f is any probability density function in P and s is any positive real number. Then D (G d ) is the class of d-dimensional ellipsoids.

For a positive integer N , an N -component d-dimensional Gaussian mixture model [7] ( (N, d)-gmm ) is, by definition, any probability distribution belonging to the convex hull of some N d-dimensional Gaussian distributions. Suppose we are given a sample from a population (N, d)-gmm but the number N of the components is unknown. To select N from the sample is an example of Akaike’s model selection problem [1] (see [5] for recent approach). The authors of [9] proposed to choose N by structural risk minimization principle [8], where an important role is played by the vc dimension of the class D ((G d ) N ) with (G d ) N being the class of (N, d)-gmms. Our result is that the vc dimension of D ((G d ) N ) is greater than or equal to N (d 2 + 3d)/2.

We will prove Theorem 1. For a positive integer B, a vector a โˆˆ R B \ { 0}, and c โˆˆ R, we write an affine function โ„“ a,c (x) := t ax + c (x โˆˆ R B ) and an open halfspace H a,c := {x โˆˆ R B ; โ„“ a,c (x) < 0}. We say a set W โІ R B spans an affine subspace H โІ R B , if H is the smallest affine subspace that contains W . The cardinality of a set S is denoted by |S|. For a vector a = t (a 1 , . . . , a

Proof. By an affine transformation we can assume without loss of generality that all the components of the vector a are 1 and that S is the canonical basis {e Proof. Let B be the right-hand side. Let ฯ• be a map S d-1 โ†’ R B which maps

there is some set S โŠ‚ S d-1 such that |S| = B and ฯ•(S) spans the hyperplane. Let a โˆˆ R B be a vector with the first d components being 1 and the other components being 0. By Lemma 2, for any ฮต > 0 the family

. By the definition of ฯ•, the class of sets defined by quadratic inequalities

But, when ฮต is sufficiently small, all of these sets are ellipsoids.

We verify the converse inequality.

Below, the convex hull of a set A is denoted by conv(A).

If there are x = (u, x B ), y = (u, y B ) โˆˆ S such that x B < y B , then for any a โˆˆ R B with the last component nonnegative and for any c โˆˆ R we have โ„“ a,c (x) < โ„“ a,c (y), and thus x โˆˆ H a,c = {x โˆˆ R B ; โ„“ a,c (x) < 0} whenever y โˆˆ H a,c . This contradicts the assumption “C shatters S.” Therefore, for the canonical projection ฯ€ :

By applying Radon’s theorem 1 [6] to the set ฯ€(S) โŠ‚ R B-1 , there is a partition (T 1 , T 2 ) of S such that we can take y from conv(ฯ€(T 1 )) โˆฉ conv(ฯ€(T 2 )). Then we see that there are z, z โ€ฒ โˆˆ R such that (y, z) โˆˆ conv(T 1 ) and (y, z โ€ฒ ) โˆˆ conv(T 2 ). Because C shatters S, there are some a โˆˆ R B and some c โˆˆ R such that the last component a B of a is nonnegative and a halfspace H a,c โˆˆ C cuts T 1 out of S. Thus, we have โ„“ a,c (x) < 0 for all x โˆˆ conv(T 1 ) while โ„“ a,c (x) โ‰ฅ 0 for all x โˆˆ conv(T 2 ) where T 2 = S \ T 1 . Therefore โ„“ a,c (y, z) < โ„“ a,c (y, z โ€ฒ ) and a B > 0, we have z โ€ฒ > z. On the other hand, some member H a โ€ฒ ,c โ€ฒ โˆˆ C cuts T 2 out of S. By a similar reasoning, we have z > z โ€ฒ , which is a contradiction.

Proof. Let 0 โˆˆ conv(A). Then for every finite subset A โ€ฒ of A, 0 / โˆˆ conv(A โ€ฒ ) and there is a hyperplane J through 0 such that conv(A โ€ฒ ) is contained in one of the two open halfspaces determined by J. So there is a new rectangular coordinate system such that the origin point is the same as the older rectangular coordinate system, one of the new coordinate axes is normal to J, and any a โˆˆ A โ€ฒ is represented as (a 1 , . . . , a B ) with a B > 0. So VCdim({H a,c } aโˆˆA โ€ฒ ,cโˆˆR ) โ‰ค B by Lemma 4, and thus VCdim({H a,c } aโˆˆA,cโˆˆR ) โ‰ค B.

The proof of Theorem 1 is as follows: By Lemma 3, we have only to establish that the class of d-dimensional ellipsoids has vc dimension less than or equal to B := (d 2 + 3d)/2. Assume otherwise. For a = t (a 1 , . . . , a B ) โˆˆ R B and x = t (x 1 , . . . , x d ), define a quadratic form q a (x) and a quadratic polynomial p a (x) by

Let A be the set of a โˆˆ R B s

…(Full text truncated)…

๐Ÿ“ธ Image Gallery

cover.png

Reference

This content is AI-processed based on ArXiv data.

Start searching

Enter keywords to search articles

โ†‘โ†“
โ†ต
ESC
โŒ˜K Shortcut