The VC (VapnikβChervonenkis) dimension is a measure of complexity of a set of functions that can be learned by a simple binary classification algorithm. To define it more formally, we need the concept of shattering.
Given a collection \(S = S_1, ..., S_M\) of subsets of a finite set \(U\), the VC-dimension of \(S\) is the size of the largest set \(X \subseteq U\) such that for every \(X' \subseteq X\), there is an \(i\) for which \(S_i \cap X = X'\). For this \(i\), we can write that \(X\) is shattered by \(S\). So, a Boolean circuit \(C\) that computes a function \(f:\{0, 1\}^m \times \{0, 1\}^n \rightarrow \{0, 1\}\) succinctly represents a collection \(S\) of \(π^π\) subsets of \(πΌ = \{π, π\}^π\) as follows: the set \(πΊ_π\) consists of exactly those elements \(x\) for which \(πͺ(π, π) = π\).
Using this notion, the language VC-DIMENSION is the set of pairs \((C, k)\) for which \(C\) represents a collection of subsets \(S\) whose VC dimension is at least \(k\). The purpose of this blog post is to show that this language has an intricate relationship with the polynomial hierarchy in computational complexity.
To explain this better, I'm going to introduce some notation:
Let \(\text{P}^A\) be the set of problems solvable in poly-time with a TM that uses an oracle for a problem in \(A\).
Let \(\text{NP}^A\) be the set of problems solvable in nondeterministic poly-time with a TM that uses an oracle for a problem in \(A\).
Let \(\text{coNP}^A\) be the set of problems solvable in co-nondeterministic poly-time with a TM that uses an oracle for a problem in \(A\).
Then, define \(\Delta_0^P := \Sigma_0^P := \Pi_0^P:=P\), where \(P\) is the set of problems that can be solved in poly-time. Then, for all \(i \geq 0\), define:
$$\Delta_{i+1}^P:=\text{P}^{\Sigma_i^P}$$
$$\Sigma_{i+1}^P:=\text{NP}^{\Sigma_i^P}$$
$$\Pi_{i+1}^P:=\text{coNP}^{\Sigma_i^P}$$
The result that I want to show here is that the VC-dimension is in \(\Sigma_3^P\), and that is the strongest form of problem in \(\Sigma_3^P\). So, any solution that can compute the VC-dimension can be used to compute any other problem in \(\Sigma_3^P\). The complexity-theoretic word for this phenomenon is
completenesss. That is to say, the V-dimension is \(\Sigma_3^P\)-complete.
Given an instance of VC-DIMENSION:
$$\text{VCβDIMENSION}=\{(πΆ,π):βX,|X|β₯m\text{ s.t. }βX' βπ,βπ\text{ s.t. }βxβX',C(i,x)=1\}$$
We can show that it is \(Ξ£_3π\)-complete by reducing from an instance of \(πππ΄π_3\) which is known to be \(Ξ£_3^P\)-complete. Therefore, let an arbitrary instance of \(πππ΄π_3\) be:
$$βπ₯_1 βπ₯_2 βπ₯_3 \text{ s.t. } \varphi(π₯_1,π₯_2,π₯_3)=1$$
We then define the universe \(U\) as the set \(\{0, 1\}^\ell \times \{1, 2, 3, ... \ell\}\) s.t. \(βπ₯_1, |π₯_1| = \ell\), we define the following subset:
$$π_{π₯_1} = \{π₯_1\} \times \{1,2,3,...\ell\}$$
Using this, we define the collection of sets, \(S\), succinctly represented by circuit \(C\) (note that the circuit would have to maintain this invariant \(βπ₯_1,π₯_2,π₯_3\), as:
1) If \(π(π₯_1,π₯_2,π₯_3)=1\), then \(π(π₯_1,π₯_2,π₯_3)=\{π₯_1\}Γ\{1,2,3,...\ell\},|π₯_1|=l\)
2) If \(π(π₯_1, π₯_2, π₯_3) = 0\), then \(π(π₯_1, π₯_2, π₯_3) = π\), where \(π\) is the empty set (note that we donβt need to
address this case here since \(π(π₯_1,π₯_2,π₯_3) = 0\) should really just do nothing, hence the empty set will suffice), where the name of each set is \((π₯_1, π₯_2, π₯_3)\).
We now define the reduction function \(f\):
Inputs: A set named \((π₯_1, π₯_2, π₯_3)\) and an element, \(a = (π₯'_1,π₯_2)\) of a potential subset \(π_{x_1}\).
Function: The isomorphism between the VC-DIMENSION instance \((πΆ,\ell)\) and \(πππ΄π_3\) instance \(π(π₯_1,π₯_2,π₯_3)\).
Deciding Procedure: We return βACCEPTβ if \(π β π(π₯_1, π₯_2, π₯_3)\) and βNOβ if \(π β π(π₯_1, π₯_2, π₯_3)\).
Therefore, we do the following:
If \((π(π₯_1, π₯_2, π₯_3) = 1) β§ (a β (x_1, \ell))\), return βACCEPTβ.
Else, return βREJECTβ.
Finally, the onus of the proof is to show that βyesβ instances of \(πππ΄π_3\) maps to βyesβ instances of the VC-
DIMENSION decision problem and vice versa, and that βnoβ instances of \(πππ΄π_3\) maps to βnoβ instances of the
VC-DIMENSION decision problem and vice versa:
To show that βyesβ maps to βyesβ:
To show that βyesβ \(πππ΄π_3\) instances map to βyesβ VC-DIMENSION instances:
Given a βyesβ instance of \(πππ΄π_3\), \(βπ₯_1 βπ₯_2 βπ₯_3 \text{ s.t. } π(π₯_1, π₯_2 , π₯_3 ) = 1\), the set \(π(π₯_1 , π₯_2 , π₯_3 ) = \{x_1\}\times x_2\).
Recall that the initial subset was:
$$π_{π₯_1} =\{π₯_1\}\times\{1,2,3,...\ell\}$$
However, since \(π(π₯_1, π₯_2, π₯_3) = \{π₯_1\} \times π₯_2\) and \(π(π₯_1, π₯_2, π₯_3) = 1\), the VC dimension of \(S\) would be \(β₯ π₯_2 = \ell\).
Hence, the VC dimension of \(S\) that is represented by circuit \(C\) would be \(β₯ \ell\), which satisfies the decision problem.
QED.
To show that βyesβ VC-DIMENSION instances map to βyesβ \(πππ΄π_3\) instances:
Given a βyesβ instance of VC-DIMENSION \((πΆ, π)\), then the set \(S\) represented by the circuit \(C\) has a size atleast \(m\).
Therefore, \(βπ, |π| β₯ π \text{ s.t. } π \text{ shatters } π\). It follows then by definition of shattering that \(β π \text{ s.t. } π β π_π\).
However, this can only happen if \(βπ₯_2, {π₯_1} \times π₯_2 β π\). This is the same thing as writing:
$$βπ₯_1 βπ₯_2 βπ₯_3 s.t.π(π₯_1,π₯_2,π₯_3) = 1,$$
which is a βyesβ instance of \(πππ΄π_3\).
QED.
To show that βnoβ maps to βnoβ, the contrapositive argument holds since a βyesβ instance of the VC-
DIMENSION problem being reduced to a βyesβ instance of \(πππ΄π_3\) implies that βnoβ instances of \(πππ΄π_3\) can be
reduced to βnoβ instances of VC-DIMENSION, and a βyesβ instance of \(πππ΄π_3\) being reduced to a βyesβ instance of VC-DIMENSION implies that a βnoβ instance of VC-DIMENSION can be reduced to a βnoβ instance of \(QSAT_3\).
Since we have shown that βyesβ maps to βyesβ and βnoβ maps to βnoβ, we have that \(β\) a reduction function from instances of \(πππ΄π_3\) to instances of the VC-DIMENSION problem and that \(β\) a reduction function from
instances of the VC-DIMENSION problem to instances of \(πππ΄π_3\). Additionally, since \(πππ΄π_3\) is known to be \(\Sigma_3^P\)-complete, it follows then that ππΆ-DIMENSION is also \(\Sigma_3^π\)-complete.
QED.