Sunday, 24 September 2006

linear algebra - Matrix approximation

I'll address the last question (about an a priori bound for epsilonepsilon).



If nggkggmnggkggm, the worst-case bound for epsilonepsilon is between c(m)cdotk2/(m1)c(m)cdotk2/(m1) and C(m)cdotk1/(m1)C(m)cdotk1/(m1) (probably near the former but I haven't checked this carefully). Note that the bound does not depend on nn.



Proof.
The columns of AA form a set SS of cardinality at most nn in mathbbRmmathbbRm. For a given epsilonepsilon, a suitable BB exists if and only if there is a subset TsubsetSTsubsetS of cardinality at most mm such that the convex hull conv(T)conv(T) majorizes the set (1epsilon)S(1epsilon)S in the following sense: for every vinSvinS, there is a point in conv(T)conv(T) which is component-wise greater than (1epsilon)v(1epsilon)v. And this majorization is implied by the following: conv(sym(T))conv(sym(T)) contains the set (1epsilon)S(1epsilon)S, or equivalently, the set (1epsilon)conv(sym(S))(1epsilon)conv(sym(S)), where by sym(X)sym(X) denotes the minimal origin-symmetric set containing XX, that is, sym(X)=XcupXsym(X)=XcupX.



Consider the polytope P=conv(sym(S))P=conv(sym(S)). We want to find a subset of its vertices of cardinality at most kk, such that their convex hull approximates PP up to (1epsilon)(1epsilon)-rescaling. This problem is invariant under linear transformations, and we may assume that PP has nonempty interior. Then Fritz John's theorem asserts that there is a linear transformation of mathbbRmmathbbRm which transforms PP to a body contained in the unit ball and containing the ball of radius 1/sqrtm1/sqrtm. For such a set, (1epsilon)(1epsilon)-scaling approximation follows from (epsilon/sqrtm)-approximation in the sense of Hausdorff distance. So it suffices to choose T to be an (epsilon/sqrtm)-net in S. Then a standard packing argument gives the above upper bound for epsilon.



On the other hand, if S is contained in the unit sphere and separated away from the coordinate hyperplanes, you must choose T to be a sqrtepsilon-net in S. This gives the lower bound; the "worst case" is a uniformly packed set of n=C(m)cdotk points on the sphere.



UPDATE.



Fritz John theorem, also known as John Ellipsoid Theorem, says that for any origin-symmetric convex body KsubsetmathbbRm, there is an ellipsoid E (also centered at the origin) such that EsubsetKsubsetsqrtmE. (There is a non-symmetric variant as well but the constant is worse.) The linear transformation that I used just sends E to the unit ball. There are lecture notes about John ellipsoid here and probably in many other sources.



Comparing scaling distance (see also Banach-Mazur distance) and Hausdorff distance between convex bodies is based on the following. The scaling distance is determined by the worst ratio of the support functions of the two bodies, and the Hausdorff distance is the maximum difference between the support functions. Once you captured the bodies between two balls, you can compare relative and absolute difference. This should be explained in any reasonable textbook in convex geometry; unfortunately I'm not an expert in textbooks, especially English-language ones.



By "packing argument" I mean variants of the following argument showing that for any epsilon, any subset S of the unit ball in mathbbRm contains an epsilon-net of cardinality at most (1+2/epsilon)m. Take a maximal epsilon-separated subset T of S, it is always an epsilon-net. Since T is epsilon-separated, the balls of radius epsilon/2 centered at the points of T are disjoint, hence the sum of their volumes is no greater than the volume of the (1+epsilon/2)-ball that contains them all. Writing the volume of an r-ball as c(m)cdotrm yields the result. This argument gives a rough estimate
epsilonle(2sqrtm+1)k1/m
in the original problem (up to errors in my quick computations). To improve the exponent one can consider the (m1)-dimensional surface of P rather that the whole ball.

No comments:

Post a Comment