Comments | gustavolacerda: mismatches between mathematical essences and data structures

gustavolacerda

mismatches between mathematical essences and data structures

Oct 13, 2009 20:15

Lately, I've been faced with problems of the following type ( Read more... )

Back to all threads

anonymous October 15 2009, 08:01:50 UTC

is (1) just an example or you actually have some problem which requires some sort of optimization over the space of "k-flats"? if you have such a problem, maybe you can exploit better the structure of it in order to simplify matters. for example, you can characterize the k-th largest eigenvalue of a hermitian matrix as the solution of a certain non-linear min-max problem in which the min is taken over the space of "k-flats" and the max is taken in that space (removing the origin, actually) that's the result of the Courant-Fischer Minimax Theorem, you can take a look at its statement here http://www.cs.tau.ac.il/~haima/gen_cf.pdf

my point is, exploiting the structure of your problem may lead to other more useful characterizations of the solution-set which wouldn't require you to actually try to find a representation for the set of k-flats... there are many different methods to compute eigenvalues...

a propósito, sou eu, Ives...

gustavolacerda October 15 2009, 17:59:21 UTC

Thanks, Ives!

Both of these are actual problems motivated by research ideas. #1 is like a regression problem: given points on a unit hypersphere centered at the origin, which k-dim "great circle" minimizes the residual sum of squares? The great circles correspond to the set of k-flats (aka "k-planes") through the origin (the circle is just the intersection of the flat and the sphere).

I don't see how eigenvalues might help here. Do you? I don't have a square matrix anywhere.

anonymous October 15 2009, 20:55:17 UTC

what do you mean by "residual sum of squares"? is it the sum of the squared distances to the k-flat? could it be the maximum of the squared distances? I will think about it...

Ives

gustavolacerda October 15 2009, 20:57:28 UTC

<< is it the sum of the squared distances to the k-flat? >>

yup. You could use either geodesic distance (ideal) or orthogonal distance. It probably doesn't matter which.

anonymous October 16 2009, 01:47:00 UTC

did it... if you use the orthogonal distance (regular euclidean distance), it turns out that the solution is the same as that given by PCA... that's where an eigendecomposition solves this problem... ;)

just to fix notation, the solution (k-flat) corresponds to the vector space spanned by the eigenvectors associated to the k largest eigenvalues of the matrix \sum_i{x_i x_i^T}, x_i being the points on that sphere...

if you really want the proof, I guess I could take some time later to write it down for you or just meet me at UBC some day...

nice exercise, tough... I guess the geodesic distance would be way more complicated...

cheers,
Ives

gustavolacerda October 16 2009, 02:03:23 UTC

Thanks for working on this. The geodesic distance is easy to write down, and probably convex.

anonymous October 16 2009, 02:15:53 UTC

you're welcome... regarding the geodesic distance, it is simple to write down, but to optimize it I guess wouldn't be so simple... regarding convexity, I don't know what you could do since the sphere is not convex. I have some ideas to circumvent this but I'm not sure how useful they would be...

Ives

gustavolacerda October 16 2009, 02:20:17 UTC

the sphere isn't convex??

anonymous October 16 2009, 02:32:45 UTC

nop... but the closed ball is... I mean, a convex subset of the euclidean space...
maybe we are adopting the same name for different things... let me make it clear...
for me, the sphere is the set of x such that ||x||=1, and the closed ball is the set of x such that ||x|| <= 1
as a subset of the euclidean vector space (inheriting the usual addition and scalar multiplication), the sphere is not convex, but the closed ball is...
are you ok with that?

cheers,
Ives

gustavolacerda October 16 2009, 04:08:45 UTC

ah, right.

gustavolacerda October 16 2009, 04:07:59 UTC

This square matrix, is it n-by-n (where n is the dimension of the Euclidean space), or P-by-P (where P is the number of points)?
I think the former. When you take all the eigenvectors, you end up with the original hypersphere.

I now understand why PCA is the solution here.

Back to all threads