Machine Learning, etc: Sampling doubly stochastic matrices

Monday, December 17, 2007

Sampling doubly stochastic matrices

Stochastic matrices are easy to get -- just normalize the rows. Doubly stochastic matrices require more work -- simply normalizing columns/rows ~~will not converge~~ may take few dozen iterations to converge. One approach that works is to do constrained optimization, finding closest (in least squared sense) doubly stochastic matrix to given matrix. Another approach is to start with a permutation matrix and follow a Markov chain that modifies 4 elements randomly at each step in a way that keeps the matrix doubly stochastic. Here is the implementation of these two methods in Mathematica (notebook in the same directory)

http://www.yaroslavvb.com/research/qr/doubly-stochastic/doubly-stochastic.html

As Jeremy pointed out, Birkhoff's theorem provides necessary and sufficient conditions for a matrix to be doubly stochastic -- it must be a convex combination of permutation matrices. It seems hard to sample from the set of valid convex combinations though. For instance, here's a set of 3 dimensional slices through the set of valid convex combinations for 3x3 matrices (there are 6 permutation matrices, so the space is 5 dimensional).

Update: here's the updated notebook with 2 more algorithms, suggested by Dr.Teh and Jeremy below. Turns out that sampling random convex combinations is fairly easy -- almost all the time uniformly sampled convex combination will produce a doubly-stochastic matrix. However, matrices end up looking pretty uniform
http://www.yaroslavvb.com/research/qr/doubly-stochastic/doubly-stochastic3.html

12 comments:

Jeremy9:26 PM
Your second approach reminds me of the following theorem (taken from Combinatorial Matrix Theory by Brualdi and Ryser):

A matrix A is doubly stochastic iff there exists permutation matrices P_1, ..., P_t and real numbers c_1, ... , c_t such that

A = c_1 P_1 + ... + c_t P_t

and

c_1 + ... + c_t = 1.
ReplyDelete
Replies
m29:50 PM
I have a question: why won't iteratively normalizing columns and rows work? Isn't it IPF? cheers, -yw
ReplyDelete
Replies
Yaroslav Bulatov12:24 AM
Jeremy -- thanks for the pointer. I also found it as Thm. 8.7.1 in Horn and Johnson

Dr Teh -- you are right, it's IPF, I made a mistake of running it for too few iterations, if I run it longer it tends to converge (within 10^-10 of doubly stochastic) after 30 iterations for 3x3 matrices, less for larger matrices
http://www.yaroslavvb.com/research/qr/doubly-stochastic/doubly-stochastic2.html
ReplyDelete
Replies
Yaroslav Bulatov5:34 PM
In fact, convergence of IPF to doubly-stochastic matrix was proven back in 1964
ReplyDelete
Replies
Yaroslav Bulatov5:35 PM
(convergence is guaranteed only if all matrix entries are strictly positive)
ReplyDelete
Replies
dfacto7:47 AM
Isn't this simpler and equivalent? In Matlab notation: [T,~]=qr(rand(n));T=T.^2;
ReplyDelete
Replies
draj11:25 PM

Excellent machine learning blog,thanks for sharing...
Seo Internship in Bangalore
Smo Internship in Bangalore
Digital Marketing Internship Program in Bangalore
ReplyDelete
Replies
john11:12 PM
Great Article
IEEE final year projects on machine learning

JavaScript Training in Chennai

Final Year Project Centers in Chennai

JavaScript Training in Chennai

ReplyDelete
Replies
john11:14 PM
Great Article
IEEE final year projects on machine learning

JavaScript Training in Chennai

Final Year Project Centers in Chennai

JavaScript Training in Chennai

ReplyDelete
Replies
카지노사이트 모음 해외 카지노사이트1:58 AM
Thankjob for this fantastic post
ReplyDelete
Replies
토토사이트 스포츠 토토사이트 메이저사이트2:12 AM
Keep up the great work, good ones are hard to find.
ReplyDelete
Replies
파워볼사이트 농구토토 하는법 농구 토토2:12 AM
It was certainly helpful. your website is very helpful.
ReplyDelete
Replies

Add comment