distributed cur decomposition for...

DistributedCURDecompositionforBi-Clustering

StephenKline,KevinShawJune1,2016

{sakline,keshaw}@stanford.eduStanfordUniversity,CME323FinalProject

CURasalternativetoSVD– e.g.Biclustering

CompleteratingsforselectUsers

CompleteratingsforselectMovies

Movie Ratingssparse /huge

SVD– accuratebutheavy,Less interpretable(rotatedspace)

CUR– less accuratebutlight,Moreinterpretable*

*Asarchetypal usersandmovies

Movies

User-Movie Biclusters

Biclustering wasoriginally developedinthecontextofDNAmicroarrays

Biclustering alsohaspotential inotherareasandhas addedinterpretability

Source:SourceCodeforBiologyMedicine(April2013)- "Thenon-negativematrixfactorizationtoolboxforbiologicaldatamining"

ReviewofSVD:A=U∑VT

• PRO- Highaccuracyo ksingular values/vectors produce thebestk-rank

approximation toA

• CON - Highcomputation/spacerequirementso Inour biclustering applicationwithMovieLens

data,thedistributed SVDis “roughlysquare” -ARPACK (vs.“tallandskinny” – ATAtrick)

dense /fulldense /full

Asparse /huge

densebig

dense /bigsparse/small

BackgroundonA=CUR

• CURtradesaccuracy…... forcomputation/spacesavings

• C/R =cols/rows fromA• U =pseudo-inverse ofW

(intersectionof CandR)

sparsebig

sparse /bigdense/small

Asparse /huge

Pinv( ) Intersectionof CandR(callitW,verysmall)

• Col/RowSelect() algsamplesw/replacement (allowsduplicates)

• Pinv(W) calculatedviaSVD(W)

• Accuracybetterforlargedatasets

DesignDecisionsforDistributedCUR

sparsebig

sparse /bigdense/small

Asparse /huge

KeyDesignDecision:Distributetwoinstances ofAavoidingfutureall-to-allcommunications

Only necessary tostoreCandRassetofindices intoACompute Ulocally

Therearemultiple variations ofCUR.Weselected thealgorithmaspresented in:Drineas, et.al.,2006."FastMonteCarloAlgorithms forMatricesIII"which(forexample)does notremoveduplicatecols/rows assomeothers do.

Serialvs.DistributedCUR- Asymptotics

• BuildCandR:o Generateprobabilities– O(mn)o CreateCmatrix– O(mk)o CreateRmatrix– O(nk)

• ConstructU• ComputeCTC– O(mk2)• SVDofCTC– O(k3)• ComputeAandB– O(k3)• U=ABT – O(k3)

• BuildCandR:• Generateprobabilities– O(mn +p)cost,O(maxdense)time

o Create2RDDsbyRow/Colpartition– O(mn)cost,AtoAo Bothinstances:reducetoRow/Colsums—

O(maxdense)time,nocommunicationo Oneinstance:reduceRowsumtototal– O(p)cost,O(logp)timeo Broadcasttotaltocalculateprobs – O(p)cost,O(logp)time

• CreateC/Rmatriceso Locallysamplekrows/cols– O(k)o BroadcastsampletoRDDs– O(pk)cost,O(klogp)time

• ConstructU• SameasSerial(lessopportunitytodistribute)

Serial Distributed (communication cost andcomputation time)

Biclustering:DistributedCURvsSVD- Empirics

distributed cur decomposition for...

Documents

a stereoscopic look into the bulk · 2016. 9. 29. ·...

s16 operator manual - tennant co

9702 s16 ms_all

new large-scale matrix factorization: dsgd in...

s16 briefing for teams

introduction to distributed...

neural networks for video frame...

o'neill wetsuits s16 catalogue

stanford optical society · secretary tiffany huang,...

eld s16 bro - eldridgestreet.org

pregel and graphx - stanford...

kranonit s16 (python). sergey burma

exposure draft - nasba...s16 – 03 : monitoring mechanism...

superserver 1029ux-ll1-s16/-ll2-s16/-ll3-s16 quick ... ·...

graphic design examples s16

meyers s16

u2 5to integrados s16

kucharska s16 2_aug_2g14

8004 s16 er 11

s16-03 equipos virtuosos