Fast Matrix Multiplication = Calculating Tensor Ranknirkhe/talks/ma10presentation2.pdfIntroduction...

Introduction to the ProblemStrassen’s Algorithm

Intro. to Tensor RankBounding ω with Tensor Rank

Fast Matrix Multiplication = Calculating TensorRank

Caltech Math 10 Presentation

Chinmay Nirkhe

November 15th, 2016

Chinmay Nirkhe Fast Matrix Multiplication

Statement of the Problem

Question

Given two matrices A,B ∈ Cn×n, what is the algorithmiccomplexity of calculating C = AB?

Cij =∑k

AikBkj

Assume that arithmetic operations (i.e. +,×) of two values in C isconstant time (O(1)).

Big-O Notation

Definition (Big-O)

For functions f , g : N→ N, we say f ∈ O(g) (or f = O(g)) if∃ c > 0 s.t. f (n) ≤ cg(n) for all but finitely many n.

A naıve O(n3) algorithm

Calculating each Cij can be done in O(n) computations by

Cij =∑k

AikBkj

n2 such values Cij =⇒ n2 · O(n) = O(n3).

Definition of ω

Definition (Coefficient of Matrix Multiplication)

We define the coefficient of matrix multiplication ω as

ω = inf{α s.t. ∀ε > 0, ∃ alg. of complexity O(nα+ε)

Remark (Trivial Bounds)

2 ≤ ω ≤ 3.

We saw an O(n3) algorithm. C has n2 entries so it takes at leastO(n2) computations to just write C .

Definition of ω

Definition (Coefficient of Matrix Multiplication)

We define the coefficient of matrix multiplication ω as

ω = inf{α s.t. ∀ε > 0, ∃ alg. of complexity O(nα+ε)

Remark (Trivial Bounds)

2 ≤ ω ≤ 3.

We saw an O(n3) algorithm. C has n2 entries so it takes at leastO(n2) computations to just write C .

Goal revisited

The goal is to (ideally) show ω = 2. The latest result byCoppersmith-Winnograd is ω ≤ 2.3728639.

How do we improve ω?

Start by considering A,B ∈ C2×2. Then,C11 = A11B11 + A12B21

C12 = A11B12 + A12B22

C21 = A21B11 + A22B21

C22 = A21B12 + A22B22

Clearly there are 8 multiplications terms here. We are going toreduce this to 7 multiplications.

7 multiplications

M1 = (A11 + A22)(B11 + B22)

M2 = (A21 + A22)B11

M3 = A11(B12 − B22)

M4 = A22(B21 − B11)

M5 = (A11 + A12)B22

M6 = (A21 − A11)(B11 + B12)

M7 = (A12 − A22)(B21 + B22)

C11 = M1 + M4 −M5 + M7

C12 = M3 + M5

C21 = M2 + M4

C22 = M1 −M2 + M3 + M6

A bound on ω due to 7 multiplications and recursion

Consider matrices A,B ∈ C2m×2m ,m ∈ N. Write A =

(A11 A12

A21 A22

)for Aij ∈ C2m−1×2m−1

and B similarly. Use the 7 multiplication trickto recursively multiply A and B.

Let f (m) be the complexity of multiplying A,B ∈ C2m×2m .

f (m) = 7f (m − 1) + c0(2m × 2m)

f (m) = O(7m)

(A11 A12

A21 A22

f (m) = 7f (m − 1) + c0(2m × 2m)

f (m) = O(7m)

(A11 A12

A21 A22

f (m) = 7f (m − 1) + c0(2m × 2m)

f (m) = O(7m)

Let N = 2m, then f (m) = O(N log2 7).

Theorem (Strassen’s)

ω ≤ log2 7 ≈ 2.8074

Can we multiply C2×2 with < 7 multiplications?

Can we find a simpler way to multiply a 3× 3 matrix using kmultiplications s.t. log3 k < log2 7?

The Matrix Multiplication Tensor

Definition (The Matrix Multiplication Tensor)

For fixed n, the matrix multiplication tensor T ∈ Cnm×mp×pn

defined byTij ′,jk ′,ki ′ = 1{i=i ′,j=j ′,k=k ′}

for i , i ′, j , j ′, k , k ′ ∈ {1, . . . , n}.

Notation

The tensor is often notated as 〈n,m, p〉.

M.M. Tensor of 〈2, 2, 2〉

Strassen’s equations

M1 = (A11 + A22)(B11 + B22)

M2 = (A21 + A22)B11

M3 = A11(B12 − B22)

M4 = A22(B21 − B11)

M5 = (A11 + A12)B22

M6 = (A21 − A11)(B11 + B12)

M7 = (A12 − A22)(B21 + B22)

Rank of a Tensor

Definition (Rank 1 Tensor)

A tensor T ∈ Cnm×mp×np is a rank 1 tensor if it can be expressedas T = u ⊗ v ⊗ w where u ∈ Cnm, v ∈ Cmp,w ∈ Cnp. i.e.

Tijk = uivjwk

Definition (Tensor Rank)

For an arbitrary tensor T , the rank of T (not. R(T )) is theminimum number of rank 1 tensors summing to T .

Rank of a Tensor

Definition (Rank 1 Tensor)

A tensor T ∈ Cnm×mp×np is a rank 1 tensor if it can be expressedas T = u ⊗ v ⊗ w where u ∈ Cnm, v ∈ Cmp,w ∈ Cnp. i.e.

Tijk = uivjwk

Definition (Tensor Rank)

For an arbitrary tensor T , the rank of T (not. R(T )) is theminimum number of rank 1 tensors summing to T .

R(〈2, 2, 2〉) ≤ 7.

T1 = (1, 0, 0, 1)⊗ (1, 0, 0, 1)⊗ (1, 0, 0, 1)

T2 = (0, 0, 1, 1)⊗ (1, 0, 0, 0)⊗ (0, 0, 1,−1)

T3 = (1, 0, 0, 0)⊗ (0, 1, 0,−1)⊗ (0, 1, 0, 1)

T4 = (0, 0, 0, 1)⊗ (−1, 0, 1, 0)⊗ (1, 0, 1, 0)

T5 = (1, 1, 0, 0)⊗ (0, 0, 0, 1)⊗ (−1, 1, 0, 0)

T6 = (−1, 0, 1, 0)⊗ (1, 1, 0, 0)⊗ (0, 0, 0, 1)

T7 = (0, 1, 0,−1)⊗ (0, 0, 1, 1)⊗ (1, 0, 0, 0)

Strassen’s equations

M1 = (A11 + A22)(B11 + B22)

M2 = (A21 + A22)B11

M3 = A11(B12 − B22)

M4 = A22(B21 − B11)

M5 = (A11 + A12)B22

M6 = (A21 − A11)(B11 + B12)

M7 = (A12 − A22)(B21 + B22)

C11 = M1 + M4 −M5 + M7

C12 = M3 + M5

C21 = M2 + M4

C22 = M1 −M2 + M3 + M6

M.M. Tensor of 〈2, 2, 2〉

ω ≤ logn R(〈n, n, n〉)

Theorem

For any n > 2, if r = R(〈n, n, n〉) where 〈n, n, n〉 is the mat.mult.tensor for n × n matrices, then ω ≤ logn r

Let T =r∑`=1

u` ⊗ v` ⊗ w` be the rank decomposition of T . Divide

matrices A,B ∈ Cnm×nm into n × n equal parts A11, . . . ,Ann andB11, . . . ,Bnn. Compute (recursively)

n,n∑i ,j=1

u(ij)` Aij

n,n∑i ,j=1

v(ij)` Bij

r∑`=1

Theorem

Let T =r∑`=1

u` ⊗ v` ⊗ w` be the rank decomposition of T .

Divide

n,n∑i ,j=1

u(ij)` Aij

n,n∑i ,j=1

v(ij)` Bij

r∑`=1

Theorem

Let T =r∑`=1

matrices A,B ∈ Cnm×nm into n × n equal parts A11, . . . ,Ann andB11, . . . ,Bnn.

Compute (recursively)

n,n∑i ,j=1

u(ij)` Aij

n,n∑i ,j=1

v(ij)` Bij

r∑`=1

Theorem

Let T =r∑`=1

n,n∑i ,j=1

u(ij)` Aij

n,n∑i ,j=1

v(ij)` Bij

Cij =r∑`=1

Theorem

Let T =r∑`=1

n,n∑i ,j=1

u(ij)` Aij

n,n∑i ,j=1

v(ij)` Bij

r∑`=1

Calculation of M takes r multiplications of matrices of sizenm−1 × nm−1 + c0N

2 additions.

f (m) = rf (m − 1) + c0(nm × nm)

f (m) = O(rm) = O(n(logn r)m) =⇒ ω ≤ logm2

2 additions.

f (m) = rf (m − 1) + c0(nm × nm)

2 additions.

f (m) = rf (m − 1) + c0(nm × nm)

f (m) = O(rm) = O(n(logn r)m) =⇒

ω ≤ logm2

2 additions.

f (m) = rf (m − 1) + c0(nm × nm)

By the construction of the tensor,

Cab =∑

i ,i ′,j ,j ′

T(i ,j)(i ′,j ′)(a,b)AijBi ′j ′

=r∑`=1

∑i ,i ′,j ,j ′

T(`)(i ,j)(i ′,j ′)(a,b)AijBi ′j ′

=r∑`=1

w(ab)`

n,n∑i ,j=1

u(ij)` Aij

n,n∑i ,j=1

v(ij)` Bij

r∑`=1

w(ab)` M`

Cab =∑

i ,i ′,j ,j ′

=r∑`=1

∑i ,i ′,j ,j ′

T(`)(i ,j)(i ′,j ′)(a,b)AijBi ′j ′

=r∑`=1

w(ab)`

n,n∑i ,j=1

u(ij)` Aij

n,n∑i ,j=1

v(ij)` Bij

r∑`=1

w(ab)` M`

Cab =∑

i ,i ′,j ,j ′

=r∑`=1

∑i ,i ′,j ,j ′

T(`)(i ,j)(i ′,j ′)(a,b)AijBi ′j ′

=r∑`=1

w(ab)`

n,n∑i ,j=1

u(ij)` Aij

n,n∑i ,j=1

v(ij)` Bij

r∑`=1

w(ab)` M`

Does this help reduce ω bound?

Theorem

For n > 2, ω ≤ logn R(〈n, n, n〉).

The only other small n found for which this gives a stronger boundthan Strassen’s is n = 70, where R = 143, 640 so ω ≤ 2.80.

What about non cubic-tensors?

Does this help reduce ω bound?

Theorem

For n > 2, ω ≤ logn R(〈n, n, n〉).

The only other small n found for which this gives a stronger boundthan Strassen’s is n = 70, where R = 143, 640 so ω ≤ 2.80.

What about non cubic-tensors?

The Matrix Multiplication Tensor

Definition (The Matrix Multiplication Tensor)

For fixed n, the matrix multiplication tensor T ∈ Cnm×mp×pn

defined byTij ′,jk ′,ki ′ = 1{i=i ′,j=j ′,k=k ′}

for i , i ′, j , j ′, k , k ′ ∈ {1, . . . , n}.

Notation

The tensor is often notated as 〈n,m, p〉.

Bounding Non-Cubic Tensors

Lemma (Permutation invariant)

R(〈n,m, p〉) = R(〈π(n), π(m), π(p)〉) for any π ∈ S3.

Take the decomposition into rank 1 tensors and permute(u, v ,w) 7→ (π(u), π(v), π(w)) for each rank 1 tensor.

Lemma (Permutation invariant)

R(〈n,m, p〉) = R(〈π(n), π(m), π(p)〉) for any π ∈ S3.

Take the decomposition into rank 1 tensors and permute(u, v ,w) 7→ (π(u), π(v), π(w)) for each rank 1 tensor.

Lemma (Tensor Product Upper bound)

R(〈nn′,mm′, pp′〉) = R(〈n,m, p〉 ⊗ 〈n′,m′, p′〉)≤ R(〈n,m, p〉)R(〈n′,m′, p′〉).

Think of 〈nn′,mm′, pp′〉 as deconstruction into a tensor of tensors.

Let 〈n,m, p〉 =∑r

i=1 ui ⊗ vi ⊗ wi , 〈n′,m′, p′〉 =∑r ′

j=1 u′j ⊗ v ′j ⊗ w ′j .

〈nn′,mm′, pp′〉 =

r ,r ′∑i ,j=1

(ui ⊗ u′j)⊗ (vi ⊗ v ′j )⊗ (wi ⊗ w ′j )

i=1 ui ⊗ vi ⊗ wi , 〈n′,m′, p′〉 =∑r ′

j=1 u′j ⊗ v ′j ⊗ w ′j .

〈nn′,mm′, pp′〉 =

r ,r ′∑i ,j=1

(ui ⊗ u′j)⊗ (vi ⊗ v ′j )⊗ (wi ⊗ w ′j )

i=1 ui ⊗ vi ⊗ wi , 〈n′,m′, p′〉 =∑r ′

j=1 u′j ⊗ v ′j ⊗ w ′j .

〈nn′,mm′, pp′〉 =

r ,r ′∑i ,j=1

(ui ⊗ u′j)⊗ (vi ⊗ v ′j )⊗ (wi ⊗ w ′j )

General Tensor Bound

Theorem (General Bound)

ω ≤ 3 lognmp R(〈n,m, p〉)

R(〈nmp, nmp, nmp〉) ≤ R(〈n,m, p〉)R(〈m, p, n〉)R(〈p, n,m〉)≤ R(〈n,m, p〉)3

ω ≤ lognmp R(〈n,m, p〉)3 = 3 lognmp R(〈n,m, p〉)

The Quest for Tensor Rank and other methods

Computational bashing! People have tried to find tensors butnon-trivial in general and no serious improvements have beenachieved.

Border Rank

Let T (k) ∈ (C[εk ])nm×mp×np be a tensor such that as ε→ 0,T (k) = εkT + O(εk+1). Then, define R(T ) = mink R(T (k)).

Theorem

R(〈2, 2, 3) = 10 =⇒ ω ≤ 2.79

Border Rank

Let T (k) ∈ (C[εk ])nm×mp×np be a tensor such that as ε→ 0,T (k) = εkT + O(εk+1). Then, define R(T ) = mink R(T (k)).

Theorem

R(〈2, 2, 3) = 10 =⇒ ω ≤ 2.79

Group Theoretic Approach

Recently, the biggest push in this subject and the way thatCoppersmith and Winnograd achieved their lower bound was tofind the matrix multiplication tensor embedded within the Cayleytable of a group G and use properties of generators on the groupto bound ω. But for another day...

Fast Matrix Multiplication = Calculating Tensor Ranknirkhe/talks/ma10presentation2.pdfIntroduction...

Documents

Transcript of Fast Matrix Multiplication = Calculating Tensor Ranknirkhe/talks/ma10presentation2.pdfIntroduction...

Symmetry Energy, Tensor Force & Maximum … Energy, Tensor Force & Maximum Rotation of Neutron Stars ECT* EMMI Workshop “Neutron-rich matter & Neutron Stars” September 30 th –

Horndeski's Vector-Tensor Theory - UCMteorica.fis.ucm.es/iberiCOS2015/images/MThorsrud.pdf · Horndeski’s Vector-Tensor Theory Mikjel Thorsrud, University of Oslo March 30, 2015

BOUNDING AND NONBOUNDING MINIMAL PAIRS IN THEpmt6sbc/preprints/minimal.pdf · BOUNDING AND NONBOUNDING MINIMAL PAIRS IN THE ENUMERATION DEGREES S. BARRY COOPER, ANGSHENG LI, ANDREA

Low rank aproximation in traditional and novel tensor formats€¦ · Icanonical rank r ]DOF for a given tensor product basis - best N-term approximation (super adaptivity)! Ithere

Bounding sup-norms of cusp forms of large level

TENSOR SERIES PORTABLE PA TRE-1201 › Full_Range_Active_Speaker_Syst… · Description The Tensor TRE-1501 15-inch active loudspeaker enclosure offers 1400W of Class D power output

ON THE TENSOR SEMIGROUP OF AFFINE KAC-MOODY LIE …math.univ-lyon1.fr/~ressayre/PDFs/tensorconeKacMoody.pdf · TENSOR SEMIGROUP OF AFFINE KM LIE ALGEBRA 5 Contents 1. Introduction

A brief introduction into the Q-tensor theory of nematic ...users.uoa.gr › ~nalikako › Athens1.pdf · An introduction to the Q-tensor theory of nematic liquid crystals Defects:

Mining Large Time-evolving Graphs Using Matrix and Tensor ...christos/TALKS/SIGMOD-07-tutorial/tensor3.pdf · • Tensor tools •Cseudi sesta • Tensor Basics •Tucker – Tucker

Isospin dependent interactions in RMF & ρ NN tensor coupling

Ising model on random networks and canonical tensor modelyitpqip2014.ws/presentation/Sato.pdf · Ising model on random networks! and ! canonical tensor model Yuki Sato (Univ. of the

Tensor topology - homepages.inf.ed.ac.ukhomepages.inf.ed.ac.uk/cheunen/cvqt/slides/Enriquemoliner.pdf · Tensor topology Pau Enrique Moliner1 Chris Heunen 1 Sean Tull 2 1University

Appendix A Riemann Curvature Tensor - Springer978-3-319-01842-3/1.pdf · 244 Appendix A: Riemann Curvature Tensor components. Taking into account the symmetry under pairwise exchange

Tensor topologyconferences.inf.ed.ac.uk/clapscotland/cvqt/Enriquemoliner.pdf · Tensor topology Pau Enrique Moliner1 Chris Heunen 1 Sean Tull 2 1University of Edinburgh 2University

The Weyl Tensor - Universität zu Köln · I Algebraic equations for the traces of the Riemann Tensor ... Strong gravitational lensing for the photons coupled to Weyl tensor in ...

Tensors, ranks, and varieties. - areeweb.polito.it off/Kick off... · with vi 2Vi is called elementary tensor. 2/15 E. Carlini Tensors, ranks, and varieties. Tensor rank Note that

2.6 Tensor Analysis 133 - Panjab University

tensor-via-CRES - Physics Department at UMass AmherstProject‐8 technique Sept 27-30 2017 Tensor via cyclotron radiation Power from a single electron orbiting in a magnetic field

Bounding the photon mass with cosmological propagation of ...

Introduction to Deep Learning (Optimization …liqun/teaching/cs680_19f/dl2.pdfIntroduction to Deep Learning (Optimization Algorithms)!1 Topics SGD SGDM AdaGrad Adam AdaDelta RMSprop