breaking the coherence barrier: asymptotic incoherence and ...sampling strategy successfully...
TRANSCRIPT
Breaking the coherence barrier: asymptoticincoherence and asymptotic sparsity in compressed
sensingBen Adcock
Department of MathematicsPurdue University
Anders C. HansenDAMTP
University of Cambridge
Clarice PoonDAMTP
University of Cambridge
Bogdan RomanComputer Laboratory
University of Cambridge
AbstractāWe introduce a mathematical framework thatbridges a substantial gap between compressed sensing theory andits current use in real-world applications. Although completelygeneral, one of the principal applications for our framework isthe Magnetic Resonance Imaging (MRI) problem. Our theoryprovides a comprehensive explanation for the abundance ofnumerical evidence demonstrating the advantage of so-calledvariable density sampling strategies in compressive MRI. Be-sides this, another important conclusion of our theory is thatthe success of compressed sensing is resolution dependent. Atlow resolutions, there is little advantage over classical linearreconstruction. However, the situation changes dramatically oncethe resolution is increased, in which case compressed sensing canand will offer substantial benefits.
I. INTRODUCTION
In this paper we present a new mathematical frameworkfor overcoming the so-called coherence barrier in compressedsensing1. Our framework generalizes the three traditionalpillars of compressed sensingānamely, sparsity, incoherenceand uniform random subsamplingāto three new concepts:asymptotic sparsity, asymptotic incoherence and multilevelrandom subsampling. As we explain, asymptotic sparsityand asymptotic incoherence are more representative of real-world problemsāe.g. imagingāthan the usual assumptionsof sparsity and incoherence. For instance, problems in MRIare both asymptotically sparse and asymptotically incoherent,and hence amenable to our framework.
The second important contribution of the paper is an anal-ysis of a novel and intriguing effect that occurs in asymptoti-cally sparse and asymptotically incoherent problems. Namely,the success of compressed sensing is resolution dependent.
As suggested by their names, asymptotic incoherence andasymptotic sparsity are only truly witnessed for reasonablylarge problem sizes. When the problem size is small, thereis little to be gained from compressed sensing over classicallinear reconstruction techniques. However, once the resolutionof the problem is sufficiently large, compressed sensing canand will offer a substantial advantage.
The phenomenon, which we call resolution dependence, hastwo important consequences for practitioners seeking to use
1This paper is part of a larger project on subsampling in applications.Further details, as well as codes and numerical examples, can be found onthe project website http://subsample.org.
compressed sensing in applications:(i) Suppose one considers a compressed sensing experiment
where the sampling device, the object to be recovered, thesampling strategy and subsampling percentage are all fixed,but the resolution is allowed to vary. Resolution dependencemeans that a compressed sensing reconstruction done at highresolutions (e.g. 2048Ć 2048) will yield much higher qualitywhen compared to full sampling than one done at a lowresolution (e.g. 256Ć 256). Hence a practitioner carrying outan experiment at low resolution may well conclude that com-pressed sensing imparts limited benefits. However, a markedlydifferent conclusion would be reached if the same experimentwere to be performed at higher resolution.
(ii) Suppose we conduct a similar experiment, but wenow use the same total number of samples N (instead ofthe same percentage) at low resolution as we take at highresolution. Intriguingly, the above result still holds: namely,the higher resolution reconstruction will yield substantiallybetter results. This is true because the multilevel randomsampling strategy successfully exploits asymptotic sparsity andasymptotic incoherence. Thus, with the same amount of totaleffort, i.e. the number of measurements, compressed sensingwith multilevel sampling works as a resolution enhancer: itallows one to recover the fine details of an image in a waythat is not possible with the lower resolution reconstruction.
On a broader note, resolution dependence and its conse-quences suggest the following advisory: it is critical that sim-ulations with compressed sensing be carried out with a carefulunderstanding of the influence of the problem resolution.Naıve simulations with standard, low-resolution test imagesmay very well lead to incorrect conclusions about the efficacyof compressed sensing as a tool for image reconstruction.
An important application of our work is the problem ofMRI. This served as one of the original motivations forcompressed sensing, and continues to be a topic of substantialresearch. Some of the earliest work on this problemāinparticular, the research of Lustig et al. [1]ā[3]ādemonstratedthat, due to high coherence, the standard random samplingstrategies of compressed sensing theory lead to substandardreconstructions. Conversely, random sampling according tosome nonuniform density was shown empirically to leadto substantially improved reconstruction quality. It is now
standard in MR applications to use a variable density strategyto overcome the coherence barrier. See [2]ā[5].
This work has culminated in the extremely successful ap-plication of compressed sensing to MRI. However, a mathe-matical theory addressing these sampling strategies is largelylacking. Despite some recent work of Krahmer & Ward [6],a substantial gap remains between the standard theorems ofcompressed sensing and its implementation in such problems.Our framework aims to bridge precisely this gap. In particular,our theorems provide a mathematical foundation for com-pressed sensing for coherent problems, and gives credenceto the above empirical studies demonstrating the success ofnonuniform density sampling.
Whilst the MR problem will serve as our main application,we stress that our theory is extremely gen2āeral in that itholds for almost arbitrary sampling and sparsity systems. Inparticular, standard compressed sensing results, such as thoseof Candes, Romberg & Tao [7], Candes & Plan [8], are specificinstances of our main results.
For brevity, we shall provide only the most salient aspects ofour framework. A more detailed discussion and analysis, con-taining proofs, further discussion and numerical experimentscan be found in the paper [9] and the website [10].
II. BACKGROUND
A. Compressed sensingA typical setup in compressed sensing is as follows. Let
ĻjNj=1 and ĻjNj=1 be two orthonormal bases of CN , thesampling and sparsity bases respectively, and write
U = (uij)Ni,j=1 ā CNĆN , uij = ćĻj , Ļić.
Note that U is an isometry. The coherence of U is given by
Āµ(U) = maxi,j=1,...,N
|uij |2 ā [Nā1, 1]. (1)
We say that U is perfectly incoherent if Āµ(U) = Nā1.Let f ā CN be s-sparse in the basis ĻjNj=1. In other
words, f =āNj=1 xjĻj , and the vector x = (xj)
Nj=1 ā CN
satisfies |supp(x)| ā¤ s, where
supp(x) = j : xj 6= 0.
Suppose now we have access to the samples
fj = ćf, Ļjć, j = 1, . . . , N,
and let Ī© ā 1, . . . , N be of cardinality m and chosenuniformly at random. According to a result of Candes &Plan [8] and Adcock & Hansen [11], f can be recoveredexactly with probability exceeding 1 ā Īµ from the subset ofmeasurements fj : j ā Ī©, provided
m & Āµ(U) Ā·N Ā· s Ā·(1 + log(Īµā1)
)Ā· logN. (2)
In practice, recovery is achieved by solving the convex opti-mization problem:
minĪ·āCN
āĪ·āl1 subject to PĪ©UĪ· = PĪ©f , (3)
where f = (f1, . . . , fN )>, and PĪ© ā CNĆN is the diagonalprojection matrix with jth entry 1 if j ā Ī© and zero otherwise.
B. The coherence barrier
The key estimate (2) shows that the number of measure-ments m required is, up to a log factor, on the order of thesparsity s, provided the coherence Āµ(U) = O
(Nā1
). This is
the case, for example, when U is the DFT matrix; a problemwhich was studied in some of the first papers on compressedsensing [7] (this example is actually perfectly incoherent).
On the other hand, when Āµ(U) large, one cannot expectto reconstruct an s-sparse vector f from highly subsampledmeasurements, regardless of the recovery algorithm employed[8]. We refer to this as the coherence barrier.
The MRI problem gives an important instance of this barrier.If ĻjNj=1 is a discrete wavelet and ĻjNj=1 corresponds tothe tows of the NĆN discrete Fourier transform (DFT) matrix,then the matrix U = DFT Ā· DWTā1 satisfies Āµ(U) = O (1)for any N [6], [12]. Hence, although signals and images aretypically sparse in wavelet bases, they cannot be recoveredfrom highly subsampled measurements using the standardcompressed sensing algorithm.
III. NEW CONCEPTS
In order to overcome the coherence barrier, we require threenew concepts.
A. Asymptotic incoherence
Consider the above example. It is known that, whilst theglobal coherence Āµ(U) is O (1), the coherence decreases aseither the Fourier frequency or wavelet scale increases. Werefer to this property as asymptotic incoherence:
Definition 1. Let U ā CNĆN be an isometry. Then U isasymptotically incoherent if
limK,NāāK<N
Āµ(Pā„KU) = limK,NāāK<N
Āµ(UPā„K ) = 0, (4)
where Pā„K : CNĆN is the projection matrix corresponding tothe index set K + 1, . . . , N.
Note that, for the wavelet example discussed above, one hasĀµ(Pā„KU), Āµ(UPā„K ) = O
(Kā1
)[12].
B. Multilevel sampling
Asymptotic incoherence suggests a different subsamplingstrategy should be used instead of standard random sampling.High coherence in the first few rows of U means that importantinformation about the signal to be recovered may well becontained in its corresponding low frequency measurements.Hence to ensure good recovery we should fully sample theserows. Once outside of this region, when the coherence startsto decrease, we can begin to subsample. Thus, instead of sam-pling uniform at random, we consider the following multilevelrandom sampling scheme:
Definition 2. Let r ā N, N = (N1, . . . , Nr) ā Nr with 1 ā¤N1 < . . . < Nr, m = (m1, . . . ,mr) ā Nr, with mk ā¤Nk āNkā1, k = 1, . . . , r, and suppose that
Ī©k ā Nkā1 + 1, . . . , Nk, |Ī©k| = mk, k = 1, . . . , r,
are chosen uniformly at random, where N0 = 0. We refer tothe set Ī© = Ī©N,m := Ī©1 āŖ . . . āŖ Ī©r as an (N,m)-multilevelsampling scheme.
Note that similar sampling strategies are found in mostempirical studies on compressive MRI [2]ā[5]. Closely relatedstrategies were also considered in [12], as well as in [13].
C. Asymptotic sparsity in levelsIn the case of perfect incoherence, the standard random
sampling strategies of compressed sensing are ideally suitedfor sparse signals. However, in asymptotically incoherentsetting, the notion of sparsity can be substantially relaxed.
To explain this, let x = (xj)Nj=1 be vector of coefficients of a
signal f in the basis ĻjjāN. Suppose that x was very sparsein its entries j = 1, . . . ,M1. Since the matrix U is highlycoherent in its corresponding rows, there is no way we canexploit this sparsity to achieve subsampling. High coherenceforces us to sample fully the first M1 rows, otherwise we runthe risking of missing critical information about x.
This means that there is nothing to be gained from highsparsity of x in its first few entries. However, we can expect toachieve significant subsampling if x is asymptotically sparse:
Definition 3. For r ā N let M = (M1, . . . ,Mr) ā Nr with1 ā¤ M1 < . . . < Mr and s = (s1, . . . , sr) ā Nr, withsk ā¤Mk āMkā1, k = 1, . . . , r, where M0 = 0. We say thatx ā l2(N) is (s,M)-sparse if, for each k = 1, . . . , r,
āk := supp(x) ā© Mkā1 + 1, . . . ,Mk,
satisfies |āk| ā¤ sk.
As we next explain, signals possessing this sparsitypatternāwhich we henceforth refer to as being asymptoticallysparse in levelsāare ideally suited to multilevel samplingschemes. Roughly speaking, the number of measurements mk
required in each band Ī©k is determined by the sparsity of fin the corresponding band āk and the asymptotic coherence.
This naturally leads to the question: which types of imagesare asymptotically sparse? The answer is that most imagespossess exactly this sparsity structure. Natural, real-life imagesare asymptotically sparsity in wavelet bases. At coarse scales,most images are not sparse, but sparsity rapidly increases withrefinement.
IV. MAIN RESULTS
For brevity, we consider only the case of a two-levelsampling strategy. The multilevel case is described in [9].
Write ĀµK = Āµ(Pā„KU). We now have:
Theorem 4. Let U ā CNĆN be an isometry and x ā CN be(s,M)-sparse, where r = 2, s = (M1, s2), s = M1 + s2 andM = (M1,M2) with M2 = N . Suppose that
āPā„N1UPM1
ā ā¤ Ī³āM1
, (5)
for some 1 ā¤ N1 ā¤ N and Ī³ ā (0, 2/5], and that Ī³ ā¤s2āĀµN1 . For Īµ > 0, let m ā N satisfy
m & (N āN1) Ā· (log(sĪµā1) + 1) Ā· ĀµN1 Ā· s2 Ā· log (N) .
Let Ī© = Ī©N,m be a two-level sampling scheme, where N =(N1, N2) and m = (m1,m2) with N2 = N , m1 = N1 andm2 = m, and suppose that Ī¾ ā CN is a minimizer of (3),where f = Ux. Then, with probability exceeding 1 ā Īµ, Ī¾ isunique and Ī¾ = x.
Note that if f is not exactly sparse, and if the measurementsf = Ux+ z are corrupted by noise z satisfying āzā ā¤ Ī“, thenone can also prove that under essentially the same conditionsthe minimization
infĪ·āHāĪ·āl1 subject to āPĪ©UĪ· ā yā ā¤ Ī“. (6)
recovers f exactly, up to an error depending only on Ī“ and theerror of the best approximation Ļs,M(f) of x by an (s,M)-sparse vector. We refer to [9] for details.
A. Discussion
Theorem 4 demonstrates that asymptotic incoherence andtwo-level sampling overcomes the coherence barrier. To seethis, note the following:
(i) The condition āPā„N1UPM1
ā ā¤ 25āM1
(which is alwayssatisfied for some N1, since U is an isometry) implies thatfully sampling the first N1 measurements allows one to recoverthe first M1 coefficients of f .
(ii) To recover the remaining s2 coefficients we require, upto log factors, an additional
m2 & (N āN1) Ā· ĀµN1 Ā· s2,
measurements, taken randomly from the range M1 +1, . . . ,M2. In particular, if N1 is a fixed fraction of N ,and if ĀµN1 = O
(Nā1
1
), such as for wavelets with Fourier
measurements, then one requires only m2 & s2 additionalmeasurements to recover the sparse part of the signal.
(iii) When f is asymptotically sparse, such is the case fornatural images, then the relative size of s2 will become smalleras M and N grow. In particular, the percentage
(N1+m2
N
)Ć100
of measurements required will decrease. Hence the subsam-pling rate possible will improve as the problem resolutionbecomes larger (see Section V).
We remark that it is not necessary to know the sparsitystructure, i.e. the values s and M, of the image f in orderto implement the multilevel sampling technique. Given amultilevel scheme Ī© = Ī©N,m, the result of [9] governingasymptotically compressible signals demonstrates that f willbe recovered exactly up to an error on the order of Ļs,M(f),where s and M are determined implicitly by N, m and theconditions of the theorem. Of course, some a priori knowledgeof s and M will greatly assist in selecting the parameters Nand m so as to get the best recovery results. However, this isnot necessary for implementing the method.
V. RESOLUTION DEPENDENCE AND NUMERICAL RESULTS
As explained, natural, real life images are not sparse atcoarse wavelet scales, nor is there substantial asymptoticincoherence. Hence, regardless of how we choose to recover f ,there is little possibility for substantial subsampling. On the
128 256 512 1024 2048 40960
10
20
30
40
50
60
Min
sub
āsa
mpl
ing
% fo
r su
cces
s
Resolution level
Fig. 1. The minimum subsampling percentage p.
other hand, asymptotic incoherence and asymptotic sparsityboth kick in when the resolution increases. Multilevel sam-pling allows us to exploit these properties, and by doing sowe achieve far greater subsampling.
To illustrate this, consider the reconstruction of the 1Dimage f(t) = eātĻ[0.2,0.8](t), t ā [0, 1], from its Fouriersamples using orthonormal Haar wavelets. We use a two-levelscheme with p/2% fixed samples and p/2% random samples,where p is the total subsampling percentage, and search for thesmallest value of p such that the two-level sampling schemesucceeds: namely, it gives an error than that obtained by takingall possible samples of f .
In Figure 1 we plot p against the resolution level N .The difference between low resolution (N = 128) and highresolution (N = 4096) is clear and dramatic: the success of thereconstruction is highly resolution dependent. For the formerwe require nearly 60% of the samples, whereas with the latterthis figure is reduced to less than 10%.
Now consider a different experiment, where the total num-ber of measurements is fixed and equal to 5122 = 262144,but the sampling pattern is allowed to vary. Figure 2 displaysthe image to be recovered, which is the analytic phantom in-troduced by GuerquināKern, Lejeune, Pruessmann and Unserin [14]. For the purposes of comparison, artificial fine detailswere added to the image. In Figure 3 we display a segmentof the reconstruction. As is clear, compressed sensing withmultilevel sampling acts a resolution enhancer. By samplinghigher in the Fourier spectrum, one recovers a far better imagewhilst taking the same amount of measurements.
For further numerical examples, as well as a comprehensivediscussion of the multilevel sampling strategy used in Figure2, we refer to [9], [10].
REFERENCES
[1] M. Lustig, āSparse MRI,ā Ph.D. dissertation, Stanford University, 2008.[2] M. Lustig, D. L. Donoho, and J. M. Pauly, āSparse MRI: the application
of compressed sensing for rapid MRI imaging,ā Magn. Reson. Imaging,vol. 58, no. 6, pp. 1182ā1195, 2007.
[3] M. Lustig, D. L. Donoho, J. M. Santos, and J. M. Pauly, āCompressedSensing MRI,ā IEEE Signal Process. Mag., vol. 25, no. 2, pp. 72ā82,March 2008.
[4] G. Puy, J. P. Marques, R. Gruetter, J. Thiran, D. Van De Ville,P. Vandergheynst, and Y. Wiaux, āSpread spectrum Magnetic ResonanceImaging,ā IEEE Trans. Med. Imaging, vol. 31, no. 3, pp. 586ā598, 2012.
Fig. 2. The GPLU phantom.
Fig. 3. The reconstruction of the 2048 Ć 2048 GPLU phantom from5122 Fourier samples. Top: linear reconstruction using the first 5122 Fouriersamples and zero padding elsewhere. Bottom: multilevel compressed sensingreconstruction.
[5] G. Puy, P. Vandergheynst, and Y. Wiaux, āOn variable density com-pressive sampling,ā IEEE Signal Process. Letters, vol. 18, pp. 595ā598,2011.
[6] F. Krahmer and R. Ward, āCompressive imaging: stable and robustrecovery from variable density frequency samples,ā Preprint, 2012.
[7] E. J. Candes, J. Romberg, and T. Tao, āRobust uncertainty principles:exact signal reconstruction from highly incomplete frequency informa-tion,ā IEEE Trans. Inform. Theory, vol. 52, no. 2, pp. 489ā509, 2006.
[8] E. J. Candes and Y. Plan, āA probabilistic and RIPless theory ofcompressed sensing,ā IEEE Trans. Inform. Theory, vol. 57, no. 11, pp.7235ā7254, 2011.
[9] B. Adcock, A. C. Hansen, C. Poon, and B. Roman, āBreaking thecoherence barrier: asymptotic incoherence and asymptotic sparsity incompressed sensing,ā Arxiv 1302.0561, 2013.
[10] (2013) http://subsample.org.[11] B. Adcock and A. C. Hansen, āGeneralized sampling and infinite-
dimensional compressed sensing,ā Technical report NA2011/02, DAMTP,University of Cambridge, 2011.
[12] E. J. Candes and J. Romberg, āSparsity and incoherence in compressivesampling,ā Inverse Problems, vol. 23, no. 3, pp. 969ā985, 2007.
[13] V. Studer, J. Bobin, M. Chahid, H. Moussavi, E. Candes, and M. Dahan,āCompressive fluorescence microscopy for biological and hyperspectralimaging,ā Submitted, 2011.
[14] M. Guerquin-Kern, L. Lejeune, K. P. Pruessmann, and M. Unser, āRe-alistic analytical phantoms for parallel Magnetic Resonance Imaging,āIEEE Trans. Med. Imaging, vol. 31, no. 3, pp. 626ā636, 2012.