13
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006 3089 The Nonsubsampled Contourlet Transform: Theory, Design, and Applications Arthur L. da Cunha, Jianping Zhou, Member, IEEE, and Minh N. Do, Member, IEEE Abstract—In this paper, we develop the nonsubsampled contourlet transform (NSCT) and study its applications. The construction proposed in this paper is based on a nonsubsam- pled pyramid structure and nonsubsampled directional filter banks. The result is a flexible multiscale, multidirection, and shift- invariant image decomposition that can be efficiently implemented via the à trous algorithm. At the core of the proposed scheme is the nonseparable two-channel nonsubsampled filter bank (NSFB). We exploit the less stringent design condition of the NSFB to design filters that lead to a NSCT with better frequency selectivity and regularity when compared to the contourlet transform. We propose a design framework based on the mapping approach, that allows for a fast implementation based on a lifting or ladder structure, and only uses one-dimensional filtering in some cases. In addition, our design ensures that the corresponding frame elements are regular, symmetric, and the frame is close to a tight one. We assess the performance of the NSCT in image denoising and enhancement applications. In both applications the NSCT compares favorably to other existing methods in the literature. Index Terms—Contourlet transform, frames, image denoising, image enhancement, multidimensional filter banks, nonsubsam- pled filter banks. I. INTRODUCTION A NUMBER of image-processing tasks are efficiently carried out in the domain of an invertible linear transfor- mation. For example, image compression and denoising are efficiently done in the wavelet transform domain [1], [2]. An effective transform captures the essence of a given signal or a family of signals with few basis functions. The set of basis func- tions completely characterizes the transform and this set can be redundant or not, depending on whether the basis functions are linear dependent. By allowing redundancy, it is possible to enrich the set of basis functions so that the representation is more efficient in capturing some signal behavior. In addition, redundant representations are generally more flexible and easier to design. In applications such as denoising, enhancement, and Manuscript received May 20, 2005; revised November 16, 2005. This work was supported in part by a CAPES fellowship, Brazil, and in part by the U.S. National Science Foundation under Grant CCR-0237633 (CAREER). The as- sociate editor coordinating the review of this manuscript and approving it for publication was Prof. Phillipe Salembier. A. L. da Cunha is with the Department of Electrical and Computer Engi- neering and the Coordinated Science Laboratory, University of Illinois at Ur- bana-Champaign, Urbana, IL 61801 USA (e-mail: [email protected]). J. Zhou was with the Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801 USA. He is now with Texas Instruments, Austin, TX 78714 USA (e-mail: [email protected]). M. N. Do is with the Department of Electrical and Computer Engineering, the Coordinated Science Laboratory, and the Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL 61801 USA (e-mail: minhdo@uiuc. edu). Digital Object Identifier 10.1109/TIP.2006.877507 contour detection, a redundant representation can significantly outperform a nonredundant one. Another important feature of a transform is its stability with respect to shifts of the input signal. The importance of the shift- invariance property in imaging applications dates back at least to Daugman [3] and was also advocated by Simoncelli et al. in [4]. An example that illustrates the importance of shift- invariance is image denoising by thresholding where the lack of shift-invariance causes pseudo-Gibbs phenomena around singularities [5]. Thus, most state-of-the-art wavelet denoising algorithms (see for example [6]–[8]) use an expansion with less shift sensitivity than the standard maximally decimated wavelet decomposition—the most common being the nonsub- sampled wavelet transform (NSWT) computed with the à trous algorithm [9]. 1 In addition to shift-invariance, it has been recognized that an efficient image representation has to account for the geometrical structure pervasive in natural scenes. In this direction, several representation schemes have recently been proposed [10]–[15]. The contourlet transform [14] is a multidirectional and mul- tiscale transform that is constructed by combining the Lapla- cian pyramid [16], [17] with the directional filter bank (DFB) proposed in [18]. The pyramidal filter bank structure of the contourlet transform has very little redundancy, which is im- portant for compression applications. However, designing good filters for the contourlet transfom is a difficult task. In addi- tion, due to downsamplers and upsamplers present in both the Laplacian pyramid and the DFB, the contourlet transform is not shift-invariant. In this paper, we propose an overcomplete transform that we call the nonsubsampled contourlet transform (NSCT). Our main motivation is to construct a flexible and efficient transform tar- geting applications where redundancy is not a major issue (e.g., denoising). The NSCT is a fully shift-invariant, multiscale, and multidirection expansion that has a fast implementation. The proposed construction leads to a filter-design problem that to the best of our knowledge has not been addressed elsewhere. The design problem is much less constrained than that of contourlets. This enables us to design filters with better frequency selec- tivity thereby achieving better subband decomposition. Using the mapping approach we provide a framework for filter design that ensures good frequency localization in addition to having a fast implementation through ladders steps. The NSCT has proven to be very efficient in image denoising and image en- hancement as we show in this paper. 1 Denoising by thresholding in the NSWT domain can also be realized by denoising multiple circular shifts of the signal with a critically sampled wavelet transform and then averaging the results. This has been termed cycle spinning after [5]. 1057-7149/$20.00 © 2006 IEEE

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

  • Upload
    duongtu

  • View
    221

  • Download
    0

Embed Size (px)

Citation preview

Page 1: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006 3089

The Nonsubsampled Contourlet Transform:Theory, Design, and Applications

Arthur L. da Cunha, Jianping Zhou, Member, IEEE, and Minh N. Do, Member, IEEE

Abstract—In this paper, we develop the nonsubsampledcontourlet transform (NSCT) and study its applications. Theconstruction proposed in this paper is based on a nonsubsam-pled pyramid structure and nonsubsampled directional filterbanks. The result is a flexible multiscale, multidirection, and shift-invariant image decomposition that can be efficiently implementedvia the à trous algorithm. At the core of the proposed scheme isthe nonseparable two-channel nonsubsampled filter bank (NSFB).We exploit the less stringent design condition of the NSFB todesign filters that lead to a NSCT with better frequency selectivityand regularity when compared to the contourlet transform. Wepropose a design framework based on the mapping approach,that allows for a fast implementation based on a lifting or ladderstructure, and only uses one-dimensional filtering in some cases.In addition, our design ensures that the corresponding frameelements are regular, symmetric, and the frame is close to a tightone. We assess the performance of the NSCT in image denoisingand enhancement applications. In both applications the NSCTcompares favorably to other existing methods in the literature.

Index Terms—Contourlet transform, frames, image denoising,image enhancement, multidimensional filter banks, nonsubsam-pled filter banks.

I. INTRODUCTION

ANUMBER of image-processing tasks are efficientlycarried out in the domain of an invertible linear transfor-

mation. For example, image compression and denoising areefficiently done in the wavelet transform domain [1], [2]. Aneffective transform captures the essence of a given signal or afamily of signals with few basis functions. The set of basis func-tions completely characterizes the transform and this set canbe redundant or not, depending on whether the basis functionsare linear dependent. By allowing redundancy, it is possible toenrich the set of basis functions so that the representation ismore efficient in capturing some signal behavior. In addition,redundant representations are generally more flexible and easierto design. In applications such as denoising, enhancement, and

Manuscript received May 20, 2005; revised November 16, 2005. This workwas supported in part by a CAPES fellowship, Brazil, and in part by the U.S.National Science Foundation under Grant CCR-0237633 (CAREER). The as-sociate editor coordinating the review of this manuscript and approving it forpublication was Prof. Phillipe Salembier.

A. L. da Cunha is with the Department of Electrical and Computer Engi-neering and the Coordinated Science Laboratory, University of Illinois at Ur-bana-Champaign, Urbana, IL 61801 USA (e-mail: [email protected]).

J. Zhou was with the Department of Electrical and Computer Engineering,University of Illinois at Urbana-Champaign, Urbana, IL 61801 USA. He is nowwith Texas Instruments, Austin, TX 78714 USA (e-mail: [email protected]).

M. N. Do is with the Department of Electrical and Computer Engineering,the Coordinated Science Laboratory, and the Beckman Institute, University ofIllinois at Urbana-Champaign, Urbana, IL 61801 USA (e-mail: [email protected]).

Digital Object Identifier 10.1109/TIP.2006.877507

contour detection, a redundant representation can significantlyoutperform a nonredundant one.

Another important feature of a transform is its stability withrespect to shifts of the input signal. The importance of the shift-invariance property in imaging applications dates back at leastto Daugman [3] and was also advocated by Simoncelli et al.in [4]. An example that illustrates the importance of shift-invariance is image denoising by thresholding where the lackof shift-invariance causes pseudo-Gibbs phenomena aroundsingularities [5]. Thus, most state-of-the-art wavelet denoisingalgorithms (see for example [6]–[8]) use an expansion withless shift sensitivity than the standard maximally decimatedwavelet decomposition—the most common being the nonsub-sampled wavelet transform (NSWT) computed with the à trousalgorithm [9].1

In addition to shift-invariance, it has been recognized that anefficient image representation has to account for the geometricalstructure pervasive in natural scenes. In this direction, severalrepresentation schemes have recently been proposed [10]–[15].The contourlet transform [14] is a multidirectional and mul-tiscale transform that is constructed by combining the Lapla-cian pyramid [16], [17] with the directional filter bank (DFB)proposed in [18]. The pyramidal filter bank structure of thecontourlet transform has very little redundancy, which is im-portant for compression applications. However, designing goodfilters for the contourlet transfom is a difficult task. In addi-tion, due to downsamplers and upsamplers present in both theLaplacian pyramid and the DFB, the contourlet transform is notshift-invariant.

In this paper, we propose an overcomplete transform that wecall the nonsubsampled contourlet transform (NSCT). Our mainmotivation is to construct a flexible and efficient transform tar-geting applications where redundancy is not a major issue (e.g.,denoising). The NSCT is a fully shift-invariant, multiscale, andmultidirection expansion that has a fast implementation. Theproposed construction leads to a filter-design problem that to thebest of our knowledge has not been addressed elsewhere. Thedesign problem is much less constrained than that of contourlets.This enables us to design filters with better frequency selec-tivity thereby achieving better subband decomposition. Usingthe mapping approach we provide a framework for filter designthat ensures good frequency localization in addition to havinga fast implementation through ladders steps. The NSCT hasproven to be very efficient in image denoising and image en-hancement as we show in this paper.

1Denoising by thresholding in the NSWT domain can also be realized bydenoising multiple circular shifts of the signal with a critically sampled wavelettransform and then averaging the results. This has been termed cycle spinningafter [5].

1057-7149/$20.00 © 2006 IEEE

Page 2: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

3090 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006

Fig. 1. Nonsubsampled contourlet transform. (a) NSFB structure that imple-ments the NSCT. (b) Idealized frequency partitioning obtained with the pro-posed structure.

The paper is structured as follows. In Section II, we describethe NSCT and its building blocks. We introduce a pyramid struc-ture that ensures the multiscale feature of the NSCT and the di-rectional filtering structure based on the DFB. The basic unit inour construction is the nonsubsampled filter bank (NSFB) whichis discussed in Section II. In Section III, we study the issues as-sociated with the NSFB design and implementation problems.Applications of the NSCT in image denoising and enhancementare discussed in Section IV. Concluding remarks are drawn inSection V.

Notation: Throughout this paper, a two-dimensional(2-D) filter is represented by its -transform where

. Evaluated on the unit sphere, a filter is denotedby where . Ifis a 2-D vector, then whereas if is a 2 2matrix, then with the columns of

. In this paper we often deal with zero-phase 2-D filters.On the unit sphere, such filters can be written as polynomialsin . We thus write for azero-phase filter in which and denote and ,respectively.

Abbreviations: A number of abbreviations are usedthroughout the paper.

NSCT: Nonsubsampled Contourlet Transform.NSFB: Nonsubsampled Filter Bank.NSDFB: Nonsubsampled Directional Filter Bank.NSP: Nonsubsampled Pyramid.NSWT: Nonsubsampled 2-D Wavelet Transform.LAS: Local Adaptive Shrinkage.

II. NONSUBSAMPLED CONTOURLETS AND FILTER BANKS

A. Nonsubsampled Contourlet Transform

Fig. 1(a) displays an overview of the proposed NSCT. Thestructure consists in a bank of filters that splits the 2-D fre-quency plane in the subbands illustrated in Fig. 1(b). Our pro-posed transform can thus be divided into two shift-invariantparts: 1) a nonsubsampled pyramid structure that ensures themultiscale property and 2) a nonsubsampled DFB structure thatgives directionality.

1) Nonsubsampled Pyramid (NSP): The multiscale propertyof the NSCT is obtained from a shift-invariant filtering structurethat achieves a subband decomposition similar to that of the

Fig. 2. Proposed nonsubsampled pyramid is a 2-D multiresolution expansionsimilar to the 1-D NSWT. (a) Three-stage pyramid decomposition. The lightergray regions denote the aliasing caused by upsampling. (b) Subbands on the 2-Dfrequency plane.

Laplacian pyramid. This is achieved by using two-channelnonsubsampled 2-D filter banks. Fig. 2 illustrates the proposednonsubsampled pyramid (NSP) decomposition withstages. Such expansion is conceptually similar to the one–di-mensional (1-D) NSWT computed with the à trous algorithm[9] and has redundancy, where denotes the numberof decomposition stages. The ideal passband support of thelow-pass filter at the th stage is the region .Accordingly, the ideal support of the equivalent high-passfilter is the complement of the low-pass, i.e., the region

. The filters forsubsequent stages are obtained by upsampling the filters ofthe first stage. This gives the multiscale property without theneed for additional filter design. The proposed structure is thusdifferent from the separable NSWT. In particular, one bandpassimage is produced at each stage resulting in redundancy.By contrast, the NSWT produces three directional images ateach stage, resulting in redundancy.

The 2-D pyramid proposed in [19] pp. 21 is obtained witha similar structure. Specifically, the NSFB of [19] is built fromlow-pass filter . One then sets , and thecorresponding synthesis filters . A similardecomposition can be obtained by removing the downsamplersand upsamplers in the Laplacian pyramid and then upsamplingthe filters accordingly. Those perfect reconstruction systems canbe seen as a particular case of our more general structure. Theadvantage of our construction is that it is general and as a re-sult, better filters can be obtained. In particular, in our design

and are low-pass and high-pass. Thus, they filtercertain parts of the noise spectrum in the processed pyramidcoefficients.

2) Nonsubsampled Directional Filter Bank (NSDFB): Thedirectional filter bank of Bamberger and Smith [18] is con-structed by combining critically-sampled two-channel fan filterbanks and resampling operations. The result is a tree-structuredfilter bank that splits the 2-D frequency plane into directionalwedges. A shift-invariant directional expansion is obtained witha nonsubsampled DFB (NSDFB). The NSDFB is constructedby eliminating the downsamplers and upsamplers in the DFB(see also [20]). This is done by switching off the downsam-plers/upsamplers in each two-channel filter bank in the DFB treestructure and upsampling the filters accordingly. This resultsin a tree composed of two-channel NSFBs. Fig. 3 illustrates a

Page 3: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

DA CUNHA et al.: NSCT: THEORY, DESIGN, AND APPLICATIONS 3091

Fig. 3. Four-channel nonsubsampled directional filter bank constructed withtwo-channel fan filter banks. (a) Filtering structure. The equivalent filter in eachchannel is given by U (z) = U (z)U (z ). (b) Corresponding frequencydecomposition.

Fig. 4. Need for upsampling in the NSCT. (a) With no upsampling, the high-pass at higher scales will be filtered by the portion of the directional filter thathas “bad” response. (b) Upsampling ensures that filtering is done in the “good”region.

four channel decomposition. Note that in the second level, theupsampled fan filters , have checker-board fre-quency support, and when combined with the filters in the firstlevel give the four directional frequency decomposition shownin Fig. 3. The synthesis filter bank is obtained similarly. Justlike the critically sampled directional filter bank, all filter banksin the nonsubsampled directional filter bank tree structure areobtained from a single NSFB with fan filters [see Fig. 5(b)].Moreover, each filter bank in the NSDFB tree has the samecomputational complexity as that of the building-block NSFB.

3) Combining the Nonsubsampled Pyramid and Nonsub-sampled Directional Filter Bank in the NSCT: The NSCT isconstructed by combining the NSP and the NSDFB as shown inFig. 1(a). In constructing the NSCT, care must be taken when ap-plying the directional filters to the coarser scales of the pyramid.Due to the tree-structure nature of the NSDFB, the directional re-sponse at the lower and upper frequencies suffers from aliasingwhich can be a problem in the upper stages of the pyramid(see Fig. 8). This is illustrated in Fig. 4(a), where the passbandregion of the directional filter is labeled as “Good” or “Bad.”Thus, we see that for coarser scales, the high-pass channel ineffect is filtered with the bad portion of the directional filterpassband. This results in severe aliasing and in some observedcases a considerable loss of directional resolution.

We remedy this by judiciously upsampling the NSDFB fil-ters. Denote the th directional filter by . Then for higherscales, we substitute for where is chosen

Fig. 5. Two-channel NSFBs used in the NSCT. The system is two times re-dundant and the reconstruction is error free when the filters satisfy Bezout’sidentity. (a) Pyramid NSFB. (b) Fan NSFB.

to ensure that the good part of the response overlaps with thepyramid passband. Fig. 4(b) illustrates a typical example. Notethat this modification preserves perfect reconstruction. In a typ-ical five-scale decomposition, we upsample by the NSDFBfilters of the last two stages.

Filtering with the upsampled filters does not increase compu-tational complexity. Specifically, for a given sampling matrixand a 2-D filter , to obtain the output resulting fromfiltering with , we use the convolution formula

(1)

This is the à trous filtering algorithm [9] (“à trous” is Frenchfor “with holes”). Therefore, each filter in the NSDFB tree hasthe same complexity as that of the building-block fan NSFB.Likewise, each filtering stage of the NSP has the same com-plexity as that incurred by the first stage. Thus, the complexityof the NSCT is dictated by the complexity of the building-blockNSFBs. If each NSFB in both NSP and NSDFB requires op-erations per output sample, then for an image of pixels theNSCT requires about operations where denotes thenumber of subbands. For instance, if , a typical decom-position with four pyramid levels, 16 directions in the two finerscales, and eight directions in the two coarser scales would re-quire a total of 1536 operations per image pixel.

If the building block 2-channel NSFBs in the NSP andNSDFB are invertible, then clearly the NSCT is invertible. Italso underlies a frame expansion (see Section II-C). The frameelements are localized in space and oriented along a discrete setof directions. The NSCT is flexible in that it allows any numberof directions in each scale. In particular, it can satisfy theanisotropic scaling law—a key property in establishing theexpansion nonlinear approximation behavior [13], [14]. Thisproperty is ensured by doubling the number of directions in theNSDFB expansion at every other scale. The NSCT has redun-dancy given by , where denotes the number oflevels in the NSDFB at the th scale.

B. Nonsubsampled Filter Banks

At the core of the proposed NSCT structure is the 2-D two-channel NSFB. Shown in Fig. 5 are the NSFBs needed to con-struct the NSCT. In this paper, we focus exclusively on the FIRcase simply because it is easier to implement in multiple dimen-sions. For a general FIR two-channel NSFB, perfect reconstruc-

Page 4: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

3092 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006

tion is achieved provided the filters satisfy the Bezout identity,as follows:

(2)

The Bezout relation in (2) puts no constraint on the frequencyresponse of the filters involved. Therefore, to obtain good solu-tions, one has to impose additional conditions on the filters.

C. Frame Analysis of the NSCT

The NSFB can be interpreted in terms of analysis/synthesisoperators of frame systems. A family of vectors con-stitute a frame for a Hilbert space if there exist two positiveconstants such that for each we have

(3)

In the event that , the frame is said to be tight. The framebounds are the tightest positive constants satisfying (3).

Consider the NSFB of Fig. 5(a). The familyis a frame for if and only if there exist constants

such that [21]

(4)

Thus, the frame bounds of an NSFB can be computed by

(5)

where , and denote the essential infimum and es-sential supremum respectively. From (4), we see that the frameis tight whenever is almost everywhere constant. For FIRfilters, this means that .Such condition can only be met with linear phase FIR filters if

and are either trivial delays or combination of twodelays (for a formal proof, see [22, pp. 337–338] or [23]).

Because the NSFB is redundant, an infinite number of in-verses exist. Among them, the pseudoinverse is optimal in theleast-square sense [1]. Given a frame of analysis filters, the syn-thesis filters corresponding to the frame pseudoinverse are givenby for [21]. In this case, the syn-thesis filters form the dual frame with lower and upper framebounds given by and respectively. When the analysisfilters are FIR, then unless the frame is tight, the synthesis filtersof the pseudoinverse will be IIR.

From the above discussion, we gather two important points:1) linear phase filters and tight frames are mutually exclusiveand 2) the pseudoinverse is desirable, but is IIR if the frame isnot tight. Consequently, an FIR NSFB system with linear phasefilters and with synthesis filters corresponding to the pseudoin-verse is not possible. However, we can approximate the pseu-doinverse with FIR filters. For a given number of filter taps, thecloser the frame is to being tight, the better an FIR approxima-tion of the pseudoinverse can be [24]. Thus, in the designs of

TABLE IFRAME BOUNDS EVOLVING WITH SCALE FOR THE PYRAMID

FILTERS GIVEN IN EXAMPLE 1 IN SECTION III

the filters we seek linear phase filters that underly a frame thatis as close to a tight one as possible.

In a general FIR perfect reconstruction NSFB system, bothanalysis and synthesis filters form a frame. If we denote the anal-ysis and synthesis frame bounds by and respec-tively, the frames will be close to tight provided [24]

We always assume that the filters are normalized so that wehave . In case the pseudoinverse is used, then wealso have . The following result shows the NSCTis a frame operator for whenever the constituent NSFBseach forms a frame.

Proposition 1: In the NSCT, if the pyramid filter bank con-stitute a frame with frame bounds and , and the fan filtersconstitute a frame with frame bounds and , then the NSCTis a frame with bounds and , satisfying

Proof: See Appendix A:Remark 1: When both the pyramid and fan filter banks form

tight frames with bound 1, then , andfrom the above proposition, the NSCT is also a tight frame withbound 1.

The above estimates on and can be accurate in somecases, especially when the frame is close to a tight one and thenumber of levels is small (e.g., ). In general, however,they are not accurate estimates. Their purpose is more of givingan interval for the frame bounds rather than the actual values.Table I shows estimates for different number of scales. The ac-tual frame bound is computed from (4)–(5), whereas the esti-mates are the ones given according to Proposition 1.

III. FILTER DESIGN AND IMPLEMENTATION

The filter-design problem of the NSCT comprises the twobasic NSFBs displayed in Fig. 5. The goal is to design the filtersimposing the Bezout identity (i.e., perfect reconstruction) andenforcing other properties such as sharp frequency response,easy implementation, regularity of the frame elements, andtightness of the corresponding frames. It is also desirable thatthe filters are linear-phase.

Two-channel 1-D NSFBs that underly tight frames are de-signed in [21]. However, the design methodology of [21] is noteasy to extend to 2-D designs since it relies on spectral factoriza-tion which is hard in 2-D. If we relax the tightness constraint,

Page 5: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

DA CUNHA et al.: NSCT: THEORY, DESIGN, AND APPLICATIONS 3093

Fig. 6. Lifting structure for the NSFB designed with the mapping approach.The 1-D prototype is factored with the Euclidean algorithm. The 2-D filters areobtained by replacing x 7! f(z).

then the design becomes more flexible. In addition, as we al-luded to earlier, non-tight filters can be linear phase.

An effective and simple way to design 2-D filters is the map-ping approach first proposed by McClellan [25] in the contextof digital filters and then used by several authors [23], [26]–[28]in the context of filter banks. In such an approach, the 2-D fil-ters are obtained from 1-D ones. In the context of NSFBs, a setof perfect reconstruction 2-D filters is obtained in the followingway.Step 1) Construct a set of 1-D polynomials

that satisfies theBezout identity.

Step 2) Given a 2-D FIR filter , thenare 2-D

filters satisfying the Bezout identity.Thus, one has to design the set of 1-D filters and the mapping

function so that the ideal responses are well approximatedwith a small number of filter coefficients. In the mappingapproach, one can control the frequency and phase responsesthrough the mapping function. Moreover, if the mappingfunction is zero-phase, then and it followsthat the mapped filters are also zero-phase. In this case, onthe unit sphere, the mapping function is a 2-D polynomial in

. We thus denote it by , where it isimplicit that .

A. Implementation Through Lifting

Filters designed with the mapping approach can be effi-ciently factored into a ladder [29] or lifting [30] structure thatsimplifies computations. To see this, assume without loss ofgenerality that the degree of the high-pass prototype poly-nomial is smaller than that of . Supposealso that there are synthesis filters andsuch that the Bezout identity is satisfied. In this case it followsthat . The Euclidean algorithm thenenables us to factor the filters in the following way [29], [30]:

(6)

As a result, we can obtain a 2-D factorization by replacingwith . This factorization characterizes every 2-D NSFB

derived from 1-D through the mapping method. Fig. 6 illustratesthe ladder structure with one stage.

In general, the lifting implementation at least halves thenumber of multiplications and additions of the direct form [30].

This can be shown by following the same reasoning as in thecritically-sampled case discussed in [30]. The complexity canbe reduced further if the lifting steps in the 1-D prototype aremonomials and the mapping filter has the form

(7)

for suitable , , and integers . Note that ifis a 1-D filter, then the 2-D filter for integers and

has the same complexity as that of . Therefore, filters ofthe form in (7) have the same complexity as that of separable fil-ters (i.e., filters of the form ) which amountsto two 1-D filtering operations. Notice that if is as in (7),then for an arbitrary sampling matrix , also has the formin (17). Consequently, all NSFBs in the NSDFB tree structurecan be implemented with 1-D operations whenever the buildingblock fan NSFB can be implemented with 1-D filtering opera-tions. The same reasoning applies to the NSFBs of the NSP.

B. Pyramid Filter Design

In the pyramid case, we impose line zeros at and. Notice that an th order line zero at ,

for example, amounts to a factor in the low-passfilter. Such zeros are useful to obtain good approximation of theideal frequency response of Fig. 5, in addition to imposing reg-ularity of the scaling function. We point out that for the approx-imation of smooth images, point zeros atwould suffice [31]. However, our experience shows that pointzeros alone do not guarantee a “reasonable” frequency responseof the pyramid filters. The following proposition characterizesthe mapping function that generates zeros in the resulting 2-Dfilters.

Proposition 2: Let be a polynomial with rootswhere each has multiplicity . Suppose we want a

mapping function such that

(8)

where is a bivariate polynomial. Thenhas the form in (8) if and only if

takes the form

(9)

for some root , where is a bivariatepolynomial, and are such that and

.Proof: See Appendix B:

The above result holds in general, even for point zeros (as op-posed to line zeros as in Proposition 2). We will explore thisextensively in the designs that follow.

Suppose the prototype filters each haszeros at . Then, in order to produce a suitable zero-phasemapping function for the pyramid NSFB we consider the classof maximally-flat filters given by the polynomials [32, p. 171]

(10)

Page 6: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

3094 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006

TABLE IIMAXIMALLY FLAT MAPPING POLYNOMIALS USED IN THE

DESIGN OF THE NONSUBSAMPLED FAN FILTER BANK

where controls the degree of flatness at and con-trols the degree flatness at . Following Proposition 2, wecan construct a family of mapping functions as

(11)

so that zero moments at and are guaranteed.Note that, except for the constant , has the form of(7) and hence can be implemented with 1-D filtering operationsonly.

C. Fan Filter Design

To design the fan filters idealized in Fig. 5(b), we use thesame methodology as in the pyramid case. The distinction oc-curs in the mapping function. The fan-shaped response can beobtained from a diamond-shaped response by simple modula-tion in one of the frequency variables. This modulation pre-serves the perfect reconstruction property. A useful family ofmapping functions for the diamond-shaped response is obtainedby imposing flatness around and inaddition to point zeros at . If the mapping function iszero-phase, this amounts to imposing flatness in a polynomialat and , and zeros at

. To obtain the fan mapping function, wesimply replace with , which corresponds to a shift byon the variable. If we use prototype 1-D filters with zeros at

, then similar to the pyramid case, we set

(12)

where the polynomials give the class of max-imally-flat half-band filters with diamond support. Aclosed-form expression for is given in [33].Table II displays the first six mapping functionsfor the resulting maximally-flat fan filter bank.

D. Design Examples

The design through mapping is based on a set of 1-D polyno-mials that satisfies the Bezout identity, i.e.,

The design can be simplified ifwe impose the restriction that and

. One advantage of this choice is thatthe frequency response of the filters can be controlled by thelow-pass branch—the high-pass will automatically have thecomplementary response. Another advantage is that, under an

additional condition, the frame bounds of the analysis and syn-thesis frames are the same and can be computed from the 1-Dprototypes. To see this, suppose that is the mapping functionand that with denoting therange of the mapping function . Then using (5), we have

(13)

(14)

A similar argument shows that

(15)

Example 1 (Pyramid Filters Very Close to Tight Ones): Inorder to get filters that are almost tight, we design the prototypes

and to be very close to each other. If we letand , then the

following filters can be checked to satisfy the Bezout identity:

The above filters are obtained from a ladder structure where theladder steps are given by , . To obtain the proto-type, we choose the constants and such that eachfilter has a zero at and, in addition, that

. We obtain

The lifting factorization of the prototype filters is given by

(16)with

Notice that this implementation has four multiplies/sample,whereas the direct form has seven multiplies/sample.

In this example we set, where is given in (11) so that each

of the filters has a fourth-order zero at and at. Since the ladder steps are monomials, the NSFB

can be implemented with 1-D filtering operations. The framebounds are computed using (13)–(15)

Page 7: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

DA CUNHA et al.: NSCT: THEORY, DESIGN, AND APPLICATIONS 3095

Fig. 7. Magnitude response of the filters designed in Example 1 with maxi-mally-flat filters. The nonsubsampled pyramid filter bank underlies almost tightanalysis and synthesis frames.

Fig. 8. Fan filters designed with prototype filters of Example 1 and the fanmapping function of Table II.

Thus, we have and the frame is almost tight.The support size of is 13 13 whereas has supportsize 19 19. Fig. 7 shows the response of the resulting filters.

Example 2 (Maximally-Flat Fan Filters): Using the pro-totype 1-D filters of Example 1, we choosein Table II with . We then replace in (16) with

to obtain and . Fig. 8 shows themagnitude response of the filters. The support size of is21 21, whereas the support size of is 31 31.

E. Regularity of the NSCT Basis Functions

The regularity of the NSCT basis functions can be controlledby the NSP low-pass filters. Denoting by the scalinglow-pass filter used in the pyramid (we write insteadof for convenience), we have the associated scalingfunction

where convergence is in the weak sense. In our proposed designthe filter can be factored as

(17)

Notice that the remainder filter is not separable. There-fore, one cannot separate the regularity estimation as two 1-Dproblems. Nonetheless, a similar argument can be developedand an estimate of the 2-D regularity of the scaling filter isobtained.

Proposition 3: Let be a scaling filter as in (17) withthe corresponding scaling function . Let

Then

Proof: See Appendix C:As an example, consider the prototype filter in Example 1 andthe mapping . Theresulting filter has second-order zeros at and at

. It can be verified that andso that the regularity exponent2 is at leastfor and for . Thus the

corresponding scaling functions and wavelets are at least con-tinuous. We point out that better estimates are possible applyingsimilar 1-D techniques. For instance, one could prove a resultsimilar to Lemma 7.1.2 in [32, p. 217], as a consequence ofProposition 3.

Fig. 9 shows the basis functions of the NSCT obtained withthe filters designed via mapping. As the picture shows, the func-tions have a good degree of regularity.

IV. APPLICATIONS

A. Image Denoising

In order to illustrate the potential of the NSCT designed usingthe techniques previously discussed, we study additive whiteGaussian noise (AWGN) removal from images by means ofthresholding estimators.

1) Comparison to Other Transforms: To highlight the per-formance of the NSCT relative to other transforms, we performhard threshold on the subband coefficients of the various trans-forms. We choose the threshold

for each subband. This has been termed -sigma thresholdingin [34]. We set for the finest scale and for theremaining ones. We use five scales of decomposition for both

2The regularity exponent of a scaling function �(t) is the largest number �such that �(!) decays as fast as 1=(1 + j! j + j! j) .

Page 8: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

3096 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006

Fig. 9. Basis functions of the nonsubsampled contourlet transform. (a) Basis functions of the second stage of the pyramid. (b) Basis functions of third (top eight)and fourth (bottom eight) stages of the pyramid.

TABLE IIIDENOISING PERFORMANCE OF THE NSCT. THE LEFT-MOST COLUMNS ARE HARD THRESHOLDING AND THE RIGHT-MOST ONES SOFT ESTIMATORS. FOR HARD

THRESHOLDING, THE NSCT CONSISTENTLY OUTPERFORMS CURVELETS AND THE NSWT. THE NSCT-LAS PERFORMS ON A PAR WITH THE MORE

SOPHISTICATED ESTIMATOR BLS-GSM [8] AND IS SUPERIOR TO THE BIVSHRINK ESTIMATOR OF [7]

NSCT, contourlet transform (CT), and NSWT. For the NSCTand CT we use 4, 8, 8, 16, 16 directions in the scales fromcoarser to finer, respectively.

Table III (left columns) shows the PSNR results for varioustransforms and noise intensities. The results show the NSCT isconsistently superior to curvelets and NSWT in PSNR measure.For the “Barbara” image the NSCT yields improvements in ex-

cess of 1.90 dB in PSNR over the NSWT. The NSCT also is su-perior to the CT as the results show. Fig. 10 displays the recon-structed images using the the NSWT, curvelets and NSCT. Asthe figure shows, both the NSCT and the curvelet transform of-fers a better recovery of edge information relative to the NSWT.But improvements can be seen in the NSCT, particularly aroundthe eye.

Page 9: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

DA CUNHA et al.: NSCT: THEORY, DESIGN, AND APPLICATIONS 3097

Fig. 10. Image denoising with the NSCT and hard thresholding. The noisyintensity is 20. (a) Original Lena image. (b) Denoised with the NSWT,PSNR = 31:40 dB. (c) Denoised with the curvelet transform and hard thresh-olding,PSNR = 31:52 dB. (d) Denoised with the NSCT,PSNR = 32:03 dB.

2) Comparison To Other Denoising Methods: We performsoft thresholding (shrinkage) independently in each subband.Following [6], we choose the threshold

where denotes the variance of the -th coefficient atthe -th directional subband of the -th scale, and is thenoise variance at scale and direction . It is shown in [6] thatshrinkage estimation with , and assuminggeneralized Gaussian distributed yields a risk within 5% of theoptimal Bayes risk. As studied in [35], contourlet coefficientsare well modelled by generalized Gaussian distributions. Thesignal variances are estimated locally using the neighboringcoefficients contained in a square window within each subbandand a maximum likelihood estimator. The noise variance ineach subband is inferred using a Monte Carlo technique wherethe variances are computed for a few normalized noise imagesand then averaged to stabilize the results. We refer to thismethod as local adaptive shrinkage (LAS). Effectively, ourLAS method is a simplified version of the denoising methodproposed in [36] that works in the NSCT or NSWT domain. Inthe LAS estimator we use four scales for both the NSCT andNSWT. For the NSCT we use 3, 3, 4, 4 directions in the scalesfrom coarser to finer, respectively.

To benchmark the performance of the NSCT-LAS scheme,we have used two of the best denoising methods in the lit-erature: 1) bivariate shrinkage with local variance estimation(BivShrink) [7]; 2) Bayes least squares with a Gaussianscale-mixture model (BLS-GSM) proposed in [8]. Table III

(right columns) shows the results obtained.3 The NSCT cou-pled with the LAS estimator (NSCT-LAS) produced verysatisfactory results. In particular, among the methods studied,the NSCT-LAS yields the best results for the “Barbara” image,being surpassed by the BLS-GSM method for the other images.Despite its slight loss in performance relative to BLS-GSM,we believe the NSCT has potential for better results. Thisis because by comparison, the BLS-GSM is a considerablyricher and more sophisticated estimation method than oursimple local thresholding estimator. However, studying morecomplex denoising methods in the NSCT domain is beyondthe scope of the present paper. Fig. 11 displays the denoisedimages with both BLS-GLM and NSCT-LAS methods. As thepictures show, the NSCT offers a slightly better reconstruction.In particular, the tablecloth texture is better recovered in theNSCT-LAS scheme.

We briefly mention that in denoising applications, one canreduce the redundancy of the NSCT by using critically sam-pled directional filter banks over the nonsubsampled pyramid.This results in a transform with redundancy which isconsiderably faster. There is however a loss in performance asTable IV shows. Nonetheless, in some applications, the smallperformance loss might be a good price to pay given the reducedredundancy of this alternative construction.

B. Image Enhancement

Existing image-enhancement methods amplify noise whenthey amplify weak edges since they cannot distinguish noisefrom weak edges. In the frequency domain, both weak edges andnoise produce low-magnitude coefficients. Since weak edges aregeometric structures and noise is not, we can use the NSCT todistinguish them.

The NSCT is shift-invariant so that each pixel of the transformsubbands corresponds to that of the original image in the samespatial location. Therefore, we gather the geometrical informa-tion pixel by pixel from the NSCT coefficients. We observe thatthere are three classes of pixels: strong edges, weak edges, andnoise. First, the strong edges correspond to those pixels withlarge magnitude coefficients in all subbands. Second, the weakedges correspond to those pixels with large magnitude coeffi-cients in some directional subbands but small magnitude co-efficients in other directional subbands within the same scale.Finally, the noise corresponds to those pixels with small magni-tude coefficients in all subbands. Based on this observation, wecan classify pixels into three categories by analyzing the dis-tribution of their coefficients in different subbands. One simpleway is to compute the mean (denoted by mean) and the max-imum (denoted by max) magnitude of the coefficients for eachpixel across directional subbands, and then classify it by

(18)

where is a parameter ranging from 1 to 5, and is the noisestandard deviation of the subbands at a specific pyramidal level.

3The PSNR values of the BivShrink method were obtained from the tables in[7]. In [7], the authors do not use the “Peppers” image as a test image, hence wedo not have a BivShrink column for “Peppers.”

Page 10: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

3098 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006

Fig. 11. Comparison between the NSCT-LAS and BLS-GSM denoising methods. The noise intensity is 20. (a) Original Barbara image. (b) Denoised with theBLS-GSM method of [8], PSNR = 30:28 dB. (c) Denoised with NSCT-LAS, PSNR = 30:60 dB.

TABLE IVRELATIVE LOSS IN PSNR PERFORMANCE (DB) WHEN USING THE CRITICALLY

SAMPLED DFB INSTEAD OF THE NSDFB IN THE NSCT-LAS METHOD

We first estimate the noise variance of the input image with therobust median operator [36] and then compute the noise vari-ance of each subband [34].

The goal of image enhancement is to amplify weak edges andto suppress noise. To this end, we modify the NSCT coefficientsaccording to the category of each pixel by a nonlinear mappingfunction (similar to [37])

(19)

where the input is the original coefficient, and is theamplifying gain. This function keeps the coefficients of strongedges, amplifies the coefficients of weak edges, and zeros thenoise coefficients.

We summarize our enhancement method using the NSCT inthe following algorithm.

1) Compute the NSCT of the input image for levels.2) Estimate the noise standard deviation of the input image.3) For each level of the pyramid:

a) Estimate the noise variance.b) At each pixel location, compute the mean and the

maximum magnitude of the corresponding coeffi-cients in all directional subbands at this level, andclassify each pixel according to (18) into “strongedges,” “weak edges,” or noise.

c) For each directional subband, use the nonlinear map-ping function given in (19) to modify the NSCT coef-ficients according to the classification.

4) Reconstruct the enhanced image from the modified NSCTcoefficients.

TABLE VDV AND BV COMPARISONS FOR ENHANCEMENT METHODS

We compare the enhancement results by the proposed algo-rithm with those by the NSWT. In the experiments, we choose

and . To evaluate the enhancement performanceobjectively, the detailed variance (DV) and background vari-ance (BV) were proposed in [38]. The DV and BV values repre-sent the variance of foreground and background pixels, respec-tively. A good enhancement methods should increase the DV ofthe original image but not the BV. We use the BV and DV tocompare the enhancement performance of the NSWT and theNSCT in Table V. Fig. 12 shows the results obtained for the“Barbara” image. We observe that our proposed algorithm of-fers better results in enhancing the weak edges in the textures.

V. CONCLUSION

We have developed a fully shift-invariant version of the con-tourlet transform, the NSCT. The design of the NSCT is re-duced to the design of a nonsubsampled pyramid filter bankand a nonsubsampled fan filter bank. We exploit this new lessstringent filter-design problem using a mapping approach, thusdispensing with the need for 2-D factorization. We also devel-oped a lifting/ladder structure for the 2-D NSFB. This structure,when coupled with the filters designed via mapping, provides avery efficient implementation that under some additional condi-tions can be reduced to 1-D filtering operations. Applicationsof our proposed transform in image denoising and enhance-ment were studied. In denoising, we studied the performanceof the NSCT when coupled with a hard thresholding estimatorand a local adaptive shrinkage. For hard thresholding, our re-sults indicate that the NSCT provides better performance thancompeting transform such as the NSWT and curvelets. Con-currently, our local adaptive shrinkage results are competitiveto other denoising methods. In particular, our results show that

Page 11: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

DA CUNHA et al.: NSCT: THEORY, DESIGN, AND APPLICATIONS 3099

Fig. 12. Image enhancement with the NSCT. (a) Original zoomed “Barbara” image. (b) Enhanced by the NSWT. (c) Enhanced by the NSCT.

a fairly simple estimator in the NSCT domain yields compa-rable performance to state-of-the-art denoising methods that aremore sophisticated and complex. In image enhancement, theresults obtained with the NSCT are superior to those of theNSWT both visually and with respect to objective measure-ments. A MATLAB toolbox that implements the NSCT can bedownloaded from MATLAB Central (http://www.mathworks.com/matlabcentral/).

APPENDIX

A. Proof of Proposition 1

Consider the pyramid shown in Fig. 2(a). If , then wehave that . Now, suppose we havelevels and assume . Then if we furthersplit into and , noting that , we have

Thus, by induction, we conclude thatfor any . A similar argument shows that in the NSDFB

with stages, one has that so that

The bound for is proved similarly, by just reversing theinequalities.

B. Proof of Proposition 2

We prove the claim for the case in which the zeros ofare distinct. The proof for the case of repeated roots

can be handled similarly.Denote

(20)

Then, sufficiency follows by direct substitution of (9) in (20).We prove necessity by induction. Suppose

for somepolynomial . Note that for all

if and only if for some zero offor all . So, it follows:

(21)

which implies that witha polynomial. Suppose

where . By successively applying the chain rule fordifferentiation, we get

Because and the s are distinct, we haveand then

(22)

(23)

Page 12: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

3100 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 10, OCTOBER 2006

where (23) follows by successively integrating (22). Com-bining (23) with (21) we obtain that

. By induction, we conclude that

If for some poly-nomial , then a similar argument shows that

where is a zero of . Thus, . There-fore

and so

and (9) is established with and .

C. Proof of Proposition 3

The proof follows the same lines as for the 1-D case (see [32,pp. 216]). Using the identity

we have

Because is continuously differentiable at , the samealso holds for . Now write in polar coordinateswhere . Since is continuously differ-entiable, for each is a continuously differentiablefunction of . Since , from the mean value theoremwe have that for , and

where .This gives

where we have used the inequality . Now chooseso that . We then obtain, for

so that for each

Now, putting it all together, we obtain

which completes the proof.

ACKNOWLEDGMENT

The authors would like to thank J. Laska for helping with theimplementation of the NSCT and Dr. J.-L. Starck for providingthe curvelet denoising software.

REFERENCES

[1] S. Mallat, A Wavelet Tour of Sinal Processing, 2nd ed. New York:Academic, 1999.

[2] M. Vetterli and J. Kovacevic, Wavelets and Subband Coding. Engle-wood Cliffs, NJ: Prentice-Hall, 1995.

[3] J. G. Daugman, “Uncertainty relation for resolution in space, spatial fre-quency, and orientation optimized by two-dimensional visual corticalfilters,” J. Opt Soc. Amer. A, vol. 2, no. 7, pp. 1160–1169, Jul. 1985.

[4] E. P. Simoncelli, W. T. Freeman, E. H. Adelson, and D. J. Heeger,“Shiftable multiscale transforms,” IEEE Trans. Inf. Theory, vol. 38, no.2, pp. 587–607, Mar. 1992.

[5] R. R. Coifman and D. L. Donoho, “Translation invariant de-noising,” inWavelets and Statistics, A. Antoniadis and G. Oppenheim, Eds. NewYork: Springer-Verlag, 1995, pp. 125–150.

[6] S. G. Chang, B. Yu, and M. Vetterli, “Adaptive wavelet thresholdingfor image denoising and compression,” IEEE Trans. Image Process.,vol. 9, no. 9, pp. 1532–1546, Sep. 2000.

[7] L. Sendur and I. W. Selesnick, “Bivariate shrinkage with local varianceestimation,” IEEE Signal Process. Lett., vol. 9, no. 12, pp. 438–441,Dec. 2002.

[8] J. Portilla, V. Strela, M. J. Wainwright, and E. P. Simoncelli, “Image de-noising using scale mixtures of Gaussians in the wavelet domain,” IEEETrans. Image Process., vol. 12, no. 11, pp. 1338–1351, Nov. 2003.

[9] M. J. Shensa, “The discrete wavelet transform: Wedding the à trous andMallat algorithms,” IEEE Trans. Signal Process., vol. 40, no. 10, pp.2464–2482, Oct. 1992.

[10] D. L. Donoho, “Wedgelets: Nearly minimax estimation of edges,” Ann.Statist., vol. 27, no. 3, pp. 859–897, 1999.

[11] M. B. Wakin, J. K. Romberg, H. Choi, and R. G. Baraniuk, “Wavelet-domain approximation and compression of piecewise smooth images,”IEEE Trans. Image Process., vol. 15, no. 5, pp. 1071–1087, May 2006.

[12] E. L. Pennec and S. Mallat, “Sparse geometric image representationwith bandelets,” IEEE Trans. Image Process., vol. 14, no. 4, pp.423–438, Apr. 2005.

Page 13: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. …minhdo.ece.illinois.edu/publications/nsct.pdf · The Nonsubsampled Contourlet Transform: Theory, Design, and Applications ... based

DA CUNHA et al.: NSCT: THEORY, DESIGN, AND APPLICATIONS 3101

[13] E. J. Candès and D. L. Donoho, “New tight frames of curvelets andoptimal representations of objects with piecewise C singularities,”Commun. Pure Appl. Math, vol. 57, no. 2, pp. 219–266, Feb. 2004.

[14] M. N. Do and M. Vetterli, “The contourlet transform: An efficient di-rectional multiresolution image representation,” IEEE Trans. ImageProcess., vol. 14, no. 12, pp. 2091–2106, Dec. 2005.

[15] J. G. Rosiles and M. J. T. Smith, “A low complexity overcomplete di-rectional image pyramid,” in Proc. Int. Conf. Image Processing (ICIP),Barcelona, Spain, 2003, vol. 1, pp. 1049–1052.

[16] P. J. Burt and E. H. Adelson, “The Laplacian pyramid as a compactimage code,” IEEE Trans. Commun., vol. 31, no. 4, pp. 532–540, Apr.1983.

[17] M. N. Do and M. Vetterli, “Framing pyramids,” IEEE Trans. SignalProcess., vol. 51, no. 9, pp. 2329–2342, Sep. 2003.

[18] R. H. Bamberger and M. J. T. Smith, “A filter bank for the directionaldecomposition of images: Theory and design,” IEEE Trans. SignalProcess., vol. 40, no. 4, pp. 882–893, Apr. 1992.

[19] J. L. Starck, F. Murtagh, and A. Bijaoui, Image Processing and DataAnalysis. Cambridge, U.K.: Cambridge Univ. Press, 1998.

[20] M. A. U. Khan, M. K. Khan, and M. A. Khan, “Coronary angiogramimage enhancement using decimation-free directional filter banks,” inProc. Int. Conf. Acoustics, Speech, and Signal Proc. (ICASSP), Mon-treal, QC, Canada, 2004, pp. 441–444.

[21] Z. Cvetkovic and M. Vetterli, “Oversampled filler banks,” IEEE Trans.Signal Process., vol. 46, no. 5, pp. 1245–1255, May 1998.

[22] P. P. Vaidyanathan, Multirate Systems and Filterbanks. EnglewoodCliffs, NJ: Prentice-Hall, 1993.

[23] J. Kovacevic and M. Vetterli, “Nonseparable multidimensional perfectreconstruction filter banks and wavelet bases for ,” IEEE Trans. Inf.Theory, vol. 38, no. 2, pp. 533–555, Mar. 1992.

[24] H. Bölcskei, F. Hlawatsch, and H. G. Feichtinger, “Frame-theoreticanalysis of oversampled filter banks,” IEEE Trans. Signal Process., vol.46, no. 12, pp. 3256–3268, Dec. 1998.

[25] J. H. McClellan, “The design of two-dimensional digital filters bytransformation,” in Proc. 7th Annu. Princeton Conf. InformationSciences and Systems, 1973.

[26] S.-M. Phoong, C. W. Kim, P. P. Vaidyanathan, and R. Ansari, “Anew class of two-channel biorthogonal filter banks and wavelet bases,”IEEE Trans. Signal Process., vol. 43, no. 3, pp. 649–661, Mar. 1995.

[27] D. B. H. Tay and N. G. Kingsbury, “Flexible design of multidimen-sional perfect reconstruction FIR 2-band filters using transformationof variables,” IEEE Trans. Image Process., vol. 2, no. 4, pp. 466–480,Oct. 1993.

[28] A. L. Cunha and M. N. Do, “On two-channel filter banks with direc-tional vanishing moments,” IEEE Trans. Image Process., submitted forpublication.

[29] S. Mitra and R. Sherwood, “Digital ladder networks,” IEEE Trans.Audio Electroacoust., vol. AU-21, no. 1, pp. 30–36, Feb. 1973.

[30] W. Sweldens, “The lifting scheme: A custom-design construction ofbiorthogonal wavelets,” Appl. Comput. Harmon. Anal., vol. 3, no. 2,pp. 186–200, 1996.

[31] R. Jia, “Approximation properties of multivariate wavelets,” Math.Comput., vol. 67, pp. 647–665, 1998.

[32] I. Daubechies, Ten Lectures on Wavelets. Philadelphia, PA: SIAM,1992.

[33] T. Cooklev, T. Yosbida, and A. Nishihara, “Maximally flat half-banddiamond-shaped FIR filters using the Bernstein polynomial,” IEEETrans. Circuits Syst. II, vol. 40, no. 11, pp. 749–751, Nov. 1993.

[34] J.-L. Starck, E. J. Candès, and D. L. Donoho, “The curvelet transformfor image denoising,” IEEE Trans. Image Process., vol. 11, no. 6, pp.670–684, Jun. 2002.

[35] D. D.-Y. Po and M. N. Do, “Directional multiscale modeling of im-ages using the contourlet transform,” IEEE Trans. Image Process., sub-mitted for publication.

[36] S. G. Chang, B. Yu, and M. Vetterli, “Spatially adaptive wavelet thresh-olding with context modeling for image denoising,” IEEE Trans. ImageProcess., vol. 9, no. 9, pp. 1522–1531, Sep. 2000.

[37] K. V. Velde, “Multi-scale color image enhancement,” in Proc. IEEEInt. Conf. Image Processing, 1999, vol. 3, pp. 584–587.

[38] G. Ramponi, N. Strobel, S. K. Mitra, and T.-H. Yu, “Nonlinear unsharpmasking methods for image contrast enhancement,” J. Electron. Imag.,vol. 5, no. 3, pp. 353–366, Mar. 1996.

Arthur L. da Cunha received the B.S. degreein electrical engineering from the University ofBrasilia, Brasilia, Brazil, in 2000, and the M.S.degree in electrical engineering form the PontificalCatholic University of Rio de Janeiro (PUC-RIO),Rio de Janiero, Brazil, in 2002. He is currentlypursuing the Ph.D. degree in the Department ofElectrical and Computer Engineering, University ofIllinois at Urbana-Champaign, Urbana.

During the fall of 2002, he was a graduate studentat Georgia Institute of Technology, Atlanta. He is cur-

rently a Research Assistant with the Coordinated Science Laboratory, Univer-sity of Illinois at Urbana-Champaign. In the summer of 2005, he was a Re-search Engineer at NTT DoCoMo Labs USA, and subsequently worked in theAudiovisual Communication Lab, Swiss Federal Institute of Technology, Lau-sanne, Switzerland. His research interests include signal representation, estima-tion, and coding.

Mr. da Cunha has won several awards and grants from the Brazilian NationalResearch Council, including two graduate fellowships. In 2005, he won a BestStudent Paper Award at the ICASSP Conference (Philadelphia, PA).

Jianping Zhou (S’05–M’05) received the B.Sc.degree in applied mathematics from Xi’an JiaotongUniversity, Xi’an, China, in 1997, the M.Eng. degreein signal and information processing from PekingUniversity, Beijing, China, in 2000, and the Ph.D.degree in electrical engineering from the Universityof Illinois at Urbana-Champaign, Urbana, in 2005.

He is a Member of Technical Staff with the DSPSolution R&D Center, Texas Instruments, Dallas,TX. From 2002 to 2005, he was a Research Assistantwith the Coordinated Science Laboratory, University

of Illinois. From 2000 to 2002, he was an Assistant Researcher with theInternet Media Group and the Wireless and Networking Group, MicrosoftResearch Asia, Beijing. From 1997 to 2000, he was a Research Assistant withthe National Laboratory on Machine Perception, Beijing, China. His researchinterests include wavelets, computer vision, digital color processing, image andmultidmensional signal processing, multimedia coding and communication,and multimedia delivery over wireless Internet.

Dr. Zhou received an exceptional class scholarship and the “Model of Excel-lent Students,” the highest honor for students in Peking University in 2000, andan exceptional class scholarship and “Model of Excellent Students,” the highesthonor for students in Xi’an Jiaotong University, in 1995 and 1996.

Minh N. Do (M’02) was born in Thanh Hoa,Vietnam, in 1974. He received the B.Eng. degreein computer engineering from the University ofCanberra, Canberra, ACT, Australia, in 1997, andthe Dr.Sci. degree in communication systems fromthe Swiss Federal Institute of Technology Lausanne(EPFL), Lausanne, Switzerland, in 2001.

Since 2002, he has been an Assistant Professorwith the Department of Electrical and ComputerEngineering and a Research Assistant Professorwith the Coordinated Science Laboratory and the

Beckman Institute, both at the University of Illinois at Urbana-Champaign,Urbana. His research interests include wavelets, image and multidimensionalsignal processing. multiscale geometric analysis, and visual informationrepresentation.

Dr. Do received a Silver Medal from the 32nd International MathematicalOlympiad in 1991, a University Medal from the University of Canberra in 1997,the Best Doctoral Thesis Award from the EPFL in 2001, and a CAREER awardfrom the National Science Foundation in 2003.