Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)

Single Image Super-Resolution from

Transformed Self-Exemplars

Jia-Bin Huang Narendra AhujaAbhishek Singh

Single Image Super-Resolution

• Recovering high-resolution image from low-resolution one

Spatial frequency

Amplitude

Super-Resolution

Sharpening

Multi-image vs. Single-image

Multi-image

Source: [Park et al. SPM 2003]

Single-image

Source: [Freeman et al. CG&A 2002]

http://www-sipl.technion.ac.il/new/Teaching/Projects/Winter2007/SR_Overview.pdf

http://people.csail.mit.edu/billf/publications/Example-Based_Super_Resolution.pdf

External Example-based Super-Resolution

Learning to map from low-res to high-res patches

• Nearest neighbor [Freeman et al. CG&A 02]

• Neighborhood embedding [Chang et a. CVPR 04]

• Sparse representation [Yang et al. TIP 10]

• Kernel ridge regression [Kim and Kwon PAMI 10]

• Locally-linear regression [Yang and Yang ICCV 13] [Timofte et al. ACCV 14]

• Convolutional neural network [Dong et al. ECCV 14]

• Random forest [Schulter et al. CVPR 15]

External dictionary

http://people.csail.mit.edu/billf/publications/Example-Based_Super_Resolution.pdf

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.330.2972&rep=rep1&type=pdf

http://www.ifp.illinois.edu/~jyang29/ScSR.htm


https://people.mpi-inf.mpg.de/~kkim/supres/supres.htm

https://people.mpi-inf.mpg.de/~kkim/supres/supres.htm

http://www.cv-foundation.org/openaccess/content_iccv_2013/papers/Yang_Fast_Direct_Super-Resolution_2013_ICCV_paper.pdf

http://www.vision.ee.ethz.ch/~timofter/publications/Timofte-ACCV-2014.pdf


http://mmlab.ie.cuhk.edu.hk/projects/SRCNN.html

http://lrs.icg.tugraz.at/pubs/schulter_cvpr_2015.pdf

http://lrs.icg.tugraz.at/pubs/schulter_cvpr_2015.pdf

Internal Example-based Super-Resolution

Low-res and high-res example pairs from patch recurrence across scale

• Non-local means with self-examples [Ebrahimi and Vrscay ICIRA 2007]

• Unified classical and example SR [Glasner et al. ICCV 2009]

• Local self-similarity [Freedman and Fattal TOG 2011]

• In-place regression [Yang et al. ICCV 2013]

• Nonparametric blind SR [Michaeli and Irani ICCV 2013]

• SR for noisy images [Singh et al. CVPR 2014]

• Sub-band self-similarity [Singh et al. ACCV 2014]

Internal dictionary

http://links.uwaterloo.ca/papers/waterloo/ebvr_iciar07.pdf






http://www.wisdom.weizmann.ac.il/~vision/single_image_SR/files/single_image_SR.pdf



http://www.cs.huji.ac.il/~raananf/projects/lss_upscale/



http://www.cv-foundation.org/openaccess/content_cvpr_2013/papers/Yang_Fast_Image_Super-Resolution_2013_CVPR_paper.pdf




http://www.wisdom.weizmann.ac.il/~vision/BlindSR.html





http://vision.ai.illinois.edu/publications/CVPR2014_SR_Denoise.pdf

http://vision.ai.illinois.edu/publications/singhaccv2014.pdf

Motivation

• Internal dictionary• More “relevant” patches• Limited number of examples

• High-res patches are often available in the transformed domain

Symmetry Surface orientation Perspective distortion

Super-Resolution from Transformed Self-Exemplars

LR input image Matching error LR patch HR patch

Translation

Perspective

Ground truth LR/HR patch

Translation

Ground truth LR/HR patch

Affine transform

LR input image Matching error LR patch HR patch

Input low-res image

All-frequency band low-frequency band

Super-Resolution Scheme

Multi-scale version of [Freedman and Fattal TOG 2011]





Input low-res image

LR/HR example pairs

Super-Resolution Scheme

Multi-scale version of [Freedman and Fattal TOG 2011]

low-frequency bandAll-frequency band





Input low-res image

low-frequency band

?

All-frequency band

Input low-res image

low-frequency bandAll-frequency band

?

Super-Resolution as Nearest Neighbor Field Estimation

Appearance cost Plane compatibility Scale cost

[Huang et al. SIGGRAPH 2014] Scale

https://sites.google.com/site/jbhuang0604/publications/struct_completion

Search Patch Transformation

• Generalized PatchMatch [Barnes et al. ECCV 2010]• Randomization• Spatial propagation

• Backward compatible when planar structures were not detected

Perspective Similarity Affine[Huang et al. SIGGRAPH 2014]

https://sites.google.com/site/jbhuang0604/publications/struct_completion

Results

Datasets – BSD 100 and Urban 100

Berkeley segmentation dataset (100 test images) Urban image dataset from Flickr (100 test images)

Dataset – Set5, Set14, and Sun-Hays 80

Set5

Set 14 Sun-Hays 80 [Sun and Hays ICCP 12]


Ground-truth HR

SRCNN [Dong et al. ECCV 14] Glasner [Glasner et al. ICCV 2009]

Our result

SR Factor 4xBicubic

A+ [Timofte et al. ACCV 14]






SR Factor 4xGround-truth HR


Our result

Bicubic







SR Factor 4xGround-truth HR


Our result

Bicubic







Bicubic

SRCNN [Dong et al. ECCV 14] A+ [Timofte et al. ACCV 14]

Our result

Ground-truth HR

Sub-band [Singh et al. ACCV 2014]





Ground-truth

SRCNN [Dong et al. ECCV 14]

Glasner [Glasner et al. ICCV 2009]

Our result




Ground-truth HR

SRCNN [Dong et al. ECCV 14]


Our result




Bicubic


Our result

Ground-truth HR

Sub-band [Singh et al. ACCV 2014]





Bicubic


Our result

Ground-truth HR

Our resultSub-band [Singh et al. ACCV 2014]





Bicubic


Our result

Ground-truth HR







Bicubic


Our result

Ground-truth HR

ScSR [Yang et al. TIP 10]





Bicubic


Our result

Ground-truth HR






Bicubic


Our result

Ground-truth HR






BSD 100 Dataset – SR factor 4x

Quantitative Results – Urban 100 dataset

Scale Bicubic ScSR Kim and Kwon Sub-band Glasner SRCNN A+ Ours

2x - PSNR 26.66 28.26 28.74 28.34 27.85 28.65 28.87 29.38

4x - PSNR 23.14 24.02 24.20 24.19 23.58 24.14 24.34 24.82

2x - SSIM 0.8408 0.8828 0.8940 0.8820 0.8709 0.8909 0.8957 0.9032

4x - SSIM 0.6573 0.7024 0.7104 0.7115 0.6736 0.7047 0.7195 0.7386

~ 0.5 dB averaged PSNR improvement over the state-of-the-art method

Quantitative Results – BSD 100 dataset

On par of the state-of-the-art method

Scale Bicubic ScSR Kim Sub-band Glasner SRCNN A+ Ours

2x - PSNR 29.55 30.77 31.11 30.73 30.28 31.11 31.22 31.18

3x - PSNR 27.20 27.72 28.17 27.88 27.06 28.20 28.30 28.30

4x - PSNR 25.96 26.61 26.71 26.60 26.17 26.70 26.82 26.85

2x - SSIM 0.8425 0.8744 0.8840 0.8774 0.8621 0.8835 0.8862 0.8855

3x - SSIM 0.7382 0.7647 0.7788 0.7714 0.7368 0.7794 0.7836 0.7843

4x - SSIM 0.6672 0.6983 0.7027 0.7021 0.6747 0.7018 0.7089 0.7108

Ground truth HR image

Input LR image128 x 96

Bicubic SR Factor 8x

Internet-scale scene matching [Sun and Hays ICCP 12] SR Factor 8x

#Training images

6.3 millions


SRCNN [Dong et al. ECCV 14] SR Factor 8x

#Training images

395,909 from ImageNet


Our result SR Factor 8x

#Training image

1 LR input

Our result: coarse-to-fine super-resolution




Sparse coding [Yang et al. TIP 10] SR Factor 8x


SRCNN [Dong et al. ECCV 14] SR Factor 8x







SR Factor 8xInternet-scale scene matching [Sun and Hays ICCP 12]


SR Factor 8xSRCNN [Dong et al. ECCV 14]





SRCNN [Dong ECCV 2014] SR Factor 8x

Ours SR Factor 8x


SRCNN [Dong ECCV 2014] SR Factor 8x

Ours SR Factor 8x

Low-Res

TI-DTV [Fernandez-Granda

and Candes ICCV 2013]

Ours

SR Factor 4x

Low-Res

TI-DTV [Fernandez-Granda

and Candes ICCV 2013]

Ours

SR Factor 4x

Limitations – Blur Kernel Model

• Suffer from blur kernel mismatch

• Blind SR to estimate kernel [Michaeli and Irani ICCV 2013][Efrat et al. ICCV 2013]

• With ground truth kernel, we can get significantly improvement

• External example-based method would need to retrain the model

http://www.cv-foundation.org/openaccess/content_iccv_2013/papers/Michaeli_Nonparametric_Blind_Super-resolution_2013_ICCV_paper.pdf




http://www.cv-foundation.org/openaccess/content_iccv_2013/papers/Efrat_Accurate_Blur_Models_2013_ICCV_paper.pdf

http://www.cv-foundation.org/openaccess/content_iccv_2013/papers/Efrat_Accurate_Blur_Models_2013_ICCV_paper.pdf

Limitations

• Slow computation time• On average, 40 seconds for super-resolving 2x on an image in BSD 100 dataset

on a 2.8Ghz PC, 12G RAM PC

SRF 4x

Ground truth HR Our result

A+ [Timofte et al. ACCV 14]SRCNN [Dong et al. ECCV 14]




Conclusions

• Super-resolution based on transformed self-exemplars• No training data, no feature extraction, no complicated learning algorithms

• Works particularly well on urban scenes

• On par with state-of-the-art on natural scenes

Code and data available: http://bit.ly/selfexemplarsrSee us on poster #82

http://bit.ly/selfexemplarsr

Single Image Super-Resolution from Transformed Self-Exemplars



Engineering

Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)