Fast image scaling using HAAR

Page 1

CodeProject: Fast Dyadic Image Scaling with Haar Transform. Free so...

1 di 4

Multimedia Âť General Graphics Âť Image classes License (GPL)

http://www.codeproject.com/KB/graphics/Dyadic_down_sampling.as...

License: The GNU General Public VC8.0, C++, Windows, MFC, Visual Studio, VS2005, Dev

Fast Dyadic Image Scaling with Haar Transform

Posted : 18 Oct 2007 Updated : 18 Oct 2007 Views : 12,785

By Chesnokov Yuriy

This article demonstrates the use of Haar transform for dyadic image scaling with MMX optimization

6 votes for this Article. Popularity: 3.61 Rating: 4.64 out of 5

1 2 3 4 5

Download demo project - 6.86 KB Download source - 18.93 KB

Introduction This is the fast dyadic image down sampling class based on Haar transform. It extends BaseFWT2D class from my other article 2D Fast Wavelet Transform Library for Image Processing for this specific purpose. It uses MMX optimization and is applicable in the image processing field where you perform dyadic down sampling: 2, 4, 8, 16, 32 ... pow(2, N) times. I use that code as a preprocessing in the face detection process.

Background You need to be familiar with Haar transform.

Using the Code I've arranged console project allocating RGB array for 640x480 image and implementing several runs of down sampling to gather statistics and output average time for it. I used the precision time counter - I remember I downloaded it some long time ago from The Code Project. On my 2.2GHz TravelMate under licensed Vista it runs 5-6ms for down sampling this image to 80x60, eight times smaller. The classes in the project are: vec1D //1D vector wrapper

vec2D //2D vector wrapper

BaseFWT2D //abstract base class for 2D FWT

Haar : public BaseFWT2D //Haar based down sampling

ImageResize //provides RGB data down sampling

You can learn about vec1D and BaseFWT2D from my 2D Fast Wavelet Transform Library for Image Processing article and about vec2D from my other article 2D Vector Class Wrapper SSE Optimized for Math Operations. The ImageResize class contains three objects of class Haar for red, green and blue channels down

30/07/2008 16.30


CodeProject: Fast Dyadic Image Scaling with Haar Transform. Free so...

2 di 4

http://www.codeproject.com/KB/graphics/Dyadic_down_sampling.as...

sampling. First, you need to initialize the ImageResize object to specific width, height and down sampling ratio: void init(unsigned int w, unsigned int h, float zoom = 0.125f);

The zoom is the image down sampling factor, with resulting image down sampled by 1/zoom times. The default one (0.125f) provides 8 times down sampled image. You can down sample the image only with zoom equal to 1/2, 1/4, 1/8, ... 1/pow(2,N). Then you can proceed with down sampling incoming images with either of the overloaded functions: int resize(const unsigned char* pBGR);

int resize(const unsigned char* pR, const unsigned char* pG, const unsigned char* pB) const;

The first one takes RGB stream with the first byte in the triplet for blue channel and the last one for red. The second takes the RGB channels in separate buffers. //your bitmap data goes in that fashion //unsigned char* pBGR = new unsigned char[width*height*3]; unsigned int width = 640; unsigned int height = 480; float zoom = 0.25; ImageResize resize; resize.init(width, height, zoom); //keep resizing incoming data after initialization. resize.resize(pBGR);

To access down sampled image, the following functions are defined: char** getr() const;

char** getg() const;

char** getb() const;

Note they provide 2D char pointers to the data in char range -128 ... 127. //print out resized red channel char** pr = resize.getr(); for(unsigned int y = 0; y < height * zoom; y++) { for(unsigned int x = 0; x < width * zoom; x++) wprintf(L" %d", (pr[y][x] + 128)); wprintf(L"\n"); }

You can also access down sampled gray version of the RGB bitmap after resize() call with: inline const vec2D* gety() const;

It returns the pointer of vec2D type to it. I've written rgb2y(int r, int g, int b) function to convert a single RGB triplet to gray pixel with SSE optimization, however I use simple floating point arithmetic currently in that version of class and turn on the compiler's SSE optimization. It actually runs slightly faster than my SSE optimized function (have to look at that a moment later). The Haar extension to the BaseFWT2D is pretty simple. I've provided implementations for virtual functions BaseFWT2D::transrows() and BaseFWT2D::transcols() (I have not written it for

30/07/2008 16.30


CodeProject: Fast Dyadic Image Scaling with Haar Transform. Free so...

3 di 4

http://www.codeproject.com/KB/graphics/Dyadic_down_sampling.as...

BaseFWT2D::synthrows() and BaseFWT2D::synthcols() since this is a down sampling class and not up sampling yet). They are MMX optimized and the math behind Haar transform is that you take 2 consecutive pixels, and calculate their mean. So you first decrease the size of your image twice along the horizontal direction and the same along the vertical. It is easy when you do this column wise but with a single row, you have to select even and odd consecutive pixels and just average them in parallel. I do it this way: unsigned char* sour; __m64 m00FF; m00FF.m64_u64 = 0x00FF00FF00FF00FF; __m64 *msour = (__m64 *)sour; //even coeffs __m64 even = _mm_packs_pu16(_mm_and_si64 (*msour, m00FF), _mm_and_si64(*(msour + 1), m00FF)); //odd coeffs __m64 odd = _mm_packs_pu16(_mm_srli_pi16(*msour, 8), _mm_srli_pi16(*(msour + 1), 8)); msour += 2;

Points of Interest The Haar class could be modified with SSE2 integer intrinsic for even faster processing, I hope I can implement it later and submit the update, otherwise if someone interested is eager to modify it with SSE2 support, please let me know. I bet it could do the same 640x480 down sampling to 80x60 for about 1-2ms with SSE2.

History 18th October, 2007: Initial post

License This article, along with any associated source code and files, is licensed under The GNU General Public License (GPL)

About the Author Chesnokov Yuriy

Former Cambridge University post-doc (http://www-ucc.ch.cam.ac.uk/research/yc274-research.html) currently lives in Krasnodar, Russia and doing some contract research for third parties. Research intrests in digital signal processing in medicine, image and video processing, pattern recognition, AI methods, computer vision. You may approach me for the code/research development in the above areas (chesnokov_yuriy at mail dot ru, chesnokov.yuriy at gmail dot com). Publications:

Mvp

Complexity and spectral analysis of the heart rate variability dynamics for distant prediction of paroxysmal atrial fibrillation with artificial intelligence methods. Artificial Intelligence in Medicine. 2008. V43/2. PP. 151-165 (http://dx.doi.org/10.1016/j.artmed.2008.03.009) Face Detection C++ Library with Skin and Motion Analysis. Biometrics AIA 2007 TTS. 22 November 2007, Moscow, Russia. (http://www.dancom.ru/rus/AIA/2007TTS/ProgramAIA2007TTS.html) Screening Patients with Paroxysmal Atrial Fibrillation (PAF) from Non-PAF Heart Rhythm Using HRV Data Analysis. Computers in Cardiology 2007. V. 34. PP. 459–463 (http://www.cinc.org/Proceedings/2007/pdf/0459.pdf)

30/07/2008 16.30


CodeProject: Fast Dyadic Image Scaling with Haar Transform. Free so...

4 di 4

http://www.codeproject.com/KB/graphics/Dyadic_down_sampling.as...

Distant Prediction of Paroxysmal Atrial Fibrillation Using HRV Data Analysis. Computers in Cardiology 2007. V. 34. PP. 455-459 (http://www.cinc.org/Proceedings/2007/pdf/0455.pdf) Individually Adaptable Automatic QT Detector. Computers in Cardiology 2006. V. 33. PP. 337-341 http://www.cinc.org/Proceedings/2006/pdf/0337.pdf) Past/recent outsourcing code/research: www.ayonix.com - face recognition C/MATLAB www.system7.co.uk - CBIR C#/ASP.NET (cbir.system7.com) private enterprise in UK - pedestrian detection C++/CLI www.trulyintelligent.com - SpeechSieve consulting in AI www.devline.ru - video codecs C++ Occupation: Software Developer Location:

Russian Federation

Discussions and Feedback 18 messages have been posted for this article. Visit http://www.codeproject.com/KB/graphics/Dyadic_down_sampling.aspx to post and view comments on this article, or click here to get a print view with messages. PermaLink | Privacy | Terms of Use Last Updated: 18 Oct 2007 Editor: Deeksha Shenoy

Copyright 2007 by Chesnokov Yuriy Everything else Copyright Š CodeProject, 1999-2008 Web09 | Advertise on the Code Project

30/07/2008 16.30


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.