jrtechs
/
jrtechs-NodeJSBlog
mirror of https://github.com/jrtechs/NodeJSBlog.git

This blog post going over the basic image manipulation things you cando with Open CV. [Open CV](https://opencv.org/) is an open-sourcelibrary of computer vision tools. Open CV is written to be used inconjunction with deep learning frameworks like[TensorFlow](https://www.tensorflow.org/). This tutorial is going tobe using Python3, although you can also use Open CV with C++, Java,and [Matlab](https://www.mathworks.com/products/matlab.html) 
# Reading and Displaying Images

The first thing that you want to do when you start playing around withopen cv is to import the dependencies required. Most basic computervision projects with OpenCV will use NumPy and matplotlib. All imagesin Open CV are represented as NumPy matrices with shape (x, y, 3),with the data type uint8. This essentially means that every image is a2d matrix with three color channels for BGR where each pixel can havean intensity between 0 and 255. Zero is black where 255 is white ingrayscale. 

```python# Open cv library
import cv2
# numpy library for matrix manipulation
import numpy as np
# matplotlib for displaying the images 
from matplotlib import pyplot as plt```
Reading an image is as easy as using the "cv2.imread" function.  Ifyou simply try to print the image with Python's print function, youwill flood your terminal with a massive matrix. In this post, we aregoing to be using the infamous[Lenna](https://en.wikipedia.org/wiki/Lenna) image which has been usedin the Computer Vision field since 1973. 

```pythonlenna = cv2.imread('lenna.jpg')
# Prints a single pixel value
print(lenna[50][50])
# Prints the image dimensions
# (width, height, 3 -- BRG)
print(lenna.shape)```
    [ 89 104 220]    (440, 440, 3)

By now you might have noticed that I am saying "BRG" instead of "RGB";in Open CV colors are in the order of "BRG" instead of "RGB". Thismakes it particularly difficult when printing the images using adifferent library like matplotlib because they expect images to be inthe form "RGB". Thankfully for us we can use some functions in theOpen CV library to convert the color scheme. 

```pythondef printI(img):    rgb = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)    plt.imshow(rgb)
printI(lenna)```

![png](media/cv1/output_6_0.png)

Going a step further with image visualization, we can use matplotlibto view images side by side to each other. This makes it easier tomake comparisons when running different algorithms on the same image. 

```pythondef printI3(i1, i2, i3):    fig = plt.figure()    ax1 = fig.add_subplot(1,3,1)    ax1.imshow(cv2.cvtColor(i1, cv2.COLOR_BGR2RGB))    ax2 = fig.add_subplot(1,3,2)    ax2.imshow(cv2.cvtColor(i2, cv2.COLOR_BGR2RGB))    ax3 = fig.add_subplot(1,3,3)    ax3.imshow(cv2.cvtColor(i3, cv2.COLOR_BGR2RGB))        def printI2(i1, i2):    fig = plt.figure()    ax1 = fig.add_subplot(1,2,1)    ax1.imshow(cv2.cvtColor(i1, cv2.COLOR_BGR2RGB))    ax2 = fig.add_subplot(1,2,2)    ax2.imshow(cv2.cvtColor(i2, cv2.COLOR_BGR2RGB))    ```
If we zero out the other colored layers and only left one channel, wecan visualize each channel individually. In the following examplenotice that image.copy() generates a deep-copy of the image matrix --this is a useful NumPy function. 

```pythondef generateBlueImage(image):    b = image.copy()    # set the green and red channels to 0    # note images are in BGR    b[:, :, 1] = 0    b[:, :, 2] = 0    return b

def generateGreenImage(image):    g = image.copy()    # sets the blue and red channels to 0    g[:, :, 0] = 0    g[:, :, 2] = 0    return g
def generateRedImage(image):    r = image.copy()    # sets the blue and green channels to 0    r[:, :, 0] = 0    r[:, :, 1] = 0    return r
def visualizeRGB(image):    printI3(generateRedImage(image), generateGreenImage(image), generateBlueImage(image))```

```pythonvisualizeRGB(lenna)```

![png](media/cv1/output_11_0.png)

# Grayscale Images

Converting a color image to grayscale reduces the dimensionalitybecause you are squishing each color layer into one channel. Open CVhas a built-in function to do this. 

```pythonglenna = cv2.cvtColor(lenna, cv2.COLOR_BGR2GRAY)printI(glenna)```

![png](media/cv1/output_14_0.png)

The builtin function works in most applications, however, yousometimes want more control in which color layers are weighted more ingenerating the grayscale image. To do that you can  

```pythondef generateGrayScale(image, rw = 0.25, gw = 0.5, bw = 0.25):    """    Image is the open cv image    w = weight to apply to each color layer    """    w = np.array([[[ bw, gw,  rw]]])    gray2 = cv2.convertScaleAbs(np.sum(image*w, axis=2))    return gray2```

```pythonprintI(generateGrayScale(lenna))```

![png](media/cv1/output_17_0.png)

Notice that the sum of the weights is equal to 1 if it above 1, itwould brighten the image but if it was below 1, it would darken theimage. 

```pythonprintI2(generateGrayScale(lenna, 0.1, 0.3, 0.1), generateGrayScale(lenna, 0.5, 0.6, 0.5))```

![png](media/cv1/output_19_0.png)

We could also use our function to display the grayscale output of eachcolor layer. 

```pythonprintI3(generateGrayScale(lenna, 1.0, 0.0, 0.0), generateGrayScale(lenna, 0.0, 1.0, 0.0), generateGrayScale(lenna, 0.0, 0.0, 1.0))```

![png](media/cv1/output_21_0.png)

Based on this output, the red layer is the brightest which makes sensebecause the majority of the image is in a pinkish/red tone.  
# Pixel Operations

Pixel operations are simply things that you do to every pixel in theimage. 
## Negative

To take the negative of an image, you simply invert the image. Ie: ifthe pixel was 0, it would now be 255, if the pixel was 0 it would nowbe 255. Since all the images are unsigned ints of length 8, rightonce, a pixel hits a boundary, it would automatically wrap over whichis convenient for us. With NumPy, if you subtract a number from amatrix, it would do that for every element in that matrix -- neat.Therefore if we wanted to invert an image we could just take 255 andsubtract it from the image. 

```pythoninvert_lenna = 255 - lennaprintI(invert_lenna)```

![png](media/cv1/output_25_0.png)

## Darken And Lighten

To brighten and darken an image you can add constants to the imagebecause that would push the image closer twords 0 and 255 which isblack and white. 

```pythonbright_bad_lenna = lenna + 25
printI(bright_bad_lenna)```

![png](media/cv1/output_28_0.png)

Notice that the image got brighter but in some parts the image gotinverted. This is because when we add two images, and we don't want towrap, we have to set a clipping threshold to be the 0 and 255. IE:when we add a constant to the image at pixel 240, we don't want it towrap back to 0, we just want it to retain a value of 255. Open CV hasbuilt-in functions for this. 

```pythondef brightenImg(img, num):    a = np.zeros(img.shape, dtype=np.uint8)    a[:] = num    return cv2.add(img, a)
def darkenImg(img, num):    a = np.zeros(img.shape, dtype=np.uint8)    a[:] = num    return cv2.subtract(img, a)
brighten_lenna = brightenImg(lenna, 50)darken_lenna = darkenImg(lenna, 50)
printI2(brighten_lenna, darken_lenna)```

![png](media/cv1/output_30_0.png)

## Contrast

Adjusting the contrast of an image is a matter of multiplying theimage by a constant. Multiplying by a number greater than 1 wouldincrease the contrast and multiplying by a number lower than 1 woulddecrease the contrast. 

```pythondef adjustContrast(img, amount):    """    changes the data type to float32 so we can adjust the contrast by    more than integers, then we need to clip the values and     convert data types at the end.    """    a = np.zeros(img.shape, dtype=np.float32)    a[:] = amount    b = img.astype(float)    c = np.multiply(a, b)    np.clip(c, 0, 255, out=c) # clips between 0 and 255    return c.astype(np.uint8)```

```pythonprintI2(adjustContrast(lenna, 0.8) ,adjustContrast(lenna, 1.3))```

![png](media/cv1/output_33_0.png)

# Noise

I most cases you don't want to add random noise to your image,however, in some algorithms, it becomes necessary to do for testing.Noise is anything that makes the image imperfect. In the "real world"this is usually in the form of dead pixels on your camera lens orother things distorting your view.  
## Salt and Pepper

Salt and pepper noise is adding random black and white pixels to yourimage. 

```pythonimport random
def uniformNoise(image, num):    img = image.copy()    h, w, c = img.shape    x = np.random.uniform(0,w,num)    y = np.random.uniform(0,h,num)
    for i in range(0, num):        r = 0 if random.randrange(0,2) == 0 else 255        img[int(x[i])][int(y[i])] = np.asarray([r, r, r])            return imgprintI2(uniformNoise(lenna, 1000), uniformNoise(lenna, 7000))```

![png](media/cv1/output_36_0.png)

# Image Denoising

It is possible to remove the salt and pepper noise from an image toclean it up. Unlike how my professor worded it, this is not"enhancing" the image, this is merely using filters that remove thenoise from the image by blurring it.  
## Moving Average

The moving average technique sets each pixel equal to the average ofits neighborhood. The bigger your neighborhood the more the image isblurred. 

```pythonbad_lenna = uniformNoise(lenna, 6000)
blur_lenna = cv2.blur(bad_lenna,(3,3))
printI2(bad_lenna, blur_lenna)```

![png](media/cv1/output_39_0.png)

As you can see, most of the noise was removed from the image but,imperfections were left. To see the effects of the filter size, youcan play around with it. 

```pythonblur_lenna_3 = cv2.blur(bad_lenna,(3,3))blur_lenna_8 = cv2.blur(bad_lenna,(8,8))printI2(blur_lenna_3, blur_lenna_8)```

![png](media/cv1/output_41_0.png)

## Median Filtering

Median filters transform every pixel by taking the median value of itsneighborhood. This is a lot better than average filters for noisereduction because it has less of a blurring effect and it is extremelywell at removing outliers like salt and pepper noise.  

```pythonmedian_lenna = cv2.medianBlur(bad_lenna,3)
printI2(bad_lenna, median_lenna)```

![png](media/cv1/output_43_0.png)

# Remarks

Open CV is a vastly powerful framework for image manipulation. Thispost only covered some of the more basic applications of Open CV.Future posts might explore some of the more advanced techniques incomputer vision like filters, Canny edge detection, template matching,and Harris Corner detection.