Share @ LinkedIn Facebook  python, imagemanipulation
Image Processing using Pillow

Note: This tutorial is inspired by Pillow Tutorial

What is PIL (Python Imaging Library)?

Python Imaging Library is used for loading images in memory and doing various transformations on the image. It supports loading images of various types like PNG, JPEG, etc. It can be used to do various transformations on image and then store that transformed image. PIL is generally used in deep learning where preprocessing of images is required as part of object detection tasks. Image loaded using PIL can be easily converted to Numpy array as many Machine Learning and Deep Learning libraries take numpy array as input. Various deep learning libraries use PIL behind the scene to do image pre-processing.

How to install PIL?

pip install Pillow

Importing PIL and using it for image manipulation

In [377]:
import PIL
from PIL import Image, ImageFilter, ImageEnhance, ImageSequence
import numpy as np
import secrets
import matplotlib.pyplot as plt


Loading an image in memory.

Image is the main class of PIL which is responsible for representing an image. Whenever you load an image in memory using PIL it'll be of class Image.

open() method is used to open a file. This method loads file lazily as it reads the header of an image file to find out image format, mode, and size. It does not load the whole image into memory which can be of few MB for images with higher resolution. This can be helpful to find out image metadata without loading it in memory.

Note: Please save a few png, jpeg images in a directory where jupyter notebook is kept to load images in memory and do various operations. Also, change the image name according to the below code to load your images.

In [ ]:'ml.jpeg')


In [379]:
img ='ml1.png').resize((380,250)); img2 ='ml.jpeg').resize((380,250))
type(img), type(img2)
(PIL.Image.Image, PIL.Image.Image)
  • size attribute of Image instance refers to the size of the image in pixels. (width, height)
  • mode attribute refers to in which mode image is loaded which is generally RGB, RGBA, CMYK, L, etc. One can get more information about mode, pallette, bands, etc over here
  • format refers to the format of the source file from which image was loaded, It'll be None for an image generated after processing an original image. One can read about formats supported over here
In [380]:
(img.size,img.mode, img.format,, (img2.size,img2.mode, img2.format,
(((380, 250), 'RGBA', None, {'srgb': 0, 'gamma': 0.45455, 'dpi': (96, 96)}),
 ((380, 250), 'RGB', None, {}))
In [ ]:
plt.hist(img.histogram(),bins=256,range=(0,255),color='red', label='Density of pixels within image')


  • show() function saves the image and then loads it in default image application on the machine.
  • convert() function is used to convert an image from one mode to another. Currently conversion only between images with mode types 'L', 'RGB' and 'CMYK'
In [382]:
img = img.convert('L')

Save image using Image class instance method save()

  • It's required to provide the file format in which to save the file. We can give format as a parameter to save() method or we can include it in the filename as well. save() method will fail if the format is not provided as part of a file name or as format parameter.
In [383]:'img_rgb.jpg');'temp',format="jpeg")

Creating thumbnail

  • We can use thumbnail() method of Image class to create thumbnail from an image. We need to provide a size parameter as a tuple (height, width).PIL will create an image that has the same aspect ratio as the original image. The resulting thumbnail will not be the same size as you provided as parameter. But it'll not be larger than that size.
  • Also make a note that thumbnail() method works in-place which means that it'll modify original image object rather than returning a new modified object.
  • one can use size attribute of Image class to check the size of the resulting image after converting to thumbnail.
  • size attribute of Image class always returns tuple in format (width, height).
In [384]:
(128, 84)

Resizing Image

resize() method is used to resize the method. It requires a new size of an image as a tuple first parameter. One can maximize and minimize image using this method.

In [385]:
img ='ml1.png').resize((380,250))

Cropping Image

crop() method is used to crop a particular part of an image. It requires a tuple of 4 integers (left, upper, right, lower). This number must be within a range of size of image otherwise error will be raised while cropping. Image is represented using a coordinated system with (0,0) as upper-left corner and (width, height) as a lower-right corner.

crop() method is lazy as it does not actually crop image until load() function is called to load a cropped image.

  • One can find width and height of resulting image using below formula:
    • width = right - left, height = lower - upper.

size attribute of Image class always returns tuple in format (width, height).

In [386]:
cropped_img = img.crop((100,100,250,150))
print('size of cropped image : %s'%str(cropped_img.size))
size of cropped image : (150, 50)

Flip/Rotate an Image

  • transpose() function is used to flip/rotate image in various angles. Various flip/rotate types are Image.FLIP_LEFT_RIGHT, Image.FLIP_TOP_BOTTOM, Image.ROTATE_180, Image.ROTATE_90, Image.ROTATE_270, Image.TRANSPOSE and Image.TRANSVERSE

  • rotate() function is used to rotate the image by certain degrees. It takes on a parameter which must be between (0,360).

In [ ]:


In [ ]:


Converting Image object to numpy array and vice-versa

  • We can directly give Image class instance as input to numpy.array() method. It'll convert image to numpy array based on mode and it'll create channels accordingly.

  • We can create Image instance from numpy array again with Image.fromarray() function.

In [389]:
nparr = np.array(img)
nparr.shape, nparr.dtype, nparr
((250, 380, 4), dtype('uint8'), array([[[  4,   6,  44, 255],
         [  4,   6,  44, 255],
         [  6,   8,  47, 255],
         [  6,   8,  47, 255]],

        [[  5,   7,  45, 255],
         [  5,   7,  45, 255],
         [  6,   8,  47, 255],
         [  6,   8,  47, 255]],


        [[  4,   7,  42, 255],
         [  4,   7,  42, 255],
         [  5,   7,  45, 255],
         [  5,   7,  46, 255]],

        [[  3,   6,  41, 255],
         [  3,   6,  41, 255],
         [  5,   7,  45, 255],
         [  5,   7,  46, 255]]], dtype=uint8))
In [ ]:
img2 = Image.fromarray(nparr); Image.fromarray(nparr)


In [391]:
Image.fromarray(np.random.randint(0,255,size=(150, 150, 3), dtype=np.uint8))

Create image from bytes and vice-versa.

  • frombytes() function allows us to create an image from a byte stream.
  • We need to specify the mode, size of image and data.
  • Please make a note that for RGB, RGBA modes, image data should be provided for all channels but size will be (width, height) only.
  • tobytes() function is used to create a byte stream from an image.
In [392]:
Image.frombytes('L', size=(100,100), data = secrets.token_bytes(10000))
In [393]:
Image.frombytes('RGB', size=(100,100), data = secrets.token_bytes(30000))
In [394]:
Image.frombytes('RGBA', size=(100,100), data = secrets.token_bytes(40000));
In [395]:
byte_stream = img.tobytes()
print('Size of bytes : %d'%len(byte_stream))
print('Image byte size from dimensions %d'%(img.size[0]*img.size[1]*len(img.getbands())))
Image.frombytes(img.mode, img.size, byte_stream)
Size of bytes : 380000
Image byte size from dimensions 380000

Splitting image into bands and merging bands in various order

  • split() method is used to split methods into bands which will (R,G,B) for RGB, (R,G,B,A) for RGBA, (C,M,Y,K) for CMYK and (L,) for L (Luminescence). Each of the bands returned itself will be an instance of Image class again so it makes easy to work on parts and then combine again.

  • merge() is used to merge different bands and create one unified image of those bands. It takes the first parameter as mode and the second parameter as tuple containing bands according to mode. bands must have the same size in order for a merge to work correctly.

In [ ]:
img =img.convert('RGB')
r, g, b = img.split()
Image.merge('L', (b,))


In [ ]:
Image.merge('RGBA', (r, g, b, b))


In [ ]:
Image.merge('RGB', (b, r, g))


Pasting images into another image

  • paste() method is used to paste cropped image/image to another image.
  • User needs to provide place where to paste the image with a tuple (left, upper, right, lower). Users can only provide (left, upper) as well for place and it'll work fine.
  • One can also use mask which can be used to make pasted images with transparency. The masked image will have values between (0, 255) with 0 values being totally transparent and 255 is opaque.
  • One can use this function to create a collage.
In [ ]:
img ='ml1.png').resize((380,230))
cropped_region = img.crop((100,100,150,150))


Applying various filters on image for effects

  • filter() method is used to apply various filters on an image like a blur, smooth, emboss, etc. It accepts one argument which is a filter. The filter is from class ImageFilter.
  • All effects generate by doing convolution operation on the image.Read about convolution to get idea about it.
  • All effects have (3,3) or (5,5) kernel which gets convolved on the image. Kernel refers to an array of that size.
In [400]:
kernel_size = ImageFilter.BLUR.filterargs[0]
print('Kernel Size : %s'%str(kernel_size))
print('Kernel : %s'%str(np.array(ImageFilter.BLUR.filterargs[3]).reshape(kernel_size)))
Kernel Size : (5, 5)
Kernel : [[1 1 ... 1 1]
 [1 0 ... 0 1]
 [1 0 ... 0 1]
 [1 1 ... 1 1]]
In [401]:
kernel_size = ImageFilter.EMBOSS.filterargs[0]
print('Kernel Size : %s'%str(kernel_size))
print('Kernel : %s'%str(np.array(ImageFilter.EMBOSS.filterargs[3]).reshape(kernel_size)))
Kernel Size : (3, 3)
Kernel : [[-1  0  0]
 [ 0  1  0]
 [ 0  0  0]]
In [402]:
kernel_size = ImageFilter.SMOOTH.filterargs[0]
print('Kernel Size : %s'%str(kernel_size))
print('Kernel : %s'%str(np.array(ImageFilter.SMOOTH.filterargs[3]).reshape(kernel_size)))
Kernel Size : (3, 3)
Kernel : [[1 1 1]
 [1 5 1]
 [1 1 1]]
In [403]:
kernel_size = ImageFilter.CONTOUR.filterargs[0]
print('Kernel Size : %s'%str(kernel_size))
print('Kernel : %s'%str(np.array(ImageFilter.CONTOUR.filterargs[3]).reshape(kernel_size)))
Kernel Size : (3, 3)
Kernel : [[-1 -1 -1]
 [-1  8 -1]
 [-1 -1 -1]]