Edge detection with the Sobel operator in Ruby


I was never much into image processing. Sure, like most programmers I dabbled into it for cropping images or doing some fancy-schmancy filtering effects stuff. I even wrote a Flickr clone for my last book which has a rather impressive photo editor (mashed up from Pixlr, not mine). But I never thought much on how those effects were done or who came up with them in the first place. That is until I met Irwin Sobel.

For those who know their image processing, this should ring bells immediately. Yes, it’s that Sobel. But a minute to give some background — Irwin is a colleague of mine working in the Mobile and Immersive Experience Lab in HP Labs. I was visiting about two weeks ago and was introduced to him and his current projects. Inevitably someone talked about the Sobel operator, a commonly used algorithm used for edge detection. I was, unfortunately, totally clueless about what it was. Not good. So not surprisingly I ended up Googling for ‘Sobel operator’ at the first possible chance and found out what it was.

The Sobel operator is an algorithm for edge detection in images. Edge detection for those who are not familiar with the term, is an image processing technique to discover the boundaries between regions in an image. It’s an important part of detecting features and objects in an image. Simply put, edge detection algorithms help us to determine and separate objects from background, in an image.

The Sobel operator does this in a rather clever way. An image gradient is a change in intensity (or color) of an image (I’m over simplifying but bear with me). An edge in an image occurs when the gradient is greatest  and the Sobel operator makes use of this fact to find the edges in an image. The Sobel operator calculates the approximate image gradient of each pixel by convolving the image with a pair of 3×3 filters. These filters estimate the gradients in the horizontal (x) and vertical (y) directions and the magnitude of the gradient is simply the sum of these 2 gradients.

The magnitude of the gradient, which is what we use, is calculated using:

That’s the simplified, 2-paragraph theory behind the algorithm. If this fascinates you, you should grab a couple of books on image processing and computer vision and go through them.

Let’s look at how to implement the Sobel operator. This is simply by creating the 2 filters and running them through each pixel in the image, starting from the left and going right. Note that because the filter is a 3×3 matrix, the pixels in the first and last rows as well as the first and last columns cannot be estimated so the output image will be a 1 pixel-depth smaller than the original image.

To calculate the pixel in the right side of the equation (the one with coordinates 1,1) the following equation is used:

output pixel [1,1] = ([0,0] x -1) + ([0,1] x 0) + ([0,2] x 1) + ([1,0] x -2) + ([1,1] x 0) + ([1,2] x 2) + ([2,0] x -1) + ([2,1] x 0) + ([2,2] x 1)

To simplify matters even more, the grayscale version of the original image is usually used.

Now let’s look at the Ruby implementation


require 'chunky_png'

class ChunkyPNG::Image
  def at(x,y)
    ChunkyPNG::Color.to_grayscale_bytes(self[x,y]).first
  end
end

img = ChunkyPNG::Image.from_file('engine.png')

sobel_x = [[-1,0,1],
           [-2,0,2],
           [-1,0,1]]

sobel_y = [[-1,-2,-1],
           [0,0,0],
           [1,2,1]]

edge = ChunkyPNG::Image.new(img.width, img.height, ChunkyPNG::Color::TRANSPARENT)

for x in 1..img.width-2
  for y in 1..img.height-2
    pixel_x = (sobel_x[0][0] * img.at(x-1,y-1)) + (sobel_x[0][1] * img.at(x,y-1)) + (sobel_x[0][2] * img.at(x+1,y-1)) +
              (sobel_x[1][0] * img.at(x-1,y))   + (sobel_x[1][1] * img.at(x,y))   + (sobel_x[1][2] * img.at(x+1,y)) +
              (sobel_x[2][0] * img.at(x-1,y+1)) + (sobel_x[2][1] * img.at(x,y+1)) + (sobel_x[2][2] * img.at(x+1,y+1))

    pixel_y = (sobel_y[0][0] * img.at(x-1,y-1)) + (sobel_y[0][1] * img.at(x,y-1)) + (sobel_y[0][2] * img.at(x+1,y-1)) +
              (sobel_y[1][0] * img.at(x-1,y))   + (sobel_y[1][1] * img.at(x,y))   + (sobel_y[1][2] * img.at(x+1,y)) +
              (sobel_y[2][0] * img.at(x-1,y+1)) + (sobel_y[2][1] * img.at(x,y+1)) + (sobel_y[2][2] * img.at(x+1,y+1))

    val = Math.sqrt((pixel_x * pixel_x) + (pixel_y * pixel_y)).ceil
    edge[x,y] = ChunkyPNG::Color.grayscale(val)
  end
end

edge.save('engine_edge.png')

First thing you’d notice is that I used a library called ChunkyPNG, which is PNG manipulation library that is implemented in pure Ruby. While wrappers over ImageMagick (like RMagick) is probably the defacto image processing and manipulation library in Ruby, I thought it’s kind of pointless to do a Sobel operator with ImageMagick since it already has its own edge detection implementation.

To simplify the implementation, I opened up the Image class in ChunkyPNG and added a new method that will return a grayscale pixel at a specific location. Then I created the 2 Sobel filters with arrays of arrays. I created 2 nested loops to iterate through each pixel column by column, then row by row and at each pixel I used the equation above to calculate the gradient by applying the x filter then the y filter. Finally I used the gradient and set a grayscale pixel based on the gradient value, on a new image.

Here you can see the original image, which I reused from the Wikipedia entry on Sobel operator.

And the edge detected image with the x filter applied only.

This is the edge detected image with the y filter only.

Finally this is the edge detected image with both x and y filters applied.

This short exercise might not be technically challenging but it made me appreciate the pioneers who invented things that we now take for granted. Here’s a final picture, one with myself and Irwin (he is the guy who’s sitting opposite me), and a bunch of other colleagues at HP Labs Palo Alto over lunch. Thanks Irwin, for the Sobel operator!

29 thoughts on “Edge detection with the Sobel operator in Ruby

    1. thanks! -What fatonicnul forms did you use for the galaxy model and point spread function models, and how did you determine these?We ended up using an exponential profile for the galaxy model and a moffat distribution for the PSF. We also experimented with add a de Vacouleurs component to the galaxy model. Our software was designed so that it would be easy to swap and combine models. To assess the quality of our models we studied the pixel-level residuals (stacked from thousands of images) between the predicted and observed image.-Which neural network implementation in TMVA did you use?The Multi-Layer Perceptron implementation with BFGS training.-What parameters for the network did you use?The parameters from the model fits were fed to the network as inputs. We trained the network to “learn” corrections to the fitted ellipticity values.-How did you select these parameters?Trial and error for the most part. We had trouble with convergence feeding ALL of the information we had available for each of the images at first. So we started from a smaller subset of the parameters and systematically added parameters and evaluated the results.

  1. I need a code for performing sobel operation in java where the input image is read from a file and edge detected image is stored in a file….plz reply…

  2. I like this post, however in your ruby implementation of the sobel operator, I don’t believe you perform any thresholding on your image, you merely set each pixel to it’s calculated magnitude value instead of setting a threshold and only setting an edge to a white value if the gradient is above a certain value. Would this not be the proper way to implement the sobel for edge detection?

  3. Great post, helped me understand it well! But I am struggling to find the original paper on Sobels’ work to reference, do you know of it?
    thanks

  4. I’m having a problem with edge detection using Sobel operator: it produces too many false edges, effect is shown on pictures below. I’m using a 3×3 sobel operator – first extracting vertical then horizontal, final output is magnitude of each filter output. Edges on synthetic images are extracted properly but natural images produce have too many false edges or “noise” even if image is preprocessed by applying blur or median filter. What might be cause of this? Is it implementation problem (then: why synthetic images are fine?) or I need to do some more preprocessing?

  5. The relation between two countries is not cordial but that air of antagonism
    has never stopped or forced any business from withdrawing any of their business initiatives.
    If you want to get into reselling of used printer cartridges, you will
    need to source for additional empty cartridges to make
    your efforts viable. Its weight is 122 grams and casing measures 111 x 51 x 13.

    Hi friends, how is all, and what you would like to
    say regarding this article, in my view its actually amazing
    for me.

  6. Thanks for another excellent post. The place else may just anybody
    get that type of info in such an ideal way of writing?

    I have a presentation subsequent week, and I’m at the look for such info.

  7. 私は大学生なのですが、数年後までにはアパレル業界で正社員として働きたいと考えています。
    アパレル業界でバリバリに働きたいです。
    どうすればユニクロとかで店員として働けるのでしょうか?

  8. Hi My doubt here is in a 3*3 matrix only the middle pixel value(1,1) is calculated?? I am rather confused how the output of other pixels (0,0..0,1) will be??
    Please help me out..

  9. Hi admin, do you monetize your site ? There is easy method to
    earn extra money every month, just search on youtube – How to earn with wordai
    4

  10. I have noticed you don’t monetize your website,
    don’t waste your traffic, you can earn additional bucks every month because you’ve got high
    quality content. If you want to know how to make extra money, search for: Boorfe’s tips best adsense alternative

Leave a comment