Join Bill Weinman for an in-depth discussion in this video Image analysis, part of Code Clinic: C++.
- Hello, and welcome to Code Clinic. My name is Bill Wyman. Code Clinic is a monthly course where a unique problem is introduced to a collection of lynda.com authors. In response, each author will create a solution using their programming language of choice. You can learn several things from Code Clinic, different approaches to solving a problem, the pros and cons of different languages and some tips and tricks to encorporate into your own coding practices. This month we'll work on a problem centered around image analysis. In one sense, this is simply data analysis.
Images are really nothing more than specialized and well-defined sets of data. An image consists of pixels. Pixels consist of data representing the color of the pixel and in some cases, the pixels transparency. The pixels are arranged in rows and columns. When assembled correctly, they represent an image. Our brains are very good at recognizing patterns, but computers are not. Think about captcha security devices, those puzzles you sometimes see when logging into a website. The captcha asks what letters and numbers are in the image, information obscured by random lines, sometimes overlapping transparent blocks of color.
All of these intersecting shapes make it difficult for a computer program to separate the background noise from the actual data. Another example is the test to determine color blindness. Letters and numbers are hidden in a circle filled with different colored dots. If you're colorblind, you'll not be able to see the numbers. For a computer program, this can be incredibly difficult as it requires detecting an edge, as well as recognizing the overall shape. It's difficult even for the most advanced programmer. In this problem we're trying to solve a common problem for many photographers, plagiarism.
A photographer will take a picture and post it on the internet only to discover someone has stolen their image and placed a subset of that image on their website. For example, here is an image and then a cropped version of that image. It would be extremely handy if there was a program searching the internet for cropped versions of an original image, so a photographer could protect their rights. In fact, Google image search will do just that, but we're curious how it works and what the required code might look like. Here's the challenge.
Given two images, determine if one image is a subset of the other image. We'll assume they're both JPEG files, that the resolution is the same, as well as the bit depth. We've provided a set of images. The program should return a table showing which images are cropped versions of other images. You may want to pause now and create a solution of your own. How would you solve the problem? In the next videos I'll show you how I solved this challenge.
Bill introduces challenges and provides an overview of his solutions in C++. Challenges include topics such as statistical analysis, searching directories for images, and accessing peripheral devices.
Visit other courses in the series to see how to solve the exact same challenges in languages like C#, Java, PHP, Python, and Ruby.
Skill Level Intermediate
Q: I am unable to access the Lake Pend Oreille data from outside the U.S.
A: A static copy of this data is provided here for lynda.com members outside of the U.S