Join David Gassner for an in-depth discussion in this video Identify the image subset, part of Code Clinic: C#.
Hello and welcome to Code Clinic. My name is David Gassner. Code Clinic is a monthly course where a unique problem is introduced to a collection of Lynda.com authors. In response, each author will create a solution using their programming language of choice. You can learn several things from Code Clinic: Different approaches to solving a problem, the pros and cons of different languages, and some tips and tricks to incorporate into your own coding practices.
This is a problem centered around image analysis. In one sense, this is simply data analysis. Images are really nothing more than specialized and well-defined sets of data. An image consists of pixels. Pixels consist of data, representing a color, and in some cases, transparency. The pixels are arranged in rows and columns. When assembled correctly they represent an image.
Our brains are very good at recognizing patterns, but computers are not. Think about CAPTCHA security devices, those puzzles you sometimes see when logging into a website. The CAPTCHA asks what letters and numbers are in the image. Information is obscured by random lines, or sometimes overlapping transparent blocks of color. All of those intersecting shapes make it difficult for a computer program to separate the background noise from the actual data.
Another example is the test to determine color blindness. Letters and numbers are hidden in a circle filled with different color dots. If you're color blind, you won't be able to see the numbers. For a computer program this can be incredibly difficult, as it requires detecting an edge, as well as recognizing the overall shape. It's difficult even for the most advanced programmer. In this problem we're trying to solve a common issue for many photographers: Plagiarism.
A photographer will take a picture and post it on the Internet, only to discover someone has stolen their image and placed a subset of that image on their website. For example, here is an image, and then a cropped version of the same image. It would be extremely handy to have a program searching the Internet for cropped versions of an original image, so that a photographer could protect their rights. In fact, Google Image Search will do just that, but we're curious how it works and what the required code might look like.
Here's the challenge: Given two images, determine if one image is a subset of the other image. We'll assume that they're both JPEG files, that the resolution is the same, as well as the bit depth. We've provided a set of images. The program should determine which images are cropped versions of other images. Perhaps you'll want to pause and create a solution of your own. How would you solve the problem? In the next videos I'll show you how I solved this challenge.
David introduces challenges and then provides an overview of his solutions in C#. Challenges include topics such as statistical analysis, searching directories for images, and accessing peripheral devices.
Visit other courses in the series to see how to solve the exact same challenges in languages like C++, Java, PHP, Python, and Ruby.
Skill Level Intermediate
Q: Why can't I access the Lake Pend Orielle site (http://lpo.dt.navy.mil)?
A: The Lake Pend Orielle site is not accessible in some geographical areas. We have contacted the owner of the server to try to resolve this issue.
Q: I am unable to access the Lake Pend Oreille data from outside the U.S.
A: A static copy of this data is provided here for lynda.com members outside of the U.S