Object finder for photographic images

6829384
Add to folder: View Folders  
Keywords to Highlight:

full-text

print

pdf

permalink

Inventors

Schneiderman, Henry
Kanade, Takeo

Application #

795208

Filed

Feb-28-2001

Published

Dec-7-2004

Current US Class

375/240.19
382/154
382/240
382/285

International Classes

G06K 009/00

Field of Search

382/115 382/118 382/159 382/180 382/209 382/154 382/162 382/165 382/285 382/240 706/21 706/52 700/83 375/240.19 345/810 345/840 348/42 348/51

Assignee

Carnegie Mellon University (Pittsburgh, PA)

Examiners

Mehta; Bhavesh M.

Attorney, Agent or Firm

Kirkpatrick & Lockhart LLP

US Patent References

5642431   Network-based syst...
5710833   Detection, recogniti...
6072893   Method and system...
6128397   Method for finding...
6134339   Method and appar...
6192145   Method and appar...
6211515   Adaptive non-unifo...
6272231   Wavelet-based faci...
6381280   Single chip motion...
6567081   Methods and arran...
6597739   Three-dimensional...
6671391   Pose-adaptive face...

Referenced by:

View Backward References

Other References

Amit et al., Discussion of the Paper "Arcing Classifiers" by Leo Breiman, The Annals of Statistics, vol. 26, No. 3, 1998, pp. 833-837. Breiman, L., Arcing Classifiers, The Annals of Statistics, vol. 26, No. 3, 1998, pp. 801-823. Burel et al., Detection and Localization of Faces on Digital Images, Pattern Recognition Letters 15, 1994, pp. 963-967. Burl et al., Recognition of Planar Object Classes, CVPR 1996, pp. 223-230. Colmenarez et al., Face Detection with Information-Based Maximum Discrimination, CVPR 1997, pp. 782-787. Cosman et al., Vector Quantization of Image Subbands: A Survey, IEEE Trans. On Image Processing, vol. 5, No. 2, Feb. 1996, pp. 202-225. Dietterich, T.G., Discussion of the Paper "Arcing Classifiers" by Leo Breiman, The Annals of Statistics, vol. 26, No. 3, 1998, pp. 838-841. Domingos et al., On the Optimality of the Simple Bayesian Classifier under Zero-One Loss, Machine Learning, 29, 1997, pp. 103-130. Freund et al., A Decision-theoretic Generalization of On-line Learning and an Application to Boosting, Journal of Computer and System Sciences, vol. 55, No. 1, 1997, pp. 119-139. Freund et al., Discussion of the Paper "Arcing Classifiers" by Leo Breiman, The Annals of Statistics, vol. 26, No. 3, 1998, pp. 824-832. Moghaddam et al., Probabilistic Visual Learning for Object Representation, IEEE Trans. on Pattern Analysis and Machine Intelligence vol. 19, No. 7, Jul. 1997, pp. 696-710. Osuna et al., Training Support Vector Machines: An Application to Face Detection, CVPR 1997, pp. 130-136. Roth et al., A SNoW-Based Face Detector, Neural Information Processing Systems, 1999, pp. 862-868. Rowley et al., Neural Network-Based Face Detection, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 20, No. 1, Jan. 1998, pp. 23-38. Schapire et al., Improved Boosting Algorithms Using Confidence-rated Predictions, Machine Learning, 37(3), 1999, pp. 297-336. Schneiderman et al., Probabilistic Modeling of Local Appearance and Spatial Relationships for Object Recognition, CVPR 1998, pp. 45-51. Strang et al., Wavelets and Filter Banks, Wellesley-Cambridge Press, 1997, pp. 1-35, 103-142, 216-218. Sung et al., Example-based Learning for View-based Human Face Detection, M.I.T. AI Memo No. 1521, 1994, pp. 1-20. Sung, K., Learning and Example Selection for Object and Pattern Detection, M.I.T. AI Lab. Tech Report No. 1572, 1996, pp. 1-195. Vapnik, V.N., The Nature of Statistical Learning Theory, Springer, 1995, pp. 127-137. Yang et al., Human Face Detection in a Complex Background, Pattern Recognition, 27(1), 1994, pp. 53-63.

Citation

Cite This Patent

More From Subclass 154

5754680   Object shape expre...
5815593   Laser surface char...
5127061   Real-time three-di...
6236749   Image recognition...
5845006   Method of processi...
6839144   Optical displaceme...
6424877   Reproduction appa...
5870490   Apparatus for extra...
6496597   Generating image...
6496598   Image processing...
5604817   Tooth measurement...
4641351   Logical transform i...
 

More From Class 382

6434267   Interpretation of the...
6961480   Off-grid interpolatio...
6067377   Color image formi...
6721939   Electron beam shot...
6142943   Doppler ultrasound...
6877097   Security access met...
6915014   Image processing...
6658150   Image recognition...
5060082   Facsimile apparatus
5850481   Steganographic sys...
5881173   Bit-plane coding de...
6263095   Imaging method a...
 
Abstract
An object finder program for detecting presence of a 3D object in a 2D image containing a 2D representation of the 3D object. The object finder uses the wavelet transform of the input 2D image for object detection. A pre-selected number of view-based detectors are trained on sample images prior to performing the detection on an unknown image. These detectors then operate on the given input image and compute a quantized wavelet transform for the entire input image. The object detection then proceeds with sampling of the quantized wavelet coefficients at different image window locations on the input image and efficient look-up of pre-computed log-likelihood tables to determine object presence. The object finder's coarse-to-fine object detection strategy coupled with exhaustive object search across different positions and scales results in an efficient and accurate object detection scheme. The object finder detects a 3D object over a wide range in angular variation (e.g., 180 degrees) through the combination of a small number of detectors each specialized to a small range within this range of angular variation.
 
Claims
What is claimed is:

1. A method to detect presence of a 3D (three dimensional) object in a 2D (two dimensional) image containing a 2D representation of said 3D object, said method comprising:

receiving a digitized version of said 2D image;

selecting one or more view-based detectors;

for each view-based detector, computing a wavelet transform of said digitized version of said 2D image, wherein said wavelet transform generates a plurality of transform coefficients, and wherein each transform coefficient represents visual information from said 2D image that is localized in space, frequency, and orientation;

applying said one or more view-based detectors in parallel to respective plurality of transform coefficients, wherein each view-based detector is configured to:



Description
BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention broadly relates to image processing and image recognition, and more particularly, to a system and method for detecting presence of 3D (three dimensional) objects in a 2D (two dimensional) image containing 2D representation of the 3D objects.

2. Description of the Related Art

Object recognition is the problem of using computers to automatically locate objects in images, where an object can be any type of three dimensional physical entity such as a human face, automobile, airplane, etc. Object detection involves locating any object that belongs to a category such as the class of human faces, automobiles, etc. For example, a face detector would attempt to find all human faces in a photograph, but would not make finer distinctions such as identifying each face.

The challenge in object detection is coping with all the variations that can exist within a class of objects and the variations in visual appearance. FIG. 1A illustrates a picture slide 10 showing intra-class variations for human faces and cars. For example, cars vary in shape, size, coloring, and in small details such as the headlights, grill, and tires. Similarly, the class of human faces may contain human faces for males and females, young and old, bespectacled with plain eyeglasses or with sunglasses, etc. Also, the visual expression of a face may be different from human to human. One face may appear jovial whereas the other one may appear sad and gloomy. Visual appearance also depends on the surrounding environment and lighting conditions as illustrated by the picture slide 12 in FIG. 1B. Light sources will vary in their intensity, color, and location with respect to the object. Nearby objects may cast shadows on the object or reflect additional light on the object. Furthermore, the appearance of the object also depends on its pose; that is, its position and orientation with respect to the camera. FIG. 1C shows a picture slide 14 illustrating geometric variation among human faces. A person's race, age, gender, ethnicity, etc., may play a dominant role in defining the person's facial features. A side view of a human face will look much different than a frontal view.