Movatterモバイル変換


[0]ホーム

URL:


US10043058B2 - Face detection, representation, and recognition - Google Patents

Face detection, representation, and recognition
Download PDF

Info

Publication number
US10043058B2
US10043058B2US15/065,021US201615065021AUS10043058B2US 10043058 B2US10043058 B2US 10043058B2US 201615065021 AUS201615065021 AUS 201615065021AUS 10043058 B2US10043058 B2US 10043058B2
Authority
US
United States
Prior art keywords
face
program instructions
image
computer
determining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US15/065,021
Other versions
US20170262695A1 (en
Inventor
Mohamed N. Ahmed
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hyundai Motor Co
Kia Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines CorpfiledCriticalInternational Business Machines Corp
Priority to US15/065,021priorityCriticalpatent/US10043058B2/en
Assigned to INTERNATIONAL BUSINESS MACHINES CORPORATIONreassignmentINTERNATIONAL BUSINESS MACHINES CORPORATIONASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: AHMED, MOHAMED N.
Publication of US20170262695A1publicationCriticalpatent/US20170262695A1/en
Priority to US15/933,894prioritypatent/US10346676B2/en
Application grantedgrantedCritical
Publication of US10043058B2publicationCriticalpatent/US10043058B2/en
Assigned to HYUNDAI MOTOR COMPANY, KIA CORPORATIONreassignmentHYUNDAI MOTOR COMPANYASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: INTERNATIONAL BUSINESS MACHINES CORPORATION
Activelegal-statusCriticalCurrent
Adjusted expirationlegal-statusCritical

Links

Images

Classifications

Definitions

Landscapes

Abstract

In an approach to face recognition in an image, one or more computer processors receive an image that includes at least one face and one or more face parts. The one or more computer processors detect the one or more face parts in the image with a face component model. The one or more computer processors cluster the detected one or more face parts with one or more stored images. The one or more computer processors extract, from the clustered images, one or more face descriptors. The one or more computer processors determine a recognition score of the at least one face, based, at least in part, on the extracted one or more face descriptors.

Description

BACKGROUND OF THE INVENTION
The present invention relates generally to the field of computer based image analysis and recognition, and more particularly to robust face detection, representation, and recognition.
Face recognition is an increasingly important application of computer vision, particularly in areas such as security. However, accurate face recognition is often difficult due to the fact that a person's face can look very different depending on pose, expression, illumination, and facial accessories. Face recognition has been approached with 3D model-based techniques and feature-based methods. The essential feature of every face recognition system is the similarity measure—where faces are considered similar if they belong to the same individual. A similarity measure is a real-valued function that quantifies the similarity between two objects. Typically such measures are in some sense the inverse of distance metrics: they take on large values for similar objects and either zero or a negative value for dissimilar objects. The similarity measure can be used to verify that two face images belong to the same person, or to classify novel images by determining to which of the given faces a new example is most similar.
A face recognition system generally involves a face detection process for detecting the position and size of a face image included in an input image, a face parts detection process for detecting the positions of principal face parts from the detected face image, and a face identification process that identifies the face image (i.e., the person) by checking an image obtained by correcting the position and rotation of the face image based on the positions of the face parts against a registered image. Face detection is concerned with the problem of locating regions within a digital image or video sequence, which have a high probability of representing a human face. Face detection includes a process of determining whether a human face is present in an input image, and may include determining a position and/or other features, properties, parameters, or values of parameters of the face within the input image.
Face recognition technology has achieved tremendous advancements in the last decade. However, many current automated tools perform best on well-posed, frontal facial photos taken for identification purposes. These tools may not be able to handle the sheer volume of possibly relevant videos and photographs captured in unconstrained environments. In such environments, factors like pose, illumination, partial occlusion, and varying facial expressions present a difficult challenge, even for state of the art face recognition systems.
SUMMARY
Aspects of the present invention provide an approach for face recognition in an image. A first aspect of the present invention discloses a method including one or more computer processors receiving an image that includes at least one face and one or more face parts. The one or more computer processors detect the one or more face parts in the image with a face component model. The one or more computer processors cluster the detected one or more face parts with one or more stored images. The one or more computer processors extract, from the clustered images, one or more face descriptors. The one or more computer processors determine a recognition score of the at least one face, based, at least in part, on the extracted one or more face descriptors. The approach is advantageous because the face component model improves the accuracy and detection rate of state-of-the-art face detection in unconstrained environments and under partial occlusion.
A second aspect of the present invention discloses a computer program product including one or more computer readable storage devices and program instructions stored on the one or more computer readable storage devices. The stored program instructions include program instructions to receive an image that includes at least one face and one or more face parts. The stored program instructions include program instructions to detect the one or more face parts in the image with a face component model. The stored program instructions include program instructions to cluster the detected one or more face parts with one or more stored images. The stored program instructions include program instructions to extract, from the clustered images, one or more face descriptors. The stored program instructions include program instructions to determine a recognition score of the at least one face, based, at least in part, on the extracted one or more face descriptors.
A third aspect of the present invention discloses a computer system including one or more computer processors and one or more computer readable storage devices, wherein the program instructions are stored on the one or more computer readable storage devices for execution by at least one of the one or more computer processors. The stored program instructions include program instructions to receive an image that includes at least one face and one or more face parts. The stored program instructions include program instructions to detect the one or more face parts in the image with a face component model. The stored program instructions include program instructions to cluster the detected one or more face parts with one or more stored images. The stored program instructions include program instructions to extract, from the clustered images, one or more face descriptors. The stored program instructions include program instructions to determine a recognition score of the at least one face, based, at least in part, on the extracted one or more face descriptors.
In yet another aspect of the invention, detecting the one or more face parts with a face component model includes applying, by the one or more computer processors, a root filter to the image, initializing, by the one or more computer processors, a set of the one or more face parts, determining, by the one or more computer processors, whether a presence of one or more occluding objects is detected in the image that exceeds a threshold, and, in response to determining the presence of one or more occluding objects is detected in the image that exceeds a threshold, adding, by the one or more computer processors, the one or more occluding objects to the set of one or more face parts.
In yet another aspect of the invention, extracting, from the clustered images, one or more face descriptors further includes normalizing, by the one or more computer processors, the received image, comparing, by the one or more computer processors, the normalized image to one or more existing templates of facial features, determining, by the one or more computer processors, whether a minimum distance between the normalized image and the one or more existing templates exceeds a threshold, in response to determining the minimum distance between the normalized image and the one or more existing templates exceeds a threshold, determining, by the one or more computer processors, whether a quantity of the one or more existing templates is less than a pre-defined maximum quantity, and in response to determining the quantity of the one or more existing templates is less than a pre-defined maximum quantity, creating, by the one or more computer processors, a new template from the normalized image. An advantage of this approach is that by keeping the number of templates a fixed size, computation complexity of face matching and recognition does not increase with the number of images, but, instead, remains constant.
In yet another aspect of the invention, determining the recognition score of the at least one face further includes using, by the one or more computer processors, a trained convolution deep neural network. An advantage of this approach is that a trained convolution deep neural network can handle large-scale data with complex distribution.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a functional block diagram illustrating a distributed data processing environment, in accordance with an embodiment of the present invention;
FIG. 2 is a flowchart depicting operational steps of a face detection module, in a face recognition engine, on a server computer within the distributed data processing environment ofFIG. 1, for detecting face parts in an image, in accordance with an embodiment of the present invention;
FIG. 3 is a flowchart depicting operational steps of a face representation module, in the face recognition engine, on the server computer within the distributed data processing environment ofFIG. 1, in accordance with an embodiment of the present invention;
FIG. 4 illustrates an example of a deep neural network used by a face recognition module in the face recognition engine, on the server computer within the distributed data processing environment ofFIG. 1, in accordance with an embodiment of the present invention; and
FIG. 5 depicts a block diagram of components of the server computer executing the face recognition engine within the distributed data processing environment ofFIG. 1, in accordance with an embodiment of the present invention.
DETAILED DESCRIPTION
Embodiments of the present invention recognize that efficiency can be gained in face recognition systems by implementing a face components model that can detect parts of the face instead of an entire face in an image when portions of the face in the image are difficult to detect due to full or partial occlusion, for example. Embodiments of the present invention also recognize that efficiency can be gained by implementing a fixed size face descriptor for face recognition. In addition, embodiments of the present invention recognize that efficiency can be gained by implementing a deep neural network for optimizing face feature extraction and classification. Implementation of embodiments of the invention may take a variety of forms, and exemplary implementation details are discussed subsequently with reference to the Figures.
FIG. 1 is a functional block diagram illustrating a distributed data processing environment, generally designated100, in accordance with one embodiment of the present invention. The term “distributed” as used in this specification describes a computer system that includes multiple, physically distinct devices that operate together as a single computer system.FIG. 1 provides only an illustration of one implementation and does not imply any limitations with regard to the environments in which different embodiments may be implemented. Many modifications to the depicted environment may be made by those skilled in the art without departing from the scope of the invention as recited by the claims.
Distributeddata processing environment100 includesclient computing device104 andserver computer108, interconnected overnetwork102.Network102 can be, for example, a telecommunications network, a local area network (LAN), a wide area network (WAN), such as the Internet, or a combination of the three, and can include wired, wireless, or fiber optic connections.Network102 can include one or more wired and/or wireless networks that are capable of receiving and transmitting data, voice, and/or video signals, including multimedia signals that include voice, data, and video information. In general,network102 can be any combination of connections and protocols that will support communications betweenclient computing device104,server computer108, and other computing devices (not shown) within distributeddata processing environment100.
Client computing device104 can be a laptop computer, a tablet computer, a smart phone, or any programmable electronic device capable of communicating with various components and devices within distributeddata processing environment100, vianetwork102. In general,client computing device104 represents any programmable electronic device or combination of programmable electronic devices capable of executing machine readable program instructions and communicating with other computing devices (not shown) within distributeddata processing environment100 via a network, such asnetwork102.Client computing device104 includesuser interface106. In one embodiment,client computing device104 includes a camera for capturing images that the user may submit to servercomputer108 for face detection and recognition.
User interface106 provides an interface between a user ofclient computing device104 andserver computer108. In one embodiment,user interface106 may be a graphical user interface (GUI) or a web user interface (WUI) and can display text, documents, web browser windows, user options, application interfaces, and instructions for operation, and include the information (such as graphic, text, and sound) that a program presents to a user and the control sequences the user employs to control the program. In another embodiment,user interface106 may also be mobile application software that provides an interface between a user ofclient computing device104 andserver computer108. Mobile application software, or an “app,” is a computer program designed to run on smart phones, tablet computers and other mobile devices.User interface106 enables a user ofclient computing device104 to accessserver computer108 for face detection and recognition processes.
Server computer108 can be a standalone computing device, a management server, a web server, a mobile computing device, or any other electronic device or computing system capable of receiving, sending, and processing data. In other embodiments,server computer108 can represent a server computing system utilizing multiple computers as a server system, such as in a cloud computing environment. In another embodiment,server computer108 can be a laptop computer, a tablet computer, a netbook computer, a personal computer (PC), a desktop computer, a personal digital assistant (PDA), a smart phone, or any programmable electronic device capable of communicating withclient computing device104 and other computing devices (not shown) within distributeddata processing environment100 vianetwork102. In another embodiment,server computer108 represents a computing system utilizing clustered computers and components (e.g., database server computers, application server computers, etc.) that act as a single pool of seamless resources when accessed within distributeddata processing environment100.Server computer108 includesface recognition engine110 anddatabase120.Server computer108 may include internal and external hardware components, as depicted and described in further detail with respect toFIG. 5.
Facerecognition engine110 begins a face recognition process by performing face detection on images received, either for enrollment in a gallery or with a query for matching, using an approach that includes a face components model (FCM). In the FCM, facerecognition engine110 models faces as a collection of parts that include regular facial parts, such as eyes, nose, and mouth, and optional common occluding objects, including, but not limited to, sunglasses, caps, hands, and other faces. Facerecognition engine110 aligns and normalizes detected faces to a frontal view using a 3D mean shape. Facerecognition engine110 then extracts features from each face part. Facerecognition engine110 uses the extracted features to cluster the data based on feature-based similarity. During a face recognition process, facerecognition engine110 may extract a compact face descriptor (CFD) from the clustered faces to represent each subject in a gallery, where a gallery is a collection of images used for matching. The CFD includes a stack of M templates, where M is a maximum number of templates, for each face part. During a face recognition process, facerecognition engine110 extracts CFDs of query images in the same manner. In one embodiment, facerecognition engine110 computes a recognition score by calculating the distance between an extracted vector and one or more subjects, i.e., faces, in a gallery or database, using sparse similarity measure. In another embodiment, facerecognition engine110 may compute a recognition score by training a deep neural network using the extracted templates and occlusion maps. In one embodiment, occlusion maps are pre-defined withinface recognition engine110 based on observation of a large number of face images, and may be stored indatabase120. Occlusion maps define an area where one or more occluding objects are expected to cover parts of a face. For example, sunglasses are expected to cover the eyes.
In the depicted embodiment, facerecognition engine110 includes four components that perform the various functions of a face recognition process, as described above: facedetection module112, faceclustering module114, facerepresentation module116, and facerecognition module118. In another embodiment, facerecognition engine110 is a fully integrated tool that includes the functions of the previously listed components, but the components are not individual entities. In a further embodiment, facerecognition engine110 does not useface representation module116. In yet another embodiment, one or more of the four components may be integrated withinface recognition engine110.
Face detection module112 includes a face component model (FCM). The FCM models a face as a collection of facial parts, for example, two eyes, one nose, and one mouth, in addition to various occluding parts, such as sunglasses, caps, hands, etc. In one embodiment, the FCM may also detect special facial characteristics, including, but not limited to, scars, tattoos, piercings, and facial hair, as optional parts for a face. The FCM detects a face and suggests visible parts for use in the rest of a face recognition process. The FCM is comprised of two components. One component handles frontal and near frontal faces, i.e., faces with two eyes visible. Another component handles profile and near profile faces, i.e., faces with only one eye visible.Face detection module112 recovers other poses by deformations embedded in the model, which allows for different parts to change locations or positions, for example, eyes may be open or closed. Each of the two components includes a root filter that captures a global appearance of a face in an image and several parts filters that capture a texture of different parts. In one embodiment, the parts filters capture the texture of different parts at twice the spatial resolution, i.e., zoom in to the image of the face by a factor of two.Face detection module112 calculates an overall score of an image by adding the root score of the image to the sum of the parts scores belonging to a selected set of objects. The overall score is an indication of the confidence that the detected object is a particular face part. Using the FCM approach is advantageous because the FCM approach improves the accuracy and detection rate of state-of-the-art face detection in unconstrained environments and under partial occlusion.Face detection module112 is depicted and described in further detail with respect toFIG. 2.
Face clustering module114 clusters similarly appearing faces into subjects by following a process of several steps. The steps include facial landmark detection, face normalization, feature extraction, and clustering.Face clustering module114 normalizes detected faces into either a frontal or profile pose using a 3D reconstructed shape, extracts feature vectors from the normalized faces, and clusters the feature vectors using a feature-based similarity measure.Face clustering module114 performs landmark detection, i.e., detection of various points on a face, using one of a plurality of techniques known in the art. For example, faceclustering module114 may be based on a regularized boosted classifier coupled with a mixture of complex Bingham distributions.Face clustering module114 may detect landmarks using an energy functional representation, and reduce false positives by regularizing a boosted classifier with a variance normalization factor.Face clustering module114 may also model an appearance around each landmark using Haar-like features, as would be recognized by one skilled in the art.
Pose normalization is important in face recognition processes to overcome differences between images in a gallery and query images (to be matched to gallery images), for example head tilt or a direction the head is facing.Face clustering module114 normalizes poses using one of a plurality of techniques known in the art. For example, in one embodiment, faceclustering module114 detects cropped faces based on detected landmarks and warps the faces to either the corresponding frontal or profile view based on the pose of the input image. Continuing the example, faceclustering module114 estimates a pose from the detected landmarks, uses the estimated pose to estimate a projection matrix to align a 3D mean shape to the same pose of the input face, and maps a texture from the 2D domain to color the 3D face vertex.Face clustering module114 then generates one or more faces in either a frontal view or profile view based on the pose angle of the input image.
After pose normalization,face clustering module114 extracts one or more facial feature templates from the normalized faces. The facial feature template is a combination of features extracted from different facial components.Face clustering module114 represents each facial component by concatenating descriptors around a number of landmarks that belong to the component at different image resolutions. For example, faceclustering module114 may represent a left eye by five landmarks, i.e., points: two corners, a center, a center upper, and a center lower. In one embodiment, the representation of each part may contain redundant information, however an over-completed descriptor may improve accuracy. In a preferred embodiment, faceclustering module114 utilizes compact face descriptors (CFD), as will be described with respect to facerepresentation module116. In another embodiment, faceclustering module114 builds descriptors using one of a plurality of techniques known in the art. For example, in one embodiment, faceclustering module114 may build a descriptor using a combination of high-dimension local binary pattern and a histogram of oriented gradients (HOG) since the techniques include a tradeoff between computational complexity and accuracy. In the example, faceclustering module114 performs a dimensionality reduction on the facial feature templates using principle component analysis (PCA) and probabilistic linear discriminative analysis (PLDA). After extracting the facial feature templates, faceclustering module114 clusters the facial feature templates obtained from different faces into subjects using one of a plurality of clustering approaches known in the art. For example, in one embodiment, faceclustering module114 may cluster the templates using a fuzzy C-means clustering algorithm.
In an embodiment whereface recognition engine110 includesface representation module116, facerepresentation module116 creates a fixed size compact face descriptor (CFD) for representing subject faces. A CFD consists of a stack of M templates for each subject (at frontal and profile view), where M is a maximum number of templates, and corresponding features extracted from each face part. The CFD converges to the basis for representing the subject space at frontal and profile view independently when the enrolled data covers the extreme point in the subject space at different illumination conditions, expressions, and occlusion. An advantage of the CFD is that the CFD remains a fixed size, independent of the number of enrolled images and videos per subject, thus the computation complexity of face matching and recognition does not increase with the number of images, but, instead, remains constant. By using the CFD, facerepresentation module116 makes use of all imagery, which is advantageous over known solutions becauseface representation module116 does not rely on a single best frame approach, and dynamically updates the CFD using newly enrolled images. In addition, by using the M-template approach, the CFD is more robust to variations of age, expression, face marks, and extreme illumination that cannot be corrected using common illumination normalization techniques. Another advantage of using the CFD is that the CFD can handle different problems, such as illumination, expression, age, etc., that are challenging for known approaches. For example, a difference of Gaussian approach may be used to preprocess illumination variation before feature extraction, but the approach fails in handling the full spectrum of illumination variation. A CFD, however, uses multiple images for the same subject under different expressions to handle different facial expressions. Facerepresentation module116 is depicted and described in further detail with respect toFIG. 3.
Facerecognition module118 uses inputs extracted and created byface detection module112, faceclustering module114, and facerepresentation module116 to perform face recognition on an image presented by a user, viauser interface106. Facerecognition module118 aligns and normalizes test media, i.e., query images and video frames, to a frontal or profile view. Facerecognition module118 extracts features from each visible face part. Then facerecognition module118 computes recognition score, i.e., a distance between the extracted vector and each subject indatabase120, using sparse similarity measure and performs a matching process. In a preferred embodiment, facerecognition module118 determines the recognition score in the matching process by using a trained convolution deep neural network, as is known to one skilled in the art, to handle large-scale data with a complex distribution and to optimize face feature extraction and classification. In the embodiment, the convolution deep neural network may consist of N layers of convolution, where each layer is followed by maximum pooling, and the last layer is a fully connected layer. The input to the convolution deep neural network includes facial feature templates and occlusion maps detected byface detection module112 using the FCM approach. An advantage of using the described convolution deep neural network architecture is that by using the feature combination, facerecognition module118 may achieve robust face recognition under occlusion as well as incorporating unusual characteristics in the recognition. In addition, using the described convolution deep neural network enablesface recognition module118 to handle large-scale data with complex distribution.
In another embodiment, facerecognition module118 may perform the matching process by comparing the input image to each image indatabase120 one by one. Although functional, this method will likely take a long time, depending on the number of images indatabase120. In another embodiment, facerecognition module118 may perform the matching process by using a known 1-N matching scheme. For example, facerecognition module118 may apply a nested cascade classifier to improve matching complexity and speed. Facerecognition module118 may divide template features into several nested stages from coarse to fine. Ifface recognition module118 determines the resulting similarity score is smaller than a threshold of a current stage, then facerecognition module118 interrupts the matching process. Since the probability of two facial feature templates belonging to different subjects is high,face recognition module118 rejects most of the matching pairs in the early stages of the cascade, which may improve the matching speed significantly.
In a further embodiment, facerecognition module118 may perform the matching process by using an indexing approach based on geometric hashing which has a constant time O(1) retrieval performance. In the embodiment, facerecognition module118 transforms CFDs to a lower dimensional index and records them in a hash table as a single entry associated with a particular ID. Facerecognition module118 converts the input image to a similar set of indices, retrieves corresponding IDs from the hash table, and uses a voting scheme to resolve the likely ID, thus enabling retrieval even if the input or gallery models are only partially defined.
Database120 is a repository for images used byface recognition engine110, also known as a gallery. In the depicted embodiment,database120 resides onserver computer108. In another embodiment,database120 may reside elsewhere within distributeddata processing environment100 providedface recognition engine110 has access todatabase120. A database is an organized collection of data.Database120 can be implemented with any type of storage device capable of storing data and configuration files that can be accessed and utilized byserver computer108, such as a database server, a hard disk drive, or a flash memory.Database120 stores images of a plurality of subjects, i.e., faces, used to trainface recognition engine110 as well as query images submitted to facerecognition engine110, viauser interface106, for matching purposes.Database120 may also store templates associated with one or more CFDs.
FIG. 2 is a flowchart depicting operational steps offace detection module112, onserver computer108 within distributeddata processing environment100 ofFIG. 1, for detecting face parts in an image, in accordance with an embodiment of the present invention.
Face detection module112 applies a root filter (step202). Whenface detection module112 receives an image submitted by a user ofclient computing device104, viauser interface106, for either a query or enrollment in a gallery, application of the root filter to a face in an image captures the global appearance of the face which includes face parts and occluding objects. In a preferred embodiment, facedetection module112 uses an FCM to apply the root filter.
Face detection module112 initializes a set of face parts (step204).Face detection module112 creates a set of face parts, such as two eyes, a nose and a mouth. For example, facedetection module112 may create a set S that includes P1as a left eye, P2as a right eye, P3as a nose, and P4as a mouth. In one embodiment, face detection module initializes the set of face parts based on expected locations of face parts in a global appearance or position of a face, i.e., two eyes, a nose and a mouth are expected in a frontal position face image, whether or not occluding objects are present in the image.Face detection module112 selects the face parts by comparing different subtypes of each part and selecting the best match. A subtype is a different template for each face part. For example, subtypes of an eye can be the eye open or the eye closed. Therefore subtypes may be included for each face part within a gallery.
Face detection module112 determines whether an occluding object is detected that exceeds a pre-defined threshold (decision block206).Face detection module112 looks for additional objects in the image that may occlude the face parts. Examples of occluding objects include, but are not limited to, sunglasses, caps, hands, and other faces. In one embodiment, the pre-defined threshold is a percentage of pixels in the image. Exceeding the pre-defined threshold indicates the object's presence in the image. Ifface detection module112 determines the image includes an occluding object that exceeds the pre-defined threshold (“yes” branch, decision block206), then facedetection module112 adds the occluding object to the set of face parts (step208). For example, ifface detection module112 detects an occluding object, such as sunglasses, in the image, then facedetection module112 adds the sunglasses as object P5to set S.
Face detection module112 determines whether the occluding object overlaps a face part (decision block210).Face detection module112 compares the occluding object to a corresponding face part according to one or more occlusion maps. In one embodiment, facedetection module112 determines whether a pre-defined percentage of the face part is missing from the image. For example, facedetection module112 may determine that at least 75 percent of the left eye is missing. Ifface detection module112 determines the occluding object overlaps a face part (“yes” branch, decision block210), then facedetection module112 removes the overlapped face part from the set (step212).Face detection module112 computes a score for the detection of a possible occluding object and then compares the score of the occluding object to a score of the overlapped object and keeps the object with the highest score in the set. In one embodiment, the score is a probability of detection, i.e., a confidence level. The confidence level may be measured by how close the object under consideration is to other subtypes in the gallery. For example, facedetection module112 computes a confidence that a detected object is a left eye. If an occluding object, such as sunglasses, has a higher score than an overlapped object, such as a left eye, then facedetection module112 keeps the sunglasses in the set of parts and removes the left eye from the set of parts. In another example, if an occluding object, such as a cap, has a lower score than an overlapped object, such as a left eye, then facedetection module112 keeps the left eye in the set of parts and removes the cap from the set of parts.
Face detection module112 determines whether another occluding object is detected in the image (decision block214). Ifface detection module112 determines another occluding object is detected in the image (“yes” branch, decision block214), then facedetection module112 returns todecision block206.
Ifface detection module112 determines the image does not include an occluding object that exceeds the pre-defined threshold (“no” branch, decision block206), or ifface detection module112 determines the occluding object does not overlap a face part (“no” branch, decision block210), or ifface detection module112 determines another occluding object is not detected in the image (“no” branch, decision block214), then facedetection module112 completes the face detection process on the received image and ends. In one embodiment, the FCM performs all the steps described with respect toFIG. 2. Using the FCM is advantageous because the FCM improves the performance, e.g., the speed and accuracy, offace recognition engine110, especially with images that include one or more occluding objects.
FIG. 3 illustrates operational steps offace representation module116, inface recognition engine110, onserver computer108 within distributeddata processing environment100 ofFIG. 1, in accordance with an embodiment of the present invention.
Facerepresentation module116 receives an image (step302). When a user ofclient computing device104 enrolls a new image to the gallery, viauser interface106, facerepresentation module116 receives the image. In one embodiment, facedetection module112 also receives an image when a user ofclient computing device104 submits an image, viauser interface106, as a query. In one embodiment, the received image is the same image referenced inFIG. 2.
Facerepresentation module116 normalizes the received image (step304). In one embodiment, facerepresentation module116 normalizes the image using one of a plurality of techniques known in the art, as discussed with respect to faceclustering module114 inFIG. 1.
Facerepresentation module116 compares the normalized image to existing templates (step306). Onceface representation module116 normalizes the image, facerepresentation module116 compares the normalized image to facial feature templates included in a CFD, and stored indatabase120, for the same subject.
Facerepresentation module116 determines whether a minimum distance between features in the normalized image and features in the existing templates exceeds a threshold (decision block308). The distance between the normalized image and existing templates is a measure of similarity. In one embodiment, facerepresentation module116 uses adaptive learning to choose the threshold by training an algorithm on similar and different faces to discover the threshold dynamically. In one embodiment, facerepresentation module116 measures the distance using a simple Euclidean distance, as would be recognized by one skilled in the art, to compute the distance between two vectors. In another embodiment, facerepresentation module116 measures the distance by training a Siamese neural network, as would be recognized by one skilled in the art, to compute the threshold and distance for similarity measurement. Ifface representation module116 determines the minimum distance between the normalized image and the existing templates exceeds a threshold (“yes” branch, decision block308), then facerepresentation module116 determines whether the quantity of existing templates is less than M (decision block310). M is a maximum number of templates allowed in the stack of the compact face descriptor (CFD), thus keeping the CFD at a fixed size. In one embodiment, the user ofclient computing device104 defines a value of M. In another embodiment, a system administrator may define a value of M.
Ifface representation module116 determines the quantity of existing templates is less than M (“yes” branch, decision block310), then facerepresentation module116 creates and appends a new template (step312). Ifface representation module116 determines the CFD includes less than the maximum number of templates, then facerepresentation module116 creates a new template from the normalized image and appends the new template to the existing templates indatabase120.
Ifface representation module116 determines the minimum distance between the normalized image and the existing templates does not exceed a threshold (“no” branch, decision block308), or ifface representation module116 determines the quantity of existing templates is not less than M (“no” branch, decision block310), then facerepresentation module116 updates the closest template (step314). Facerepresentation module116 determines a matching score for each of the existing templates as compared to the normalized image. Facerepresentation module116 determines which of the existing templates has the highest matching score or similarity measurement, i.e., the minimum distance from the normalized image, and then updates the most similar template. Facerepresentation module116 regenerates features extracted from one or more facial parts to update the template indatabase120.
FIG. 4 illustrates an example of deepneural network400 used byface recognition module118 within distributeddata processing environment100 ofFIG. 1, in accordance with an embodiment of the present invention.
In the depicted embodiment, deepneural network400 is a convolution neural network (CNN), as would be recognized by one skilled in the art. Deepneural network400 includes convolution layer404,sub-sampling layer406,convolution layer408,sub-sampling layer410, and fully connectedmultilayer perceptron412. Convolution layers404 and408 consist of a rectangular grid of neurons, and require that the previous layer also be a rectangular grid of neurons. Each neuron takes inputs from a rectangular section of the previous layer; the weights for the input rectangular section are the same for each neuron in the convolution layer. Thus, a convolution layer is an image convolution of the previous layer, where the weights specify the convolution filter. In addition, there may be several grids in each convolution layer; each grid takes inputs from the grids in the previous layer, using potentially different filters.
Sub-sampling layers406 and410 are examples of pooling layers. A pooling layer takes rectangular blocks from the convolution layer and subsamples the blocks to produce a single output from that block. In the depicted embodiment,sub-sampling layers406 and410 are max-pooling layers, i.e.,sub-sampling layers406 and410 take the maximum of the block they are pooling. In another embodiment, deepneural network400 may perform pooling by instructingsub-sampling layers406 and410 to take the average of the block. In a further embodiment, deepneural network400 may perform pooling by instructingsub-sampling layers406 and410 to use a learned linear combination of the neurons in the block.
Fully connectedmultilayer perceptron412 performs high-level reasoning in deepneural network400. As would be recognized by one skilled in the art, a multilayer perceptron (MLP) is a feedforward artificial neural network model that maps sets of input data onto a set of appropriate outputs. Fully connectedmultilayer perceptron412 is not spatially located.
CFD402 represents a stack of M facial feature templates, where, in the depicted simplified example, M is five.CFD402 is the input to deepneural network400. In one embodiment, one or more occlusion maps may also be input to deepneural network400. Convolution layer404 consists of a rectangular grid of neurons. Convolution layer404 performs image convolution, i.e., filtering, on each of the facial feature templates inCFD402 to extract a face representation. The right-pointing arrows between the depicted layers represent filter weights or parameters that deepneural network400 trains. Deepneural network400 adjusts the weights of the filter during training. The output of convolution layer404 is a filtered image.Sub-sampling layer406 divides the image into small rectangular blocks. The size of each block can vary. In one embodiment, deepneural network400 uses a 3×3 pixel block size. In other embodiments, deepneural network400 may user other block sizes.Sub-sampling layer406 subsamples each block to produce a single output (maximum value) from that block. For example, if the input tosub-sampling layer406 is 81×81 pixels, and the block size is 3×3 pixels, then the output ofsub-sampling layer406 is an image of 27×27 pixels.
Deepneural network400 may use multiple concatenated convolution and sub-sampling layers to improve the feature selection task. In the depicted embodiment, deepneural network400 uses two of each type of layer. For the recognition task, deepneural network400 performs high-level reasoning via fully connectedmultilayer perceptron412. Fully connectedmultilayer perceptron412 takes each neuron in the previous layer (whether the previous layer is fully connected, pooling, or convolution) and connects each neuron to every single neuron within fully connectedmultilayer perceptron412.
FIG. 5 depicts a block diagram of components ofserver computer108 within distributeddata processing environment100 ofFIG. 1, in accordance with an embodiment of the present invention. It should be appreciated thatFIG. 5 provides only an illustration of one implementation and does not imply any limitations with regard to the environments in which different embodiments can be implemented. Many modifications to the depicted environment can be made.
Server computer108 can include processor(s)504,cache514,memory506,persistent storage508,communications unit510, input/output (I/O) interface(s)512 andcommunications fabric502.Communications fabric502 provides communications betweencache514,memory506,persistent storage508,communications unit510, and input/output (I/O) interface(s)512.Communications fabric502 can be implemented with any architecture designed for passing data and/or control information between processors (such as microprocessors, communications and network processors, etc.), system memory, peripheral devices, and any other hardware components within a system. For example,communications fabric502 can be implemented with one or more buses.
Memory506 andpersistent storage508 are computer readable storage media. In this embodiment,memory506 includes random access memory (RAM). In general,memory506 can include any suitable volatile or non-volatile computer readable storage media.Cache514 is a fast memory that enhances the performance of processor(s)504 by holding recently accessed data, and data near recently accessed data, frommemory506.
Program instructions and data used to practice embodiments of the present invention, e.g., facerecognition engine110 anddatabase120 are stored inpersistent storage508 for execution and/or access by one or more of the respective processor(s)504 ofserver computer108 viamemory506. In this embodiment,persistent storage508 includes a magnetic hard disk drive. Alternatively, or in addition to a magnetic hard disk drive,persistent storage508 can include a solid-state hard drive, a semiconductor storage device, a read-only memory (ROM), an erasable programmable read-only memory (EPROM), a flash memory, or any other computer readable storage media that is capable of storing program instructions or digital information.
The media used bypersistent storage508 may also be removable. For example, a removable hard drive may be used forpersistent storage508. Other examples include optical and magnetic disks, thumb drives, and smart cards that are inserted into a drive for transfer onto another computer readable storage medium that is also part ofpersistent storage508.
Communications unit510, in these examples, provides for communications with other data processing systems or devices, including resources ofclient computing device104. In these examples,communications unit510 includes one or more network interface cards.Communications unit510 may provide communications through the use of either or both physical and wireless communications links. Facerecognition engine110 anddatabase120 may be downloaded topersistent storage508 ofserver computer108 throughcommunications unit510.
I/O interface(s)512 allows for input and output of data with other devices that may be connected toserver computer108. For example, I/O interface(s)512 may provide a connection to external device(s)516 such as a keyboard, a keypad, a touch screen, a microphone, a digital camera, and/or some other suitable input device. External device(s)516 can also include portable computer readable storage media such as, for example, thumb drives, portable optical or magnetic disks, and memory cards. Software and data used to practice embodiments of the present invention, e.g., facerecognition engine110 anddatabase120 onserver computer108, can be stored on such portable computer readable storage media and can be loaded ontopersistent storage508 via I/O interface(s)512. I/O interface(s)512 also connect to adisplay518.
Display518 provides a mechanism to display data to a user and may be, for example, a computer monitor.Display518 can also function as a touchscreen, such as a display of a tablet computer.
The programs described herein are identified based upon the application for which they are implemented in a specific embodiment of the invention. However, it should be appreciated that any particular program nomenclature herein is used merely for convenience, and thus the invention should not be limited to use solely in any specific application identified and/or implied by such nomenclature.
The present invention may be a system, a method, and/or a computer program product. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention.
The computer readable storage medium can be any tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
These computer readable program instructions may be provided to a processor of a general purpose computer, a special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, a segment, or a portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the blocks may occur out of the order noted in the Figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
The descriptions of the various embodiments of the present invention have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the invention. The terminology used herein was chosen to best explain the principles of the embodiment, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.

Claims (17)

What is claimed is:
1. A method for face recognition in an image, the method comprising:
receiving, by one or more computer processors, an image that includes at least one face and one or more face parts;
normalizing, by the one or more computer processors, the received image;
comparing, by the one or more computer processors, the normalized image to one or more existing templates of facial features in a first a face descriptor of a plurality of face descriptors;
determining, by the one or more computer processors, whether a minimum distance between the normalized image and the one or more existing templates exceeds a threshold;
responsive to determining the minimum distance between the normalized image and the one or more existing templates exceeds a threshold, determining, by the one or more computer processors, whether a quantity of the one or more existing templates is less than a pre-defined maximum quantity;
responsive to determining the quantity of the one or more existing templates is less than a pre-defined maximum quantity, creating, by the one or more computer processors, a new template from the normalized image, wherein the new template is appended to the one or more existing templates in the first face descriptor;
detecting, by the one or more computer processors, the one or more face parts in the image with a face component model;
clustering, by the one or more computer processors, the detected one or more face parts with one or more stored images;
extracting, by the one or more computer processors, from the clustered images, one or more face descriptors of the plurality of face descriptors; and
determining, by the one or more computer processors, a recognition score of the at least one face, based, at least in part, on the extracted one or more face descriptors.
2. The method ofclaim 1, wherein detecting the one or more face parts with a face component model further comprises:
applying, by the one or more computer processors, a root filter to the image;
initializing, by the one or more computer processors, a set of the one or more face parts;
determining, by the one or more computer processors, whether a presence of one or more occluding objects is detected in the image that exceeds a threshold; and
responsive to determining the presence of one or more occluding objects is detected in the image that exceeds a threshold, adding, by the one or more computer processors, the one or more occluding objects to the set of one or more face parts.
3. The method ofclaim 2, further comprising:
determining, by the one or more computer processors, whether the one or more occluding objects overlap one or more face parts in the image; and
responsive to determining the one or more occluding objects overlap at least one of the one or more face parts in the image, removing, by the one or more computer processors, the one or more overlapped face parts from the set the one or more of face parts.
4. The method ofclaim 1, further comprising, responsive to determining the quantity of the one or more existing templates is not less than a pre-defined maximum quantity, updating, by the one or more computer processors, a first template of the one or more existing templates, wherein the first template most closely matches the normalized image.
5. The method ofclaim 1, wherein determining the recognition score of the at least one face further comprises using, by the one or more computer processors, a trained convolution deep neural network.
6. The method ofclaim 5, wherein the trained convolution deep neural network uses as input at least one of the one or more face descriptors of the plurality of face descriptors or one or more occlusion maps.
7. The method ofclaim 1, wherein the received image includes at least one object that occludes at least one of the one or more face parts.
8. A computer program product for face recognition in an image, the computer program product comprising:
one or more computer readable storage devices and program instructions stored on the one or more computer readable storage devices, the stored program instructions comprising:
program instructions to receive an image that includes at least one face and one or more face parts;
program instructions to normalize the received image;
program instructions to compare the normalized image to one or more existing templates of facial features in a first a face descriptor of a plurality of face descriptors;
program instructions to determine whether a minimum distance between the normalized image and the one or more existing templates exceeds a threshold;
responsive to determining the minimum distance between the normalized image and the one or more existing templates exceeds a threshold, program instructions to determine whether a quantity of the one or more existing templates is less than a pre-defined maximum quantity;
responsive to determining the quantity of the one or more existing templates is less than a pre-defined maximum quantity, program instructions to create a new template from the normalized image, wherein the new template is appended to the one or more existing templates in the first face descriptor;
program instructions to detect the one or more face parts in the image with a face component model;
program instructions to cluster the detected one or more face parts with one or more stored images;
program instructions to extract, from the clustered images, one or more face descriptors of the plurality of face descriptors; and
program instructions to determine a recognition score of the at least one face, based, at least in part, on the extracted one or more face descriptors.
9. The computer program product ofclaim 8, wherein the program instructions to detect the one or more face parts with a face component model comprise:
program instructions to apply a root filter to the image;
program instructions to initialize a set of the one or more face parts;
program instructions to determine whether a presence of one or more occluding objects is detected in the image that exceeds a threshold; and
responsive to determining the presence of one or more occluding objects is detected in the image that exceeds a threshold, program instructions to add the one or more occluding objects to the set of one or more face parts.
10. The computer program product ofclaim 9, the stored program instructions further comprising:
program instructions to determine whether the one or more occluding objects overlap one or more face parts in the image; and
responsive to determining the one or more occluding objects overlap at least one of the one or more face parts in the image, program instructions to remove the one or more overlapped face parts from the set the one or more of face parts.
11. The computer program product ofclaim 8, the stored program instructions further comprising, responsive to determining the quantity of the one or more existing templates is not less than a pre-defined maximum quantity, program instructions to update a first template of the one or more existing templates, wherein the first template most closely matches the normalized image.
12. The computer program product ofclaim 8, wherein the program instructions to determine the recognition score of the at least one face comprise program instructions to use a trained convolution deep neural network.
13. A computer system for face recognition in an image, the computer system comprising:
one or more computer processors;
one or more computer readable storage devices;
program instructions stored on the one or more computer readable storage devices for execution by at least one of the one or more computer processors, the stored program instructions comprising:
program instructions to receive an image that includes at least one face and one or more face parts;
program instructions to normalize the received image;
program instructions to compare the normalized image to one or more existing templates of facial features in a first a face descriptor of a plurality of face descriptors;
program instructions to determine whether a minimum distance between the normalized image and the one or more existing templates exceeds a threshold;
responsive to determining the minimum distance between the normalized image and the one or more existing templates exceeds a threshold, program instructions to determine whether a quantity of the one or more existing templates is less than a pre-defined maximum quantity;
responsive to determining the quantity of the one or more existing templates is less than a pre-defined maximum quantity, program instructions to create a new template from the normalized image, wherein the new template is appended to the one or more existing templates in the first face descriptor;
program instructions to detect the one or more face parts in the image with a face component model;
program instructions to cluster the detected one or more face parts with one or more stored images;
program instructions to extract, from the clustered images, one or more face descriptors of the plurality of face descriptors; and
program instructions to determine a recognition score of the at least one face, based, at least in part, on the extracted one or more face descriptors.
14. The computer system ofclaim 13, wherein the program instructions to detect the one or more face parts with a face component model comprise:
program instructions to apply a root filter to the image;
program instructions to initialize a set of the one or more face parts;
program instructions to determine whether a presence of one or more occluding objects is detected in the image that exceeds a threshold; and
responsive to determining the presence of one or more occluding objects is detected in the image that exceeds a threshold, program instructions to add the one or more occluding objects to the set of one or more face parts.
15. The computer system ofclaim 14, the stored program instructions further comprising:
program instructions to determine whether the one or more occluding objects overlap one or more face parts in the image; and
responsive to determining the one or more occluding objects overlap at least one of the one or more face parts in the image, program instructions to remove the one or more overlapped face parts from the set the one or more of face parts.
16. The computer system ofclaim 13, the stored program instructions further comprising, responsive to determining the quantity of the one or more existing templates is not less than a pre-defined maximum quantity, program instructions to update a first template of the one or more existing templates, wherein the first template most closely matches the normalized image.
17. The computer system ofclaim 13, wherein the program instructions to determine the recognition score of the at least one face comprise program instructions to use a trained convolution deep neural network.
US15/065,0212016-03-092016-03-09Face detection, representation, and recognitionActive2036-04-19US10043058B2 (en)

Priority Applications (2)

Application NumberPriority DateFiling DateTitle
US15/065,021US10043058B2 (en)2016-03-092016-03-09Face detection, representation, and recognition
US15/933,894US10346676B2 (en)2016-03-092018-03-23Face detection, representation, and recognition

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
US15/065,021US10043058B2 (en)2016-03-092016-03-09Face detection, representation, and recognition

Related Child Applications (1)

Application NumberTitlePriority DateFiling Date
US15/933,894ContinuationUS10346676B2 (en)2016-03-092018-03-23Face detection, representation, and recognition

Publications (2)

Publication NumberPublication Date
US20170262695A1 US20170262695A1 (en)2017-09-14
US10043058B2true US10043058B2 (en)2018-08-07

Family

ID=59786904

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US15/065,021Active2036-04-19US10043058B2 (en)2016-03-092016-03-09Face detection, representation, and recognition
US15/933,894ActiveUS10346676B2 (en)2016-03-092018-03-23Face detection, representation, and recognition

Family Applications After (1)

Application NumberTitlePriority DateFiling Date
US15/933,894ActiveUS10346676B2 (en)2016-03-092018-03-23Face detection, representation, and recognition

Country Status (1)

CountryLink
US (2)US10043058B2 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US10346676B2 (en)*2016-03-092019-07-09International Business Machines CorporationFace detection, representation, and recognition
US10497014B2 (en)*2016-04-222019-12-03Inreality LimitedRetail store digital shelf for recommending products utilizing facial recognition in a peer to peer network
US10896318B2 (en)*2017-09-092021-01-19Apple Inc.Occlusion detection for facial recognition processes
US11157138B2 (en)*2017-05-312021-10-26International Business Machines CorporationThumbnail generation for digital images
US11683448B2 (en)2018-01-172023-06-20Duelight LlcSystem, method, and computer program for transmitting face models based on face data points
US12401912B2 (en)2014-11-172025-08-26Duelight LlcSystem and method for generating a digital image
US12401911B2 (en)2014-11-072025-08-26Duelight LlcSystems and methods for generating a high-dynamic range (HDR) pixel stream
US12445736B2 (en)2024-10-302025-10-14Duelight LlcSystems and methods for generating a digital image

Families Citing this family (44)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US10799168B2 (en)*2010-06-072020-10-13Affectiva, Inc.Individual data sharing across a social network
US10558848B2 (en)2017-10-052020-02-11Duelight LlcSystem, method, and computer program for capturing an image with correct skin tone exposure
CN106778453B (en)*2015-11-252020-05-12腾讯科技(深圳)有限公司Method and device for detecting glasses wearing in face image
CN106056562B (en)*2016-05-192019-05-28京东方科技集团股份有限公司A kind of face image processing process, device and electronic equipment
US10346464B2 (en)*2016-09-272019-07-09Canon Kabushiki KaishaCross-modiality image matching method
US10198626B2 (en)2016-10-192019-02-05Snap Inc.Neural networks for facial modeling
US20180129868A1 (en)*2016-11-082018-05-10Qualcomm IncorporatedSystem and method associated with object verification
US10474883B2 (en)*2016-11-082019-11-12Nec CorporationSiamese reconstruction convolutional neural network for pose-invariant face recognition
US10235771B2 (en)*2016-11-112019-03-19Qualcomm IncorporatedMethods and systems of performing object pose estimation
CN106780662B (en)*2016-11-162020-09-18北京旷视科技有限公司 Face image generation method, device and device
CN106780658B (en)2016-11-162021-03-09北京旷视科技有限公司Face feature adding method, device and equipment
WO2018126275A1 (en)*2016-12-302018-07-05Dirk Schneemann, LLCModeling and learning character traits and medical condition based on 3d facial features
US10832035B2 (en)*2017-06-222020-11-10Koninklijke Philips N.V.Subject identification systems and methods
CN109214238B (en)*2017-06-302022-06-28阿波罗智能技术(北京)有限公司Multi-target tracking method, device, equipment and storage medium
CN107633207B (en)*2017-08-172018-10-12平安科技(深圳)有限公司AU characteristic recognition methods, device and storage medium
US10579785B2 (en)*2017-09-292020-03-03General Electric CompanyAutomatic authentification for MES system using facial recognition
CN107729928B (en)*2017-09-302021-10-22百度在线网络技术(北京)有限公司Information acquisition method and device
US10510157B2 (en)*2017-10-282019-12-17Altumview Systems Inc.Method and apparatus for real-time face-tracking and face-pose-selection on embedded vision systems
CN107669282A (en)*2017-11-192018-02-09济源维恩科技开发有限公司A lie detector based on face recognition
CN108510061B (en)*2018-03-192022-03-29华南理工大学Method for synthesizing face by multiple monitoring videos based on condition generation countermeasure network
CN110363047B (en)*2018-03-262021-10-26普天信息技术有限公司Face recognition method and device, electronic equipment and storage medium
CN108764041B (en)*2018-04-252021-09-14电子科技大学Face recognition method for lower shielding face image
CN108664920B (en)*2018-05-102022-12-20东方网力科技股份有限公司Real-time large-scale cascading face clustering method and device
US11163981B2 (en)*2018-09-112021-11-02Apple Inc.Periocular facial recognition switching
US11367305B2 (en)2018-09-282022-06-21Apple Inc.Obstruction detection during facial recognition processes
RU2697646C1 (en)2018-10-262019-08-15Самсунг Электроникс Ко., Лтд.Method of biometric authentication of a user and a computing device implementing said method
CN109598203A (en)*2018-11-062019-04-09平安科技(深圳)有限公司Photo adding method, device, terminal and storage medium based on recognition of face
CN109800787B (en)*2018-12-142020-12-29西安交通大学 Image Template Matching Method Based on Relative Feature Distance Error Metric
RU2724783C1 (en)*2018-12-282020-06-25Акционерное общество "Лаборатория Касперского"Candidate fingerprint matching and comparison system and method
CN109934114B (en)*2019-02-152023-05-12重庆工商大学Finger vein template generation and updating algorithm and system
US10872258B2 (en)2019-03-152020-12-22Huawei Technologies Co., Ltd.Adaptive image cropping for face recognition
CN110084191B (en)*2019-04-262024-02-23广东工业大学Eye shielding detection method and system
US11080884B2 (en)*2019-05-152021-08-03Matterport, Inc.Point tracking using a trained network
CN110414444A (en)*2019-07-312019-11-05中国工商银行股份有限公司Face identification method and device
FR3101988B1 (en)*2019-10-102021-10-22Sispia PROCESS FOR RECOGNIZING OBJECTS WITH INCREASED REPRESENTATIVITY
US11610436B2 (en)*2020-04-082023-03-21Nec CorporationEfficient watchlist searching with normalized similarity
CN111340013B (en)*2020-05-222020-09-01腾讯科技(深圳)有限公司Face recognition method and device, computer equipment and storage medium
CN111783592B (en)*2020-06-242022-07-08杭州十域科技有限公司Face recognition method combining incomplete face information with position data
CN111797746B (en)*2020-06-282024-06-14北京小米松果电子有限公司Face recognition method, device and computer readable storage medium
JP7512844B2 (en)*2020-10-292024-07-09オムロン株式会社 Learning method, trained model, detection system, detection method, and program
US20220207282A1 (en)*2020-12-282022-06-30Fortinet, Inc.Extracting regions of interest for object detection acceleration in surveillance systems
CN112418190B (en)*2021-01-212021-04-02成都点泽智能科技有限公司Mobile terminal medical protective shielding face recognition method, device, system and server
CN113361649B (en)*2021-07-082024-04-02南京邮电大学Autonomous ship navigation scene clustering method for improving fuzzy C-means algorithm
US12113897B2 (en)2022-08-112024-10-08Nametag Inc.Systems and methods for storing biometric images as profile images in an authentication profile

Citations (20)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6381346B1 (en)*1997-12-012002-04-30Wheeling Jesuit UniversityThree-dimensional face identification system
US7596247B2 (en)*2003-11-142009-09-29Fujifilm CorporationMethod and apparatus for object recognition using probability models
US20110293189A1 (en)*2010-05-282011-12-01Microsoft CorporationFacial Analysis Techniques
US8116534B2 (en)*2006-05-292012-02-14Kabushiki Kaisha ToshibaFace recognition apparatus and face recognition method
US8199979B2 (en)*2004-01-222012-06-12DigitalOptics Corporation Europe LimitedClassification system for consumer digital images using automatic workflow and face detection and recognition
US8331616B2 (en)*2007-08-272012-12-11Sony CorporationFace image processing apparatus, face image processing method, and computer program
US20130259310A1 (en)*2012-03-302013-10-03Canon Kabushiki KaishaObject detection method, object detection apparatus, and program
US20130266193A1 (en)*2012-04-092013-10-10Accenture Global Services LimitedBiometric matching technology
US20140112541A1 (en)*2008-03-242014-04-24Verint Systems Ltd.Method and System for Edge Detection
US20140270482A1 (en)*2013-03-152014-09-18Sri InternationalRecognizing Entity Interactions in Visual Media
US8848975B2 (en)*2007-03-052014-09-30Seeing Machines Pty LtdEfficient and accurate 3D object tracking
US20140328509A1 (en)*2011-12-042014-11-06Digital Makeup LtdDigital makeup
US20150091900A1 (en)*2013-09-272015-04-02Pelican Imaging CorporationSystems and Methods for Depth-Assisted Perspective Distortion Correction
US20150110349A1 (en)*2013-10-222015-04-23Samsung Electronics Co., Ltd.Face tracking apparatuses and methods
US9042599B2 (en)*2010-07-022015-05-26Intel CorporationTechniques for face detection and tracking
US20150169938A1 (en)*2013-12-132015-06-18Intel CorporationEfficient facial landmark tracking using online shape regression method
US20150317511A1 (en)*2013-11-072015-11-05Orbeus, Inc.System, method and apparatus for performing facial recognition
US9210313B1 (en)*2009-02-172015-12-08Ikorongo Technology, LLCDisplay device content selection through viewer identification and affinity prediction
US20160140146A1 (en)*2014-11-142016-05-19Zorroa CorporationSystems and Methods of Building and Using an Image Catalog
US20160292494A1 (en)*2007-12-312016-10-06Applied Recognition Inc.Face detection and recognition

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5729619A (en)*1995-08-081998-03-17Northrop Grumman CorporationOperator identity, intoxication and drowsiness monitoring system and method
US6208379B1 (en)*1996-02-202001-03-27Canon Kabushiki KaishaCamera display control and monitoring system
US5991429A (en)*1996-12-061999-11-23Coffin; Jeffrey S.Facial recognition system for security access and identification
US7522186B2 (en)*2000-03-072009-04-21L-3 Communications CorporationMethod and apparatus for providing immersive surveillance
US7161615B2 (en)*2001-11-302007-01-09PelcoSystem and method for tracking objects and obscuring fields of view under video surveillance
US7715595B2 (en)*2002-01-162010-05-11Iritech, Inc.System and method for iris identification using stereoscopic face recognition
JP3999561B2 (en)*2002-05-072007-10-31松下電器産業株式会社 Surveillance system and surveillance camera
US7421097B2 (en)*2003-05-272008-09-02Honeywell International Inc.Face identification verification using 3 dimensional modeling
US8705808B2 (en)*2003-09-052014-04-22Honeywell International Inc.Combined face and iris recognition system
US7428314B2 (en)*2003-12-032008-09-23Safehouse International Inc.Monitoring an environment
GB2409029A (en)2003-12-112005-06-15Sony Uk LtdFace detection
US8224018B2 (en)*2006-01-232012-07-17Digimarc CorporationSensing data from physical objects
US8121356B2 (en)*2006-09-152012-02-21Identix IncorporatedLong distance multimodal biometric system and method
EP2198552A1 (en)*2007-09-282010-06-23Eye Controls, LLC.Systems and methods for biometric identification
US10043058B2 (en)*2016-03-092018-08-07International Business Machines CorporationFace detection, representation, and recognition
US10423830B2 (en)2016-04-222019-09-24Intel CorporationEye contact correction in real time using neural network based machine learning

Patent Citations (20)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6381346B1 (en)*1997-12-012002-04-30Wheeling Jesuit UniversityThree-dimensional face identification system
US7596247B2 (en)*2003-11-142009-09-29Fujifilm CorporationMethod and apparatus for object recognition using probability models
US8199979B2 (en)*2004-01-222012-06-12DigitalOptics Corporation Europe LimitedClassification system for consumer digital images using automatic workflow and face detection and recognition
US8116534B2 (en)*2006-05-292012-02-14Kabushiki Kaisha ToshibaFace recognition apparatus and face recognition method
US8848975B2 (en)*2007-03-052014-09-30Seeing Machines Pty LtdEfficient and accurate 3D object tracking
US8331616B2 (en)*2007-08-272012-12-11Sony CorporationFace image processing apparatus, face image processing method, and computer program
US20160292494A1 (en)*2007-12-312016-10-06Applied Recognition Inc.Face detection and recognition
US20140112541A1 (en)*2008-03-242014-04-24Verint Systems Ltd.Method and System for Edge Detection
US9210313B1 (en)*2009-02-172015-12-08Ikorongo Technology, LLCDisplay device content selection through viewer identification and affinity prediction
US20110293189A1 (en)*2010-05-282011-12-01Microsoft CorporationFacial Analysis Techniques
US9042599B2 (en)*2010-07-022015-05-26Intel CorporationTechniques for face detection and tracking
US20140328509A1 (en)*2011-12-042014-11-06Digital Makeup LtdDigital makeup
US20130259310A1 (en)*2012-03-302013-10-03Canon Kabushiki KaishaObject detection method, object detection apparatus, and program
US20130266193A1 (en)*2012-04-092013-10-10Accenture Global Services LimitedBiometric matching technology
US20140270482A1 (en)*2013-03-152014-09-18Sri InternationalRecognizing Entity Interactions in Visual Media
US20150091900A1 (en)*2013-09-272015-04-02Pelican Imaging CorporationSystems and Methods for Depth-Assisted Perspective Distortion Correction
US20150110349A1 (en)*2013-10-222015-04-23Samsung Electronics Co., Ltd.Face tracking apparatuses and methods
US20150317511A1 (en)*2013-11-072015-11-05Orbeus, Inc.System, method and apparatus for performing facial recognition
US20150169938A1 (en)*2013-12-132015-06-18Intel CorporationEfficient facial landmark tracking using online shape regression method
US20160140146A1 (en)*2014-11-142016-05-19Zorroa CorporationSystems and Methods of Building and Using an Image Catalog

Non-Patent Citations (10)

* Cited by examiner, † Cited by third party
Title
Ayazoglu et al. "Fast algorithms for structured robust principal component analysis" Computer Vision and Pattern Recognition (CVPR) 2012 IEEE Conference pp. 1704-1711.*
Cao et al. "Face Alignment by Explicit Shape Regression" International Journal of Computer Vision Springer Science (Dec. 2012) pp. 1-14.*
Ding et al., "Continuous Pose Normalization for Pose-Robust Face Recognition", IEEE Signal Processing Letters, vol. 19, No. 11, Nov. 2012, pp. 721-724.
El Shafey et al., "A Scalable Formulation of Probabilistic Linear Discriminant Analysis: Applied to Face Recognition", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, Issue: 7, Jul. 2013, pp. 1788-1794 (Supplemental pp. 1-3).
Farag, "Face recognition in the wild", University of Louisville, ThinkIR: The University of Louisville's Institutional Repository, Electronic Theses and Dissertations, Dec. 2013, 123 pages.
Felzenszwalb et al., "Object Detection with Discriminatively Trained Part-Based Models", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 32, No. 9, Sep. 2010, pp. 1627-1645.
Mostafa et al., "Dynamic Weighting of Facial Features for Automatic Pose-Invariant Face Recognition", 2012 Ninth conference on Computer and Robot Vision, pp. 411-416, © 2012 IEEE.
Mostafa et al., "Pose Invariant Approach for Face Recognition at Distance", ECCV 2012, Part VI, LNCS 7577, pp. 15-28, © Springer-Verlag Berlin Heidelberg 2012.
Peng et al. "RASL: Robust Alignment by Sparse and Low-rank Decomposition for Linearly Correlated Images" Computer Vision and Pattern Recognition (CVPR) 2012 IEEE Conference on, pp. 1-8.*
Yan et al., "The Fastest Deformable Part Model for Object Detection", 2014 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 24-28, 2014, pp. 2497-2504.

Cited By (12)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US12401911B2 (en)2014-11-072025-08-26Duelight LlcSystems and methods for generating a high-dynamic range (HDR) pixel stream
US12401912B2 (en)2014-11-172025-08-26Duelight LlcSystem and method for generating a digital image
US12418727B2 (en)2014-11-172025-09-16Duelight LlcSystem and method for generating a digital image
US10346676B2 (en)*2016-03-092019-07-09International Business Machines CorporationFace detection, representation, and recognition
US10497014B2 (en)*2016-04-222019-12-03Inreality LimitedRetail store digital shelf for recommending products utilizing facial recognition in a peer to peer network
US11157138B2 (en)*2017-05-312021-10-26International Business Machines CorporationThumbnail generation for digital images
US11169661B2 (en)2017-05-312021-11-09International Business Machines CorporationThumbnail generation for digital images
US10896318B2 (en)*2017-09-092021-01-19Apple Inc.Occlusion detection for facial recognition processes
US20210133428A1 (en)*2017-09-092021-05-06Apple Inc.Occlusion detection for facial recognition processes
US11521423B2 (en)*2017-09-092022-12-06Apple Inc.Occlusion detection for facial recognition processes
US11683448B2 (en)2018-01-172023-06-20Duelight LlcSystem, method, and computer program for transmitting face models based on face data points
US12445736B2 (en)2024-10-302025-10-14Duelight LlcSystems and methods for generating a digital image

Also Published As

Publication numberPublication date
US10346676B2 (en)2019-07-09
US20170262695A1 (en)2017-09-14
US20180211101A1 (en)2018-07-26

Similar Documents

PublicationPublication DateTitle
US10346676B2 (en)Face detection, representation, and recognition
US10776470B2 (en)Verifying identity based on facial dynamics
CN108319938B (en) High-quality training data preparation system for high-performance face recognition systems
EP3023911B1 (en)Method and apparatus for recognizing object, and method and apparatus for training recognizer
Chakraborty et al.An overview of face liveness detection
Rathod et al.Automated attendance system using machine learning approach
US8064653B2 (en)Method and system of person identification by facial image
CN109165593B (en)Feature extraction and matching and template update for biometric authentication
US20170262472A1 (en)Systems and methods for recognition of faces e.g. from mobile-device-generated images of faces
CN108090406B (en)Face recognition method and system
WO2020164278A1 (en)Image processing method and device, electronic equipment and readable storage medium
KR101117549B1 (en)Face recognition system and method thereof
CN109815823B (en)Data processing method and related product
Fang et al.Real-time hand posture recognition using hand geometric features and fisher vector
US11475684B1 (en)Methods and systems for performing noise-resistant computer vision techniques
CN116266419A (en)Living body detection method and device and computer equipment
Ahmed et al.Eye detection and localization in a facial image based on partial geometric shape of iris and eyelid under practical scenarios
Gürel et al.Design of a face recognition system
CN115862055B (en)Pedestrian re-recognition method and device based on contrast learning and countermeasure training
Ragesh et al.Fast r-cnn based masked face recognition for access control system
JP7141518B2 (en) Finger vein matching method, device, computer equipment, and storage medium
John et al.Real time blink recognition from various head pose using single eye
Chandel et al.A comparison of face landmark detection techniques
Sharma et al.Face recognition using face alignment and PCA techniques: a literature survey
CN118537900A (en)Face recognition method and device, electronic equipment and storage medium

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW Y

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AHMED, MOHAMED N.;REEL/FRAME:037933/0747

Effective date:20160304

STCFInformation on status: patent grant

Free format text:PATENTED CASE

ASAssignment

Owner name:KIA CORPORATION, KOREA, REPUBLIC OF

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERNATIONAL BUSINESS MACHINES CORPORATION;REEL/FRAME:058780/0252

Effective date:20220111

Owner name:HYUNDAI MOTOR COMPANY, KOREA, REPUBLIC OF

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERNATIONAL BUSINESS MACHINES CORPORATION;REEL/FRAME:058780/0252

Effective date:20220111

MAFPMaintenance fee payment

Free format text:PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment:4


[8]ページ先頭

©2009-2025 Movatter.jp