|
|
|
Face Recognition Technology Whitepaper |
|
SCOPE FingerTec presented an automatic face recognition algorithm by combining 2D and 3D local features ensure accuracy and security when used as an authentication method. FingerTec technology is the foundation for all face recognition solutions from FingerTec and operates seamlessly with many third-party security applications, smart cards and biometric readers on the market. This article describes the principles and advantages of FingerTec technology. |
|
|
Face recognition has become one of the most important biometrics authentication technologies in the past few years. Two main reasons for extensive attention on face recognition technology are: 1) Aptness in various applications including in content-based video processing system, law enforcement system and in security systems. A strong need for a robust automatic system is obvious due the widespread use of photo-ID for personal identification and security 2) although there are reliable methods of biometrics identifications existed such as fingerprint scans and iris scans, face recognition is proven effective for its user-friendliness. The system does not require its users to do anything; it is contactless. On top, as one of the core components, the maturity of the digital camera technology with competitive price is also a contributing factor to the strong emergence of face recognition technology. Most of the face recognition techniques have evolved in order to overcome two main challenges: illumination and pose variation. Either of these problems can cause serious performance degradation in a face recognition system. Illumination can change the appearance of an object drastically, and in the most of the cases these differences induced by illumination are larger than differences between individuals, what makes difficult the recognition task. The same statement is valid for pose variation. Usually, the training data used by face recognition systems are frontal view face images of individuals. Frontal view images contain more specific information of a face than profile or other pose angle images. The problem appears when the system has to recognize a rotated face using this frontal view training data. Furthermore, the appearance of a face can also change drastically if the illumination conditions vary. Therefore, pose and illumination (among other challenges) are the main causes for the degradation of 2D face recognition algorithms. Some of the new face recognition strategies tend to overcome both challenges from a 3D perspective. The 3D data points corresponding to the surface of the face may be acquired using different alternatives: a multi camera system (stereoscopy), structured light, range cameras or 3D laser and scanner devices. The main advantage of using 3D data is that depth information does not depend on pose and illumination and therefore the representation of the object do not change with these parameters, making the whole system more robust. However, the main drawback of the majority of 3D face recognition approaches is that they need all the elements of the system to be well calibrated and synchronized to acquire accurate 3D data (texture and depth maps). Moreover, most of them also require the cooperation or collaboration of the subject making them not useful for uncontrolled or semi-controlled scenarios where the only input of the algorithms will be a 2D intensity image acquired from a single camera. This article discusses two main algorithm families commonly used to recognize faces: two-dimensional based and three-dimensional based recognition. Both of these two algorithms recognize faces images in different ways; two-dimensional algorithm is based on information theory concepts, seeks a computational model that best describes a face by extracting the most relevant information contained in that face while three-dimensional facial geometry represents the internal anatomical structure of the face rather than its external appearance influenced by environmental factors. As will be shown in this article, both algorithms have advantages and disadvantages. FingerTec continued research and development work has led to a more accuracy and robust face recognition technology, the FingerTec Face Recognition solution.
Over the past decades, FingerTec has concentrated on developing face recognition methods within the framework of biometrics security systems and is now applying face recognition technology to other markets. FingerTec face recognition technology can be implemented as a functionally independent application, or seamlessly integrated into new or existing biometrics security solutions by system integrators and solution providers.
FingerTec face recognition presented a novel and highly descriptive 2D-3D mixed local feature and demonstrated its performance on a challenging interclass recognition problem. By combining the 2D and 3D local features, it provides high speed and high accuracy for facial detection and facial features extraction and achieved a significant improvement in performance. Moreover, the combined performance deterioration is significantly lower than that of the individual features. |
|
|
|
Two Dimensional Face Recognition - PCA |
Principal component analysis (PCA) is one of the widely used 2D face recognition algorithm. It is based on information theory concepts, seeks a computational model that best describes a face by extracting the most relevant information contained in that face. The Eigenfaces approach is a PCA method, in which a small set of characteristic pictures are used to describe the variation between face images. The goal is to find the eigenvectors (eigenfaces) of the covariance matrix of the distribution, spanned by training a set of face images. Later, every face image is represented by a linear combination of these eigenvectors. Recognition is performed by projecting a new image onto the subspace spanned by the eigenfaces and then classifying the face by comparing its position in the face space with the positions of known individuals. The PCA-eigenfaces system capture the image and change it to light and dark areas. Both the initial facial image and the facial image in question are also captured in a two-deimensional form. Then, the two images are compared according to the points of the two eigenface image. It picks out certain features and calculates the distances between them. The points are the facial features such as eyes, nose, mouth, bone curves, and other distinct features. The eigenface algorithm firstly forms overall average image. This is the image just adding all images and dividing by number of images in training set. And the eigenvectors of covariance matrix that is formed by combining all deviations of training set’s images from average iamge is formed in order to applu eigenfaces algorithm. After finding overall average image, the order is to find eigenvectors of the covariance matrix. Visualization of eigenvectors
is carried out simply applying a quantization that is if the found eigenvectors have components that are greater than 255 and smaller than 0 round them to 255, and 0 respectively. |
|
|
Figure 1: 2D Face Recognition Process |
|
|
Pros:
• Fast, needs lesser amount of memory for identification
• Image template size small |
|
|
Cons:
• 2D face recognition algorithm is sensitive to lighting, head orientations, facial expressions and makeup
• 2D images contain limited information. |
|
|
|
|
Three Dimensional Face Recognition |
Three-dimensional face recognition (3D face recognition) is a modality of facial recognition methods in which the three-dimensional geometry of the human face is used. 3D face recognition has the potential to achieve better accuracy than its 2D counterpart by measuring geometry of rigid features on the face. This avoids such pitfalls of 2D face recognition algorithms as change in lighting, different facial expressions, make-up and head orientation. Another approach is to use the 3D model to improve accuracy of traditional image based recognition by transforming the head into a known view. Additionally, most range scanners acquire both a 3D mesh and the corresponding texture. This allows combining the output of pure 3D matchers with the more traditional 2D face recognition algorithms, thus yielding better performance (as shown in FRVT 2006). The main technological limitation of 3D face recognition methods is the acquisition of 3D images, which usually requires a range camera. This is also a reason why 3D face recognition methods have emerged significantly later (in the late 1980s) than 2D methods. Recently commercial solutions have implemented depth perception by projecting a grid onto the face and integrating video capture of it into a high resolution 3D model. This allows for good recognition accuracy with low cost off-the-shelf components. |
|
|
Pros:
• 3D Representation of face is less susceptible to isometric deformations (expression changes)
• 3D approach overcomes
problem of large facial orientation changes
• 3D model retains all the information about the facial features, a more accurate
representation of the facial features leads to potentially
higer discriminating power
• Robustness to lighting and angles up to 45º |
|
|
Cons:
• Less nimble at processing large crowds templates
• Computational cost of
processin 3D data is higer than for 2D data |
|
|
|
Figure 2: 3D Face Recognition Process |
|
|
3D Face recognition system consists of four modules: Device Module, Data Processing Module, Feature Extraction Module and Matching Engine Module. The Device Module acquires initial 3D facial data by a 3D surface scanner VGA camera and transfers it to the processor. After receiving raw data (the distorted pattern on the target object), the Data Processing Module performs image filtering (noise reduction) and then instantly reconstructs the 3D surface, smoothing and Interpolating data to avoid holes and optimize the mesh. The Feature Extraction Module receives the optimized 3D surface for further feature vector (biometric template) extraction. During biometric template extraction, a proprietary twostage algorithm is used. At the first stage, the surface “semantic analysis” is performed, resulting in the location of key crania-facial landmarks (points) on the facial surface and the fitting of the surface to a generic topological map of the face. At the second stage, when the location of specific surface patches (eye-sockets, super ciliary’s arches, forehead zone, nasolabial zone, chin zone, etc.) is known, information about local surface curvature characteristics is extracted. This local curvature information is used further to build a single geometric descriptor and packs this data into a biometric template. The output of the module is a biometric template uniquely characterizing the person, which is used in the
next matching stage. The Matching Engine working in identification mode compares the extracted biometric template with all of the stored templates in the database and produces a similarity score for each of the stored templates. The template with the best similarity score is the top match. |
|
|
|
|
2D vs 3D Face Recognition |
|
Two Dimention 2D |
Three Dimention 3D |
Definition |
The eigenvectors of the two-dimensional vector space of possible faces of human beings. |
a class of methods that work on a 3D dataset, representing both face and head shape as range data or polygonal meshes. |
How it works |
An initial set of 2D face images were acquired. The Eigenfaces were calculated from the training set. Only M Eigenfaces corresponding to the M largest eigenvalues were retained. These Eigenfaces spanned the face space which constituted of the training set.The M Eigenfaceweights were calculated for each training image by projecting the image onto face space spanned by the Eigenfaces. Each face image then will be represented by M weights- an extremely compact representation. |
The Device Module acquires initial 3D facial data and transfers it to the processor. The Data Processing Module reconstructs the 3D surface for further recognition. The Feature Extraction Module builds the feature vector (biometric template), based on the 3D surface reconstructed for further use in the matching
process. The Matching Engine Module provides a comparison of acquired and previously enrolled biometric templates. |
Template size |
Small template size (800 bytes to 2 kilobytes). |
Compact biometric template extracted (2-4 kilobytes). |
Template Desctipton |
Face features locations, texture or combination. NOT a real measument. |
Description of face shape in 3D face geometry, full features description. Gound-based measurement (sub-milimetre) |
Liveness Testing |
May be spoofed by photo or video. |
NOT to be spoofed by video
or photo. |
FAR |
0.001 (FRVT2006 result) |
0.001 (FRVT2006 result) |
FRR |
0.010-0.017 (FRVT2006 result) |
0.005-0.015(FRVT2006 result) |
Accuracy |
Medium, fully depending to image resolution. |
High, not so depending to image resolution. |
Sensitivity |
Sensitive to lighting, pose, makeup or expressions. |
Insensitive to lighting, make up and pose up to 45º.Sensitive to expressions. |
Standard |
ANSI INCITS 385-2004ISO – 19794-5 FDIS |
ANSI INCITS 385-2004ISO/IEC JTC1 SC37 WG3 |
Leading Vendors |
Neven-Vision, Sagem, FingerTec |
Identix®, FingerTec |
|
|
FingerTec Face Recognition Algorithm |
Both 2D and 3D face recognition techniques are used in FingerTec face recognition algorithm technology. This leads to a new paradigm using the mixed 2D-3D face recognition systems where 3D data is used in the training but either 2D or 3D information can be used in the recognition depending on the scenario. Following this concept, where only part of the information (partial concept) is used in the recognition, a novel method is presented in this work. This has been called Partial Principal Component Analysis (P2CA) since it fuses the partial concept with the fundamentals of the well known PCA algorithms. Both strategies have been proven to be very robust in pose variation scenarios showing that the 3D training process retains all the spatial information of the face while the 2D picture effectively recovers the face information from the available data. Simulation results have shown recognition rates above 91% when using face images with a view range of 45º around the human face in the training stage and 2D face pictures taken from different angles (from -45º to +45º) in the recognition stage.
Benefits of the FingerTec face recognition algorihtm: |
General Block Diagram of 2D-3D Mixed Face Recognition |
|
|
|
|
1. |
FingerTec face recognition is able to process maximum 1 to 20000 faces for 1 to N identification,
there is no requirement to enter a name or a PIN. For the 1 to 1 identification, it can
be carried with ease within 0.9 second with 60000 faces. (Based on embedded machine
CPU, 630 MHz). |
|
|
2. |
The FingerTec face recognition current 2D-3D mixed approach provides a measure for
automatic and robust estimation of input stream quality. This measure is computationally
efficient and allows for estimation of the quality of input surface and attribution of it to one
of tow classes: face or not a face. This means that not only enrollment can be automatically
controlled but also all subsequent face acquisitions. |
|
|
3. |
FingerTec face recognition is more robust to different view angles between the enrollment
and captured shots, with robust recognition up to 45°. Therefore, FingerTec® approach
has the potential to work with higher accuracy in real work environments.; |
|
|
4. |
The FingerTec face recognition products cannot be spoofed by video or photograph images.
In addition, it is extremely difficult to fool the system with 3D dummy or mask, as
a precise stereo-lithographic model is required with the same sub-millimeter geometric
measurements. In addition, the light pattern in the near infrared range is reflected and diffracted
in a specific manner against human skin. |
|
|
5. |
FingerTec biometric template are optimized according to some given criteria, the image
polygonal mesh is built from the cloud of the 2D and 3D points and the size is less than 5
kilobytes which reduced the storage requirements and enhanced the processing time. |
|
|
6. |
FingerTec face recognition able to process 10-12 full capturing-matching cycles per second
for extremely low False Rejection Rates (FRR) and False Acceptance Rate (FAR) which is
the leader in the processing and accuracy. |
|
|
|
|
Algorithm Performance |
FingerTec has been gone through many tests based on different image capturing resolution, lighting environment, poses angles where 100000 faces images are obtained to test the performance of the FingerTec face recognition algorithm in the past 2 years. The summary of the results as below: |
Face Enrollment Results:
• 98.9% automatic enrollment
• 1.1% required manual support
• 0% failure to enroll
• average 5-10 seconds enrollment time |
|
Face Verification Results:
• False Accept Rate (FAR) <= 0.0001%
• False Reject Rate (FRR) <= 1%
• Matching speed <= 2 seconds |
|
Face processing is one of the most active research fields as demonstrated by more than 1000 publications that have appeared in different conferences and journals in the last few years. Additionally, it is also a mature topic with more than 30 years. Recently, a new trend of 3D face recognition approaches showed an increase in the recognition rate if 3D data is available. Nevertheless, cost of the set-up, acquisition time and cooperation of the subjects are still some of the requirements for obtaining accurate 3D data that may not be available during the recognition stage. Thus, we have presented a mixed 2D-3D face recognition philosophy, the system is trained with 3D data but it can use either 2D or 3D data in the test stage. We have presented the extension of the 2D statistical PCA method to a 2D-3D face recognition scheme (Partial Principal Component Analysis). However, this philosophy may be extended also to other face recognition statistical approaches like LDA or ICA with have shown a higher robustness in the presence of illumination variations. Additionally, we have presented an automatic approach for the creation of aligned virtual view images using nine different views. These aligned virtual view images are used as training data for the 2D-3D mixed technique. The virtual view image is created by using a cylindrical approximation for the real object surface. The alignment is done by global and local transformations of the whole image and face features, respectively. Results show an improvement in the recognition rate when using the local alignment procedure proposed. FingerTec will continuously run field test of the complete system to get statistics for continued improvement of the 2D-3D mixed face recognition performance. |
|
|
|