Off-line verification technique for Hindi signatures - IEEE Xplore

12 downloads 269 Views 590KB Size Report
Oct 9, 2013 - Abstract: Handwritten signature is one of the oldest biometric attributes used for authentication of an individual or a document. The purpose of ...
www.ietdl.org Published in IET Biometrics Received on 30th January 2013 Revised on 9th October 2013 Accepted on 9th October 2013 doi: 10.1049/iet-bmt.2013.0016

Special Issue: Selected papers from International Conference on Frontiers in Handwriting Recognition (ICFHR 2012) ISSN 2047-4938

Off-line verification technique for Hindi signatures Srikanta Pal1, Umapada Pal2, Michael Blumenstein1 1

School of Information and Communication Technology, Griffith University, Gold Coast, Queensland, Australia Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, West Bengal 700108, India E-mail: srikanta.pal@griffithuni.edu.au

2

Abstract: Handwritten signature is one of the oldest biometric attributes used for authentication of an individual or a document. The purpose of this study is to present an empirical contribution towards the understanding of signature verification using a novel method involving off-line Hindi (Devnagari) signatures. Although research in the field of signature verification involving Western signatures has been well studied, there has been relatively little attention devoted to non-Western signatures such as Chinese, Japanese, Arabic, Persian etc. In this study, the performance of an off-line signature verification system involving Hindi signatures, whose style is distinct from Western scripts, was investigated. The gradient feature, Zernike moment features and SVMs were considered for verification. To the best of the authors’ knowledge, Hindi signatures investigated as part of a large dataset have never been used for the task of signature verification, and this research work is only the second important report using Hindi signatures in this area. An encouraging accuracy of 90.69% was obtained using gradient feature. The Hindi signature database employed for experimentation consisted of 2400 (100 × 24) genuine signatures and 3000 (100 × 30) forgeries. The error rates of 11.50% FRR and 7.12% FAR were obtained through experimentation using gradient features.

1

Introduction

Today, biometric technologies are increasingly being used to ensure identity verification or the authorisation of access to sensitive data. Among all of the biometric authentication systems, handwritten signatures have been accepted as an official means to verify personal identity for legal purposes on such documents as cheques, credit cards and wills [1]. Signature verification has been a topic of renewed intensive research over the past several years [2] because of the important role it plays in numerous areas, including in financial applications. An automatic signature verification system can be classified into two categories [3, 4] called on-line and off-line. In an on-line technique, signatures are created on a digitiser and dynamic information such as speed and pressure is captured in addition to a static image of the signature [5, 6]. In an off-line technique, signatures are inscribed on a piece of paper and then scanned to obtain the signature image [7]. Off-line signature verification deals with the verification of signatures, which appear in a static format [8]. Verification decisions are usually based on local or global features extracted from processed signatures. Excellent verification results can be achieved by comparing the robust features of the test signature with that of the user’s signature using appropriate classifiers [9]. The use of signatures has been one of the more convenient methods for the identification and verification of human beings. Signatures are considered a complete image with a special distribution of pixels, and a particular writing style. 182 & The Institution of Engineering and Technology 2013

It is not considered as a collection of letters and words [10]. A person’s signature may change radically during their lifetime. Great inconsistency can even be observed in signatures according to country, habits, psychological or mental state, physical and practical conditions [11]. The remainder of this paper is organised as follows. Some related work is described in Section 2. The concept of signature verification is introduced in Section 3. Section 4 deals with the importance of Hindi signature verification. The Hindi signature database developed for the current research is described in Section 5. Pre-processing steps are described in Section 6. Some notable properties of Devnagari script are introduced in Section 7. Section 8 briefly describes the feature extraction techniques employed in this work. Details of the classifiers used are presented in Section 9. The experimental settings are presented in Section 10. Results and discussion are given in Section 11. Section 12 deals with the comparison of performance. Error analyses are described in Section 13. Finally, conclusions and future work are discussed in Section 14.

2

Related work

There is a limited amount of work in the area involving off-line verification of non-Western signatures. A few of the signature verification techniques are taken into account and illustrated here. Pal et al. [12] presented a signature identification technique for bi-script off-line signatures. In that proposed signature identification method, the signatures of Bangla (Bengali) and English are considered for the IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016

www.ietdl.org identification process. That report was an important contribution involving the Indian regional language in the signature identification/verification area. In that experiment, different features such as under-sampled bitmaps, modified chain code features and the gradient feature were computed from both background and foreground components and were employed for the purpose of signature identification. Support vector machines (SVMs) and nearest neighbour (NN) techniques were considered as classifiers for signature identification in the proposed system. A database of 1554 English signatures and 1092 Bengali signatures were used to generate the experimental results. In one of the papers by Pal et al. [13], a signature verification system based on a local language in India was introduced. In that paper, the performance of an off-line signature verification system involving Bangla (Bengali) signatures, whose style is distinct from Western scripts, was investigated. The Bangla signature database employed in the experiments consisted of 3000 forgeries and 2400 genuine signatures. That Bangla signature database, used for the task of signature verification was a novel contribution to the field of off-line signature verification. The Gaussian grid feature extraction technique was employed for feature extraction and SVMs were considered for classification. In another paper by Pal et al. [14], multi-script signature identification and verification involving English and Hindi signatures was presented. The intention of that paper was to present an empirical contribution towards the understanding of multi-script signature verification. In that paper, multi-script signatures were identified first based on the signature script and after that the experiment for verification was performed based on the identified result. Two different results for identification and verification were determined and analysed. In recent times, Pal et al. [15] introduced a signature verification system employing Hindi signatures. The purpose of that paper was to present an investigation of the performance of a signature verification system involving Hindi off-line signatures. In that study, two important features such as: gradient feature, Zernike moment feature and SVM classifiers were employed. Thirty sets of Hindi signature images were considered for experimentation. Each set of signatures consisted of 30 skilled forgeries and 24 genuine signatures. For each signature set, the classifier was trained with 14 randomly chosen genuine signatures. The negative samples for training were the 20 skilled forgeries of signatures. For testing, the remaining genuine signatures (ten signatures) and remaining skilled forgeries (ten signatures) of the signatures considered, were employed. In the present proposed approach, 100 sets of Hindi signature images have been employed for experimentation. The skilled forgeries were not taken into consideration for training purposes in the proposed technique. The experimental settings were also different which is described in Section 10. Another non-English signature verification approach was presented by Pal et al. [16] where a Bangla signature database, which consisted of 5400 (100 × 54) signatures was employed for experimentation. The objective of that paper was to present an empirical contribution towards the understanding of a threshold-based signature verification method. Some techniques such as under-sampled bitmap, intersection/endpoint and directional chain code were employed for feature extraction. The NN method was considered for classification. For experimentation, the system was trained with a set of 12 genuine signatures of IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016

each individual (class) and a distance map using the Euclidian distances between all 12 genuine signatures of each class (121 distances) were calculated. The average distance of all 121 distances for each class was computed and this average distance of each class was considered as the acceptance/rejection threshold of that class. An encouraging accuracy was achieved using the threshold-based technique on a Bangla signature database. A signature verification competition for non-English on-line and off-line signatures was proposed by Liwicki et al. [17]. In that competition, the performance of 12 systems from 5 different institutes was evaluated based on Dutch and Chinese signatures. Our present signature verification work also involves Hindi off-line signatures, but there is a major difference with respect to a few aspects. The experimental technique presented in our proposed work for signature verification is totally different to the earlier one. The size of the signature dataset used for experimentation is also vastly different compared to the previous one. The accuracy obtained from the new experiments is encouraging. Apart from non-Western signature verification, it has been observed that a substantial amount work has been done in the area employing off-line verification of Western signatures. Armand et al. [18] presented an effective method to perform off-line signature verification and identification. Unique structural features were extracted from the signature’s contour. Using a publicly available database of 2106 signatures containing 936 genuine and 1170 forgeries, the verification rate of 91.12% was obtained. Ramachandra et al. [19] proposed an off-line signature verification system based on a cross-validation principle and graph matching. Schafer and Viriri [20] presented an off-line signature verification system based on the combination of feature sets. Some extracted features were: aspect ratio, centroid feature, four surface features, six surface features, number of edge points, transition features etc. The verification of signatures was accomplished using the Euclidean distance classifier. Signatures may be written in different languages and there is a need to undertake a systematic study in this area. From the above reviewed work, it may be noted that many published papers are available for Western signatures and only a few studies have been undertaken for signatures of Chinese, Japanese, Persian, Arabic etc. [21]. To the best of the authors’ knowledge, there are only a few published papers dealing with signatures of Indic scripts and languages. The present work of Hindi signature verification would be considered a novel contribution to the field of signature verification. Some signature samples of Hindi script are shown in Fig. 1.

Fig. 1 Examples of some Hindi signature samples 183

& The Institution of Engineering and Technology 2013

www.ietdl.org 3

Signature verification concept

In the last few decades many methods have been developed in the area of pattern recognition, specifically for the off-line signature verification problem. In general, to deal with the problem of off-line signature verification, researchers have investigated a commonly used approach which is based on two different patterns of classes, class 1 and class 2, where class 1 represents the genuine signature set, and class 2 represents the forged signature set. However, different types of forgeries are measured aside from skilled forgeries (random forgeries, simple forgeries, simulated forgeries, unskilled forgeries, casual forgeries etc.), which are discussed below. As different types of forgeries are being used for signature verification experimentation, the scope of signature databases used for approaches to verification, are not limited to genuine and skilled forgeries alone. Random forgeries have been widely used in verification. In this proposed approach, random forgeries are considered instead of skilled forgeries for training purposes during experimentation. Actually, genuine signatures were treated as random forgeries and this is further discussed in the experimental settings section. Usually two types of errors are considered in an off-line/ on-line signature verification system: The false rejection or Type-I error and the false acceptance or Type-II error. These error types are associated with two common types of error rates: the false rejection rate (FRR) which is the percentage of genuine signatures misclassified as forgeries, and the false acceptance rate (FAR) which is the percentage of forged signatures misclassified as genuine. According to Coetzer et al. [22], there are three basic types of forged signatures, which are often taken into account, and these are 1. Random forgery. The forger has no access to the genuine signature (not even the author’s name) and reproduces a random one. In many cases, the forgeries are the forger’s own genuine signature. 2. Simple forgery. The forger knows the author’s name and the script, but has no access to a sample of the signature. 3. Skilled forgery. The forger has access to one or more samples of the genuine signature and is able to reproduce it. According to Weiping et al. [23], forgeries may be classified into two basic classes: casual, skilled or traced, which are often considered in the signature verification field. Casual forgeries (also referred to as rough of random) are produced by only knowing the name of the person, whose signature is being forged, with no previous knowledge of the appearance of the genuine signature. In some cases, the forger may even use their own signature as

a forgery (substitution forgery). Casual forgeries are the most commonly found forgeries. In skilled forgeries, the forger knows the signature very well and has practiced the simulation process. Therefore a skilled forgery is very similar to the genuine signature and some effective features in casual forgery verification become ineffective in skilled signature verification. Detection of the simulated and disguised signatures was introduced by Liwicki et al. [24]. Fig. 2 outlines the three main phases of automatic signature verification: data acquisition and pre-processing, feature extraction and classification.

4

Importance of Hindi signature verification

In a multi-script and multi-lingual country like India, Hindi script is one of the official languages of the country, and is not only used for writing/reading purposes, but is also used for signing purposes. Hindi script is widely used for official reasons throughout India and it is the state local language in some of the states like Bihar, Uttar Pradesh, Madhya Pradesh (a state of India) as well as a national language in India. Experiments with Hindi signatures for verification can be considered as a new contribution to the area of signature verification. Hindi signature verification is quite different to other scripts with respect to the shape and structure of Hindi signature images. Most of the characters of Hindi hand-written words are touching. Hindi is written using Devnagari script. It has been noted that many characters of Devanagari script have a horizontal line at the upper part called a headline, which is called as irorekha in Devanagari. It joins two or more basic or compound characters to form a word. These headlines are present at the top portion of the characters and they are often used as supporting features in identifying Devanagari script. Hindi script is, less blocky and presents a more sinuous shape. Hindi signatures always contain textual components. The gradient feature used in this research is suitable for analysing Hindi script given such characteristics. One of the co-authors of this paper has attained encouraging accuracy analysing Hindi characters as presented in [25] where the gradient feature was considered. For Hindi signatures, there is no concept of an initial signature which is usually seen in English signatures. By initial signature we mean the first character of the first name followed by the last mane (family name).

5

Signature database

Although automatic signature verification has been an active research area for several decades, there has been no publicly available signature database for Hindi, the most popular official Indian script. Therefore a Hindi signature database was created for the purpose of this work. Hence, the research in automatic signature verification has long been constrained by the unavailability of a standard database. 5.1 Hindi signature collection and database preparation

Fig. 2 Process-flow of a general signature verification system 184 & The Institution of Engineering and Technology 2013

The signatures of Hindi script were considered for this signature verification approach. As there has been no public signature corpus available for Hindi script, it was necessary to create a database of Hindi signatures. The signatures were collected from Kolkata, India. The majority of the signatures were contributed by students. The remaining IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016

www.ietdl.org experiments for the purpose of performance comparison. Each signature set of this corpus consists of 24 genuine signatures and 30 simple forgeries. The reason 100 sets were used from the GPDS on this occasion, is due to the fact that the Hindi dataset described previously was comprised of 100 sets, and it was considered important to have equivalent signature numbers for experimentation and comparison between the two datasets.

6 Fig. 3 Genuine and forged signatures

signatures were collected from seniors of the locality. This Hindi signature database consists of 100 sets. Each set consists of 24 genuine signatures and 30 skilled forgeries. The signature images employed consisted of 100 sets whereby the writer number ranges from H-S-Set-001 to H-S-Set-100 (Hindi-Signature-Set). In order to collect the genuine signatures corresponding to each individual, a collection form was designed. The form contained 24 boxes where the signatures could be written. From each individual, 24 genuine signatures were collected. A total number of 2400 genuine signatures from 100 individuals were collected. For each contributor, all genuine specimens were collected in a single day’s writing session. In addition, only skilled forged signatures were collected for this proposed work. In order to produce the forgeries, the imitators were allowed to practice their forgeries as long as they wished with static images of genuine specimens. A total number of 3000 forged signatures were collected from the writers. Some genuine signature samples, along with their corresponding forgeries, are displayed in Fig. 3. 5.2

Pre-processing

The signatures to be processed by the system needed to be in a digital image format. Each signature was handwritten on a rectangular space of fixed size on a white sheet of paper. It was necessary to scan all signature document pages. At the very beginning, the images were captured in 256 level grey scale at 300 dpi and stored in TIFF format (tagged image file format) for the purpose of further processing. In the pre-processing step, a histogram-based threshold technique was applied for binarisation. In this step, the digitised grey-level image was converted to a two-tone image. Subsequently, a noise removal technique was applied to remove the salt and pepper noise and the signature images were extracted from the signature-collecting document forms. The signature collecting form containing 24 genuine signatures in binary format is shown in Fig. 4.The extracted binary signature images were stored in TIFF format. A typical scanned signature and its corresponding binary image are shown in Figs. 5 and 6, respectively.

Fig. 5 Scanned signature image

GPDS English database

Another database consisting of 100 sets from GPDS-160 [26] was also utilised for these experiments. As GPDS is a popular and publicly available dataset, it was considered for our

Fig. 6 Binary signature image

Fig. 4 Signature collection form with genuine signature images IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016

185

& The Institution of Engineering and Technology 2013

www.ietdl.org where Bn|m|s =

( − 1)s (S − 1)!      S! n + |m|/2 − S ! n − |m|/2 − S ! 

The complex Zernike moments of order n and repetition m are given by

Fig. 7 Basic characters of Devnagari script

Anm =

Properties of Devnagari scripts

Devnagari is an oriental script descended from Brahmi script [27]. It is the most popular official script and national language of India. In Hindi script, the writing direction is from left to right and there is no concept of upper/lower case. Hindi script has about 50 basic characters. These characters are presented in Fig. 7. Vowels in this script generally take a modified shape in most words and are called modifiers or allographs. Modifiers generally do not disturb the shape of basic characters in the middle zone of a line. If the shape is disturbed in the middle zone, we call the resultant shape a compound character. Vowel modifiers of Devnagari scripts are shown in Fig. 8. A text line in such scripts can be partitioned into three zones (upper, middle and lower). The upper zone denotes the portion above the headline, the middle zone denotes the portion between the headline and baseline and the lower zone is the portion below the baseline. The imaginary line separating the middle and lower zones is called the baseline.

8

Feature extraction

Feature extraction is a crucial step in any pattern recognition system. The Zernike feature and the gradient feature extraction technique are described as follows. 8.1

Zernike moments feature

Zernike polynomials are an orthogonal set of complex-valued polynomials  y Vnm (x, y) = Rnm (x, y) exp jm tan−1 x

(1)

where x2 + y2 ≤ 1,

j=

√ −1,

n + 1 ∗ f (x, y)Vnm (x, y) p x y

(5)

where x 2 + y 2 ≤ 1 and the symbol * denotes the complex conjugate operator [28]. The Zernike moments can be computed by the scale invariant central moments as follows

Fig. 8 Vowel modifiers of Devnagari

7

(4)

n ≥ 0,

|m| ≤ n

(2)

and n − |m| is even and radial polynomials {Rnm} are defined as

Anm

 

|m| n b   m n+1  d = ( − j) ·   d p k−|m| a=0 d=0 n−k=even



b B × G a n|m|s k−2a−d,2a+d

(6)

where b=

√ n − |m| − S and j = −1 2

(7)

8.2 Computation of 576-dimensional gradient features The grey-scale local-orientation histogram of the signature component is used for 576-dimensional feature extraction [29]. To obtain a 576-dimensional gradient-based feature vector, the following steps were executed. Step 1: 2 × 2 mean filtering is applied 5 times on the input image. Step 2: The grey scale image obtained in Step 1 is normalised so that the mean grey scale becomes zero with a maximum value of 1. Step 3: The normalised image is then segmented into 17 × 7 blocks. Compromising trade-off between accuracy and complexity, this block size is decided experimentally. To obtain the bounding box of the grey-scale image, the image is converted into a two-tone format using Otsu’s thresholding algorithm [30]. This will exclude unnecessary background information from the image. Step 4: A Roberts filter is then applied to the image to obtain the gradient image. The arc tangent of the gradient (direction of gradient) is quantised into 32 directions and the strength of the gradient is accumulated with each of the quantised directions. The strength of the gradient f(X, Y) is defined as follows f (X , Y ) =

 (Du)2 + (Dv)2

(8)

and the direction of gradient (θ(x, y)) is Rnm (x, y) =

(n−|m|/2  )

Bn|m|s (x2 + y2 )(n/2)−s

s=0

186 & The Institution of Engineering and Technology 2013

(3)

u(x, y) = tan−1

Dv Du

(9)

IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016

www.ietdl.org where Δu = g(x + 1, y + 1) − g(x, y) and Δv = g(x + 1, y) − g(x, y + 1) and g(x, y) is the grey level of point (x, y). Step 5: Histograms of the values of 32 quantised directions are computed for each of the 17 × 7 blocks. Step 6: The directional histogram of the 17 × 7 blocks is down-sampled into 9 × 4 blocks and 16 directions using Gaussian filters. Finally, a 9 × 4 × 16 = 576-dimensional feature vector is obtained.

there were 99 × 2 = 198 random forgeries employed for training. For testing, the remaining 12 genuine signatures and 30 skilled forgeries of the signature set being considered were employed. The number of samples for training and testing for experimentation with each signature set are shown in Table 1.

11 9

Classifier details

In our experiments, we have used SVMs as classifiers. SVMs have been originally defined for two-class problems and they look for the optimal hyper plane, which maximises the distance and the margin between the nearest examples of both classes, namely support vectors. Given a training database of M data: {xm|m = 1, …, M}, the linear SVM classifier is then defined as f (x) =



aj xj · x + b

(10)

j

where {xj} are the set of support vectors and the parameters αj and b have been determined by solving a quadratic problem [31]. The linear SVM can be extended to various non-linear variants; details can be found in [31, 32]. In our experiments, the RBF kernel SVM outperformed other non-linear SVM kernels, hence, we are reporting our verification results based on the RBF kernel only. Different gamma values for SVM training were established experimentally. No rule/formula was used for the selection of gamma values. The values of gamma were determined empirically until the desired accuracy was obtained. In this experiment, two different gamma values were employed for the SVM classifier for two different feature sets to obtain the maximum accuracy. If those two gamma values were interchanged and applied for experimentation, the system would not necessarily produce the desired outcome.

10

Experimental settings

In the field of signature verification, there is a sparseness of publicly available signature databases. The quality of the available databases also varies, as there has been no standard collection protocol. Besides, it is very costly to create a large corpus with different types of forgeries, especially skilled forgeries. In this experimental method of verification, genuine signatures were considered for training and skilled forgeries were considered for testing purposes only. For the experiments in the proposed research, our own database described in Section 5 was used. For each signature set, an SVM was trained with 12 randomly chosen genuine signatures. The negative samples for training (random forgeries) were the genuine signatures of 99 other signature sets. Two signatures were taken from each set. In total,

Results and discussion

An investigation of the performance of a signature verification system involving Hindi signatures is presented here. For training and testing of the system, 5400 Hindi signatures were employed. Among these signatures, 2400 (24 × 100) samples were genuine and 3000 (30 × 100) samples were skilled forgeries. The experimental results ofany signature verification system is associated with an FRR, FAR and AER (average error rate). Using the gradient feature and Zernike moment feature, an FRR, FAR and AER was calculated. At this operational point, SVMs produced an FRR of 11.50% and an FAR of 7.12% using the gradient feature whereas an FRR of 18.74% and an FAR of 14.60% were obtained using the Zernike moment feature. We obtained a better accuracy compared with our previous Bangla signature verification work [13] as well as our Hindi signature verification work [15]. A 5-fold crossed validation technique has been considered and the accuracy of 90.90% was obtained using gradient feature. From the experimental results obtained, it is also observed that the performance of the gradient feature is very encouraging, compared to the results obtained using the Zernike moment feature. The results obtained from these experiments using our Hindi dataset are comparatively better than the state-of-the-art methodology [11] and the accuracies are shown in Table 2. Three different accuracies from the experiments were calculated based on three different values of gamma of the SVM classifier using the gradient feature and the Zernike moment feature. The graphical representations of the three different outcomes achieved following experimentation of Table 2 Verification accuracies for different feature sets Features used gradient feature Zernike moment feature

Highest accuracy, % 90.69 83.33

Table 1 Number of signature samples used per set for experimentation

training testing

Genuine signature

Random forgeries

Skilled forgeries

12 12

198 n/a

n/a 30

IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016

Fig. 9 Comparison of results using different values of gamma for the SVM classifier using the gradient feature 187

& The Institution of Engineering and Technology 2013

www.ietdl.org

Fig. 10 Comparison of results using different values of gamma for the SVM classifier using the Zernike feature

Fig. 12 Comparison of different values of FRR, FAR and AER for different settings of gamma for the SVM classifier using the Zernike moment feature

Table 3 Value of FRR, FAR and AER using gradient feature Gamma 19 000 18 000 17 000

FRR, %, FAR, % and AER, % 11.50, 7.12 and 9.31 11.64, 7.22 and 9.43 11.90, 7.54 and 9.72

Table 4 Values of FRR, FAR and AER using the Zernike moment feature Gamma

FRR, %, FAR, % and AER, %

0.000001 0.000002 0.000003

18.74, 14.60 and 16.67 19.66, 15.20 and 17.43 20.10, 17.96 and 19.03

different values of gamma for SVMs are shown in Figs. 9 and 10, respectively. The FRR and the FAR rates changed rapidly for the Zernike moment feature with the value of gamma used in the SVM classifiers. Different values of gamma are chosen experimentally for obtaining lower error rates. The error rates obtained from the experiments using gradient and Zernike moments are shown in Tables 3 and 4, respectively. These error rates demonstrate the robustness of the gradient feature and Zernike moment feature extraction techniques for signature verification. The comparison of error rates

Fig. 13 FAR and FRR are represented by the ROC curve

using different values of gamma as part of SVM classifier training using the gradient feature and Zernike moment feature are shown graphically in Figs. 11 and 12. In the signature verification field, a receiver operating characteristic (ROC) is a graphical plot which illustrates the performance of the verification system. It is created by plotting the FAR against the FRR obtained in the experiments. In recent years, the use of the ROC curve has been widely adopted for the valuation of signature verification performance. A ROC space is defined by the FAR and FRR as x- and y-axes, respectively. The area under the curve (AUC) of the ROC estimates system performance by using a single value, since the AUC provides the probability that the classifier will rank an arbitrarily-selected positive sample higher than an arbitrarily chosen negative sample [33]. The performance of the system using the gradient feature is shown by the ROC curve in Fig. 13.

12

Fig. 11 Comparison of different values for FRR, FAR and AER for different values of gamma of the SVM classifier using the gradient feature 188 & The Institution of Engineering and Technology 2013

Comparison of performance

A comparison of different approaches with associated results have been considered, although the size of the data set, type of feature extraction technique and the methodology used for experimentation are not similar. An error rate of 15.57% AER was obtained as the verification result in a research work [16] where a Bangla (Bengali) signature database and the directional chain code features were employed. Through experimentation [13], an accuracy of 90.40% was obtained where Gaussian grid features were used for the IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016

www.ietdl.org Table 5 Performance comparisons with other published works Feature set

modified directional chain code [16] Gaussian grid features [13] gradient feature (proposed method)

Database Signature script

Data size

Bangla

5400 samples 5400 samples 5400 samples

Bangla Hindi

Accuracy, %

Fig. 16 Forged signature treated as a genuine signature using gradient feature 84.43 90.40 90.69

Fig. 17 Forged signature treated as a genuine signature using the Zernike moment feature

investigation. A 100 sets of Bangla signatures were employed in those experiments, consisting of 3000 forgeries and 2400 genuine signatures. Error rates of 10.77% FRR and 8.45% FAR were obtained as a result of the experiments. For further details pertaining to experimental settings and the number of training and testing samples used for these two approaches (employed for comparison) please see the papers [13, 16]. An encouraging accuracy of 90.69% was obtained from the proposed approach when the Hindi signature database employed for experimentation consisted of 2400 (100 × 24) genuine signatures and 3000 (100 × 30) forgeries. Error rates of 11.50% FRR and 7.12% FAR were found as a result of experimentation when the gradient features were used. Performance comparisons with other published works are shown in Table 5. For a comparative study about the performance between Hindi and Western signatures, the GPDS dataset has also been used for experimentation. An accuracy of 84.85% was obtained on 100 sets from the GPDS-160 [26] database using gradient features.The values of FRR, FAR and AER obtained in the experiments were 18.12, 12.18 and 15.15%, respectively, when similar experimental settings used for Hindi were also applied to the GPDS dataset.

13

Error analysis

In our approach, two categories of confusing samples were obtained using the classifier. The first category illustrates the genuine signature samples treated as forged signature samples. The second one illustrates the forged signature samples treated as genuine signature samples. The gradient and the Zernike moment feature based results obtained from these two categories are shown in Figs. 14–17. From the above results it is noted that the error rate obtained for the FRR is more than the error rate attained for FAR, that is, FRR is higher than FAR. The difference in number between the genuine and random forgery samples used for

Fig. 14 Genuine signature treated as forged signature using the gradient feature

Fig. 15 Genuine signature treated as forged signature using the Zernike moment feature IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016

training the classifier may account for the difference in the error rate between FRR and FAR.

14

Conclusions and future work

This paper demonstrates an investigation of the excellent performance of a signature verification system involving Hindi off-line signatures. Actually, the novel approach used for the experimental process with the combination of a large Hindi off-line signature dataset proffers an effective contribution to the field of non-Western signature verification. The substantially higher accuracy obtained following experimentation is also a great achievement. The gradient feature, Zernike moment feature and SVM classifiers were employed, and encouraging results were obtained. For a comparative study of performance, the GPDS dataset has also been used for experimentation. To the best of our knowledge, a large Hindi signature database has not previously been used for the task of signature verification and this is only the second important report using Hindi signatures in this area. The proposed off-line verification scheme is a new investigation for Hindi signatures in the field of off-line signature verification. In the near future, we plan to extend our work considering more samples of Hindi signatures with the exploration of different feature extraction techniques and classifiers. We also have a plan to prepare another larger Hindi off-line signature database in order to achieve a higher accuracy using different methods for experimentation in the signature verification field.

15

References

1 Chen, S., Srihari, S.: ‘Use of exterior contour and shape features in off-line signature verification’. Int. Conf. on Document Analysis and Recognition, 2005, pp. 1280–1284 2 Ferrer, M.A., Alonso, J.B., Travieso, C.M.: ‘Off-line geometric parameters for automatic signature verification using fixed-point arithmetic’, IEEE Trans. Pattern Anal. Mach. Intell., 2005, 27, (6), pp. 993–997 3 Madabusi, S., Srinivas, V., Bhaskaran, S., Balasubramanian, M.: ‘On-line and off-line signature verification using relative slope algorithm’. Int. Workshop on Measurement Systems for Homeland Security, 2005, pp. 11–15 4 Impedovo, D., Pirlo, G.: ‘Automatic signature verification: the state of the art’, IEEE Trans. Syst. Man Cybern. C, 2008, 38, (5), pp. 609–635 5 Plamondon, R., Lorette, G.: ‘Automatic signature verification and writer identification – the state of the art’, Pattern Recogn., 1989, 22, (2), pp. 107–131 6 Kholmatov, A., Yanikoglu, B.: ‘Identity authentication using improved online signature verification method’, Pattern Recognit. Lett., 2005, 26, (15), pp. 2400–2408 7 Kalera, M., Srihari, S., Xu, A.: ‘Offline signature verification and identification using distance statistics’, Int. J. Pattern Recognit. Artif. Intell., 2004, 18, (3), pp. 1339–1360 189

& The Institution of Engineering and Technology 2013

www.ietdl.org 8 Pottier, I., Burel, G.: ‘Identification and authentication of handwritten signatures with a connectionist approach’. Conf. on Neural Networks, 1994, pp. 2948–2951 9 Plamondon, R., Srihari, S.N.: ‘On-line and off-line handwriting recognition: a comprehensive survey’, IEEE Trans. Pattern Anal. Mach. Intell., 2000, 22, (1), pp. 63–84 10 Fang, B., Leung, C.H., Tang, Y.Y., Tse, K.W., Kwok, P.C.K., Wong, Y. K.: ‘Off-line signature verification by the tracking of feature and stroke positions’, Pattern Recognit., 2003, 36, (1), pp. 91–101 11 Franke, K.: ‘Analysis of authentic signatures and forgeries’. Int. Workshop on Computational Forensics, 2009, pp. 150–164 12 Pal, S., Alaei, A., Pal, U., Blumenstein, M.: ‘Off-line signature identification using background and foreground information’. Int. Conf. on Digital Image Computing, 2011, pp. 672–677 13 Pal, S., Nguyen, V., Blumenstein, M., Pal, U.: ‘Off-line Bangla signature verification’. Document Analysis Systems, 27–29 March 2012, pp. 282–286 14 Pal, S., Pal, U., Blumenstein, M.: ‘Hindi and English off-line signature identification and verification’. Int. Conf. on Advances in Computing, 2012, pp. 905–910 15 Pal, S., Blumenstein, M., Pal, U.: ‘Hindi off-line signature verification’. Int. Conf. on Frontiers in Handwriting Recognition, 2012, pp. 371–375 16 Pal, S., Alireza, A., Pal, U., Blumenstein, M.: ‘Off-line Bangla signature verification: an empirical study’. Int. Joint Conf. on Neural Networks (IJCNN 2013), Dallas, Texas, USA (Accepted) 17 Liwicki, M., Malik, M.I., van den Heuvel, C.E., et al.: ‘Signature verification competition for online and off-line skilled forgeries’. Proc. 11th ICDAR, 2011, pp. 1480–1484 18 Armand, S., Blumenstein, M., Muthukkumarasamy, V.: ‘Off-line signature verification based on the modified direction feature’. Int. Conf. on Pattern Recognition, 2006, pp. 509–512 19 Ramachandra, A.C., Pavithra, K., Yashasvini, K., Raja, K.B., Venugopal, K.R., Patnaik, L.M.: ‘Off-line signature verification based on cross-validation for graph matching’. Int. Conf. on Electrical and Electronics, 2008, pp. 17–22 20 Schafer, B., Viriri, S.: ‘An off-line signature verification system’. Int. Conf. on Signal and Image Processing Applications, 2009, pp. 95–100

190 & The Institution of Engineering and Technology 2013

21 Lv, H., Wang, W., Wang, C., Zhuo, Q.: ‘Off-line Chinese signature verification based on support vector machines’, Pattern Recognit. Lett., 2005, 26, (15), pp. 2390–2399 22 Coetzer, J., Herbst, B., Preez, J.D.: ‘Off-line signature verification using the discrete radon transform and a hidden markov model’, EURASIP J. Appl. Signal Process., 2004, 4, pp. 559–571 23 Weiping, H.O.U., Ye, X., Wang, K.: ‘A survey of off-line signature verification’. Proc. Int. Conf. on Intelligent Mechatronics and Automation, 2004, pp. 536–541 24 Liwicki, M., van den Heuvel, C.E., Found, B., Malik, M.I.: ‘Forensic signature verification competition 4nsigcomp2010 – detection of simulated and disguised signatures’. Proc. 12th ICFHR, Kolkata, India, 16–18 November 2010, pp. 715–720 25 Pal, U., Wakabayashi, T., Sharma, N., Kimura, F.: ‘Handwritten numeral recognition of six popular Indian scripts’. Proc. Ninth Int. Conf. on Document Analysis and Recognition, 2007, pp. 749–753 26 Ferrer, M.A., Alonso, J.B., Travieso, C.M.: ‘Offline geometric parameters for automatic signature verification using fixed-point arithmetic’, Trans. IEEE PAMI, 2005, 27, pp. 993–997 27 Chaudhuri, B.B., Pal, U.: ‘An OCR system to read two Indian language scripts: Bangla and Devnagari (Hindi)’. Int. Conf. on Document Analysis and Recognition, 1997, pp. 1011–1015 28 Teh, C.H., Chin, R.T.: ‘On image analysis by the methods of moments’, Pattern Anal. Mach. Intell., 1988, 10, (4), pp. 496–513 29 Ito, T., Ohyama, W., Wakabayashi, T., Kimura, F.: ‘Combination of signature verification techniques by SVM’. Int. Conf. on Frontiers in Handwriting Recognition, 2012, pp. 428–431 30 Otsu, N.: ‘A threshold selection method from gray-level histogram’, Syst. Man Cybern., 1979, 9, pp. 62–66 31 Vapnik, V.: ‘The nature of statistical learning theory’ (Springer Verlang, 1995) 32 Burges, C.: ‘A tutorial on support vector machines for pattern recognition’, Data Min. Knowl. Discov., 1998, 2, (2), pp. 1–43 33 Fawcett, T.: ‘An introduction to ROC analysis’, Pattern Recognit. Lett., 2006, 27, (8), pp. 861–874

IET Biom., 2013, Vol. 2, Iss. 4, pp. 182–190 doi: 10.1049/iet-bmt.2013.0016