Using Neural Networks for Marketing Research ... - Semantic Scholar

4 downloads 0 Views 522KB Size Report
Abstract: - This paper deals with problems of marketing research data classification by means of artificial. Neural Networks algorithms. Two basic methods are ...
Mathematical Methods and Techniques in Engineering and Environmental Science

Using Neural Networks for Marketing Research Data Classification JIRI STASTNY, PAVEL TURCINEK, ARNOST MOTYCKA Department of Informatics Mendel University in Brno Zemedelská 1, 61300 Brno CZECH REPUBLIC [email protected], [email protected], [email protected] Abstract: - This paper deals with problems of marketing research data classification by means of artificial Neural Networks algorithms. Two basic methods are described, classification with the aid of Multi-layer Perceptron neural network with Back-propagation algorithm and classification with the aid of Self-organizing (Kohonen's) maps. Finally, applicability of these algorithms is compared. These algorithms are applied over the data from a survey about consumer behavior in the food market in the Czech Republic. The limits of this approach are considered and possibilities of more complex structural recognition methods and semi-supervised learning utilization are suggested.

Key-Words: - Neural Networks, Multi-Layer Perceptron, Back Propagation, Self-Organizing Maps, Consumer Behavior, Classification. surveys in which consumers respond to specific questions.

1 Introduction The issue of consumer behavior falls into the field of marketing. Into issue of consumer behavior fall categories of recognition and understanding of how consumers think, feel, evaluate, choose among different alternatives, how consumers are influenced by their surroundings, how they act during the decision-making and purchasing, how is their behavior limited by their knowledge or ability to process information, what motivates them and how they differ in their decision-making in different ways depending on the importance or product interest [1]. Perception of information and communication technology is gradually transformed from something rather unique, bringing a competitive advantage in the market, to the necessity of conditioning the existence or not existence of business between the competitive business organizations [2]. Currently you cannot imagine marketing without the involvement of information technology. The development of the Internet greatly simplified data collection, and thus the volume of usable data to search for information has increased greatly. To research on consumer behavior data can be obtained from multiple sources. In secondary research are typically used in national and international sources, such as the Czech Statistical Office or Eurostat, that provide data in electronic form, and are easily accessible via the Web. In the primary research are most often used data from

ISBN: 978-1-61804-046-6

2 Problem Formulation This article will address the possibilities of using artificial intelligence tools [3, 4, 5, 6] over data on consumer behavior in the food market and assess their applicability to this type of data. Article will be based on primary data that was taken out of the research Department of Marketing and Sales at the Faculty of Business and Economics of Mendel University in Brno. These data were processed only by means of statistical methods. More details about this research are given in [7].

2.1 The use of layered Multi-Layer Perceptron neural network Multi-Layer Perceptron neural network (MLP) is an acyclic forward network. Neurons can be divided into disjunctive layers so that the output of each neuron of one layer are connected to the inputs of each neuron layer following. There are no links between non-neighboring layers of neurons or between neurons in the same layer. Each neuron has as many inputs as there are neurons in the lower layer. The input layer serves only to distribute input values to the first hidden layer. A network with one hidden layer and one output layer is known as a two-layer network, a network with two hidden layers of a three-layer, etc. [3, 8]. As a learning algorithm for MLP neural network is the most widely used back-propagation algorithm.

252

Mathematical Methods and Techniques in Engineering and Environmental Science

Back propagation algorithm is an iterative method where the network gets from an initial nonlearned state to the full learned one [3, 8, 9]. It is possible to describe the algorithm in the following way: random initialization of weights; repeat repeat choose_pattern _from_training_set; put_chosen_pattern_in_input_of_network; compute_outputs_of_network; compare_outputs_with_required_values; modify_weights; until all_patterns_from_traning_set_are_chosen; until total_error < criterion; Back-propagation algorithm is based on minimization of neural network energy given with the formula (SSE) [3, 8]:

1 n E = ∑ ( yi − d i ) 2 2 i =1

3 Problem Solution We decided to apply over the data two different approaches which artificial intelligence offers. First, we tried to classify the data using the MLP. Subsequently, we then focused on the use of Kohonen maps. We got the data file, which is described in detail in [7]. It is a set of answers to approximately thirty questions on consumer behavior in the food market, where respondents scored on a scale of one to ten, how the factor affects their consumer behavior (quality, price, origin of the product, etc.). These responses are then connected to the corresponding characteristics of respondents, such as age, gender, education, household type, etc.

3.1 Application of MLP Our goal was to test whether it is possible to classify the characteristics of respondents based on their answers to questions about their consumer behavior in the food market. For the actual calculation we used a software tool Weka 3.7 [12], which among other allows the classification by MLP. From the above characteristics, we tried to classify respondents' age and education. As is evident from the following tables, the results are very unsatisfactory. Table 1: Inclusion in the classification of age classification 18- 35- 25- 12- 65+ 5524 54 34 17 64 18-24 428 110 97 38 18 26 35-54 116 334 88 6 36 28 25-34 114 132 39 15 12 5 12-17 52 13 8 30 2 4 65 + 30 49 14 3 90 17 55-64 29 55 18 2 22 10

(1)

where n means the number of network outputs, yi means the i-th output and di means the i-th output required.

2.2 The use of Self-organizing (Kohonen's) maps

reality

Kohonen network is one of the networks that do not need to be trained by teacher. The basic idea of its function is cluster analysis that means the ability of network to find certain characteristics and dependencies in the presented training data with the absence of any outside information [5, 10, 11]. In the Kohonen network is besides the input layer output layer only. Number of inputs that come into the neuron is equal to the number of inputs to the Kohonen network. The weights of these inputs are used to encode the patterns which represent presented patterns. These neurons do not have own transfer function. The only operation performed by the neuron is the calculation of distances (deviations) from the model presented pattern encoded in the weights of the neuron according to the relation [5, 10]:

When we calculate the success of the correct classification, we find that only 44.5%. It is also similar for characteristic of education, as demonstrated in Table 2. Here is a 43.6% success rate. Table 2: Inclusion in the classification of education classification A B C D E F level A 1 7 2 14 0 6 level B 3 13 17 78 3 13 level C 4 12 54 166 8 46 level D 8 40 106 743 18 201 level E 0 3 15 43 3 28 level F 0 14 38 276 8 96

N −1

d = ∑ [ xi (t ) − wi (t )]2

(2)

i =0

reality

where xi(t) are the individual elements of the input pattern and wi(t) corresponding neuron weights, which are encoded patterns. Closer information can be found in [5, 10].

ISBN: 978-1-61804-046-6

253

Mathematical Methods and Techniques in Engineering and Environmental Science

Individual groups correspond to achievable levels of education in the Czech Republic, where F is the highest level.

Figure 1 is then displayed, a print screen from the Weka environment during data analysis.

Fig. 1: Print screen of Weka 3.7 environment Neural Network toolbox [10, 11]. Here we tried to train the network so that coalesced according to the characteristics of individual respondents. Results can be seen in Figures 2 and 3.

3.2 use of Kohonen's maps The second approach to data mentioned above was the use of self-organizing (Kohonen's) maps. For the calculation was used MATLAB with the use of

Fig. 2: Result by age

ISBN: 978-1-61804-046-6

254

Mathematical Methods and Techniques in Engineering and Environmental Science

way to group individual respondents' answers to the corresponding age categories were logically grouped. Figure 3 showing the results according to education is less satisfactory (small color difference shows smaller significant impact of education on customer behavior).

Coloring principle of individual components lies in the fact that the richer the color represents more similarity between the elements contained in this section. As is clear from Figure 2 application of Kohonen maps have already managed to find a satisfactory

Figure 3: Result by education

4 Conclusion The performed simulation experiments over the mentioned data it is clear that for a given application is crucial to use a suitable type of artificial neural networks. Classification results obtained by means of Multi-Layer Perceptron neural network are unsuitable for this type of data well as using other learning algorithms [9, 4]. In contrast, using self-organizing (Kohonen's) maps brought relatively satisfactory results, especially in the age characteristics. The characteristics according to education, where the results are less satisfactory, heterogeneity may be due to such independence of educational attainment on shopping behavior.

[2]

[3]

[4]

[5] Acknowledgments: This work has been supported by the research design of Mendel University in Brno MSM 6215648904/03.

[6]

References: [1] Solomon, M. R. Consumer Behavior. Buying, Having, and Being, Pearson Prenctice Hall.

ISBN: 978-1-61804-046-6

255

Saddle River 2004, 621 p., ISBN: 0-13123011-5. Chalupová, N., Motyčka, A., Situation and trends in trade-supporting information technologies. Acta Universitatis agriculture et silviculture Mendelianae Brunensis, 2008. Vol. LVI, no. 6, pp. 25-36. ISSN 1211-8516. Miehie, D., Spiegelhalter, D. J., Taylor, C. C., Machine Learning, Neural and Statistical Classification. Ellis Horwood, NY, 1994. Štencl, M., Šťastný, J., Neural network learning algorithms comparison on numerical prediction of real data. In MENDEL 2010, 16th International Conference on Soft Computing. Brno University of Technology, 2010, s. 280285. ISSN 1803-3814. Kohonen, T. Schroeder, M. R., Huang, T. S., Self-Organizing Maps. Secaucus, NJ, USA, Springer-Verlag New York, Inc., 2001. ISBN 3540679219. Balogh, Z., Klimeš, C., Turčáni, M., Instruction Adaptation Modelling Using Fuzzy Petri NETs, 2011. Distance Learning, Simulation and Communication 2011: CATE 2011 (Community – Army – Technology – Environment) IDET 2011, Brno: University of Defence, 2011. pp 22-29, ISBN 978-80-7231-695-3.

Mathematical Methods and Techniques in Engineering and Environmental Science

[7] Turčínková, J., Kalábová, J., Preferences of Moravian consumers when buying food. Acta Universitatis agriculture et silviculture Mendelianae Brunensis. 2011. Vol. LIX, No. 2, pp.371-376. [8] Sarle, W. S., Neural Networks and Statistical Models. Proceedings of the Nineteenth Annual SAS Users Group International Conference, Cary, NC: SAS Institute,1994, pp 1538-1550. [9] Škorpil, V., Šťastný, J., Comparison of Learning Algorithms. In 24 th Biennial Symposium on Communications. Kingston, CANADA, 2008. pp. 231-234. ISBN 978-14244-1945- 6. [10] Kohonen, T. et al. SOM PAK: The SelfOrganizing Map Program Package. Technical Report A31, Helsinki University of Technology, Laboratory of Computer and Information Science, 1996. on-line: http://www.cis.hut.fi/research/papers/som_tr96. ps.Z [31-08-11]. [11] FEJFAR, J., ŠŤASTNÝ, J., Time series clustering in large data sets. Acta Universitatis agriculturae et silviculturae Mendelianae Brunensis: Acta of Mendel University of agriculture and forestry Brno 2011. sv. 59, č. 2, ISSN 12118516.

[12] Weka. http://www.cs.waikato.ac.nz/ml/weka/ on-line [31-08-11].

ISBN: 978-1-61804-046-6

256