Free Essay

Genetic Algorithm

In:

Submitted By prachi12
Words 3528
Pages 15
IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 4, No 2, July 2010 ISSN (Online): 1694‐0784 ISSN (Print): 1694‐0814

18

An Improved k-Nearest Neighbor Classification Using Genetic Algorithm
N. Suguna1, and Dr. K. Thanushkodi2
1

Professor in Computer Science and Engg, Akshaya College of Engineering and Technology, Coimbatore, Tamil Nadu, India.

2

Director, Akshaya College of Engineering and Technology, Coimbatore, Tamil Nadu, India.

Abstract k-Nearest Neighbor (KNN) is one of the most popular algorithms for pattern recognition. Many researchers have found that the KNN algorithm accomplishes very good performance in their experiments on different data sets. The traditional KNN text classification algorithm has three limitations: (i) calculation complexity due to the usage of all the training samples for classification, (ii) the performance is solely dependent on the training set, and (iii) there is no weight difference between samples. To overcome these limitations, an improved version of KNN is proposed in this paper. Genetic Algorithm (GA) is combined with KNN to improve its classification performance. Instead of considering all the training samples and taking k-neighbors, the GA is employed to take k-neighbors straightaway and then calculate the distance to classify the test samples. Before classification, initially the reduced feature set is received from a novel method based on Rough set theory hybrid with Bee Colony Optimization (BCO) as we have discussed in our earlier work. The performance is compared with the traditional KNN, CART and SVM classifiers.
Keywords: k-Nearest Neighbor, Genetic Algorithm, Support Vector Machine, Rough Set.

training samples must be calculated. When the number of training samples is less, the KNN classifier is no longer optimal, but if the training set contains a huge number of samples, the KNN classifier needs more time to calculate the similarities. This problem can be solved in 3 ways: reducing the dimensions of the feature space; using smaller data sets; using improved algorithm which can accelerate to [5]; 2. Dependency on the training set: The classifier is generated only with the training samples and it does not use any additional data. This makes the algorithm to depend on the training set excessively; it needs recalculation even if there is a small change on training set; 3. No weight difference between samples: All the training samples are treated equally; there is no difference between the samples with small number of data and huge number of data. So it doesn’t match the actual phenomenon where the samples have uneven distribution commonly. A wide variety of methods have been proposed to deal with these problems [6-9]. Another problem is that the classification algorithms will be confused with more number of features. Therefore, feature subset selection is implicitly or explicitly conducted for learning systems [10], [11]. There are two steps in neighborhood classifiers. First an optimal feature subspace is selected, which has a similar discriminating power as the original data, but the number of features is greatly reduced. Then the neighborhood classifier is applied. In this paper, we have used a novel method based on Rough set theory hybrid with Bee Colony Optimization (BCO) to select the optimal feature set as discussed in our previous work [12]. Then the proposed GKNN classifier is analyzed with this reduced feature set. In this paper, Genetic Algorithm (GA) is combined with kNearest Neighbor (KNN) algorithm called as Genetic KNN (GKNN), to overcome the limitations of traditional KNN. In traditional KNN algorithm, initially the distance between all the test and training samples are calculated and the k-

1. Introduction
Nearest neighbor search is one of the most popular learning and classification techniques introduced by Fix and Hodges [1], which has been proved to be a simple and powerful recognition algorithm. Cover and Hart [2] showed that the decision rule performs well considering that no explicit knowledge of the data is available. A simple generalization of this method is called K-NN rule, in which a new pattern is classified into the class with the most members present among the K nearest neighbors, can be used to obtain good estimates of the Bayes error and its probability of error asymptotically approaches the Bayes error [3]. The traditional KNN text classification has three limitations [4]: 1. High calculation complexity: To find out the k nearest neighbor samples, all the similarities between the

IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 4, No 2, July 2010 www.IJCSI.org neighbors with greater distances are taken for classification. In our proposed method, by using GA, k-number of samples are chosen for each iteration and the classification accuracy is calculated as fitness. The highest accuracy is recorded each time. Thus, it is not required to calculate the similarities between all samples, and there is no need to consider the weight of the category. This paper is structured as follows: the following section presents the traditional KNN algorithm. Section 3 explains the proposed GKNN classifier. The comparative experiments and results are discussed in Section 4 and the work is concluded in Section 5.

19

Genetic algorithm (GA) [15], [16] is a randomized search and optimization technique guided by the principles of evolution and natural genetics, having a large amount of implicit parallelism. GAs perform search in complex, large and multimodal landscapes, and provide near-optimal solutions for objective or fitness function of an optimization problem. In GA, the parameters of the search space are encoded in the form of strings (called chromosomes). A collection of such strings is called a population. Initially, a random population is created, which represents different points in the search space. An objective and fitness function is associated with each string that represents the degree of goodness of the string. Based on the principle of survival of the fittest, a few of the strings are selected and each is assigned a number of copies that go into the mating pool. Biologically inspired operators like cross-over and mutation are applied on these strings to yield a new generation of strings. The process of selection, crossover and mutation continues for a fixed number of generations or till a termination condition is satisfied. An excellent survey of GA along with the programming structure used can be found in [16]. GA have applications in fields as diverse as VLSI design, image processing, neural networks, machine learning, job shop scheduling, etc. String Representation - Here the chromosomes are encoded with real numbers; the number of genes in each chromosome represents the samples in the training set. Each gene will have 5 digits for vector index and k number of genes. For example, if k=5, a sample chromosome may look as follows: 00100 10010 00256 01875 00098 Here, the 00098 represents, the 98th instance and the second gene say that the 1875 instance in the training sample. Once the initial population is generated now we are ready to apply genetic operators. With these k neighbors, the distance between each sample in the testing set is calculated and the accuracy is stored as the fitness values of this chromosome. Reproduction (selection) - The selection process selects chromosomes from the mating pool directed by the survival of the fittest concept of natural genetic systems. In the proportional selection strategy adopted in this article, a chromosome is assigned a number of copies, which is proportional to its fitness in the population, that go into the mating pool for further genetic operations. Roulette wheel selection is one common technique that implements the proportional selection strategy. Crossover - Crossover is a probabilistic process that exchanges information between two parent chromosomes for generating two child chromosomes. In this paper, single point crossover with a fixed crossover probability of pc is used. For chromosomes of length l, a random integer, called the crossover point, is generated in the range [1, l-1]. The portions of the chromosomes lying to the right of the crossover point are exchanged to produce two offspring. Mutation - Each chromosome undergoes mutation with a fixed probability pm. For binary representation of chromosomes, a bit position (or gene) is mutated by simply

2. KNN Classification Algorithm
In pattern recognition field, KNN is one of the most important non-parameter algorithms [13] and it is a supervised learning algorithm. The classification rules are generated by the training samples themselves without any additional data. The KNN classification algorithm predicts the test sample’s category according to the K training samples which are the nearest neighbors to the test sample, and judge it to that category which has the largest category probability. The process of KNN algorithm to classify sample X is [14]:  Suppose there are j training categories C1,C2,…,Cj and the sum of the training samples is N after feature reduction, they become m-dimension feature vector.  Make sample X to be the same feature vector of the form (X1, X2,…, Xm), as all training samples.  Calculate the similarities between all training samples and X. Taking the ith sample di (di1,di2,…,dim) as an example, the similarity SIM(X, di) is as following:

SIM ( X , d i ) 

X j 1 2

m

j

.d ij
2

 m   m    X j  .   d ij       j 1   j 1 



Choose k samples which are larger from N similarities of SIM(X, di), (i=1, 2,…, N), and treat them as a KNN collection of X. Then, calculate the probability of X belong to each category respectively with the following formula.

P( X , C j ) 

 SIM ( X , d ) . y(d , C i i d

j

)

Where y(di, Cj) is a category attribute function, which satisfied y(d,, Cj) =  

1, d i C j 0, d i C j

Judge sample X to be the category which has the largest P(X, Cj).

3. Improved KNN Classification Based On Genetic Algorithm

IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 4, No 2, July 2010 www.IJCSI.org flipping its value. Since we are considering real numbers in this paper, a random position is chosen in the chromosome and replace by a random number between 0-9. After the genetic operators are applied, the local maximum fitness value is calculated and compared with global maximum. If the local maximum is greater than the global maximum then the global maximum is assigned with the local maximum, and the next iteration is continued with the new population. The cluster points will be repositioned corresponding to the chromosome having global maximum. Otherwise, the next iteration is continued with the same old population. This process is repeated for N number of iterations. From the following section, it is shown that our refinement algorithm improves the cluster quality. The algorithm is given as. 1. 2. Choose k number of samples from the training set to generate initial population (p1). Calculate the distance between training samples in each chromosome and testing samples, as fitness value. Choose the chromosome with highest fitness value store it as global maximum (Gmax). a. For i = 1 to L do i. Perform reproduction ii. Apply the crossover operator. iii. Perform mutation and get the new population. (p2) iv. Calculate the local maximum (Lmax). v. If Gmax < Lmax then a. Gmax = Lmax; b. p1 = p2; b. Repeat Output – the chromosome which obtains Gmax has the optimum K-neighbors and the corresponding labels are the classification results.

20

the performance with various k values on three different distance measures (1-norm, 2-norm and n-norm). Table 2 depicts the performance accuracy of our proposed classifier compared with traditional KNN, CART and SVM classifiers [17]. From the results it is shown that our proposed method outperforms the others with greater accuracy.

5. Conclusion
The KNN classifier is one of the most popular neighborhood classifier in pattern recognition. However, it has limitations such as: great calculation complexity, fully dependent on training set, and no weight difference between each class. To combat this, a novel method to improve the classification performance of KNN using Genetic Algorithm (GA) is proposed in this paper. Initially the reduced feature set is constructed from the samples using Rough set based Bee Colony Optimization (BeeRSAR). Next, our proposed GKNN classifier is applied for classification. The basic idea here is that, instead of calculating the similarities between all the training and test samples and then choosing k-neighbors for classification, by using GA, only k-neighbors are chosen at each iteration,, the similarities are calculated, the test samples are classified with these neighbors and the accuracy is calculated. This process is repeated for L number of times to reach greater accuracy; hence the calculation complexity of KNN is reduced and there is no need to consider the weight of the samples. The performance of GKNN classifier is tested with five different medical data collected from UCI data repository, and compared with traditional KNN, CART and SVM. The experiments and results show that our proposed method not only reduces the complexity of the KNN, also it improves the classification accuracy. References [1]
E. Fix, and J. Hodges, “Discriminatory analysis. Nonparametric discrimination: Consistency properties”. Technical Report 4, USAF School of Aviation Medicine, Randolph Field, Texas, 1951. T.M. Cover, and P.E. Hart, “Nearest neighbor pattern classification”, IEEE Transactions on Information Theory, 13, pp. 21–27, 1967. R.O. Duda, and P.E. Hart, Pattern classification and scene analysis, New York: Wiley, 1973. W. Yu, and W. Zhengguo, “A fast kNN algorithm for text categorization”, Proceedings of the Sixth International Conference on Machine Learning and Cybernetics, Hong Kong, pp. 3436-3441, 2007. W. Yi, B. Shi, and W. Zhang’ou, “A Fast KNN Algorithm Applied to Web Text Categorization”, Journal of The China Society for Scientific and Technical Information, 26(1), pp. 60-64, 2007. K.G. Anil, “On optimum choice of k in nearest neighbor classification”, Computational Statistics and Data Analysis, 50, pp. 3113–3123, 2006. E. Kushilevitz, R. Ostrovsky, and Y. Rabani, “Efficient search for approximate nearest neighbor in high dimensional spaces”. SIAM Journal on Computing, 30, pp. 457–474, 2000. M. Lindenbaum, S. Markovitch, and D. Rusakov, “Selective sampling for nearest neighbor classifiers”, Machine Learning, 54, pp. 125–152, 2004.

3.

4.

4. Experiments & Results
The performance of the reduct approaches discussed in this paper has been tested with 5 different medical datasets, downloaded from UCI machine learning data repository. Table 1 shows the details about the datasets and the reduced feature set used in this paper. Table 1 Datasets Used for Reduct No. of Features in Total No. of Total No. of the Reduced Set Instances Features (BeeRSAR [12]) 366 34 7 6 300 13 500 32 699 21 56 09 8 4 4

[2] [3] [4]

[5]

Dataset Name Dermatology Cleveland Heart HIV Lung Cancer Wisconsin

[6] [7]

[8] With the reduced feature set, the GKNN classifier is applied. Ten-fold cross validation method is performed for analyzing

IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 4, No 2, July 2010 www.IJCSI.org [9] [10]
C. Zhou, Y. Yan, and Q. Chen, “Improving nearest neighbor classification with cam weighted distance”. Pattern Recognition, 39, pp. 635–645, 2006. D.P. Muni, and N.R.D. Pal, “Genetic programming for simultaneous feature selection and classifier design”, IEEE Transactions on Systems Man and Cybernetics Part B – Cybernetics, 36, pp. 106–117, 2006. J. Neumann, C. Schnorr, and G. Steidl, “Combined SVMbased feature selection and classification”, Machine Learning, 61, pp. 129–150, 2005. N. Suguna, and K. Thanushkodi, “A Novel Rough Set Reduct Algorithm Based on Bee Colony Optimization”, International Journal of Granular Computing, Rough Sets and Intelligent Systems, (Communicated) 2010. Belur V. Dasarathy, “Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques”, Mc Graw-Hill Computer Science Series, IEEE Computer Society Press, Las Alamitos, California, pp. 217-224, 1991. Y. Lihua, D. Qi, and G. Yanjun, “Study on KNN Text Categorization Algorithm”, Micro Computer Information, 21, pp. 269-271, 2006. D.E. Goldberg, Genetic Algorithms in Search, Optimization and Machine Learning, Addison-Wesley, New York, 1989. L. Davis (Ed.), Handbook of Genetic Algorithms, Van Nostrand Reinhold, New York, 1991. Q. Hu, D. Yu and Z. Xie, Neighborhood Classifiers, Expert Systems with Applications, 2007.

21

Author Biographies
N.Suguna received her B.E degree in Computer Science and Engineering from Madurai Kamaraj University in 1999 and M.E. degree in Computer Science and Engineering from Bangalore University in 2003. She has more than a decade of teaching experience in various Engineering colleges in Tamil Nadu and Karnataka. She is currently with Akshaya College of Engineering and Technology, Coimbatore, Tamilnadu, India. Her research interests include Data Mining, Soft Computing and Object Oriented Systems. Dr.K.Thanushkodi has more than 35 years of teaching experience in various Government & Private Engineering Colleges. He has published 45 papers in International journals and conferences. He is currently guiding 15 research scholars in the area of Power System Engineering, Power Electronics and Computer Networks. He has been the Principal in-charge and Dean in Government College of Engineering Bargur. He has served as senate member in Periyar University, Salem. He has served as member of the research board, Anna University, Chennai. He Served as Member Academic Council, Anna University, Chennai. He is serving as Member, Board of Studies in Electrical Engineering, Anna University, Chennai. He is serving as Member, Board of Studies in Electrical and Electronics & Electronics and Communication Engineering, Amritha Viswa Vidya Peetham, Deemed University, Coimbatore. He is serving as Governing Council Member SACS MAVMM Engineering College, Madurai. He served as Professor and Head of E&I, EEE, CSE & IT Departments at Government College of Technology, Coimbatore. He is serving as Syndicate Member of Anna University, Coimbatore. Currently, he is the Director of Akshaya College of Engineering and Technology, Coimbatore.

[11] [12]

[13]

[14] [15] [16] [17]

Table 2. Performance Analysis of the Classifiers
Classifier KNN K Value 5 Distance Measure 1-norm 2-norm n-norm 1-norm 2-norm n-norm 1-norm 2-norm n-norm 1-norm 2-norm n-norm 1-norm 2-norm n-norm 1-norm 2-norm n-norm 1-norm 2-norm n-norm 1-norm 2-norm n-norm Dermatology 75.75 ± 0.02 82.98 ± 0.05 65.77 ± 0.18 66.03 ± 0.83 73.80 ± 0.70 63.79 ± 0.76 73.97 ± 0.94 68.24 ± 0.84 75.96 ± 0.68 70.75 ± 0.26 64.08 ± 0.40 63.80 ± 0.45 93.79 ± 0.15 94.24 ± 0.69 94.80 ± 0.67 94.51 ± 0.94 93.54 ± 0.31 94.88 ± 0.28 93.97 ± 0.19 93.57 ± 0.66 93.86 ± 0.12 93.65 ± 0.53 94.60 ± 0.26 93.80 ± 0.65 89.12 85.76 Cleveland Heart 74.27 ± 0.06 66.62 ± 0.71 73.46 ± 0.89 69.65 ± 0.89 67.60 ± 0.30 67.42 ± 0.24 73.86 ± 0.45 71.68 ± 0.92 71.77 ± 0.14 69.96 ± 0.80 69.00 ± 0.63 72.64 ± 0.69 87.18 ± 0.92 92.63 ± 0.35 85.64 ± 0.47 92.81 ± 0.45 96.67 ± 0.40 90.47 ± 0.19 93.42 ± 0.75 81.21 ± 0.50 85.56 ± 0.63 90.45 ± 0.61 87.87 ± 0.19 88.96 ± 0.85 85.23 83.23 HIV 67.59 ± 0.92 68.31 ± 0.71 66.85 ± 0.63 73.13 ± 0.39 73.53 ± 0.94 74.64 ± 0.76 74.69 ± 0.09 72.99 ± 0.08 66.83 ± 0.69 68.73 ± 0.33 72.71 ± 0.40 69.22 ± 0.50 81.33 ± 0.61 90.71 ± 0.07 91.86 ± 0.90 80.53 ± 0.65 82.85 ± 0.35 92.98 ± 0.17 82.99 ± 1.00 91.96 ± 1.00 91.64 ± 0.93 82.64 ± 0.85 87.64 ± 0.58 83.89 ± 0.58 82.33 80.45 Lung Cancer 68.86 ± 0.23 68.07 ± 0.66 68.64 ± 0.10 68.52 ± 0.96 70.07 ± 0.88 70.05 ± 0.49 68.27 ± 0.14 68.50 ± 0.95 69.37 ± 0.71 67.85 ± 0.27 69.98 ± 0.62 69.57 ± 0.87 86.24 ± 0.56 90.17 ± 0.69 88.76 ± 0.29 88.91 ± 0.08 84.86 ± 0.37 82.99 ± 0.87 83.23 ± 0.36 82.88 ± 0.27 86.69 ± 0.81 84.69 ± 0.58 87.51 ± 0.49 86.78 ± 0.35 87.01 82.29 Wisconsin Breast Cancer 76.42 ± 0.31 69.48 ± 0.66 76.07 ± 0.67 75.60 ± 0.84 68.94 ± 0.69 78.09 ± 0.48 77.36 ± 0.37 80.44 ± 0.93 68.14 ± 0.42 72.10 ± 0.85 67.07 ± 0.40 71.69 ± 0.63 97.56 ± 0.01 90.89 ± 0.98 87.96 ± 0.70 97.92 ± 0.51 92.63 ± 0.84 92.62 ± 0.81 89.73 ± 0.36 86.15 ± 0.41 95.70 ± 0.53 86.39 ± 0.47 94.28 ± 0.78 91.38 ± 0.84 86.77 85.34

10

15

20

GKNN

5

10

15

20

SVM CART

Similar Documents

Free Essay

Genetic Algorithms

... A Genetic Algorithm Approach to Solve the Shortest Path Problem for Road Maps Sachith Abeysundara*, Baladasan Giritharan+, Saluka Kodithuwakku◊ *Department of Statistics and Computer Science, Faculty of Science, University of Peradeniya, Sri Lanka Email: sachith@email.com Telephone: (+94) 81 2374652 + Department of Statistics and Computer Science, Faculty of Science, University of Peradeniya, Sri Lanka Email: bgiri@pdn.ac.lk ◊ Department of Statistics and Computer Science, Faculty of Science, University of Peradeniya, Sri Lanka Email: salukak@pdn.ac.lk Telephone: (+94) 81 2394260 Abstract—This paper presents a new genetic algorithm approach to solve the shortest path problem for road maps. This is based on the analogy of finding the shortest possible distance between two towns or cities in a graph or a map with potential connection, which means that the path distances are always positive. Typically this is represented by a graph with each node representing a city and each edge being a path between two cities and there exist some traditional algorithms that produce solutions for the problem. A new method is found to solve the shortest path problem using GAs. The algorithm has been tested for a road map containing more than 125 cities and the experimental results guarantee to provide acceptably good solutions for the given search space. HE shortest path problem is typical in the world of combinatorial systems. This research will attempt to apply a Genetic algorithm to solve...

Words: 2513 - Pages: 11

Free Essay

Eigrp

...Lockett, Roblee Page 1 of 48 6/3/2003 GENETIC ALGORITHM BASED DESIGN AND IMPLEMENTATION OF MULTIPLIERLESS TWODIMENSIONAL IMAGE FILTERS by Douglas J. Lockett and Christopher D. Roblee ********* Senior Capstone Design Project Submitted in partial fulfillment of the requirements for the degree of Bachelor of Science Department of Electrical and Computer Engineering Union College Steinmetz Hall Schenectady, New York 12308 U.S.A. Submitted May 30, 2003 Final Project Report Senior Capstone Design Project, Department of Electrical and Computer Engineering Union College, 2003. © 2003 Douglas Lockett, Christopher Roblee Lockett, Roblee Page 2 of 48 6/3/2003 Table of Contents: Abstract……………………………………………………………………………….3 1. Introduction…………………………………………………………………………..4 2. Theory of Multiplierless Arithmetic………………………………………………...5 3. Image Filters 3.1. Motivations for IIR vs. FIR……………………………………………………....7 3.2. Edge Detection …………………………………………………………………..8 3.3. Canny Edge Detection……………………………………………………………9 4. Genetic Algorithms 4.1. Motivations……………………………………………………………………...10 4.2. Basic Theory…………………………………………………………………….10 4.3. Description of the Designed Genetic Algorithm………………………………..13 4.3.1. Fitness Function Definition and Crossover Selection…………………...17 4.3.2. Magnitude Response and Relative Error………………………………...19 4.3.3. GA Parameters…………………………………………………………...19 5. Results 5.1. Magnitude Frequency Analysis ……………………………………………...…21 5.2. Spatial Analysis…………………………………………………………………24...

Words: 9389 - Pages: 38

Free Essay

Biometrics

...Optimization of biometric Fingerprint Recognition parameters using Genetic Algorithms Report submitted for CPSC - 6126 Fall 2014 Term Paper By Krishna Sindhuri Nagavolu 13th December, 2014 Optimization  of  biometric  fingerprint  recognition  parameters  using   Genetic  Algorithms   Abstract This research paper discusses about parameter optimization for biometric fingerprint recognition with the use of genetic algorithms. An accurate access control system is very important in domains like identification checks at airports, government organizations, FBI’s, scientific working groups like NASA, US defense department and driver’s license office. The main reason these organizations prefer Biometrics is because it measures physiological characteristics like fingerprints, iris patterns, facial recognition, retina recognition, ear recognition and DNA analysis. Based on the features of the biometric sensors used, the system can detect whether a person is an authorized user or not. Though there are many other methods for identification, biometric sensors are considered to be very reliable and accurate. The main objective of this paper is to build a fingerprint recognizer that is fully automatic and can minimize the errors caused while matching the fingerprints. Keywords: Biometrics, genetic algorithms, Parameter optimization, fingerprints                           2|Page Optimization ...

Words: 2966 - Pages: 12

Free Essay

Bluetooth Network Security

...TECHNOLOGY A seminar report on GENETIC ALGORITHMS Submitted by Pranesh S S 2SD06CS061 8th semester DEPARTMENT OF COMPUTER SCIENCE ENGINEERING 2009-10 1 VISHVESHWARAIAH TECHNOLOGICAL UNIVERSITY S.D.M COLLEGE OF ENGINEERING AND TECHNOLOGY DEPARTMENT OF COMPUTER SCIENCE ENGINEERING CERTIFICATE Certified that the seminar work entitled “GENETIC ALGORITHMS” is a bona fide work presented by Pranesh S S bearing USN NO 2SD06CS061 in a partial fulfilment for the award of degree of Bachelor of Engineering in Computer Science Engineering of the Vishveshwaraiah Technological University, Belgaum during the year 2009-10. The seminar report has been approved as it satisfies the academic requirements with respect to seminar work presented for the Bachelor of Engineering Degree. Staff in charge SANTOSH DESHPANDE SIR Name: Pranesh S S USN: 2SD06CS061 H.O.D 2 INDEX 1. INTRODUCTION 2. HISTORY 3. DEFINITION AND EXPLANATION 4. NEED FOR GENETIC ALGORITHM 5. IMPORTANCE OF GAOVER OTHER TECHNIQUES? 6. WORKING OF GA 6.1 BASIC DESCRIPTION 6.2 GENERAL ALGORITHM 7. IMPLEMENTATION 8. EXAMPLE-A SIMULATION BY HAND 9. ADVANTAGES AND DISADVANTAGES 10. CONCLUSION 11. REFERENCES 4 4 5 5 6 6 7 7 8 11 12 13 13 3 Abstract Genetic Algorithms have recently become a popular artificial technique for solving complex optimization problems and a sophisticated tool for machine learning. This paper provides an introduction to genetic algorithms and brief applicability to problems...

Words: 2516 - Pages: 11

Free Essay

Generic Algorithim for Travelling Salesman

...Introduction TSP (Travelling salesman problem) is an optimization problem that it is difficult to solve using classical methods. Different Genetic Algorithm (GA) have been right to solve the TSP each with advantages and disadvantages (Davis, 2005) In this research paper, I highlight a new algorithm by merging different genetic Algorithm results to the better solution for TSP. In amalgam algorithm, appropriateness of algorithm and traveled distance for TSP has been considered. Results obtained suggest that it does not quickly establish in the local optimum and enjoys a good speed for an inclusive answer (Fogel, 2010). New methods such as GAs, refrigeration algorithms, Artificial Neural Networks, and ACO (Ant Colony Optimization) to solve TSP problem, in recent past have been suggested. Both ACO and GAs is centered on repetitive (Goldenberg, 2005) ACO system was unfilled for the first time by Dorigoat al. to solve TSP. In ACO algorithms, people work together to find the solution. In collective intelligence algorithms, it uses the real life of creatures without putting in consideration the complex mechanisms in run their day to day life in all aspects as best as possible. GA is an iterative procedure that contains a population of individuals or chromosomes. Coding of randomly or heuristic by a string of symbols as a gene in possible solution is done. All possible solution in this search space is examined. When search space is large, GAs usually are used. People can select an...

Words: 3446 - Pages: 14

Free Essay

Hello

...2.4.6 Genetic Algorithms. Genetic algorithms (GAs) are techniques that can be applied in the solution of optimization issues that are centered on combinations and search. Over ages, natural populaces develop in accordance with the standards of “natural selection and survival of the fittest”, in light of the hereditary process of living creatures. For imitation of the procedure of development of living creatures, genetic algorithms are suited for working with a vast kind of true issues. Ordinarily, in nature, there is dependably competition between individuals who belong to a particular population, as a rule is brought about by the shortage of resources, females, domain, hierarchy and so forth. The creatures likewise compete for different reasons: one is sustenance. At the point when the resource is rare or when there is an upsurge in the number of individuals in a particular population, the battle for sustenance is more noteworthy. This will eradicate the weakest or less attuned people. This competition normally has an effect on the quest of partner for reproduction purposes, individuals who are more fruitful in this quest are those that are more prone to create posterity. As the best adjusted individuals sustain their posterity in progressive eras, driving thusly to great individuals to consolidate their qualities, enhancing the species; those individuals without accomplishment in this journey, they will create less posterity Genetic Algorithms utilize an immediate comparability...

Words: 1655 - Pages: 7

Free Essay

Nid Using Ga

...International Journal of Advanced Science and Technology Vol. 29, April, 2011 A Mutual Construction for IDS Using GA S. Selvakani Kandeeban Professor and Head, MCA Department, Francis Xavier Engineering College, Tirunelveli, Tamilnadu, India sselvakani@hotmail.com R. S. Rajesh Reader, Department of CSE, Manonmanium Sundaranar University,, Tirunelveli, Tamilnadu, India rs_rajesh@yahoo.co.in Abstract A variety of intrusion prevention techniques, such as user authentication using passwords, avoidance of programming errors and information protection, has been used to protect computer systems. However information prevention alone is not sufficient to protect our systems as those systems become even more complex with the rapid growth and expansion of Internet technology and local network systems. Moreover, programming errors, firewall configuration errors and ambiguous or undefined security policies add to the system’s complexity. An Intrusion Detection system (IDS) is therefore needed as another layer to protect computer systems. The IDS is one of the most important techniques of information dynamic security technology. It is defined as a process of monitoring the events occurring in a computer system or network and analyzing them to differentiate between normal activities of the system and behaviors that can be classified as suspicious or intrusive. Current Intrusion Detection Systems have several known shortcomings, such as low accuracy (registering high False Positives and...

Words: 2519 - Pages: 11

Free Essay

Ofdm

...CHAPTER: 1 INTRODUCTION 1.1 Some basics elements of communication systems: In [1] [21], it is mentioned that communication system means a system where transmission of data or information is done from one point to another by several processes. The processes consist of generation of an information signal, description of the information signal through a defined set of symbols, encoding of the symbols through communication channels, decoding and reproduction of original symbols and finally re-creation of the original information signal. All these features of a communication system can be described by three basic elements such as transmitter, channel and receiver. Figure 1.1: Basic structure of communication system 1.2 Wireless communication background In 1921, Detroit Michigan Police Dept. made the earliest significant use of Mobile radio in a vehicle in the United States. The system operated at a frequency close to 2 MHz. The channels soon became overcrowded. In 1940, new frequencies between 30 and 40 MHz were made available. Increasing the available channels encouraged a substantial buildup of police systems. Shortly thereafter other users found a need for this form of communication. Private individuals, companies and public agencies purchased and operated their own mobile units. In 1945, first public mobile telephone system in the U.S. was inaugurated in St. Louis, Missouri with three channels at 150 MHz. Six channels spaced 60 kHz apart were allocated for this service...

Words: 15258 - Pages: 62

Free Essay

Gnetic Algorithms

...Genetic Algorithms Basic Genetic Algorithm – Flow Chart 1. Initial Population 1. Initial Population ON ON | | GENERATE RANDOM POPULATION (POSSIBLE SOLUTIONS) | 2. Fitness Evaluation 2. Fitness Evaluation 3. Selection 3. Selection | | EVALUATE THE FITNESS OF EACH (BASED ON THE FITNESS FUNCTION) | | | CHOOSE PARENT FACTORS (BETTER FITNESS = BETTER CHANCE) | 4. Crossover 4. Crossover 5. Mutation 5. Mutation | | CROSSOVER THE PARENT TRAITS TO FORM NEW CHILDREN. (PROBABILITY) | | | MUTATION PROBABILITY APPLIED (MAINTAINS GENETIC DIVERSITY) | Acceptable? Acceptable? | | IF OPTIMIZATION CONDITIONS ARE NOT MET(REPEAT STEPS 2-5) * OR | Yes Yes End Process End Process | | IF THE MAXIMUM GENERATIONS ARE MET (TERMINATE) * OR | | | IF SATISFACTORY FITNESS LEVEL IS REACHED (END THE PROCESS) | KEY TERMS * INDIVIDUAL Any possible solution to the problem at hand, usually expressed in binary code * POPULATION Group of all individuals * CHROMOSOME Blueprint for an individual usually expressed in binary code. (Ex: 011011) * GENE An individual value in a chromosome, usually expressed as a “1” or “0” * PARENTS An original “individual” solution in the GA process that has passed the fitness function * CHILDREN A new solution to the problem formed through crossover and mutation from the parent solutions * SEARCH SPACE All possible solutions to the problem...

Words: 261 - Pages: 2

Premium Essay

Cost-Quality Trade Off

...multi-objective multi-mode model for solving discrete time–cost–quality trade-off problems (DTCQTPs) with preemption and generalized precedence relations. The proposed model has three unique features: (1) preemption of activities (with some restrictions as a minimum time before the first interruption, a maximum number of interruptions for each activity, and a maximum time between interruption and restarting); (2) simultaneous optimization of conflicting objectives (i.e., time, cost, and quality); and (3) generalized precedence relations between activities. These assumptions are often consistent with real-life projects. A customized, dynamic, and self-adaptive version of a multiobjective evolutionary algorithm is proposed to solve the scheduling problem. The proposed multi-objective evolutionary algorithm is...

Words: 11435 - Pages: 46

Premium Essay

Cost-Quality Trade Off

...multi-objective multi-mode model for solving discrete time–cost–quality trade-off problems (DTCQTPs) with preemption and generalized precedence relations. The proposed model has three unique features: (1) preemption of activities (with some restrictions as a minimum time before the first interruption, a maximum number of interruptions for each activity, and a maximum time between interruption and restarting); (2) simultaneous optimization of conflicting objectives (i.e., time, cost, and quality); and (3) generalized precedence relations between activities. These assumptions are often consistent with real-life projects. A customized, dynamic, and self-adaptive version of a multiobjective evolutionary algorithm is proposed to solve the scheduling problem. The proposed multi-objective evolutionary algorithm is...

Words: 11435 - Pages: 46

Premium Essay

Cluster Analysis with Nature Inspired Algorithams

...A COMPARITIVE STUDY OF CLUSTER ANALYSIS WITH NATURE INSPIRED ALGORITHMS A PROJECT REPORT Submitted by K.Vinodini 310126510043 I.Harshavardhan 310126510039 B.Prasanth kumar 310126510013 K.Sai Sivani 310126510042 in Partial Fulfillment of the requirements for the Award of the Degree of BACHELOR OF TECHNOLOGY in COMPUTER SCIENCE AND ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND SYSTEMS ENGINEERING Anil Neerukonda Institute of Technology and Science (ANITS) ANDHRA UNIVERSITY : VISAKHAPATNAM – 530003 APRIL 2014 ANIL NEERUKONDA INSTITUTE OF TECHNOLOGY AND SCIENCES ANDHRA UNIVERSITY : VISAKHAPATNAM-530 003 BONAFIDE CERTIFICATE Certified that this project report “A Comparative study of cluster analaysis with Nature Inspired Algorithms”is the bonafide work of “K.Vinodini, I.Harsha, B.V.PrasanthKumar, K.SaiSivani”who carried out the project work under my supervision. Signature Signature Dr S C Satapathy Dr S C Satapathy HEAD OF THE DEPARTMENT ...

Words: 9404 - Pages: 38

Free Essay

Ant Colony Optimisation

...Technological University, Hyderabad) Himayathnagar, C.B.Post, Moinabad, Hyderabad-5000 2 075. CERTIFICATE This is to certify that the Seminar Report on “Ant Colony Optimization”, is a bonafide Seminar work done by Ranjith Kumar A (06J11A0534), in partial fulfillment for the award of the degree Bachelor of Technology in “Computer Science engineering” J.N.T.U Hyderabad during the year 2010. Y.V.S Pragathi M.Tech Head of CSE Department 3 Abstract Ant Colony Optimization (ACO) has been successfully applied to those combinatorial optimization problems which can be translated into a graph exploration. Artificial ants build solutions step by step adding solution components that are represented by graph nodes. The existing ACO algorithms are suitable when the graph is not very large (thousands of nodes) but is not useful when the graph size can be a challenge for the computer memory and cannot be completely generated or stored in it. In this paper we study a new ACO model that overcomes the difficulties found when working with a huge construction graph. In addition to the description of the model, we analyze in the experimental section one technique used for dealing with this huge graph exploration. The results of the analysis can help to understand the meaning of the new parameters introduced and to decide which parameterization is more suitable for a given problem. For the experiments we use one real problem with capital importance in Software Engineering: refutation...

Words: 5585 - Pages: 23

Premium Essay

An Adaptive Differential Evolution Algorithm with Novel Mutation and Crossover Strategies for Global Numerical Optimization

...482 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 42, NO. 2, APRIL 2012 An Adaptive Differential Evolution Algorithm With Novel Mutation and Crossover Strategies for Global Numerical Optimization Sk. Minhazul Islam, Swagatam Das, Member, IEEE, Saurav Ghosh, Subhrajit Roy, and Ponnuthurai Nagaratnam Suganthan, Senior Member, IEEE Abstract—Differential evolution (DE) is one of the most powerful stochastic real parameter optimizers of current interest. In this paper, we propose a new mutation strategy, a fitnessinduced parent selection scheme for the binomial crossover of DE, and a simple but effective scheme of adapting two of its most important control parameters with an objective of achieving improved performance. The new mutation operator, which we call DE/current-to-gr_best/1, is a variant of the classical DE/current-to-best/1 scheme. It uses the best of a group (whose size is q% of the population size) of randomly selected solutions from current generation to perturb the parent (target) vector, unlike DE/current-to-best/1 that always picks the best vector of the entire population to perturb the target vector. In our modified framework of recombination, a biased parent selection scheme has been incorporated by letting each mutant undergo the usual binomial crossover with one of the p top-ranked individuals from the current population and not with the target vector with the same index as used in all variants of DE. A DE variant obtained...

Words: 11062 - Pages: 45

Premium Essay

Electrical Paper

...MEXICO a Abstract Motivated by the recent success of diverse approaches based on Differential Evolution (DE) to solve constrained numerical optimization problems, in this paper, the performance of this novel evolutionary algorithm is evaluated. Three experiments are designed to study the behavior of different DE variants on a set of benchmark problems by using different performance measures proposed in the specialized literature. The first experiment analyzes the behavior of four DE variants in 24 test functions considering dimensionality and the type of constraints of the problem. The second experiment presents a more in-depth analysis on two DE variants by varying two parameters (the scale factor F and the population size NP ), which control the convergence of the algorithm. From the results obtained, a simple but competitive combination of two DE variants is proposed and compared against state-of-the-art DE-based algorithms for constrained optimization in the third experiment. The study in this paper shows (1) important information about the behavior of DE in constrained search spaces and (2) the role of this knowledge in the correct combination of variants, based on their capabilities, to generate simple but competitive approaches. Keywords: Evolutionary Algorithms, Differential Evolution, Constrained Email addresses: emezura@lania.mx (Efr´n Mezura-Montes), e emiranda@bianni.unistmo.edu.mx (Mariana Edith...

Words: 22303 - Pages: 90