Free Essay

Neural Networks for Matching in Computer Vision

In:

Submitted By ad3sp0ti
Words 3666
Pages 15
Neural Networks for Matching in Computer Vision
Giansalvo Cirrincione1 and Maurizio Cirrincione2
Department of Electrical Engineering, Lab. CREA University of Picardie-Jules Verne 33, rue Saint Leu, 80039 Amiens - France exin@u-picardie.fr Universite de Technologie de Belfort-Montbeliard (UTBM) Rue Thierry MIEG, Belfort Cedex 90010, France maurizio.cirricione@utbm.fr
1

2

Abstract. A very important problem in computer vision is the matching of features extracted from pairs of images. At this proposal, a new neural network, the Double Asynchronous Competitor (DAC) is presented. It exploits the self-organization for solving the matching as a pattern recognition problem. As a consequence, a set of attributes is required for each image feature. The network is able to find the variety of the input space. DAC exploits two intercoupled neural networks and outputs the matches together with the occlusion maps of the pair of frames taken in consideration. DAC can also solve other matching problems.

1

Introduction

In computer vision, structure from motion (SFM) algorithms recover the motion and scene parameters by using a sequence of images (very often only a pair of images is needed). Several SFM techniques require the extraction of features (corners, lines and so on) from each frame. Then, it is necessary to find certain types of correspondences between images, i.e. to identify the image elements in different frames that correspond to the same element in the scene. This paper addresses this specific problem, also known as matching. The techniques for image matching can be continuous or discrete. The latter are feature-based methods (FBM) and treat the images as samples of the scene taken at discrete times. They allow accurate estimation of motion parameters and structure of the scene even under a relatively large motion, do not suffer much from the problem of varying image intensity and do not need intensity smoothness. However, they suffer from several problems, like occlusion, depth discontinuities, repetitive patterns in the scene. As a consequence, image matching belongs to the class of the so called inverse problems, which are known to be ill-posed. It can be converted to a well-posed problem by introducing constraints in the correspondence formulation. The commonly used constraints are the similarity (or compatibility) constraint (matching features must have similar attribute values), the uniqueness constraint (almost always, a given pixel or feature from one view can match no
B. Apolloni et al. (Eds.): KES 2007/WIRN 2007, Part I, LNAI 4692, pp. 688–695, 2007. c Springer-Verlag Berlin Heidelberg 2007

Neural Networks for Matching in Computer Vision

689

more than one pixel or feature from the other view), the epipolar constraint (for calibrated stereo rig), the continuity (or disparity) constraint (correspondences for neighboring points in one image must have similar disparities), the disparity gradient constraint (condition on the tangent to the object surface), the ordering constraint, the smooth motion constraint and several geometric and kinematic constraints (see [1] for reference). The formulation of the correspondence problem is well-suited to computation by a neural network. Indeed, it can be formulated as an optimization task where a cost function (energy equation), representing the constraints, is minimized. The minimization can be mapped into a Hopfield neural network [2] such that the energy function is the same as the Lyapunov function of the network, with the synaptic interconnection weights between the neurons representing the constraints imposed by the corresponding problem. These applications yield bad results for real images because of their dimensionality and high computational demand, suffer from the local minima and the possible improvements (e.g. simulated annealing) are too time consuming for real time applications. The most interesting applications are the Zhou and Chellappa neural network [3] for static stereo, the Sarigianidis and Pycock neural network [4] which uses a motion correspondence method and the Nasrabadi and Choo neural network [5] for stereo vision correspondence (the cost function is devised by implementing the continuity, uniqueness and 2D rigidity). Unlike the previous ones, the Bellando and Kothari neural network [6] uses topology preserving maps (see after), but works well only for toy problems. In the next section a novel neural network is proposed. It considers matching as a pattern recognition problem. Two examples of matching are then given, the first by using a pair of synthetic images, the second by using a road sequence.

2

The Double Asynchronous Competitor (DAC)

The Double Asynchronous competitor (DAC) exploits the features of the self organizing feature maps (SOFM, [7]) for solving the correspondence problem. The SOFM defines a topologically preserving mapping from a high-dimensional input data space into a low dimensional output space: the latter is usually a 2D lattice of neurons. The neurons are usually arranged on a rectangular grid and connected according to a given neighborhood relation. Each neuron possesses a weight vector whose dimensionality is the same of the input vector. The learning is unsupervised and its updating is winner-take-most : after presentation of the input vector x to the network, the neuron i whose weight wi is the nearest to the input, according to a predefined metric, is selected (winning neuron with index i∗ ). The winner-take-most approach updates both the winning neuron and its lattice neighbors according to: wi ←− wi + α(t)G( i, i∗ , t)(x − wi ) ∀i


(1)

where α(t) is the learning rate, usully decreasing with time, and G( i, i , t) is a weighting function (neighborhood kernel ) depending on the grid distance around the winning neuron. The kernel can be Gaussian:

690

G. Cirrincione and M. Cirrincione

G( i, i∗ , t) = e



wi −wi∗ ρ2 (t)

2 2

(2)

where the Euclidean distance is used and ρ(t) (decreasing with time) is the neighborhood radius. The SOFM quantizes the input distribution and is able to capture the underlying variety of the input space. The vector quantization is constrained by the grid neighborhood structure of the neurons. These units project the input distribution toward the neuron grid by respecting locally the concept of proximity: in this sense, SOFM is topologically preserving. DAC works for sequences of images without need of epipolar constraints. It is able to solve the matching problem even in the case of several moving objects and mobile camera. It also yields an occlusion map for each image. The following description considers the two-view problem, but can be easily extended to multiple views. DAC is composed of two SOFMs (NN12 and NN21) which are coupled. Every SOFM has a particular lattice: the neurons are placed in one image (2D lattice) at the geometrical position of the feature points and the training set is composed of the attribute vectors of the other image. An advantage of this approach is the possibility of using as many attributes as desired, without any care of eventual correlations: in every case, the network is able to detect the unknown variety of the input space. The choice of the attributes depends on the kind of image sequence. In the simulations, the SUSAN corner detector [8] has been used. The advantage of this choice is given by the fact that no explicit image derivatives are needed and the feature extraction is very quick. According to the value of its scalar parameter t (brightness difference threshold), corners or edge points (edgels) can be extracted. A lot of attributes can be attached to these features, e.g., the intensity (gray value) of the feature point, the edgeness as defined in [9], the positive and the negative cornerness as defined in [9], the USAN area computed by the SUSAN detector and the USAN centre of gravity w.r.t. the feature point [8], the measure of interest of the Moravec operator (computed ad hoc for the feature point, [10]), the Laplacian using 3x3 or 5x5 masks for the second derivatives [11], the second derivative in the direction of the gradient and in the direction orthogonal to the gradient [11] and the coordinates of the feature points, weighted far less than the other attributes in the training set. NN12 has the first image (feature points) as grid (neurons), the attribute vectors of feature points of the second image as training set (TS) and the attribute vectors of feature points of the first image as initial weight vectors. NN21 is the opposite: the second image as grid, the first image attribute vectors as training set and the second image attribute vectors as initial weight vectors. This choice for the initial conditions helps implementing the search for the correspondence similarity by means of the SOFM quantization process. The choice for the grid implies the continuity and 2D rigidity constraints thanks to the topologically preserving behavior of the SOFM. Indeed, the network seeks to 2D project the one image attribute vectors in order to respect the relative other image feature point positions. In a certain sense, it is a way of implementing the 2D rigidity constraint without need of introducing an a priori shift parameter as in the existing matching techniques. The coupling between NN12 and NN21 implements

Neural Networks for Matching in Computer Vision

691

the uniqueness constraint. The SOFMs used in DAC have Gaussian neighborhood kernels. An important difference with the SOFM learning is the possibility for an input vector of not having a winning neuron. In fact, a neuron is accepted as winner only if its weight vector stays into a hypersphere (search window in the attribute/input space) centered at the input vector, whose radius is given by the threshold δ (t) which decreases with time for increasing resolution. The association input vector-winning neuron is considered as a possible match.

Fig. 1. 1D illustration of the determination of the occlusion maps. The images are one-dimensional. The displacement fields illustrate correspondences between two 1D images.

There are several possible architectures for DAC. The basic arrangement is made of the series of NN21 and NN12 and a feedback from NN12 to NN21. One DAC epoch is equal to the sum of one NN21 epoch (half DAC epoch) and one NN12 epoch (half DAC epoch). Every half DAC epoch, a test is done for the setting of the first and second image occlusion maps. Fig.1 shows the testing for 1D images. The first half epoch (presentation of the whole image 1 training set) is given by the NN21 learning (right side of fig.1). This learning is coupled to the other SOFM. At the end of the epoch, a first approximated disparity map 2 → 1 is given by the input data and associated winning neurons. This first matching does not use occlusion information. Hence, this matching may jam the occluded parts of the second image into parts of the first image. The objective of this first half DAC epoch is to compute the occlusion map 1 (first image). The jamming generally will not affect the computation of the occlusion map 1, since the occluded regions of image 1 to be detected may only occur on the opposite side across the jammed region [9]. Those regions (input data/features) in the first image that have not been matched are occluded in the second image and are marked as occluded: every neuron/input vector has a state variable which is set to zero in case of occlusion, i.e. when no neuron wins. If the input vector (feature) is not occluded, its state variable is set to the number of the winning neuron. Hence, a test is done on the input data state variables. If an input vector is occluded, it is excluded from the TS for the successive N21 learnings and the corresponding

692

G. Cirrincione and M. Cirrincione

(i.e. attached to the same feature point of image 1) neuron in NN12 is inactivated and does not enter the N12 learning. This is recorded by a state variable set to zero for the corresponding neuron. This inactivation is represented by the arrow from N21 (right) to N12 (left) in fig.1. Once this inactivation is done, NN12 learns for one half DAC epoch with the remaining neurons (presentation of the whole N12 training set). A first approximated disparity map 1 → 2 is output. The results of this learning determine occlusion map 2 analogously. The same test is done on the NN12 input data. As a consequence, the occluded data are ruled out the TS and the corresponding neurons in the N21 grid are inactivated. This inactivation is represented by the arrow from N12 (left) to N21 (right) in fig.1. This procedure is recursive. The coupling between NN12 and NN21 is given by the following procedure (the coupling between NN21 and NN12 is analogous): 1. 2. 3. 4. 5. Enter vector k of the image 2 TS into NN12. Neuron f 2 wins (its weight vector must also be in the δ-hypersphere). Neuron f 2 and its Gaussian neighborhood learn. Select the corresponding couple in NN21: neuron k and input vector f 2. Neuron k and its Gaussian neighborhood learn only if the k weight vector is in the δ-hypersphere centered at f 2.

As a consequence of this coupling, one half epoch in a SOFM induces learning in the other one before the corresponding half epoch. This coupling implements the uniqueness constraint as shown in fig.2. Indeed, the asymmetry of DAC (one feature has only one winning neuron, but one neuron may win for more than one feature) and the interchange of the function of neuron and feature between the SOFMs imply the uniqueness.

Fig. 2. DAC uniqueness coupling property

Resuming, the only DAC parameters are the neighborhood radius ρ(t), the learning rate α(t) and the threshold δ (t) for the acceptance of the winner. Experience has suggested the following heuristics for setting a constant value for δ: merge the two training sets and compute all the possible Euclidean distances D between couples of vectors; then, set σ to (1.1 ÷ 1.2) dmin , where dmin is equal ¯ ¯ to D − σD and D and σD are the average and standard deviation of the set of distances D.

Neural Networks for Matching in Computer Vision

693

After every DAC epoch, a control is done on the DAC results as a stop criterion: 1. Every non occluded input vector/feature point from image 1 has a nonzero state variable representing the number of the associated winning neuron for image 2. 2. The winning neuron represents also the input vector for the feature point in the same image (i.e. image 2). 3. If the state variable of this input vector (image 2) coincides with the state variable of the input vector of image 1 (first step), then the match is accepted. 4. If all matches are accepted, then DAC stops and outputs the list of the correspondences and the lists of the two occlusion maps.

3

Simulations

The first simulation deals with a pair of synthetic images. Given a world reference frame xyz, centered at the center of projection of the normalized camera (unit focal distance) and of z-axis parallel to the optical axis, a certain number of points Pi = (xi , yi , zi ), where xi , yi ∈ [−5, 5] and zi ∈ [3, 10], has been selected randomly. These points are then translated of the vector [2, 3, 1] and rotated around the axis [0, 0, 1] of 0.1 rad. Then, the initial and final points are projected (perspective projection) into an image plane whose coordinates are parallel to the x- and y-axes of the world reference frame and whose centre is the intersection with the optical axis. Hence, the correspondences are known a priori. A white Gaussian noise (μ = 0, σ = 1) is then added to all image points. A comparison is made between the network of Nasrabadi and Choo and DAC. In the first network, the vertical disparity is also implemented in the compatibility measure and the selected parameters (used for the comparison) are λ = 1 and ϑ = 10 (for an explanation of these terms, see [5]). In the second network, the input vectors are only composed of the coordinates of the image points. The learning rate α(t) is a decreasing function (inversely proportional to the iteration number) from 0.3 to 0.03; after reaching the minimum, the function remains constant. at 0.03. The same function is used both for the Gaussian standard deviation ρ(t) and for the hypersphere ray δ(t), with initial values, respectively, 1.5 and 3 and same final value 0.03. Tab.1 shows the results, averaged over five experiments for every choice of the number of points. DAC is more accurate, both for noiseless and, above all, for noisy images, even if only two attributes can be used. Furthermore, it is by far faster than the other network and does not suffer from the problem of local minima. In fact, only two DAC epochs are sufficient for all the experiments. The second simulation considered here is fully presented in [12]. Two image frames are extracted from a road sequence of 100 frames, given by the CSTV CNR Computer Vision Laboratory of Turin (Italy). It has been captured by a camera mounted on the roof of a van (mobile laboratory MOBLAB) which slowly turns to left. The frames have 256 grey levels and pixel dimension 720 × 280. In the camera model (frame XY Z) for the test road image pair, the Y -axis has the vertical direction and is oriented upward. A certain number of feature points

694

G. Cirrincione and M. Cirrincione Table 1. Correct match percentages for the synthetic pair DAC noiseless noisy 100 % 100 % 100 % 100 % 100 % 98.3 % 100 % 96.9 % Nasrabadi & Choo noiseless noisy 98.2 % 96.4 % 95.2 % 90.6 % 92.1 % 87.5 % 89.1 % 77.2 %

points 10 30 70 100

Fig. 3. DAC results (arrows) for the test road image sequence, superimposed on the first frame. The black arrows are excluded by the CASEDEL EXIN neural network.

(corners) are extracted by the SUSAN corner finder. The image matching has been performed by using DAC, which uses all the attributes described before. In fact DAC is able to automatically capture the variety of the input space. The learning rate α(t) is constant and equal to 0.01; the Gaussian standard deviation ρ(t) is a decreasing function (inversely proportional to the iteration number) with initial value 150; the hypersphere ray δ(t) is constant and equal to 1. DAC converges after only two epochs and finds 31 correspondences. These correspondences are visualized as superimposed on the first frame in fig.3. In the neural SFM module presented in [12], the neural network called CASEDEL EXIN is able to characterize the cluster of black arrows in the upper left of fig.3 as cluster of mismatches (outliers) and so it refines the DAC matching. The presence of this group of black arrows is explained by the absence of detected corners for the advertising poster in the first frame [12]. Despite the claims of Stephen Smith, SUSAN does not show a good stability in this image sequence. This explains the inaccuracy of some matches.

4

Conclusion

This paper has presented a novel kind of neural network, DAC, which exploits the self-organization for solving the matching problem in SFM as a pattern

Neural Networks for Matching in Computer Vision

695

recognition problem. As a consequence, a set of attributes is required for each feature point. The network is able to find the variety of the input space. DAC is faster and more accurate than the Hopfield based neural techniques and is by far better than the BPN based neural techniques which would require nearly unlimited training sets. DAC is an open architecture in the sense that different schemas are possible and other self-organizing learning laws can be chosen. In [12] DAC is integrated with the EXIN SNN neural network, which is able to segment the image and has a similar learning law. Hence, it can be implemented directly in the DAC learning law and can refine the matching. Future work will deal firstly with the choice of other feature detectors, better if neural, which must be less unstable than SUSAN. A more detailed analysis of the limits and the convergence properties of DAC is still needed. Its coupling and temporal scheduling must be further investigated. Furthermore, the DAC matching technique can be extended to other problems as the template matching, the range point matching and the graph matching. The parallelization of the DAC architecture must be still analysed.

References
1. Faugeras, O.: Three-Dimensional Computer Vision: a Geometric Viewpoint. MIT Press, Cambridge, Massachusetts (USA) (1993) 2. Hopfield, J.: Neural networks and physical systems with emergent collective computational abilities. Proc. Nat. Acad. Sci. 79, 2554–2558 (1982) 3. Zhou, Y., Chellappa, R.: Artificial Neural Networks for Computer Vision. Research Notes in Neural Computing, vol. 5. Springer, Heidelberg (1992) 4. Sarigianidis, G., Pycock, D.: Motion correspondence using a neural network. British Machine Vision Conference, 649–658 (1993) 5. Nasrabadi, N., Choo, C.: Hopfield network for stereo vision correspondence. IEEE Trans. on Neural Networks 3(1), 5–12 (1992) 6. Bellando, J., Kothari, R.: On image correspondence using topology preserving mappings. In: Proc. Int. Conf. On Neural Networks, Washington, D.C (USA), pp. 1784–1789 (June 1996) 7. Kohonen, T.: Self-Organization and Associative Memory. Springer, Berlin (1989) 8. Smith, S., Brady, J.: Susan-a new approach to low level image processing. International Journal of Computer Vision 23(1), 45–78 (1997) 9. Weng, J., Huang, T., Ahuja, N.: Motion and Structure from Image Sequences. Springer Series in Information Sciences. Springer, Heidelberg (1993) 10. Moravec, H.: Visual mapping by a robot rover. In: Proc. Of the 6th Int. Joint Conf. On Artificial Intelligence, pp. 598–600 (1979) 11. Beaudet, P.: Rotational invariant image operators. In: Proc. Of the Int. Conf. On Pattern Recognition, pp. 579–583 (1978) 12. Cirrincione, G.: A Neural Approach to the Structure from Motion Problem. PhD thesis, LIS INPG Grenoble (December, 1998)

Similar Documents

Free Essay

Stereoscopic Building Reconstruction Using High-Resolution Satellite Image Data

...Stereoscopic Building Reconstruction Using High-Resolution Satellite Image Data Anonymous submission Abstract—This paper presents a novel approach for the generation of 3D building model from satellite image data. The main idea of 3D modeling is based on the grouping of 3D line segments. The divergence-based centroid neural network is employed in the grouping process. Prior to the grouping process, 3D line segments are extracted with the aid of the elevation information obtained by using area-based stereo matching of satellite image data. High-resolution IKONOS stereo images are utilized for the experiments. The experimental result proved the applicability and efficiency of the approach in dealing with 3D building modeling from high-resolution satellite imagery. Index Terms—building model, satellite image, 3D modeling, line segment, stereo I. I NTRODUCTION Extraction of 3D building model is one of the important problems in the generation of an urban model. The process aims to detect and describe the 3D rooftop model from complex scene of satellite imagery. The automated extraction of the 3D rooftop model can be considered as an essential process in dealing with 3D modeling in the urban area. There has been a significant body of research in 3D reconstruction from high-resolution satellite imagery. Even though a natural terrain can be successfully reconstructed in a precise manner by using correlation-based stereoscopic processing of satellite images [1], 3D building reconstruction...

Words: 2888 - Pages: 12

Free Essay

Cognitive Science

...Digital analog: Neuron: digital: spike or none; analog: number of spikes per second Sparsity: % of neurons fire in response to a stimulus How many objects a neuron respond: sparsity times total objects ANN: weight, more input more output : algorithm and representational Input times weight Not threshold to fire Turing test: computational Visual fields: left visual field: nasal left eye, temporal right eye, right hemisphere Right visual field: nasal right eye, temporal left eye Color blindness: missing cones; common: no L or M cone Cones not function at night One class of rods, see in the night Opponent processing: Red/green: (L-M): differences between those 2 cones/ if miss L, then can’t tell red from green Blue/yellow: (s-s+m/2) Explicit: conscious Episodic/semantic Implicit: skill memory LTP: stronger synaptic connection Long term: grow more receptors on post synapse anatomical Short term: amount of neurons Turing machine Single vs double dissociation Single: one manipulation Double: two manipulations Visual angle Grandmother cell a lot of cells respond for Halle Berry Do not respond only to Halle Berry Math: impossibly large number of neurons Only 100 images do not necessarily show that those cells only respond to one concept Size constancy: If no depth cue/ with out size constancy; then same visual angle same proximal size and same perceived size. s Alternative: different difficulties of those 2 tasks ...

Words: 4004 - Pages: 17

Premium Essay

Diabetic Retinopathy Literature Review

...Estimation of diabetic retinopathy with artery/vein classification in retinal images using Artificial Neural Network Leshmi Satheesh M.Tech Student,Dept. of Electronics & Communication Mohandas College of Engineering, Kerala University Trivandrum-695541, Kerala, India Email: leshmi24@gmail.com Abstract—Diabetic retinopathy is the single largest explanation for sight loss and visual impairment in eighteen to sixty five year olds . Damage of blood vessels in the eye and the formation of lesions in the retina are the earliest signs of diabetic retinopathy. Efficient image processing and analysis algorithms have to be developed for the automated screening programs to work robustly and effectively. For the detection of vascular changes...

Words: 3295 - Pages: 14

Premium Essay

An Evolution of Computer Science Research

...Abbreviated version of this report is published as "Trends in Computer Science Research" Apirak Hoonlor, Boleslaw K. Szymanski and M. Zaki, Communications of the ACM, 56(10), Oct. 2013, pp.74-83 An Evolution of Computer Science Research∗ Apirak Hoonlor, Boleslaw K. Szymanski, Mohammed J. Zaki, and James Thompson Abstract Over the past two decades, Computer Science (CS) has continued to grow as a research field. There are several studies that examine trends and emerging topics in CS research or the impact of papers on the field. In contrast, in this article, we take a closer look at the entire CS research in the past two decades by analyzing the data on publications in the ACM Digital Library and IEEE Xplore, and the grants awarded by the National Science Foundation (NSF). We identify trends, bursty topics, and interesting inter-relationships between NSF awards and CS publications, finding, for example, that if an uncommonly high frequency of a specific topic is observed in publications, the funding for this topic is usually increased. We also analyze CS researchers and communities, finding that only a small fraction of authors attribute their work to the same research area for a long period of time, reflecting for instance the emphasis on novelty (use of new keywords) and typical academic research teams (with core faculty and more rapid turnover of students and postdocs). Finally, our work highlights the dynamic research landscape in CS, with its focus constantly ...

Words: 15250 - Pages: 61

Premium Essay

Biometrics

...Summary. Various anthropometric studies have been conducted in the last decade in order to investigate how different physiological or behavioral human characteristics can be used as identity evidence to prove the individuality of each person. Some of these characteristics are: face, eyes, ears, teeth, fingers, hands, feet, veins, voice, signature, typing style and gait. Since the first biometric security systems appeared in the market, an increasing demand for novel techniques that will cover all different scenarios, has been observed. Every new method appears to outmatch some of its competitors but, at the same time, presents disadvantages compared to others. However, there is still no method that consists a single panacea to all different scenarios and demands for security. This is the reason for which researchers are on a continuous effort for more efficient and generic biometric modalities that can be used in various applications. In this chapter, emerging biometric modalities that appeared in the last years in order to improve the performance of biometric recognition systems, are presented. The presented methods are divided in two major categories, intrusive and non-intrusive ones, according to the level of user nuisance that each system sets off. 1 Introduction Biometric recognition is a well-known research area that aims to provide more efficient solutions to everyday growing human need for security. Biometrics refers to methods that can be used for uniquely recognizing...

Words: 12845 - Pages: 52

Free Essay

Deep Learning Wikipedia

...http://ml.memect.com Contents 1 Artificial neural network 1 1.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 History . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2.1 Improvements since 2006 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.3.1 Network function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.3.2 Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1.3.3 Learning paradigms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1.3.4 Learning algorithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 1.4 Employing artificial neural networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 1.5 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1.5.1 Real-life applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1.5.2 Neural networks and neuroscience . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1.6 Neural network software . . . . . . . . . . . . . . . . ...

Words: 55759 - Pages: 224

Free Essay

A Fusion Approach for EffiCient Human Skin Detection

...detection despite wide variation in ethnicity and illumination. To the best of our knowledge, this is the first method to employ fusion strategy for this purpose. Qualitative and quantitative results on three standard public datasets and a comparison with state-of-the-art methods have shown the effectiveness and robustness of the proposed approach. Index Terms—Color space, dynamic threshold, fusion strategy, skin detection. I. INTRODUCTION W ITH the progress of information society today, images have become more and more important. Among them, skin detection plays an important role in a wide range of image processing applications from face tracking, gesture analysis, content-based image retrieval systems to various human–computer interaction domains [1]–[6]. In these applications, the search space for...

Words: 5432 - Pages: 22

Free Essay

Nogotiation

...Artificial intelligence From Wikipedia, the free encyclopedia Jump to: navigation, search "AI" redirects here. For other uses, see Ai. For other uses, see Artificial intelligence (disambiguation). Artificial intelligence (AI) is the intelligence of machines and the branch of computer science that aims to create it. AI textbooks define the field as "the study and design of intelligent agents"[1] where an intelligent agent is a system that perceives its environment and takes actions that maximize its chances of success.[2] John McCarthy, who coined the term in 1955,[3] defines it as "the science and engineering of making intelligent machines."[4] AI research is highly technical and specialized, deeply divided into subfields that often fail to communicate with each other.[5] Some of the division is due to social and cultural factors: subfields have grown up around particular institutions and the work of individual researchers. AI research is also divided by several technical issues. There are subfields which are focussed on the solution of specific problems, on one of several possible approaches, on the use of widely differing tools and towards the accomplishment of particular applications. The central problems of AI include such traits as reasoning, knowledge, planning, learning, communication, perception and the ability to move and manipulate objects.[6] General intelligence (or "strong AI") is still among the field's long term goals.[7] Currently popular approaches include statistical...

Words: 7301 - Pages: 30

Free Essay

Artificial Neural Network for Biomedical Purpose

...ARTIFICIAL NEURAL NETWORKS METHODOLOGICAL ADVANCES AND BIOMEDICAL APPLICATIONS Edited by Kenji Suzuki Artificial Neural Networks - Methodological Advances and Biomedical Applications Edited by Kenji Suzuki Published by InTech Janeza Trdine 9, 51000 Rijeka, Croatia Copyright © 2011 InTech All chapters are Open Access articles distributed under the Creative Commons Non Commercial Share Alike Attribution 3.0 license, which permits to copy, distribute, transmit, and adapt the work in any medium, so long as the original work is properly cited. After this work has been published by InTech, authors have the right to republish it, in whole or part, in any publication of which they are the author, and to make other personal use of the work. Any republication, referencing or personal use of the work must explicitly identify the original source. Statements and opinions expressed in the chapters are these of the individual contributors and not necessarily those of the editors or publisher. No responsibility is accepted for the accuracy of information contained in the published articles. The publisher assumes no responsibility for any damage or injury to persons or property arising out of the use of any materials, instructions, methods or ideas contained in the book. Publishing Process Manager Ivana Lorkovic Technical Editor Teodora Smiljanic Cover Designer Martina Sirotic Image Copyright Bruce Rolff, 2010. Used under license from Shutterstock.com First published March, 2011 Printed in...

Words: 43079 - Pages: 173

Premium Essay

Job Performance

...Competency 1 1.2 Business Competency 2 1.3 System Competency 2 2.0 Four Major Information Systems 3 2.1 Management Information System 3 2.2 Knowledge Management System 4 2.3 Customer Relationship Management 4 2.4 Supply Chain Management System 5 3.0 Cloud Computing 5 3.1 Characteristics of Cloud Computing 6 3.1.1 Elasticity & Scalability 6 3.1.2 Provisioning 6 3.1.3 Standardisation 6 3.1.4 Billing and Service Usage 7 3.2 Issues with Cloud Computing before Implementation 7 4.0 Technology Review 7 5.0 Operating Systems in Personal Computers 8 5.1 Features of Microsoft Windows 8 9 6.0 Enterprise Systems 10 6.1 Benefits of implementing Enterprise Systems 10 6.2 Challenges caused by implementing Enterprise Systems 11 7.0 Intelligent Systems 11 7.1 Types of Intelligent Systems 12 7.1.1 Expert Systems 12 7.1.2 Artificial Neural Networks 12 7.1.3 Motion Controls 13 7.1.4 Genetic Algorithms 13 8.0 Web Services 13 9.0 Educational Institutions 14 10.0 Technological Safeguards 15 10.1 Encryption 15 Reference List 16 1.0 Technical, Business and System Competencies Information system is best defined as the an amalgamation of hardware, software, technical infrastructure and skilled employees which are prearranged to smooth the progress of planning, controlling, coordinating and decision making within an organisation....

Words: 4438 - Pages: 18

Free Essay

Strategy

...aid them about the existence of traffic signs to minimized unwanted circumstances during driving such as fatigue, poor sight and adverse weather conditions. Though a various number of traffic sign detection systems have been revised in literature; the need of design with a robust algorithm still remains open for further research. This paper purposes to design a system capable of performing traffic sign detection while considering variations of challenges such as color illumination, computational difficulty and functional constraints existed. Traffic sign detection is divided into three main parts namely; Pre-processing, Color segmentation and Thresholding. The color segmentation method is vital as it presents a detailed investigation of vision based color spaces in this case RGB, HSV and CMYK considering varying illumination conditions under different environments. This paper further highlights possible improvements to the proposed approaches for traffic sign detection. Keywords: Traffic sign, detection and recognition, HSV, RGB, Bhattacharyya Coefficient ©...

Words: 2939 - Pages: 12

Free Essay

Paper

...ABSTRACT The field of humanoids robotics is widely recognized as the current challenge for robotics research .The humanoid research is an approach to understand and realize the complex real world interactions between a robot, an environment, and a human. The humanoid robotics motivates social interactions such as gesture communication or co-operative tasks in the same context as the physical dynamics. This is essential for three-term interaction, which aims at fusing physical and social interaction at fundamental levels. People naturally express themselves through facial gestures and expressions. Our goal is to build a facial gesture human-computer interface fro use in robot applications. This system does not require special illumination or facial make-up. By using multiple Kalman filters we accurately predict and robustly track facial features. Since we reliably track the face in real-time we are also able to recognize motion gestures of the face. Our system can recognize a large set of gestures (13) ranging from “yes”, ”no” and “may be” to detecting winks, blinks and sleeping. Chapter 1 ROLE OF HUMANOIDS 1. INTRODUCTION.: The field of humanoids robotics, widely recognized as the current challenge for robotics research, is attracting the interest of many research groups worldwide. Important efforts have been devoted to the objective of developing humanoids and impressive results have been produced, from the technological point of view, especially...

Words: 3654 - Pages: 15

Free Essay

3d Animation Captcha

...distinguish between human users and computer programs, CAPTCHA (Completely Automated Public Turing test to tell Computers and Human Apart) mechanism is widely applied in websites such as accounts application website. While the major implementation of CAPTCHA method—2D still image verification code based on OCR technology is threatened by developing artificial intelligence and image recognition technologies. In this paper, we propose a new approach to implement CAPTCHA mechanism based on 3D Animation, utilizing the weakness of computer vision, which make it robust to computer attacks and convenient for users to recognize, and implemented this method to generate a 3D animation verification code. Keywords-CAPTCHA;VerificationCode;Moving Three-dimensional Animation I. Figure 1. objects; INTRODUCTION Internet is crucial to each respect of life all over the globe nowadays, through which we could retrieve and exchange information freely and efficiently. Given the fundamental relation between internet and people’ s life, vast malicious computer programs attack websites for profits, such as auto application for some mails’ accounts to send junk e-mails, etc. CAPTCHA (Completely Automated Public Turing test to tell Computers and Human Apart) system emerges to solve this problem by identifying end-users of internet whether a real person or an automated computer program[1][2][3]. It also prevents malicious computer program impropriating limited resources...

Words: 3406 - Pages: 14

Free Essay

Haru

...System Engineering Management ROBOT PAINTING USING ARTIFICIAL INTELLIGENCE SUBMITTED BY Hari priya Kapileswarapu 1504606 ADVISOR Steven Maher; Geoffrey Rodman OKLAHOMA CHRISTIAN UNIVERSITY ABTRACT: This paper describes about one the functionality of the robot and how robot paints an object using artificial intelligence. And objects like iron wall, wooden box, motor bike etc. Suppose if the robot has to paint a motor bike it has to identify the parts of the motor bike and has to paint the color to those parts. For example, if the rims of the wheels had to be painted in black color then the robot has to identify the rims and paint the black color. In the same way, if the chassis had to be painted with black color then the robot has to identify the chassis and paint it accordingly. In this way, the whole motor bike would be painted by the robot based on the parts. For painting the parts of the motor bike there are some methods like generating the optimum path and the trajectory methods. In this way, robot can paint the respective object based on the models with different colors. Index Terms— Artificial Intelligence, Dec ACKNOWLEDGEMENT: I feel it a great pleasure and honor to express our immense gratitude towards our esteemed guide, Geoffrey Rodman for standing by our side all through the implementation of the project. His able technical guidance and expertise have Contribute to the success of...

Words: 5654 - Pages: 23

Free Essay

Mech-Humanoid-Robot

...present a seminar report. It helped me a lot to realize of what we study for. Secondly, I would like to thank my parents who patiently helped me as i went through my work and helped to modify and eliminate some of the irrelevant or un-necessary stuffs. Thirdly, I would like to thank my friends who helped me to make my work more organized and well-stacked till the end. Next, I would thank Microsoft for developing such a wonderful tool like MS Word. It helped my work a lot to remain error-free. Last but clearly not the least,I would thank The Almighty for giving me strength to complete my report on time. 3 www.studymafia.org Content  Introduction  System architecture  Real-time facial gesture recognition system  The vision system  Tracking the face:  Conclusion  References 4 www.studymafia.org...

Words: 3784 - Pages: 16