Free Essay

Computational Linguistics

In: Science

Submitted By shagshogah
Words 880
Pages 4
Computational Linguistics Computational linguistics (CL) is a discipline between linguistics and computer science which is concerned with the computational aspects of the human language faculty. It belongs to the cognitive sciences and overlaps with the field of artificial intelligence (AI), a branch of computer science aiming at computational models of human cognition. Computational linguistics has applied and theoretical components. Theoretical CL takes up issues in theoretical linguistics and cognitive science. It deals with formal theories about the linguistic knowledge that a human needs for generating and understanding language. Today these theories have reached a degree of complexity that can only be managed by employing computers. Computational linguists develop formal models simulating aspects of the human language faculty and implement them as computer programmes. These programmes constitute the basis for the evaluation and further development of the theories. In addition to linguistic theories, findings from cognitive psychology play a major role in simulating linguistic competence. Within psychology, it is mainly the area of psycholinguistics that examines the cognitive processes constituting human language use. The relevance of computational modelling for psycholinguistic research is reflected in the emergence of a new subdiscipline: computational psycholinguistics.

Applied CL focusses on the practical outcome of modelling human language use. The methods, techniques, tools and applications in this area are often subsumed under the term language engineering or (human) language technology. Although existing CL systems are far from achieving human ability, they have numerous possible applications. The goal is to create software products that have some knowledge of human language. Such products are going to change our lives. They are urgently needed for improving human-machine interaction since the main obstacle in the interaction beween human and computer is a communication problem. Today's computers do not understand our language but computer languages are difficult to learn and do not correspond to the structure of human thought. Even if the language the machine understands and its domain of discourse are very restricted, the use of human language can increase the acceptance of software and the productivity of its users.

Natural language interfaces enable the user to communicate with the computer in French, English, German, or another human language. Some applications of such interfaces are database queries, information retrieval from texts, so-called expert systems, and robot control. Current advances in the recognition of spoken language improve the usability of many types of natural language systems. Communication with computers using spoken language will have a lasting impact upon the work environment, completely new areas of application for information technology will open up. However, spoken language needs to be combined with other modes of communication such as pointing with mouse or finger. If such multimodal communication is finally embedded in an effective general model of cooperation,we have succeeded in turning the machine into a partner. Much older than communication problems between human beings and machines are those between people with different mother tongues. One of the original aims of applied computational linguistics has always been fully automatic translation between human languages. From bitter experience scientists have realized that they are still far away from achieving the ambitious goal of translating unrestricted texts. Nevertheless computational linguists have created software systems that simplify the work of human translators and clearly improve their productivity. Less than perfect automatic translations can also be of great help to information seekers who have to search through large amounts of texts in foreign languages.

The rapid growth of the Internet/WWW and the emergence of the information society poses exciting new challenges to language technology. Although the new media combine text, graphics, sound and movies, the whole world of multimedia information can only be structured, indexed and navigated through language. For browsing, navigating, filtering and processing the information on the web, we need software that can get at the contents of documents. Language technology for content management is a necessary precondition for turning the wealth of digital information into collective knowledge. The increasing multilinguality of the web constitutes an additional challenge for our discipline. The global web can only be mastered with the help of multilingual tools for indexing and navigating. Systems for crosslingual information and knowledge management will surmount language barriers for e-commerce, education and international cooperation.

We still do not know very well how people produce and comprehend language. Yet our understanding of the intricate mechanisms that underly human language processing keeps growing. Modelling such mechanisms on a computer also helps us to discover and formally describe hidden properties of human language that are relevant for any kind of language processing including many useful software applications. Our long term goal is the deep understanding of human language and powerful intelligent linguistic applications. However, even today's language technologies full of clever short cuts and shallow processing techniques can be turned into badly needed software products.

For many students and practitioners of computational linguistics the special attraction of the discipline is the combination of expertise from the humanities, natural and behavioural sciences, and engineering. Scientific approaches and practical techniques come from linguistics, computer science, psychology, and mathematics. At some universities the subject is taught in computer science at others it belongs to linguistics or cognitive science. In addition there is a small but growing number of programs and departments dedicated solely to computational linguistics.

Similar Documents

Free Essay

None

...Evaluating Commercial Spoken Language Translation Software Harold SOMERS and Yuri SUGITA1 Centre for Computational Linguistics UMIST, PO Box 88 Manchester M60 1QD, England Harold.Somers@umist.ac.uk, sugita_yuri@yahoo.co.jp Abstract1 While spoken language translation remains a research goal, a crude form of it is widely available commercially for Japanese–English as a pipeline concatenation of speech-to-text recognition (SR), text-to-text translation (MT) and text-to-speech synthesis (SS). This paper proposes and illustrates an evaluation methodology for this noisy channel which tries to quantify the relative amount of degradation in translation quality due to each of the contributing modules. A small pilot experiment involving word-accuracy rate for the SR, and a fidelity evaluation for the MT and SS modules is proposed in which subjects are asked to paraphrase translated and/or synthesised sentences from a tourist’s phrasebook. Results show (as expected) that MT is the “noisiest” channel, with SS contributing least noise. The concatenation of the three channels is worse than could be predicted from the performance of each as individual tasks. 1. Introduction Evaluation is without doubt a major aspect of language engineering, including Machine Translation (MT). Although it is still true that no consensus exists regarding the best way to evaluate software, there is general agreement about some of the factors that must be taken into account when deciding what form an......

Words: 5225 - Pages: 21

Free Essay

Sentiment Analysis in Emotional

...eKNOW 2014 : The Sixth International Conference on Information, Process, and Knowledge Management Malay Semantic Text Processing Engine Benjamin Chu Min Xian MIMOS Berhad Kuala Lumpur, Malaysia mx.chu@mimos.my Liu Qiang MIMOS Berhad Kuala Lumpur, Malaysia qiang.liu@mimos.my Rohana Mahmud University of Malaya Kuala Lumpur, Malaysia rohanamahmud@um.edu.my Arun Anand Sadanandan MIMOS Berhad Kuala Lumpur, Malaysia arun.anand@mimos.my Kow Weng Onn MIMOS Berhad Kuala Lumpur, Malaysia kwonn@mimos.my Dickson Lukose MIMOS Berhad Kuala Lumpur, Malaysia dickson.lukose@mimos.my Abstract—Semantic Text Understanding is a process that transforms text into conceptual representation. In this paper, we propose a Text Understanding System for Malay Language. The system comprises of two components: Morphology Analyzer and Semantic Text Interpreter. Some initial evaluation experiments were conducted on these components to gain explanatory insights into its performance. All the current text processing systems we reviewed are focused on preliminary algorithms and rules associated to lexical, morphological and syntax analysis. In our paper, we developed an integrated approach for a text understanding system that has the ability to represent the semantics of the text. Keywords-Natural Language Understanding; Morphology Interpretation. Processing; Semantic Analysis; Semantic Text Text I. INTRODUCTION The development of fast algorithms to understand and exploit......

Words: 3718 - Pages: 15

Free Essay

Invitation

...Dear ***, I'm writing to you in my capacity as Program Committee Co-Chair for the 38th Annual Meeting of the Association for Computational Linguistics, to be held in Hong Kong from October 1--8, 2000. As you may know, the ACL conference is the premier international conference of note in the field of computational linguistics and natural language processing; I am writing to ask whether you would be willing to present a talk at the conference as an invited speaker. Invited talks will be one hour long, including a 10 minute question-answer session. We have not yet established on which day your talk would be scheduled; should you accept this invitation, there is some flexibility we can use to accommodate your own scheduling preferences (although it would be on one of the main conference session days, Tuesday October 3 through Friday October 6). In appreciation of your agreement to provide an invited talk, ACL would provide the cost of an economy class airfare from your home institution to the conference, hotel accommodations during the conference, and free registration to the conference. I will be away for an extended period of time and will not be able to read my email on a regular basis during this time. So please cc Professor Martha Palmer, an area chair and member of the ACL-2000 program committee, in your response. She has kindly agreed to coordinate the invited speaker sessions during my absence. I do very much hope that you will be able to accept this......

Words: 258 - Pages: 2

Free Essay

Forensic Science

...What is Forensic Linguistics? According to expert Dr. Carole E. Chaski forensic linguistics applies the theories, constructs and analytical methods of linguistics to questions which arise in civil, criminal and security investigations and adjudication. Dr. Chaski notes that an informed understanding of forensic-linguistics requires familiarity with the broader application of linguistics as a social science. This is because virtually every linguistic nuance has been examined within a legal context. These include:  Phonetics & phonology - the system of sounds in human language  Morphology - the minimal units of sounds which convey meanings  Syntax - the ways in which words can combine into phrases  Semantics & Syntax - word, phrasal and sentential meanings  Pragmatics - language use in conversation and other forms of discourse  Language change - the ways in which language evolves over time  Psycholinguistics - how the human cognitive system processes language  Neurolinguistics - human neuroanatomy and neuronal functioning  Sociolinguistics - identification of linguistic patterns constrained by social, class, racial and gender boundaries  Dialectology - identification of dialects related to regional boundaries  computational linguistics - tools through which computers can be used to perform linguistic tasks While there are many primary and secondary linguistic subfields, Dr Chaski makes the very important point that one should not be...

Words: 513 - Pages: 3

Premium Essay

Linguistics

...linguistics lɪŋgwɪstɪks/ noun 1. 1. the scientific study of language and its structure, including the study of grammar, syntax, and phonetics. Specific branches of linguistics include sociolinguistics, dialectology, psycholinguistics, computational linguistics, comparative linguistics, and structural linguistics. Discourse[edit] A discourse is a way of speaking that emerges within a certain social setting and is based on a certain subject matter. A particular discourse becomes a language variety when it is used in this way for a particular purpose, and is referred to as a register.[25] There may be certain lexical additions (new words) that are brought into play because of the expertise of the community of people within a certain domain of specialisation. Registers and discourses therefore differentiate themselves through the use of vocabulary, and at times through the use of style too. People in the medical fraternity, for example, may use some medical terminology in their communication that is specialised to the field of medicine. This is often referred to as being part of the "medical discourse", and so on. Dialect[edit] A dialect is a variety of language that is characteristic of a particular group among the language speakers.[26] The group of people who are the speakers of a dialect are usually bound to each other by social identity. This is what differentiates a dialect from a register or adiscourse, where in the latter case, cultural identity does not......

Words: 18394 - Pages: 74

Premium Essay

Tutututuut

...TACIT VERSUS CONSCIOUS KNOWLEDGE * Tacit vs. conscious knowledge * Descriptive vs. prescriptive rules * Each human language corresponds to a highly complex system of rules * Linguistics involves studying these rules and their implications SPEECH AND PHONEMES * The Language Faculty is a mental system that converts a physical signal (sound or sign) into meaning, and vice versa. * The Language Faculty makes several intermediate step between sound and meaning. * Phonemes are the smallest segment of speech that leads to meaningful contrasts between words is. * The properties of the vocal folds (the source), such as frequency and amplitude, are not critical to speech sounds. * It is the properties of the vocal tract (the filter) that are critical. * Formants are the highest amplitude peaks in the frequency spectrum created by the human vocal tract. Different formants lead to different speech sounds. ARTICULORY FEATURES AND CATEGORICAL PERCEPTION * Formants are the physical properties of phonemes that appear to be most critical to speech perception * However, we don’t use formants directly (we know this because of the three problems with formants). * Instead, during comprehension we convert formants into the articulatory features that the speaker used to create the formants. * Consonants are each made up of three features: place of articulation, manner of articulation, and voicing. * Vowels are each made up of two......

Words: 411 - Pages: 2

Free Essay

Thesis 2015

...CHAPTER 1 INTRODUCTION 1.1 Background of the Study "Language and Media" is a comprehensive introduction to how language interacts with media. It investigates the forms of language found in media discourse; how patterns in such language use contribute to recognizable media genres and styles; and, broader social themes and consequences that arise from media language. It uses a wide variety of real texts from the media specifically from Magazines. Boles (2012) believes that each magazine uses linguistic techniques these language features are creatively used in magazines by the writers to attract and entertain the target audience as they only have a small space and time to get the attention of the potential buyer of the product. Bell (2010) emphasizes the importance of the processes which produce media language, as stories are molded and modified by various hands. He stresses it is indeed stories that journalists and editors produce, not articles. These stories have viewpoint, values and structure that can be analyzed. He is concerned, too, with the role of the audience in influencing media language styles, and in understanding, forgetting or misconceiving the news presented to it. A feature story is a special human interest story article that is not closely tied to a recent news event. It focuses on particular people, places, and events, and it goes into great detail regarding concepts and ideas of specific market interest. The study primarily aims to provide...

Words: 1649 - Pages: 7

Premium Essay

Ambiguity

...Why Ambiguity? Ariel Cohen Ben-Gurion University arikc@bgu.ac.il All human languages are ambiguous. Ambiguity is not restricted to some special constructions that linguists are fond of discussing, but is quite ubiquitous. It is hard, in fact, to find a sentence that is not ambiguous. This fact is all too familiar to computational linguists: “One often hears in computational linguistics about completely unremarkable sentences with hundreds of parses, and that is in fact no exaggeration.” (Abney, 1996). The question is, simply, why? Why is language ambiguous? Krifka (2002) raises a similar question concerning vagueness. He shows that there are good reasons for language to allow, and even encourage vagueness. Vagueness, however, is not ambiguity. When we are told that the theater is far from here, we may be unsure as to the precise distance; but we know what the speaker intends, and we can draw inferences based on this, e.g., that we should take a cab instead of walking to the theater. But with an ambiguous term, the intended meaning is not merely insufficiently specified; it is not known, until the term is disambiguated. When we hear that John has a kid, we draw very different inferences if John is a father or if he owns a young goat. Thus, the advantages of vagueness do not seem to apply to ambiguity, and we are back to the question: why ambiguity? Some may say that it doesn’t matter. Humans possess very powerful mechanisms for disambiguation; these mechanisms resolve ambiguities,...

Words: 1571 - Pages: 7

Premium Essay

Second Language Acquisition

...The theoretical framework for teachers’ beliefs in this study is based on a computational model of second language acquisition proposed by Ellis (1998) that highlights four major stages that relates roughly to common beliefs on the process of how grammar should be taught. The first stage, input, stresses on the materials and the modification of input used in teaching grammar in the classroom. The second, explicit knowledge, provides explicit instruction; the third, output, emphasizes production practice; and the final stage, negative feedback, highlights the role of error correction. The model can be seen in Figure 1 with the four stages highlighted by the letters A, B, C, and D respectively. Figure _: A Computational Model of L2 Acquisition...

Words: 776 - Pages: 4

Premium Essay

Country Road

...Simply defined, linguistics is the scientific study of language. Though various types of language studies (including grammar and rhetoric) can be traced back over 2,500 years, the era of modern linguistics is barely two centuries old. Kicked off by the late-18th-century discovery that many European and Asian languages descended from a common tongue (Proto-Indo-European), modern linguistics was reshaped, first, by Ferdinand de Saussure (1857-1913) and more recently by Noam Chomsky (born 1928). The systematic study of the nature, structure, and variation of language. Major subfields of linguistics include phonetics, phonology, morphology, syntax, semantics, pragmatics, and discourse analysis. The founder of modern structural linguistics was Ferdinand de Saussure (1857-1913), whose most influential work, Course in General Linguistics, was edited by his students and published in 1916. Source:  An Introduction to Language by Victoria Fromkin and Robert Rodman, 6th Ed.) Part One: Introduction to Linguistics Every human knows at least one language, spoken or signed. Linguistics is the science of language, including the sounds, words, and grammar rules. Words in languages are finite, but sentences are not. It is this creative aspect of human language that sets it apart from animal languages, which are essentially responses to stimuli. The rules of a language, also called grammar, are learned as one acquires a language. These rules include phonology, the sound system, morphology,......

Words: 10632 - Pages: 43

Premium Essay

An Application for Automated Evaluation of Student Essay

...Criterion SM Online Essay Evaluation: An Application for Automated Evaluation of Student Essays Jill Burstein Educational Testing Service Rosedale Road, 18E Princeton, NJ 08541 jburstein@ets.org Martin Chodorow Department of Psychology Hunter College 695 Park Avenue New York, NY 10021 martin.chodorow@hunter.cuny.edu Claudia Leacock Educational Testing Service Rosedale Road, 18E Princeton, NJ 08541 cleacock@ets.org Abstract This paper describes a deployed educational technology application: the CriterionSM Online Essay Evaluation Service, a web-based system that provides automated scoring and evaluation of student essays. Criterion has two complementary applications: E-rater®, an automated essay scoring system and Critique Writing Analysis Tools, a suite of programs that detect errors in grammar, usage, and mechanics, that identify discourse elements in the essay, and that recognize elements of undesirable style. These evaluation capabilities provide students with feedback that is specific to their writing in order to help them improve their writing skills. Both applications employ natural language processing and machine learning techniques. All of these capabilities outperform baseline algorithms, and some of the tools agree with human judges as often as two judges agree with each other. 2. Application Description Criterion contains two complementary applications that are based on natural language processing (NLP) methods. The scoring application, e-rater®,......

Words: 5634 - Pages: 23

Premium Essay

Language Theories

...heories about how young children acquire and develop language Young children become amazingly proficient communicators during the first three years of life. As the Birth to Three Matters framework points out, they use 'the hundred languages of children' - body language (including facial expressions and dance); sign language (their own and family inventions as well as an officially recognised sign language); painting, drawing and mark-making; and oral expression. They have been acutely active listeners since their days in the womb, where they learned to recognise the speech patterns, tunes and tones of the languages used in their home contexts. Language theory research informs us that young children's language development is influenced by many factors, including having sensitive adults and older children around them who will listen and attend to their expressions and who will use and model appropriate language themselves. This has been called 'Motherese' by researchers led by Cathy Snow. Children's babbling during their first year includes the sounds of every world language and 'crib talk' demonstrates their intense interest in the sounds they hear around them. Although children with a hearing loss will stop babbling, if they grow up in a home with parents who can sign, they will follow the same patterns of development using their first language - signing - and will sign their first word at around the same age that hearing children speak theirs. Between two and three......

Words: 21453 - Pages: 86

Premium Essay

Linguistics

...Linguistics Introduction the scientific study of language. The word was first used in the middle of the 19th century to emphasize the difference between a newer approach to the study of language that was then developing and the more traditional approach of philology. The differences were and are largely matters of attitude, emphasis, and purpose. The philologist is concerned primarily with the historical development of languages as it is manifest in written texts and in the context of the associated literature and culture. The linguist, though he may be interested in written texts and in the development of languages through time, tends to give priority to spoken languages and to the problems of analyzing them as they operate at a given point in time. The field of linguistics may be divided in terms of three dichotomies: synchronic versus diachronic, theoretical versus applied, microlinguistics versus macrolinguistics. A synchronic description of a language describes the language as it is at a given time; a diachronic description is concerned with the historical development of the language and the structural changes that have taken place in it. The goal of theoretical linguistics is the construction of a general theory of the structure of language or of a general theoretical framework for the description of languages; the aim of applied linguistics is the application of the findings and techniques of the scientific study of language to practical tasks, especially to the......

Words: 30965 - Pages: 124

Premium Essay

Polysemy in Translation

...Hammouda Salhi University of Carthage, Tunisia hammouda_s@hotmail.com Abstract: This article investigates a topic at the interface between translation studies, lexical semantics and corpus linguistics. Its general aim is to show how translation studies could profit from the work done in both lexical semantics and corpus linguistics in an attempt to help ‘endear’ linguists to translators (Malmkjær, 1998). The specific objective is to capture the semantic and pragmatic behavior of the noun ‘destruction’ from its different translations into Arabic. The data are taken from an English-Arabic parallel corpus collected from UN texts and their translations (hereafter EAPCOUNT). While it seems that ‘destruction’ is monosemous, it turns out, after exploring its occurrences, to be highly polysemous and shows a case of complementary polysemy, where a number of alternations can be captured. These findings are broadly in line with the results reached in recent developments in lexical semantics, and more particularly the Generative Lexicon (GL) theory developed by James Pustejovsky. Some concrete suggestions are made at the end on how to enhance the relation between linguists and translators and their mutual cooperation. Key words: Lexical semantics, corpus linguistics, translation studies, complementary polysemy, coercion, parallel corpora, lexical ambiguities Résumé: Le présent article aborde un sujet à la croisée des études de traduction, de la......

Words: 8055 - Pages: 33

Premium Essay

Bes 219

...Introduction This paper will be discussing the information systems and software applications available for use in the accounting, marketing, and human resources departments that work within the constructs of an organization. It will discuss the selected applications, organizational departments, and how they relate to each other. In today’s business environment most if not all businesses rely heavily on computer technology, such as information systems and software applications. Many of these programs have made business faster, more reliable, and much more efficient than in the old days of everything being writen by hand written on paper by a person. Computer based business information systems are not the only form of information system, but keeping with the times computer information systems will be mainly discussed in this essay. A major player in software applications for human resources is PeopleSoft. The PeopleSoft application is designed to help with everyday production, find training tools, and locate documents. It also helps with business performance, and increased productivity due to its many features. The application has a vast number of features and updates including 200 industry specific updates. ("Oracle.com", 2011). Quick books, is a type of business accounting software. Quick books, is mostly used in the small business arena. It has a variety of accounting solutions such as, remote access, remote payroll and outsourcing, online banking and......

Words: 291 - Pages: 2