Free Essay

Speech Recognition

In:

Submitted By hilly21
Words 642
Pages 3
Speech Recognition The world of information technology is constantly making improvements and advancements. Throughout the past decade or so, we have experienced a whole new realm of technology, much of which was never even deemed imaginable. We have seen the development and continuous improvements in smart phones, whether its Wi-Fi connections, 3G, or even 4G. We have seen the enhancement in computer software and operating systems such as the new OS X Lion developed by Apple. While these extraordinary advancements have left many people wondering what is next, I believe the answer and next “big thing” will be the perfection of speech recognition. Speech recognition, also know as voice recognition or voice command, is a type of software which recognizes spoken words by the user and can interpret these words into a command. This is essentially a computer with though processing ability. However, this piece of technology has never been very efficient and in many of cases, has been avoided. It is often difficult for a user to speak slowly and clear enough for the system to recognize what is being said, causing frustration and a waste of time. It is also difficult for the software to recognize the wide array of accents which people have. According to a speech recognition research company called Type Well, speech recognition is only about 60% accurate. This shows that the development of an efficient and usable speech recognition product is still a few years away. Although the perfection of speech recognition software is yet to be seen, there is hope and many expectations for this to become very applicable for every day use. Speech recognition software can improve and make millions of lives easier by doing a being complete a few basic tasks. The most needed feature for future speech recognition products should be a speech to text command. According to EmergingTechnologies.com, “Speech to text software is perfect for anyone who wants to multitask. It makes it simple to dictate notes or documents verbally to the program that then turns those spoken words into written text” (“Emerging Technologies”). Not only could this be effective for mobile text messages, it would also be highly beneficial in the office or school setting when needing to type a report or memo. In order to do this the user would simply speak to the computer or phone and it would process the words on the screen just as in typing. This could also be ground-breaking in the world of automobiles because drivers who would normally text and risk their lives could now speak to their phones to accurately send a text message without ever losing sight of the road. Finally, according to Miss Ronda Field “it allows students with physical impairments, fine motor difficulties or visual impairments to generate text without the need for handwriting” (Field). Information technology is continuously changing for the betterment and increased efficiency of our society. Researchers work non-stop to find a better product to make our lives easier each and every day. With that said, speech recognition software can be and most likely will be a product that leads to an easier lifestyle. As Bill Gates said, referring to the future of speech recognition tecnology, “There are some things that we are always thinking about. For example, when will speech recognition be good enough for everybody to use that? And we have made a lot more progress this year on that. I think we will surprise people a bit on how well we will do on our speech recognition” (“BillGatesmicrosoft.com”).

References:

"Speech Recognition." Type Well. N.p., 2009. Web. .

Field, Ronda. "Speech/Text Software." Miss Fields Eduportal. Ronda Field, 2009. Web. .

"The Business Benefits of Speech Technology and Voice Recognition Technology." Emerging Technology. Emerging Technologies, 04 007 2009. Web. .

"Bill Gates Quotes." BillGatesmicrosoft.com. N.p., n.d. Web. .

Similar Documents

Free Essay

Short and Longterm Finical Impact of Speech Recognition

...management. There is a broad gap in understanding the role of communication services in health care delivery. Concepts about health and behaviors are made by communication, information, and technology that people relate with. Doctors use speech recognition as a form of communication not only for transcription and dictation but also as clinical decision support (just in case). In this paper, the author will discuss the effective and efficient, advantages and disadvantages, as well of the short-and-long term financial impact of speech recognition. Framework and propositions suggest that successful implement of the speech-recognition technology will positively affect performance in the health care industry (citation). Information is entered into the patient’s record with speech-recognition software by using add-ons. Physicians may have a patient management system; or be a part of a larger system such as, hospitals. Good speech-recognition will should meet standards and have feedback to the physician. Maximum assurance is provided for the Center for Medicare and Medicaid Services. More time is spent with the patient instead of paper work. Speech-recognition will also make business more efficient. By saying notes openly into EHRs, using speech recognition with the digital dictation systems, doctors can update information quickly and with lesser error (citation). Doctors will receive information about the patients test results faster. Test results and records are accessed rapidly by other...

Words: 394 - Pages: 2

Premium Essay

Nt1310 Unit 3 Speech Recognition System

...3 Speech Recognition System 3.1 Pocketsphinx The recognition framework used for acoustic modeling and recognition is Sphinx/pocketsphinx [11]. It was chosen because of the low processing and memory footprint: fast feedback to the user will be essential even when many clients connect at the same time to a server and many instances of the engine might be running in parallel. Due to the restriction in the current pocketsphinx decoder, maximum 128 word-classes can be used, therefore, the source code was modified to accommodate larger number of classes without any impact on the performance. 3.2 Training Configuration Acoustic model training and performance evaluation was conducted using Sphinx training tools. Customized procedure for model training and testing was established. The critical parameters as the word recognition performance (WER), real...

Words: 1460 - Pages: 6

Free Essay

Improving Assisted Technology

...overcome the unique challenges they face in today's modern, high communication world. While Assistive Technology is making strides to close the learning gap between persons with and without learning disabilities there is still a long way to go before technology provides a level playing field for these challenged individuals. Many of the issues with existing assistive technology revolves around clumsy, inefficient interfaces that struggle to find a balance between ease of use and sufficient complexity to ensure that the proper sequence of instructions is implemented. Machine learning is on the cutting edge of programming practices and presents some significant improvement possibilities in the areas of natural language processing, pattern recognition, and interface design. Machine learning has the potential to play a significant role in allowing assistive technologies to be more adaptive to persons with diverse sets of needs. This paper will attempt to define some specific areas of assistive technology that could benefit most from the application of machine learning. We will frame the definitions by aligning specific learning disabilities with current and future assistive technologies and then examining how the implementation of machine learning could improve upon them. Introduction The need for assistive technologies is undeniable with as many as 8 to 10 percent of children that are under the age of 18 in the United States having one form of learning disability or another.(NINDS)...

Words: 2619 - Pages: 11

Free Essay

Voice Activated Devices

...Dictation Speech recognition devices are widely used by physicians because they provide many advantages in the health care environment that they practice. Due to managed care, doctors are restricted in the amount of time they can spend with their patients because they use most of their time doing paperwork that is required of them. Speech recognition systems such as dictation programs and devices have brought a new outlook for the application of technology in healthcare organizations especially among physicians. Dictation programs and devices allow doctors to use the time formerly spent on record keeping to see more patients. Many programs and devices exist today that physicians can choose from. Every device or program offered by a medical vendor contains advantages and disadvantages. It is therefore imperative that physicians choose a product that best compliments their treatment practices. In the early days, the benefits of voice-activated programs and devices were limited by the lack of memory capacity and speed of personal computers. Early versions ran on mainframe computers and had a limited vocabulary. Discrete speech was the first application of this technology that was created. This technology used a discrete speaking style that required the speaker to pause between words so that the engine could identify each word accurately (Scott). Most users believed these short pauses to be impractical even though it was highly accurate. Discrete speech later became...

Words: 1915 - Pages: 8

Free Essay

Technical Report Topic Ideas

...Technical Report Topic Ideas any major:  technology management  issues in technical writing or communications  multi-cultural/multinational issues  competition for consumers  professional problem  professional code of ethics  implementing an ombudsman program  product liability  on site security--data, people or materials workplace violence outsourcing   new overtime regulations accounting:  inventory systems  pension/ stock option problems  corporate contributions to political parties  executive compensation  prevention of accounting fraud risk analysis agriculture:  land use management  genetically altered plants  control of crown gall in ornamental plants  methods of crop estimation/pricing/futures bioterrorism in crops   architecture:  options in environmental or natural disaster proof structures (floods, fires, earthquakes, etc.)  landscape designs for different environments (drought, boggy, etc.)  solar heating or cooling designs  lighting systems for large structures  restoration methods for old and/or historic buildings aviation:  wind shear problems and solutions  pilot retirement or retention  issues  training and safety procedures  global positioning systems  runway incursion solutions  aircraft fatigue  competing materials for aircraft structures screening/security issues options in aircraft for corporate use small airport management biology/pre-med liability insurance/malpractice reform options in diagnosis or treatment ...

Words: 682 - Pages: 3

Free Essay

Research and Critical Analysis Into Audio Transcription Processes

...Research and Critical Analysis into Audio Transcription processes By: Kiehne, Alexander Table of Contents Abstract……………………………………………………………………………….3 Topic Statement………………………………………………………………………3 Work Setting………………………………………………………………………….4 Situation Analysis…………………………………………………………………….5 Problem Analysis……………………………………………………………………..7 Plan of Action………………………………………………………………………...8 Background Research 1 – Similar software…………………………………………..8 Background Research 2 – Market for software……………………………………….9 Analysis……………………………………………………………………………….11 Critical Logs…………………………………………………………………………..12 Critical log 1………………………………..12 Critical log 2………………………………..12 Critical log 3……………………………….12 Critical log 4……………………………….13 References…………………………………………………………………………….14 Appendices……………………………………………………………………………15-16 Abstract In this document the possibility of a potential loop in the market is examined. In the field of linguistic services, the process of transcription is one of the major services requested, by the legal branch of State and Federal Government Agencies. The demand for such services is high as the court system heavily relies on written documents for official use. In time most documents transcribed could be performed by the use of sophisticated software that allows the user to accelerate the transcription process by feeding audio into a computer, allowing the software to transcribe the information. This...

Words: 2975 - Pages: 12

Free Essay

Bolta Mosam

...The BBN continuous speech recognition system :- In this paper, they describe BYBLOS, the BBN continuous speech recognition system. The system, designed for large vocabulary applications, integrates acoustic, phonetic, lexical, and linguistic knowledge sources to achieve high recognition performance. The basic approach it makes is the extensive use of robust context-dependent models of phonetic coarticulation using Hidden Markov Models (HMM). It describes the components of the BYBLOS system, including: signal processing frontend, dictionary, phonetic model training system, word model generator, grammar and decoder. In recognition experiments, it demonstrates consistently high word recognition performance on continuous speech across: speakers, task domains, and grammars of varying complexity. In speaker-dependent mode, where 15 minutes of speech is required for training to a speaker, 98.5% word accuracy has been achieved in continuous speech for a 350-word task, using grammars with perplexity ranging from 30 to 60. With only 15 seconds of training speech we demonstrate performance of 97% using a grammar. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1169748 Audio-visual modeling for bimodal speech recognition:- Audio-visual speech recognition is a novel extension of acoustic speech recognition and has received a lot of attention in the last few decades. The main motivation behind bimodal speech recognition is the bimodal characteristics of speech perception and production...

Words: 2115 - Pages: 9

Free Essay

Spech Recog

...robust feature extraction techniques for continuous speech recognition for Bengali Numerical digits system. The speech recognizers use a parametric form of a signal to get the most important distinguishable features of speech signal for recognition task. In this paper Linear predictive coding (LPC), Mel-frequency cepstral coefficients (MFCC), Perceptual linear prediction coefficients (PLP) along with a hybrid feature Bark Frequency Cepstral Coefficients (BFCC) is used for language Identification. Bark Frequency Cepstral Coefficients (BFCC) and Revised Perceptual Linear Prediction Coefficients (RPLP) were obtained from combination of MFCC and PLP. Two different classifiers, Vector Quantization (VQ) with Dynamic Time Warping (DTW) and Gaussian Mixture Model (GMM) were used for classification. The experiment shows better identification rate using hybrid feature extraction techniques compared to conventional feature extraction methods. BFCC has shown better performance than MFCC with both classifiers. RPLP along with GMM has shown best identification performance among all feature extraction techniques. Key words—Linear Predictive Coding(LPC), Perceptual Linear Prediction(PLP), Revised Perceptual Linear Prediction(RPLP), Bark Frequency Cepstral Coefficient (BFCC), Mel Frequency Cepstral Coefficient(MFCC), Vector Quantization(VQ), Gaussian Mixture Model(GMM), Dynamic Time Warping (DTW), Hidden Markov Model(HMM). Introduction:- Speech is the predominant mode of human communication....

Words: 1009 - Pages: 5

Premium Essay

Aims and Objectives of Research Project

... | Table of Contents Aims and Objectives: 2 Review of Current State of Proposed Area: 4 Major Milestones and Deliverables: 5 Scientific Risk Analysis: 7 Resources Needed and Architecture: 8 Architecture 9 Ethical, Legal, Professional Issues and Academic Misconduct: 10 Code of Ethics: 11 Bibliography 11 Aims and Objectives: The main of this study is to develop software that will allow voice command to be converted into text and then to be displayed on the output monitor or to be converted into command to perform a particular action, the primary aim will be to perform adequate amounts of research in the field of voice recognition in order to develop such tool that can be used to convert speech to text and voice to text command for a particular system to perform a task. The main objectives to achieve this will be to have a clear development plan with a clear software development cycle, having the software development cycle will not be enough on its own and constant monitoring of the development will be very critical objective in order to achieve the aim. The first aim will be to perform enough research in order to decide if there is a scope with in such development, there will be few objectives with in this some of the objectives will include performing thorough search to see if there is any demand with in such type of software, as well its future aspects to...

Words: 3015 - Pages: 13

Free Essay

Sr Corporation

...SR Corp was formed in 1986 to develop and commercialize advanced speech recognition technology. The company's mission was to deploy a new generation of speech transaction technologies, products, and systems that could be easily integrated into telephone and computer networks. The company's goal was to become the leader in a new realm of human communications. SR Corp was financed by private investors. For the past eight years the company focused on developing its core technologies of large-scale speech recognition systems. SR Corp based their solution requirements on feedback from large companies in their target market segment. The solutions developed by SR Corp are targeted at three distinct niches within the telephony segment of the speech recognition market: • Fortune 500 corporations • Telephone companies • Telephone switch OEM's Each niche market provides both opportunities and risks. SR Corp products have shown to be much further advanced then the leading market research firms and industry experts expected at this point in time. The company also had several other advantages including: • Seven US and foreign patents with other pending that will be in force into the next century • SR Corp technology is different then AT&T and other larger competitors • The product and solutions are distinguished by o Speaker-independent with continuous speech recognition. Internal testing proved the solution had a 98 to 99 percent accuracy...

Words: 2245 - Pages: 9

Free Essay

A Voice Guidance System for Autonomous Robot

...A VOICE GUIDANCE SYSTEM FOR AUTONOMOUS ROBOT Neha Dingwani Email: nehadingwani3@gmail.com Pranali Sonawane Email: pranalis93@gmail.com Sanjivani Yesade Email: Sanjivani.yasade@gmail.com Vishal Motwani Email: rvmotwani960@gmail.com ABSTRACT In this paper, a voice guidance system for autonomous robots is proposed as a project based on microcontroller. The proposed system consists of a microcontroller and voice recognition software that can recognize a limited number of voice patterns. The commands of autonomous robots are classified and are organized such that one voice recognition software can distinguish robot commands under each directory. Thus, the proposed system can distinguish more voice commands than one voice recognition processor can. I. ------------------------------------------------- INTRODUCTION This Project Describe a robot that can be operated by voice commands given from user. The project use speech recognition system for giving and processing voice commands Speech recognition, or speech-to-text, involves capturing and digitizing the sound waves, converting them to basic language units or phonemes. It is the ability of a computer to recognize general, naturally flowing voice from a wide variety of users. The robot will receive commands from user and do the actions like left, right, back, front etc. The robot will detect the obstacles, fire and gas using sensor and do the work like if robot detect obstacle it moves in different direction, if...

Words: 1797 - Pages: 8

Premium Essay

Discussion- Small Business It

...Discussion: Unlike large organizations, small organizations have been less active in integrating information technologies into their business operations. For example, some of the larger airliners use online information technologies to allow passengers to make reservation, buy a ticket, reserve a seat, check in, and even print their boarding passes online before they get to the airport. * Using the airlines example mentioned above, propose several possible IT solutions and how they would benefit a smaller airline to become more successful or attract more clients. * Tell us if the availability of information technology services has influenced your decision to travel on a particular airline. What airline was it? Response: When thinking about IT concepts that might benefit smaller airlines, a few ideas come to mind. Enterprise collaborative systems, this would allow better communication with employees which would in turn, increase production. When a customer is in need of assistance and the employee is unable to provide a response, instead of trying to contact one person at a time they could broadcast the issue to several employees which would provide multiple angles of aid. Also if a manager needs to relay a message to several employees for example weather delays he could easily accomplish this using an enterprise collaborative system. MIS (management information systems) which provides data to managers to help them make decisions would also benefit smaller airlines. It...

Words: 2725 - Pages: 11

Free Essay

Communication/Information Technology Paper

...he or she is needed or the administrative staff would rely on emails when communicating throughout the company. In researching voice recognition, this paper will include how this system affects communication in health care, the advantages and disadvantages of using the system, how efficient and effective communication is with this system, and what is the short and long term financial impact of the organization. Voice recognition is an electronic system in which the voice of a human is recognized by a machine such as a computer. In using the speech recognition systems, the system is pre-programmed with stored template words with each input of speaking is compared and the closest word or phoneme is given out. In using the voice system in health care, communication can be less complicated. When considering the use of handwriting in health care reading files or paperwork a doctor signed off on can be a puzzle in figuring out what was written. Handwriting documents gives an immediate access to a record, using the handwriting system documentation is not as comprehensive as a dictated note. Using voice recognition in communication ensures the doctor prompt and accurate documents. Voice recognition in healthcare is steadily improving will give a significant boost to the goal of 100 percent of all patient health records electronic. The voice recognition system is thought by many to be a new key technology to professional health care workers. This system has been identified as having an...

Words: 1009 - Pages: 5

Premium Essay

Sina Project

...Differentiation and Positioning Competitive Review Positioning Statement SWOT Analysis Distribution Channel Marketing Mix (4 P’s) Product Price 1. Budget Place Promotion Executive Summary DMCP, Inc. is preparing to launch a new product into the electronic technology product market called Sina. Our product offers a variety of voice recognition languages including accents which enables consumers to use the product without mobility. We are targeting specific household families with children from the ages of thirteen and up living in suburban areas in the United States. The primary marketing objective for DMCP, Inc. is to reach first-year U.S. sales of 6 million unites. DMCP, Inc. targets to achieve the financial objectives of first-year sales income of $35 million, and a gross profit of $19 million. Industry Product Overview Sina is a new product offered by DMCP, Inc. Sina is a product of a new company named DMCP, Inc. DMCP’s new consumer product is designed to disable the use of remote controls. In place of a remote control, Sina will enable your family’s voice to control the television settings. Sina is the first voice recognition software for television which includes different languages with the ability to comprehend accents. The software is exclusively designed to recognize foreign languages and accents.              Sina is in the form of a tiny black handheld box that has a cord which can be attached to the television. In addition, in the...

Words: 2986 - Pages: 12

Premium Essay

Blue Ant

...BlueAnt Bluetooth Speakerphone The BlueAnt Bluetooth Speakerphone was designed for cellphone users who excessively utilized their cellphone while driving (Shaw, 2010). With its sleek design the BlueAnt Bluetooth Speakerphone connects to the sun visor of the vehicle. Right out of the box it pairs itself with your cellphone and downloads your contacts. After pairing it is ready to be used. At this point the driver can use their voice to make and answer calls with their voice. Marketing Plan SECTION I: Executive Summary The BlueAnt Bluetooth speakerphone is a high-tech speakerphone designed to provide drivers who need to use their cellphones a safe way to do so. This speakerphone allows its users to do speech to text, answer and make phones calls through voice recognition. This device will appeal to those drivers who use their cellphones while operating their vehicle. It allows them to use their cellphone and still focus on the road. SECTION II: Situation...

Words: 1790 - Pages: 8