10
WIRELESS MONITORING FOR INDUSTRIAL AUTOMATION USING SPEECH RECOGNITION SYNOPSIS The main aim of the project is to design and develop a computer based interactive system using the speech in the real time environment to monitor the power stations and also to control the same. Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. Speaker identity is correlated with the physiological and behavioral characteristics of the speaker. These characteristics exist both in the spectral envelop and in the supra- segmental features (Voice source characteristics and dynamic features spanning several segment). A spectral envelop (speech signal) reconstructed from truncated set of cepstral co-efficients (MFCC Mel- frequency cepstral co-efficients) is much smoother than one reconstructed from LPC co-efficient. Therefore it provides a stable representation from one repetition to another of particular speaker utterances. As for the regression co- efficients, typically the first and second order co- efficients, typically the first and second order co-efficients are extracted at every frame period to represent the spectral dynamics (vector Quantization). These co-efficients are derivatives of the time function lastly Euclidian distance is found out while comparing one speech signal to another. Here we are monitoring the voltage from the remote locations using the wireless technology. This data is then displayed on Page 1

Synopsis - GSSSIT

  • Upload
    syeda

  • View
    219

  • Download
    1

Embed Size (px)

DESCRIPTION

wireless monitoring for industries using speech

Citation preview

Wireless monitoring for industrial automation using speech recognition

Wireless monitoring for industrial automation using speech recognition

SYNOPSIS

The main aim of the project is to design and develop a computer based interactive system using the speech in the real time environment to monitor the power stations and also to control the same.Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. Speaker identity is correlated with the physiological and behavioral characteristics of the speaker. These characteristics exist both in the spectral envelop and in the supra- segmental features (Voice source characteristics and dynamic features spanning several segment).A spectral envelop (speech signal) reconstructed from truncated set of cepstral co-efficients (MFCC Mel- frequency cepstral co-efficients) is much smoother than one reconstructed from LPC co-efficient. Therefore it provides a stable representation from one repetition to another of particular speaker utterances. As for the regression co-efficients, typically the first and second order co-efficients, typically the first and second order co-efficients are extracted at every frame period to represent the spectral dynamics (vector Quantization). These co-efficients are derivatives of the time function lastly Euclidian distance is found out while comparing one speech signal to another.Here we are monitoring the voltage from the remote locations using the wireless technology. This data is then displayed on the LCD and the same is fed to the PC as shown in the block diagram above. When the user wants to know the voltage status, all that he needs to do is to speak as status. At this moment the Matlab program will identify your speech and will read the status if the voltage from the micro controller and will speak up the on the speaker, so that he can get to know the status without touching the PC or by pressing any buttons.

BLOCK DIAGRAM

Voltage SourcePotentiometerADCDriverENCODERLCDPCSERIAL INTERFACEMAX232Micro controllerRF TX

DEVICE 1

MICRO CONTROLLERDECODERDEVICE 2DRIVER UNITRF RX

BLOCK DIAGRAM OF SPEECH RECOGNITION

SPEECH SIGNALADD TO DATABASESPEAKER RECOGNITION

OUTPUT

VECTOR QUANTIZATION

EUCLEDIAN DISTANCE

MFCC

MICROPHONE I/P ORAUDIO SIGNAL

RECOGNIZED SPEAKER

Software Requirements:

Windows MPLAB Mat lab Hyper Terminal software OSHON simulation software

Hardware Requirements:

PC MIC Rs232 Cable MAX232 Encoder Decoder Transmitter Receiver Relays devices power supply PCB LCD Pot

Advantages:

No live contact the voltages Easy to operate Human risks are minimized Cost effective and time efficient Human requirements are reduced

Applications:

Usedin monitoringvoltages in High Transmission lines Used in data loggers. Used in Test and measurement labs. Used in R&D. Usedin Insulator test labs.

Literature survey:

1) Parental control based on speaker class verification This paper appears in: Consumer Electronics, IEEE Transactions onIssue Date: August 2008Volume: 54 Issue: 3On page(s): 1244 - 1251Location: Rosemont, IL, USAISSN: 0098-3063INSPEC Accession Number: 10236604Digital Object Identifier: 10.1109/TCE.2008.4637613Date of Current Version: 07 October 2008 Restricting children access to materials unsuitable for them such as violence scenes is very important for parents. So there is a feature named Parental Control in devices such as televisions and computers to define the contents children can access. The parental control setting must be protected from children and is usually done by a password.

2) A Speaker Verification System Based on EMD This paper appears in: Genetic and Evolutionary Computing, 2009. WGEC '09. 3rd International Conference onIssue Date: 14-17 Oct. 2009On page(s): 553 - 556Location: GuilinPrint ISBN: 978-0-7695-3899-0INSPEC Accession Number: 11106236Digital Object Identifier: 10.1109/WGEC.2009.101Date of Current Version: 02 February 2010

Most of the speech utterance feature extraction methods are based on the assumptions: utterance signal is short-term stable and independent between each other adjacent frames. This approach ignores the dynamic characteristics of speech signal

3) Matlab-based graphical user interface development for Basic Stamp 2 microcontroller projectsThis paper appears in: American Control Conference, 2004. Proceedings of the 2004Issue Date: 30 June-2 July 2004On page(s): 3233 - 3238 vol.4ISSN: 0743-1619Print ISBN: 0-7803-8335-4INSPEC Accession Number: 8434523Digital Object Identifier: 10.1109/ACC.2004.182784 Date of Current Version: 02 May 2005

Basic Stamp 2 (BS2) is a popular microcontroller used both in hobby and industrial projects. Similar to other microcontrollers, the BS2 programming environment lacks graphical user interface (GUI) capability. In this paper, we present an approach to endow the BS2 microcontroller with GUI capabilities by interfacing it with Matlab and by exploiting Matlab's abundant GUI tools.

4)MIRCOS - microcontroller-based real time control system toolbox for use with Matlab/SimulinkThis paper appears in: Computer Aided Control System Design, 1999. Proceedings of the 1999 IEEE International Symposium onIssue Date: 1999On page(s): 267 - 272Meeting Date: 22 Aug 1999 - 27 Aug 1999Location: Kohala Coast, HI , USAPrint ISBN: 0-7803-5500-8References Cited: 14INSPEC Accession Number: 6461198Digital Object Identifier: 10.1109/CACSD.1999.808659 Date of Current Version: 06 August 2002

This paper presents the new MIRCOS toolbox for graphical programming and real time operation of the standard 16-bit microcontroller 80C166 using Matlab and Simulink. With the toolbox presented, rapid control prototyping is possible on this widely available hardware.

5)Development of a State-of-the-Art ECG DAS for Storing, Processing and Analysis Using MATLAB-Based GUI and Microprocessor This paper appears in: Advances in Computing, Control, & Telecommunication Technologies, 2009. ACT '09. International Conference onIssue Date: 28-29 Dec. 2009On page(s): 570 - 572Location: Trivandrum, KeralaPrint ISBN: 978-1-4244-5321-4INSPEC Accession Number: 11058382Digital Object Identifier: 10.1109/ACT.2009.144Date of Current Version: 12 January 2010

This paper illustrates a low cost novel method for ECG signal acquisition, display and storage using a graphical user interface (GUI), which provides a user-friendly front end by using MATLAB-based toolsets. At first, analog ECG has been converted to its digital equivalent with the help of an ADC. Microcontrollers converted this data to a serial (RS232) format and transmit to the PC for serial acquisition. The developed GUI performs the acquisition for its future processing.

Page 7