Upload
phamthien
View
219
Download
3
Embed Size (px)
Citation preview
EUROPEAN SPEECH COMMUNICATION ASSOCIATION (ESCA)
5th EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY
S2I2 SIse p t e m b e n&l 99,7i
UNDER THE AUSPICES OF
THE MINISTRY OF CULTURETHE MINISTRY OF THE AEGEAN
THE GENERAL SECRETARIAT OF SCIENCE AND RESEARCH
PROCEEDINGSVOLUME 1
ORGANIZER:UNIVERSITY OF PATRASWIRE COMMUNICATIONS LABORATORY261 10 Rion - Patras - Greece
KE¥NOYE SPEECHES !
Keynote Speech 1: Is Syntactic Structure ProsodicallyRecoverable? KN-1Speaker: Mario Rossi, Institutde Phonetique d'Aix-en-Provence, Laboratoire Parole et Langage, FranceChair: Louis Pols, ESCA, Univ. of Amsterdam, TheNetherlands
Keynote Speech 2: Conversational Interfaces: Advancesand Challenges KN-9Speaker: Victor Zue, MIT, USAChair: Paul Daalsgard, Aalborg University, Denmark
Keynote Speech 3: Prosodic Modelling in Text-to-SpeechSynthesis KN-19Speaker: Jan P. H. van Sanlen, Bell Labs-LucentTechnologies, USAChair: Hiroya Fujisaki, Science Univ. of Tokyo, Japan
Keynote Speech 4: Robust Speech Recognition: Review andPerspectivesChair: Joseph Mariani, LIMSI-CNRS, France
Speech 4A (8:35-9:00): Impact of the UnknownCommunication Channel on Automatic SpeechRecognition: A Review KN-29Speaker: Jean-Claude Junqua, Panasonic Technologies Inc.,USA
Speech 4B (9:00-9:25): Statistical Techniques for RobustASR: Review and Perspectives KN-33Speaker: Jerome BeUegarda, Apple Computer, USA
Speech 4C (9:25-9:50): Using Missing Feature Theory toActively Select Features for Robust Speech Recognitionwith Interruptions, Filtering and Noise KN-37Speaker: Richard Uppmann, Lincoln Laboratory MIT, USA
Keynote Speech 5: Perspectives of Speech TechnologyResearch Highlighted in Eurospcech '97Speaker: Wolfgang Hess, Univ. of Bonn, GermanyChair: George Kokkinakis, WCL, Univ. ofPatras, Greece
SESSION: MIA.Acoustic Modelling ICliair: Roger Moore, DRA. UK
M4A.1 Using Multiple Time Scales in a Multi-StreamSpeech Recognition System 3Stephane Dupont, *Herve BourlardFPMs-TCTS, Belgium*IDIAP, Switzerland
M4A.2 Speech Recognition Using HMM-State ConfusionCharacteristics 7Wakita Yurai, Harald Singer, Yoshinori SagisakaATR Interpreting Telecommunications Res. Labs., Japan
M4A3 Bottom-up and Top-down State Clustering forRobust Acoustic Modeling 11Cristina Chesta, Pietro Laface, *Franco RaveraPolitecnico di Torino, Italy*Centro Studi e Laboratori Telecomunicazioni (CSELT), Italy
M4A.4 Comparison of Optimization Methods forDiscriminative Trainining Criteria 15Ralf Schluter, W. Macherey, S. Kanthak, Hermann Ney, LutzWellingRWTH Aachen, Germany
M4A.5 Clustering Beyond Phoneme Contexts for SpeechRecognition 19Clark Z. Lee, Douglas O'ShaughnessyINRS Telecommunications, Canada
M4A.6 Influence of Outliers in Training the ParametricTrajectory Models for Speech Recognition 23Rathinavelu ChengalvarayanBell Labs-Lucent Technologies, USA
SESSION: M4BDynamic Ar^cuiatory MeasurementsCltair; Rene Cam* £NST, France
M4B.1 Adaptation of Natural Articulatory Movementsto the Control of the Command Parameters of a ProductionModel 27Laurence Candille, Henri MeloniCERI, France
M4B.2 Three-Dimensional Coarticulatory Strategies ofTongue Movement 31Maureen Stone, *Andrew Lundberg, *Edward Davis, RaoGullapalli, Moriel NessAiverUniv. of Maryland, USA*John Hopkins Univ., USA
M4B.3 From Laryngographic and Acoustic Signals toVoicing Gestures 35Nathalie Parlangeau, Regine Andre-ObrechtIRTT - Equipe IHMPT, France
M4B.4 Ultrasonographic Measurement of CricothyroidSpace in Speech 39Erkki Vilkman, Raija Takalo, Maatta Taisto, *Anne-MariaLaukkanen, Jaana Nummenranta, Lipponen TeroUniv. ofOulu, Finland*Univ. of Tampere, Finland
M4B.5 Coarticulation and Articulatory CompensationsStudied by Dynamic MRI 43Didier Demolin, M. George, V. Lecuit, *T. Metens, /JA.Soquet, tH. RaeymaekersUniv. Libre de Bruxelles, Belgium^Magnetic Resonance Unit.Hopital ErasmetPhilips Medical Systems, BelgiumtCollaborateur Scientifique-FNRS
SESSION! U4CLanguage IdentificationChair: MarcZisstmtn, MIT Lincoln Laboratory, USA
M4C.1 Predicting, Diagnosing and Improving AutomaticLanguage Identification Performance 51Marc A. ZissmanMIT Lincoln Laboratory, USA
M4C.2 Language Identification with Language-Independent Acoustic Models 55Cristobal Corredor-Ardoy, Jean Luc Gauvain, Marline Adda-Decker, Lori LamelLIMSI, France
M4C.3 Bayesian Methods for Language Verification 59Eluned S. Parris, Harvey Lloyd-Thomas, Michael Carey, *JerryH. WrightEnsigma Ltd, UK*Bristol Univ., UK.
M4C.4 Use of Recurrent Network for Unknown LanguageRejection in Language Identification System 63Hingkeung Kwan, Keikichi HiroseUniv. of Tokyo, Japan
M4C.5 Language-Identification Based on Cross-LanguageAcoustic Models and Optimized Information CombinationOve Andersen, Paul Dalsgaard 67Aalborg Univ., Denmark
M4C.6 Phonetic-Context Mapping in LanguageIdentification 71Jiri Navratil, Werner ZuehlkeTechnische Univ. Ilmenau, Germany
SI'S'Sinv M-H)Neural Network*, for Speech and LanguageProcessingChair Wiil/yang He\s, Univ. of Hewn, Germany
M4D.1 Discriminative Feature and Model Design forAutomatic Speech Recognition 75Mazin Rahim, Yoshua Bengio, Yann LeCunAT&T Labs-Research, USA
M4D.2 Large Vocabulary Speech Recognition with ContextDependent MMI-Connectionist/HMM Systems Using theWSJ Database 79Jorg Rottland, Christoph Neukirchen, Daniel Willett, GerhardRigollGerhard-Mercator-Univ. Duisburg, Germany
M4D J Automatic Selection of Segmental AcousticParameters by Means of Neural-Fuzzy Networks forReordering in N-best HMM Hypotheses 83Thierry Moudenc, Guy MercierFrance Telecom, France
M4D.5 A Connectionist Approach to Machine TranslationAsuncion Castano, *Francisco Casacuberta 91Universitat Jaume I, Spain*Univ. Politecnica de Valencia, Spain
M4D.6 Continuous Speech Recognition Using a ContextSensitive ANN and HMM2s 95Nicolas Pican, Jean-Francois Mari, Dominique FohrCRIN-CNRS & INRIA Lorraine, France
SESSION: MAATraining Techniques, Efficient Decoding in ASRCJiair: Jerome Belkgarda, Apple Computer, USA
MAA.1 Acoustic Modeling Based on the MDL Principle forSpeech Recognition 99Koichi Shinoda, Takao WatanabeNEC Corporation, Japan
MAA.2 Discriminative Utterance Verification UsingMultiple Confidence Measures 103Piyush Modi, Mazin RahimAT&T, USA
MAA.3 Subspace Distribution Clustering for ContinuousObservation Density Hidden Markov Models 107Enrico Bocchieri, Brian MakAT&T Labs-Research, USA
MAA.4 A Comparative Study of Methods for PhoneticDecision-Tree State Clustering I l lHJ. Nock, MJ.F. Gales, Steve YoungCambridge Univ., UK
MAA.5 Comparing Gaussian and Polynomial Classifica-tion in SCHMM-Based Recognition Systems 115Alfred Kaltenmeier, Jurgen FrankeDaimler BenzAG, Germany
MAA.6 Maximum Likelihood Successive State SplittingAlgorithm for Tied-Mixture HMNET 119AlexAndre Girardi, Harald Singer, Kiyohiro Shikano, SatoshiNakamuraNara Institute of Science and Technology, Japan
MAA.7 String-Level MCE for Continuous PhonemeRecognition 123Erik McDermott, Shigeni KatagiriATR Interpreting Telecommunications Res. Labs., Japan
MAA.8 HMM State Clustering Across Allophone ClassBoundaries 127Ze'ev Rivlin, Ananth Sankar, Harry BrattSRI International, USA
MAA.9 Weighted Determinization and Mini miration forLarge Vocabulary Speech Recognition 131Mehryar Mohri, Michael RileyAT&T Labs-Research, USA
MAA.10 Parallel Speech Recognition 135Steven Phillips, Anne RogersAT&T Labs-Research, USA
MAA.ll Fast Likelihood Computation Methods forContinuous Mixture Densities in Large Vocabulary SpeechRecognition 139Stefan Ortmanns, Thorsten Firzlaff, Hermann NeyRWTH Aachen Univ. of Technology, Germany
MAA.12 A Static Lexicon Network Representation forCross-Word Context Dependent Phones 143Kris Derouynck, Jacques Duchateau, Dirk Van CompernolleKU Leuven-ESAT, Belgium
MAA.13 Decision-Tree Based Quantization of the FeatureSpace of a Speech Recognizer 147Mukund Padmanabhan, L.R. Bahl, D. Nahamoo, De Sou2aPeterIBM, USA
MAA.14 Sub-Vector Clustering to Improve the Memoryand Speed Performance of the Acoustic LikelihoodComputation 151Mosur Ravishankar, *R. Bisiani, E. ThayerCarnegie Mellon Univ., USA*Univ. of Milan, Italy
MAA.15 The Incorporation of Path Merging in a DynamicNetwork Recogniser 155Simon HovellBT Laboratories, UK
MAA.16 Improvement on Connected Digits RecognitionUsing Duration Constraints in the Asynchronous DecodingScheme 159Miroslav NovakIBM, USA
MAA.17 Explicit Word Error Minimization in N-Best ListRescoring 163Andreas Stolcke, Yochai Konig, Mitchel WeintraubSRI International, USA
MAA.18 Efficient 2-Pass N-Best Decoder 167Long Nguyen, Richard SchwartzBBN Systems and Technologies, USA
MAA.19 A Memory Management Method for a LargeWord Network 171Tomohiro Iwasaki, Yoshiharu AbeMitsubishi Electric Corporation, Japan
SESSION: MAB -ProsodyCiwir: Nick Campbell, ATRt Japan
MAB.l Persistence of Prosodic Features Between Dialectaland Standard Italian Utterances in Six Sub-Varieties of aRegion of Southern Italy (Salento): First Assessments ofthe Results of a Recognition Test and an InstrumentalAnalysis 175Antonio RomanoUniv. Stendhal, France
MAB.2 Improving the Phonetic Annotation by Means ofProsodic Phrasing 179Halewijn Vereecken, *Annemie Vorstermans, Jean-PierreMartens, *Bert Van CoileUniv. of Gent, Belgium*L&H, Belgium
MAB.3 A Descriptive Study of Prosodic Phenomena inMPUR (West Papuan Phylum) 183Cecilia OdeLeiden Univ., The Netherlands
MAB.4 Automated Quantitative Analysis of Fo Contours ofUtterances From a German ToBI-Labeled SpeechDatabase 187HansJorg Mixdorff, *Hiroya FujisakiTU Dresden, Germany*Science Univ. of Tokyo, Japan
MAB.5 Identification and Automatic Generation ofProsodic Contours for Text-to-Speech Synthesis System inFrench 191De Tournemire StephanieFrance Telecom, France
MAB.6 Quantitative Analysis and Formulation of ToneConcatenation in Chinese FO Contours 195Jin-Fu Ni, Ren-Hua Wang, *Keikichi HiroseUniv. of Science and Technology of China, ROChina*Univ. of Tokyo, Japan
MAB.7 An Environment for the Labelling and Testing ofMelodic Aspects of Speech 199Christel Brindopke, Arno Pahde, Franz Kummert, GerhardSagererUniv. of Bielefeld, Germany
MAB.8 PROPAUSE: A Syntactico-Prosodic SystemDesigned to Assign Pauses 203David Casacuberta, Lourdes Aguilar, Rafael MarinUniversitat Autonoma de Barcelona, Spain
MAB.9 Integrated Dialog Act Segmentation andClassification Using Prosodic Features and LanguageModels 207Volker Warnke, *Ralf Kompe, Heinrich Niemann, Elmar NoethUniv. ofErlangen-NUremburg, Germany*Sony International (Europe), Germany
MAB.10 Evaluation of Prosodic Characteristics hi RetoldStories hi Dutch by Means of Semantic Scales 211Monique E. van Donzel, Florien J. Koopmans-van BeinumUniv. of Amsterdam, The Netherlands
MAB.ll Text-to-intonation in Spontaneous Swedish 215Gosta Bruce, Marcus Filipsson, Johan Frid, *Bjorn Granstr5m,*Kjell Gustafson, Merle Home, David HouseLund Univ., Sweden*KTH, Sweden
MAB.12 Synthesing Attitudes with Global Rhythmic andIntonation Contours 219Yann Morlec, Gerard Bailly, Veronique AubergeInstitut de la Communication Parlee, France
MAB.I3 Prosody-Particle Pairs as Discourse ControlSigns 223Dafydd Gibbon, Claudia SassenUniv. of Bielefeld, Germany
MAB.14 Focus Detection with Additional Information ofPhrase Boundaries and Sentence Mode 227Anja EisnerUniv. of Bonn,Germany
MAB.15 The Role of Prosody in Infants' Native-LanguageDiscrimination Abilities: The Case of Two PhonologicallyClose Languages 231Laura Bosch, Nuria Sebastian-GallesUniversitat de Barcelona, Spain
MAB.16 Prosodic Cycles and Interpersonal Synchrony inAmerican English and Swedish 235Eugene H. Buder, *Anders ErikssonUniv. of Memphis, USA*Ume& Univ., Sweden
MAB.17 Relating Prosody to Syntax: Boundary Signallingin Swedish 239Eva StrangertUmea Univ., Sweden
MAB.18 On Representation of Fundamental Frequency ofSpeech for Prosody Analysis Using Reliability FunctionMitsuru Nakai, Hiroshi Shimodaira 243Japan Advanced Institute of Science and Technology, Japan
MAB.19 Efficient Method of Establishing Words ToneDictionary for Korean TTS System 247Seong-hwan Kim, Jin-young KimChonnam National Univ., South Korea
MAB.20 Perception of Questions and Statements inNeapolitan Italian 251D'Imperio Mariapaola *David HouseOhio State University, USA*Lund Univ., Sweden
SESSION: Tl AKeyword and Topic SpottingChair: Joseph Mariani, LtMSl-CNRS, France
T1A.1 Key-Phrase Spotting Using An Integrated Lan-guage Model of N-Grams and Finite-State Grammar.... 255Lin Qiguang, Dave Lubensky, Michael Picheny, P. SrinivasaRaoIBM, USA
T1A.2 Efficient Methods for Detecting Keywords inContinuous Speech 259Jochen Junkawitsch, *Gunther Ruske, Harald HogeSiemens AG, Germany*Munich Univ.of Technology, Germany
T1A3 Providing Sublexical Constraints for Word SpottingWithin the ANGIE Framework 263Raymond Lau, Stephanie SeneffMIT Laboratory for Computer Science, USA
T1A.4 Usefulness of Phonetic Parameters in a RejectionProcedure of an IIMM Based Speech Recognition SystemKatarina Bartkova, Denis Jouvet 267France Telecom, France
T1A.5 Keyword Spotting Using FO Contour MatchingYoichi Yamashita, *Riichiro Mizoguchi 271Ritsumeikan Univ., Japan*Osaka Univ., Japan
T1A.6 A Frame and Segment Based Approach for TopicSpotting 275Elmar Noeth, Stefan Harbeck, Heinrich Niemann, VolkerWamkeUniv. of Erlangen-Nilremburg, Germany
SESSION; TIBRobustness in Recognition and Signal Processing ICtuiir: Helmut Mangold, Daimler Benz, Germany
Tlli.l Cyclic Autocorrelation-Based Linear PredictionAnalysis of Speech 279K.K. Paliwal, Yoshinori SagisakaATR Interpreting Telecommunications Res. Labs., Japan
TVBJ2 Novel FUler Acoustic Models for Connected DigitRecognition 283Ilija Zeljkovic, Shrikanth NarayananAT&T Labs-Research, USA
T1B3 A Non-Iterative Model-Adaptive E-CMN/PMCApproach for Speech Recognition in Car EnvironmentsMakoto Shozakai, Satoshi Nakamura, Kiyohiro ShikanoNara Institute of Science and Technology, Japan 287
T1B.4 Discriminative Feature Extraction for SpeechRecognition in Noise 291la Torre Angel de, Antonio M. Peinado, Antonio J. Rubio,Pedro GarciaUniversidad de Granada, Spain
T1B.5 Noise Robust Recognition Using Feature SelectiveModeling 295Michael K. Brendborg, Borge LindbergAalborg Univ., Denmark
T1B.6 Mixture Input Transformations for Adaptation ofHybrid Connectionist Speech Recognizers 299Victor AbrashSRI International, USA
SESSION: TICModelling of ProsodyOtair: Hiroya Fujisaki, Science Univ. of Tokyo, Japan
T1C.1 Metrical Representation of Demarcation andConstituency in Long Noun Phrases 303Christos Malliopoulos, *George MikrosNational Technical Univ. of Athens, GreeceVLSP, Greece
T1C.2 A System of Stylized Intonation Contours forGerman 307Hannes Pirker, *Kai Alter, Erhard Rank, John Matiasek, HaraldTrost, tGermnot KubinAustrian Research Institute for Artificial Intelligence (OFAI),Austria*Max-Planck-Institute of Cognitive Neuroscience, GermanyfVienna Univ. of Technology, Austria
T1C.3 A Method of Representing Fundamental FrequencyContours of Japanese Using Statistical Models of MoraicTransition 311Keikichi Hirose, Kouji IwanoUniv. of Tokyo, Japan
T1C.4 Modeling Arbitrarily Long Sentence-Spanning F0Contours by Parametric Concatenation of Word-SpanningPatterns 315Stavroula-Evita Fotinea, *Michael Vlahakis, *GeorgeCarayannisNational Technical Univ. of Athens, Greece*ILSP, Greece
T1C.5 Strong Interaction Between Factors InfluencingConsonant Duration 319R. J. J. H. van Son, *Jan P.H. van SantenUniv. of Amsterdam, The Netherlands*Bell Labs-Lucent Technologies, USA
T1C.6 Speech Tuning in Slovenian TTS 323Jerneja Gros, Nikola Pavesic, France MihelicUniv. of Ljubljana, Slovenia
SESSION: T1X> 'Microphone Arrays for Speech IMianecraetttCJtair: Alan Bradley, RMTT, Australia
T1D.1 Small Microphone Arrays with OptimizedDirectivity for Speech Enhancement 327Matthias DorbeckerAachen Univ. of Technology, Germany
T1D.2 Microphone Array Design Measures for Hands-FreeSpeech Recognition 331Masaaki Inoue, Satoshi Nakamura, Takeshi Yamada, KiyohiroShikanoNam Institute of Science and Technology, Japan
T1D.3 Noise Reduction by Paired Microphones 335Masato Akagi, Mitsunori MizumachiJapan Advanced Institute of Science and Technology, Japan
T1D.4 A Microphone Array for Speech EnhancementUsing Multiresolution Wavelet Transform 339Djamila MahmoudiLTS-DE, EPFL, Switzerland
T1D.5 A Two-Channel Adaptive Microphone Array withTarget Tracking 343Yoshifumi Nagata, Hiroyuki TsuboiToshiba Corporation, Japan
T1D.6 Use of Different Microphone Array Configurationsfor Hands-Free Speech Recognition in Noisy andReverberant Environment 347Diego Giuliani, Marco Matassoni, Maurizio Omologo,Piergiorgio SvaizerIstituto per la Ricerca Scientifica e Tecnologica (IRST), Italy
SBSSION: T2AMultilingual RecognitionChair: Ridtard Uppmatt, MIT Lincoln Lah, USA
T2A.1 YINHE: A Mandarin Chinese Version of the GalaxySystem 351Chao Wang, James R. Glass, Helen Meng, Joe Polifroni,Stephanie Seneff, Victor ZueMIT Laboratory for Computer Science, USA
T2A.2 Multilingual Speech Recognition for FlexibleVocabularies 355Patrizia Bonaventura, *Filippo Gallocchio, tGiorgio MiccaCentro Studi e Laboratori Telecomunicazioni (CSELT),Consultant, Italyf Centro Studi e Laboratori Telecomunicazioni (CSELT),Turin, Italy*Univ.di Padova, Italy
T2A3 A Study of Multilingual Speech Recognition 359Fuliang Weng, Harry Bratt, Leonardo Neumeyer, AndreasStolckeSRI International, USA
T2A.4 Multilingual Speech Recognition: The 1996 ByblosCallhomc System 363Jayadev Billa, Kristine Ma, John W. McDonough, GeorgeZavaliagkos, David R. Miller, Kenneth N. Ross, Amro El-JaroudiBBN Systems and Technologies, USA
T2A.5 Japanese LVCSR on the Spontaneous SchedulingTask with JANUS-3 367Tanja Schultz, Detlef Koll, Alex WaibelUniv. Karlsruhe, Germany
T2A.6 Fast Bootstrapping of LVCSR Systems withMultilingual Phoneme Sets 371Tanja Schultz, Alex WaibelUniv. Karlsruhe, Germany
SESSION: T2BLanguage Specific Speech AnalysisChair: John J. Ohala, Umv uf California, USA
T2B.1 Factors of Variation in the Production of theGerman Dorsal Fricative 375Bemd Pompino-Marschall, Christine MooshammerCenter for General Linguistics, Berlin, Germany
T2B.2 EPG and Aerodynamic Evidence for the Copro-duction and Coarticulation of Clicks in ISIZULU 379Kimberly ThomasUniv. of California, USA
T2B3 Formant Trajectory Dynamics in SwabianDiphthongs 383Anja GeumannMuenchen Univ., Germany
T2B.4 The Gestural Organization of Vowels andConsonants: A Cinefluorographic Study of ArticulatorGestures in Greenlandic 387Sidney A.J. WoodUniv. of Lund, Sweden
T2B.5 The Perception of Coronals in Western ArrernteVictoria B. Anderson 389Univ. of California, USA
T2B.6 Acoustic Modelling of American English M 393Carol Y. Espy-Wilson, *Shrikanth Narayanan, Suzanne E.Boyce, fAbeer AlwanBoston Univ., USA*AT&T Labs-Research, USAfUniv. of California, USA
SESSION! T2CFeature Estimation 1Chair: Paul Daalsgard, Aalborg University, Denmark
T2C.1 Acoustic Parameters Optimised for Recognition ofPhonetic Features 397Anya Vamich HansenAalborg Univ., Denmark
T2C.2 Heterogeneous Acoustic Measurements for PhoneticClassification 401Andrew K. Halberstadt, James R. GlassMIT, USA
T2C3 Cepstral-Time Matrices and LDA for ImprovedConnected Digit and Sub-Word Recognition AccuracyBenMilner 405BT Laboratories, UK
T2C.4 Data-Driven Design of Rasta-Like Filters 409Van Vuuren Sarel, / *Hynek HermanskyOregon Graduate Institute of Science and Technology, USAinternational Computer Science Institute, USA
T2C.5 Evaluating Feature Set Performance Using the F-Ratio and J-Measures 413Simon Nicholson, *Ben Milner, Stephen CoxUEA, UK*BT Laboratories, UK
T2C6 Robust Speech Parameters Located in theFrequency Domain 417Javier Hernando, Climent NadeuUniversitdt Politecnica de Catalunya, Spain
SESSION: T2DSpeech Coding IChair: fcabel Trancoso, tNESC, Portugal
T2D.1 A Simple and Efficient Algorithm for theCompression of MBROLA Segment Databases 421van der Vrecken Olivier, Nicolas Pierret, Thierry Dutoit,Vincent Pagel, Fabrice MalfrereTCTSLab, Belgium
T2D.2 A Segmental Formant Vocoder Based On LinearlyVarying Mixtures of Gaussians 425Parham Zolfaghari, Tony A. RobinsonCambridge Univ., UK
T2D.3 Voice Mimic System Using Articulatory Codebookfor Estimation of Vocal Tract Shape 429Samir Chennoukh, Daniel Sinder, Gael Richard, JamesFlanaganCAIP Center, Rutgers Univ., USA
T2D.4 Adaptive Transform Coding for Linear PredictiveResidual 433Damith J. Mudugamuwa, Alan B. BradleyRMTT, Australia
T2D.5 Performance Evaluation of Objective QualityMeasures for Coded Speech 437AkiraTakahashi, Nobuhiko Kitawaki, *Paolino Usai, t DavidAtkinsonNTT, Human Interface Labs, Japan*Centro Studi e Laboratori Telecomunicazioni (CSELT), ItalytNTIA, USA
T2D.6 Between Recognition and Synthesis - 300 Bits/Second Speech Coding 441Mohamed Ismail, Keith PontingSpeech Research Unit, UK
SESSION: TMAFeature Estimation II, Pitch and ProsodyCtutir: Egidia Giachin, Centro Studi e IxtboraioriTelecomunicazioni (CSELT), Italy
TMA.1 A Modified Zero-Crossing Method for PitchDetection in Presence of Interfering Sources 445Francois Gaillard, Frederic Berthommier, *Gang Feng, •Jean-Luc Schwartz1CP/1NPG, France*Institut de la Communication Parlee, France
TMA.2 Using Simulated Annealing ExpectationMaximization Algorithm for Hidden Markov ModelParameters Estimation 449Jacques Simonin, Chafic MokbelFrance Telecom, France
TMA3 Covariation of Subglottal Pressure, F0 and GlottalParameters 453Gunnar Fant, "Stellan Hertegard, Anita Kruckenberg, JohanIiljencrantsKTH, Sweden*Karolinska Hospital and Huddinge Univ. Hospital, Sweden
TMA.4 The Fractal Behaviour of Unvoiced Plosives: AMeans for Classification 457Anastasios Delopoulos, Maria RangoussiNTUA, Greece
TMA.5 A Method for Analysis of the Local Speech RateUsing an Inventory of Reference Units 461Sumio Ohno, Hiroya Fujisaki, Hideyuki TaguchiScience Univ. of Tokyo, Japan
TMA.6 Analysis and Modeling of Fundamental FrequencyContours of Greek Utterances 465Hiroya Fujisaki, Sumio Ohno, Takashi YagiScience Univ. of Tokyo, Japan
TMA.7 Characteristics of Slow, Average and Fast Speechand their Effects in Large Vocabulary Continuous SpeechRecognition 469Fernando Martinez, Daniel Tapias, Jorge Alvarez, Paloma LeonTelefonica I+D, Spain
TMA.8 Analysis of Children's Speech: Duration, Pitch, andFormants 473Sungbok Lee, Alexandra Potamianos, Shrikanth NarayananAT&T Labs-Research, USA
TMA.9 A Method of Measuring Formant Frequencies atHigh Fundamental Frequencies 477HarimutTraunmuller, *Anders ErikssonStockholm Univ., Sweden*Univ. ofUmea, Sweden
TMA.10 Analysis of Speaking Rate Variation in Stress-Timed Languages 481Tom Brondsted, Jens Printz MadsenAalborg Univ., Denmark
TMA.ll Automatic Identification of the PhonemeBoundaries Using a Mixed Parameter Model 485Paul Micallef, T e d ChiltonUniv. of Malta, Malta*Univ. of Surrey, UK
TMA.12 Pitch Detection Reliability Assessment forForensic Applications 489Sergey Koval, Veronika Bekasova, Michael Khitrov, AndreyRaevSt Petersburg State Univ., Russia
TMA.13 Efficient Estimation of Perceptual Features forSpeech Recognition 493Zhihong Hu, Etienne BarnardOregon Graduate Institute, USA
TMA.14 Towards Decomposing the Sources of Variabilityin Speech 497Narendranath Malayath, Hynek Hermansky, Alexander KainOregon Graduate Institute of Science and Technology, USA
TMA.15 Use of Vector-Valued Dynamic WeightingCoefficients for Speech Recognition: Maximum LikelihoodApproach 501Rathinavelu ChengalvarayanBell Labs-Lucent Technologies, USA
TMA.16 Automatic Segmentation: Data-Driven Units ofSpeech 505S.W. Beet, L. Baghai-RavaryAculab pic, UK
TMA.17 On Robust Time-Varying AR Speech AnalysisBased on T-Distribution 509Dejan BajicInstitute of Applied Mathematics and Electronics, Yugoslavia
TMA.18 A Simple Phoneme Energy Model for the GreekLanguage and its Application to Speech Recognition 513Dimhris Tambakas, Diana Tzima, Nikos Fakotakis, GeorgeKokkinakisUniv. ofPatras, Greece
TMA.19 A Macroscopic Analysis of an Emotional SpeechCorpus 517James E.H. Noad, Sandra P. Whiteside, Phil GreenUniv. of Sheffield, UK
TMA.20 Restoration of Pitch Pattern of Speech Based on aPitch Generation Model 521Hiroshi Shimodaira, Mitsuru Nakai, Akihiro KumataJapan Advanced Institute of Science and Technology, Japan
TMA.21 The Research of Correlation Between Pitch andSkin Galvanic Reaction at Change of Human EmotionalState 525A.V. Agranovski, O.Y. Berg, D.A. LednovSpetsvuzavtomatika Design Bureau, Russia
TMA.22 K-NN Versus Gaussian in HMM-BasedRecognition System 529Claude Montacie, Marie-Jose Caraty, Fabrice LefevreUniv. Pierre et Marie Curie - CNRS, France
TMA.23 Spectral Methods for Voice Source ParametersEstimation 533Boris Doval, Christophe D'Alessandro, Benoit DiardLIMSI-CNRS, France