Description |
1 online resource |
Series |
Communications in computer and information science ; 328 |
|
Communications in computer and information science ; 328.
|
Contents |
Part 1. Speaker Characterization and Recognition -- Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures / Jesús Villalba, Eduardo Lleida, Alfonso Ortega and Antonio Miguel -- On the use of Total Variability and Probabilistic Linear Discriminant Analysis for Speaker Verification on Short Utterances / Javier González Domínguez, Rubén Zazo and Joaquin González-Rodríguez -- Cepstral Trajectories in Linguistic Units for Text-Independent Speaker Recognition / Javier Franco-Pedroso, Fernando Espinoza-Cuadros and Joaquin Gonzalez-Rodriguez -- Improving the Quality of Standard GMM-Based Voice Conversion Systems by Considering Physically Motivated Linear Transformations / Tudor-Cătălin Zorilă, Daniel Erro and Inma Hernaez -- Evaluation of a New Beam-Search Formant Tracking Algorithm in Noisy Environments / Dayana Ribas González, José Enrique García Laínez, Antonio Miguel, Alfonso Ortega Gimenez and Eduardo Lleida, et al |
|
Part 2. Audio and Speech Segmentation -- On the Influence of Automatic Segmentation and Clustering in Automatic Speech Recognition / Paula Lopez-Otero, Laura Docio-Fernandez, Carmen Garcia-Mateo and Antonio Cardenal-Lopez -- Preliminary Results of Alignment of Text and Audio in News and Songs / Darwin Patricio Córdova Lucero and Doroteo Torre Toledano -- Aligning Very Long Speech Signals to Bilingual Transcriptions of Parliamentary Sessions / Germán Bordel, Mikel Penagarikano, Luis Javier Rodríguez-Fuentes and María Amparo Varona Fernández -- Factor Analysis Segmentation and Classification in Broadcast News Domain / Diego Castán, Alfonso Ortega Giménez and Eduardo Lleida -- Prosodic and Phonetic Features for Speaking Styles Classification and Detection / Arlindo Veiga, Dirce Celorico, Jorge Proença, Sara Candeias and Fernando Perdigão |
|
Part 3. Pathology Detection and Speech Characterization -- Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit / David Martínez, Eduardo Lleida, Alfonso Ortega, Antonio Miguel and Jesús Villalba -- Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrücken Voice Database / David Martínez, Eduardo Lleida, Alfonso Ortega and Antonio Miguel -- Using HMM to Detect Speakers with Severe Obstructive Sleep Apnoea Syndrome / Ana Montero Benavides, José Luis Blanco, Alejandra Fernández, Rubén Fernandez Pozo and Doroteo Torre Toledano, et al. -- Acoustic Analysis of European Portuguese Oral Vowels Produced by Children / Catarina Oliveira, Maria Manuel Cunha, Samuel Silva, António Teixeira and Pedro Sá-Couto -- Impact of Age in ASR for the Elderly: Preliminary Experiments in European Portuguese / Thomas Pellegrini, Isabel Trancoso, Annika Hämäläinen, António Calado and Miguel Sales Dias, et al |
|
Part 4. Dialogue and Multimodal Systems -- Mutual Information and Perplexity Based Clustering of Dialogue Information for Dynamic Adaptation of Language Models / Juan Manuel Lucas-Cuesta, Fernando Fernández-Martínez, Tirso Moreno and Javier Ferreiros -- A Multilingual SLU System Based on Semantic Decoding of Graphs of Words / Marcos Calvo, Lluís-F. Hurtado, Fernando García and Emilio Sanchís -- Merging Intention and Emotion to Develop Adaptive Dialogue Systems / Zoraida Callejas, David Griol and Ramón López-Cózar Delgado -- Language Technology for Handwritten Text Recognition / Alejandro H. Toselli, Nicolás Serrano, Adrià Giménez-Pastor, Ihab Khoury and Alfons Juan, et al. -- Character-Based Handwritten Text Recognition of Multilingual Documents / Miguel A. del Agua, Nicolás Serrano, Jorge Civera and Alfons Juan |
|
Part 5. Robustness in Automatic Speech Recognition -- A Robust Pitch Extractor Based on DTW Lines and CASA with Application in Noisy Speech Recognition / Juan A. Morales-Cordovilla, Pablo Cabañas-Molero, Antonio M. Peinado and Victoria Sánchez -- Speech Denoising Using Non-negative Matrix Factorization with Kullback-Leibler Divergence and Sparseness Constraints / Jimmy Ludeña-Choez and Ascensión Gallardo-Antolín -- MMSE Feature Reconstruction Based on an Occlusion Model for Robust ASR / José A. González, Antonio M. Peinado and Ángel M. Gómez -- Automatic Speech Recognition Based on Ultrasonic Doppler Sensing for European Portuguese / João Freitas, António Teixeira, Francisco Vaz and Miguel Sales Dias |
|
Part 6. Applications of Speech and Language Technologies -- Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform / Juan Daniel Valor Miró, Alejandro Pérez González de Martos, Jorge Civera and Alfons Juan -- Speech Reconstruction by Sparse Linear Prediction / Ján Koloda, Antonio M. Peinado and Victoria Sánchez -- Steganographic Pulse-Based Recovery for Robust ACELP Transmission over Erasure Channels / Domingo López-Oller, Angel M. Gomez, José Luis Pérez Córdoba, Bernd Geiser and Peter Vary -- A Proposal for a Visual Speech Animation System for European Portuguese / José Serra, Manuel Ribeiro, João Freitas, Verónica Orvalho and Miguel Sales Dias -- Online Learning of Log-Linear Weights in Interactive Machine Translation / Francisco-Javier López-Salcedo, Germán Sanchis-Trilles and Francisco Casacuberta |
Summary |
This volume constitutes the refereed proceedings of the Spanish Conference, IberSPEECH 2012: Joint VII "Jornadas en Tecnolog.a del Habla" and III Iberian SLTech Workshop, held in Madrid, Spain, in November 21-23, 2012. The 29 revised papers were carefully reviewed and selected from 80 submissions. The papers are organized in topical sections on speaker characterization and recognition; audio and speech segmentation; pathology detection and speech characterization; dialogue and multimodal systems; robustness in automatic speech recognition; applications of speech and language technologies |
Analysis |
computerwetenschappen |
|
computer sciences |
|
man-machine interaction |
|
computers |
|
gebruikersinterfaces |
|
user interfaces |
|
patroonherkenning |
|
pattern recognition |
|
publiceren |
|
publishing |
|
taal |
|
language |
|
taalwetenschappen |
|
linguistics |
|
kunstmatige intelligentie |
|
artificial intelligence |
|
computertechnieken |
|
computer techniques |
|
kunst |
|
arts |
|
Information and Communication Technology (General) |
|
Informatie- en communicatietechnologie (algemeen) |
Notes |
Print version record |
Subject |
Natural language processing (Computer science) -- Congresses
|
|
Iberian language -- Data processing -- Congresses
|
|
Speech processing systems -- Congresses
|
|
Informatique.
|
|
Natural language processing (Computer science)
|
|
Speech processing systems
|
Genre/Form |
proceedings (reports)
|
|
Conference papers and proceedings
|
|
Conference papers and proceedings.
|
|
Actes de congrès.
|
Form |
Electronic book
|
Author |
Torre Toledano, Doroteo.
|
ISBN |
9783642352928 |
|
3642352928 |
|