Theory and applications of digital speech processing / Lawrence R. Rabiner, Ronald W. Schafer.

Por: Rabiner, Lawrence R, 1943-Colaborador(es): Schafer, Ronald W | , 1938-Tipo de material: TextoTextoEditor: New Jersey: Pearson, 2011Edición: 1st edDescripción: xiv, 1042 p.: il. ; 24 cmISBN: 9780136034285 (alk. paper) ; 0136034284 (alk. paper)Tema(s): DIGITALIZACIÓN -- VOZ | SISTEMAS DE PROCESAMIENTO DE LA VOZClasificación CDD: 006.454
Contenidos:
CHAPTER 1 Introduction to Digital Speech Processing 11.1 The Speech Signal 31.2 The Speech Stack 81.3 Applications of Digital Speech Processing 101.4 Comment on the References 151.5 Summary 17CHAPTER 2 Review of Fundamentals of Digital Signal Processing 182.1 Introduction 182.2 Discrete-Time Signals and Systems 182.3 Transform Representation of Signals and Systems 222.4 Fundamentals of Digital Filters 332.5 Sampling 442.6 Summary 56Problems 56CHAPTER 3 Fundamentals of Human Speech Production 673.1 Introduction 673.2 The Process of Speech Production 683.3 Short-Time Fourier Representation of Speech 813.4 Acoustic Phonetics 863.5 Distinctive Features of the Phonemes of American English 1083.6 Summary 110Problems 110CHAPTER 4 Hearing, Auditory Models, and Speech Perception 1244.1 Introduction 1244.2 The Speech Chain 1254.3 Anatomy and Function of the Ear 1274.4 The Perception of Sound 1334.5 Auditory Models 1504.6 Human Speech Perception Experiments 1584.7 Measurement of Speech Quality and Intelligibility 1624.8 Summary 166Problems 167CHAPTER 5 Sound Propagation in the Human Vocal Tract 1705.1 The Acoustic Theory of Speech Production 1705.2 Lossless Tube Models 2005.3 Digital Models for Sampled Speech Signals 2195.4 Summary 228Problems 228CHAPTER 6 Time-Domain Methods for Speech Processing 2396.1 Introduction 2396.2 Short-Time Analysis of Speech 2426.3 Short-Time Energy and Short-Time Magnitude 2486.4 Short-Time Zero-Crossing Rate 2576.5 The Short-Time Autocorrelation Function 2656.6 The Modified Short-Time Autocorrelation Function 2736.7 The Short-Time Average Magnitude Difference Function 2756.8 Summary 277Problems 278CHAPTER 7 Frequency-Domain Representations 2877.1 Introduction 2877.2 Discrete-Time Fourier Analysis 2897.3 Short-Time Fourier Analysis 2927.4 Spectrographic Displays 3127.5 Overlap Addition Method of Synthesis 3197.6 Filter Bank Summation Method of Synthesis 3317.7 Time-Decimated Filter Banks 3407.8 Two-Channel Filter Banks 3487.9 Implementation of the FBS Method Using the FFT 3587.10 OLA Revisited 3657.11 Modifications of the STFT 3677.12 Summary 379Problems 380CHAPTER 8 The Cepstrum and Homomorphic Speech Processing 3998.1 Introduction 3998.2 Homomorphic Systems for Convolution 4018.3 Homomorphic Analysis of the Speech Model 4178.4 Computing the Short-Time Cepstrum and Complex Cepstrumof Speech 4298.5 Homomorphic Filtering of Natural Speech 4408.6 Cepstrum Analysis of All-Pole Models 4568.7 Cepstrum Distance Measures 4598.8 Summary 466Problems 466CHAPTER 9 Linear Predictive Analysis of Speech Signals 4739.1 Introduction 4739.2 Basic Principles of Linear Predictive Analysis 4749.3 Computation of the Gain for the Model 4869.4 Frequency Domain Interpretations of Linear PredictiveAnalysis 4909.5 Solution of the LPC Equations 5059.6 The Prediction Error Signal 5279.7 Some Properties of the LPC Polynomial A(z) 5389.8 Relation of Linear Predictive Analysis to Lossless Tube Models 5469.9 Alternative Representations of the LP Parameters 5519.10 Summary 560Problems 560CHAPTER 10 Algorithms for Estimating Speech Parameters 57810.1 Introduction 57810.2 Median Smoothing and Speech Processing 58010.3 Speech-Background/Silence Discrimination 58610.4 A Bayesian Approach to Voiced/Unvoiced/Silence Detection 59510.5 Pitch Period Estimation (Pitch Detection) 60310.6 Formant Estimation 63510.7 Summary 645Problems 645CHAPTER 11 Digital Coding of Speech Signals 66311.1 Introduction 66311.2 Sampling Speech Signals 66711.3 A Statistical Model for Speech 66911.4 Instantaneous Quantization 67611.5 Adaptive Quantization 70611.6 Quantizing of Speech Model Parameters 71811.7 General Theory of Differential Quantization 73211.8 Delta Modulation 74311.9 Differential PCM (DPCM) 75911.10 Enhancements for ADPCM Coders 76811.11 Analysis-by-Synthesis Speech Coders 78311.12 Open-Loop Speech Coders 80611.13 Applications of Speech Coders 81411.14 Summary 819Problems 820CHAPTER 12 Frequency-Domain Coding of Speech and Audio 84212.1 Introduction 84212.2 Historical Perspective 84412.3 Subband Coding 85012.4 Adaptive Transform Coding 86112.5 A Perception Model for Audio Coding 86612.6 MPEG-1 Audio Coding Standard 88112.7 Other Audio Coding Standards 89412.8 Summary 894Problems 895CHAPTER 13 Text-to-Speech Synthesis Methods 90713.1 Introduction 90713.2 Text Analysis 90813.3 Evolution of Speech Synthesis Methods 91413.4 Early Speech Synthesis Approaches 91613.5 Unit Selection Methods 92613.6 TTS Future Needs 94213.7 Visual TTS 94313.8 Summary 947Problems 947CHAPTER 14 Automatic Speech Recognition and NaturalLanguage Understanding 95014.1 Introduction 95014.2 Basic ASR Formulation 95214.3 Overall Speech Recognition Process 95314.4 Building a Speech Recognition System 95414.5 The Decision Processes in ASR 95714.6 Step 3: The Search Problem 97114.7 Simple ASR System: Isolated Digit Recognition 97214.8 Performance Evaluation of Speech Recognizers 97414.9 Spoken Language Understanding 97714.10 Dialog Management and Spoken Language Generation 98014.11 User Interfaces 98314.12 Multimodal User Interfaces 98414.13 Summary 984Problems 985AppendicesA Speech and Audio Processing Demonstrations 993B Solution of Frequency-Domain Differential Equations 1005Bibliography 1009Index 1033
Resumen: Incluso después de más de 30 años, el libro de texto de 1978 Rabiner y Schafer sigue siendo uno de los más completos para la enseñanza de un curso de procesamiento del habla a nivel de postgrado de un semestre. El nuevo libro se las arregla para superar eso y es, sin duda representa una mejora con respecto a la antigua. Se duplica el contenido del clásico imprescindible, añade muchos nuevos desarrollos tecnológicos en los últimos años, y amplía las áreas de aplicación que han visto un enorme crecimiento en las últimas dos décadas. La inclusión de nuevos problemas y ejercicios de MATLAB, junto con muestras de voz del mundo real también facilita conveniente trampolín para el diseño de proyectos de clase en profundidad.
Etiquetas de esta biblioteca: No hay etiquetas de esta biblioteca para este título. Ingresar para agregar etiquetas.
    Valoración media: 0.0 (0 votos)
Tipo de ítem Ubicación actual Colección Signatura Info Vol Copia número Estado Fecha de vencimiento Código de barras Reserva de ítems
LIBRO - MATERIAL GENERAL LIBRO - MATERIAL GENERAL Biblioteca Jorge Álvarez Lleras
Colección / Fondo / Acervo / Resguardo 006.454 R116t (Navegar estantería) Ej. 1 1 Disponible 023110
Total de reservas: 0

Incluye Bibliografía e indices

CHAPTER 1 Introduction to Digital Speech Processing 11.1 The Speech Signal 31.2 The Speech Stack 81.3 Applications of Digital Speech Processing 101.4 Comment on the References 151.5 Summary 17CHAPTER 2 Review of Fundamentals of Digital Signal Processing 182.1 Introduction 182.2 Discrete-Time Signals and Systems 182.3 Transform Representation of Signals and Systems 222.4 Fundamentals of Digital Filters 332.5 Sampling 442.6 Summary 56Problems 56CHAPTER 3 Fundamentals of Human Speech Production 673.1 Introduction 673.2 The Process of Speech Production 683.3 Short-Time Fourier Representation of Speech 813.4 Acoustic Phonetics 863.5 Distinctive Features of the Phonemes of American English 1083.6 Summary 110Problems 110CHAPTER 4 Hearing, Auditory Models, and Speech Perception 1244.1 Introduction 1244.2 The Speech Chain 1254.3 Anatomy and Function of the Ear 1274.4 The Perception of Sound 1334.5 Auditory Models 1504.6 Human Speech Perception Experiments 1584.7 Measurement of Speech Quality and Intelligibility 1624.8 Summary 166Problems 167CHAPTER 5 Sound Propagation in the Human Vocal Tract 1705.1 The Acoustic Theory of Speech Production 1705.2 Lossless Tube Models 2005.3 Digital Models for Sampled Speech Signals 2195.4 Summary 228Problems 228CHAPTER 6 Time-Domain Methods for Speech Processing 2396.1 Introduction 2396.2 Short-Time Analysis of Speech 2426.3 Short-Time Energy and Short-Time Magnitude 2486.4 Short-Time Zero-Crossing Rate 2576.5 The Short-Time Autocorrelation Function 2656.6 The Modified Short-Time Autocorrelation Function 2736.7 The Short-Time Average Magnitude Difference Function 2756.8 Summary 277Problems 278CHAPTER 7 Frequency-Domain Representations 2877.1 Introduction 2877.2 Discrete-Time Fourier Analysis 2897.3 Short-Time Fourier Analysis 2927.4 Spectrographic Displays 3127.5 Overlap Addition Method of Synthesis 3197.6 Filter Bank Summation Method of Synthesis 3317.7 Time-Decimated Filter Banks 3407.8 Two-Channel Filter Banks 3487.9 Implementation of the FBS Method Using the FFT 3587.10 OLA Revisited 3657.11 Modifications of the STFT 3677.12 Summary 379Problems 380CHAPTER 8 The Cepstrum and Homomorphic Speech Processing 3998.1 Introduction 3998.2 Homomorphic Systems for Convolution 4018.3 Homomorphic Analysis of the Speech Model 4178.4 Computing the Short-Time Cepstrum and Complex Cepstrumof Speech 4298.5 Homomorphic Filtering of Natural Speech 4408.6 Cepstrum Analysis of All-Pole Models 4568.7 Cepstrum Distance Measures 4598.8 Summary 466Problems 466CHAPTER 9 Linear Predictive Analysis of Speech Signals 4739.1 Introduction 4739.2 Basic Principles of Linear Predictive Analysis 4749.3 Computation of the Gain for the Model 4869.4 Frequency Domain Interpretations of Linear PredictiveAnalysis 4909.5 Solution of the LPC Equations 5059.6 The Prediction Error Signal 5279.7 Some Properties of the LPC Polynomial A(z) 5389.8 Relation of Linear Predictive Analysis to Lossless Tube Models 5469.9 Alternative Representations of the LP Parameters 5519.10 Summary 560Problems 560CHAPTER 10 Algorithms for Estimating Speech Parameters 57810.1 Introduction 57810.2 Median Smoothing and Speech Processing 58010.3 Speech-Background/Silence Discrimination 58610.4 A Bayesian Approach to Voiced/Unvoiced/Silence Detection 59510.5 Pitch Period Estimation (Pitch Detection) 60310.6 Formant Estimation 63510.7 Summary 645Problems 645CHAPTER 11 Digital Coding of Speech Signals 66311.1 Introduction 66311.2 Sampling Speech Signals 66711.3 A Statistical Model for Speech 66911.4 Instantaneous Quantization 67611.5 Adaptive Quantization 70611.6 Quantizing of Speech Model Parameters 71811.7 General Theory of Differential Quantization 73211.8 Delta Modulation 74311.9 Differential PCM (DPCM) 75911.10 Enhancements for ADPCM Coders 76811.11 Analysis-by-Synthesis Speech Coders 78311.12 Open-Loop Speech Coders 80611.13 Applications of Speech Coders 81411.14 Summary 819Problems 820CHAPTER 12 Frequency-Domain Coding of Speech and Audio 84212.1 Introduction 84212.2 Historical Perspective 84412.3 Subband Coding 85012.4 Adaptive Transform Coding 86112.5 A Perception Model for Audio Coding 86612.6 MPEG-1 Audio Coding Standard 88112.7 Other Audio Coding Standards 89412.8 Summary 894Problems 895CHAPTER 13 Text-to-Speech Synthesis Methods 90713.1 Introduction 90713.2 Text Analysis 90813.3 Evolution of Speech Synthesis Methods 91413.4 Early Speech Synthesis Approaches 91613.5 Unit Selection Methods 92613.6 TTS Future Needs 94213.7 Visual TTS 94313.8 Summary 947Problems 947CHAPTER 14 Automatic Speech Recognition and NaturalLanguage Understanding 95014.1 Introduction 95014.2 Basic ASR Formulation 95214.3 Overall Speech Recognition Process 95314.4 Building a Speech Recognition System 95414.5 The Decision Processes in ASR 95714.6 Step 3: The Search Problem 97114.7 Simple ASR System: Isolated Digit Recognition 97214.8 Performance Evaluation of Speech Recognizers 97414.9 Spoken Language Understanding 97714.10 Dialog Management and Spoken Language Generation 98014.11 User Interfaces 98314.12 Multimodal User Interfaces 98414.13 Summary 984Problems 985AppendicesA Speech and Audio Processing Demonstrations 993B Solution of Frequency-Domain Differential Equations 1005Bibliography 1009Index 1033

Incluso después de más de 30 años, el libro de texto de 1978 Rabiner y Schafer sigue siendo uno de los más completos para la enseñanza de un curso de procesamiento del habla a nivel de postgrado de un semestre. El nuevo libro se las arregla para superar eso y es, sin duda representa una mejora con respecto a la antigua. Se duplica el contenido del clásico imprescindible, añade muchos nuevos desarrollos tecnológicos en los últimos años, y amplía las áreas de aplicación que han visto un enorme crecimiento en las últimas dos décadas. La inclusión de nuevos problemas y ejercicios de MATLAB, junto con muestras de voz del mundo real también facilita conveniente trampolín para el diseño de proyectos de clase en profundidad.

No hay comentarios en este titulo.

para colocar un comentario.