Àâòîðèçàöèÿ
Ïîèñê ïî óêàçàòåëÿì
Taylor P. — Text-to-Speech Synthesis
Îáñóäèòå êíèãó íà íàó÷íîì ôîðóìå
Íàøëè îïå÷àòêó? Âûäåëèòå åå ìûøêîé è íàæìèòå Ctrl+Enter
Íàçâàíèå: Text-to-Speech Synthesis
Àâòîð: Taylor P.
Àííîòàöèÿ: Text-to-Speech Synthesis provides a complete, end-to-end account of the process of generating speech by computer. Giving an in-depth explanation of all aspects of current speech synthesis technology, it assumes no specialized prior knowledge. Introductory chapters on linguistics, phonetics, signal processing and speech signals lay the foundation, with subsequent material explaining how this knowledge is put to use in building practical systems that generate speech. Including coverage of the very latest techniques such as unit selection, hidden Markov model synthesis, and statistical text analysis, explanations of the more traditional techniques such as format synthesis and synthesis by rule are also provided. Weaving together the various strands of this multidisciplinary field, the book is designed for graduate students in electrical engineering, computer science, and linguistics. It is also an ideal reference for practitioners in the fields of human communication interaction and telephony.
ßçûê:
Ðóáðèêà: Computer science /
Ñòàòóñ ïðåäìåòíîãî óêàçàòåëÿ: Ãîòîâ óêàçàòåëü ñ íîìåðàìè ñòðàíèö
ed2k: ed2k stats
Ãîä èçäàíèÿ: 2009
Êîëè÷åñòâî ñòðàíèö: 597
Äîáàâëåíà â êàòàëîã: 31.10.2010
Îïåðàöèè: Ïîëîæèòü íà ïîëêó |
Ñêîïèðîâàòü ññûëêó äëÿ ôîðóìà | Ñêîïèðîâàòü ID
Ïðåäìåòíûé óêàçàòåëü
Diphones from speech 414—415
Diphthongs 153 201
Discourse-neutral renderings 116
Discrete Fourier transform (DFT) 281—282
Discrete random variables 540—541
Discrete-time Fourier transform (DTFT) 280—281
Discrete-tube model, assumptions 337
Discreteness 15—16
Distinctiveness of speech issues 171—172
Downdrift/declination 230—233
Duality principle 15
Duration synthesis modelling 463—464
Dutch intonation school 237
Dynamic-system synthesis models 252—253
Dynamic-time-warping (DTW) technique 219 469
Ease of data acquisition, and synthesis with vocal-tract models 407
Egressive pulmonic air stream 147
Eigenface model 528
Electoglottography/laryngography 155 383
Electromagnetic articulography (EMA) 156
Electropalatography 155
Emotion axes 123
Emotional speech synthesis 529—531
Emotional speech synthesis with HMM techniques 531
Emotional speech synthesis with prosody control 529—530
Emotional speech synthesis with unit selection 531
Emotional speech synthesis with voice transformation 530
Emotional speech synthesis, describing emotion 529
emphasis 118—119
Encoding/decoding messages 18—19 21—22
Engine/rule separation 83
Engineering approach to TTS 4
entropy 546—547 552
Epoch detection 381—384
Epoch detection, electroglottograph 383
Epoch detection, epoch-detection algorithm (EDA) 381
Epoch detection, instant of glottal closure (IGC) 382—383
Epoch detection, laryngograph/laryngograph signals (Lx signals) 383—384
Epoch detection, pitch-synchronous analysis 381
Epoch manipulation for TD-PSOLA 417—420
Equivalent rectangular bandwidth (ERB) auditory scale 352
Euclidean distance 486
Euler's formula 266—268
Evaluation 522—526 see
Evaluation, about evaluation 522—523
Exceptions dictionaries 208
Expressive speech see "Emotional speech synthesis"
Feature geometry 183
Features and algorithms 79—82
Filter-bank speech analysis 352—353
Filters see "Digital filters"
Finite partial functions 72
Finite-impulse-response (FIR) filter 289
First-generation synthesis see "Vocal-tract models synthesis
Forced alignment 468
Form/message-to-speech synthesis 42
Formant synthesis 388—399
Formant synthesis about formant synthesis 388—389
Formant synthesis, consonant synthesising 392—394
Formant synthesis, copy synthesis technique 394—396
Formant synthesis, Klatt synthesiser 394—395
Formant synthesis, lumped-parameter speech generation model 389
Formant synthesis, parallel synthesisers 392
Formant synthesis, phonetic input 394—397
Formant synthesis, quality issues 397—399
Formant synthesis, serial/cascade synthesisers 391—392
Formant synthesis, single formant synthesis 390—391
Formant synthesis, sound sources 389—390
Formant tracking 370—372
Formants (speech resonance) 159—160
Fourier series/synthesis/analysis 265—266 269—271
Fourier transform 275—278
Fourier transform, discrete Fourier transform (DFT) 281—282
Fourier transform, discrete-time Fourier transform (DTFT) 280—281
Fourier transform, duality principle 278
Fourier transform, inverse Fourier transform 277—278
Fourier transform, scaling property 277
Fourier transform, sine function 277
Frame shift in speech analysis 346—347
frequency 264
Frequency domain 270—275 307
Frequency domain for digital signals 283—284
Frequency domain for pitch detection 381
Frequency domain, analysis/spectral analysis 156
Frequency, angular frequency 265
Fricatives 153
Fujisaki intonation model 227 239—242
Fujisaki superimpositional models, analysis with 250
Fujisaki superimpositional models, synthesis with 249
Fundamental frequency (FO) 148 265 see
Fundamental frequency (FO) and pitch 225
Fundamental frequency (FO) contour models 227—229
Fundamental frequency (FO) contour models, acoustic model 228
Fundamental frequency (FO) contour models, classifiers 228
Fundamental frequency (FO) contour models, regression algorithms 228
Fundamental frequency (FO) contour models, target points 228
Gaussian mixture models 469
Gaussian/normal distribution/bell curve 436—438 549
General partial-synthesis functions 496—497
Generative models 89—90
Glides 154—155
Glides, off-glides 155
Glides, on-glides 155
Glottis/glottal source 148 330—333
Glottis/glottal source, assumptions 338—339
Glottis/glottal source, glottal-flow derivative 333
Glottis/glottal source, Lijencrants — Fant model 332
Glottis/glottal source, open/return/closed phases 330—331
Glottis/glottal source, parameterisation of glottal-flow signals 379
Government phonology 183
Grapheme-to-phoneme (G2P) conversion 55 218—222
Grapheme-to-phoneme (G2P) conversion with decision trees 221
Grapheme-to-phoneme (G2P) conversion with support-vector machines 221
Grapheme-to-phoneme (G2P) conversion, dynamic time warping (DTW) 219
Grapheme-to-phoneme (G2P) conversion, G2P algorithms 208 218
Grapheme-to-phoneme (G2P) conversion, G2P alignment 219
Grapheme-to-phoneme (G2P) conversion, memory-based learning 220—221
Grapheme-to-phoneme (G2P) conversion, NetTalk algorithm 219—220
Grapheme-to-phoneme (G2P) conversion, neural networks 219—220
Grapheme-to-phoneme (G2P) conversion, pronunciation by analogy 220—221
Grapheme-to-phoneme (G2P) conversion, rule ordering 219
Grapheme-to-phoneme (G2P) conversion, rule-based techniques 218—219
Grapheme-to-phoneme (G2P) conversion, statistical techniques 221—222
Graphemes 28
Graphemes, definition 54—55
Graphemes, TTS models 39
Grice's maxims 20
Hand labelling 519—521
Hand written algorithms 80
Harmonic/noise models (HNMs) 426—429
Harmonics 148—149
Harvard sentences 523
Haskins sentences 523
Heterogeneous relation graph (HRG) formalism 72—75
Hidden Markov model (HMM) about the HMM 89—91 435 471—473
Hidden Markov model (HMM) and intonation synthesis 253—254
Hidden Markov model (HMM) and phrasing prediction 133—135
Hidden Markov model (HMM) formalism 435—456 see
Hidden Markov model (HMM) formalism about HMM formalism 435—436
Hidden Markov model (HMM) formalism as generative models 440—443
Hidden Markov model (HMM) formalism, acoustic representations 439—440
Hidden Markov model (HMM) formalism, backoff techniques 444
Hidden Markov model (HMM) formalism, Baum — Welch algorithm 449
Hidden Markov model (HMM) formalism, context-sensitive modelling 451—454
Hidden Markov model (HMM) formalism, covariance matrix 439
Hidden Markov model (HMM) formalism, decision trees 452—455
Hidden Markov model (HMM) formalism, delta delta/ acceleration coefficients 439
Hidden Markov model (HMM) formalism, delta/velocity coefficients 438—439
Hidden Markov model (HMM) formalism, diagonal covariance 439
Hidden Markov model (HMM) formalism, discrete state problems 454—455
Hidden Markov model (HMM) formalism, forced-alignment mode 448
Hidden Markov model (HMM) formalism, forward-backward algorithm 449—450
Hidden Markov model (HMM) formalism, generative nature issues 455—456
Hidden Markov model (HMM) formalism, independence of observations issues 454
Hidden Markov model (HMM) formalism, language models 444
Hidden Markov model (HMM) formalism, linearity problems 455
Hidden Markov model (HMM) formalism, recognising with HMMs 440—443
Hidden Markov model (HMM) formalism, self-transition probability 440
Hidden Markov model (HMM) formalism, smoothing techniques 444
Hidden Markov model (HMM) formalism, states of phone models 440
Hidden Markov model (HMM) formalism, training HMMs 448—451
Hidden Markov model (HMM) formalism, transition probabilities 440
Hidden Markov model (HMM) formalism, triphone models 451
Hidden Markov model (HMM) formalism, Viterbi algorithm 444—448
Hidden Markov models (HMMs), labelling databases with 465—468
Hidden Markov models (HMMs), labelling databases with, about labelling 465
Hidden Markov models (HMMs), labelling databases with, alignments quality measurement 470
Hidden Markov models (HMMs), labelling databases with, dynamic-time-warping (DTW) technique 469
Hidden Markov models (HMMs), labelling databases with, forced alignment 468
Hidden Markov models (HMMs), labelling databases with, Gaussian mixture models 469
Hidden Markov models (HMMs), labelling databases with, phone boundaries determination 468—470
Hidden Markov models (HMMs), labelling databases with, phone sequence determination 467—468
Hidden Markov models (HMMs), labelling databases with, word sequence determination 467
Hidden Markov models (HMMs), synthesis from 456—464 514
Hidden Markov models (HMMs), synthesis from, about synthesis from HMMs 456—457
Hidden Markov models (HMMs), synthesis from, acoustic representations 460—461
Hidden Markov models (HMMs), synthesis from, context-sensitive models 461—463
Hidden Markov models (HMMs), synthesis from, duration modelling 463—464
Hidden Markov models (HMMs), synthesis from, example systems 464
Hidden Markov models (HMMs), synthesis from, likeliest observations for a given state sequence 457—460
Hidden semi-Markov model (HSMM) 464
Homographs 56
Homographs, abbreviation homographs 54
Homographs, accidental homographs 54
Homographs, ambiguity issues 22 46
Homographs, decoding 98
Homographs, disambiguation 79 99—101
Homographs, homograph disambiguation 56
Homographs, part-of-speech homographs 54
Homographs, resolution of 53
Homographs, true homographs 54
Homonyms 58
Homonyms, pure homonyms 58
Homophones 56—57
Human communication 13—18 see "Verbal
Human communication, about human communication 13—14
Human communication, affective prosody 17
Human communication, augmentative prosody 18
Hunt and Black algorithm 477—479 504
Iconic communication 9—10
Impulse/noise models, classical LP prediction 378
Impulse/noise models, classical LP synthesis 400—401
Independence concept 543
Independent feature formulation (IFF) 485
Infinite-impulse-response (IIR) filter 289
Information-theoretic approach 23
Inside-outside algorithms 105
Instant of glottal closure (IGC) points 374 382—383
Integrated systems, future of 536
Intelligibility issues 3 48—49 510 523
International Phonetic Association (IPA), alphabet 163—165
International Phonetic Association (IPA), consonant chart 555
International Phonetic Association (IPA), symbol set (IPA alphabet) 163—165
Interpreted communication 8
Interpreting characters 69—71
Intonation and tune 121—122
Intonation and tune, prediction issues 139
Intonation behaviour 229—236
Intonation behaviour, boundary tones 236
Intonation behaviour, downdrift/declination 230—233
Intonation behaviour, nuclear accents 230
Intonation behaviour, pitch accents 230 234—236
Intonation behaviour, pitch range 233—234
Intonation behaviour, tune 229—230
Intonation synthesis 225—229
Intonation synthesis about intonation 225 259—261
Intonation synthesis, F0 and pitch 226
Intonation synthesis, F0 synthesis 229
Intonation synthesis, intonational form 226—227
Intonation synthesis, intonational synthesis 225
Intonation synthesis, micro-prosody 229
Intonation synthesis, pitch-accent languages 227
Intonation synthesis, tone languages 227
Intonation theories and models 236—245 250—254 see "Data-driven "Deterministic synthesis
Intonation theories and models, about data-driven models 250—251
Intonation theories and models, autosegmental-metrical (AM) model 237—239
Intonation theories and models, British school 227 236—237
Intonation theories and models, data driven models 250—254
Intonation theories and models, Dutch school 237
Intonation theories and models, F0 contour models 227—229
Intonation theories and models, Fujisaki model 227 239—242 250
Intonation theories and models, intonational phonology 237
Intonation theories and models, INTSINT model 239
Intonation theories and models, phonological versus phonetic versus acoustic 244—245
Intonation theories and models, purpose 244
Intonation theories and models, superimpositional models 242
Intonation theories and models, superimpositional versus linear 245
Intonation theories and models, Tilt model 227 242—244
Intonation theories and models, ToBI scheme 237
Intonation theories and models, tones versus shapes 245
Intonation theories and models, traditional model 236—237
Intonational phonology 121
Intonational phrases 114
INTSINT intonation model 239
Inverse filtering 372
IPA see "International Phonetic Association (IPA)"
ISO 8859 70
Java speech markup language 69
Join functions 497—504
Join functions about joining units 497—498
Join functions, acoustic-distance join costs 499—500
Join functions, categorical and acoustic join costs 500—501
Join functions, join classifiers 497 502—504
Join functions, join costs 497—498
Join functions, join detectability 498
Join functions, join probability 497
Join functions, macro-concatenation issue 497
Join functions, phone-class join costs 498—499
Join functions, probabilistic and sequence join function 501—502
Join functions, sequence join classifier 503
Join functions, singular-value decomposition (SVD) 502
Join functions, splicing costs 499
Kalman filter 252
Klatt deterministic rules 256—257
Klatt synthesiser 394—395
Kullback — Leibler distance 552
Labelling databases 519 see labelling
Labelling databases, automatic labelling 521
Labelling databases, avoiding explicit labels 521—522
Labelling databases, hand labelling 519—521
Labiodental constriction 153
Language models 444
Language models, N-gram language model 444
Language origin, and pronunciation 223
Laplace transform 283
Laryngograph/laryngograph signals (Lx signals) 383—384
Larynx 148
Laureate system 513
Least modification principle 477
Letter sequences, decoding 99
Levinson — Durbin recursion source-filter separation technique 361—362 367
Lexemes, inflected forms 59
Lexical phonology/word formation 179—181
Lexical stress 116 186—189
Lexicons 63 207—218
Lexicons as a relational database 210—211
Lexicons, compression, lossless and lossy 215
Lexicons, computer lexicons 207
Lexicons, exceptions dictionaries 208
Lexicons, formats 210—212
Ðåêëàìà