Taylor P. — Text-to-Speech Synthesis :: Электронная библиотека попечительского совета мехмата МГУ

Главная Ex Libris Книги Журналы Статьи Серии Каталог Wanted Загрузка ХудЛит Справка Поиск по индексам Поиск Форум

Авторизация

Поиск по указателям

Красота

Taylor P. — Text-to-Speech Synthesis

Taylor P. — Text-to-Speech Synthesis

Обсудите книгу на научном форуме

Нашли опечатку?
Выделите ее мышкой и нажмите Ctrl+Enter

Название: Text-to-Speech Synthesis

Автор: Taylor P.

Аннотация:

Text-to-Speech Synthesis provides a complete, end-to-end account of the process of generating speech by computer. Giving an in-depth explanation of all aspects of current speech synthesis technology, it assumes no specialized prior knowledge. Introductory chapters on linguistics, phonetics, signal processing and speech signals lay the foundation, with subsequent material explaining how this knowledge is put to use in building practical systems that generate speech. Including coverage of the very latest techniques such as unit selection, hidden Markov model synthesis, and statistical text analysis, explanations of the more traditional techniques such as format synthesis and synthesis by rule are also provided. Weaving together the various strands of this multidisciplinary field, the book is designed for graduate students in electrical engineering, computer science, and linguistics. It is also an ideal reference for practitioners in the fields of human communication interaction and telephony.

Язык:

Рубрика: Computer science/

Статус предметного указателя: Готов указатель с номерами страниц

ed2k: ed2k stats

Год издания: 2009

Количество страниц: 597

Добавлена в каталог: 31.10.2010

Операции: Положить на полку | Скопировать ссылку для форума | Скопировать ID

Предметный указатель

Diphones from speech      414—415
Diphthongs      153 201
Discourse-neutral renderings      116
Discrete Fourier transform (DFT)      281—282
Discrete random variables      540—541
Discrete-time Fourier transform (DTFT)      280—281
Discrete-tube model, assumptions      337
Discreteness      15—16
Distinctiveness of speech issues      171—172
Downdrift/declination      230—233
Duality principle      15
Duration synthesis modelling      463—464
Dutch intonation school      237
Dynamic-system synthesis models      252—253
Dynamic-time-warping (DTW) technique      219 469
Ease of data acquisition, and synthesis with vocal-tract models      407
Egressive pulmonic air stream      147
Eigenface model      528
Electoglottography/laryngography      155 383
Electromagnetic articulography (EMA)      156
Electropalatography      155
Emotion axes      123
Emotional speech synthesis      529—531
Emotional speech synthesis with HMM techniques      531
Emotional speech synthesis with prosody control      529—530
Emotional speech synthesis with unit selection      531
Emotional speech synthesis with voice transformation      530
Emotional speech synthesis, describing emotion      529
emphasis      118—119
Encoding/decoding messages      18—19 21—22
Engine/rule separation      83
Engineering approach to TTS      4
entropy      546—547 552
Epoch detection      381—384
Epoch detection, electroglottograph      383
Epoch detection, epoch-detection algorithm (EDA)      381
Epoch detection, instant of glottal closure (IGC)      382—383
Epoch detection, laryngograph/laryngograph signals (Lx signals)      383—384
Epoch detection, pitch-synchronous analysis      381
Epoch manipulation for TD-PSOLA      417—420
Equivalent rectangular bandwidth (ERB) auditory scale      352
Euclidean distance      486
Euler's formula      266—268
Evaluation      522—526 see
Evaluation, about evaluation      522—523
Exceptions dictionaries      208
Expressive speech      see "Emotional speech synthesis"
Feature geometry      183
Features and algorithms      79—82
Filter-bank speech analysis      352—353
Filters      see "Digital filters"
Finite partial functions      72
Finite-impulse-response (FIR) filter      289
First-generation synthesis      see "Vocal-tract models synthesis
Forced alignment      468
Form/message-to-speech synthesis      42
Formant synthesis      388—399
Formant synthesis about formant synthesis      388—389
Formant synthesis, consonant synthesising      392—394
Formant synthesis, copy synthesis technique      394—396
Formant synthesis, Klatt synthesiser      394—395
Formant synthesis, lumped-parameter speech generation model      389
Formant synthesis, parallel synthesisers      392
Formant synthesis, phonetic input      394—397
Formant synthesis, quality issues      397—399
Formant synthesis, serial/cascade synthesisers      391—392
Formant synthesis, single formant synthesis      390—391
Formant synthesis, sound sources      389—390
Formant tracking      370—372
Formants (speech resonance)      159—160
Fourier series/synthesis/analysis      265—266 269—271
Fourier transform      275—278
Fourier transform, discrete Fourier transform (DFT)      281—282
Fourier transform, discrete-time Fourier transform (DTFT)      280—281
Fourier transform, duality principle      278
Fourier transform, inverse Fourier transform      277—278
Fourier transform, scaling property      277
Fourier transform, sine function      277
Frame shift in speech analysis      346—347
frequency      264
Frequency domain      270—275 307
Frequency domain for digital signals      283—284
Frequency domain for pitch detection      381
Frequency domain, analysis/spectral analysis      156
Frequency, angular frequency      265
Fricatives      153
Fujisaki intonation model      227 239—242
Fujisaki superimpositional models, analysis with      250
Fujisaki superimpositional models, synthesis with      249
Fundamental frequency (FO)      148 265 see
Fundamental frequency (FO) and pitch      225
Fundamental frequency (FO) contour models      227—229
Fundamental frequency (FO) contour models, acoustic model      228
Fundamental frequency (FO) contour models, classifiers      228
Fundamental frequency (FO) contour models, regression algorithms      228
Fundamental frequency (FO) contour models, target points      228
Gaussian mixture models      469
Gaussian/normal distribution/bell curve      436—438 549
General partial-synthesis functions      496—497
Generative models      89—90
Glides      154—155
Glides, off-glides      155
Glides, on-glides      155
Glottis/glottal source      148 330—333
Glottis/glottal source, assumptions      338—339
Glottis/glottal source, glottal-flow derivative      333
Glottis/glottal source, Lijencrants — Fant model      332
Glottis/glottal source, open/return/closed phases      330—331
Glottis/glottal source, parameterisation of glottal-flow signals      379
Government phonology      183
Grapheme-to-phoneme (G2P) conversion      55 218—222
Grapheme-to-phoneme (G2P) conversion with decision trees      221
Grapheme-to-phoneme (G2P) conversion with support-vector machines      221
Grapheme-to-phoneme (G2P) conversion, dynamic time warping (DTW)      219
Grapheme-to-phoneme (G2P) conversion, G2P algorithms      208 218
Grapheme-to-phoneme (G2P) conversion, G2P alignment      219
Grapheme-to-phoneme (G2P) conversion, memory-based learning      220—221
Grapheme-to-phoneme (G2P) conversion, NetTalk algorithm      219—220
Grapheme-to-phoneme (G2P) conversion, neural networks      219—220
Grapheme-to-phoneme (G2P) conversion, pronunciation by analogy      220—221
Grapheme-to-phoneme (G2P) conversion, rule ordering      219
Grapheme-to-phoneme (G2P) conversion, rule-based techniques      218—219
Grapheme-to-phoneme (G2P) conversion, statistical techniques      221—222
Graphemes      28
Graphemes, definition      54—55
Graphemes, TTS models      39
Grice's maxims      20
Hand labelling      519—521
Hand written algorithms      80
Harmonic/noise models (HNMs)      426—429
Harmonics      148—149
Harvard sentences      523
Haskins sentences      523
Heterogeneous relation graph (HRG) formalism      72—75
Hidden Markov model (HMM) about the HMM      89—91 435 471—473
Hidden Markov model (HMM) and intonation synthesis      253—254
Hidden Markov model (HMM) and phrasing prediction      133—135
Hidden Markov model (HMM) formalism      435—456 see
Hidden Markov model (HMM) formalism about HMM formalism      435—436
Hidden Markov model (HMM) formalism as generative models      440—443
Hidden Markov model (HMM) formalism, acoustic representations      439—440
Hidden Markov model (HMM) formalism, backoff techniques      444
Hidden Markov model (HMM) formalism, Baum — Welch algorithm      449
Hidden Markov model (HMM) formalism, context-sensitive modelling      451—454
Hidden Markov model (HMM) formalism, covariance matrix      439
Hidden Markov model (HMM) formalism, decision trees      452—455
Hidden Markov model (HMM) formalism, delta delta/ acceleration coefficients      439
Hidden Markov model (HMM) formalism, delta/velocity coefficients      438—439
Hidden Markov model (HMM) formalism, diagonal covariance      439
Hidden Markov model (HMM) formalism, discrete state problems      454—455

Hidden Markov model (HMM) formalism, forced-alignment mode      448
Hidden Markov model (HMM) formalism, forward-backward algorithm      449—450
Hidden Markov model (HMM) formalism, generative nature issues      455—456
Hidden Markov model (HMM) formalism, independence of observations issues      454
Hidden Markov model (HMM) formalism, language models      444
Hidden Markov model (HMM) formalism, linearity problems      455
Hidden Markov model (HMM) formalism, recognising with HMMs      440—443
Hidden Markov model (HMM) formalism, self-transition probability      440
Hidden Markov model (HMM) formalism, smoothing techniques      444
Hidden Markov model (HMM) formalism, states of phone models      440
Hidden Markov model (HMM) formalism, training HMMs      448—451
Hidden Markov model (HMM) formalism, transition probabilities      440
Hidden Markov model (HMM) formalism, triphone models      451
Hidden Markov model (HMM) formalism, Viterbi algorithm      444—448
Hidden Markov models (HMMs), labelling databases with      465—468
Hidden Markov models (HMMs), labelling databases with, about labelling      465
Hidden Markov models (HMMs), labelling databases with, alignments quality measurement      470
Hidden Markov models (HMMs), labelling databases with, dynamic-time-warping (DTW) technique      469
Hidden Markov models (HMMs), labelling databases with, forced alignment      468
Hidden Markov models (HMMs), labelling databases with, Gaussian mixture models      469
Hidden Markov models (HMMs), labelling databases with, phone boundaries determination      468—470
Hidden Markov models (HMMs), labelling databases with, phone sequence determination      467—468
Hidden Markov models (HMMs), labelling databases with, word sequence determination      467
Hidden Markov models (HMMs), synthesis from      456—464 514
Hidden Markov models (HMMs), synthesis from, about synthesis from HMMs      456—457
Hidden Markov models (HMMs), synthesis from, acoustic representations      460—461
Hidden Markov models (HMMs), synthesis from, context-sensitive models      461—463
Hidden Markov models (HMMs), synthesis from, duration modelling      463—464
Hidden Markov models (HMMs), synthesis from, example systems      464
Hidden Markov models (HMMs), synthesis from, likeliest observations for a given state sequence      457—460
Hidden semi-Markov model (HSMM)      464
Homographs      56
Homographs, abbreviation homographs      54
Homographs, accidental homographs      54
Homographs, ambiguity issues      22 46
Homographs, decoding      98
Homographs, disambiguation      79 99—101
Homographs, homograph disambiguation      56
Homographs, part-of-speech homographs      54
Homographs, resolution of      53
Homographs, true homographs      54
Homonyms      58
Homonyms, pure homonyms      58
Homophones      56—57
Human communication      13—18 see "Verbal
Human communication, about human communication      13—14
Human communication, affective prosody      17
Human communication, augmentative prosody      18
Hunt and Black algorithm      477—479 504
Iconic communication      9—10
Impulse/noise models, classical LP prediction      378
Impulse/noise models, classical LP synthesis      400—401
Independence concept      543
Independent feature formulation (IFF)      485
Infinite-impulse-response (IIR) filter      289
Information-theoretic approach      23
Inside-outside algorithms      105
Instant of glottal closure (IGC) points      374 382—383
Integrated systems, future of      536
Intelligibility issues      3 48—49 510 523
International Phonetic Association (IPA), alphabet      163—165
International Phonetic Association (IPA), consonant chart      555
International Phonetic Association (IPA), symbol set (IPA alphabet)      163—165
Interpreted communication      8
Interpreting characters      69—71
Intonation and tune      121—122
Intonation and tune, prediction issues      139
Intonation behaviour      229—236
Intonation behaviour, boundary tones      236
Intonation behaviour, downdrift/declination      230—233
Intonation behaviour, nuclear accents      230
Intonation behaviour, pitch accents      230 234—236
Intonation behaviour, pitch range      233—234
Intonation behaviour, tune      229—230
Intonation synthesis      225—229
Intonation synthesis about intonation      225 259—261
Intonation synthesis, F0 and pitch      226
Intonation synthesis, F0 synthesis      229
Intonation synthesis, intonational form      226—227
Intonation synthesis, intonational synthesis      225
Intonation synthesis, micro-prosody      229
Intonation synthesis, pitch-accent languages      227
Intonation synthesis, tone languages      227
Intonation theories and models      236—245 250—254 see "Data-driven "Deterministic synthesis
Intonation theories and models, about data-driven models      250—251
Intonation theories and models, autosegmental-metrical (AM) model      237—239
Intonation theories and models, British school      227 236—237
Intonation theories and models, data driven models      250—254
Intonation theories and models, Dutch school      237
Intonation theories and models, F0 contour models      227—229
Intonation theories and models, Fujisaki model      227 239—242 250
Intonation theories and models, intonational phonology      237
Intonation theories and models, INTSINT model      239
Intonation theories and models, phonological versus phonetic versus acoustic      244—245
Intonation theories and models, purpose      244
Intonation theories and models, superimpositional models      242
Intonation theories and models, superimpositional versus linear      245
Intonation theories and models, Tilt model      227 242—244
Intonation theories and models, ToBI scheme      237
Intonation theories and models, tones versus shapes      245
Intonation theories and models, traditional model      236—237
Intonational phonology      121
Intonational phrases      114
INTSINT intonation model      239
Inverse filtering      372
IPA      see "International Phonetic Association (IPA)"
ISO 8859      70
Java speech markup language      69
Join functions      497—504
Join functions about joining units      497—498
Join functions, acoustic-distance join costs      499—500
Join functions, categorical and acoustic join costs      500—501
Join functions, join classifiers      497 502—504
Join functions, join costs      497—498
Join functions, join detectability      498
Join functions, join probability      497
Join functions, macro-concatenation issue      497
Join functions, phone-class join costs      498—499
Join functions, probabilistic and sequence join function      501—502
Join functions, sequence join classifier      503
Join functions, singular-value decomposition (SVD)      502
Join functions, splicing costs      499
Kalman filter      252
Klatt deterministic rules      256—257
Klatt synthesiser      394—395
Kullback — Leibler distance      552
Labelling databases      519 see labelling
Labelling databases, automatic labelling      521
Labelling databases, avoiding explicit labels      521—522
Labelling databases, hand labelling      519—521
Labiodental constriction      153
Language models      444
Language models, N-gram language model      444
Language origin, and pronunciation      223
Laplace transform      283
Laryngograph/laryngograph signals (Lx signals)      383—384
Larynx      148
Laureate system      513
Least modification principle      477
Letter sequences, decoding      99
Levinson — Durbin recursion source-filter separation technique      361—362 367
Lexemes, inflected forms      59
Lexical phonology/word formation      179—181
Lexical stress      116 186—189
Lexicons      63 207—218
Lexicons as a relational database      210—211
Lexicons, compression, lossless and lossy      215
Lexicons, computer lexicons      207
Lexicons, exceptions dictionaries      208
Lexicons, formats      210—212

1 2 3 4 5

Реклама

© Электронная библиотека попечительского совета мехмата МГУ, 2004-2026

Электронная библиотека мехмата МГУ

Valid HTML 4.01!

|

Valid CSS!

О проекте