![](images/graphics/blank.gif)
Characteristics of speech
-
In this paper, we propose the Adapt-TTS model that allows high-quality audio synthesis from a small adaptive sample without training to solve these problems. The main contributions of the paper are: 1) The extracting mel-vector (EMV) architecture allows for a better representation of speaker characteristics and speech style; 2) An improved zero-shot model with a denoising diffusion model (mel-spectrogram denoiser) component allows for new voice synthesis without training with better quality (less noise).
15p
vimulcahy
18-09-2023
5
4
Download
-
Ebook How to teach Pronunciation: Part 1 includes contents: Chapter 1 the description of speech, chapter 2 teaching pronunciation, chapter 3 vowels, chapter 4 consonants, chapter 5 word and sentence stress. Please refer to the documentation for more details.
93p
haojiubujain03
09-08-2023
11
4
Download
-
Ebook Special Education: Part 1 presents the following content: Special education: concept and nature; special education: objectives and need; special education: scope and types; physically challenged: definition, types, characteristics; identification, causes, problems of physically challenged; physically challenged: preventions, teaching strategies;...Please refer to the documentation for more details.
110p
chankora
16-06-2023
5
1
Download
-
This paper investigated the Malay vowels variations from three districts (Perlis, Kelantan, and Terengganu) using spontaneous speeches acquired in a natural setting. Eight (8) Malay vowels were collected from local males and females residing in Perlis, Kelantan, and Terengganu.
16p
spiritedaway36
28-11-2021
26
2
Download
-
An approach to the formation of the voice signal (VS) informative features of the Vietnamese language on the basis of stationary autoregressive model coefficients is described. An original algorithm of VS segmentation based on interval estimation of speech sample numerical characteristics was developed to form local stationarity areas of the voice signal.
11p
vivirginia2711
09-12-2020
23
3
Download
-
The article is devoted to the development of approaches to the improvement of the efficiency of intercultural business communication, which requires a focus on the characteristics of speech that are intended to impress the participants in communication.
8p
orianahuynh
06-06-2020
22
1
Download
-
This paper introduces new methods based on exponential families for modeling the correlations between words in text and speech. While previous work assumed the effects of word co-occurrence statistics to be constant over a window of several hundred words, we show that their influence is nonstationary on a much smaller time scale.
8p
bunthai_1
06-05-2013
47
6
Download
-
In this work, we present an experimental analysis of a Dialogue System for the automatization of simple telephone services. Starting from the evaluation of a preliminar version of the system we 1 conclude the necessity to desing a robust and flexible system suitable to have to have different dialogue control strategies depending on the characteristics of the user and the performance of the speech recognition module. Experimental results following the PARADISE framework show an important improvement both in terms of task success and dialogue cost for the proposed system. ...
4p
bunthai_1
06-05-2013
39
2
Download
-
A computer program for synthesizing Japanese fundamental frequency contours implements our theory of Japanese intonation. This theory provides a complete qualitative description of the known characteristics of Japanese intonation, as well as a quantitative model of tone-scaling and timing precise enough to translate straightforwardly into a computational algorithm. An important aspect of the description is that various features of the intonation pattern are designated to be phonological properties of different types of phrasal units in a hierarchical organization.
8p
bungio_1
03-05-2013
33
1
Download
-
In this paper we describe a f l e x i b l e analysls-synthesls system which can be used for a number of studies In speech research. The maln objective Is to have a synthesis system whose characteristics can be controlled through a set of parameters to realize any desired voice characteristics. The basic synthesis scheme consists of two steps: Generationof an excitation signal f r o m pitch and galn contours and excitation of the linear system model described by linear prediction coefficients, W show that e a number of basic studies such as time expansion/...
4p
bungio_1
03-05-2013
42
1
Download
-
The instance of the electric light may prove illuminating in this connection. The electric light is pure information. It is a medium without a message, as it were, unless it is used to spell out some verbal ad or name. This fact, characteristic of all media, means that the "content" of any medium is always another medium. The content of writing is speech, just as the written word is the content of print, and print is the content of the telegraph. If it is asked, "What is the content of speech?," it is necessary to say, "It is an...
72p
thamgiacongdong
02-05-2013
38
3
Download
-
Determining the relationship between the intonational characteristics of an utterance and other features inferable from its text is important both for speech recognition and for speech synthesis. This work investigates the use of text analysis in predicting the location of intonational phrase boundaries in natural speech, through analyzing 298 utterances from the DARPA Air Travel Information Service database. For statistical modeling, we employ Classification and Regression Tree (CART) techniques. ...
8p
bunmoc_1
20-04-2013
35
4
Download
-
Filled pauses are characteristic of spontaneous speech and can present considerable problems for speech recognition by being often recognized as short words. An um can be recognized as thumb or arm if the recognizer's language model does not adequately represent FP's. Recognition of quasi-spontaneous speech (medical dictation) is subject to this problem as well.
6p
bunrieu_1
18-04-2013
33
2
Download
-
The detection of prosodic characteristics is an important aspect of both speech synthesis and speech recognition. Correct placement of pitch accents aids in more natural sounding speech, while automatic detection of accents can contribute to better wordlevel recognition and better textual understanding. In this paper we investigate probabilistic, contextual, and phonological factors that influence pitch accent placement in natural, conversational speech in a sequence labeling setting.
7p
bunbo_1
17-04-2013
36
2
Download
-
In this work, we provide an empirical analysis of differences in word use between genders in telephone conversations, which complements the considerable body of work in sociolinguistics concerned with gender linguistic differences. Experiments are performed on a large speech corpus of roughly 12000 conversations. We employ machine learning techniques to automatically categorize the gender of each speaker given only the transcript of his/her speech, achieving 92% accuracy. An analysis of the most characteristic words for each gender is also presented.
8p
bunbo_1
17-04-2013
60
2
Download
-
A distributional method for part-of-speech induction is presented which, in contrast to most previous work, determines the part-of-speech distribution of syntactically ambiguous words without explicitly tagging the underlying text corpus. This is achieved by assuming that the word pair consisting of the left and right neighbor of a particular token is characteristic of the part of speech at this position, and by clustering the neighbor pairs on the basis of their middle words as observed in a large corpus.
4p
hongvang_1
16-04-2013
39
1
Download
-
This paper presents a speech understanding component for enabling robust situated human-robot communication. The aim is to gain semantic interpretations of utterances that serve as a basis for multi-modal dialog management also in cases where the recognized word-stream is not grammatically correct. For the understanding process, we designed semantic processable units, which are adapted to the domain of situated communication. Our framework supports the specific characteristics of spontaneous speech used in combination with gestures in a real world scenario. ...
8p
hongvang_1
16-04-2013
41
1
Download
-
In this work we address the problem of unsupervised part-of-speech induction by bringing together several strands of research into a single model. We develop a novel hidden Markov model incorporating sophisticated smoothing using a hierarchical Pitman-Yor processes prior, providing an elegant and principled means of incorporating lexical characteristics.
10p
hongdo_1
12-04-2013
37
4
Download
-
Conventional Automated Essay Scoring (AES) measures may cause severe problems when directly applied in scoring Automatic Speech Recognition (ASR) transcription as they are error sensitive and unsuitable for the characteristic of ASR transcription. Therefore, we introduce a framework of Finite State Transducer (FST) to avoid the shortcomings.
10p
nghetay_1
07-04-2013
56
2
Download
-
Homer, in all probability, knew no rules of rhetoric, and was not tortured with the consideration of grammatical construction, and yet his verse will endure through time. If everybody possessed the genius of Homer, rules and cautions in writing would be unnecessary. To-day all men speak, and most men write, but it is observed that those who most closely follow Homer's method of writing without rules are most unlike Homer in the results. The ancient bard was a law unto himself; we need rules for our guidance.
188p
culao1122
08-01-2013
59
5
Download
CHỦ ĐỀ BẠN MUỐN TÌM
![](images/graphics/blank.gif)