Temat: text-to-speech - Prolib Integro

Skocz do pozycji: 1.

Tytuł:: A STUDY OF TEXT-TO-SPEECH (TTS) IN CHILDREN’S ENGLISH LEARNING
Autorzy:: Huang, Yi-Ching
Liao, Lung-Chuan
Tematy:: Text-to-Speech
English spelling
self-directed learning; Pokaż więcej
Wydawca:: Uniwersytet Marii Curie-Skłodowskiej w Lublinie. IATEFL Poland Computer Special Interest Group
Powiązania:: https://bibliotekanauki.pl/articles/955941.pdf Link otwiera się w nowym oknie
Opis:: The purpose of this study was to explore the effects of the digital material incorporated into Text-to-Speech system for students’ English spelling. The digital material was made on the basis of the Spelling Bee vocabulary list (approximately 300 words) issued by the selected school. 21 third graders from a private bilingual school in Taiwan were selected for this study. This study employed four data collection techniques, including questionnaire, pre-test and post-test, informal observation and interview, and semi-structured individual interviews. The research results showed that the use of digital material fostered the students’ English spelling ability and their self-directed learning.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 2.

Tytuł:: Implementation of Polish speech synthesis for the BOSS system
Autorzy:: Demenko, G.
Mobius, B.
Klessa, K.
Tematy:: speech synthesis
text-to-speech (TTS)
unit selection
duration prediction; Pokaż więcej
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Powiązania:: https://bibliotekanauki.pl/articles/200686.pdf Link otwiera się w nowym oknie
Opis:: The Bonn Open Synthesis System (BOSS) is an open-source software for the unit selection speech synthesis that has been used for the generation of high-quality German and Dutch speech. This article presents ongoing research and development aimed at adapting BOSS to the Polish language. In the first section, the origins and workings of the unit selection method for speech synthesis are explained. Section two details the structure of the Polish corpus and its segmental and prosodic annotation. The subsequent sections focus on the implementation of Polish TTS modules in the BOSS architecture (duration prediction and cost function) and the steps involved in preparing a new speech corpus for BOSS.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 3.

Tytuł:: Conditional Random Fields Applied to Arabic Orthographic-Phonetic Transcription
Autorzy:: Cherifi, El-Hadi
Guerti, Mhania
Tematy:: Orthographic-To-Phonetic Transcription
Conditional Random Fields
text-to-speech
Arabic speech synthesis
Modern Standard Arabic; Pokaż więcej
Wydawca:: Polska Akademia Nauk. Czasopisma i Monografie PAN
Powiązania:: https://bibliotekanauki.pl/articles/1953489.pdf Link otwiera się w nowym oknie
Opis:: Orthographic-To-Phonetic (O2P) Transcription is the process of learning the relationship between the written word and its phonetic transcription. It is a necessary part of Text-To-Speech (TTS) systems and it plays an important role in handling Out-Of-Vocabulary (OOV) words in Automatic Speech Recognition systems. The O2P is a complex task, because for many languages, the correspondence between the orthography and its phonetic transcription is not completely consistent. Over time, the techniques used to tackle this problem have evolved, from earlier rules based systems to the current more sophisticated machine learning approaches. In this paper, we propose an approach for Arabic O2P Conversion based on a probabilistic method: Conditional Random Fields (CRF). We discuss the results and experiments of this method apply on a pronunciation dictionary of the Most Commonly used Arabic Words, a database that we called (MCAW-Dic). MCAW-Dic contains over 35 000 words in Modern Standard Arabic (MSA) and their pronunciation, a database that we have developed by ourselves assisted by phoneticians and linguists from the University of Tlemcen. The results achieved are very satisfactory and point the way towards future innovations. Indeed, in all our tests, the score was between 11 and 15% error rate on the transcription of phonemes (Phoneme Error Rate). We could improve this result by including a large context, but in this case, we encountered memory limitations and calculation difficulties.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 4.

Tytuł:: BRINGING TTS SOFTWARE INTO THE CLASSROOM: THE EFFECT OF USING TEXT TO SPEECH SOFTWARE IN TEACHING READING FEATURES
Autorzy:: Meihami, Hussein
Husseini, Fateme
Tematy:: Computer-Assisted Language Learning
reading fluency
speech-to-talk software
teaching reading
text-to-speech; Pokaż więcej
Wydawca:: Uniwersytet Marii Curie-Skłodowskiej w Lublinie. IATEFL Poland Computer Special Interest Group
Powiązania:: https://bibliotekanauki.pl/articles/955750.pdf Link otwiera się w nowym oknie
Opis:: The aim of this experimental research is to investigate the effect of using Text-To-Speech Software (TTS), one of Computer Assisted Language Learning (CALL) resources in teaching reading, in particular, different aspects of reading fluency. In this study we investigated teaching and learning of word stress, word intonation, pitch contour, and fluency of English reading through TTS. It should be stated that comprehension had been a part of the program but wasn’t investigated in the study. The study indicated that word stress, word intonation, pitch contour, and fluency have significantly improved as a result of using TTS software.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 5.

Tytuł:: Harry Potter i Kamień Filozoficzny słowem malowany - czyli badanie odbioru filmu z audiodeskrypcją z syntezą mowy.
Harry Potter and the Philosophers Stone painted with words: research into reception of the film with text-to-speech audio description
Autorzy:: Drożdż-Kubik, Justyna
Opis:: Tematem niniejszej pracy jest niekonwencjonalna audiodeskrypcja, od tradycyjnej audiodeskrypcji różniąca się zastąpieniem lektora syntezatorem mowy. Zbadanie odbioru dubbingowanej zagranicznej produkcji audiowizualnej pt. „Harry Potter i Kamień Filozoficzny” z audiodeskrypcją czytaną głosem syntetycznym jest natomiast głównym celem pracy. Taka audiodeskrypcja została odczytana młodzieży z Ośrodka dla Dzieci Niewidomych i Słabowidzących w Krakowie w styczniu 2010 roku. W pracy przedstawione są wyniki badania sprawdzające opinię wychowanków dotyczącą wprowadzenia rozwiązania polegającego na odczytywaniu audiodeskrypcji przez syntezator mowy na stałe lub tylko na pewien czas, do momentu rozpowszechnienia się usługi audiodeskrypcji i zastąpienia syntezatora mowy lektorem. Osoby przeprowadzające badanie zainteresowane były również sprawdzeniem wpływu zredukowanej treści skryptu audiodeskrypcji, dokonanej w wyniku ograniczeń technicznych syntezatora mowy, na odbiór szczegółowości audiodeskrypcji.
An unconventional audio description that is audio description read out by a speech synthesis software, is a core topic of this master dissertation. The main aim of the dissertation is to study the perception of the text-to-speech audio description among visually impaired teenagers. In a study conducted in January 2010, blind and partially sighted teenagers from the education centre in Kraków listened to the audio described dubbed version of the Harry Potter and the Philosopher’s Stone. In the dissertation their opinion on the changed audio description script, which was altered in order to be read out by a speech synthesizer is presented together with their view on the possible acceptance of introducing text-to-speech audio description as a temporary and permanent option.
Dostawca treści:: Repozytorium Uniwersytetu Jagiellońskiego

Inne

na półce

Skocz do pozycji: 6.

Tytuł:: Inférences au pays de la prosodie
Inferences in the land of prosody
Autorzy:: Banyś, Wiesław
Tematy:: Implication
entailment
presupposition
implicature
prosody
theme
rheme
two-way implicative verbs
one-way implicative verbs
text-to-speech synthesis; Pokaż więcej
Wydawca:: Wydawnictwo Uniwersytetu Śląskiego
Powiązania:: https://bibliotekanauki.pl/articles/31341247.pdf Link otwiera się w nowym oknie
Opis:: When it comes to language, it’s not just the grammatical structure and the literal meaning of words that matter. The way a predicate imposes inferences on its propositional arguments is crucial to understanding the true meaning of a message. However, these inferences are influenced by many factors, such as prosody, world knowledge, speakers’ expectations regarding language use, situational stereotypes, and other implicit or contextual elements.In this paper, we examine the inferential status of the verbs referred to by Karttunen as “implicative verbs”. On the one hand, two-way implicative verbs, and on the other, one-way implicative verbs. The latter have been little studied from this perspective. This will lead us to highlight the fundamental role, too often forgotten, of prosody and focus/theme in this type of analysis and in determining the inferential status of predicates. Our analyses show that, once prosody has been considered, the classification of theoretically possible verb inferences accepted until now needs to be modified. We do not have 4 groups of one-way implicative verbs, as has been argued, but 2, namely the groups: [+/+/– // –/–] [affirmed > true or false // denied > false] of the type être capable, pouvoir and [+/+/– // –/+] [affirmed > true or false // denied > true] of the type hésiter à.The remaining two groups, considered as distinct and autonomous one-way implicative verbs with the suggested characteristics: [+ + // – +/–] [affirmed > true // denied > true or false] of the type forcer to and [+ – // – +/–] [affirmed > false // denied > true or false] of the type refuser de, belong to the canonical groups of two-way implicative verbs, respectively: forcer à to the group of verbs of the type réussir à: [+/+ // –/–] [affirmed > true // denied > false] and refuser de to the group of verbs of the type oublier de: [+/– // –/+] affirmed > false // denied > true].Naturally, this classification differentiation, important as it is, only reflects the different behaviour of certain types of predicate, and this is the most important element of these analyses with a view to automating the recognition of predicate inferences.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 7.

Tytuł:: Implementation of a voice-controlled smart home system
Implementacja sterowania głosem w inteligentnym domu
Autorzy:: Trela, Grzegorz
Opis:: Praca opisuje proces powstawania autorskiego systemu sterowania inteligentnym domem za pomocą komend głosowych wydawanych w języku polskim. Przedstawia asystentów głosowych takich jak: Amazon Alexa, Google Assistant, Samsung Bixby, Apple Siri oraz Microsoft Cortana. Prezentuje instalację wspominanych produktów na Raspberry Pi 3 B, a także tłumaczy poszczególne etapy implementacji opracowanego rozwiązania, napisanego w języku Python. Wyjaśnia sposób działania użytych narzędzi programistycznych takich jak: Snowboy, Wit.ai czy eSpeak. Ponadto podsumowuje wyniki przeprowadzonych testów, podkreślając wyróżniające cechy stworzonego asystenta na tle istniejącej konkurencji.
This thesis describes the process of creating an original voice-controlled smart home system supporting commands given in Polish. The first part contains a description of the following voice assistants: Amazon Alexa, Google Assistant, Samsung Bixby, Apple Siri and Microsoft Cortana. It provides a step by step tutorial on the installation of each of the aforementioned services on Raspberry Pi 3 B and describes various stages of the developed solution's implementation, written in Python. Furthermore, the thesis elaborates on the way development kits such as Snowboy, Wit.ai or eSpeak work. The last part of the paper provides the reader with a summary of the results, stressing the unique features of the created solution in comparison to other systems.
Dostawca treści:: Repozytorium Uniwersytetu Jagiellońskiego

Inne

na półce

Skocz do pozycji: 8.

Tytuł:: Zapisywanie symultaniczne - adekwatna forma wspierania edukacji, pracy oraz udziału w życiu społecznym i kulturalnym osób niesłyszących i słabosłyszących
Autorzy:: Domagała-Zyśk, Ewa
Wydawca:: Wydawnictwo Uniwersytetu Marii Curie-Skłodowskiej w Lublinie
Cytata wydawnicza:: Domagała-Zyśk E. (2017). Zapisywanie symultaniczne – adekwatna forma wspierania edukacji, pracy oraz udziału w życiu społecznym i kulturalnym osób niesłyszących i słabosłyszących. Lubelski Rocznik Pedagogiczny, 2,105-114.
Dostawca treści:: Repozytorium Centrum Otwartej Nauki

Artykuł

na półce

Skocz do pozycji: 9.

Tytuł:: Zarządzanie rozwojem systemów rozpoznawania mowy: problemy wydajności
Autorzy:: Kuligowska, Karolina
Kisielewicz, Paweł
Włodarz, Aleksandra
Tematy:: speech recognition system
speech-to-text performance
STT development
system rozpoznawania mowy
wydajność rozpoznawania mowy
rozwój rozpoznawania mowy; Pokaż więcej
Wydawca:: Uniwersytet Marii Curie-Skłodowskiej. Wydawnictwo Uniwersytetu Marii Curie-Skłodowskiej
Powiązania:: https://bibliotekanauki.pl/articles/610555.pdf Link otwiera się w nowym oknie
Opis:: Speech recognition enables the transformation of spoken words and sentences into text in digital form. This technology is a subject of numerous studies and commercial development for many years. The aim of this paper is to examine performance issues of speech recognition and to manage the development in this field. Thorough analysis of performance limitations of speech recognition systems we identified main 11 issues to overcome. They indicate the direction of managing development of speech recognition systems.
Rozpoznawanie mowy umożliwia przekształcanie wypowiadanych słów i zdań w tekst w formie cyfrowej. Technologia ta jest od wielu lat przedmiotem licznych badań naukowych oraz komercyjnych. Celem niniejszego artykułu jest zbadanie zagadnień dotyczących wydajności systemów rozpoznawania mowy i zarządzanie rozwojem tych systemów. Dogłębna analiza w zakresie ograniczeń wydajnościowych systemów rozpoznawania mowy pozwoliła na zidentyfikowanie problemów, które trzeba przezwyciężyć. Wskazują one kierunek zmian w zarządzaniu rozwojem systemów rozpoznawania mowy.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 10.

Tytuł:: Zapisywanie symultaniczne - adekwatna forma wspierania edukacji, pracy oraz udziału w życiu społecznym i kulturalnym osób niesłyszących i słabosłyszących
Autorzy:: Domagała-Zyśk, Ewa
Tematy:: deaf, hard of hearing, speech-to-text reporting, velotype, stenography, re-speaking
niesłyszący, słabosłyszący, zapisywanie symultaniczne, velotypia, stenotypia, respeaking; Pokaż więcej
Wydawca:: Uniwersytet Marii Curie-Skłodowskiej. Wydawnictwo Uniwersytetu Marii Curie-Skłodowskiej
Powiązania:: https://bibliotekanauki.pl/articles/606767.pdf Link otwiera się w nowym oknie
Opis:: This article presents speech-to-text reporting as an adequate form of support for education, work and full participation in social and cultural life of the deaf and hard of hearing. This form of support is known in Western Europe since the 80s of the twentieth century. However, there are a lot of discussions nowadays as for its scope and financing as well as the most effective forms of precise speech recording. The article discusses different types of speech-to-text reporting, shows the rules for creating the recordings and points out the advantages and difficulties of using this service.
Artykuł przedstawia usługę zapisywania symultanicznego jako adekwatną formę wsparcia edukacji, pracy zawodowej oraz pełnego uczestnictwa w życiu społecznym i kulturalnym osób niesłyszących i słabosłyszących. Taki rodzaj pomocy znany jest w Europie Zachodniej od lat 80. XX wieku, jednak wciąż trwają dyskusje nad możliwym zakresem jej stosowania i finansowania, a także jak najbardziej efektywnymi metodami precyzyjnego zapisu mowy. W artykule omówiono różne typy zapisywania symultanicznego, przedstawiono zasady tworzenia zapisu symultanicznego oraz wskazano na zalety i trudności stosowania tej usługi.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Informacja

Wyszukujesz frazę "text-to-speech" wg kryterium: Temat