Temat: audio-visual speech recognition

Skocz do pozycji: 1.

Tytuł:: Audio-Visual Speech Processing System for Polish Applicable to Human-Computer Interaction
Autorzy:: Jadczyk, T.
Tematy:: audio-visual speech recognition
visual features extraction
human-computer interaction; Pokaż więcej
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Powiązania:: https://bibliotekanauki.pl/articles/305828.pdf Link otwiera się w nowym oknie
Opis:: This paper describes audio-visual speech recognition system for Polish language and a set of performance tests under various acoustic conditions. We first present the overall structure of AVASR systems with three main areas: audio features extraction, visual features extraction and subsequently, audiovisual speech integration. We present MFCC features for audio stream with standard HMM modeling technique, then we describe appearance and shape based visual features. Subsequently we present two feature integration techniques, feature concatenation and model fusion. We also discuss the results of a set of experiments conducted to select best system setup for Polish, under noisy audio conditions. Experiments are simulating human-computer interaction in computer control case with voice commands in difficult audio environments. With Active Appearance Model (AAM) and multistream Hidden Markov Model (HMM) we can improve system accuracy by reducing Word Error Rate for more than 30%, comparing to audio-only speech recognition, when Signal-to-Noise Ratio goes down to 0dB.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 2.

Tytuł:: An exploratory case study to investigate perceived pronunciation errors in Thai primary school students using audio-visual speech recognition
Autorzy:: Graham, Steven
Tematy:: Speech recognition software
audio visual
English
computer assisted language learning; Pokaż więcej
Wydawca:: Uniwersytet Marii Curie-Skłodowskiej w Lublinie. IATEFL Poland Computer Special Interest Group
Powiązania:: https://bibliotekanauki.pl/articles/2087181.pdf Link otwiera się w nowym oknie
Opis:: An explorative case study has been conducted at a small rural school in the north east of Thailand to investigate the pronunciation errors that primary school students make when reading English aloud. This paper illustrates the opportunities and challenges of employing speech recognition software in rural classrooms by using it with specifically designed audio-visual materials based on the Thai curriculum to identify English language reading and pronunciation difficulties. A comparison is made between this study and published literature.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Informacja

Wyszukujesz frazę "audio-visual speech recognition" wg kryterium: Temat