Future speech interfaces with sensors and machine intelligence

Speech is the most spontaneous and natural means of communication, as well as the preferred modality for interacting with mobile or fixed electronic devices, but speech in-terfaces have drawbacks, such as a lack of user privacy; non-inclusivity for certain users; poor robustness in noisy conditions;...

Descripción completa

Detalles Bibliográficos
Otros Autores: Denby, Bruce , editor (editor), Wand, Michael , editor, Csapó, Tamás Gábor, editor
Formato: Libro electrónico
Idioma:Inglés
Publicado: Basel : MDPI [2023]
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009740850806719
Tabla de Contenidos:
  • Bruce Denby, Tam ´as G ´abor Csap ´o and Michael Wand
  • Future Speech Interfaces with Sensors and Machine Intelligence
  • Reprinted from: Sensors 2023, 23, 1971, doi:10.3390/s23041971 1
  • Wentao Yu, Steffen Zeiler and Dorothea Kolossa
  • Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition
  • Reprinted from: Sensors 2022, 22, 5501, doi:10.3390/s22155501 7
  • Sanghun Jeon and Mun Sang Kim
  • Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based
  • Interaction Applications
  • Reprinted from: Sensors 2022, 22, 7738, doi:10.3390/s22207738 27
  • Sanghun Jeon and Mun Sang Kim
  • End-to-End Lip-Reading Open Cloud-Based Speech Architecture
  • Reprinted from: Sensors 2022, 22, 2938, doi:10.3390/s22082938 55
  • Sanghun Jeon and Mun Sang Kim
  • End-to-End Sentence-Level Multi-View Lipreading Architecture with Spatial Attention Module
  • Integrated Multiple CNNs and Cascaded Local Self-Attention-CTC
  • Reprinted from: Sensors 2022, 22, 3597, doi:10.3390/s22093597 77
  • Beiming Cao, Alan Wisler and Jun Wang
  • Speaker Adaptation on Articulation and Acoustics for Articulation-to-Speech Synthesis
  • Reprinted from: Sensors 2022, 22, 6056, doi:10.3390/s22166056 105
  • Tam´as G ´abor Csap ´o, G ´abor Gosztolya, L ´aszl ´o T´oth, Amin Honarmandi Shandiz and
  • Alexandra Mark ´o
  • Optimizing the Ultrasound Tongue Image Representation for Residual Network-Based
  • Articulatory-to-Acoustic Mapping
  • Reprinted from: Sensors 2022, 22, 8601, doi:10.3390/s22228601 121
  • David Ferreira, Samuel Silva, Francisco Curado and Ant ´onio Teixeira
  • Exploring Silent Speech Interfaces Based on Frequency-Modulated Continuous-Wave Radar
  • Reprinted from: Sensors 2022, 22, 649, doi:10.3390/s22020649 135
  • Sanghun Jeon, Ahmed Elsharkawy and Mun Sang Kim
  • Lipreading Architecture Based on Multiple Convolutional Neural Networks for Sentence-Level
  • Visual Speech Recognition
  • Reprinted from: Sensors 2022, 22, 72, doi:10.3390/s22010072 153
  • Alan Wrench and Jonathan Balch-Tomes
  • Beyond the Edge: Markerless Pose Estimation of Speech Articulators from Ultrasound and
  • Camera Images Using DeepLabCut
  • Reprinted from: Sensors 2022, 22, 1133, doi:10.3390/s22031133 173
  • Dan Oneat,a, Be ´ ˘ ata L ˝orincz, Adriana Stan and Horia Cucu
  • FlexLip: A Controllable Text-to-Lip System
  • Reprinted from: Sensors 2022, 22, 4104, doi:10.3390/s22114104 201
  • Laith H. Baniata, Isaac. K. E. Ampomah and Seyoung Park
  • A Transformer-Based Neural Machine Translation Model for Arabic Dialects That Utilizes
  • Subword Units
  • Reprinted from: Sensors 2021, 21, 6509, doi:10.3390/s21196509 217.