skip to main content
Ngôn ngữ:
Giới hạn tìm kiếm: Giới hạn tìm kiếm: Dạng tài nguyên Hiển thị kết quả với: Hiển thị kết quả với: Chỉ mục

Speech and Computer: 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings

Ronzhin, Andrey ;Potapova, Rodmonga ;Németh, Géza;; Hutchison, David (Editor) ; Kanade, Takeo (Editor) ; Kittler, Josef (Editor) ; Kleinberg, Jon M (Editor) ; Mattern, Friedemann (Editor) ; Mitchell, John C (Editor) ; Naor, Moni (Editor) ; Pandu Rangan, C (Editor) ; Steffen, Bernhard (Editor) ; Terzopoulos, Demetri (Editor) ; Tygar, Doug (Editor) ; Weikum, Gerhard (Editor) ; Ronzhin, Andrey (Editor) ; Potapova, Rodmonga (Editor) ; Németh, Géza (Editor)

Lecture Notes in Computer Science

ISBN: 9783319439570 ; ISBN: 331943957X ; E-ISBN: 9783319439587 ; E-ISBN: 3319439588 ; DOI: 10.1007/978-3-319-43958-7

Toàn văn sẵn có

Trích dẫn Trích dẫn bởi
  • Nhan đề:
    Speech and Computer: 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings
  • Tác giả: Ronzhin, Andrey ; Potapova, Rodmonga ; Németh, Géza
  • Hutchison, David (Editor) ; Kanade, Takeo (Editor) ; Kittler, Josef (Editor) ; Kleinberg, Jon M (Editor) ; Mattern, Friedemann (Editor) ; Mitchell, John C (Editor) ; Naor, Moni (Editor) ; Pandu Rangan, C (Editor) ; Steffen, Bernhard (Editor) ; Terzopoulos, Demetri (Editor) ; Tygar, Doug (Editor) ; Weikum, Gerhard (Editor) ; Ronzhin, Andrey (Editor) ; Potapova, Rodmonga (Editor) ; Németh, Géza (Editor)
  • Chủ đề: Computer Science ; Artificial Intelligence (Incl. Robotics) ; Information Systems Applications (Incl. Internet) ; Pattern Recognition ; Information Storage and Retrieval ; Image Processing and Computer Vision ; Database Management ; Engineering ; Computer Science
  • Là 1 phần của: Lecture Notes in Computer Science
  • Mô tả: Intro -- Preface -- Organization -- Acknowledgments -- Contents -- Invited Talks -- Automatic Speech Recognition Based on Neural Networks -- 1 Introduction -- 2 Acoustic Model Integration -- 3 Neural Network Topologies -- 4 Training Criteria -- 5 Regularization -- 6 Optimization -- 7 Input Features -- 8 Multilingual Modeling -- 9 Adaptation and Normalization -- 10 Neural Network Based Language Modeling -- 11 Recent Developments - Integrated Modeling -- 12 Conclusions -- References -- Machine Processing of Dialogue States -- Speculations on Conversational Entropy -- 1 Introduction -- 2 The Herme Dialogues -- 3 Conversational Speech Synthesis -- 3.1 A Talking Fridge -- 3.2 Entropy (An Interlude) -- 4 A Notion of Conversational Entropy -- 5 Social Interactions and Signal Processing -- 5.1 Natural Human-Machine Conversational Interaction -- 6 Conclusions -- References -- Speech Recognition Challenges in the Car Navigation Industry -- Abstract -- 1 Infotainment in a Car -- 2 Most Popular Speech Features in a Car -- 3 Talking Cars: A Few TTS Questions in Car Navigation -- 3.1 Who has the Right to Speak? -- 3.2 Pre-recorded Voice or TTS? -- 3.3 Timing and Verbosity -- 3.4 Is Phonetical Data Always Beneficial for the Driver? -- 3.5 Grammatically Correct Monolingual Sentences -- 3.6 Multilingual Sentences -- 3.7 Sound Quality -- 4 When the Driver is Speaking: ASR Solutions in Navigation -- 4.1 Contexts for On-board Recognizers -- 4.1.1 Voice Commands -- 4.1.2 Destination Entry -- 4.1.3 Address Points -- 4.2 Contexts for Server-Based Recognizers -- 4.2.1 Voice Commands on Client Side -- 4.2.2 Destination Entry on Client Side -- 5 Dialogue Systems in a Car -- 5.1 Professional Approach -- 5.2 Unique Dialogue System on Navigation Side -- 5.3 Similar Designs in the Industry -- 6 Limitations and Sub-optimal ASR Features -- 6.1 Connected Cars are Coming Soon -- 6.2 Separate Address Recognition and Address Disambiguation -- 6.3 Recognition of All Address Points in One-Shot -- 6.4 Address Search with NLU at Top Level -- 6.5 Places in a City -- 6.6 Step-by-Step Address Entry with the One-Shot Context -- 7 Open Questions -- 7.1 Address Entry in India -- 7.2 Fixing a Wrong Address in the USA -- 8 Vision -- Conference Papers -- A Comparison of Acoustic Features of Speech of Typically Developing Children and Children with Autis ... -- Abstract -- 1 Introduction -- 2 Method -- 2.1 Data Collection -- 2.2 Data Analysis -- 3 Result -- 3.1 Acoustic Features of TD and ASD Child Emotional Speech -- 3.2 Acoustic Features of TD and ASD Spontaneous Child Speech -- 3.3 Acoustic Features of TD and ASD Repetition vs. Spontaneous Child Speech -- 4 Discussion -- 5 Conclusions -- Acknowledgements -- References -- A Deep Neural Networks (DNN) Based Models for a Computer Aided Pronunciation Learning System -- Abstract -- 1 Introduction -- 2 Baseline System Description -- 2.1 Enhancements in Proposed System -- 3 Data Description -- 4 Experimental Setup -- 5 Results and Discussion -- 6 C.
  • Nơi xuất bản: Cham: Springer International Publishing
  • Năm xuất bản: 2016
  • Ngôn ngữ: English
  • Số nhận dạng: ISBN: 9783319439570 ; ISBN: 331943957X ; E-ISBN: 9783319439587 ; E-ISBN: 3319439588 ; DOI: 10.1007/978-3-319-43958-7

Đang tìm Cơ sở dữ liệu bên ngoài...