- Table View
- List View
Speech Etiquette in Slavic Online Communities
by Lilia DuskaevaThis edited book focuses on speech etiquette, examining the rules that govern communication in various online communities: professional, female, and ethnospecific. The contributors analyze online communication in the Slavic languages Russian, Slovak, Polish, and Belarusian, showing how the concept of speech etiquette differs from the concept of politeness, although both reflect the relationship between people in interaction. Online communities are united on the basis of common informative or phatic illocutions among their participants, and their speech etiquette is manifested in stable forms of conducting discussions – stimulating and responding. Each group has its own ideas of unacceptable speech behavior and approaches to sanitation, and the rules of speech etiquette in each group determine the degree of rapport and distancing between the participants in discourse. The chapters in this book explore how rapport and distance are established through acts such as showing attention to the addressee and increasing his or her communicative status; reducing or increasing the illocutionary power of evaluations and motivations; and evaluating one’s own or someone else’s speech. The volume will be of interest to researchers studying online communication in such diverse fields as linguistics, sociology, anthropology, programming, and media studies.
Speech Processing and Soft Computing (SpringerBriefs in Speech Technology)
by Sid-Ahmed SelouaniSpeech Processing and Soft Computing includes coverage of synergy between speech technology and bio-inspired soft computing methods. Through practical cases, the author explores, dissects and examines how soft computing may complement conventional techniques in speech enhancement and speech recognition in order to provide robust systems. The material is especially useful to graduate students and experienced researchers who are interested in expanding their horizons and investigating new research directions through review of the theoretical and practical settings of soft computing methods in very recent speech applications.
Speech Processing in Embedded Systems
by Priyabrata SinhaSpeech Processing has rapidly emerged as one of the most widespread and well-understood application areas in the broader discipline of Digital Signal Processing. Besides the telecommunications applications that have hitherto been the largest users of speech processing algorithms, several non-traditional embedded processor applications are enhancing their functionality and user interfaces by utilizing various aspects of speech processing. "Speech Processing in Embedded Systems" describes several areas of speech processing, and the various algorithms and industry standards that address each of these areas. The topics covered include different types of Speech Compression, Echo Cancellation, Noise Suppression, Speech Recognition and Speech Synthesis. In addition this book explores various issues and considerations related to efficient implementation of these algorithms on real-time embedded systems, including the role played by processor CPU and peripheral functionality.
Speech Processing in Mobile Environments
by K. Sreenivasa Rao Anil Kumar VuppalaThis book focuses on speech processing in the presence of low-bit rate coding and varying background environments. The methods presented in the book exploit the speech events which are robust in noisy environments. Accurate estimation of these crucial events will be useful for carrying out various speech tasks such as speech recognition, speaker recognition and speech rate modification in mobile environments. The authors provide insights into designing and developing robust methods to process the speech in mobile environments. Covering temporal and spectral enhancement methods to minimize the effect of noise and examining methods and models on speech and speaker recognition applications in mobile environments.
Speech Recognition Using Articulatory and Excitation Source Features
by K. Sreenivasa Rao Manjunath K EThis book discusses the contribution of articulatory and excitation source information in discriminating sound units. The authors focus on excitation source component of speech -- and the dynamics of various articulators during speech production -- for enhancement of speech recognition (SR) performance. Speech recognition is analyzed for read, extempore, and conversation modes of speech. Five groups of articulatory features (AFs) are explored for speech recognition, in addition to conventional spectral features. Each chapter provides the motivation for exploring the specific feature for SR task, discusses the methods to extract those features, and finally suggests appropriate models to capture the sound unit specific knowledge from the proposed features. The authors close by discussing various combinations of spectral, articulatory and source features, and the desired models to enhance the performance of SR systems.
Speech Spectrum Analysis
by Sean A. FulopThe accurate determination of the speech spectrum, particularly for short frames, is commonly pursued in diverse areas including speech processing, recognition, and acoustic phonetics. With this book the author makes the subject of spectrum analysis understandable to a wide audience, including those with a solid background in general signal processing and those without such background. In keeping with these goals, this is not a book that replaces or attempts to cover the material found in a general signal processing textbook. Some essential signal processing concepts are presented in the first chapter, but even there the concepts are presented in a generally understandable fashion as far as is possible. Throughout the book, the focus is on applications to speech analysis; mathematical theory is provided for completeness, but these developments are set off in boxes for the benefit of those readers with sufficient background. Other readers may proceed through the main text, where the key results and applications will be presented in general heuristic terms, and illustrated with software routines and practical "show-and-tell" discussions of the results. At some points, the book refers to and uses the implementations in the Praat speech analysis software package, which has the advantages that it is used by many scientists around the world, and it is free and open source software. At other points, special software routines have been developed and made available to complement the book, and these are provided in the Matlab programming language. If the reader has the basic Matlab package, he/she will be able to immediately implement the programs in that platform---no extra "toolboxes" are required.
Speech Technology: Theory and Applications (Wiley Series In Agent Technology Ser. #11)
by Kristiina Jokinen Fang ChenThis book gives an overview of the research and application of speech technologies in different areas. One of the special characteristics of the book is that the authors take a broad view of the multiple research areas and take the multidisciplinary approach to the topics. One of the goals in this book is to emphasize the application. User experience, human factors and usability issues are the focus in this book.
Speech and Audio Processing for Coding, Enhancement and Recognition
by Tokunbo Ogunfunmi Roberto Togneri Madihally Sim NarasimhaThis book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas.
Speech and Audio Processing: A MATLAB®-based Approach
by Ian Vince McloughlinWith this comprehensive and accessible introduction to the field, you will gain all the skills and knowledge needed to work with current and future audio, speech, and hearing processing technologies. Topics covered include mobile telephony, human-computer interfacing through speech, medical applications of speech and hearing technology, electronic music, audio compression and reproduction, big data audio systems and the analysis of sounds in the environment. All of this is supported by numerous practical illustrations, exercises, and hands-on MATLAB examples on topics as diverse as psychoacoustics (including some auditory illusions), voice changers, speech compression, signal analysis and visualisation, stereo processing, low-frequency ultrasonic scanning, and machine learning techniques for big data. With its pragmatic and application driven focus, and concise explanations, this is an essential resource for anyone who wants to rapidly gain a practical understanding of speech and audio processing and technology.
Speech and Computer
by Rodmonga Potapova Alexey Karpov Iosif MporasThis book constitutes the refereed proceedings of the 16th International Conference on Speech and Computer, SPECOM 2014, held in Novi Sad, Serbia. The 56 revised full papers presented together with 3 invited talks were carefully reviewed and selected from 100 initial submissions. It is a conference with long tradition that attracts researchers in the area of computer speech processing (recognition, synthesis, understanding etc. ) and related domains (including signal processing, language and text processing, multi-modal speech processing or human-computer interaction for instance).
Speech and Computer
by Andrey Ronzhin Rodmonga Potapova Nikos FakotakisThis book constitutes the refereed proceedings of the 17th International Conference on Speech and Computer, SPECOM 2015, held in Athens, Greece, in September 2015. The 59 revised full papers presented together with 2 invited talks were carefully reviewed and selected from 104 initial submissions. The papers cover a wide range of topics in the area of computer speech processing such as recognition, synthesis, and understanding and related domains including signal processing, language and text processing, multi-modal speech processing or human-computer interaction.
Speech and Computer
by Andrey Ronzhin Rodmonga Potapova Géza NémethThis book constitutes the refereed proceedings of the 16th International Conference on Speech and Computer, SPECOM 2014, held in Novi Sad, Serbia. The 56 revised full papers presented together with 3 invited talks were carefully reviewed and selected from 100 initial submissions. It is a conference with long tradition that attracts researchers in the area of computer speech processing (recognition, synthesis, understanding etc. ) and related domains (including signal processing, language and text processing, multi-modal speech processing or human-computer interaction for instance).
Speech and Computer: 20th International Conference, SPECOM 2018, Leipzig, Germany, September 18–22, 2018, Proceedings (Lecture Notes in Computer Science #11096)
by Rodmonga Potapova Alexey Karpov Oliver JokischThis book constitutes the proceedings of the 20th International Conference on Speech and Computer, SPECOM 2018, held in Leipzig, Germany, in September 2018.The 79 papers presented in this volume were carefully reviewed and selected from 132 submissions. The papers present current research in the area of computer speech processing, including recognition, synthesis, understanding and related domains like signal processing, language and text processing, computational paralinguistics, multi-modal speech processing or human-computer interaction.
Speech and Computer: 21st International Conference, SPECOM 2019, Istanbul, Turkey, August 20–25, 2019, Proceedings (Lecture Notes in Computer Science #11658)
by Albert Ali Salah Rodmonga Potapova Alexey KarpovThis book constitutes the proceedings of the 21st International Conference on Speech and Computer, SPECOM 2019, held in Istanbul, Turkey, in August 2019.The 57 papers presented were carefully reviewed and selected from 86 submissions. The papers present current research in the area of computer speech processing including audio signal processing, automatic speech recognition, speaker recognition, computational paralinguistics, speech synthesis, sign language and multimodal processing, and speech and language resources.
Speech and Computer: 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7–9, 2020, Proceedings (Lecture Notes in Computer Science #12335)
by Rodmonga Potapova Alexey KarpovThis book constitutes the proceedings of the 22nd International Conference on Speech and Computer, SPECOM 2020, held in St. Petersburg, Russia, in October 2020. The 65 papers presented were carefully reviewed and selected from 160 submissions. The papers present current research in the area of computer speech processing including speech science, speech technology, natural language processing, human-computer interaction, language identification, multimedia processing, human-machine interaction, deep learning for audio processing, computational paralinguistics, affective computing, speech and language resources, speech translation systems, text mining and sentiment analysis, voice assistants, etc.Due to the Corona pandemic SPECOM 2020 was held as a virtual event.
Speech and Computer: 23rd International Conference, SPECOM 2021, St. Petersburg, Russia, September 27–30, 2021, Proceedings (Lecture Notes in Computer Science #12997)
by Rodmonga Potapova Alexey KarpovThis book constitutes the proceedings of the 23rd International Conference on Speech and Computer, SPECOM 2021, held in St. Petersburg, Russia, in September 2021.* The 74 papers presented were carefully reviewed and selected from 163 submissions. The papers present current research in the area of computer speech processing including audio signal processing, automatic speech recognition, speaker recognition, computational paralinguistics, speech synthesis, sign language and multimodal processing, and speech and language resources.*Due to the COVID-19 pandemic, SPECOM 2021 was held as a hybrid event.
Speech and Computer: 24th International Conference, SPECOM 2022, Gurugram, India, November 14–16, 2022, Proceedings (Lecture Notes in Computer Science #13721)
by S. R. Mahadeva Prasanna Alexey Karpov K. Samudravijaya Shyam S. AgrawalThis book constitutes the proceedings of the 24th International Conference on Speech and Computer, SPECOM 2022, held as a hybrid event in Gurugram, India, in November 2022.The 51 full and 9 short papers presented in this volume were carefully reviewed and selected from 99 submissions. The papers present current research in the area of computer speech processing including audio signal processing, automatic speech recognition, speaker recognition, computational paralinguistics, speech synthesis, sign language and multimodal processing, and speech and language resources.
Speech and Computer: 25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part I (Lecture Notes in Computer Science #14338)
by S. R. Mahadeva Prasanna Alexey Karpov Rajesh M. Hegde K. Samudravijaya Shyam S. Agrawal K. T. DeepakThe two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023.The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.
Speech and Computer: 25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part II (Lecture Notes in Computer Science #14339)
by S. R. Mahadeva Prasanna Alexey Karpov Rajesh M. Hegde K. Samudravijaya Shyam S. Agrawal K. T. DeepakThe two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023.The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.
Speech and Computer: 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25–28, 2024, Proceedings, Part I (Lecture Notes in Computer Science #15299)
by Alexey Karpov Vlado DelićThe two-volume set LNAI 15299 and 15300 constitutes the refereed proceedings of the 26th International Conference on Speech and Computer, SPECOM 2024, held in Belgrade, Serbia, during November 25–28, 2024. The 53 full papers included in these proceedings were carefully reviewed and selected from 90 submissions. The book also contains two invited talks in full paper length. The papers are organized in the following topical sections: Volume I: Invited papers; automatic speech recognition; speech and language resources; speech synthesis and perception; and speech processing for medicine. Volume II: Computational paralinguistics; affective computing; speaker recognition; digital speech processing; natural language processing.
Speech and Computer: 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25–28, 2024, Proceedings, Part II (Lecture Notes in Computer Science #15300)
by Alexey Karpov Vlado DelićThe two-volume set LNAI 15299 and 15300 constitutes the refereed proceedings of the 26th International Conference on Speech and Computer, SPECOM 2024, held in Belgrade, Serbia, during November 25–28, 2024. The 53 full papers included in these proceedings were carefully reviewed and selected from 90 submissions. The book also contains two invited talks in full paper length. The papers are organized in the following topical sections: Volume I: Invited papers; automatic speech recognition; speech and language resources; speech synthesis and perception; and speech processing for medicine. Volume II: Computational paralinguistics; affective computing; speaker recognition; digital speech processing; natural language processing.
Speech and Language Processing for Human-Machine Communications
by S. S. Agrawal Amita Devi Ritika Wason Poonam BansalThis volume comprises the select proceedings of the annual convention of the Computer Society of India. Divided into 10 topical volumes, the proceedings present papers on state-of-the-art research, surveys, and succinct reviews. The volumes cover diverse topics ranging from communications networks to big data analytics, and from system architecture to cyber security. This volume focuses on Speech and Language Processing for Human-Machine Communications. The contents of this book will be useful to researchers and students alike.
Speech and Language Technologies for Low-Resource Languages: First International Conference, SPELLL 2022, Kalavakkam, India, November 23–25, 2022, Proceedings (Communications in Computer and Information Science #1802)
by Thomas Mandl Anand Kumar M Bharathi Raja Chakravarthi Bharathi B Colm O’Riordan Hema Murthy Thenmozhi DurairajThis book constitutes refereed proceedings from the First International Conference on Speech and Language Technologies for Low-resource Languages, SPELLL 2022, held in Kalavakkam, India, in November 2022. The 25 presented papers were thoroughly reviewed and selected from 70 submissions. The papers are organised in the following topical sections: language resources; language technologies; speech technologies; multimodal data analysis; fake news detection in low-resource languages (regional-fake); low resource cross-domain, cross-lingualand cross-modal offensie content analysis (LC4).
Speech and Language Technologies for Low-Resource Languages: Second International Conference, SPELLL 2023, Perundurai, Erode, India, December 6–8, 2023, Revised Selected Papers (Communications in Computer and Information Science #2046)
by Preslav Nakov Bharathi Raja Chakravarthi Bharathi B Miguel Ángel García Cumbreras Salud María Jiménez Zafra Malliga Subramanian Kogilavani ShanmugavadivelThis book constitutes the refereed conference proceedings of the second International Conference on Speech and Language Technologies for Low-Resource Languages, SPELLL 2023, held in Perundurai, Erode, India, during December 6–8, 2023. The 27 full papers and 6 short papers presented in this book were carefully reviewed and selected from 94 submissions. The papers are divided into the following topical sections: language resources; language technologies; speech technologies; and workshops - regional fake, MMLOW, LC4.
Speech-to-Speech Translation (SpringerBriefs in Computer Science)
by Yutaka Kidawara Eiichiro Sumita Hisashi KawaiThis book provides the readers with retrospective and prospective views with detailed explanations of component technologies, speech recognition, language translation and speech synthesis.Speech-to-speech translation system (S2S) enables to break language barriers, i.e., communicate each other between any pair of person on the glove, which is one of extreme dreams of humankind.People, society, and economy connected by S2S will demonstrate explosive growth without exception.In 1986, Japan initiated basic research of S2S, then the idea spread world-wide and were explored deeply by researchers during three decades.Now, we see S2S application on smartphone/tablet around the world.Computational resources such as processors, memories, wireless communication accelerate this computation-intensive systems and accumulation of digital data of speech and language encourage recent approaches based on machine learning.Through field experiments after long research in laboratories, S2S systems are being well-developed and now ready to utilized in daily life.Unique chapter of this book is end-2-end evaluation by comparing system’s performance and human competence. The effectiveness of the system would be understood by the score of this evaluation.The book will end with one of the next focus of S2S will be technology of simultaneous interpretation for lecture, broadcast news and so on.