000 | 07716nam a22006615i 4500 | ||
---|---|---|---|
001 | 978-3-031-48312-7 | ||
003 | DE-He213 | ||
005 | 20240730202735.0 | ||
007 | cr nn 008mamaa | ||
008 | 231121s2023 sz | s |||| 0|eng d | ||
020 |
_a9783031483127 _9978-3-031-48312-7 |
||
024 | 7 |
_a10.1007/978-3-031-48312-7 _2doi |
|
050 | 4 | _aQ334-342 | |
050 | 4 | _aTA347.A78 | |
072 | 7 |
_aUYQ _2bicssc |
|
072 | 7 |
_aCOM004000 _2bisacsh |
|
072 | 7 |
_aUYQ _2thema |
|
082 | 0 | 4 |
_a006.3 _223 |
245 | 1 | 0 |
_aSpeech and Computer _h[electronic resource] : _b25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023, Proceedings, Part II / _cedited by Alexey Karpov, K. Samudravijaya, K. T. Deepak, Rajesh M. Hegde, Shyam S. Agrawal, S. R. Mahadeva Prasanna. |
250 | _a1st ed. 2023. | ||
264 | 1 |
_aCham : _bSpringer Nature Switzerland : _bImprint: Springer, _c2023. |
|
300 |
_aXXVI, 568 p. 195 illus., 141 illus. in color. _bonline resource. |
||
336 |
_atext _btxt _2rdacontent |
||
337 |
_acomputer _bc _2rdamedia |
||
338 |
_aonline resource _bcr _2rdacarrier |
||
347 |
_atext file _bPDF _2rda |
||
490 | 1 |
_aLecture Notes in Artificial Intelligence, _x2945-9141 ; _v14339 |
|
505 | 0 | _aIndustrial Speech and Language Technology -- Analysing Breathing Patterns in Reading and Spontaneous Speech -- Audio-Visual Speaker Verification via Joint Cross Attention -- A Novel Scheme to Classify Read and Spontaneous Speech -- Analysis of a Hinglish ASR System's Performance for Fraud Detection -- Anomaly Detection in Speech: A Comprehensive Approach for Enhanced Speech Analysis -- CAPTuring Accents: An Approach to Personalize Pronunciation Training for Learners with Different L1 Backgrounds -- Speech Technology for Under-Resourced Languages -- Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages -- Phone Durations Modeling for Livvi-Karelian ASR -- Significance of Indic Self-Supervised Speech Representations for Indic Under-Resourced ASR -- Study of Various End-to-End Keyword Spotting Systems on the Bengali language under Low-Resource Condition -- Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language -- Studying the Effect of Frame-Level Concatenation of GFCC and TS-MFCC Features on Zero-Shot Children's ASR -- Code-Mixed Text-to-Speech Synthesis under Low-Resource Constraints -- An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language -- An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language -- Cross Lingual Style Transfer using Multiscale Loss Function for Soliga: A Low Resource Tribal Language -- Preliminary Analysis of Lambani Vowels and Vowel Classification using Acoustic Feature -- Curriculum Learning based Approach for Faster Convergence of TTS Model -- Rhythm Measures and Language Endangerment: the Case of Deori -- Konkani Phonetic Transcription System 1.0 -- Speech Analysis and Synthesis -- E-TTS: Expressive Text-to-Speech Synthesis for Hindi using Data Augmentation -- Direct vs Cascaded Speech-to-Speech Translation using Transformer -- Deep Learning based Speech Quality Assessment Focusing on Noise Effects -- Quantifying the Emotional Landscape of Music with Three Dimensions -- Analysis of Mandarin vs. English Language for Emotional Voice Conversion -- Audio DeepFake Detection Employing Multiple Parametric Exponential Linear Units -- A Comparison of Learned Representations with Jointly Optimized VAE and DNN for Syllable Stress Detection -- On the Asymptotic Behaviour of the Speech Signal -- Improvement of Audio-Visual Keyword Spotting System Accuracy using Excitation Source Feature -- Developing a Question Answering System on the material of Holocaust survivors' testimonies in Russian -- Enhancing Children's Short Utterance based ASV using Data Augmentation Techniques and Feature Concatenation Approach -- Studying the Effectiveness of Data Augmentation and Frequency-DomainLinear Prediction Coefficients in Children's Speaker Verification under Low-Resource Conditions -- Constant-Q based Harmonic and Pitch Features for Normal vs Pathological Infant Cry Classification -- Robustness of Whisper Features for Infant Cry Classification -- Speaker and Language Identification, Verification, and Diarization -- I-MSV 2022: Indic-Multilingual and Multi-Sensor Speaker Verification Challenge -- Multi-Task Learning over Mixup Variants for the Speaker Verification Task -- Exploring the Impact of Different Approaches for Spoken Dialect Identification of Konkani Language -- Adversarially Trained Hierarchical Attention Network for Domain-Invariant Spoken Language Identification -- Ensemble of Incremental System Enhancements for Robust Speaker Diarization in Code-Switched Real-Life Audios -- Enhancing Language Identification in Indian Context through Exploiting Learned Features with Wav2Vec2.0 -- Design and Development of Voice OTP Authentication System -- End-to-End Native Language Identification using a Modified Vision Transformer(ViT) from L2 English Speech -- Dialect Identification in Ao using Modulation-based Representation -- Self-Supervised Speaker Verification Employing Augmentation Mix and Self-Augmented Training-based Clustering. . | |
520 | _aThe two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29-December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization. | ||
650 | 0 |
_aArtificial intelligence. _93407 |
|
650 | 0 |
_aImage processing _xDigital techniques. _94145 |
|
650 | 0 |
_aComputer vision. _9172811 |
|
650 | 0 |
_aComputer engineering. _910164 |
|
650 | 0 |
_aComputer networks . _931572 |
|
650 | 0 |
_aApplication software. _9172812 |
|
650 | 1 | 4 |
_aArtificial Intelligence. _93407 |
650 | 2 | 4 |
_aComputer Imaging, Vision, Pattern Recognition and Graphics. _931569 |
650 | 2 | 4 |
_aComputer Engineering and Networks. _9172813 |
650 | 2 | 4 |
_aComputer and Information Systems Applications. _9172814 |
700 | 1 |
_aKarpov, Alexey. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172815 |
|
700 | 1 |
_aSamudravijaya, K. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172816 |
|
700 | 1 |
_aDeepak, K. T. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172817 |
|
700 | 1 |
_aHegde, Rajesh M. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172818 |
|
700 | 1 |
_aAgrawal, Shyam S. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172819 |
|
700 | 1 |
_aPrasanna, S. R. Mahadeva. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172820 |
|
710 | 2 |
_aSpringerLink (Online service) _9172821 |
|
773 | 0 | _tSpringer Nature eBook | |
776 | 0 | 8 |
_iPrinted edition: _z9783031483110 |
776 | 0 | 8 |
_iPrinted edition: _z9783031483134 |
830 | 0 |
_aLecture Notes in Artificial Intelligence, _x2945-9141 ; _v14339 _9172822 |
|
856 | 4 | 0 | _uhttps://doi.org/10.1007/978-3-031-48312-7 |
912 | _aZDB-2-SCS | ||
912 | _aZDB-2-SXCS | ||
912 | _aZDB-2-LNC | ||
942 | _cELN | ||
999 |
_c97155 _d97155 |