Publication date: January 2018
Source:Computer Speech & Language, Volume 47
Author(s): Cristina Guerrero Flores, Georgina Tryfou, Maurizio Omologo
Shifting from a single to a multi-microphone setting, distant speech recognition can be benefited from the multiple instances of the same utterance in many ways. An effective approach, especially when microphones are not organized in an array fashion, is given by channel selection (CS), which assumes that for each utterance there is at least one channel that can improve the recognition results when compared to the decoding of the remaining channels. In order to identify this most favourable channel, a possible approach is to estimate the degree of distortion that characterizes each microphone signal. In a reverberant environment, this distortion can vary significantly across microphones, for instance due to the orientation of the speaker's head. In this work, we investigate on the application of cepstral distance as a distortion measure that turns out to be closely related to properties of the room acoustics, such as reverberation time and direct-to-reverberant ratio. From this measure, a blind CS method is derived, which relies on a reference computed by averaging log magnitude spectra of all the microphone signals. Another aim of our study is to propose a novel methodology to analyze CS under a wide set of experimental conditions and setup variations, which depend on the sound source position, its orientation, and the microphone network configuration. Based on the use of prior information, we introduce an informed technique to predict CS performance. Experimental results show both the effectiveness of the proposed blind CS method and the value of the aforementioned analysis methodology. The experiments were conducted using different sets of real and simulated data, the latter ones derived from synthetic and from measured impulse responses. It is demonstrated that the proposed blind CS method is well related to the oracle selection of the best recognized channel. Moreover, our method outperforms a state-of-the-art one, especially on real data.
from #ORL-AlexandrosSfakianakis via ola Kala on Inoreader http://ift.tt/2xmxtwr
Αρχειοθήκη ιστολογίου
-
►
2023
(269)
- ► Φεβρουαρίου (133)
- ► Ιανουαρίου (136)
-
►
2022
(2046)
- ► Δεκεμβρίου (165)
- ► Σεπτεμβρίου (161)
- ► Φεβρουαρίου (165)
-
►
2021
(3028)
- ► Δεκεμβρίου (135)
- ► Σεπτεμβρίου (182)
- ► Φεβρουαρίου (324)
-
►
2020
(1051)
- ► Δεκεμβρίου (292)
- ► Σεπτεμβρίου (60)
- ► Φεβρουαρίου (28)
-
►
2019
(2277)
- ► Δεκεμβρίου (18)
- ► Σεπτεμβρίου (54)
- ► Φεβρουαρίου (89)
-
►
2018
(26280)
- ► Δεκεμβρίου (189)
- ► Φεβρουαρίου (6130)
- ► Ιανουαρίου (7050)
-
▼
2017
(33948)
- ► Δεκεμβρίου (6715)
-
▼
Σεπτεμβρίου
(6470)
-
▼
Σεπ 20
(208)
- An improved procedure for integrated behavioral z-...
- A Case of the nephrotic syndrome in bone marrow tr...
- Perforators of the fibular artery and suprafascial...
- Sandwich compression with rubbery tourniquet sheet...
- Adipose-derived mesenchymal stromal cells prevente...
- Targeting metabolic abnormalities to reverse fibro...
- Diagnostic accuracy of current glossoptosis classi...
- Computed tomography image navigation patient track...
- Tobacco exposure and wound healing in head and nec...
- Impact of vocal fold augmentation and laryngoplast...
- Is topical high-volume budesonide sinus irrigation...
- Corneal Warpage due to Massage Following Eyelid Su...
- Rational and simplified nomenclature for buccinato...
- Measurement of Active and Sedentary Behavior in Co...
- Neural Correlates to the Increase in Maximal Force...
- An Innovative Ergometer to Measure Neuromuscular F...
- Corneal Warpage due to Massage Following Eyelid Su...
- The Temporal Artery Island Flap: A Good Reconstruc...
- A rare, unusual presentation of primary tuberculos...
- Re: Dr Yang's Meta-Analysis
- Enhance Surgical Outcomes in Patients with Skeleta...
- Reply
- Masthead
- Table of Contents
- Editorial Board
- Movement-related activity in the periarcuate corte...
- Feature-coding transitions to conjunction-coding w...
- Phase-Dependent deficits during Reach-to-Grasp aft...
- Cue-induced changes in the stability of finger for...
- Interacting networks of brain regions underlie hum...
- The coupling of synaptic inputs to local cortical ...
- Non-uniform surround suppression of visual respons...
- Evidence for the representation of movement kinema...
- Cranial nerve non-invasive neuromodulation improve...
- Short- and long-latency afferent inhibition; uses,...
- Anesthesia and Perioperative Care for Organ Transp...
- Acute Lung Injury and Repair: Scientific Fundament...
- Factors Influencing the Choice of Anesthesia as a ...
- Administration of Hypertonic Solutions for Hemorrh...
- The Little ICU Book, 2nd ed.
- Preventing Mistransfusions: An Evaluation of Insti...
- Other Specialties Might Have a GPS.
- CT and MR imaging findings of inflammatory pseudot...
- Contents
- One year on: Test your knowledge from the previous...
- Editorial Board
- EJVES vol. 54, issue 4 (October 2017) - Spanish Tr...
- Selected Abstracts from the October Issue of the J...
- Doppler ultrasonography can be useful to determine...
- Lamina papyracea position in patients with nasal p...
- Resident Editors of the Journal of Voice Editorial...
- Indications and Controversies for Abdominally-Base...
- Allergic sensitization in American children of Mid...
- Serum periostin during omalizumab therapy in asthm...
- Impaired objective and subjective sleep in childre...
- Subjective Evaluation of Dreams in pregnant women ...
- Sleep-Related Hypermotor Syndrome: An arousal para...
- Determination of VEGF, collagen type 1 and versica...
- A worldwide comparison of the management of T1 and...
- Custom-milled individual allogeneic bone grafts fo...
- Nipple-sparing Mastectomy Safe for Women With BRCA...
- Erratum to: Gait characteristics and their discrim...
- Striatonigral Direct Pathway Activation is Suffici...
- Qualitative differences in offline improvement of ...
- Role of the epigenetic factor Sirt7 in neuroinflam...
- So What Did You Hear This Summer?
- Study of factors responsible for recidivism in all...
- Markers of proliferation and cytokeratins in the d...
- Social media in otolaryngology-head and neck surgery
- Management decisions for Zenker diverticulum in th...
- Long-standing, near total tympanic membrane perfor...
- Hereditary hemorrhagic telangiectasia-laser treatm...
- Congenital ear malformations: Effectively correcti...
- Maxillary sinus mucoceles and other side effects o...
- Ear fetal rhabdomyoma
- Endoscopy-assisted Coblation for nasopharyngeal st...
- A collagen membrane containing osteogenic protein-...
- De Gruyter veröffentlicht Enzyklopädie des Märchen...
- How Teotihuacan’s urban design was lost and found
- Overcoming the Challenges of Metastatic Cancer: An...
- In Reply
- Cell free nucleic acids as diagnostic and prognost...
- Experimental and numerical assessment of hyperther...
- The Atypical MAP Kinase SWIP-13/ERK8 Regulates Dop...
- Synaptic Excitation in Spinal Motoneurons Alternat...
- UPF1 Governs Synaptic Plasticity through Associati...
- Effect of Threat on Right dlPFC Activity during Be...
- Correction: Zhao et al., "Sox2 Sustains Recruitmen...
- Synaptic Adhesion Molecule Pcdh-{gamma}C5 Mediates...
- Feedback Signal from Motoneurons Influences a Rhyt...
- Location of the Mesopontine Neurons Responsible fo...
- This Week in The Journal
- Effects of Selective Deafferentation on the Discha...
- Impaired Feedforward Control and Enhanced Feedback...
- Cortical Representations of Speech in a Multitalke...
- Systemic Neutrophil Depletion Modulates the Migrat...
- Disruption of M1 Activity during Performance Plate...
- Imaging Voltage in Genetically Defined Neuronal Su...
- Conditional Deletion of Prnp Rescues Behavioral an...
- NF-{kappa}B Activation Protects Oligodendrocytes a...
-
▼
Σεπ 20
(208)
-
►
2016
(4179)
- ► Σεπτεμβρίου (638)
- ► Φεβρουαρίου (526)
- ► Ιανουαρίου (517)
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου