940 resultados para noisy speaker verification


Relevância:

30.00% 30.00%

Publicador:

Resumo:

This study aims to help broaden the use of electronic portal imaging devices (EPIDs) for pre-treatment patient positioning verification, from photon-beam radiotherapy to photon- and electron-beam radiotherapy, by proposing and testing a method for acquiring clinicallyuseful EPID images of patient anatomy using electron beams, with a view to enabling and encouraging further research in this area. EPID images used in this study were acquired using all available beams from a linac configured to deliver electron beams with nominal energies of 6, 9, 12, 16 and 20 MeV, as well as photon beams with nominal energies of 6 and 10 MV. A widely-available heterogeneous, approximately-humanoid, thorax phantom was used, to provide an indication of the contrast and noise produced when imaging different types of tissue with comparatively realistic thicknesses. The acquired images were automatically calibrated, corrected for the effects of variations in the sensitivity of individual photodiodes, using a flood field image. For electron beam imaging, flood field EPID calibration images were acquired with and without the placement of blocks of water-equivalent plastic (with thicknesses approximately equal to the practical range of electrons in the plastic) placed upstream of the EPID, to filter out the primary electron beam, leaving only the bremsstrahlung photon signal. While the electron beam images acquired using a standard (unfiltered) flood field calibration were observed to be noisy and difficult to interpret, the electron beam images acquired using the filtered flood field calibration showed tissues and bony anatomy with levels of contrast and noise that were similar to the contrast and noise levels seen in the clinically acceptable photon beam EPID images. The best electron beam imaging results (highest contrast, signal-to-noise and contrast-to-noise ratios) were achieved when the images were acquired using the higher energy electron beams (16 and 20 MeV) when the EPID was calibrated using an intermediate (12 MeV) electron beam energy. These results demonstrate the feasibility of acquiring clinically-useful EPID images of patient anatomy using electron beams and suggest important avenues for future investigation, thus enabling and encouraging further research in this area. There is manifest potential for the EPID imaging method proposed in this work to lead to the clinical use of electron beam imaging for geometric verification of electron treatments in the future.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Retransmission protocols such as HDLC and TCP are designed to ensure reliable communication over noisy channels (i.e., channels that can corrupt messages). Thakkar et al. 15] have recently presented an algorithmic verification technique for deterministic streaming string transducer (DSST) models of such protocols. The verification problem is posed as equivalence checking between the specification and protocol DSSTs. In this paper, we argue that more general models need to be obtained using non-deterministic streaming string transducers (NSSTs). However, equivalence checking is undecidable for NSSTs. We present two classes where the models belong to a sub-class of NSSTs for which it is decidable. (C) 2015 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper presents the maximum weighted stream posterior (MWSP) model as a robust and efficient stream integration method for audio-visual speech recognition in environments, where the audio or video streams may be subjected to unknown and time-varying corruption. A significant advantage of MWSP is that it does not require any specific measurements of the signal in either stream to calculate appropriate stream weights during recognition, and as such it is modality-independent. This also means that MWSP complements and can be used alongside many of the other approaches that have been proposed in the literature for this problem. For evaluation we used the large XM2VTS database for speaker-independent audio-visual speech recognition. The extensive tests include both clean and corrupted utterances with corruption added in either/both the video and audio streams using a variety of types (e.g., MPEG-4 video compression) and levels of noise. The experiments show that this approach gives excellent performance in comparison to another well-known dynamic stream weighting approach and also compared to any fixed-weighted integration approach in both clean conditions or when noise is added to either stream. Furthermore, our experiments show that the MWSP approach dynamically selects suitable integration weights on a frame-by-frame basis according to the level of noise in the streams and also according to the naturally fluctuating relative reliability of the modalities even in clean conditions. The MWSP approach is shown to maintain robust recognition performance in all tested conditions, while requiring no prior knowledge about the type or level of noise.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Bid opening in e-auction is efficient when a homomorphic secret sharing function is employed to seal the bids and homomorphic secret reconstruction is employed to open the bids. However, this high efficiency is based on an assumption: the bids are valid (e.g., within a special range). An undetected invalid bid can compromise correctness and fairness of the auction. Unfortunately, validity verification of the bids is ignored in the auction schemes employing homomorphic secret sharing (called homomorphic auction in this paper). In this paper, an attack against the homomorphic auction in the absence of bid validity check is presented and a necessary bid validity check mechanism is proposed. Then a batch cryptographic technique is introduced and applied to improve the efficiency of bid validity check.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The effectiveness of higher-order spectral (HOS) phase features in speaker recognition is investigated by comparison with Mel Cepstral features on the same speech data. HOS phase features retain phase information from the Fourier spectrum unlikeMel–frequency Cepstral coefficients (MFCC). Gaussian mixture models are constructed from Mel– Cepstral features and HOS features, respectively, for the same data from various speakers in the Switchboard telephone Speech Corpus. Feature clusters, model parameters and classification performance are analyzed. HOS phase features on their own provide a correct identification rate of about 97% on the chosen subset of the corpus. This is the same level of accuracy as provided by MFCCs. Cluster plots and model parameters are compared to show that HOS phase features can provide complementary information to better discriminate between speakers.