168 resultados para Spectral Sensitivity
em Queensland University of Technology - ePrints Archive
Resumo:
This thesis presents an original approach to parametric speech coding at rates below 1 kbitsjsec, primarily for speech storage applications. Essential processes considered in this research encompass efficient characterization of evolutionary configuration of vocal tract to follow phonemic features with high fidelity, representation of speech excitation using minimal parameters with minor degradation in naturalness of synthesized speech, and finally, quantization of resulting parameters at the nominated rates. For encoding speech spectral features, a new method relying on Temporal Decomposition (TD) is developed which efficiently compresses spectral information through interpolation between most steady points over time trajectories of spectral parameters using a new basis function. The compression ratio provided by the method is independent of the updating rate of the feature vectors, hence allows high resolution in tracking significant temporal variations of speech formants with no effect on the spectral data rate. Accordingly, regardless of the quantization technique employed, the method yields a high compression ratio without sacrificing speech intelligibility. Several new techniques for improving performance of the interpolation of spectral parameters through phonetically-based analysis are proposed and implemented in this research, comprising event approximated TD, near-optimal shaping event approximating functions, efficient speech parametrization for TD on the basis of an extensive investigation originally reported in this thesis, and a hierarchical error minimization algorithm for decomposition of feature parameters which significantly reduces the complexity of the interpolation process. Speech excitation in this work is characterized based on a novel Multi-Band Excitation paradigm which accurately determines the harmonic structure in the LPC (linear predictive coding) residual spectra, within individual bands, using the concept 11 of Instantaneous Frequency (IF) estimation in frequency domain. The model yields aneffective two-band approximation to excitation and computes pitch and voicing with high accuracy as well. New methods for interpolative coding of pitch and gain contours are also developed in this thesis. For pitch, relying on the correlation between phonetic evolution and pitch variations during voiced speech segments, TD is employed to interpolate the pitch contour between critical points introduced by event centroids. This compresses pitch contour in the ratio of about 1/10 with negligible error. To approximate gain contour, a set of uniformly-distributed Gaussian event-like functions is used which reduces the amount of gain information to about 1/6 with acceptable accuracy. The thesis also addresses a new quantization method applied to spectral features on the basis of statistical properties and spectral sensitivity of spectral parameters extracted from TD-based analysis. The experimental results show that good quality speech, comparable to that of conventional coders at rates over 2 kbits/sec, can be achieved at rates 650-990 bits/sec.
Resumo:
Intrinsically photosensitive retinal ganglion cells (ipRGCs) in the eye transmit the environmental light level, projecting to the suprachiasmatic nucleus (SCN) (Berson, Dunn & Takao, 2002; Hattar, Liao, Takao, Berson & Yau, 2002), the location of the circadian biological clock, and the olivary pretectal nucleus (OPN) of the pretectum, the start of the pupil reflex pathway (Hattar, Liao, Takao, Berson & Yau, 2002; Dacey, Liao, Peterson, Robinson, Smith, Pokorny, Yau & Gamlin, 2005). The SCN synchronizes the circadian rhythm, a cycle of biological processes coordinated to the solar day, and drives the sleep/wake cycle by controlling the release of melatonin from the pineal gland (Claustrat, Brun & Chazot, 2005). Encoded photic input from ipRGCs to the OPN also contributes to the pupil light reflex (PLR), the constriction and recovery of the pupil in response to light. IpRGCs control the post-illumination component of the PLR, the partial pupil constriction maintained for > 30 sec after a stimulus offset (Gamlin, McDougal, Pokorny, Smith, Yau & Dacey, 2007; Kankipati, Girkin & Gamlin, 2010; Markwell, Feigl & Zele, 2010). It is unknown if intrinsic ipRGC and cone-mediated inputs to ipRGCs show circadian variation in their photon-counting activity under constant illumination. If ipRGCs demonstrate circadian variation of the pupil response under constant illumination in vivo, when in vitro ipRGC activity does not (Weng, Wong & Berson, 2009), this would support central control of the ipRGC circadian activity. A preliminary experiment was conducted to determine the spectral sensitivity of the ipRGC post-illumination pupil response under the experimental conditions, confirming the successful isolation of the ipRGC response (Gamlin, et al., 2007) for the circadian experiment. In this main experiment, we demonstrate that ipRGC photon-counting activity has a circadian rhythm under constant experimental conditions, while direct rod and cone contributions to the PLR do not. Intrinsic ipRGC contributions to the post-illumination pupil response decreased 2:46 h prior to melatonin onset for our group model, with the peak ipRGC attenuation occurring 1:25 h after melatonin onset. Our results suggest a centrally controlled evening decrease in ipRGC activity, independent of environmental light, which is temporally synchronized (demonstrates a temporal phase-advanced relationship) to the SCN mediated release of melatonin. In the future the ipRGC post-illumination pupil response could be developed as a fast, non-invasive measure of circadian rhythm. This study establishes a basis for future investigation of cortical feedback mechanisms that modulate ipRGC activity.
Resumo:
Opsins are ancient molecules that enable animal vision by coupling to a vitamin-derived chromophore to form lightsensitive photopigments. The primary drivers of evolutionary diversification in opsins are thought to be visual tasks related to spectral sensitivity and color vision. Typically, only a few opsin amino acid sites affect photopigment spectral sensitivity. We show that opsin genes of the North American butterfly Limenitis arthemis have diversified along a latitudinal cline, consistent with natural selection due to environmental factors. We sequenced single nucleotide(SNP) polymorphisms in the coding regions of the ultraviolet (UVRh), blue (BRh), and long-wavelength (LWRh) opsin genes from ten butterfly populations along the eastern United States and found that a majority of opsin SNPs showed significant clinal variation. Outlier detection and analysis of molecular variance indicated that many SNPs are under balancing selection and show significant population structure. This contrasts with what we found by analysing SNPs in the wingless and EF-1 alpha loci, and from neutral amplified fragment length polymorphisms, which show no evidence of significant locus-specific or genome-wide structure among populations. Using a combination of functional genetic and physiological approaches, including expression in cell culture, transgenic Drosophila, UV-visible spectroscopy, and optophysiology, we show that key BRh opsin SNPs that vary clinally have almost no effect on spectral sensitivity. Our results suggest that opsin diversification in this butterfly is more consistent with natural selection unrelated to spectral tuning. Some of the clinally varying SNPs may instead play a role in regulating opsin gene expression levels or the thermostability of the opsin protein. Lastly, we discuss the possibility that insect opsins might have important, yet-to-be elucidated, adaptive functions in mediating animal responses to abiotic factors, such as temperature or photoperiod.
Resumo:
Purpose The post-illumination pupil response (PIPR) has been quantified using four metrics, but the spectral sensitivity of only one is known; here we determine the other three. To optimize the human PIPR measurement, we determine the protocol producing the largest PIPR, the duration of the PIPR, and the metric(s) with the lowest coefficient of variation. Methods The consensual pupil light reflex (PLR) was measured with a Maxwellian view pupillometer. - Experiment 1: Spectral sensitivity of four PIPR metrics [plateau, 6 s, area under curve (AUC) early and late recovery] was determined from a criterion PIPR to a 1s pulse and fitted with Vitamin A1 nomogram (λmax = 482nm). - Experiment 2: The PLR was measured as a function of three stimulus durations (1s, 10s, 30s), five irradiances spanning low to high melanopsin excitation levels (retinal irradiance: 9.8 to 14.8 log quanta.cm-2.s-1), and two wavelengths, one with high (465nm) and one with low (637nm) melanopsin excitation. Intra and inter-individual coefficients of variation (CV) were calculated. Results The melanopsin (opn4) photopigment nomogram adequately describes the spectral sensitivity of all four PIPR metrics. The PIPR amplitude was largest with 1s short wavelength pulses (≥ 12.8 log quanta.cm-2.s-1). The plateau and 6s PIPR showed the least intra and inter-individual CV (≤ 0.2). The maximum duration of the sustained PIPR was 83.0±48.0s (mean±SD) for 1s pulses and 180.1±106.2s for 30s pulses (465nm; 14.8 log quanta.cm-2.s-1). Conclusions All current PIPR metrics provide a direct measure of the intrinsic melanopsin photoresponse. To measure progressive changes in melanopsin function in disease, we recommend that the PIPR be measured using short duration pulses (e.g., ≤ 1s) with high melanopsin excitation and analyzed with plateau and/or 6s metrics. Our PIPR duration data provide a baseline for the selection of inter-stimulus intervals between consecutive pupil testing sequences.
Resumo:
Purpose The post-illumination pupil response (PIPR) has been quantified in the literature by four metrics. The spectral sensitivity of only one metric is known and this study quantifies the other three. To optimize the measurement of the PIPR in humans, we also determine the stimulus protocol producing the largest PIPR, the duration of the PIPR, and the metric(s) with the lowest coefficient of variation. Methods The consensual pupil light reflex (PLR) was measured with a Maxwellian view pupillometer (35.6° diameter stimulus). - Experiment 1: Spectral sensitivity of four PIPR metrics [plateau, 6 s, area under curve (AUC) early and late recovery] was determined from a criterion PIPR (n = 2 participants) to a 1 s pulse at five wavelengths (409-592nm) and fitted with Vitamin A nomogram (ƛmax = 482 nm). - Experiment 2: The PLR was measured in five healthy participants [29 to 42 years (mean = 32.6 years)] as a function of three stimulus durations (1 s, 10 s, 30 s), five irradiances spanning low to high melanopsin excitation levels (retinal irradiance: 9.8 to 14.8 log quanta.cm-2.s-1), and two wavelengths, one with high (465 nm) and one with low (637 nm) melanopsin excitation. Intra and inter-individual coefficients of variation (CV) were calculated. Results The melanopsin (opn4) photopigment nomogram adequately described the spectral sensitivity derived from all four PIPR metrics. The largest PIPR amplitude was observed with 1 s short wavelength pulses (retinal irradiance ≥ 12.8 log quanta.cm-2.s-1). Of the 4 PIPR metrics, the plateau and 6 s PIPR showed the least intra and inter-individual CV (≤ 0.2). The maximum duration of the sustained PIPR was 83.4 ± 48.0 s (mean ± SD) for 1 s pulses and 180.1 ± 106.2 s for 30 s pulses (465 nm; 14.8 log quanta.cm-2.s-1). Conclusions All current PIPR metrics provide a direct measure of intrinsic melanopsin retinal ganglion cell function. To measure progressive changes in melanopsin function in disease, we recommend that the intrinsic melanopsin response should be measured using a 1 s pulse with high melanopsin excitation and the PIPR should be analyzed with the plateau and/or 6 s metrics. That the PIPR can have a sustained constriction for as long as 3 minutes, our PIPR duration data provide a baseline for the selection of inter-stimulus intervals between consecutive pupil testing sequences.
Resumo:
The effects of small changes in flight-path parameters (primary and secondary flight paths, detector angles), and of displacement of the sample along the beam axis away from its ideal position, are examined for an inelastic time-of-flight (TOF) neutron spectrometer, emphasising the deep-inelastic regime. The aim was to develop a rational basis for deciding what measured shifts in the positions of spectral peaks could be regarded as reliable in the light of the uncertainties in the calibrated flight-path parameters. Uncertainty in the length of the primary or secondary flight path has the least effect on the positions of the peaks of H, D and He, which are dominated by the accuracy of the calibration of the detector angles. This aspect of the calibration of a TOF spectrometer therefore demands close attention to achieve reliable outcomes where the position of the peaks is of significant scientific interest and is discussed in detail. The corresponding sensitivities of the position of peak of the Compton profile, J(y), to flight-path parameters and sample position are also examined, focusing on the comparability across experiments of results for H, D and He. We show that positioning the sample to within a few mm of the ideal position is required to ensure good comparability between experiments if data from detectors at high forward angles are to be reliably interpreted.
Resumo:
The effectiveness of higher-order spectral (HOS) phase features in speaker recognition is investigated by comparison with Mel Cepstral features on the same speech data. HOS phase features retain phase information from the Fourier spectrum unlikeMel–frequency Cepstral coefficients (MFCC). Gaussian mixture models are constructed from Mel– Cepstral features and HOS features, respectively, for the same data from various speakers in the Switchboard telephone Speech Corpus. Feature clusters, model parameters and classification performance are analyzed. HOS phase features on their own provide a correct identification rate of about 97% on the chosen subset of the corpus. This is the same level of accuracy as provided by MFCCs. Cluster plots and model parameters are compared to show that HOS phase features can provide complementary information to better discriminate between speakers.