983 resultados para frequency features
Resumo:
Age-related Macular Degeneration (AMD) is one of the major causes of vision loss and blindness in ageing population. Currently, there is no cure for AMD, however early detection and subsequent treatment may prevent the severe vision loss or slow the progression of the disease. AMD can be classified into two types: dry and wet AMDs. The people with macular degeneration are mostly affected by dry AMD. Early symptoms of AMD are formation of drusen and yellow pigmentation. These lesions are identified by manual inspection of fundus images by the ophthalmologists. It is a time consuming, tiresome process, and hence an automated diagnosis of AMD screening tool can aid clinicians in their diagnosis significantly. This study proposes an automated dry AMD detection system using various entropies (Shannon, Kapur, Renyi and Yager), Higher Order Spectra (HOS) bispectra features, Fractional Dimension (FD), and Gabor wavelet features extracted from greyscale fundus images. The features are ranked using t-test, Kullback–Lieber Divergence (KLD), Chernoff Bound and Bhattacharyya Distance (CBBD), Receiver Operating Characteristics (ROC) curve-based and Wilcoxon ranking methods in order to select optimum features and classified into normal and AMD classes using Naive Bayes (NB), k-Nearest Neighbour (k-NN), Probabilistic Neural Network (PNN), Decision Tree (DT) and Support Vector Machine (SVM) classifiers. The performance of the proposed system is evaluated using private (Kasturba Medical Hospital, Manipal, India), Automated Retinal Image Analysis (ARIA) and STructured Analysis of the Retina (STARE) datasets. The proposed system yielded the highest average classification accuracies of 90.19%, 95.07% and 95% with 42, 54 and 38 optimal ranked features using SVM classifier for private, ARIA and STARE datasets respectively. This automated AMD detection system can be used for mass fundus image screening and aid clinicians by making better use of their expertise on selected images that require further examination.
Resumo:
Existing crowd counting algorithms rely on holistic, local or histogram based features to capture crowd properties. Regression is then employed to estimate the crowd size. Insufficient testing across multiple datasets has made it difficult to compare and contrast different methodologies. This paper presents an evaluation across multiple datasets to compare holistic, local and histogram based methods, and to compare various image features and regression models. A K-fold cross validation protocol is followed to evaluate the performance across five public datasets: UCSD, PETS 2009, Fudan, Mall and Grand Central datasets. Image features are categorised into five types: size, shape, edges, keypoints and textures. The regression models evaluated are: Gaussian process regression (GPR), linear regression, K nearest neighbours (KNN) and neural networks (NN). The results demonstrate that local features outperform equivalent holistic and histogram based features; optimal performance is observed using all image features except for textures; and that GPR outperforms linear, KNN and NN regression
Resumo:
This paper is about localising across extreme lighting and weather conditions. We depart from the traditional point-feature-based approach as matching under dramatic appearance changes is a brittle and hard thing. Point feature detectors are fixed and rigid procedures which pass over an image examining small, low-level structure such as corners or blobs. They apply the same criteria applied all images of all places. This paper takes a contrary view and asks what is possible if instead we learn a bespoke detector for every place. Our localisation task then turns into curating a large bank of spatially indexed detectors and we show that this yields vastly superior performance in terms of robustness in exchange for a reduced but tolerable metric precision. We present an unsupervised system that produces broad-region detectors for distinctive visual elements, called scene signatures, which can be associated across almost all appearance changes. We show, using 21km of data collected over a period of 3 months, that our system is capable of producing metric localisation estimates from night-to-day or summer-to-winter conditions.
Resumo:
This PhD research has provided novel solutions to three major challenges which have prevented the wide spread deployment of speaker recognition technology: (1) combating enrolment/ verification mismatch, (2) reducing the large amount of development and training data that is required and (3) reducing the duration of speech required to verify a speaker. A range of applications of speaker recognition technology from forensics in criminal investigations to secure access in banking will benefit from the research outcomes.
Resumo:
Objective Migraine is a highly disabling disease affecting a significant proportion of the Australian population. The Methylenetetrahydrofolate Reductase (MTHFR) C677T variant has been associated with increased levels of homocysteine and risk of migraine with aura (MA). Folic acid, Vitamin B6 and B12 supplementation has been previously shown to reduce increased levels of homocysteine and decrease migraine symptoms. However the influence of dietary folate intake on migraine has been unclear. The aim of the current study was to analyse the association of dietary folate intake in the form of dietary folate equivalent (DFE), folic acid (FA) and total food folate (TFF) on migraine frequency, severity and disability. Methods A cohort of 141 adult females of Caucasian descent with MA was genotyped for the MTHFRC677T variant using restriction enzyme digestion. Dietary folate information was collected from all participants and analysed using the “FoodWorks” 2009 package. Folate consumption was compared to migraine frequency, severity and disability using linear regression. Results A significant inverse relation was observed between DFE [R2= 0.201, P= 0.045, CI (-0.004, -0.001)] and FA [R2= 0.255, P= 0.036, 95% CI (-0.009, -0.002)] consumption and migraine frequency. It was also observed that in individuals with the CC genotype for the MTHFR C677T variant, migraine frequency was significantly linked to FA consumption [R2= 0.077, P= 0.029, CI (-0.009, -0.005)]. Conclusions The results from this study indicate that folate intake in the form of folic acid may influence migraine frequency in female MA sufferers.
Resumo:
This work deals with estimators for predicting when parametric roll resonance is going to occur in surface vessels. The roll angle of the vessel is modeled as a second-order linear oscillatory system with unknown parameters. Several algorithms are used to estimate the parameters and eigenvalues of the system based on data gathered experimentally on a 1:45 scale model of a tanker. Based on the estimated eigenvalues, the system predicts whether or not parametric roll occurred. A prediction accuracy of 100% is achieved for regular waves, and up to 87.5% for irregular waves.
Resumo:
Abnormal event detection has attracted a lot of attention in the computer vision research community during recent years due to the increased focus on automated surveillance systems to improve security in public places. Due to the scarcity of training data and the definition of an abnormality being dependent on context, abnormal event detection is generally formulated as a data-driven approach where activities are modeled in an unsupervised fashion during the training phase. In this work, we use a Gaussian mixture model (GMM) to cluster the activities during the training phase, and propose a Gaussian mixture model based Markov random field (GMM-MRF) to estimate the likelihood scores of new videos in the testing phase. Further-more, we propose two new features: optical acceleration, and the histogram of optical flow gradients; to detect the presence of any abnormal objects and speed violations in the scene. We show that our proposed method outperforms other state of the art abnormal event detection algorithms on publicly available UCSD dataset.
Resumo:
Fine-grained leaf classification has concentrated on the use of traditional shape and statistical features to classify ideal images. In this paper we evaluate the effectiveness of traditional hand-crafted features and propose the use of deep convolutional neural network (ConvNet) features. We introduce a range of condition variations to explore the robustness of these features, including: translation, scaling, rotation, shading and occlusion. Evaluations on the Flavia dataset demonstrate that in ideal imaging conditions, combining traditional and ConvNet features yields state-of-theart performance with an average accuracy of 97:3%�0:6% compared to traditional features which obtain an average accuracy of 91:2%�1:6%. Further experiments show that this combined classification approach consistently outperforms the best set of traditional features by an average of 5:7% for all of the evaluated condition variations.
Resumo:
Background: Hot air ballooning incidents are relatively rare, however, when they do occur they are likely to result in a fatality or serious injury. Human error is commonly attributed as the cause of hot air ballooning incidents; however, error in itself is not an explanation for safety failures. This research aims to identify, and establish the relative importance of factors contributing towards hot air ballooning incidents. Methods: Twenty-two Australian Ballooning Federation (ABF) incident reports were thematically coded using a bottom up approach to identify causal factors. Subsequently, 69 balloonists (mean 19.51 years’ experience) participated in a survey to identify additional causal factors and rate (out of seven) the perceived frequency and potential impact to ballooning operations of each of the previously identified causal factors. Perceived associated risk was calculated by multiplying mean perceived frequency and impact ratings. Results: Incident report coding identified 54 causal factors within nine higher level areas: Attributes, Crew resource management, Equipment, Errors, Instructors, Organisational, Physical Environment, Regulatory body and Violations. Overall, ‘weather’, ‘inexperience’ and ‘poor/inappropriate decisions’ were rated as having greatest perceived associated risk. Discussion: Although errors were nominated as a prominent cause of hot air ballooning incidents, physical environment and personal attributes are also particularly important for safe hot air ballooning operations. In identifying a range of causal factors the areas of weakness surrounding ballooning operations have been defined; it is hoped that targeted safety and training strategies can now be put into place removing these contributing factors and reducing the chance of pilot error.
Resumo:
Few studies have investigated the vocal communication of ratites, and none has investigated the spectral and temporal structure of vocalizations of Apteryx, the only extant ratite genus in New Zealand. We describe the long-range vocalization (whistle call) and vocal behavior of male and female North Island Brown Kiwi (Apteryx mantelli). Spontaneous calling by seven pairs was recorded in the field over a one-year period. Call notes produced by males were tonal in nature; the fundamental frequency was ~1.5 kHz, with overtones reaching up to ~13.0 kHz. Call notes produced by females contained a series of tightly packed, poorly defined harmonics and consisted of a fundamental frequency of ~0.1 kHz, with overtones reaching ~7.0 kHz. The amplitude within notes of females was concentrated into two prominent formants. Some individuals of pairs exhibited duetting behavior that resulted in alteration of the inter-note interval after the onset of the call of their mate. Our findings draw attention to the uniqueness of the North Island Brown Kiwi's vocalizations, and we uncovered some unexpected structural features that call for further investigation.
Resumo:
The intermediate leaf-nosed bat (Hipposideros larvatus) is a medium-sized bat distributed throughout the Indo-Malay region. In north-east India, bats identified as H. larvatus captured at a single cave emitted echolocation calls with a bimodal distribution of peak frequencies, around either 85 kHz or 98 kHz. Individuals echolocating at 85 kHz had larger ears and longer forearms than those echolocating at 98 kHz, although no differences were detected in either wing morphology or diet, suggesting limited resource partitioning. A comparison of mitochondrial control region haplotypes of the two phonic types with individuals sampled from across the Indo-Malay range supports the hypothesis that, in India, two cryptic species are present. The Indian 98-kHz phonic bats formed a monophyletic clade with bats from all other regional populations sampled, to the exclusion of the Indian 85-kHz bats. In India, the two forms showed 12–13% sequence divergence and we propose that the name Hipposideros khasiana for bats of the 85-kHz phonic type. Bats of the 98-kHz phonic type formed a monophyletic group with bats from Myanmar, and corresponded to Hipposideros grandis, which is suggested to be a species distinct from Hipposideros larvatus. Differences in echolocation call frequency among populations did not reflect phylogenetic relationships, indicating that call frequency is a poor indicator of evolutionary history. Instead, divergence in call frequency probably occurs in allopatry, possibly augmented by character displacement on secondary contact to facilitate intraspecific communication.
Resumo:
Many species of bat use ultrasonic frequency modulated (FM) pulses to measure the distance to objects by timing the emission and reception of each pulse. Echolocation is mainly used in flight. Since the flight speed of bats often exceeds 1% of the speed of sound, Doppler effects will lead to compression of the time between emission and reception as well as an elevation of the echo frequencies, resulting in a distortion of the perceived range. This paper describes the consequences of these Doppler effects on the ranging performance of bats using different pulse designs. The consequences of Doppler effects on ranging performance described in this paper assume bats to have a very accurate ranging resolution, which is feasible with a filterbank receiver. By modeling two receiver types, it was first established that the effects of Doppler compression are virtually independent of the receiver type. Then, used a cross-correlation model was used to investigate the effect of flight speed on Doppler tolerance and range–Doppler coupling separately. This paper further shows how pulse duration, bandwidth, function type, and harmonics influence Doppler tolerance and range–Doppler coupling. The influence of each signal parameter is illustrated using calls of several bat species. It is argued that range–Doppler coupling is a significant source of error in bat echolocation, and various strategies bats could employ to deal with this problem, including the use of range rate information are discussed.
Resumo:
We recorded echolocation calls from 14 sympatric species of bat in Britain. Once digitised, one temporal and four spectral features were measured from each call. The frequency-time course of each call was approximated by fitting eight mathematical functions, and the goodness of fit, represented by the mean-squared error, was calculated. Measurements were taken using an automated process that extracted a single call from background noise and measured all variables without intervention. Two species of Rhinolophus were easily identified from call duration and spectral measurements. For the remaining 12 species, discriminant function analysis and multilayer back-propagation perceptrons were used to classify calls to species level. Analyses were carried out with and without the inclusion of curve-fitting data to evaluate its usefulness in distinguishing among species. Discriminant function analysis achieved an overall correct classification rate of 79% with curve-fitting data included, while an artificial neural network achieved 87%. The removal of curve-fitting data improved the performance of the discriminant function analysis by 2 %, while the performance of a perceptron decreased by 2 %. However, an increase in correct identification rates when curve-fitting information was included was not found for all species. The use of a hierarchical classification system, whereby calls were first classified to genus level and then to species level, had little effect on correct classification rates by discriminant function analysis but did improve rates achieved by perceptrons. This is the first published study to use artificial neural networks to classify the echolocation calls of bats to species level. Our findings are discussed in terms of recent advances in recording and analysis technologies, and are related to factors causing convergence and divergence of echolocation call design in bats.