50 resultados para classification and regression tree
Resumo:
The HOM-C clustered prototype homeobox genes of Drosophila, and their counterparts, the HOX genes in humans, are highly conserved at the genomic level. These master regulators of development continue to be expressed throughout adulthood in various tissues and organs. The physiological and patho-physiological functions of this network of genes are being avidly pursued within the scientific community, but defined roles for them remain elusive. The order of expression of HOX genes within a cluster is co-ordinated during development, so that the 3' genes are expressed more anteriorly and earlier than the 5' genes. Mutations in HOXA13 and HOXD13 are associated with disorders of limb formation such as hand-foot-genital syndrome (HFGS), synpolydactyly (SPD), and brachydactyly. Haematopoietic progenitors express HOX genes in a pattern characteristic of the lineage and stage of differentiation of the cells. In leukaemia, dysregulated HOX gene expression can occur due to chromosomal translocations involving upstream regulators such as the MLL gene, or the fusion of a HOX gene to another gene such as the nucleoporin, NUP98. Recent investigations of HOX gene expression in leukaemia are providing important insights into disease classification and prediction of clinical outcome. Whereas the oncogenic potential of certain HOX genes in leukaemia has already been defined, their role in other neoplasms is currently being studied. Progress has been hampered by the experimental approach used in many studies in which the expression of small subsets of HOX genes was analysed, and complicated by the functional redundancy implicit in the HOX gene system. Attempts to elucidate the function of HOX genes in malignant transformation will be enhanced by a better understanding of their upstream regulators and downstream target genes.
Resumo:
Several studies have reported imitative deficits in autism spectrum disorder (ASD). However, it is still debated if imitative deficits are specific to ASD or shared with clinical groups with similar mental impairment and motor difficulties. We investigated whether imitative tasks can be used to discriminate ASD children from typically developing children (TD) and children with general developmental delay (GDD). We applied discriminant function analyses to the performance of these groups on three imitation tasks and tests of dexterity, motor planning, verbal skills, theory of mind (ToM). Analyses revealed two significant dimensions. The first represented impairment of dexterity and verbal ability, and discriminated TD from GDD children. Once these differences were accounted for, differences in ToM and the three imitation tasks accounted for a significant proportion of the remaining intergroup variance and discriminated the ASD group from other groups. Further analyses revealed that inclusion of imitative tasks increased the specificity and sensitivity of ASD classification and that imitative tasks considered alone were able to reliably discriminate ASD, TD and GDD. The results suggest that imitation and theory of mind impairment in autism may stem from a common domain of origin separate from general cognitive and motor skill.
Resumo:
For many applications of emotion recognition, such as virtual agents, the system must select responses while the user is speaking. This requires reliable on-line recognition of the user’s affect. However most emotion recognition systems are based on turnwise processing. We present a novel approach to on-line emotion recognition from speech using Long Short-Term Memory Recurrent Neural Networks. Emotion is recognised frame-wise in a two-dimensional valence-activation continuum. In contrast to current state-of-the-art approaches, recognition is performed on low-level signal frames, similar to those used for speech recognition. No statistical functionals are applied to low-level feature contours. Framing at a higher level is therefore unnecessary and regression outputs can be produced in real-time for every low-level input frame. We also investigate the benefits of including linguistic features on the signal frame level obtained by a keyword spotter.
Resumo:
The VLT-FLAMES Tarantula Survey (VFTS) is an ESO Large Programme that has obtained multi-epoch optical spectroscopy of over 800 massive stars in the 30 Doradus region of the Large Magellanic Cloud (LMC). Here we introduce our scientific motivations and give an overview of the survey targets, including optical and near-infrared photometry and comprehensive details of the data reduction. One of the principal objectives was to detect massive binary systems via variations in their radial velocities, thus shaping the multi-epoch observing strategy. Spectral classifications are given for the massive emission-line stars observed by the survey, including the discovery of a new Wolf-Rayet star (VFTS 682, classified as WN5h), 2' to the northeast of R136. To illustrate the diversity of objects encompassed by the survey, we investigate the spectral properties of sixteen targets identified by Gruendl & Chu from Spitzer photometry as candidate young stellar objects or stars with notable mid-infrared excesses. Detailed spectral classification and quantitative analysis of the O- and B-type stars in the VFTS sample, paying particular attention to the effects of rotational mixing and binarity, will be presented in a series of future articles to address fundamental questions in both stellar and cluster evolution.
Resumo:
Although cerebral palsy (CP) is the most common cause of motor deficiency in young children, it occurs in only 2 to 3 per 1000 live births. In order to monitor prevalence rates, especially within subgroups (birthweight, clinical type), it is necessary to study large populations. A network of CP surveys and registers was formed in 14 centres in eight countries across Europe. Differences in prevalence rates of CP in the centres prior to any work on harmonization of data are reported. The subsequent process to standardize the definition of CP, inclusion/exclusion criteria, classification, and description of children with CP is outlined. The consensus that was reached on these issues will make it possible to monitor trends in CP rate, to provide a framework for collaborative research, and a basis for services planning among European countries.
Resumo:
A 40mcore from Loagan Bunut,Malaysian Borneo, yielded a high-resolution early Holocene (11.3e6.75 ka) sequence of marginal-marine deposits. Palynological analysis showed relatively stable fire-regulated lowland forest through this time, with the local development and regression of mangrove vegetation. A general trend of rising rainfall and thus strengthening North East monsoonal circulation linked to the migration of the mean position of the ICTZ was interrupted by what may be episodes of drier climate and weakening monsoonal activity at 9250-8890, 7900 and 7600-7545 cal. BP. Magnetic susceptibility peaks suggestmarked short-term ENSO-style activity superimposed upon this record. Repeated markers for openand disturbed habitats, plus occasional imported and probably-cultivated taxa, point towards human impact from the earliest Holocene on the wet tropical forest at Loagan Bunut.
Resumo:
Background: There is growing interest in the potential utility of molecular diagnostics in improving the detection of life-threatening infection (sepsis). LightCycler® SeptiFast is a multipathogen probebased real-time PCR system targeting DNA sequences of bacteria and fungi present in blood samples within a few hours. We report here the protocol of the first systematic review of published clinical diagnostic accuracy studies of this technology when compared with blood culture in the setting of suspected sepsis. Methods/design: Data sources: the Cochrane Database of Systematic Reviews, the Database of Abstracts of Reviews of Effects (DARE), the Health Technology Assessment Database (HTA), the NHS Economic Evaluation Database (NHSEED), The Cochrane Library, MEDLINE, EMBASE, ISI Web of Science, BIOSIS Previews, MEDION and the Aggressive Research Intelligence Facility Database (ARIF). Study selection: diagnostic accuracy studies that compare the real-time PCR technology with standard culture results performed on a patient's blood sample during the management of sepsis. Data extraction: three reviewers, working independently, will determine the level of evidence, methodological quality and a standard data set relating to demographics and diagnostic accuracy metrics for each study. Statistical analysis/data synthesis: heterogeneity of studies will be investigated using a coupled forest plot of sensitivity and specificity and a scatter plot in Receiver Operator Characteristic (ROC) space. Bivariate model method will be used to estimate summary sensitivity and specificity. The authors will investigate reporting biases using funnel plots based on effective sample size and regression tests of asymmetry. Subgroup analyses are planned for adults, children and infection setting (hospital vs community) if sufficient data are uncovered. Dissemination: Recommendations will be made to the Department of Health (as part of an open-access HTA report) as to whether the real-time PCR technology has sufficient clinical diagnostic accuracy potential to move forward to efficacy testing during the provision of routine clinical care.
Towards an understanding of the causes and effects of software requirements change: two case studies
Resumo:
Changes to software requirements not only pose a risk to the successful delivery of software applications but also provide opportunity for improved usability and value. Increased understanding of the causes and consequences of change can support requirements management and also make progress towards the goal of change anticipation. This paper presents the results of two case studies that address objectives arising from that ultimate goal. The first case study evaluated the potential of a change source taxonomy containing the elements ‘market’, ‘organisation’, ‘vision’, ‘specification’, and ‘solution’ to provide a meaningful basis for change classification and measurement. The second case study investigated whether the requirements attributes of novelty, complexity, and dependency correlated with requirements volatility. While insufficiency of data in the first case study precluded an investigation of changes arising due to the change source of ‘market’, for the remainder of the change sources, results indicate a significant difference in cost, value to the customer and management considerations. Findings show that higher cost and value changes arose more often from ‘organisation’ and ‘vision’ sources; these changes also generally involved the co-operation of more stakeholder groups and were considered to be less controllable than changes arising from the ‘specification’ or ‘solution’ sources. Results from the second case study indicate that only ‘requirements dependency’ is consistently correlated with volatility and that changes coming from each change source affect different groups of requirements. We conclude that the taxonomy can provide a meaningful means of change classification, but that a single requirement attribute is insufficient for change prediction. A theoretical causal account of requirements change is drawn from the implications of the combined results of the two case studies.
Resumo:
Late age-related maculopathy (ARM) is responsible for the majority of blind registrations in the Western world among persons over 50 years of age. It has devastating effects on quality of life and independence and is becoming a major public health concern. Current treatment options are limited and most aim to slow progression rather than restore vision; therefore, early detection to identify those patients most suitable for these interventions is essential. In this work, we review the literature encompassing the investigation of visual function in ARM in order to highlight those visual function parameters which are affected very early in the disease process. We pay particular attention to measures of acuity, contrast sensitivity (CS), cone function, electrophysiology, visual adaptation, central visual field sensitivity and metamorphopsia. We also consider the impact of bilateral late ARM on visual function as well as the relationship between measures of vision function and self-reported visual functioning. Much interest has centred on the identification of functional changes which may predict progression to neovascular disease; therefore, we outline the longitudinal studies, which to date have reported dark-adaptation time, short-wavelength cone sensitivity, colour-match area effect, dark-adapted foveal sensitivity, foveal flicker sensitivity, slow recovery from glare and slower foveal electroretinogram implicit time as functional risk factors for the development of neovascular disease. Despite progress in this area, we emphasise the need for longitudinal studies designed in light of developments in disease classification and retinal imaging, which would ensure the correct classification of cases and controls, and provide increased understanding of the natural course and progression of the disease and further elucidate the structure-function relationships in this devastating disorder.
Resumo:
Understanding how communities assemble is a key challenge in ecology. Conflicting hypotheses suggest that plant traits within communities should show divergence to reflect strategies to reduce competition or convergence to reflect strong selection for the environmental conditions operating. Further hypotheses suggest that plant traits related to productivity show convergence within communities, but those related to disturbance show divergence. Data on functional diversity (FD ) of 12 traits from 30 communities ranging from arable fields, mown and grazed grasslands to moorland and woodland were employed to test this using randomisations tests and correlation and regression analysis. No traits showed consistent significant convergence or divergence in functional diversity. When correlated to measures of the environment, the most common pattern was for functional diversity to decline (7 out of 12 traits) and the degree of convergence (7 out of 12 traits) to increase as the levels of productivity (measured as primary productivity, soil nitrogen release and vegetation C:N) and disturbance increased. Convergence or a relationship between functional diversity and the environment was not seen for a number of important traits, such as LDMC and SLA, which are considered as key predictors of ecosystem function. The analysis indicates that taking into account functional diversity within a system may be a necessary part of predicting the relationship between plant traits and ecosystem function, and that this may be of particular importance within less productive and less disturbed systems. © 2011 Springer-Verlag.
Resumo:
Pollen grains are microscopic so their identification and quantification has, for decades, depended upon human observers using light microscopes: a labour-intensive approach. Modern improvements in computing and imaging hardware and software now bring automation of pollen analyses within reach. In this paper, we provide the first review in over 15 yr of progress towards automation of the part of palynology concerned with counting and classifying pollen, bringing together literature published from a wide spectrum of sources. We
consider which attempts offer the most potential for an automated palynology system for universal application across all fields of research concerned with pollen classification and counting. We discuss what is required to make the datasets of these automated systems as acceptable as those produced by human palynologists, and present suggestions for how automation will generate novel approaches to counting and classifying pollen that have hitherto been unthinkable.
Resumo:
Background: Tobacco smoke is a major risk to the health of its users and arsenic is among the components of smoke present at concentrations of toxicological concern. There are significant variations in human toxicity between inorganic and organic arsenic species and the aim of this study was to determine whether there are predictable relationships among major arsenic species in tobacco that could be useful for risk assessment.
Methods: 14 samples of tobacco were studied spanning a wide range of concentrations in samples from different geographical regions, including certified reference materials and cigarette products. Inorganic and major organic arsenic species were extracted from powdered tobacco samples by nitric acid using microwave digestion. Concentrations of arsenic species in these extracts were determined using HPLC-ICPMS.
Results: The concentrations of total inorganic arsenic species range from 144 to 3914 mu g kg(-1), while organic species dimethylarsinic acid (DMA) ranges from 21 to 176 mu g As kg(-1), and monomethylarsonic acid (MA) ranges from 30 to 116 mu g kg(-1). The percentage of species eluted compared to the total arsenic extracted ranges from 11.1 to 36.8% suggesting that some As species (possibly macro-molecules, strongly complexed or in organic forms) do not elute from the column. This low percentage of column-speciated arsenic is indicative that more complex forms of arsenic exist in the tobacco. All the analysed species correlate positively with total arsenic concentration over the whole compositional range and regression analysis indicates a consistent ratio of about 4:1 in favour of inorganic arsenic compared with MA + DMA.
Conclusions: The dominance of inorganic arsenic species among those components analysed is a marked feature of the diverse range of tobaccos selected for study. Such consistency is important in the context of a WHO expert panel recommendation to regulate tobacco crops and products using total arsenic concentration. If implemented more research would be required to develop models that accurately predict the smoker's exposure to reduced inorganic arsenic species on the basis of leaf or product concentration and product design features.
Resumo:
Shoeprint evidence collected from crime scenes can play an important role in forensic investigations. Usually, the analysis of shoeprints is carried out manually and is based on human expertise and knowledge. As well as being error prone, such a manual process can also be time consuming; thus affecting the usability and suitability of shoeprint evidence in a court of law. Thus, an automatic system for classification and retrieval of shoeprints has the potential to be a valuable tool. This paper presents a solution for the automatic retrieval of shoeprints which is considerably more robust than existing solutions in the presence of geometric distortions such as scale, rotation and scale distortions. It addresses the issue of classifying partial shoeprints in the presence of rotation, scale and noise distortions and relies on the use of two local point-of-interest detectors whose matching scores are combined. In this work, multiscale Harris and Hessian detectors are used to select corners and blob-like structures in a scale-space representation for scale invariance, while Scale Invariant Feature Transform (SIFT) descriptor is employed to achieve rotation invariance. The proposed technique is based on combining the matching scores of the two detectors at the score level. Our evaluation has shown that it outperforms both detectors in most of our extended experiments when retrieving partial shoeprints with geometric distortions, and is clearly better than similar work published in the literature. We also demonstrate improved performance in the face of wear and tear. As matter of fact, whilst the proposed work outperforms similar algorithms in the literature, it is shown that achieving good retrieval performance is not constrained by acquiring a full print from a scene of crime as a partial print can still be used to attain comparable retrieval results to those of using the full print. This gives crime investigators more flexibility is choosing the parts of a print to search for in a database of footwear.
Resumo:
Multivariate classification techniques have proven to be powerful tools for distinguishing experimental conditions in single sessions of functional magnetic resonance imaging (fMRI) data. But they are vulnerable to a considerable penalty in classification accuracy when applied across sessions or participants, calling into question the degree to which fine-grained encodings are shared across subjects. Here, we introduce joint learning techniques, where feature selection is carried out using a held-out subset of a target dataset, before training a linear classifier on a source dataset. Single trials of functional MRI data from a covert property generation task are classified with regularized regression techniques to predict the semantic class of stimuli. With our selection techniques (joint ranking feature selection (JRFS) and disjoint feature selection (DJFS)), classification performance during cross-session prediction improved greatly, relative to feature selection on the source session data only. Compared with JRFS, DJFS showed significant improvements for cross-participant classification. And when using a groupwise training, DJFS approached the accuracies seen for prediction across different sessions from the same participant. Comparing several feature selection strategies, we found that a simple univariate ANOVA selection technique or a minimal searchlight (one voxel in size) is appropriate, compared with larger searchlights.
Resumo:
Context. The Public European Southern Observatory Spectroscopic Survey of Transient Objects (PESSTO) began as a public spectroscopic survey in April 2012. PESSTO classifies transients from publicly available sources and wide-field surveys, and selects science targets for detailed spectroscopic and photometric follow-up. PESSTO runs for nine months of the year, January - April and August - December inclusive, and typically has allocations of 10 nights per month.
Aims. We describe the data reduction strategy and data products that are publicly available through the ESO archive as the Spectroscopic Survey data release 1 (SSDR1).
Methods. PESSTO uses the New Technology Telescope with the instruments EFOSC2 and SOFI to provide optical and NIR spectroscopy and imaging. We target supernovae and optical transients brighter than 20.5<sup>m</sup> for classification. Science targets are selected for follow-up based on the PESSTO science goal of extending knowledge of the extremes of the supernova population. We use standard EFOSC2 set-ups providing spectra with resolutions of 13-18 Å between 3345-9995 Å. A subset of the brighter science targets are selected for SOFI spectroscopy with the blue and red grisms (0.935-2.53 μm and resolutions 23-33 Å) and imaging with broadband JHK<inf>s</inf> filters.
Results. This first data release (SSDR1) contains flux calibrated spectra from the first year (April 2012-2013). A total of 221 confirmed supernovae were classified, and we released calibrated optical spectra and classifications publicly within 24 h of the data being taken (via WISeREP). The data in SSDR1 replace those released spectra. They have more reliable and quantifiable flux calibrations, correction for telluric absorption, and are made available in standard ESO Phase 3 formats. We estimate the absolute accuracy of the flux calibrations for EFOSC2 across the whole survey in SSDR1 to be typically ∼15%, although a number of spectra will have less reliable absolute flux calibration because of weather and slit losses. Acquisition images for each spectrum are available which, in principle, can allow the user to refine the absolute flux calibration. The standard NIR reduction process does not produce high accuracy absolute spectrophotometry but synthetic photometry with accompanying JHK<inf>s</inf> imaging can improve this. Whenever possible, reduced SOFI images are provided to allow this.
Conclusions. Future data releases will focus on improving the automated flux calibration of the data products. The rapid turnaround between discovery and classification and access to reliable pipeline processed data products has allowed early science papers in the first few months of the survey.