908 resultados para classification and regression tree


Relevância:

100.00% 100.00%

Publicador:

Resumo:

In modern farm systems the economic interests make reducing the risks related to transport practice an important goal. An increasing attention is directed to the welfare of animals in transit, also considering the new existing facilities. In recent years the results coming from the study of animal farm behaviour were used as tool to assess the welfare. In this thesis were analyzed behavioural patterns, jointly with blood variables, to evaluate the stress response of piglets and young bulls during transport. Since the animal behaviour could be different between individuals and these differences can affect animal responses to aversive situations, the individual behavioural characteristics were taken in account. Regarding young bulls, selected to genetic evaluation, the individual behaviour was investigated before, during and after transport, while for piglets was adopted a tested methodology classification and behavioural tests to observe their coping characteristics. The aim of this thesis was to analyse the behavioural and physiological response of young bulls and piglets to transport practice and to investigate if coping characteristics may affect how piglets cope with aversive situations. The thesis is composed by four experimental studies. The first one aims to identify the best existent methodology classification of piglets coping style between those that were credited in literature. The second one investigated the differences in response to novel situations of piglets with different coping styles. The last studies evaluated the stress response of piglets and young bulls to road transportation. The results obtained show that transport did not affect the behaviour and homeostasis of young animals which respond in a different way from adults. However the understanding of individual behavioural characteristic and the use of behavioural patterns, in addition to blood analyses, need to be more investigated in order to be useful tools to assess the animal response in aversive situation.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Advances in biomedical signal acquisition systems for motion analysis have led to lowcost and ubiquitous wearable sensors which can be used to record movement data in different settings. This implies the potential availability of large amounts of quantitative data. It is then crucial to identify and to extract the information of clinical relevance from the large amount of available data. This quantitative and objective information can be an important aid for clinical decision making. Data mining is the process of discovering such information in databases through data processing, selection of informative data, and identification of relevant patterns. The databases considered in this thesis store motion data from wearable sensors (specifically accelerometers) and clinical information (clinical data, scores, tests). The main goal of this thesis is to develop data mining tools which can provide quantitative information to the clinician in the field of movement disorders. This thesis will focus on motor impairment in Parkinson's disease (PD). Different databases related to Parkinson subjects in different stages of the disease were considered for this thesis. Each database is characterized by the data recorded during a specific motor task performed by different groups of subjects. The data mining techniques that were used in this thesis are feature selection (a technique which was used to find relevant information and to discard useless or redundant data), classification, clustering, and regression. The aims were to identify high risk subjects for PD, characterize the differences between early PD subjects and healthy ones, characterize PD subtypes and automatically assess the severity of symptoms in the home setting.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Atmospheric aerosol particles directly impact air quality and participate in controlling the climate system. Organic Aerosol (OA) in general accounts for a large fraction (10–90%) of the global submicron (PM1) particulate mass. Chemometric methods for source identification are used in many disciplines, but methods relying on the analysis of NMR datasets are rarely used in atmospheric sciences. This thesis provides an original application of NMR-based chemometric methods to atmospheric OA source apportionment. The method was tested on chemical composition databases obtained from samples collected at different environments in Europe, hence exploring the impact of a great diversity of natural and anthropogenic sources. We focused on sources of water-soluble OA (WSOA), for which NMR analysis provides substantial advantages compared to alternative methods. Different factor analysis techniques are applied independently to NMR datasets from nine field campaigns of the project EUCAARI and allowed the identification of recurrent source contributions to WSOA in European background troposphere: 1) Marine SOA; 2) Aliphatic amines from ground sources (agricultural activities, etc.); 3) Biomass burning POA; 4) Biogenic SOA from terpene oxidation; 5) “Aged” SOAs, including humic-like substances (HULIS); 6) Other factors possibly including contributions from Primary Biological Aerosol Particles, and products of cooking activities. Biomass burning POA accounted for more than 50% of WSOC in winter months. Aged SOA associated with HULIS was predominant (> 75%) during the spring-summer, suggesting that secondary sources and transboundary transport become more important in spring and summer. Complex aerosol measurements carried out, involving several foreign research groups, provided the opportunity to compare source apportionment results obtained by NMR analysis with those provided by more widespread Aerodyne aerosol mass spectrometers (AMS) techniques that now provided categorization schemes of OA which are becoming a standard for atmospheric chemists. Results emerging from this thesis partly confirm AMS classification and partly challenge it.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Bioinformatics, in the last few decades, has played a fundamental role to give sense to the huge amount of data produced. Obtained the complete sequence of a genome, the major problem of knowing as much as possible of its coding regions, is crucial. Protein sequence annotation is challenging and, due to the size of the problem, only computational approaches can provide a feasible solution. As it has been recently pointed out by the Critical Assessment of Function Annotations (CAFA), most accurate methods are those based on the transfer-by-homology approach and the most incisive contribution is given by cross-genome comparisons. In the present thesis it is described a non-hierarchical sequence clustering method for protein automatic large-scale annotation, called “The Bologna Annotation Resource Plus” (BAR+). The method is based on an all-against-all alignment of more than 13 millions protein sequences characterized by a very stringent metric. BAR+ can safely transfer functional features (Gene Ontology and Pfam terms) inside clusters by means of a statistical validation, even in the case of multi-domain proteins. Within BAR+ clusters it is also possible to transfer the three dimensional structure (when a template is available). This is possible by the way of cluster-specific HMM profiles that can be used to calculate reliable template-to-target alignments even in the case of distantly related proteins (sequence identity < 30%). Other BAR+ based applications have been developed during my doctorate including the prediction of Magnesium binding sites in human proteins, the ABC transporters superfamily classification and the functional prediction (GO terms) of the CAFA targets. Remarkably, in the CAFA assessment, BAR+ placed among the ten most accurate methods. At present, as a web server for the functional and structural protein sequence annotation, BAR+ is freely available at http://bar.biocomp.unibo.it/bar2.0.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In many plant species, the genetic template of early life-stages is formed by animal-mediated pollination and seed dispersal and has profound impact on further recruitment and population dynamics. Understanding the impact of pollination and seed dispersal on genetic patterns is a central issue in plant population biology. In my thesis, I investigated (i) contemporary dispersal and gene flow distances as well as (ii) genetic diversity and spatial genetic structure (SGS) across subsequent recruitment stages in a population of the animal-pollinated and dispersed tree Prunus africana in Kakamega Forest, West Kenya. Using microsatellite markers and parentage analyses, I inferred distances of pollen dispersal (father-to-mother), seed dispersal/maternal gene flow (mother-to-offspring) as well as paternal gene flow (father-to-offspring) for four early life stages of the species (seeds and fruits, current year seedlings, seedlings ≤ 3yr, seedlings > 3yr). Distances of pollen and seed dispersal as well as paternal gene flow were significantly shorter than expected from the spatial arrangement of trees and sampling plots. They were not affected by the density of conspecific trees in the surrounding. At the propagule stage, mean pollen dispersal distances were considerably (23-fold) longer than seed dispersal distances, and paternal gene flow distances exceeded maternal gene flow by a factor of 25. Seed dispersal distances were remarkably restricted, potentially leading to a strong initial SGS. The initial genetic template created by pollination and seed dispersal was extensively altered during later recruitment stages. Potential Janzen-Connell effects led to markedly increasing distances between offspring and both parental trees in older life stages. This showed that distance and density-dependent mortality factors are not exclusively related to the mother tree, but also to the father. Across subsequent recruitment stages, the pollen to seed dispersal ratio and the paternal to maternal gene flow ratio dropped to 2.1 and 3.4, respectively, in seedlings > 3yr. The relative changes in effective pollen dispersal, seed dispersal, and paternal gene flow distances across recruitment stages elucidate the mechanisms affecting the contribution of the two processes pollen and seed dispersal to overall gene flow. Using the same six microsatellite loci, I analyzed genetic diversity and SGS across five life stages, from seed rain to adults. Levels of genetic diversity within the studied P. africana population were comparable to other Prunus species and did not vary across life stages. In congruence with the short seed dispersal distances, I found significant SGS in all life stages. SGS decreased from seed and early seedling stages to older juvenile stages, and it was higher in adults than in late juveniles of the next generation. A comparison of the data with direct assessments of contemporary gene flow patterns indicate that distance- or density-dependent mortality, potentially due to Janzen-Connell effects, led to the initial decrease in SGS. Intergeneration variation in SGS could have been driven by variation in demographic processes, the effect of overlapping generations, and local selection processes. Overall, my study showed that complex sequential processes during recruitment contribute to the spatial genetic structure of tree populations. It highlights the importance of a multistage perspective for a comprehensive understanding of the impact of animal-mediated pollen and seed dispersal on spatial population dynamics and genetic patterns of trees.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The study defines a new farm classification and identifies the arable land management. These aspects and several indicators are taken into account to estimate the sustainability level of farms, for organic and conventional regimes. The data source is Italian Farm Account Data Network (RICA) for years 2007-2011, which samples structural and economical information. An environmental data has been added to the previous one to better describe the farm context. The new farm classification describes holding by general informations and farm structure. The general information are: adopted regime and farm location in terms of administrative region, slope and phyto-climatic zone. The farm structures describe the presence of main productive processes and land covers, which are recorded by FADN database. The farms, grouped by homogeneous farm structure or farm typology, are evaluated in terms of sustainability. The farm model MAD has been used to estimate a list of indicators. They describe especially environmental and economical areas of sustainability. Finally arable lands are taken into account to identify arable land managements and crop rotations. Each arable land has been classified by crop pattern. Then crop rotation management has been analysed by spatial and temporal approaches. The analysis reports a high variability inside regimes. The farm structure influences indicators level more than regimes, and it is not always possible to compare the two regimes. However some differences between organic and conventional agriculture have been found. Organic farm structures report different frequency and geographical location than conventional ones. Also different connections among arable lands and farm structures have been identified.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The use of linear programming in various areas has increased with the significant improvement of specialized solvers. Linear programs are used as such to model practical problems, or as subroutines in algorithms such as formal proofs or branch-and-cut frameworks. In many situations a certified answer is needed, for example the guarantee that the linear program is feasible or infeasible, or a provably safe bound on its objective value. Most of the available solvers work with floating-point arithmetic and are thus subject to its shortcomings such as rounding errors or underflow, therefore they can deliver incorrect answers. While adequate for some applications, this is unacceptable for critical applications like flight controlling or nuclear plant management due to the potential catastrophic consequences. We propose a method that gives a certified answer whether a linear program is feasible or infeasible, or returns unknown'. The advantage of our method is that it is reasonably fast and rarely answers unknown'. It works by computing a safe solution that is in some way the best possible in the relative interior of the feasible set. To certify the relative interior, we employ exact arithmetic, whose use is nevertheless limited in general to critical places, allowing us to rnremain computationally efficient. Moreover, when certain conditions are fulfilled, our method is able to deliver a provable bound on the objective value of the linear program. We test our algorithm on typical benchmark sets and obtain higher rates of success compared to previous approaches for this problem, while keeping the running times acceptably small. The computed objective value bounds are in most of the cases very close to the known exact objective values. We prove the usability of the method we developed by additionally employing a variant of it in a different scenario, namely to improve the results of a Satisfiability Modulo Theories solver. Our method is used as a black box in the nodes of a branch-and-bound tree to implement conflict learning based on the certificate of infeasibility for linear programs consisting of subsets of linear constraints. The generated conflict clauses are in general small and give good rnprospects for reducing the search space. Compared to other methods we obtain significant improvements in the running time, especially on the large instances.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Purpose To evaluate geriatric assessment (GA) domains in relation to clinically important outcomes in older breast cancer survivors. Methods Six hundred sixty women diagnosed with primary breast cancer in four US geographic regions (Los Angeles, CA; Minnesota; North Carolina; and Rhode Island) were selected with disease stage I to IIIA, age ≥ 65 years at date of diagnosis, and permission from attending physician to contact. Data were collected over 7 years of follow-up from consenting patients' medical records, telephone interviews, physician questionnaires, and the National Death Index. Outcomes included self-reported treatment tolerance and all-cause mortality. Four GA domains were described by six individual measures, as follows: sociodemographic by adequate finances; clinical by Charlson comorbidity index (CCI) and body mass index; function by number of physical function limitations; and psychosocial by the five-item Mental Health Index (MHI5) and Medical Outcomes Study Social Support Survey (MOS-SSS). Associations were evaluated using t tests, χ2 tests, and regression analyses. Results In multivariable regression including age and stage, three measures from two domains (clinical and psychosocial) were associated with poor treatment tolerance; these were CCI ≥ 1 (odds ratio [OR] = 2.49; 95% CI, 1.18 to 5.25), MHI5 score less than 80 (OR = 2.36; 95% CI, 1.15 to 4.86), and MOS-SSS score less than 80 (OR = 3.32; 95% CI, 1.44 to 7.66). Four measures representing all four GA domains predicted mortality; these were inadequate finances (hazard ratio [HR] = 1.89; 95% CI, 1.24 to 2.88; CCI ≥ 1 (HR = 1.38; 95% CI, 1.01 to 1.88), functional limitation (HR = 1.40; 95% CI, 1.01 to 1.93), and MHI5 score less than 80 (HR = 1.34; 95% CI, 1.01 to 1.85). In addition, the proportion of women with these outcomes incrementally increased as the number of GA deficits increased. Conclusion This study provides longitudinal evidence that GA domains are associated with poor treatment tolerance and predict mortality at 7 years of follow-up, independent of age and stage of disease.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper describes informatics for cross-sample analysis with comprehensive two-dimensional gas chromatography (GCxGC) and high-resolution mass spectrometry (HRMS). GCxGC-HRMS analysis produces large data sets that are rich with information, but highly complex. The size of the data and volume of information requires automated processing for comprehensive cross-sample analysis, but the complexity poses a challenge for developing robust methods. The approach developed here analyzes GCxGC-HRMS data from multiple samples to extract a feature template that comprehensively captures the pattern of peaks detected in the retention-times plane. Then, for each sample chromatogram, the template is geometrically transformed to align with the detected peak pattern and generate a set of feature measurements for cross-sample analyses such as sample classification and biomarker discovery. The approach avoids the intractable problem of comprehensive peak matching by using a few reliable peaks for alignment and peak-based retention-plane windows to define comprehensive features that can be reliably matched for cross-sample analysis. The informatics are demonstrated with a set of 18 samples from breast-cancer tumors, each from different individuals, six each for Grades 1-3. The features allow classification that matches grading by a cancer pathologist with 78% success in leave-one-out cross-validation experiments. The HRMS signatures of the features of interest can be examined for determining elemental compositions and identifying compounds.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this study was to evaluate the association between nasolabial symmetry and aesthetics in children with complete unilateral cleft lip and palate (CUCLP). Frontal and basal photographs of 60 consecutively treated children with CUCLP (cleft group: 41 boys and 19 girls, mean (SD) age 11 (2) years) and 44 children without clefts (control group: 16 boys and 28 girls, mean (SD) age 11(2) years), were used for evaluation of nasolabial symmetry and aesthetics. Nasal and labial measurements were made to calculate the coefficient of asymmetry (CA). The 5-grade aesthetic index described by Asher-McDade et al. was used to evaluate nasolabial appearance. Correlation and regression analysis were used to identify an association between aesthetics and CA, sex, and the presence of CUCLP. Ten measurements in the cleft, and 2 in the control, group differed significantly between the cleft and non-cleft (or right and left) sides, respectively. The significantly higher values of 9 of 11 CA in the children with CUCLP indicated that they had more asymmetrical nasolabial areas than children without clefts. However, the regression analyses showed that only a few CA were associated with nasolabial aesthetics. In conclusion, nasolabial aesthetics and nasolabial symmetry seem to be only weakly associated in patients with CUCLP.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

PURPOSE: Neoadjuvant treatment is an accepted standard approach for treating locally advanced esophago-gastric adenocarcinomas. Despite a response of the primary tumor, a significant percentage dies from tumor recurrence. The aim of this retrospective exploratory study from two academic centers was to identify predictors of survival and recurrence in histopathologically responding patients. METHODS: Two hundred thirty one patients with adenocarcinomas (esophagus: n = 185, stomach: n = 46, cT3/4, cN0/+, cM0) treated with preoperative chemotherapy (n = 212) or chemoradiotherapy (n = 19) followed by resection achieved a histopathological response (regression 1a: no residual tumor (n = 58), and regression 1b < 10 % residual tumor (n = 173)). RESULTS: The estimated median overall survival was 92.4 months (5-year survival, 56.6 %) for all patients. For patients with regression 1a, median survival is not reached (5-year survival, 71.6 %) compared to patients with regression 1b with 75.3 months median (5-year survival, 52.2 %) (p = 0.031). Patients with a regression 1a had lymph node metastases in 19.0 versus 33.7 % in regression 1b. The ypT-category (p < 0.001), the M-category (p = 0.005), and the type of treatment (p = 0.04) were found to be independent prognostic factors in R0-resected patients. The recurrence rate was 31.7 % (n = 66) (local, 39.4 %; peritoneal carcinomatosis, 25.7 %; distant metastases, 50 %). Recurrence was predicted by female gender (p = 0.013), ypT-category (p = 0.007), and M-category (p = 0.003) in multivariate analysis. CONCLUSION: Response of the primary tumor does not guarantee recurrence-free long-term survival, but histopathological complete responders have better prognosis compared to partial responders. Established prognostic factors strongly influence the outcome, which could, in the future, be used for stratification of adjuvant treatment approaches. Increasing the rate of histopathological complete responders is a valid endpoint for future clinical trials investigating new drugs.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A number of state-level pharmaceutical assistance programs have been established as a result of the growing recognition of the role of pharmaceuticals in the long-term care of the elderly. However, existing research does not provide a coherent expectation for patterns of use by rural and urban elderly. The data for this analysis are drawn from a larger study of the Pennsylvania Pharmaceutical Assistance Contract for the Elderly (PACE). PACE provides prescription medicines for elderly who meet income requirements. The research project was designed to assess the characteristics of PACE program participants and non-participants on a wide range of issues. Chi-square analysis and regression models were used to assess the association between rural and urban residence and access to the PACE Program. The results indicate that rural/urban status of the elderly is not a significant predictor of the use of PACE. Other traditional variables (e.g., health self-rating and physician visits) did predict difference in the pattern of use.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND AND OBJECTIVE: Sleep disturbances are prevalent but often overlooked or underestimated. We suspected that sleep disorders might be particularly common among pharmacy customers, and that they could benefit from counselling. Therefore, we described the prevalence and severity of symptoms associated with sleep and wakefulness disorders among Swiss pharmacy customers, and estimated the need for counselling and treatment. METHODS: In 804 Swiss pharmacies (49% of all community pharmacies) clients were invited to complete the Stanford Sleep Disorders Questionnaire (SDQ), and the Epworth Sleepiness Scale (EPW). The SDQ was designed to classify symptoms of sleep and wakefulness into the four most prevalent disorders: sleep apnoea syndrome (SAS), insomnia in psychiatric disorders (PSY), periodic leg movement disorders/restless legs (RLS) and narcolepsy (NAR). Data were entered into an internet-linked database for analysis by an expert system as a basis for immediate counselling by the pharmacist. RESULTS: Of 4901 participants, 3238 (66.1%) were female, and 1663 (33.9%) were male. The mean age (SD) of females and males was 52.4 (18.05), and 55.1 (17.10) years, respectively. The percentages of female and male individuals above cut-off of SDQ subscales were 11.4% and 19.8% for sleep apnoea, 40.9% and 38.7% for psychiatric sleep disorders, 59.3% and 46.8% for restless legs, and 10.4% and 9.4% for narcolepsy respectively. The prevalence of an Epworth Sleepiness Scale score >11 was 16.5% in females, and 23.9% in males. Reliability assessed by Cronbach's alpha was 0.65 to 0.78 for SDQ subscales, and for the Epworth score. CONCLUSIONS: Symptoms of sleep and wakefulness disorders among Swiss pharmacy customers were highly prevalent. The SDQ and the Epworth Sleepiness Scale score had a satisfactory reliability to be useful for identification of pharmacy customers who might benefit from information and counselling while visiting pharmacies. The internet-based system proved to be a helpful tool for the pharmacist when counselling his customers in terms of diagnostic classification and severity of symptoms associated with the sleeping and waking state.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In many applications the observed data can be viewed as a censored high dimensional full data random variable X. By the curve of dimensionality it is typically not possible to construct estimators that are asymptotically efficient at every probability distribution in a semiparametric censored data model of such a high dimensional censored data structure. We provide a general method for construction of one-step estimators that are efficient at a chosen submodel of the full-data model, are still well behaved off this submodel and can be chosen to always improve on a given initial estimator. These one-step estimators rely on good estimators of the censoring mechanism and thus will require a parametric or semiparametric model for the censoring mechanism. We present a general theorem that provides a template for proving the desired asymptotic results. We illustrate the general one-step estimation methods by constructing locally efficient one-step estimators of marginal distributions and regression parameters with right-censored data, current status data and bivariate right-censored data, in all models allowing the presence of time-dependent covariates. The conditions of the asymptotics theorem are rigorously verified in one of the examples and the key condition of the general theorem is verified for all examples.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Duplications and deletions in the human genome can cause disease or predispose persons to disease. Advances in technologies to detect these changes allow for the routine identification of submicroscopic imbalances in large numbers of patients. METHODS: We tested for the presence of microdeletions and microduplications at a specific region of chromosome 1q21.1 in two groups of patients with unexplained mental retardation, autism, or congenital anomalies and in unaffected persons. RESULTS: We identified 25 persons with a recurrent 1.35-Mb deletion within 1q21.1 from screening 5218 patients. The microdeletions had arisen de novo in eight patients, were inherited from a mildly affected parent in three patients, were inherited from an apparently unaffected parent in six patients, and were of unknown inheritance in eight patients. The deletion was absent in a series of 4737 control persons (P=1.1x10(-7)). We found considerable variability in the level of phenotypic expression of the microdeletion; phenotypes included mild-to-moderate mental retardation, microcephaly, cardiac abnormalities, and cataracts. The reciprocal duplication was enriched in nine children with mental retardation or autism spectrum disorder and other variable features (P=0.02). We identified three deletions and three duplications of the 1q21.1 region in an independent sample of 788 patients with mental retardation and congenital anomalies. CONCLUSIONS: We have identified recurrent molecular lesions that elude syndromic classification and whose disease manifestations must be considered in a broader context of development as opposed to being assigned to a specific disease. Clinical diagnosis in patients with these lesions may be most readily achieved on the basis of genotype rather than phenotype.