122 resultados para data sheets
Resumo:
Background and aims Recent studies have adopted a broad definition of Sapindaceae that includes taxa traditionally placed in Aceraceae and Hippocastanaceae, achieving monophyly but yielding a family difficult to characterize and for which no obvious morphological synapomorphy exists. This expanded circumscription was necessitated by the finding that the monotypic, temperate Asian genus Xanthoceras, historically placed in Sapindaceae tribe Harpullieae, is basal within the group. Here we seek to clarify the relationships of Xanthoceras based on phylogenetic analyses using a dataset encompassing nearly 3/4 of sapindaceous genera, comparing the results with information from morphology and biogeography, in particular with respect to the other taxa placed in Harpullieae. We then re-examine the appropriateness of maintaining the current broad, morphologically heterogeneous definition of Sapindaceae and explore the advantages of an alternative family circumscription. Methods Using 243 samples representing 104 of the 142 currently recognized genera of Sapindaceae s. lat. (including all in Harpullieae), sequence data were analyzed for nuclear (ITS) and plastid (matK, rpoB, trnD-trnT, trnK-matK, trnL-trnF and trnS-trnG) markers, adopting the methodology of a recent family-wide study, performing single-gene and total evidence analyses based on maximum likelihood (ML) and maximum parsimony (MP) criteria, and applying heuristic searches developed for large datasets, viz, a new strategy implemented in RAxML (for ML) and the parsimony ratchet (for MP). Bootstrap analyses were performed for each method to test for congruence between markers. Key results Our findings support earlier suggestions that Harpullieae are polyphyletic: Xanthoceras is confirmed as sister to all other sampled taxa of Sapindaceae s. lat.; the remaining members belong to three other clades within Sapindaceae s. lat., two of which correspond respectively to the groups traditionally treated as Aceraceae and Hippocastanaceae, together forming a clade sister to the largely tropical Sapindaceae s. str., which is monophyletic and morphologically coherent provided Xanthoceras is excluded. Conclusion To overcome the difficulties of a broadly circumscribed Sapindaceae, we resurrect the historically recognized temperate families Aceraceae and Hippocastanaceae, and describe a new family, Xanthoceraceae, thus adopting a monophyletic and easily characterized circumscription of Sapindaceae nearly identical to that used for over a century.
Resumo:
To test hypotheses about the universality of personality traits, college students in 50 cultures identified an adult or college-aged man or woman whom they knew well and rated the 11,985 targets using the 3rd-person version of the Revised NEO Personality Inventory. Factor analyses within cultures showed that the normative American self-report structure was clearly replicated in most cultures and was recognizable in all. Sex differences replicated earlier self-report results, with the most pronounced differences in Western cultures. Cross-sectional age differences for 3 factors followed the pattern identified in self-reports, with moderate rates of change during college age and slower changes after age 40. With a few exceptions, these data support the hypothesis that features of personality traits are common to all human groups.
Resumo:
Analyzing functional data often leads to finding common factors, for which functional principal component analysis proves to be a useful tool to summarize and characterize the random variation in a function space. The representation in terms of eigenfunctions is optimal in the sense of L-2 approximation. However, the eigenfunctions are not always directed towards an interesting and interpretable direction in the context of functional data and thus could obscure the underlying structure. To overcome such difficulty, an alternative to functional principal component analysis is proposed that produces directed components which may be more informative and easier to interpret. These structural components are similar to principal components, but are adapted to situations in which the domain of the function may be decomposed into disjoint intervals such that there is effectively independence between intervals and positive correlation within intervals. The approach is demonstrated with synthetic examples as well as real data. Properties for special cases are also studied.
Resumo:
Imaging mass spectrometry (IMS) represents an innovative tool in the cancer research pipeline, which is increasingly being used in clinical and pharmaceutical applications. The unique properties of the technique, especially the amount of data generated, make the handling of data from multiple IMS acquisitions challenging. This work presents a histology-driven IMS approach aiming to identify discriminant lipid signatures from the simultaneous mining of IMS data sets from multiple samples. The feasibility of the developed workflow is evaluated on a set of three human colorectal cancer liver metastasis (CRCLM) tissue sections. Lipid IMS on tissue sections was performed using MALDI-TOF/TOF MS in both negative and positive ionization modes after 1,5-diaminonaphthalene matrix deposition by sublimation. The combination of both positive and negative acquisition results was performed during data mining to simplify the process and interrogate a larger lipidome into a single analysis. To reduce the complexity of the IMS data sets, a sub data set was generated by randomly selecting a fixed number of spectra from a histologically defined region of interest, resulting in a 10-fold data reduction. Principal component analysis confirmed that the molecular selectivity of the regions of interest is maintained after data reduction. Partial least-squares and heat map analyses demonstrated a selective signature of the CRCLM, revealing lipids that are significantly up- and down-regulated in the tumor region. This comprehensive approach is thus of interest for defining disease signatures directly from IMS data sets by the use of combinatory data mining, opening novel routes of investigation for addressing the demands of the clinical setting.
Resumo:
This is one of the few studies that have explored the value of baseline symptoms and health-related quality of life (HRQOL) in predicting survival in brain cancer patients. Baseline HRQOL scores (from the EORTC QLQ-C30 and the Brain Cancer Module (BN 20)) were examined in 490 newly diagnosed glioblastoma cancer patients for the relationship with overall survival by using Cox proportional hazards regression models. Refined techniques as the bootstrap re-sampling procedure and the computation of C-indexes and R(2)-coefficients were used to try and validate the model. Classical analysis controlled for major clinical prognostic factors selected cognitive functioning (P=0.0001), global health status (P=0.0055) and social functioning (P<0.0001) as statistically significant prognostic factors of survival. However, several issues question the validity of these findings. C-indexes and R(2)-coefficients, which are measures of the predictive ability of the models, did not exhibit major improvements when adding selected or all HRQOL scores to clinical factors. While classical techniques lead to positive results, more refined analyses suggest that baseline HRQOL scores add relatively little to clinical factors to predict survival. These results may have implications for future use of HRQOL as a prognostic factor in cancer patients.
Resumo:
Given the very large amount of data obtained everyday through population surveys, much of the new research again could use this information instead of collecting new samples. Unfortunately, relevant data are often disseminated into different files obtained through different sampling designs. Data fusion is a set of methods used to combine information from different sources into a single dataset. In this article, we are interested in a specific problem: the fusion of two data files, one of which being quite small. We propose a model-based procedure combining a logistic regression with an Expectation-Maximization algorithm. Results show that despite the lack of data, this procedure can perform better than standard matching procedures.
Resumo:
Despite their limited proliferation capacity, regulatory T cells (T(regs)) constitute a population maintained over the entire lifetime of a human organism. The means by which T(regs) sustain a stable pool in vivo are controversial. Using a mathematical model, we address this issue by evaluating several biological scenarios of the origins and the proliferation capacity of two subsets of T(regs): precursor CD4(+)CD25(+)CD45RO(-) and mature CD4(+)CD25(+)CD45RO(+) cells. The lifelong dynamics of T(regs) are described by a set of ordinary differential equations, driven by a stochastic process representing the major immune reactions involving these cells. The model dynamics are validated using data from human donors of different ages. Analysis of the data led to the identification of two properties of the dynamics: (1) the equilibrium in the CD4(+)CD25(+)FoxP3(+)T(regs) population is maintained over both precursor and mature T(regs) pools together, and (2) the ratio between precursor and mature T(regs) is inverted in the early years of adulthood. Then, using the model, we identified three biologically relevant scenarios that have the above properties: (1) the unique source of mature T(regs) is the antigen-driven differentiation of precursors that acquire the mature profile in the periphery and the proliferation of T(regs) is essential for the development and the maintenance of the pool; there exist other sources of mature T(regs), such as (2) a homeostatic density-dependent regulation or (3) thymus- or effector-derived T(regs), and in both cases, antigen-induced proliferation is not necessary for the development of a stable pool of T(regs). This is the first time that a mathematical model built to describe the in vivo dynamics of regulatory T cells is validated using human data. The application of this model provides an invaluable tool in estimating the amount of regulatory T cells as a function of time in the blood of patients that received a solid organ transplant or are suffering from an autoimmune disease.
Resumo:
Background Multiple logistic regression is precluded from many practical applications in ecology that aim to predict the geographic distributions of species because it requires absence data, which are rarely available or are unreliable. In order to use multiple logistic regression, many studies have simulated "pseudo-absences" through a number of strategies, but it is unknown how the choice of strategy influences models and their geographic predictions of species. In this paper we evaluate the effect of several prevailing pseudo-absence strategies on the predictions of the geographic distribution of a virtual species whose "true" distribution and relationship to three environmental predictors was predefined. We evaluated the effect of using a) real absences b) pseudo-absences selected randomly from the background and c) two-step approaches: pseudo-absences selected from low suitability areas predicted by either Ecological Niche Factor Analysis: (ENFA) or BIOCLIM. We compared how the choice of pseudo-absence strategy affected model fit, predictive power, and information-theoretic model selection results. Results Models built with true absences had the best predictive power, best discriminatory power, and the "true" model (the one that contained the correct predictors) was supported by the data according to AIC, as expected. Models based on random pseudo-absences had among the lowest fit, but yielded the second highest AUC value (0.97), and the "true" model was also supported by the data. Models based on two-step approaches had intermediate fit, the lowest predictive power, and the "true" model was not supported by the data. Conclusion If ecologists wish to build parsimonious GLM models that will allow them to make robust predictions, a reasonable approach is to use a large number of randomly selected pseudo-absences, and perform model selection based on an information theoretic approach. However, the resulting models can be expected to have limited fit.
Resumo:
It has been demonstrated in earlier studies that patients with a cochlear implant have increased abilities for audio-visual integration because the crude information transmitted by the cochlear implant requires the persistent use of the complementary speech information from the visual channel. The brain network for these abilities needs to be clarified. We used an independent components analysis (ICA) of the activation (H2 (15) O) positron emission tomography data to explore occipito-temporal brain activity in post-lingually deaf patients with unilaterally implanted cochlear implants at several months post-implantation (T1), shortly after implantation (T0) and in normal hearing controls. In between-group analysis, patients at T1 had greater blood flow in the left middle temporal cortex as compared with T0 and normal hearing controls. In within-group analysis, patients at T0 had a task-related ICA component in the visual cortex, and patients at T1 had one task-related ICA component in the left middle temporal cortex and the other in the visual cortex. The time courses of temporal and visual activities during the positron emission tomography examination at T1 were highly correlated, meaning that synchronized integrative activity occurred. The greater involvement of the visual cortex and its close coupling with the temporal cortex at T1 confirm the importance of audio-visual integration in more experienced cochlear implant subjects at the cortical level.
Resumo:
Spatial data on species distributions are available in two main forms, point locations and distribution maps (polygon ranges and grids). The first are often temporally and spatially biased, and too discontinuous, to be useful (untransformed) in spatial analyses. A variety of modelling approaches are used to transform point locations into maps. We discuss the attributes that point location data and distribution maps must satisfy in order to be useful in conservation planning. We recommend that before point location data are used to produce and/or evaluate distribution models, the dataset should be assessed under a set of criteria, including sample size, age of data, environmental/geographical coverage, independence, accuracy, time relevance and (often forgotten) representation of areas of permanent and natural presence of the species. Distribution maps must satisfy additional attributes if used for conservation analyses and strategies, including minimizing commission and omission errors, credibility of the source/assessors and availability for public screening. We review currently available databases for mammals globally and show that they are highly variable in complying with these attributes. The heterogeneity and weakness of spatial data seriously constrain their utility to global and also sub-global scale conservation analyses.
Resumo:
Position du problème: La mise en place de la tarification à l'activité pour les hôpitaux de court séjour pourrait entraîner une diminution des durées de séjour pour raisons financières. L'impact potentiel de ce phénomène sur la qualité des soins n'est pas connu. Les réadmissions identifiées à l'aide des données administratives hospitalières sont, pour certaines situations cliniques, des indicateurs de qualité des soins valides. Méthode: Étude rétrospective du lien entre la durée de séjour et la survenue de réadmissions imprévues liées au séjour initial, pour les cholécystectomies simples et les accouchements par voie basse sans complication, à partir des données du programme de médicalisation des systèmes d'information de l'Assistance publique-Hôpitaux de Paris des années 2002 à 2005. Résultats: Pour les deux procédures, la probabilité de réadmission suit une courbe en " J ". Après ajustement sur l'âge, le sexe, les comorbidités associées, l'hôpital et l'année d'admission, la probabilité de réadmission est plus élevée pour les durées de séjour les plus courtes : pour les cholécystectomies, odds ratio : 6,03 [IC95 % : 2,67-13,59] pour les hospitalisations d'un jour versus trois jours ; pour les accouchements, odds ratio : 1,74 [IC95 % : 1,05-2,91] pour les hospitalisations de deux jours versus trois jours. Conclusion: Pour deux pathologies communes, les durées de séjour les plus courtes sont associées à des probabilités de réadmission plus élevées. L'utilisation routinière des données du programme de médicalisation des systèmes d'information peut permettre d'assurer le suivi de la relation entre la réduction de la durée de séjour et les réadmissions. The prospective payment system for the French short-stay hospitals creates a financial incentive to reduce length of stay. The potential impact of the resulting decrease in length of stay on the quality of healthcare is unknown. Readmission rates are valid outcome indicators for some clinical procedures. Methods: Retrospective study of the association between length of stay and unplanned readmissions related to the initial stay, for two procedures: cholecystectomy and vaginal delivery. Data: Administrative diagnosis-related groups database of "Assistance publique-Hopitaux de Paris", a large teaching hospital, for years 2002 to 2005. Results: The risk of readmission according to length of stay, taking age, sex, comorbidity, hospital and year of admission into account, followed a J-shaped curve for both procedures. The probability of readmission was higher for very short stays, with odds ratios and 95% confidence intervals of 6.03 [2.67-13.59] for cholecystectomies (1- versus 3-night stays), and of 1.74 [1.05-2.91] for vaginal deliveries (2- versus 3-night stays). Conclusion: For both procedures, the shortest lengths of stay are associated with a higher readmission probability. Suitable indicators derived from administrative databases would enable monitoring of the association between length of stay and readmissions.