963 resultados para STATISTICAL MODELS
Resumo:
This article examines the different influences that Catholicism and Protestantism exert on economically relevant values. It argues that Catholic theology and practice facilitate personal transactions while Protestantism favors values and types of moral and legal enforcement better adapted for impersonal trade. Protestantism may thus be more conducive to economic growth through anonymous exchange while Catholicism may provide better support for personal contracting. Several components of this hypothesis are confirmed using statistical models with data from the 1998 ISSP international survey on religion. These show that Protestants are more trusting of anonymous counter parties, develop more reliable institutions for legal enforcement and are more willing to spend resources on monitoring and punishing other members of the community. Catholicism is more protective of the family and small-group relationships, and provides more tolerant and less motivating beliefs. Relatively smaller and less consistent differences appear in terms of worldly personal success and incentives.
Resumo:
OBJECTIVE To estimate the prevalence of self-reported constipation and associated factors in the general population of a Brazilian city. METHOD Secondary analysis of an epidemiological study, population-based, cross-sectional study, about bowel habits of Brazilian population. A total of 2,162 individuals were interviewed using two instruments: sociodemographic data and the adapted and validated Brazilian version of the "Bowel Function in the Community" tool. RESULTS There was a prevalence of 25.2% for the self-reported constipation, 37.2% among women and 10.2% among men. Stroke and old age were associated with constipation in the three statistical models used. CONCLUSION The prevalence found showed to be similar to the findings in the literature, although some associated factors obtained here have never been investigated.
Resumo:
The educational system in Spain is undergoing a reorganization. At present, high-school graduates who want to enroll at a public university must take a set of examinations Pruebas de Aptitud para el Acceso a la Universidad (PAAU). A "new formula" (components, weights, type of exam,...) for university admission is been discussed. The present paper summarizes part of the research done by the author in her PhD. The context for this thesis is the evaluation of large-scale and complex systems of assessment. The main objectives were: to achieve a deep knowledge of the entire university admissions process in Spain, to discover the main sources of uncertainty and topromote empirical research in a continual improvement of the entire process. Focusing in the suitable statistical models and strategies which allow to high-light the imperfections of the system and reduce them, the paper develops, among other approaches, some applications of multilevel modeling.
Resumo:
The ATP-binding cassette (ABC) family of proteins comprise a group of membrane transporters involved in the transport of a wide variety of compounds, such as xenobiotics, vitamins, lipids, amino acids, and carbohydrates. Determining their regional expression patterns along the intestinal tract will further characterize their transport functions in the gut. The mRNA expression levels of murine ABC transporters in the duodenum, jejunum, ileum, and colon were examined using the Affymetrix MuU74v2 GeneChip set. Eight ABC transporters (Abcb2, Abcb3, Abcb9, Abcc3, Abcc6, Abcd1, Abcg5, and Abcg8) displayed significant differential gene expression along the intestinal tract, as determined by two statistical models (a global error assessment model and a classic ANOVA, both with a P < 0.01). Concordance with semiquantitative real-time PCR was high. Analyzing the promoters of the differentially expressed ABC transporters did not identify common transcriptional motifs between family members or with other genes; however, the expression profile for Abcb9 was highly correlated with fibulin-1, and both genes share a common complex promoter model involving the NFkappaB, zinc binding protein factor (ZBPF), GC-box factors SP1/GC (SP1F), and early growth response factor (EGRF) transcription binding motifs. The cellular location of another of the differentially expressed ABC transporters, Abcc3, was examined by immunohistochemistry. Staining revealed that the protein is consistently expressed in the basolateral compartment of enterocytes along the anterior-posterior axis of the intestine. Furthermore, the intensity of the staining pattern is concordant with the expression profile. This agrees with previous findings in which the mRNA, protein, and transport function of Abcc3 were increased in the rat distal intestine. These data reveal regional differences in gene expression profiles along the intestinal tract and demonstrate that a complete understanding of intestinal ABC transporter function can only be achieved by examining the physiologically distinct regions of the gut.
Resumo:
The development of statistical models for forensic fingerprint identification purposes has been the subject of increasing research attention in recent years. This can be partly seen as a response to a number of commentators who claim that the scientific basis for fingerprint identification has not been adequately demonstrated. In addition, key forensic identification bodies such as ENFSI [1] and IAI [2] have recently endorsed and acknowledged the potential benefits of using statistical models as an important tool in support of the fingerprint identification process within the ACE-V framework. In this paper, we introduce a new Likelihood Ratio (LR) model based on Support Vector Machines (SVMs) trained with features discovered via morphometric and spatial analyses of corresponding minutiae configurations for both match and close non-match populations often found in AFIS candidate lists. Computed LR values are derived from a probabilistic framework based on SVMs that discover the intrinsic spatial differences of match and close non-match populations. Lastly, experimentation performed on a set of over 120,000 publicly available fingerprint images (mostly sourced from the National Institute of Standards and Technology (NIST) datasets) and a distortion set of approximately 40,000 images, is presented, illustrating that the proposed LR model is reliably guiding towards the right proposition in the identification assessment of match and close non-match populations. Results further indicate that the proposed model is a promising tool for fingerprint practitioners to use for analysing the spatial consistency of corresponding minutiae configurations.
Resumo:
Drift is an important issue that impairs the reliability of gas sensing systems. Sensor aging, memory effects and environmental disturbances produce shifts in sensor responses that make initial statistical models for gas or odor recognition useless after a relatively short period (typically few weeks). Frequent recalibrations are needed to preserve system accuracy. However, when recalibrations involve numerous samples they become expensive and laborious. An interesting and lower cost alternative is drift counteraction by signal processing techniques. Orthogonal Signal Correction (OSC) is proposed for drift compensation in chemical sensor arrays. The performance of OSC is also compared with Component Correction (CC). A simple classification algorithm has been employed for assessing the performance of the algorithms on a dataset composed by measurements of three analytes using an array of seventeen conductive polymer gas sensors over a ten month period.
Resumo:
The relationships between nutrient contents and indices of the Diagnosis and Recommendation Integrated System (DRIS) are a useful basis to determine appropriate ranges for the interpretation of leaf nutrient contents. The purpose of this study was to establish Beaufils ranges from statistical models of the relationship between foliar concentrations and DRIS indices, generated by two systems of DRIS norms - the F value and natural logarithm transformation - and assess the nutritional status of cotton plants, based on these Beaufils ranges. Yield data from plots (average acreage 100 ha) and foliar concentrations of macro and micronutrients of cotton (Gossypium hirsutum r. latifolium) plants, in the growing season 2004/2005, were stored in a database. The criterion to define the reference population consisted of plots with above-average yields + 0.5 standard deviation (over 4,575 kg ha-1 seed cotton yield). The best-fitting statistical model of the relationship between foliar nutrient concentrations and DRIS indices was linear, with R² > 0.8090, p < 0.01, except for N, with R² = 0.5987, p < 0.01. The two criteria were effective to diagnose the plant nutritional status. The diagnoses were not random, but based on the effectiveness of the chi-square-tested method. The agreement between the methods to assess the nutritional status was 92.59-100 %, except for S, with 74.07 % agreement.
Resumo:
Many of the most interesting questions ecologists ask lead to analyses of spatial data. Yet, perhaps confused by the large number of statistical models and fitting methods available, many ecologists seem to believe this is best left to specialists. Here, we describe the issues that need consideration when analysing spatial data and illustrate these using simulation studies. Our comparative analysis involves using methods including generalized least squares, spatial filters, wavelet revised models, conditional autoregressive models and generalized additive mixed models to estimate regression coefficients from synthetic but realistic data sets, including some which violate standard regression assumptions. We assess the performance of each method using two measures and using statistical error rates for model selection. Methods that performed well included generalized least squares family of models and a Bayesian implementation of the conditional auto-regressive model. Ordinary least squares also performed adequately in the absence of model selection, but had poorly controlled Type I error rates and so did not show the improvements in performance under model selection when using the above methods. Removing large-scale spatial trends in the response led to poor performance. These are empirical results; hence extrapolation of these findings to other situations should be performed cautiously. Nevertheless, our simulation-based approach provides much stronger evidence for comparative analysis than assessments based on single or small numbers of data sets, and should be considered a necessary foundation for statements of this type in future.
Resumo:
Remote sensing using airborne imaging spectroscopy (AIS) is known to retrieve fundamental optical properties of ecosystems. However, the value of these properties for predicting plant species distribution remains unclear. Here, we assess whether such data can add value to topographic variables for predicting plant distributions in French and Swiss alpine grasslands. We fitted statistical models with high spectral and spatial resolution reflectance data and tested four optical indices sensitive to leaf chlorophyll content, leaf water content and leaf area index. We found moderate added-value of AIS data for predicting alpine plant species distribution. Contrary to expectations, differences between species distribution models (SDMs) were not linked to their local abundance or phylogenetic/functional similarity. Moreover, spectral signatures of species were found to be partly site-specific. We discuss current limits of AIS-based SDMs, highlighting issues of scale and informational content of AIS data.
Resumo:
This paper presents multiple kernel learning (MKL) regression as an exploratory spatial data analysis and modelling tool. The MKL approach is introduced as an extension of support vector regression, where MKL uses dedicated kernels to divide a given task into sub-problems and to treat them separately in an effective way. It provides better interpretability to non-linear robust kernel regression at the cost of a more complex numerical optimization. In particular, we investigate the use of MKL as a tool that allows us to avoid using ad-hoc topographic indices as covariables in statistical models in complex terrains. Instead, MKL learns these relationships from the data in a non-parametric fashion. A study on data simulated from real terrain features confirms the ability of MKL to enhance the interpretability of data-driven models and to aid feature selection without degrading predictive performances. Here we examine the stability of the MKL algorithm with respect to the number of training data samples and to the presence of noise. The results of a real case study are also presented, where MKL is able to exploit a large set of terrain features computed at multiple spatial scales, when predicting mean wind speed in an Alpine region.
Resumo:
No consensus exists on whether acyclovir prophylaxis should be given for varicella-zoster virus (VZV) prophylaxis after hematopoietic cell transplantation because of the concern of "rebound" VZV disease after discontinuation of prophylaxis. To determine whether rebound VZV disease is an important clinical problem and whether prolonging prophylaxis beyond 1 year is beneficial, we examined 3 sequential cohorts receiving acyclovir from day of transplantation until engraftment for prevention of herpes simplex virus reactivation (n = 932); acyclovir or valacyclovir 1 year (n = 1117); or acyclovir/valacyclovir for at least 1 year or longer if patients remained on immunosuppressive drugs (n = 586). In multivariable statistical models, prophylaxis given for 1 year significantly reduced VZV disease (P < .001) without evidence of rebound VZV disease. Continuation of prophylaxis beyond 1 year in allogeneic recipients who remained on immunosuppressive drugs led to a further reduction in VZV disease (P = .01) but VZV disease developed in 6.1% during the second year while receiving this strategy. In conclusion, acyclovir/valacyclovir prophylaxis given for 1 year led to a persistent benefit after drug discontinuation and no evidence of a rebound effect. To effectively prevent VZV disease in long-term hematopoietic cell transplantation survivors, additional approaches such as vaccination will probably be required.
Resumo:
This report documents an extensive field program carried out to identify the relationships between soil engineering properties, as measured by various in situ devices, and the results of machine compaction monitoring using prototype compaction monitoring technology developed by Caterpillar Inc. Primary research tasks for this study include the following: (1) experimental testing and statistical analyses to evaluate machine power in terms of the engineering properties of the compacted soil (e.g., density, strength, stiffness) and (2) recommendations for using the compaction monitoring technology in practice. The compaction monitoring technology includes sensors that monitor the power consumption used to move the compaction machine, an on-board computer and display screen, and a GPS system to map the spatial location of the machine. In situ soil density, strength, and stiffness data characterized the soil at various stages of compaction. For each test strip or test area, in situ soil properties were compared directly to machine power values to establish statistical relationships. Statistical models were developed to predict soil density, strength, and stiffness from the machine power values. Field data for multiple test strips were evaluated. The R2 correlation coefficient was generally used to assess the quality of the regressions. Strong correlations were observed between averaged machine power and field measurement data. The relationships are based on the compaction model derived from laboratory data. Correlation coefficients (R2) were consistently higher for thicker lifts than for thin lifts, indicating that the depth influencing machine power response exceeds the representative lift thickness encountered under field conditions. Caterpillar Inc. compaction monitoring technology also identified localized areas of an earthwork project with weak or poorly compacted soil. The soil properties at these locations were verified using in situ test devices. This report also documents the steps required to implement the compaction monitoring technology evaluated.
Resumo:
BACKGROUND: Pseudogenes have long been considered as nonfunctional genomic sequences. However, recent evidence suggests that many of them might have some form of biological activity, and the possibility of functionality has increased interest in their accurate annotation and integration with functional genomics data. RESULTS: As part of the GENCODE annotation of the human genome, we present the first genome-wide pseudogene assignment for protein-coding genes, based on both large-scale manual annotation and in silico pipelines. A key aspect of this coupled approach is that it allows us to identify pseudogenes in an unbiased fashion as well as untangle complex events through manual evaluation. We integrate the pseudogene annotations with the extensive ENCODE functional genomics information. In particular, we determine the expression level, transcription-factor and RNA polymerase II binding, and chromatin marks associated with each pseudogene. Based on their distribution, we develop simple statistical models for each type of activity, which we validate with large-scale RT-PCR-Seq experiments. Finally, we compare our pseudogenes with conservation and variation data from primate alignments and the 1000 Genomes project, producing lists of pseudogenes potentially under selection. CONCLUSIONS: At one extreme, some pseudogenes possess conventional characteristics of functionality; these may represent genes that have recently died. On the other hand, we find interesting patterns of partial activity, which may suggest that dead genes are being resurrected as functioning non-coding RNAs. The activity data of each pseudogene are stored in an associated resource, psiDR, which will be useful for the initial identification of potentially functional pseudogenes.
Resumo:
Intraclass correlation (ICC) is an established tool to assess inter-rater reliability. In a seminal paper published in 1979, Shrout and Fleiss considered three statistical models for inter-rater reliability data with a balanced design. In their first two models, an infinite population of raters was considered, whereas in their third model, the raters in the sample were considered to be the whole population of raters. In the present paper, we show that the two distinct estimates of ICC developed for the first two models can both be applied to the third model and we discuss their different interpretations in this context.
Resumo:
Background: Bone health is a concern when treating early stage breast cancer patients with adjuvant aromatase inhibitors. Early detection of patients (pts) at risk of osteoporosis and fractures may be helpful for starting preventive therapies and selecting the most appropriate endocrine therapy schedule. We present statistical models describing the evolution of lumbar and hip bone mineral density (BMD) in pts treated with tamoxifen (T), letrozole (L) and sequences of T and L. Methods: Available dual-energy x-ray absorptiometry exams (DXA) of pts treated in trial BIG 1-98 were retrospectively collected from Swiss centers. Treatment arms: A) T for 5 years, B) L for 5 years, C) 2 years of T followed by 3 years of L and, D) 2 years of L followed by 3 years of T. Pts without DXA were used as a control for detecting selection biases. Patients randomized to arm A were subsequently allowed an unplanned switch from T to L. Allowing for variations between DXA machines and centres, two repeated measures models, using a covariance structure that allow for different times between DXA, were used to estimate changes in hip and lumbar BMD (g/cm2) from trial randomization. Prospectively defined covariates, considered as fixed effects in the multivariable models in an intention to treat analysis, at the time of trial randomization were: age, height, weight, hysterectomy, race, known osteoporosis, tobacco use, prior bone fracture, prior hormone replacement therapy (HRT), bisphosphonate use and previous neo-/adjuvant chemotherapy (ChT). Similarly, the T-scores for lumbar and hip BMD measurements were modeled using a per-protocol approach (allowing for treatment switch in arm A), specifically studying the effect of each therapy upon T-score percentage. Results: A total of 247 out of 546 pts had between 1 and 5 DXA; a total of 576 DXA were collected. Number of DXA measurements per arm were; arm A 133, B 137, C 141 and D 135. The median follow-up time was 5.8 years. Significant factors positively correlated with lumbar and hip BMD in the multivariate analysis were weight, previous HRT use, neo-/adjuvant ChT, hysterectomy and height. Significant negatively correlated factors in the models were osteoporosis, treatment arm (B/C/D vs. A), time since endocrine therapy start, age and smoking (current vs. never).Modeling the T-score percentage, differences from T to L were -4.199% (p = 0.036) and -4.907% (p = 0.025) for the hip and lumbar measurements respectively, before any treatment switch occurred. Conclusions: Our statistical models describe the lumbar and hip BMD evolution for pts treated with L and/or T. The results of both localisations confirm that, contrary to expectation, the sequential schedules do not seem less detrimental for the BMD than L monotherapy. The estimated difference in BMD T-score percent is at least 4% from T to L.