15 resultados para Generalised Additive Model
em Université de Lausanne, Switzerland
Resumo:
An important statistical development of the last 30 years has been the advance in regression analysis provided by generalized linear models (GLMs) and generalized additive models (GAMs). Here we introduce a series of papers prepared within the framework of an international workshop entitled: Advances in GLMs/GAMs modeling: from species distribution to environmental management, held in Riederalp, Switzerland, 6-11 August 2001.We first discuss some general uses of statistical models in ecology, as well as provide a short review of several key examples of the use of GLMs and GAMs in ecological modeling efforts. We next present an overview of GLMs and GAMs, and discuss some of their related statistics used for predictor selection, model diagnostics, and evaluation. Included is a discussion of several new approaches applicable to GLMs and GAMs, such as ridge regression, an alternative to stepwise selection of predictors, and methods for the identification of interactions by a combined use of regression trees and several other approaches. We close with an overview of the papers and how we feel they advance our understanding of their application to ecological modeling.
Resumo:
Prediction of species' distributions is central to diverse applications in ecology, evolution and conservation science. There is increasing electronic access to vast sets of occurrence records in museums and herbaria, yet little effective guidance on how best to use this information in the context of numerous approaches for modelling distributions. To meet this need, we compared 16 modelling methods over 226 species from 6 regions of the world, creating the most comprehensive set of model comparisons to date. We used presence-only data to fit models, and independent presence-absence data to evaluate the predictions. Along with well-established modelling methods such as generalised additive models and GARP and BIOCLIM, we explored methods that either have been developed recently or have rarely been applied to modelling species' distributions. These include machine-learning methods and community models, both of which have features that may make them particularly well suited to noisy or sparse information, as is typical of species' occurrence data. Presence-only data were effective for modelling species' distributions for many species and regions. The novel methods consistently outperformed more established methods. The results of our analysis are promising for the use of data from museums and herbaria, especially as methods suited to the noise inherent in such data improve.
Resumo:
BACKGROUND: The FTO gene harbors the strongest known susceptibility locus for obesity. While many individual studies have suggested that physical activity (PA) may attenuate the effect of FTO on obesity risk, other studies have not been able to confirm this interaction. To confirm or refute unambiguously whether PA attenuates the association of FTO with obesity risk, we meta-analyzed data from 45 studies of adults (n = 218,166) and nine studies of children and adolescents (n = 19,268). METHODS AND FINDINGS: All studies identified to have data on the FTO rs9939609 variant (or any proxy [r(2)>0.8]) and PA were invited to participate, regardless of ethnicity or age of the participants. PA was standardized by categorizing it into a dichotomous variable (physically inactive versus active) in each study. Overall, 25% of adults and 13% of children were categorized as inactive. Interaction analyses were performed within each study by including the FTO×PA interaction term in an additive model, adjusting for age and sex. Subsequently, random effects meta-analysis was used to pool the interaction terms. In adults, the minor (A-) allele of rs9939609 increased the odds of obesity by 1.23-fold/allele (95% CI 1.20-1.26), but PA attenuated this effect (p(interaction) = 0.001). More specifically, the minor allele of rs9939609 increased the odds of obesity less in the physically active group (odds ratio = 1.22/allele, 95% CI 1.19-1.25) than in the inactive group (odds ratio = 1.30/allele, 95% CI 1.24-1.36). No such interaction was found in children and adolescents. CONCLUSIONS: The association of the FTO risk allele with the odds of obesity is attenuated by 27% in physically active adults, highlighting the importance of PA in particular in those genetically predisposed to obesity.
Resumo:
Background Early age at first delivery has been identified as a risk factor for high-risk HPV-type infection and cervical cancer development. Methods A cross-sectional study was carried out in a large public maternity hospital in Sao Paulo, Brazil. During June 2006 to February 2007, 301 women aged 15-24 years who gave birth to their first child were recruited between 43 and 60 days after delivery. Detection of HPV DNA in cervical specimens was performed using a standardised PCR protocol with PGMY09/11 primers. The association of selected factors with HPV infection was assessed by using a Generalised Linear Model. Results HPV DNA was detected in 58.5% (95% CI 52.7% to 64.0%) of the enrolled young women. The most common types of HPV found were: HPV16, HPV51, HPV52, HPV58 and HPV71. The overall prevalence of HPV types targeted by the HPV prophylactic vaccines was: HPV 16-12.0%, HPV 18-2.3% and HPV 6 and 11 4.3%. In the multivariate analysis, only age (inversely, p for trend=0.02) and smoking habits were independently associated with HPV infection. Conclusions The findings show that these young primiparous women had high cervical HPV prevalence, suggesting that this is a high-risk group for cervical cancer development. Nevertheless, 17.3% were positive for any of the four HPV types included in HPV vaccines (HPV6, 11, 16 or 18), with 13.3% positive for HPV 16 or 18 and only 1.0% having both vaccine related-oncogenic HPV types. Thus, young primiparous women could benefit from catch-up HPV vaccination programmes.
Resumo:
Current models of brain organization include multisensory interactions at early processing stages and within low-level, including primary, cortices. Embracing this model with regard to auditory-visual (AV) interactions in humans remains problematic. Controversy surrounds the application of an additive model to the analysis of event-related potentials (ERPs), and conventional ERP analysis methods have yielded discordant latencies of effects and permitted limited neurophysiologic interpretability. While hemodynamic imaging and transcranial magnetic stimulation studies provide general support for the above model, the precise timing, superadditive/subadditive directionality, topographic stability, and sources remain unresolved. We recorded ERPs in humans to attended, but task-irrelevant stimuli that did not require an overt motor response, thereby circumventing paradigmatic caveats. We applied novel ERP signal analysis methods to provide details concerning the likely bases of AV interactions. First, nonlinear interactions occur at 60-95 ms after stimulus and are the consequence of topographic, rather than pure strength, modulations in the ERP. AV stimuli engage distinct configurations of intracranial generators, rather than simply modulating the amplitude of unisensory responses. Second, source estimations (and statistical analyses thereof) identified primary visual, primary auditory, and posterior superior temporal regions as mediating these effects. Finally, scalar values of current densities in all of these regions exhibited functionally coupled, subadditive nonlinear effects, a pattern increasingly consistent with the mounting evidence in nonhuman primates. In these ways, we demonstrate how neurophysiologic bases of multisensory interactions can be noninvasively identified in humans, allowing for a synthesis across imaging methods on the one hand and species on the other.
Resumo:
Accumulating evidence suggests that polymorphisms in Toll-like receptors (TLRs) influence the pathogenesis of mycobacterial infections, including leprosy, a disease whose manifestations depend on host immune responses. Polymorphisms in TLR2 are associated with an increased risk of reversal reaction, but not susceptibility to leprosy itself. We examined whether polymorphisms in TLR4 are associated with susceptibility to leprosy in a cohort of 441 Ethiopian leprosy patients and 197 healthy controls. We found that two single nucleotide polymorphisms (SNPs) in TLR4 (896G>A [D299G] and 1196C>T [T399I]) were associated with a protective effect against the disease. The 896GG, GA and AA genotypes were found in 91.7, 7.8 and 0.5% of leprosy cases versus 79.9, 19.1 and 1.0% of controls, respectively (odds ratio [OR] = 0.34, 95% confidence interval [CI] 0.20-0.57, P < 0.001, additive model). Similarly, the 1196CC, CT and TT genotypes were found in 98.1, 1.9 and 0% of leprosy cases versus 91.8, 7.7 and 0.5% of controls, respectively (OR = 0.16, 95% CI 0.06--.40, P < 0.001, dominant model). We found that Mycobacterium leprae stimulation of monocytes partially inhibited their subsequent response to lipopolysaccharide (LPS) stimulation. Our data suggest that TLR4 polymorphisms are associated with susceptibility to leprosy and that this effect may be mediated at the cellular level by the modulation of TLR4 signalling by M. leprae.
Resumo:
Uromodulin is expressed exclusively in the thick ascending limb and is the most abundant protein excreted in normal urine. Variants in UMOD, which encodes uromodulin, are associated with renal function, and urinary uromodulin levels may be a biomarker for kidney disease. However, the genetic factors regulating uromodulin excretion are unknown. We conducted a meta-analysis of urinary uromodulin levels to identify associated common genetic variants in the general population. We included 10,884 individuals of European descent from three genetic isolates and three urban cohorts. Each study measured uromodulin indexed to creatinine and conducted linear regression analysis of approximately 2.5 million single nucleotide polymorphisms using an additive model. We also tested whether variants in genes expressed in the thick ascending limb associate with uromodulin levels. rs12917707, located near UMOD and previously associated with renal function and CKD, had the strongest association with urinary uromodulin levels (P<0.001). In all cohorts, carriers of a G allele of this variant had higher uromodulin levels than noncarriers did (geometric means 10.24, 14.05, and 17.67 μg/g creatinine for zero, one, or two copies of the G allele). rs12446492 in the adjacent gene PDILT (protein disulfide isomerase-like, testis expressed) also reached genome-wide significance (P<0.001). Regarding genes expressed in the thick ascending limb, variants in KCNJ1, SORL1, and CAB39 associated with urinary uromodulin levels. These data indicate that common variants in the UMOD promoter region may influence urinary uromodulin levels. They also provide insights into uromodulin biology and the association of UMOD variants with renal function.
Resumo:
The incidence of hepatocellular carcinoma (HCC) is increasing in Western countries. Although several clinical factors have been identified, many individuals never develop HCC, suggesting a genetic susceptibility. However, to date, only a few single-nucleotide polymorphisms have been reproducibly shown to be linked to HCC onset. A variant (rs738409 C>G, encoding for p.I148M) in the PNPLA3 gene is associated with liver damage in chronic liver diseases. Interestingly, several studies have reported that the minor rs738409[G] allele is more represented in HCC cases in chronic hepatitis C (CHC) and alcoholic liver disease (ALD). However, a significant association with HCC related to CHC has not been consistently observed, and the strength of the association between rs738409 and HCC remains unclear. We performed a meta-analysis of individual participant data including 2,503 European patients with cirrhosis to assess the association between rs738409 and HCC, particularly in ALD and CHC. We found that rs738409 was strongly associated with overall HCC (odds ratio [OR] per G allele, additive model=1.77; 95% confidence interval [CI]: 1.42-2.19; P=2.78 × 10(-7) ). This association was more pronounced in ALD (OR=2.20; 95% CI: 1.80-2.67; P=4.71 × 10(-15) ) than in CHC patients (OR=1.55; 95% CI: 1.03-2.34; P=3.52 × 10(-2) ). After adjustment for age, sex, and body mass index, the variant remained strongly associated with HCC. Conclusion: Overall, these results suggest that rs738409 exerts a marked influence on hepatocarcinogenesis in patients with cirrhosis of European descent and provide a strong argument for performing further mechanistic studies to better understand the role of PNPLA3 in HCC development.
Resumo:
La biologie de la conservation est communément associée à la protection de petites populations menacées d?extinction. Pourtant, il peut également être nécessaire de soumettre à gestion des populations surabondantes ou susceptibles d?une trop grande expansion, dans le but de prévenir les effets néfastes de la surpopulation. Du fait des différences tant quantitatives que qualitatives entre protection des petites populations et contrôle des grandes, il est nécessaire de disposer de modèles et de méthodes distinctes. L?objectif de ce travail a été de développer des modèles prédictifs de la dynamique des grandes populations, ainsi que des logiciels permettant de calculer les paramètres de ces modèles et de tester des scénarios de gestion. Le cas du Bouquetin des Alpes (Capra ibex ibex) - en forte expansion en Suisse depuis sa réintroduction au début du XXème siècle - servit d?exemple. Cette tâche fut accomplie en trois étapes : En premier lieu, un modèle de dynamique locale, spécifique au Bouquetin, fut développé : le modèle sous-jacent - structuré en classes d?âge et de sexe - est basé sur une matrice de Leslie à laquelle ont été ajoutées la densité-dépendance, la stochasticité environnementale et la chasse de régulation. Ce modèle fut implémenté dans un logiciel d?aide à la gestion - nommé SIM-Ibex - permettant la maintenance de données de recensements, l?estimation automatisée des paramètres, ainsi que l?ajustement et la simulation de stratégies de régulation. Mais la dynamique d?une population est influencée non seulement par des facteurs démographiques, mais aussi par la dispersion et la colonisation de nouveaux espaces. Il est donc nécessaire de pouvoir modéliser tant la qualité de l?habitat que les obstacles à la dispersion. Une collection de logiciels - nommée Biomapper - fut donc développée. Son module central est basé sur l?Analyse Factorielle de la Niche Ecologique (ENFA) dont le principe est de calculer des facteurs de marginalité et de spécialisation de la niche écologique à partir de prédicteurs environnementaux et de données d?observation de l?espèce. Tous les modules de Biomapper sont liés aux Systèmes d?Information Géographiques (SIG) ; ils couvrent toutes les opérations d?importation des données, préparation des prédicteurs, ENFA et calcul de la carte de qualité d?habitat, validation et traitement des résultats ; un module permet également de cartographier les barrières et les corridors de dispersion. Le domaine d?application de l?ENFA fut exploré par le biais d?une distribution d?espèce virtuelle. La comparaison à une méthode couramment utilisée pour construire des cartes de qualité d?habitat, le Modèle Linéaire Généralisé (GLM), montra qu?elle était particulièrement adaptée pour les espèces cryptiques ou en cours d?expansion. Les informations sur la démographie et le paysage furent finalement fusionnées en un modèle global. Une approche basée sur un automate cellulaire fut choisie, tant pour satisfaire aux contraintes du réalisme de la modélisation du paysage qu?à celles imposées par les grandes populations : la zone d?étude est modélisée par un pavage de cellules hexagonales, chacune caractérisée par des propriétés - une capacité de soutien et six taux d?imperméabilité quantifiant les échanges entre cellules adjacentes - et une variable, la densité de la population. Cette dernière varie en fonction de la reproduction et de la survie locale, ainsi que de la dispersion, sous l?influence de la densité-dépendance et de la stochasticité. Un logiciel - nommé HexaSpace - fut développé pour accomplir deux fonctions : 1° Calibrer l?automate sur la base de modèles de dynamique (par ex. calculés par SIM-Ibex) et d?une carte de qualité d?habitat (par ex. calculée par Biomapper). 2° Faire tourner des simulations. Il permet d?étudier l?expansion d?une espèce envahisseuse dans un paysage complexe composé de zones de qualité diverses et comportant des obstacles à la dispersion. Ce modèle fut appliqué à l?histoire de la réintroduction du Bouquetin dans les Alpes bernoises (Suisse). SIM-Ibex est actuellement utilisé par les gestionnaires de la faune et par les inspecteurs du gouvernement pour préparer et contrôler les plans de tir. Biomapper a été appliqué à plusieurs espèces (tant végétales qu?animales) à travers le Monde. De même, même si HexaSpace fut initialement conçu pour des espèces animales terrestres, il pourrait aisément être étndu à la propagation de plantes ou à la dispersion d?animaux volants. Ces logiciels étant conçus pour, à partir de données brutes, construire un modèle réaliste complexe, et du fait qu?ils sont dotés d?une interface d?utilisation intuitive, ils sont susceptibles de nombreuses applications en biologie de la conservation. En outre, ces approches peuvent également s?appliquer à des questions théoriques dans les domaines de l?écologie des populations et du paysage.<br/><br/>Conservation biology is commonly associated to small and endangered population protection. Nevertheless, large or potentially large populations may also need human management to prevent negative effects of overpopulation. As there are both qualitative and quantitative differences between small population protection and large population controlling, distinct methods and models are needed. The aim of this work was to develop theoretical models to predict large population dynamics, as well as computer tools to assess the parameters of these models and to test management scenarios. The alpine Ibex (Capra ibex ibex) - which experienced a spectacular increase since its reintroduction in Switzerland at the beginning of the 20th century - was used as paradigm species. This task was achieved in three steps: A local population dynamics model was first developed specifically for Ibex: the underlying age- and sex-structured model is based on a Leslie matrix approach with addition of density-dependence, environmental stochasticity and culling. This model was implemented into a management-support software - named SIM-Ibex - allowing census data maintenance, parameter automated assessment and culling strategies tuning and simulating. However population dynamics is driven not only by demographic factors, but also by dispersal and colonisation of new areas. Habitat suitability and obstacles modelling had therefore to be addressed. Thus, a software package - named Biomapper - was developed. Its central module is based on the Ecological Niche Factor Analysis (ENFA) whose principle is to compute niche marginality and specialisation factors from a set of environmental predictors and species presence data. All Biomapper modules are linked to Geographic Information Systems (GIS); they cover all operations of data importation, predictor preparation, ENFA and habitat suitability map computation, results validation and further processing; a module also allows mapping of dispersal barriers and corridors. ENFA application domain was then explored by means of a simulated species distribution. It was compared to a common habitat suitability assessing method, the Generalised Linear Model (GLM), and was proven better suited for spreading or cryptic species. Demography and landscape informations were finally merged into a global model. To cope with landscape realism and technical constraints of large population modelling, a cellular automaton approach was chosen: the study area is modelled by a lattice of hexagonal cells, each one characterised by a few fixed properties - a carrying capacity and six impermeability rates quantifying exchanges between adjacent cells - and one variable, population density. The later varies according to local reproduction/survival and dispersal dynamics, modified by density-dependence and stochasticity. A software - named HexaSpace - was developed, which achieves two functions: 1° Calibrating the automaton on the base of local population dynamics models (e.g., computed by SIM-Ibex) and a habitat suitability map (e.g. computed by Biomapper). 2° Running simulations. It allows studying the spreading of an invading species across a complex landscape made of variously suitable areas and dispersal barriers. This model was applied to the history of Ibex reintroduction in Bernese Alps (Switzerland). SIM-Ibex is now used by governmental wildlife managers to prepare and verify culling plans. Biomapper has been applied to several species (both plants and animals) all around the World. In the same way, whilst HexaSpace was originally designed for terrestrial animal species, it could be easily extended to model plant propagation or flying animals dispersal. As these softwares were designed to proceed from low-level data to build a complex realistic model and as they benefit from an intuitive user-interface, they may have many conservation applications. Moreover, theoretical questions in the fields of population and landscape ecology might also be addressed by these approaches.
Resumo:
BACKGROUND: Several distributions of country-specific blood pressure (BP) percentiles by sex, age, and height for children and adolescents have been established worldwide. However, there are no globally unified BP references for defining elevated BP in children and adolescents, which limits international comparisons of the prevalence of pediatric elevated BP. We aimed to establish international BP references for children and adolescents by using 7 nationally representative data sets (China, India, Iran, Korea, Poland, Tunisia, and the United States). METHODS AND RESULTS: Data on BP for 52 636 nonoverweight children and adolescents aged 6 to 19 years were obtained from 7 large nationally representative cross-sectional surveys in China, India, Iran, Korea, Poland, Tunisia, and the United States. BP values were obtained with certified mercury sphygmomanometers in all 7 countries by using standard procedures for BP measurement. Smoothed BP percentiles (50th, 90th, 95th, and 99th) by age and height were estimated by using the Generalized Additive Model for Location Scale and Shape model. BP values were similar between males and females until the age of 13 years and were higher in males than females thereafter. In comparison with the BP levels of the 90th and 95th percentiles of the US Fourth Report at median height, systolic BP of the corresponding percentiles of these international references was lower, whereas diastolic BP was similar. CONCLUSIONS: These international BP references will be a useful tool for international comparison of the prevalence of elevated BP in children and adolescents and may help to identify hypertensive youths in diverse populations.
Resumo:
We investigated the role of the number of loci coding for a neutral trait on the release of additive variance for this trait after population bottlenecks. Different bottleneck sizes and durations were tested for various matrices of genotypic values, with initial conditions covering the allele frequency space. We used three different types of matrices. First, we extended Cheverud and Routman's model by defining matrices of "pure" epistasis for three and four independent loci; second, we used genotypic values drawn randomly from uniform, normal, and exponential distributions; and third we used two models of simple metabolic pathways leading to physiological epistasis. For all these matrices of genotypic values except the dominant metabolic pathway, we find that, as the number of loci increases from two to three and four, an increase in the release of additive variance is occurring. The amount of additive variance released for a given set of genotypic values is a function of the inbreeding coefficient, independently of the size and duration of the bottleneck. The level of inbreeding necessary to achieve maximum release in additive variance increases with the number of loci. We find that additive-by-additive epistasis is the type of epistasis most easily converted into additive variance. For a wide range of models, our results show that epistasis, rather than dominance, plays a significant role in the increase of additive variance following bottlenecks.
Resumo:
Gene-on-gene regulations are key components of every living organism. Dynamical abstract models of genetic regulatory networks help explain the genome's evolvability and robustness. These properties can be attributed to the structural topology of the graph formed by genes, as vertices, and regulatory interactions, as edges. Moreover, the actual gene interaction of each gene is believed to play a key role in the stability of the structure. With advances in biology, some effort was deployed to develop update functions in Boolean models that include recent knowledge. We combine real-life gene interaction networks with novel update functions in a Boolean model. We use two sub-networks of biological organisms, the yeast cell-cycle and the mouse embryonic stem cell, as topological support for our system. On these structures, we substitute the original random update functions by a novel threshold-based dynamic function in which the promoting and repressing effect of each interaction is considered. We use a third real-life regulatory network, along with its inferred Boolean update functions to validate the proposed update function. Results of this validation hint to increased biological plausibility of the threshold-based function. To investigate the dynamical behavior of this new model, we visualized the phase transition between order and chaos into the critical regime using Derrida plots. We complement the qualitative nature of Derrida plots with an alternative measure, the criticality distance, that also allows to discriminate between regimes in a quantitative way. Simulation on both real-life genetic regulatory networks show that there exists a set of parameters that allows the systems to operate in the critical region. This new model includes experimentally derived biological information and recent discoveries, which makes it potentially useful to guide experimental research. The update function confers additional realism to the model, while reducing the complexity and solution space, thus making it easier to investigate.
Resumo:
Anthropomorphic model observers are mathe- matical algorithms which are applied to images with the ultimate goal of predicting human signal detection and classification accuracy across varieties of backgrounds, image acquisitions and display conditions. A limitation of current channelized model observers is their inability to handle irregularly-shaped signals, which are common in clinical images, without a high number of directional channels. Here, we derive a new linear model observer based on convolution channels which we refer to as the "Filtered Channel observer" (FCO), as an extension of the channelized Hotelling observer (CHO) and the nonprewhitening with an eye filter (NPWE) observer. In analogy to the CHO, this linear model observer can take the form of a single template with an external noise term. To compare with human observers, we tested signals with irregular and asymmetrical shapes spanning the size of lesions down to those of microcalfications in 4-AFC breast tomosynthesis detection tasks, with three different contrasts for each case. Whereas humans uniformly outperformed conventional CHOs, the FCO observer outperformed humans for every signal with only one exception. Additive internal noise in the models allowed us to degrade model performance and match human performance. We could not match all the human performances with a model with a single internal noise component for all signal shape, size and contrast conditions. This suggests that either the internal noise might vary across signals or that the model cannot entirely capture the human detection strategy. However, the FCO model offers an efficient way to apprehend human observer performance for a non-symmetric signal.
Resumo:
Evaluation of image quality (IQ) in Computed Tomography (CT) is important to ensure that diagnostic questions are correctly answered, whilst keeping radiation dose to the patient as low as is reasonably possible. The assessment of individual aspects of IQ is already a key component of routine quality control of medical x-ray devices. These values together with standard dose indicators can be used to give rise to 'figures of merit' (FOM) to characterise the dose efficiency of the CT scanners operating in certain modes. The demand for clinically relevant IQ characterisation has naturally increased with the development of CT technology (detectors efficiency, image reconstruction and processing), resulting in the adaptation and evolution of assessment methods. The purpose of this review is to present the spectrum of various methods that have been used to characterise image quality in CT: from objective measurements of physical parameters to clinically task-based approaches (i.e. model observer (MO) approach) including pure human observer approach. When combined together with a dose indicator, a generalised dose efficiency index can be explored in a framework of system and patient dose optimisation. We will focus on the IQ methodologies that are required for dealing with standard reconstruction, but also for iterative reconstruction algorithms. With this concept the previously used FOM will be presented with a proposal to update them in order to make them relevant and up to date with technological progress. The MO that objectively assesses IQ for clinically relevant tasks represents the most promising method in terms of radiologist sensitivity performance and therefore of most relevance in the clinical environment.
Resumo:
We consider a spectrally-negative Markov additive process as a model of a risk process in a random environment. Following recent interest in alternative ruin concepts, we assume that ruin occurs when an independent Poissonian observer sees the process as negative, where the observation rate may depend on the state of the environment. Using an approximation argument and spectral theory, we establish an explicit formula for the resulting survival probabilities in this general setting. We also discuss an efficient evaluation of the involved quantities and provide a numerical illustration.