940 resultados para maximum likelihood analysis
Resumo:
In this work, kriging with covariates is used to model and map the spatial distribution of salinity measurements gathered by an autonomous underwater vehicle in a sea outfall monitoring campaign aiming to distinguish the effluent plume from the receiving waters and characterize its spatial variability in the vicinity of the discharge. Four different geostatistical linear models for salinity were assumed, where the distance to diffuser, the west-east positioning, and the south-north positioning were used as covariates. Sample variograms were fitted by the Mat`ern models using weighted least squares and maximum likelihood estimation methods as a way to detect eventual discrepancies. Typically, the maximum likelihood method estimated very low ranges which have limited the kriging process. So, at least for these data sets, weighted least squares showed to be the most appropriate estimation method for variogram fitting. The kriged maps show clearly the spatial variation of salinity, and it is possible to identify the effluent plume in the area studied. The results obtained show some guidelines for sewage monitoring if a geostatistical analysis of the data is in mind. It is important to treat properly the existence of anomalous values and to adopt a sampling strategy that includes transects parallel and perpendicular to the effluent dispersion.
Resumo:
Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science in Geospatial Technologies.
Resumo:
Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science in Geospatial Technologies.
Resumo:
This Letter presents a search at the LHC for s-channel single top-quark production in proton-proton collisions at a centre-of-mass energy of 8 TeV. The analyzed data set was recorded by the ATLAS detector and corresponds to an integrated luminosity of 20.3 fb−1. Selected events contain one charged lepton, large missing transverse momentum and exactly two b-tagged jets. A multivariate event classifier based on boosted decision trees is developed to discriminate s-channel single top-quark events from the main background contributions. The signal extraction is based on a binned maximum-likelihood fit of the output classifier distribution. The analysis leads to an upper limit on the s-channel single top-quark production cross-section of 14.6 pb at the 95% confidence level. The fit gives a cross-section of σs=5.0±4.3 pb, consistent with the Standard Model expectation.
Resumo:
Extreme value models are widely used in different areas. The Birnbaum–Saunders distribution is receiving considerable attention due to its physical arguments and its good properties. We propose a methodology based on extreme value Birnbaum–Saunders regression models, which includes model formulation, estimation, inference and checking. We further conduct a simulation study for evaluating its performance. A statistical analysis with real-world extreme value environmental data using the methodology is provided as illustration.
Resumo:
Pressures on the Brazilian Amazon forest have been accentuated by agricultural activities practiced by families encouraged to settle in this region in the 1970s by the colonization program of the government. The aims of this study were to analyze the temporal and spatial evolution of land cover and land use (LCLU) in the lower Tapajós region, in the state of Pará. We contrast 11 watersheds that are generally representative of the colonization dynamics in the region. For this purpose, Landsat satellite images from three different years, 1986, 2001, and 2009, were analyzed with Geographic Information Systems. Individual images were subject to an unsupervised classification using the Maximum Likelihood Classification algorithm available on GRASS. The classes retained for the representation of LCLU in this study were: (1) slightly altered old-growth forest, (2) succession forest, (3) crop land and pasture, and (4) bare soil. The analysis and observation of general trends in eleven watersheds shows that LCLU is changing very rapidly. The average deforestation of old-growth forest in all the watersheds was estimated at more than 30% for the period of 1986 to 2009. The local-scale analysis of watersheds reveals the complexity of LCLU, notably in relation to large changes in the temporal and spatial evolution of watersheds. Proximity to the sprawling city of Itaituba is related to the highest rate of deforestation in two watersheds. The opening of roads such as the Transamazonian highway is associated to the second highest rate of deforestation in three watersheds.
Resumo:
Here we focus on factor analysis from a best practices point of view, by investigating the factor structure of neuropsychological tests and using the results obtained to illustrate on choosing a reasonable solution. The sample (n=1051 individuals) was randomly divided into two groups: one for exploratory factor analysis (EFA) and principal component analysis (PCA), to investigate the number of factors underlying the neurocognitive variables; the second to test the "best fit" model via confirmatory factor analysis (CFA). For the exploratory step, three extraction (maximum likelihood, principal axis factoring and principal components) and two rotation (orthogonal and oblique) methods were used. The analysis methodology allowed exploring how different cognitive/psychological tests correlated/discriminated between dimensions, indicating that to capture latent structures in similar sample sizes and measures, with approximately normal data distribution, reflective models with oblimin rotation might prove the most adequate.
Resumo:
C4 photosynthesis is an adaptation derived from the more common C3 photosynthetic pathway that confers a higher productivity under warm temperature and low atmospheric CO2 concentration [1, 2]. C4 evolution has been seen as a consequence of past atmospheric CO2 decline, such as the abrupt CO2 fall 32-25 million years ago (Mya) [3-6]. This relationship has never been tested rigorously, mainly because of a lack of accurate estimates of divergence times for the different C4 lineages [3]. In this study, we inferred a large phylogenetic tree for the grass family and estimated, through Bayesian molecular dating, the ages of the 17 to 18 independent grass C4 lineages. The first transition from C3 to C4 photosynthesis occurred in the Chloridoideae subfamily, 32.0-25.0 Mya. The link between CO2 decrease and transition to C4 photosynthesis was tested by a novel maximum likelihood approach. We showed that the model incorporating the atmospheric CO2 levels was significantly better than the null model, supporting the importance of CO2 decline on C4 photosynthesis evolvability. This finding is relevant for understanding the origin of C4 photosynthesis in grasses, which is one of the most successful ecological and evolutionary innovations in plant history.
Resumo:
Spatial heterogeneity, spatial dependence and spatial scale constitute key features of spatial analysis of housing markets. However, the common practice of modelling spatial dependence as being generated by spatial interactions through a known spatial weights matrix is often not satisfactory. While existing estimators of spatial weights matrices are based on repeat sales or panel data, this paper takes this approach to a cross-section setting. Specifically, based on an a priori definition of housing submarkets and the assumption of a multifactor model, we develop maximum likelihood methodology to estimate hedonic models that facilitate understanding of both spatial heterogeneity and spatial interactions. The methodology, based on statistical orthogonal factor analysis, is applied to the urban housing market of Aveiro, Portugal at two different spatial scales.
Resumo:
This paper is inspired by articles in the last decade or so that have argued for more attention to theory, and to empirical analysis, within the well-known, and long-lasting, contingency framework for explaining the organisational form of the firm. Its contribution is to extend contingency analysis in three ways: (a) by empirically testing it, using explicit econometric modelling (rather than case study evidence) involving estimation by ordered probit analysis; (b) by extending its scope from large firms to SMEs; (c) by extending its applications from Western economic contexts, to an emerging economy context, using field work evidence from China. It calibrates organizational form in a new way, as an ordinal dependent variable, and also utilises new measures of familiar contingency factors from the literature (i.e. Environment, Strategy, Size and Technology) as the independent variables. An ordered probit model of contingency was constructed, and estimated by maximum likelihood, using a cross section of 83 private Chinese firms. The probit was found to be a good fit to the data, and displayed significant coefficients with plausible interpretations for key variables under all the four categories of contingency analysis, namely Environment, Strategy, Size and Technology. Thus we have generalised the contingency model, in terms of specification, interpretation and applications area.
Resumo:
It has been argued that by truncating the sample space of the negative binomial and of the inverse Gaussian-Poisson mixture models at zero, one is allowed to extend the parameter space of the model. Here that is proved to be the case for the more general three parameter Tweedie-Poisson mixture model. It is also proved that the distributions in the extended part of the parameter space are not the zero truncation of mixed poisson distributions and that, other than for the negative binomial, they are not mixtures of zero truncated Poisson distributions either. By extending the parameter space one can improve the fit when the frequency of one is larger and the right tail is heavier than is allowed by the unextended model. Considering the extended model also allows one to use the basic maximum likelihood based inference tools when parameter estimates fall in the extended part of the parameter space, and hence when the m.l.e. does not exist under the unextended model. This extended truncated Tweedie-Poisson model is proved to be useful in the analysis of words and species frequency count data.
Resumo:
BACKGROUND: We estimated the heritability of three measures of glomerular filtration rate (GFR) in hypertensive families of African descent in the Seychelles (Indian Ocean). METHODS: Families with at least two hypertensive siblings and an average of two normotensive siblings were identified through a national hypertension register. Using the ASSOC program in SAGE (Statistical Analysis in Genetic Epidemiology), the age- and gender-adjusted narrow sense heritability of GFR was estimated by maximum likelihood assuming multivariate normality after power transformation. ASSOC can calculate the additive polygenic component of the variance of a trait from pedigree data in the presence of other familial correlations. The effects of body mass index (BMI), blood pressure, natriuresis, along with sodium to potassium ratio in urine and diabetes, were also tested as covariates. RESULTS: Inulin clearance, 24-hour creatinine clearance, and GFR based on the Cockcroft-Gault formula were available for 348 persons from 66 pedigrees. The age- and gender-adjusted correlations (+/- SE) were 0.51 (+/- 0.04) between inulin clearance and creatinine clearance, 0.53 (+/- 0.04) between inulin clearance and Cockcroft-Gault formula and 0.66 (+/- 0.03) between creatinine clearance and Cockcroft-Gault formula. The age- and gender-adjusted heritabilities (+/- SE) of GFR were 0.41 (+/- 0.10) for inulin clearance, 0.52 (+/- 0.13) for creatinine clearance, and 0.82 (+/- 0.09) for Cockcroft-Gault formula. Adjustment for BMI slightly lowered the correlations and heritabilities for all measurements whereas adjustment for blood pressure had virtually no effect. CONCLUSION: The significant heritability estimates of GFR in our sample of families of African descent confirm the familial aggregation of this trait and justify further analyses aimed at discovering genetic determinants of GFR.
Resumo:
There is considerable variation in the level of fecal egg excretion during Schistosoma mansoni infections. Within a single endemic area, the distribution of egg counts is typically overdispersed, with the majority of eggs excreted coming from a minority of residents. The purpose of this study was to quantify the influence of genetic factors on patterns of fecal egg excretion in a rural study sample in Brazil. Individual fecal egg excretions, expressed in eggs per gram of feces, were determined by the Kato-Katz method on stool samples collected on three different days. Detailed genealogic information was gathered at the time of sampling, which allowed assignment of 461 individuals to 14 pedigrees containing between 3 and 422 individuals. Using a maximum likelihood variance decomposition approach, we performed quantitative genetic analyses to determine if genetic factors could partially account for the observed pattern of fecal egg excretion. The quantitative genetic analysis indicated that between 21-37% of the variation in S. mansoni egg counts was attributable to additive genetic factors and that shared environment, as assessed by common household, accounted for a further 12-21% of the observed variation. A maximum likelihood heritability (h²) estimate of 0.44 ± 0.14 (mean ± SE) was found for the 9,604 second- and higher-degree pairwise relationships in the study sample, which is consistent with the upper limit (37%) of the genetic factor determined in the variance decomposition analysis. These analyses point to the significant influence of additive host genes on the pattern of S. mansoni fecal egg excretion in this endemic area.
Resumo:
Bipolar disorder has a genetic component, but the mode of inheritance remains unclear. A previous genome scan conducted in 70 European families led to detect eight regions linked to bipolar disease. Here, we present an investigation of whether the phenotypic heterogeneity of the disorder corresponds to genetic heterogeneity in these regions using additional markers and an extended sample of families. The MLS statistic was used for linkage analyses. The predivided sample test and the maximum likelihood binomial methods were used to test genetic homogeneity between early-onset bipolar type I (cut-off of 22 years) and other types of the disorder (later onset of bipolar type I and early-onset bipolar type II), using a total of 138 independent bipolar-affected sib-pairs. Analysis of the extended sample of families supports linkage in four regions (2q14, 3p14, 16p23, and 20p12) of the eight regions of linkage suggested by our previous genome scan. Heterogeneity testing revealed genetic heterogeneity between early and late-onset bipolar type I in the 2q14 region (P = 0.0001). Only the early form of the bipolar disorder but not the late form appeared to be linked to this region. This region may therefore include a genetic factor either specifically involved in the early-onset bipolar type I or only influencing the age at onset (AAO). Our findings illustrate that stratification according to AAO may be valuable for the identification of genetic vulnerability polymorphisms.
Resumo:
Plasmodium falciparum is the parasite responsible for the most acute form of malaria in humans. Recently, the serine repeat antigen (SERA) in P. falciparum has attracted attention as a potential vaccine and drug target, and it has been shown to be a member of a large gene family. To clarify the relationships among the numerous P. falciparum SERAs and to identify orthologs to SERA5 and SERA6 in Plasmodium species affecting rodents, gene trees were inferred from nucleotide and amino acid sequence data for 33 putative SERA homologs in seven different species. (A distance method for nucleotide sequences that is specifically designed to accommodate differing GC content yielded results that were largely compatible with the amino acid tree. Standard-distance and maximum-likelihood methods for nucleotide sequences, on the other hand, yielded gene trees that differed in important respects.) To infer the pattern of duplication, speciation, and gene loss events in the SERA gene family history, the resulting gene trees were then "reconciled" with two competing Plasmodium species tree topologies that have been identified by previous phylogenetic studies. Parsimony of reconciliation was used as a criterion for selecting a gene tree/species tree pair and provided (1) support for one of the two species trees and for the core topology of the amino acid-derived gene tree, (2) a basis for critiquing fine detail in a poorly resolved region of the gene tree, (3) a set of predicted "missing genes" in some species, (4) clarification of the relationship among the P. falciparum SERA, and (5) some information about SERA5 and SERA6 orthologs in the rodent malaria parasites. Parsimony of reconciliation and a second criterion--implied mutational pattern at two key active sites in the SERA proteins-were also seen to be useful supplements to standard "bootstrap" analysis for inferred topologies.