923 resultados para model selection in binary regression


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Atmosphere only and ocean only variational data assimilation (DA) schemes are able to use window lengths that are optimal for the error growth rate, non-linearity and observation density of the respective systems. Typical window lengths are 6-12 hours for the atmosphere and 2-10 days for the ocean. However, in the implementation of coupled DA schemes it has been necessary to match the window length of the ocean to that of the atmosphere, which may potentially sacrifice the accuracy of the ocean analysis in order to provide a more balanced coupled state. This paper investigates how extending the window length in the presence of model error affects both the analysis of the coupled state and the initialized forecast when using coupled DA with differing degrees of coupling. Results are illustrated using an idealized single column model of the coupled atmosphere-ocean system. It is found that the analysis error from an uncoupled DA scheme can be smaller than that from a coupled analysis at the initial time, due to faster error growth in the coupled system. However, this does not necessarily lead to a more accurate forecast due to imbalances in the coupled state. Instead coupled DA is more able to update the initial state to reduce the impact of the model error on the accuracy of the forecast. The effect of model error is potentially most detrimental in the weakly coupled formulation due to the inconsistency between the coupled model used in the outer loop and uncoupled models used in the inner loop.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The phylogenetic placement of Kuhlmanniodendron Fiaschi & Groppo (Achariaceae) within Malpighiales was investigated with rbcL sequence data. This genus was recently created to accommodate Carpotroche apterocarpa Kuhlm., a poorly known species from the rainforests of Espirito Santo, Brazil. One rbcL sequence was obtained from Kuhlmanniodendron and analyzed with 73 additional sequences from Malpighiales, and 8 from two closer orders, Oxalidales and Celastrales, all of which were available at Genbank. Phylogenetic analyses were carried out with maximum parsimony and Bayesian inference; bootstrap analyses were used in maximum parsimony to evaluate branch support. The results confirmed the placement of Kuhlmanniodendron together with Camptostylus, Lindackeria, Xylotheca, and Caloncoba in a strongly supported clade (posterior probability = 0.99) that corresponds with the tribe Lindackerieae of Achariaceae (Malpighiales). Kuhlmanniodendron also does not appear to be closely related to Oncoba (Salicaceae), an African genus with similar floral and fruit morphology that has been traditionally placed among cyanogenic Flacourtiaceae (now Achariaceae). A picrosodic paper test was performed in herbarium dry leaves, and the presence of cyanogenic glycosides, a class of compounds usually found in Achariaceae, was detected. Pollen morphology and wood anatomy of Kuhlmanniodendron were also investigated, but both pollen (3-colporate and microreticulate) and wood, with solitary to multiple vessels, scalariform perforation plates and other features, do not seem to be useful to distinguish this genus from other members of the Achariaceae and are rather common among the eudicotyledons as a whole. However, perforated ray cells with scalariform plates, an uncommon wood character, present in Kuhlmanniodendron are similar to those found in Kiggelaria africana (Pangieae, Achariaceae), but the occurrence of such cells is not mapped among the angiosperms, and it is not clear how homoplastic this character could be.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It is known that large fragment sizes and high connectivity levels are key components for maintaining species in fragments; however, their relative effects are poorly understood, especially in tropical areas. In order to test these effects, we built models for explaining understory birds occurrence in a fragmented Atlantic Rain Forest landscape with intermediate habitat cover (3%). Data from over 9000 mist-net hours from 17 fragments differing in size (2-175 ha) and connectivity (considering corridor linkages and distance to nearby fragments) were ranked under a model selection approach. A total 1293 individuals of 62 species were recorded. Species richness, abundance and compositional variation were mainly affected by connectivity indices that consider the capacity of species to use corridors and/or to cross short distances up to 30 m through the matrix. Bird functional groups were differently affected by area and connectivity: while terrestrial insectivores, omnivores and frugivores were affected by both area and connectivity, the other groups (understory insectivores, nectarivores, and others) were affected only by connectivity. In the studied landscape, well connected fragments can sustain an elevated number of species and individuals. Connectivity gives the opportunity for individuals to use multiple fragments, reducing the influence of fragment size. While preserving large fragments is a conservation target worldwide and should continue to be, our results indicated that connectivity between fragments can enhance the area functionally connected and is beneficial to all functional groups and therefore should be a conservation priority. (C) 2008 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Time-lagged responses of biological variables to landscape modifications are widely recognized, but rarely considered in ecological studies. In order to test for the existence of time-lags in the response of trees, small mammals, birds and frogs to changes in fragment area and connectivity, we studied a fragmented and highly dynamic landscape in the Atlantic forest region. We also investigated the biological correlates associated with differential responses among taxonomic groups. Species richness and abundance for four taxonomic groups were measured in 21 secondary forest fragments during the same period (2000-2002), following a standardized protocol. Data analyses were based on power regressions and model selection procedures. The model inputs included present (2000) and past (1962, 1981) fragment areas and connectivity, as well as observed changes in these parameters. Although past landscape structure was particularly relevant for trees, all taxonomic groups (except small mammals) were affected by landscape dynamics, exhibiting a time-lagged response. Furthermore, fragment area was more important for species groups with lower dispersal capacity, while species with higher dispersal ability had stronger responses to connectivity measures. Although these secondary forest fragments still maintain a large fraction of their original biodiversity, the delay in biological response combined with high rates of deforestation and fast forest regeneration imply in a reduction in the average age of the forest. This also indicates that future species losses are likely, especially those that are more strictly-forest dwellers. Conservation actions should be implemented to reduce species extinction, to maintain old-growth forests and to favour the regeneration process. Our results demonstrate that landscape history can strongly affect the present distribution pattern of species in fragmented landscapes, and should be considered in conservation planning. (C) 2009 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Clustering is a difficult task: there is no single cluster definition and the data can have more than one underlying structure. Pareto-based multi-objective genetic algorithms (e.g., MOCK Multi-Objective Clustering with automatic K-determination and MOCLE-Multi-Objective Clustering Ensemble) were proposed to tackle these problems. However, the output of such algorithms can often contains a high number of partitions, becoming difficult for an expert to manually analyze all of them. In order to deal with this problem, we present two selection strategies, which are based on the corrected Rand, to choose a subset of solutions. To test them, they are applied to the set of solutions produced by MOCK and MOCLE in the context of several datasets. The study was also extended to select a reduced set of partitions from the initial population of MOCLE. These analysis show that both versions of selection strategy proposed are very effective. They can significantly reduce the number of solutions and, at the same time, keep the quality and the diversity of the partitions in the original set of solutions. (C) 2010 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper, we compare the performance of two statistical approaches for the analysis of data obtained from the social research area. In the first approach, we use normal models with joint regression modelling for the mean and for the variance heterogeneity. In the second approach, we use hierarchical models. In the first case, individual and social variables are included in the regression modelling for the mean and for the variance, as explanatory variables, while in the second case, the variance at level 1 of the hierarchical model depends on the individuals (age of the individuals), and in the level 2 of the hierarchical model, the variance is assumed to change according to socioeconomic stratum. Applying these methodologies, we analyze a Colombian tallness data set to find differences that can be explained by socioeconomic conditions. We also present some theoretical and empirical results concerning the two models. From this comparative study, we conclude that it is better to jointly modelling the mean and variance heterogeneity in all cases. We also observe that the convergence of the Gibbs sampling chain used in the Markov Chain Monte Carlo method for the jointly modeling the mean and variance heterogeneity is quickly achieved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we deal with a Bayesian analysis for right-censored survival data suitable for populations with a cure rate. We consider a cure rate model based on the negative binomial distribution, encompassing as a special case the promotion time cure model. Bayesian analysis is based on Markov chain Monte Carlo (MCMC) methods. We also present some discussion on model selection and an illustration with a real dataset.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Skew-normal distribution is a class of distributions that includes the normal distributions as a special case. In this paper, we explore the use of Markov Chain Monte Carlo (MCMC) methods to develop a Bayesian analysis in a multivariate, null intercept, measurement error model [R. Aoki, H. Bolfarine, J.A. Achcar, and D. Leao Pinto Jr, Bayesian analysis of a multivariate null intercept error-in -variables regression model, J. Biopharm. Stat. 13(4) (2003b), pp. 763-771] where the unobserved value of the covariate (latent variable) follows a skew-normal distribution. The results and methods are applied to a real dental clinical trial presented in [A. Hadgu and G. Koch, Application of generalized estimating equations to a dental randomized clinical trial, J. Biopharm. Stat. 9 (1999), pp. 161-178].

Relevância:

100.00% 100.00%

Publicador:

Resumo:

P>In the context of either Bayesian or classical sensitivity analyses of over-parametrized models for incomplete categorical data, it is well known that prior-dependence on posterior inferences of nonidentifiable parameters or that too parsimonious over-parametrized models may lead to erroneous conclusions. Nevertheless, some authors either pay no attention to which parameters are nonidentifiable or do not appropriately account for possible prior-dependence. We review the literature on this topic and consider simple examples to emphasize that in both inferential frameworks, the subjective components can influence results in nontrivial ways, irrespectively of the sample size. Specifically, we show that prior distributions commonly regarded as slightly informative or noninformative may actually be too informative for nonidentifiable parameters, and that the choice of over-parametrized models may drastically impact the results, suggesting that a careful examination of their effects should be considered before drawing conclusions.Resume Que ce soit dans un cadre Bayesien ou classique, il est bien connu que la surparametrisation, dans les modeles pour donnees categorielles incompletes, peut conduire a des conclusions erronees. Cependant, certains auteurs persistent a negliger les problemes lies a la presence de parametres non identifies. Nous passons en revue la litterature dans ce domaine, et considerons quelques exemples surparametres simples dans lesquels les elements subjectifs influencent de facon non negligeable les resultats, independamment de la taille des echantillons. Plus precisement, nous montrons comment des a priori consideres comme peu ou non-informatifs peuvent se reveler extremement informatifs en ce qui concerne les parametres non identifies, et que le recours a des modeles surparametres peut avoir sur les conclusions finales un impact considerable. Ceci suggere un examen tres attentif de l`impact potentiel des a priori.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this article, we introduce a semi-parametric Bayesian approach based on Dirichlet process priors for the discrete calibration problem in binomial regression models. An interesting topic is the dosimetry problem related to the dose-response model. A hierarchical formulation is provided so that a Markov chain Monte Carlo approach is developed. The methodology is applied to simulated and real data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The thermo-solvatochrornic behaviors of 2,6-diphenyl-4-(2,4,6-triphenylpyridinium-1-yl) phenolate, RB; 2,6-dichloro-4-(2,4,6-triphenyloyridinium-1-yl) phenolate, WB; 2,6-dibromo-4-[(E)-2-(1-methylpyridinium-4-yl)ethenyl] phenolate, MePMBr(2); 2,6-dibromo-4-[(E)-2-(1-n-octylpyridinium-4-yl)ethenyl] phenolate, OcPMBr(2), have been investigated in binary mixtures of the ionic liquid, IL, 1-(1-butyl)-3-methylimidazolium tetrafluorborate, [BuMeIm][BF(4)], and water (W), in the temperature range from 10 to 60 degrees C. Plots of the empirical solvent polarities, ET (probe) in kcal mol(-1), versus the mole fraction of water in the binary mixture, chi(w) showed nonlinear, i.e., nonideal behavior. Solvation by these IL-W mixtures shows the following similarities to that by aqueous aliphatic alcohols: The same solvation model can be conveniently employed to treat the data obtained; it is based on the presence in the system-bulk medium and probe solvation shell of IL, W, and the ""complex"" solvent 1:1 IL-W. The origin of the nonideal solvation behavior appears to be the same, preferential solvation of the probe, in particular by the complex solvent. The strength of association of the IL-W complex, and the polarity of the IL are situated between the corresponding values of aqueous methanol and aqueous ethanol. Temperature increase causes a gradual desolvation of all probes employed. A difference between solvation by IL-W and aqueous alcohols is that probe-solvent hydrophobic interactions appear to play a minor role in case of the former mixture, probably because solvation is dominated by hydrogen-bonding and Coulombic interactions between the ions of the IL and the zwitterionic probes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Raman band assigned to the nu(C=O)mode in N,N-dimethylformamide (at ca. 1660 cm(-1)) was used as a probe to study a group of ionic liquids 1-alkyl-3-methylimidazolium bromide ([C(n)Mlm]Br) with different alkyl groups (n = 2, 4, 6, 8 and 10 carbons) in binary equimolar binary mixtures with dimethylformamide. Due to the high electric dipole moment of the group C=O, there is a substantial coupling between adjacent molecules in the solution, and the corresponding Raman band involves both vibrational and reorientational modes. Different chain lengths of the ILs lead to different extents of the uncoupling of adjacent molecules of dimethylformamide, resulting in different shifts for this band in the mixtures. Information about the organization of ionic liquids in solution was obtained and a model of aggregation for these systems is proposed. (C) 2010 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Canalization is defined as the stability of a genotype against minor variations in both environment and genetics. Genetic variation in degree of canalization causes heterogeneity of within-family variance. The aims of this study are twofold: (1) quantify genetic heterogeneity of (within-family) residual variance in Atlantic salmon and (2) test whether the observed heterogeneity of (within-family) residual variance can be explained by simple scaling effects. RESULTS: Analysis of body weight in Atlantic salmon using a double hierarchical generalized linear model (DHGLM) revealed substantial heterogeneity of within-family variance. The 95% prediction interval for within-family variance ranged from ~0.4 to 1.2 kg2, implying that the within-family variance of the most extreme high families is expected to be approximately three times larger than the extreme low families. For cross-sectional data, DHGLM with an animal mean sub-model resulted in severe bias, while a corresponding sire-dam model was appropriate. Heterogeneity of variance was not sensitive to Box-Cox transformations of phenotypes, which implies that heterogeneity of variance exists beyond what would be expected from simple scaling effects. CONCLUSIONS: Substantial heterogeneity of within-family variance was found for body weight in Atlantic salmon. A tendency towards higher variance with higher means (scaling effects) was observed, but heterogeneity of within-family variance existed beyond what could be explained by simple scaling effects. For cross-sectional data, using the animal mean sub-model in the DHGLM resulted in biased estimates of variance components, which differed substantially both from a standard linear mean animal model and a sire-dam DHGLM model. Although genetic differences in canalization were observed, selection for increased canalization is difficult, because there is limited individual information for the variance sub-model, especially when based on cross-sectional data. Furthermore, potential macro-environmental changes (diet, climatic region, etc.) may make genetic heterogeneity of variance a less stable trait over time and space.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sweden, together with Norway, Finland and Denmark, have created a multi-national electricity market called NordPool. In this market, producers and retailers of electricity can buy and sell electricity, and the retailers then offers this electricity to end consumers such as households and industries. Previous studies have shown that pricing at the NordPool market is functioning quite well, but no other study has to my knowledge studied if pricing in the retail market to consumers in Sweden is well functioning. If the market is well functioning, with competition and low transaction costs when changing electricity retailer, we would expect that a homogeneous good such as electricity would be sold at the approximately same price, and that price changes would be highly correlated, in this market. Thus, the aim of this study is to test whether the price of Vattenfall, the largest energy firm in the Swedish market, is highly correlated to the price of other firms in the Swedish retail market for electricity. Descriptive statistics indicate that the price offered by Vattenfall is quite similar to the price of other firms in the market. In addition, regression analysis show that the correlation between the price of Vattenfall and other firms is as high as 0.98.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The predominant knowledge-based approach to automated model construction, compositional modelling, employs a set of models of particular functional components. Its inference mechanism takes a scenario describing the constituent interacting components of a system and translates it into a useful mathematical model. This paper presents a novel compositional modelling approach aimed at building model repositories. It furthers the field in two respects. Firstly, it expands the application domain of compositional modelling to systems that can not be easily described in terms of interacting functional components, such as ecological systems. Secondly, it enables the incorporation of user preferences into the model selection process. These features are achieved by casting the compositional modelling problem as an activity-based dynamic preference constraint satisfaction problem, where the dynamic constraints describe the restrictions imposed over the composition of partial models and the preferences correspond to those of the user of the automated modeller. In addition, the preference levels are represented through the use of symbolic values that differ in orders of magnitude.