945 resultados para Bayesian statistic
Resumo:
Marinussaurus curupira, a new genus and species of Gymnophthalmidae lizard is described from Iranduba, state of Amazonas, Brazil. The genus is characterized by an elongate body; short and stout pentadactyl limbs; all digits clawed; single frontonasal; two prefrontals; absence of frontoparietals; interparietal and parietals forming a straight posterior margin, with interparietal shorter than parietals; distinctive ear opening and eyelid; few temporals; three pairs of chin shields; nasal divided; a distinct collar; smooth, mainly hexagonal, dorsal scales; smooth quadrangular ventral scales; two precloacal and three femoral pores on each side in males; pores between three or four scales. Parsimony (PAR) and partitioned Bayesian (BA) phylogenetic analyses with morphological and molecular data recovered the new genus as a member of the Ecpleopodini radiation of the Cercosaurinae. A close relationship of the new genus with Arthrosaura is postulated.
Resumo:
Background: Mites (Acari) have traditionally been treated as monophyletic, albeit composed of two major lineages: Acariformes and Parasitiformes. Yet recent studies based on morphology, molecular data, or combinations thereof, have increasingly drawn their monophyly into question. Furthermore, the usually basal (molecular) position of one or both mite lineages among the chelicerates is in conflict to their morphology, and to the widely accepted view that mites are close relatives of Ricinulei. Results: The phylogenetic position of the acariform mites is examined through employing SSU, partial LSU sequences, and morphology from 91 chelicerate extant terminals (forty Acariformes). In a static homology framework, molecular sequences were aligned using their secondary structure as guide, whereby regions of ambiguous alignment were discarded, and pre-aligned sequences analyzed under parsimony and different mixed models in a Bayesian inference. Parsimony and Bayesian analyses led to trees largely congruent concerning infraordinal, well-supported branches, but with low support for inter-ordinal relationships. An exception is Solifugae + Acariformes (P. P = 100%, J. = 0.91). In a dynamic homology framework, two analyses were run: a standard POY analysis and an analysis constrained by secondary structure. Both analyses led to largely congruent trees; supporting a (Palpigradi (Solifugae Acariformes)) clade and Ricinulei as sister group of Tetrapulmonata with the topology (Ricinulei (Amblypygi (Uropygi Araneae))). Combined analysis with two different morphological data matrices were run in order to evaluate the impact of constraining the analysis on the recovered topology when employing secondary structure as a guide for homology establishment. The constrained combined analysis yielded two topologies similar to the exclusively molecular analysis for both morphological matrices, except for the recovery of Pedipalpi instead of the (Uropygi Araneae) clade. The standard (direct optimization) POY analysis, however, led to the recovery of trees differing in the absence of the otherwise well-supported group Solifugae + Acariformes. Conclusions: Previous studies combining ribosomal sequences and morphology often recovered topologies similar to purely morphological analyses of Chelicerata. The apparent stability of certain clades not recovered here, like Haplocnemata and Acari, is regarded as a byproduct of the way the molecular homology was previously established using the instrumentalist approach implemented in POY. Constraining the analysis by a priori homology assessment is defended here as a way of maintaining the severity of the test when adding new data to the analysis. Although the strength of the method advocated here is keeping phylogenetic information from regions usually discarded in an exclusively static homology framework; it still has the inconvenience of being uninformative on the effect of alignment ambiguity on resampling methods of clade support estimation. Finally, putative morphological apomorphies of Solifugae + Acariformes are the reduction of the proximal cheliceral podomere, medial abutting of the leg coxae, loss of sperm nuclear membrane, and presence of differentiated germinative and secretory regions in the testis delivering their products into a common lumen.
Resumo:
A new genus and species of microteiid lizard is described based on a series of specimens obtained at Parque Nacional do Caparao (20 degrees 28'S, 41 degrees 49'W), southeastern Brazil, along the division line between the States of Minas Gerais and Espirito Santo. The new lizard occurs in isolated high-altitude, open, rocky habitats above the altitudinal lit-nits of the Atlantic forest. It is characterized by the presence of prefrontals, frontoparietals, parietals, interparietal, and occipital scales; ear opening and eyelid distinct; three pairs of genials; absence of collar; lanceolate and mucronate dorsal scales; six regular transverse and longitudinal series of smooth ventrals that are longer than wide, with the lateral ones narrower. Maximum parsimony (MP) and partitioned Bayesian (PBA) phylogenetic analyses based on morphological and molecular characters with all known genera of Gymnophthalminae (except for Scriptosaura) Plus Rhachisaurus recovered this new lizard in a clade having Colobodactylus and Heterodactylus as its closest relatives. Both analyses recovered the monophyly of Gymnophthalminae and Gymnophthalmini. The monophyly of the Heterodactylini received moderate support in MP analyses but was not recovered in PBA. To eliminate classification controversy between these results, the present concept of Heterodactylini is restricted to accommodate the new genus, Colobodactylus and Heterodactylus, and a new tribe Iphisiini is proposed to allocate Alexandresaurus, Iphisa, Colobosaura, Acratosaura, and Stenolepis. Current phylogenetic knowledge of Gymnophthalminae suggests that fossoriality and increase of body elongation arose as adaptive responses to avoid extreme surface temperatures, either cold or hot, depending on circumstances.
Resumo:
Background: Progress towards the development of a malaria vaccine against Plasmodium vivax, the most widely distributed human malaria parasite, will require a better understanding of the immune responses that confer clinical protection to patients in regions where malaria is endemic. Methods: Glutathione S-transferase (GST) and GST-fusion proteins representing the N-terminus of the merozoite surface protein 1 of P. vivax, PvMSP1-N, and the C-terminus, PvMSP1-C, were covalently coupled to BioPlex carboxylated beads. Recombinant proteins and coupled beads were used, respectively, in ELISA and Bioplex assays using immune sera of P. vivax patients from Brazil and PNG to determine IgG and subclass responses. Concordances between the two methods in the seropositivity responses were evaluated using the Kappa statistic and the Spearman's rank correlation. Results: The results using this methodology were compared with the classical microtitre enzyme-linked immnosorbent assay ( ELISA), showing that the assay was sensitive, reproducible and had good concordance with ELISA; yet, further research into different statistical analyses seems desirable before claiming conclusive results exclusively based on multiplex assays. As expected, results demonstrated that PvMSP1 was immunogenic in natural infections of patients from different endemic regions of Brazil and Papua New Guinea ( PNG), and that age correlated only with antibodies against the C-terminus part of the molecule. Furthermore, the IgG subclass profiles were different in these endemic regions having IgG3 predominantly recognizing PvMSP1 in Brazil and IgG1 predominantly recognizing PvMSP1 in PNG. Conclusions: This study validates the use of the multiplex assay to measure naturally-acquired IgG antibodies against the merozoite surface protein 1 of P. vivax.
Resumo:
Background: Plasmodium vivax malaria is a major public health challenge in Latin America, Asia and Oceania, with 130-435 million clinical cases per year worldwide. Invasion of host blood cells by P. vivax mainly depends on a type I membrane protein called Duffy binding protein (PvDBP). The erythrocyte-binding motif of PvDBP is a 170 amino-acid stretch located in its cysteine-rich region II (PvDBP(II)), which is the most variable segment of the protein. Methods: To test whether diversifying natural selection has shaped the nucleotide diversity of PvDBP(II) in Brazilian populations, this region was sequenced in 122 isolates from six different geographic areas. A Bayesian method was applied to test for the action of natural selection under a population genetic model that incorporates recombination. The analysis was integrated with a structural model of PvDBP(II), and T-and B-cell epitopes were localized on the 3-D structure. Results: The results suggest that: (i) recombination plays an important role in determining the haplotype structure of PvDBP(II), and (ii) PvDBP(II) appears to contain neutrally evolving codons as well as codons evolving under natural selection. Diversifying selection preferentially acts on sites identified as epitopes, particularly on amino acid residues 417, 419, and 424, which show strong linkage disequilibrium. Conclusions: This study shows that some polymorphisms of PvDBP(II) are present near the erythrocyte-binding domain and might serve to elude antibodies that inhibit cell invasion. Therefore, these polymorphisms should be taken into account when designing vaccines aimed at eliciting antibodies to inhibit erythrocyte invasion.
Resumo:
Gaussianity and statistical isotropy of the Universe are modern cosmology's minimal set of hypotheses. In this work we introduce a new statistical test to detect observational deviations from this minimal set. By defining the temperature correlation function over the whole celestial sphere, we are able to independently quantify both angular and planar dependence (modulations) of the CMB temperature power spectrum over different slices of this sphere. Given that planar dependence leads to further modulations of the usual angular power spectrum C(l), this test can potentially reveal richer structures in the morphology of the primordial temperature field. We have also constructed an unbiased estimator for this angular-planar power spectrum which naturally generalizes the estimator for the usual C(l)'s. With the help of a chi-square analysis, we have used this estimator to search for observational deviations of statistical isotropy in WMAP's 5 year release data set (ILC5), where we found only slight anomalies on the angular scales l = 7 and l = 8. Since this angular-planar statistic is model-independent, it is ideal to employ in searches of statistical anisotropy (e.g., contaminations from the galactic plane) and to characterize non-Gaussianities.
Resumo:
Online music databases have increased significantly as a consequence of the rapid growth of the Internet and digital audio, requiring the development of faster and more efficient tools for music content analysis. Musical genres are widely used to organize music collections. In this paper, the problem of automatic single and multi-label music genre classification is addressed by exploring rhythm-based features obtained from a respective complex network representation. A Markov model is built in order to analyse the temporal sequence of rhythmic notation events. Feature analysis is performed by using two multi-variate statistical approaches: principal components analysis (unsupervised) and linear discriminant analysis (supervised). Similarly, two classifiers are applied in order to identify the category of rhythms: parametric Bayesian classifier under the Gaussian hypothesis (supervised) and agglomerative hierarchical clustering (unsupervised). Qualitative results obtained by using the kappa coefficient and the obtained clusters corroborated the effectiveness of the proposed method.
Resumo:
This paper presents a description of nuclear magnetic resonance (NMR) of quadrupolar systems using the Holstein-Primakoff (HP) formalism and its analogy with a Bose-Einstein condensate (BEC) system. Two nuclear spin systems constituted of quadrupolar nuclei I=3/2 ((23)Na) and I=7/2 ((133)Cs) in lyotropic liquid crystals were used for experimental demonstrations. Specifically, we derived the conditions necessary for accomplishing the analogy, executed the proper experiments, and compared with quantum mechanical prediction for a Bose system. The NMR description in the HP representation could be applied in the future as a workbench for BEC-like systems, where the statistical properties may be obtained using the intermediate statistic, first established by Gentile. The description can be applied for any quadrupolar systems, including new developed solid-state NMR GaAS nanodevices.
Resumo:
Context tree models have been introduced by Rissanen in [25] as a parsimonious generalization of Markov models. Since then, they have been widely used in applied probability and statistics. The present paper investigates non-asymptotic properties of two popular procedures of context tree estimation: Rissanen's algorithm Context and penalized maximum likelihood. First showing how they are related, we prove finite horizon bounds for the probability of over- and under-estimation. Concerning overestimation, no boundedness or loss-of-memory conditions are required: the proof relies on new deviation inequalities for empirical probabilities of independent interest. The under-estimation properties rely on classical hypotheses for processes of infinite memory. These results improve on and generalize the bounds obtained in Duarte et al. (2006) [12], Galves et al. (2008) [18], Galves and Leonardi (2008) [17], Leonardi (2010) [22], refining asymptotic results of Buhlmann and Wyner (1999) [4] and Csiszar and Talata (2006) [9]. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Background: The aim of this study was to estimate the prevalence of fibromyalgia, as well as to assess the major symptoms of this syndrome in an adult, low socioeconomic status population assisted by the primary health care system in a city in Brazil. Methods: We cross-sectionally sampled individuals assisted by the public primary health care system (n = 768, 35-60 years old). Participants were interviewed by phone and screened about pain. They were then invited to be clinically assessed (304 accepted). Pain was estimated using a Visual Analogue Scale (VAS). Fibromyalgia was assessed using the Fibromyalgia Impact Questionnaire (FIQ), as well as screening for tender points using dolorimetry. Statistical analyses included Bayesian Statistics and the Kruskal-Wallis Anova test (significance level = 5%). Results: From the phone-interview screening, we divided participants (n = 768) in three groups: No Pain (NP) (n = 185); Regional Pain (RP) (n = 388) and Widespread Pain (WP) (n = 106). Among those participating in the clinical assessments, (304 subjects), the prevalence of fibromyalgia was 4.4% (95% confidence interval [2.6%; 6.3%]). Symptoms of pain (VAS and FIQ), feeling well, job ability, fatigue, morning tiredness, stiffness, anxiety and depression were statically different among the groups. In multivariate analyses we found that individuals with FM and WP had significantly higher impairment than those with RP and NP. FM and WP were similarly disabling. Similarly, RP was no significantly different than NP. Conclusion: Fibromyalgia is prevalent in the low socioeconomic status population assisted by the public primary health care system. Prevalence was similar to other studies (4.4%) in a more diverse socioeconomic population. Individuals with FM and WP have significant impact in their well being.
Resumo:
Efficient automatic protein classification is of central importance in genomic annotation. As an independent way to check the reliability of the classification, we propose a statistical approach to test if two sets of protein domain sequences coming from two families of the Pfam database are significantly different. We model protein sequences as realizations of Variable Length Markov Chains (VLMC) and we use the context trees as a signature of each protein family. Our approach is based on a Kolmogorov-Smirnov-type goodness-of-fit test proposed by Balding et at. [Limit theorems for sequences of random trees (2008), DOI: 10.1007/s11749-008-0092-z]. The test statistic is a supremum over the space of trees of a function of the two samples; its computation grows, in principle, exponentially fast with the maximal number of nodes of the potential trees. We show how to transform this problem into a max-flow over a related graph which can be solved using a Ford-Fulkerson algorithm in polynomial time on that number. We apply the test to 10 randomly chosen protein domain families from the seed of Pfam-A database (high quality, manually curated families). The test shows that the distributions of context trees coming from different families are significantly different. We emphasize that this is a novel mathematical approach to validate the automatic clustering of sequences in any context. We also study the performance of the test via simulations on Galton-Watson related processes.
Resumo:
Background: There are several studies in the literature depicting measurement error in gene expression data and also, several others about regulatory network models. However, only a little fraction describes a combination of measurement error in mathematical regulatory networks and shows how to identify these networks under different rates of noise. Results: This article investigates the effects of measurement error on the estimation of the parameters in regulatory networks. Simulation studies indicate that, in both time series (dependent) and non-time series (independent) data, the measurement error strongly affects the estimated parameters of the regulatory network models, biasing them as predicted by the theory. Moreover, when testing the parameters of the regulatory network models, p-values computed by ignoring the measurement error are not reliable, since the rate of false positives are not controlled under the null hypothesis. In order to overcome these problems, we present an improved version of the Ordinary Least Square estimator in independent (regression models) and dependent (autoregressive models) data when the variables are subject to noises. Moreover, measurement error estimation procedures for microarrays are also described. Simulation results also show that both corrected methods perform better than the standard ones (i.e., ignoring measurement error). The proposed methodologies are illustrated using microarray data from lung cancer patients and mouse liver time series data. Conclusions: Measurement error dangerously affects the identification of regulatory network models, thus, they must be reduced or taken into account in order to avoid erroneous conclusions. This could be one of the reasons for high biological false positive rates identified in actual regulatory network models.
Resumo:
Morphological and molecular analyses have proven to be complementary tools of taxonomic information for the redescription of the ctenostome bryozoans Amathia brasiliensis Busk, 1886 and Amathia distans Busk, 1886. The two species, originally described from material collected by the `Challenger` expedition but synonymized by later authors, now have their status fixed by means of the selection of lectotypes, morphological observations and analyses of DNA sequences described here. The morphological characters allowing the identification of living and/or preserved specimens are (1) A. brasiliensis: whitish-pale pigment spots in the frontal surface of stolons and zooids, and a wide stolon with biserial zooid clusters growing in clockwise and anti-clockwise spirals along it, the spirality direction being maintained from maternal to daughter stolons; and (2) A. distans: bright yellow pigment spots in stolonal and zooidal surfaces including lophophores, and a slender stolon, thickly cuticularized, with biserial zooid clusters growing in clockwise and anti-clockwise spirals along it and the spirality direction not maintained from maternal to daughter stolons. Pairwise comparisons of DNA sequences of the mitochondrial genes cytochrome c oxidase subunit I and large ribosomal RNA subunit revealed deep genetic divergence between A. brasiliensis and A. distans. Finally, analyses of those sequences within a Bayesian phylogenetic context recovered their genealogical species status.
Resumo:
Stream discharge-concentration relationships are indicators of terrestrial ecosystem function. Throughout the Amazon and Cerrado regions of Brazil rapid changes in land use and land cover may be altering these hydrochemical relationships. The current analysis focuses on factors controlling the discharge-calcium (Ca) concentration relationship since previous research in these regions has demonstrated both positive and negative slopes in linear log(10)discharge-log(10)Ca concentration regressions. The objective of the current study was to evaluate factors controlling stream discharge-Ca concentration relationships including year, season, stream order, vegetation cover, land use, and soil classification. It was hypothesized that land use and soil class are the most critical attributes controlling discharge-Ca concentration relationships. A multilevel, linear regression approach was utilized with data from 28 streams throughout Brazil. These streams come from three distinct regions and varied broadly in watershed size (< 1 to > 10(6) ha) and discharge (10(-5.7)-10(3.2) m(3) s(-1)). Linear regressions of log(10)Ca versus log(10)discharge in 13 streams have a preponderance of negative slopes with only two streams having significant positive slopes. An ANOVA decomposition suggests the effect of discharge on Ca concentration is large but variable. Vegetation cover, which incorporates aspects of land use, explains the largest proportion of the variance in the effect of discharge on Ca followed by season and year. In contrast, stream order, land use, and soil class explain most of the variation in stream Ca concentration. In the current data set, soil class, which is related to lithology, has an important effect on Ca concentration but land use, likely through its effect on runoff concentration and hydrology, has a greater effect on discharge-concentration relationships.
Resumo:
A simultaneous optimization strategy based on a neuro-genetic approach is proposed for selection of laser induced breakdown spectroscopy operational conditions for the simultaneous determination of macronutrients (Ca, Mg and P), micro-nutrients (B, Cu, Fe, Mn and Zn), Al and Si in plant samples. A laser induced breakdown spectroscopy system equipped with a 10 Hz Q-switched Nd:YAG laser (12 ns, 532 nm, 140 mJ) and an Echelle spectrometer with intensified coupled-charge device was used. Integration time gate, delay time, amplification gain and number of pulses were optimized. Pellets of spinach leaves (NIST 1570a) were employed as laboratory samples. In order to find a model that could correlate laser induced breakdown spectroscopy operational conditions with compromised high peak areas of all elements simultaneously, a Bayesian Regularized Artificial Neural Network approach was employed. Subsequently, a genetic algorithm was applied to find optimal conditions for the neural network model, in an approach called neuro-genetic, A single laser induced breakdown spectroscopy working condition that maximizes peak areas of all elements simultaneously, was obtained with the following optimized parameters: 9.0 mu s integration time gate, 1.1 mu s delay time, 225 (a.u.) amplification gain and 30 accumulated laser pulses. The proposed approach is a useful and a suitable tool for the optimization process of such a complex analytical problem. (C) 2009 Elsevier B.V. All rights reserved.