15 resultados para Statistical Language Model

em Scielo Saúde Pública - SP


Relevância:

80.00% 80.00%

Publicador:

Resumo:

The pipeline for macro- and microarray analyses (PMmA) is a set of scripts with a web interface developed to analyze DNA array data generated by array image quantification software. PMmA is designed for use with single- or double-color array data and to work as a pipeline in five classes (data format, normalization, data analysis, clustering, and array maps). It can also be used as a plugin in the BioArray Software Environment, an open-source database for array analysis, or used in a local version of the web service. All scripts in PMmA were developed in the PERL programming language and statistical analysis functions were implemented in the R statistical language. Consequently, our package is a platform-independent software. Our algorithms can correctly select almost 90% of the differentially expressed genes, showing a superior performance compared to other methods of analysis. The pipeline software has been applied to 1536 expressed sequence tags macroarray public data of sugarcane exposed to cold for 3 to 48 h. PMmA identified thirty cold-responsive genes previously unidentified in this public dataset. Fourteen genes were up-regulated, two had a variable expression and the other fourteen were down-regulated in the treatments. These new findings certainly were a consequence of using a superior statistical analysis approach, since the original study did not take into account the dependence of data variability on the average signal intensity of each gene. The web interface, supplementary information, and the package source code are available, free, to non-commercial users at http://ipe.cbmeg.unicamp.br/pub/PMmA.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We explore a DNA statistical model to obtain information about the behavior of the thermodynamics quantities. Special attention is given to the thermal denaturation of this macromolecule.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

INTRODUCTION: The evolution of virulence in host-parasite relationships has been the subject of several publications. In the case of HIV virulence, some authors suggest that the evolution of HIV virulence correlates with the rate of acquisition of new sexual partners. In contrast some other authors argue that the level of HIV virulence is independent of the sexual activity of the host population. METHODS: Provide a mathematical model for the study of the potential influence of human sexual behaviour on the evolution of virulence of HIV is provided. RESULTS: The results indicated that, when the probability of acquisition of infection is a function both of the sexual activity and of the virulence level of HIV strains, the evolution of HIV virulence correlates positively with the rate of acquisition of new sexual partners. CONCLUSION: It is concluded that in the case of a host population with a low (high) rate of exchange of sexual partners the evolution of HIV virulence is such that the less (more) virulent strain prevails.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVE: To assess the receptive vocabulary of children aged between two years and six months and five years and eleven months who were attending childcare centers and kindergarten schools. METHODS: An analytical cross-sectional study was carried out in the municipality of Embu, Southeastern Brazil. The Peabody Picture Vocabulary Test and analysis of factors associated with children's performance were applied. The sample consisted of 201 children of both genders, aged between two and six years. Statistical analysis was performed using multivariate analysis and logistic regression model. The dependent variable analyzed was test performance and the independent variables were child's age, mother's level of education and family socio-demographic characteristics. RESULTS: It was observed that 44.3% of the children had performances in the test that were below what would be expected for their age. The factors associated with the best performances in the test were child's age (OR=2.4; 95% CI: 1.6-3.5) and mother's education level (OR= 3.2; 95% CI: 1.3-7.4). CONCLUSIONS: Mother's education level is important for child's language development. Settings such as childcare and kindergarten schools are protective factors for child development in families of low income and education.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The increasing number of pertussis cases reported on the last twenty years and the existence of new acellular vaccines reinforce the need of research for experimental models to assure the quality of available pertussis vaccines. In this study, allotments of whole-cell and acellular pertussis vaccines were tested through the Intranasal Challenge Model (INM) using conventional NIH mice. The results have been compared to those achieved by the "Gold standard" Intracerebral Challenge Model (ICM). In contrast to ICM, INM results did not show intralaboratorial variations. Statistical analysis by Anova and Ancova tests revealed that the INM presented reproducibility and allowed identification and separation of different products, including three-component and four-component accellular pertussis vaccines. INM revealed differences between pertussis vaccines. INM provides lower distress to the mice allowing the reduction of mice number including the possibility of using conventional mice (less expensive) under non-aseptic environment. Thus, INM may be used as an alternative method of verifying the consistence of allotment production, including acellular pertussis vaccines.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Statistical models allow the representation of data sets and the estimation and/or prediction of the behavior of a given variable through its interaction with the other variables involved in a phenomenon. Among other different statistical models, are the autoregressive state-space models (ARSS) and the linear regression models (LR), which allow the quantification of the relationships among soil-plant-atmosphere system variables. To compare the quality of the ARSS and LR models for the modeling of the relationships between soybean yield and soil physical properties, Akaike's Information Criterion, which provides a coefficient for the selection of the best model, was used in this study. The data sets were sampled in a Rhodic Acrudox soil, along a spatial transect with 84 points spaced 3 m apart. At each sampling point, soybean samples were collected for yield quantification. At the same site, soil penetration resistance was also measured and soil samples were collected to measure soil bulk density in the 0-0.10 m and 0.10-0.20 m layers. Results showed autocorrelation and a cross correlation structure of soybean yield and soil penetration resistance data. Soil bulk density data, however, were only autocorrelated in the 0-0.10 m layer and not cross correlated with soybean yield. The results showed the higher efficiency of the autoregressive space-state models in relation to the equivalent simple and multiple linear regression models using Akaike's Information Criterion. The resulting values were comparatively lower than the values obtained by the regression models, for all combinations of explanatory variables.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Soil organic matter (SOM) plays an important role in carbon (C) cycle and soil quality. Considering the complexity of factors that control SOM cycling and the long time it usually takes to observe changes in SOM stocks, modeling constitutes a very important tool to understand SOM cycling in forest soils. The following hypotheses were tested: (i) soil organic carbon (SOC) stocks would be higher after several rotations of eucalyptus than in low-productivity pastures; (ii) SOC values simulated by the Century model would describe the data better than the mean of observations. So, the aims of the current study were: (i) to evaluate the SOM dynamics using the Century model to simulate the changes of C stocks for two eucalyptus chronosequences in the Rio Doce Valley, Minas Gerais State, Brazil; and (ii) to compare the C stocks simulated by Century with the C stocks measured in soils of different Orders and regions of the Rio Doce Valley growing eucalyptus. In Belo Oriente (BO), short-rotation eucalyptus plantations had been cultivated for 4.0; 13.0, 22.0, 32.0 and 34.0 years, at a lower elevation and in a warmer climate, while in Virginópolis (VG), these time periods were 8.0, 19.0 and 33.0 years, at a higher elevation and in a milder climate. Soil samples were collected from the 0-20 cm layer to estimate C stocks. Results indicate that the C stocks simulated by the Century model decreased after 37 years of poorly managed pastures in areas previously covered by native forest in the regions of BO and VG. The substitution of poorly managed pastures by eucalyptus in the early 1970´s led to an average increase of C of 0.28 and 0.42 t ha-1 year-1 in BO and VG, respectively. The measured C stocks under eucalyptus in distinct soil Orders and independent regions with variable edapho-climate conditions were not far from the values estimated by the Century model (root mean square error - RMSE = 20.9; model efficiency - EF = 0.29) despite the opposite result obtained with the statistical procedure to test the identity of analytical methods. Only for lower soil C stocks, the model over-estimated the C stock in the 0-20 cm layer. Thus, the Century model is highly promising to detect changes in C stocks in distinct soil orders under eucalyptus, as well as to indicate the impact of harvest residue management on SOM in future rotations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Soil properties have an enormous impact on economic and environmental aspects of agricultural production. Quantitative relationships between soil properties and the factors that influence their variability are the basis of digital soil mapping. The predictive models of soil properties evaluated in this work are statistical (multiple linear regression-MLR) and geostatistical (ordinary kriging and co-kriging). The study was conducted in the municipality of Bom Jardim, RJ, using a soil database with 208 sampling points. Predictive models were evaluated for sand, silt and clay fractions, pH in water and organic carbon at six depths according to the specifications of the consortium of digital soil mapping at the global level (GlobalSoilMap). Continuous covariates and categorical predictors were used and their contributions to the model assessed. Only the environmental covariates elevation, aspect, stream power index (SPI), soil wetness index (SWI), normalized difference vegetation index (NDVI), and b3/b2 band ratio were significantly correlated with soil properties. The predictive models had a mean coefficient of determination of 0.21. Best results were obtained with the geostatistical predictive models, where the highest coefficient of determination 0.43 was associated with sand properties between 60 to 100 cm deep. The use of a sparse data set of soil properties for digital mapping can explain only part of the spatial variation of these properties. The results may be related to the sampling density and the quantity and quality of the environmental covariates and predictive models used.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The objective of this study was to evaluate the efficiency of spatial statistical analysis in the selection of genotypes in a plant breeding program and, particularly, to demonstrate the benefits of the approach when experimental observations are not spatially independent. The basic material of this study was a yield trial of soybean lines, with five check varieties (of fixed effect) and 110 test lines (of random effects), in an augmented block design. The spatial analysis used a random field linear model (RFML), with a covariance function estimated from the residuals of the analysis considering independent errors. Results showed a residual autocorrelation of significant magnitude and extension (range), which allowed a better discrimination among genotypes (increase of the power of statistical tests, reduction in the standard errors of estimates and predictors, and a greater amplitude of predictor values) when the spatial analysis was applied. Furthermore, the spatial analysis led to a different ranking of the genetic materials, in comparison with the non-spatial analysis, and a selection less influenced by local variation effects was obtained.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Data was analyzed on development of the solanaceen fruit crop Cape gooseberry to evaluate how well a classical thermal time model could describe node appearance in different environments. The data used in the analysis were obtained from experiments conducted in Colombia in open fields and greenhouse condition at two locations with different climate. An empirical, non linear segmented model was used to estimate the base temperature and to parameterize the model for simulation of node appearance vs. time. The base temperature (Tb) used to calculate the thermal time (TT, ºCd) for node appearance was estimated to be 6.29 ºC. The slope of the first linear segment was 0.023 nodes per TT and 0.008 for the second linear segment. The time at which the slope of node apperance changed was 1039.5 ºCd after transplanting, determined from a statistical analysis of model for the first segment. When these coefficients were used to predict node appearance at all locations, the model successfully fit the observed data (RSME=2.1), especially for the first segment where node appearance was more homogeneous than the second segment. More nodes were produced by plants grown under greenhouse conditions and minimum and maximum rates of node appearance rates were also higher.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Annonaceae includes cultivated species of economic interest and represents an important source of information for better understanding the evolution of tropical rainforests. In phylogenetic analyses of DNA sequence data that are used to address evolutionary questions, it is imperative to use appropriate statistical models. Annonaceae are cases in point: Two sister clades, the subfamilies Annonoideae and Malmeoideae, contain the majority of Annonaceae species diversity. The Annonoideae generally show a greater degree of sequence divergence compared to the Malmeoideae, resulting in stark differences in branch lengths in phylogenetic trees. Uncertainty in how to interpret and analyse these differences has led to inconsistent results when estimating the ages of clades in Annonaceae using molecular dating techniques. We ask whether these differences may be attributed to inappropriate modelling assumptions in the phylogenetic analyses. Specifically, we test for (clade-specific) differences in rates of non-synonymous and synonymous substitutions. A high ratio of nonsynonymous to synonymous substitutions may lead to similarity of DNA sequences due to convergence instead of common ancestry, and as a result confound phylogenetic analyses. We use a dataset of three chloroplast genes (rbcL, matK, ndhF) for 129 species representative of the family. We find that differences in branch lengths between major clades are not attributable to different rates of non-synonymous and synonymous substitutions. The differences in evolutionary rate between the major clades of Annonaceae pose a challenge for current molecular dating techniques that should be seen as a warning for the interpretation of such results in other organisms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A study about the spatial variability of data of soil resistance to penetration (RSP) was conducted at layers 0.0-0.1 m, 0.1-0.2 m and 0.2-0.3 m depth, using the statistical methods in univariate forms, i.e., using traditional geostatistics, forming thematic maps by ordinary kriging for each layer of the study. It was analyzed the RSP in layer 0.2-0.3 m depth through a spatial linear model (SLM), which considered the layers 0.0-0.1 m and 0.1-0.2 m in depth as covariable, obtaining an estimation model and a thematic map by universal kriging. The thematic maps of the RSP at layer 0.2-0.3 m depth, constructed by both methods, were compared using measures of accuracy obtained from the construction of the matrix of errors and confusion matrix. There are similarities between the thematic maps. All maps showed that the RSP is higher in the north region.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Reliable predictions of remaining lives of civil or mechanical structures subjected to fatigue damage are very difficult to be made. In general, fatigue damage is extremely sensitive to the random variations of material mechanical properties, environment and loading. These variations may induce large dispersions when the structural fatigue life has to be predicted. Wirsching (1970) mentions dispersions of the order of 30 to 70 % of the mean calculated life. The presented paper introduces a model to estimate the fatigue damage dispersion based on known statistical distributions of the fatigue parameters (material properties and loading). The model is developed by expanding into Taylor series the set of equations that describe fatigue damage for crack initiation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An interesting fact about language cognition is that stimulation involving incongruence in the merge operation between verb and complement has often been related to a negative event-related potential (ERP) of augmented amplitude and latency of ca. 400 ms - the N400. Using an automatic ERP latency and amplitude estimator to facilitate the recognition of waves with a low signal-to-noise ratio, the objective of the present study was to study the N400 statistically in 24 volunteers. Stimulation consisted of 80 experimental sentences (40 congruous and 40 incongruous), generated in Brazilian Portuguese, involving two distinct local verb-argument combinations (nominal object and pronominal object series). For each volunteer, the EEG was simultaneously acquired at 20 derivations, topographically localized according to the 10-20 International System. A computerized routine for automatic N400-peak marking (based on the ascendant zero-cross of the first waveform derivative) was applied to the estimated individual ERP waveform for congruous and incongruous sentences in both series for all ERP topographic derivations. Peak-to-peak N400 amplitude was significantly augmented (P < 0.05; one-sided Wilcoxon signed-rank test) due to incongruence in derivations F3, T3, C3, Cz, T5, P3, Pz, and P4 for nominal object series and in P3, Pz and P4 for pronominal object series. The results also indicated high inter-individual variability in ERP waveforms, suggesting that the usual procedure of grand averaging might not be considered a generally adequate approach. Hence, signal processing statistical techniques should be applied in neurolinguistic ERP studies allowing waveform analysis with low signal-to-noise ratio.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Mass transfer kinetics in osmotic dehydration is usually modeled by Fick's law, empirical models and probabilistic models. The aim of this study was to determine the applicability of Peleg model to investigate the mass transfer during osmotic dehydration of mackerel (Scomber japonicus) slices at different temperatures. Osmotic dehydration was performed on mackerel slices by cooking-infusion in solutions with glycerol and salt (a w = 0.64) at different temperatures: 50, 70, and 90 ºC. Peleg rate constant (K1) (h(g/gdm)-1) varied with temperature variation from 0.761 to 0.396 for water loss, from 5.260 to 2.947 for salt gain, and from 0.854 to 0.566 for glycerol intake. In all cases, it followed the Arrhenius relationship (R²>0.86). The Ea (kJ / mol) values obtained were 16.14; 14.21, and 10.12 for water, salt, and glycerol, respectively. The statistical parameters that qualify the goodness of fit (R²>0.91 and RMSE<0.086) indicate promising applicability of Peleg model.