62 resultados para Data distribution

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo


Relevância:

70.00% 70.00%

Publicador:

Resumo:

Background: This paper addresses the prediction of the free energy of binding of a drug candidate with enzyme InhA associated with Mycobacterium tuberculosis. This problem is found within rational drug design, where interactions between drug candidates and target proteins are verified through molecular docking simulations. In this application, it is important not only to correctly predict the free energy of binding, but also to provide a comprehensible model that could be validated by a domain specialist. Decision-tree induction algorithms have been successfully used in drug-design related applications, specially considering that decision trees are simple to understand, interpret, and validate. There are several decision-tree induction algorithms available for general-use, but each one has a bias that makes it more suitable for a particular data distribution. In this article, we propose and investigate the automatic design of decision-tree induction algorithms tailored to particular drug-enzyme binding data sets. We investigate the performance of our new method for evaluating binding conformations of different drug candidates to InhA, and we analyze our findings with respect to decision tree accuracy, comprehensibility, and biological relevance. Results: The empirical analysis indicates that our method is capable of automatically generating decision-tree induction algorithms that significantly outperform the traditional C4.5 algorithm with respect to both accuracy and comprehensibility. In addition, we provide the biological interpretation of the rules generated by our approach, reinforcing the importance of comprehensible predictive models in this particular bioinformatics application. Conclusions: We conclude that automatically designing a decision-tree algorithm tailored to molecular docking data is a promising alternative for the prediction of the free energy from the binding of a drug candidate with a flexible-receptor.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: Caesarean section rates in Brazil have been steadily increasing. In 2009, for the first time, the number of children born by this type of procedure was greater than the number of vaginal births. Caesarean section is associated with a series of adverse effects on the women and newborn, and recent evidence suggests that the increasing rates of prematurity and low birth weight in Brazil are associated to the increasing rates of Caesarean section and labour induction. Methods: Nationwide hospital-based cohort study of postnatal women and their offspring with follow-up at 45 to 60 days after birth. The sample was stratified by geographic macro-region, type of the municipality and by type of hospital governance. The number of postnatal women sampled was 23,940, distributed in 191 municipalities throughout Brazil. Two electronic questionnaires were applied to the postnatal women, one baseline face-to-face and one follow-up telephone interview. Two other questionnaires were filled with information on patients' medical records and to assess hospital facilities. The primary outcome was the percentage of Caesarean sections (total, elective and according to Robson's groups). Secondary outcomes were: post-partum pain; breastfeeding initiation; severe/near miss maternal morbidity; reasons for maternal mortality; prematurity; low birth weight; use of oxygen use after birth and mechanical ventilation; admission to neonatal ICU; stillbirths; neonatal mortality; readmission in hospital; use of surfactant; asphyxia; severe/near miss neonatal morbidity. The association between variables were investigated using bivariate, stratified and multivariate model analyses. Statistical tests were applied according to data distribution and homogeneity of variances of groups to be compared. All analyses were taken into consideration for the complex sample design. Discussion: This study, for the first time, depicts a national panorama of labour and birth outcomes in Brazil. Regardless of the socioeconomic level, demand for Caesarean section appears to be based on the belief that the quality of obstetric care is closely associated to the technology used in labour and birth. Within this context, it was justified to conduct a nationwide study to understand the reasons that lead pregnant women to submit to Caesarean sections and to verify any association between this type of birth and it's consequences on postnatal health.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Abstract Background Regardless the regulatory function of microRNAs (miRNA), their differential expression pattern has been used to define miRNA signatures and to disclose disease biomarkers. To address the question of whether patients presenting the different types of diabetes mellitus could be distinguished on the basis of their miRNA and mRNA expression profiling, we obtained peripheral blood mononuclear cell (PBMC) RNAs from 7 type 1 (T1D), 7 type 2 (T2D), and 6 gestational diabetes (GDM) patients, which were hybridized to Agilent miRNA and mRNA microarrays. Data quantification and quality control were obtained using the Feature Extraction software, and data distribution was normalized using quantile function implemented in the Aroma light package. Differentially expressed miRNAs/mRNAs were identified using Rank products, comparing T1DxGDM, T2DxGDM and T1DxT2D. Hierarchical clustering was performed using the average linkage criterion with Pearson uncentered distance as metrics. Results The use of the same microarrays platform permitted the identification of sets of shared or specific miRNAs/mRNA interaction for each type of diabetes. Nine miRNAs (hsa-miR-126, hsa-miR-1307, hsa-miR-142-3p, hsa-miR-142-5p, hsa-miR-144, hsa-miR-199a-5p, hsa-miR-27a, hsa-miR-29b, and hsa-miR-342-3p) were shared among T1D, T2D and GDM, and additional specific miRNAs were identified for T1D (20 miRNAs), T2D (14) and GDM (19) patients. ROC curves allowed the identification of specific and relevant (greater AUC values) miRNAs for each type of diabetes, including: i) hsa-miR-1274a, hsa-miR-1274b and hsa-let-7f for T1D; ii) hsa-miR-222, hsa-miR-30e and hsa-miR-140-3p for T2D, and iii) hsa-miR-181a and hsa-miR-1268 for GDM. Many of these miRNAs targeted mRNAs associated with diabetes pathogenesis. Conclusions These results indicate that PBMC can be used as reporter cells to characterize the miRNA expression profiling disclosed by the different diabetes mellitus manifestations. Shared miRNAs may characterize diabetes as a metabolic and inflammatory disorder, whereas specific miRNAs may represent biological markers for each type of diabetes, deserving further attention.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The Amazonian lowlands include large patches of open vegetation which contrast sharply with the rainforest, and the origin of these patches has been debated. This study focuses on a large area of open vegetation in northern Brazil, where d13C and, in some instances, C/N analyses of the organic matter preserved in late Quaternary sediments were used to achieve floristic reconstructions over time. The main goal was to determine when the modern open vegetation started to develop in this area. The variability in d13C data derived from nine cores ranges from -32.2 to -19.6 parts per thousand, but with nearly 60% of data above -26.5 parts per thousand. The most enriched values were detected only in ecotone and open vegetated areas. The development of open vegetation communities was asynchronous, varying between estimated ages of 6400 and 3000 cal a BP. This suggests that the origin of the studied patches of open vegetation might be linked to sedimentary dynamics of a late Quaternary megafan system. As sedimentation ended, this vegetation type became established over the megafan surface. In addition, the data presented here show that the presence of C4 plants must be used carefully as a proxy to interpret dry paleoclimatic episodes in Amazonian areas. Copyright (c) 2012 John Wiley & Sons, Ltd.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The beta-Birnbaum-Saunders (Cordeiro and Lemonte, 2011) and Birnbaum-Saunders (Birnbaum and Saunders, 1969a) distributions have been used quite effectively to model failure times for materials subject to fatigue and lifetime data. We define the log-beta-Birnbaum-Saunders distribution by the logarithm of the beta-Birnbaum-Saunders distribution. Explicit expressions for its generating function and moments are derived. We propose a new log-beta-Birnbaum-Saunders regression model that can be applied to censored data and be used more effectively in survival analysis. We obtain the maximum likelihood estimates of the model parameters for censored data and investigate influence diagnostics. The new location-scale regression model is modified for the possibility that long-term survivors may be presented in the data. Its usefulness is illustrated by means of two real data sets. (C) 2011 Elsevier B.V. All rights reserved.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This research reports liquid liquid equilibrium data for the system lard (swine fat), cis-9-octadecenoic acid (oleic acid), ethanol, and water at 318.2 K, as well as their correlation with the nonrandom two-liquid (NRTL) and universal quasichemical activity coefficient (UNIQUAC) thermodynamic equations, which have provided global deviations of 0.41 % and 0.53 %, respectively. Additional equilibrium experiments were also performed to obtain cholesterol partition (or distribution) coefficients to verify the availability of the use of ethanol plus water to reduce the cholesterol content in lard. The partition experiments were performed with concentrations of free fatty acids (commercial oleic acid) that varied from (0 to 20) mass % and of water in the solvent that varied from (0 to 18) mass %. The percentage of free fatty acids initially present in lard had a slight effect on the distribution of cholesterol between the phases. Furthermore, the distribution coefficients decreased by adding water in the ethanol; specifically, it resulted in a diminution of the capability of the solvent to remove the cholesterol.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper establishes the spawning habitat of the Brazilian sardine Sardinella brasiliensis and investigates the spatial variability of egg density and its relation with oceanographic conditions in the shelf of the south-east Brazil Bight (SBB). The spawning habitats of S. brasiliensis have been defined in terms of spatial models of egg density, temperature-salinity plots, quotient (Q) analysis and remote sensing data. Quotient curves (Q(C)) were constructed using the geographic distribution of egg density, temperature and salinity from samples collected during nine survey cruises between 1976 and 1993. The interannual sea surface temperature (SST) variability was determined using principal component analysis on the SST anomalies (SSTA) estimated from remote sensing data over the period between 1985 and 2007. The spatial pattern of egg occurrences in the SBB indicated that the largest concentration occurred between Paranagua and Sao Sebastiao. Spawning habitat expanded and contracted during the years, fluctuating around Paranagua. In January 1978 and January 1993, eggs were found nearly everywhere along the inner shelf of the SBB, while in January 1988 and 1991 spawning had contracted to their southernmost position. The SSTA maps for the spawning periods showed that in the case of habitat expansion (1993 only) anomalies over the SBB were zero or slightly negative, whereas for the contraction period anomalies were all positive. Sardinella brasiliensis is capable of exploring suitable spawning sites provided by the entrainment of the colder and less-saline South Atlantic Central Water onto the shelf by means of both coastal wind-driven (to the north-east of the SBB) and meander-induced (to the south-west of the SBB) upwelling.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abundance and composition of marine benthic communities have been relatively well studied in the SE Brazilian coast, but little is known on patterns controlling the distribution of their planktonic larval stages. A survey of larval abundance in the continental margin, using a Multi-Plankton Sampler, was conducted in a cross-shelf transect off Cabo Frio (23 degrees S and 42 degrees W) during a costal upwelling event. Hydrographic conditions were monitored through discrete CDT casts. Chlorophyll-a in the top 100 m of the water column was determined and changes in surface chlorophyll-a was estimated using SeaWiFS images. Based on the larval abundances and the meso-scale hydrodynamics scenario, our results suggest two different processes affecting larval distributions. High larval densities were found nearshore due to the upwelling event associated with high chlorophyll a and strong along shore current. On the continental slope, high larval abundance was associated with a clockwise rotating meander, which may have entrapped larvae from a region located further north (Cabo de Sao Tome, 22 degrees S and 41 degrees W). In mid-shelf areas, our data suggests that vertical migration may likely occur as a response to avoid offshore transport by upwelling plumes and/or cyclonic meanders. The hydrodynamic scenario observed in the study area has two distinct yet extremely important consequences: larval retention on food-rich upwelling areas and the broadening of the tropical domain to southernmost subtropical areas. (C) 2009 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The aim of this study was to analyze the distribution and abundance of the fish fauna of Palmas bay on Anchieta Island in southeastern Brazil. Specimens were caught in the summer and winter of 1992, using an otter trawl at three locations in the bay. The specimens were caught in both the nighttime and daytime. Data on the water temperature and salinity were recorded for the characterization of the predominant water mass in the region, and sediment samples were taken for granulometric analysis. A total of 7 656 specimens (79 species), with a total weight of approximately 300 kg, were recorded. The most abundant species were Eucinostomus argenteus, Ctenosciaena gracilicirrhus, Haemulon steindachneri, Eucinostomus gula and Diapterus rhombeus, which together accounted for more than 73% of the sample. In general, the ecological indices showed no differences in the composition of species for the abiotic variables analyzed. The multivariate analysis showed that the variations in the distribution of the fish fauna were mainly associated with intra-annual differences in temperature and salinity, resulting from the presence of South Atlantic Central Water (SACW) in the area during the summer. The analysis also showed an association with the type of bottom and a lesser association with respect to the night/day periods.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This investigation attempts to determine which environmental parameters of the bottom water and sediment control recent foraminifera fauna at Ezcurra Inlet (King George Island, Antarctica), using data collected during four summers (2002/03, 2003/04, 2004/05 and 2006/07). The study revealed that Ezcurra Inlet contain typical Antarctic foraminifera fauna with three distinct assemblages and few differences in environmental parameters. The species Bolivina pseudopunctata, Fursenkoina fusiformis, Portatrochammina antarctica, and Adercotryma glomerata were abundant in the samples. An elevated abundance, richness and diversity were common at the entrance of the inlet at depths greater than 55 m, where the inlet was characterized by low temperatures and muddy sand. In the inner part of the inlet (depth 30-55 m), richness and diversity were low and the most significant species were Cassidulinoides parkerianus, C. porrectus, and Psammosphaera fusca. Shallow waters showed low values of richness and abundance and high temperatures coupled with coarser sediment. In areas with high suspended matter concentrations and pH values associated with low salinity the most representative species were Hippocrepinella hirudinea and Hemisphaerammina bradyi.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Foraminiferal data were obtained from 66 samples of box cores on the southeastern Brazilian upper margin (between 23.8A degrees-25.9A degrees S and 42.8A degrees-46.13A degrees W) to evaluate the benthic foraminiferal fauna distribution and its relation to some selected abiotic parameters. We focused on areas with different primary production regimes on the southern Brazilian margin, which is generally considered as an oligotrophic region. The total density (D), richness (R), mean diversity (H) over bar`, average living depth (ALD(X) ) and percentages of specimens of different microhabitats (epifauna, shallow infauna, intermediate infauna and deep infauna) were analyzed. The dominant species identified were Uvigerina spp., Globocassidulina subglobosa, Bulimina marginata, Adercotryma wrighti, Islandiella norcrossi, Rhizammina spp. and Brizalina sp.. We also established a set of mathematical functions for analyzing the vertical foraminiferal distribution patterns, providing a quantitative tool that allows correlating the microfaunal density distributions with abiotic factors. In general, the cores that fit with pure exponential decaying functions were related to the oligotrophic conditions prevalent on the Brazilian margin and to the flow of the Brazilian Current (BC). Different foraminiferal responses were identified in cores located in higher productivity zones, such as the northern and the southern region of the study area, where high percentages of infauna were encountered in these cores, and the functions used to fit these profiles differ appreciably from a pure exponential function, as a response of the significant living fauna in deeper layers of the sediment. One of the main factors supporting the different foraminiferal assemblage responses may be related to the differences in primary productivity of the water column and, consequently, in the estimated carbon flux to the sea floor. Nevertheless, also bottom water velocities, substrate type and water depth need to be considered.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper we introduce an extension of the Lindley distribution which offers a more flexible model for lifetime data. Several statistical properties of the distribution are explored, such as the density, (reversed) failure rate, (reversed) mean residual lifetime, moments, order statistics, Bonferroni and Lorenz curves. Estimation using the maximum likelihood and inference of a random sample from the distribution are investigated. A real data application illustrates the performance of the distribution. (C) 2011 The Korean Statistical Society. Published by Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper we propose a hybrid hazard regression model with threshold stress which includes the proportional hazards and the accelerated failure time models as particular cases. To express the behavior of lifetimes the generalized-gamma distribution is assumed and an inverse power law model with a threshold stress is considered. For parameter estimation we develop a sampling-based posterior inference procedure based on Markov Chain Monte Carlo techniques. We assume proper but vague priors for the parameters of interest. A simulation study investigates the frequentist properties of the proposed estimators obtained under the assumption of vague priors. Further, some discussions on model selection criteria are given. The methodology is illustrated on simulated and real lifetime data set.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Conway-Maxwell Poisson (COMP) distribution as an extension of the Poisson distribution is a popular model for analyzing counting data. For the first time, we introduce a new three parameter distribution, so-called the exponential-Conway-Maxwell Poisson (ECOMP) distribution, that contains as sub-models the exponential-geometric and exponential-Poisson distributions proposed by Adamidis and Loukas (Stat Probab Lett 39:35-42, 1998) and KuAY (Comput Stat Data Anal 51:4497-4509, 2007), respectively. The new density function can be expressed as a mixture of exponential density functions. Expansions for moments, moment generating function and some statistical measures are provided. The density function of the order statistics can also be expressed as a mixture of exponential densities. We derive two formulae for the moments of order statistics. The elements of the observed information matrix are provided. Two applications illustrate the usefulness of the new distribution to analyze positive data.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The design of a network is a solution to several engineering and science problems. Several network design problems are known to be NP-hard, and population-based metaheuristics like evolutionary algorithms (EAs) have been largely investigated for such problems. Such optimization methods simultaneously generate a large number of potential solutions to investigate the search space in breadth and, consequently, to avoid local optima. Obtaining a potential solution usually involves the construction and maintenance of several spanning trees, or more generally, spanning forests. To efficiently explore the search space, special data structures have been developed to provide operations that manipulate a set of spanning trees (population). For a tree with n nodes, the most efficient data structures available in the literature require time O(n) to generate a new spanning tree that modifies an existing one and to store the new solution. We propose a new data structure, called node-depth-degree representation (NDDR), and we demonstrate that using this encoding, generating a new spanning forest requires average time O(root n). Experiments with an EA based on NDDR applied to large-scale instances of the degree-constrained minimum spanning tree problem have shown that the implementation adds small constants and lower order terms to the theoretical bound.