926 resultados para Genomic data integration
Resumo:
A four-parameter extension of the generalized gamma distribution capable of modelling a bathtub-shaped hazard rate function is defined and studied. The beauty and importance of this distribution lies in its ability to model monotone and non-monotone failure rate functions, which are quite common in lifetime data analysis and reliability. The new distribution has a number of well-known lifetime special sub-models, such as the exponentiated Weibull, exponentiated generalized half-normal, exponentiated gamma and generalized Rayleigh, among others. We derive two infinite sum representations for its moments. We calculate the density of the order statistics and two expansions for their moments. The method of maximum likelihood is used for estimating the model parameters and the observed information matrix is obtained. Finally, a real data set from the medical area is analysed.
Resumo:
Joint generalized linear models and double generalized linear models (DGLMs) were designed to model outcomes for which the variability can be explained using factors and/or covariates. When such factors operate, the usual normal regression models, which inherently exhibit constant variance, will under-represent variation in the data and hence may lead to erroneous inferences. For count and proportion data, such noise factors can generate a so-called overdispersion effect, and the use of binomial and Poisson models underestimates the variability and, consequently, incorrectly indicate significant effects. In this manuscript, we propose a DGLM from a Bayesian perspective, focusing on the case of proportion data, where the overdispersion can be modeled using a random effect that depends on some noise factors. The posterior joint density function was sampled using Monte Carlo Markov Chain algorithms, allowing inferences over the model parameters. An application to a data set on apple tissue culture is presented, for which it is shown that the Bayesian approach is quite feasible, even when limited prior information is available, thereby generating valuable insight for the researcher about its experimental results.
Resumo:
Grass reference evapotranspiration (ETo) is an important agrometeorological parameter for climatological and hydrological studies, as well as for irrigation planning and management. There are several methods to estimate ETo, but their performance in different environments is diverse, since all of them have some empirical background. The FAO Penman-Monteith (FAD PM) method has been considered as a universal standard to estimate ETo for more than a decade. This method considers many parameters related to the evapotranspiration process: net radiation (Rn), air temperature (7), vapor pressure deficit (Delta e), and wind speed (U); and has presented very good results when compared to data from lysimeters Populated with short grass or alfalfa. In some conditions, the use of the FAO PM method is restricted by the lack of input variables. In these cases, when data are missing, the option is to calculate ETo by the FAD PM method using estimated input variables, as recommended by FAD Irrigation and Drainage Paper 56. Based on that, the objective of this study was to evaluate the performance of the FAO PM method to estimate ETo when Rn, Delta e, and U data are missing, in Southern Ontario, Canada. Other alternative methods were also tested for the region: Priestley-Taylor, Hargreaves, and Thornthwaite. Data from 12 locations across Southern Ontario, Canada, were used to compare ETo estimated by the FAD PM method with a complete data set and with missing data. The alternative ETo equations were also tested and calibrated for each location. When relative humidity (RH) and U data were missing, the FAD PM method was still a very good option for estimating ETo for Southern Ontario, with RMSE smaller than 0.53 mm day(-1). For these cases, U data were replaced by the normal values for the region and Delta e was estimated from temperature data. The Priestley-Taylor method was also a good option for estimating ETo when U and Delta e data were missing, mainly when calibrated locally (RMSE = 0.40 mm day(-1)). When Rn was missing, the FAD PM method was not good enough for estimating ETo, with RMSE increasing to 0.79 mm day(-1). When only T data were available, adjusted Hargreaves and modified Thornthwaite methods were better options to estimate ETo than the FAO) PM method, since RMSEs from these methods, respectively 0.79 and 0.83 mm day(-1), were significantly smaller than that obtained by FAO PM (RMSE = 1.12 mm day(-1). (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
This article presents a statistical model of agricultural yield data based on a set of hierarchical Bayesian models that allows joint modeling of temporal and spatial autocorrelation. This method captures a comprehensive range of the various uncertainties involved in predicting crop insurance premium rates as opposed to the more traditional ad hoc, two-stage methods that are typically based on independent estimation and prediction. A panel data set of county-average yield data was analyzed for 290 counties in the State of Parana (Brazil) for the period of 1990 through 2002. Posterior predictive criteria are used to evaluate different model specifications. This article provides substantial improvements in the statistical and actuarial methods often applied to the calculation of insurance premium rates. These improvements are especially relevant to situations where data are limited.
Resumo:
P>The Arabidopsis thylakoid FtsH protease complex is composed of FtsH1/FtsH5 (type A) and FtsH2/FtsH8 (type B) subunits. Type A and type B subunits display a high degree of sequence identity throughout their mature domains, but no similarity in their amino-terminal targeting peptide regions. In chloroplast import assays, FtsH2 and FtsH5 were imported and subsequently integrated into thylakoids by a two-step processing mechanism that resulted in an amino-proximal lumenal domain, a single transmembrane anchor, and a carboxyl proximal stromal domain. FtsH2 integration into washed thylakoids was entirely dependent on the proton gradient, whereas FtsH5 integration was dependent on NTPs, suggesting their integration by Tat and Sec pathways, respectively. This finding was corroborated by in organello competition and by antibody inhibition experiments. A series of constructs were made in order to understand the molecular basis for different integration pathways. The amino proximal domains through the transmembrane anchors were sufficient for proper integration as demonstrated with carboxyl-truncated versions of FtsH2 and FtsH5. The mature FtsH2 protein was found to be incompatible with the Sec machinery as determined with targeting peptide-swapping experiments. Incompatibility does not appear to be determined by any specific element in the FtsH2 domain as no single domain was incompatible with Sec transport. This suggests an incompatible structure that requires the intact FtsH2. That the highly homologous type A and type B subunits of the same multimeric complex use different integration pathways is a striking example of the notion that membrane insertion pathways have evolved to accommodate structural features of their respective substrates.
Resumo:
Expressed sequence tags derived markers have a great potential to be used in functional map construction and QTL tagging. In the present work, sugarcane genomic probes and expressed sequence tags having homology to genes, mostly involved in carbohydrate metabolism were used in RFLP assays to identify putative QTLs as well as their epistatic interactions for fiber content, cane yield, pol and tones of sugar per hectare, at two crop cycles in a progeny derived from a bi-parental cross of sugarcane elite materials. A hundred and twenty marker trait associations were found, of which 26 at both crop cycle and 32 only at first ratoon cane. A sucrose synthase derived marker was associated with a putative QTL having a high negative effect on cane yield and also with a QTL having a positive effect on Pol at both crop cycles. Fifty digenic epistatic marker interactions were identified for the four traits evaluated. Of these, only two were observed at both crop cycles.
Resumo:
Despite its importance to agriculture, the genetic basis of heterosis is still not well understood. The main competing hypotheses include dominance, overdominance, and epistasis. NC design III is an experimental design that. has been used for estimating the average degree of dominance of quantitative trait 106 (QTL) and also for studying heterosis. In this study, we first develop a multiple-interval mapping (MIM) model for design III that provides a platform to estimate the number, genomic positions, augmented additive and dominance effects, and epistatic interactions of QTL. The model can be used for parents with any generation of selling. We apply the method to two data sets, one for maize and one for rice. Our results show that heterosis in maize is mainly due to dominant gene action, although overdominance of individual QTL could not completely be ruled out due to the mapping resolution and limitations of NC design III. For rice, the estimated QTL dominant effects could not explain the observed heterosis. There is evidence that additive X additive epistatic effects of QTL could be the main cause for the heterosis in rice. The difference in the genetic basis of heterosis seems to be related to open or self pollination of the two species. The MIM model for NC design III is implemented in Windows QTL Cartographer, a freely distributed software.
Resumo:
Several aspects of photoperception and light signal transduction have been elucidated by studies with model plants. However, the information available for economically important crops, such as Fabaceae species, is scarce. In order to incorporate the existing genomic tools into a strategy to advance soybean research, we have investigated publicly available expressed sequence tag ( EST) sequence databases in order to identify Glycine max sequences related to genes involved in light-regulated developmental control in model plants. Approximately 38,000 sequences from open-access databases were investigated, and all bona fide and putative photoreceptor gene families were found in soybean sequence databases. We have identified G. max orthologs for several families of transcriptional regulators and cytoplasmic proteins mediating photoreceptor-induced responses, although some important Arabidopsis phytochrome-signaling components are absent. Moreover, soybean and Arabidopsis gene-family homologs appear to have undergone a distinct expansion process in some cases. We propose a working model of light perception, signal transduction and response-eliciting in G. max, based on the identified key components from Arabidopsis. These results demonstrate the power of comparative genomics between model systems and crop species to elucidate several aspects of plant physiology and metabolism.
Resumo:
The genetic linkage map for the common bean (Phaseolus vulgaris L.) is a valuable tool for breeding programs. Breeders provide new cultivars that meet the requirements of farmers and consumers, such as seed color, seed size, maturity, and growth habit. A genetic study was conducted to examine the genetics behind certain qualitative traits. Growth habit is usually described as a recessive trait inherited by a single gene, and there is no consensus about the position of the locus. The aim of this study was to develop a new genetic linkage map using genic and genomic microsatellite markers and three morphological traits: growth habit, flower color, and pod tip shape. A mapping population consisting of 380 recombinant F10 lines was generated from IAC-UNA x CAL143. A total of 871 microsatellites were screened for polymorphisms among the parents, and a linkage map was obtained with 198 mapped microsatellites. The total map length was 1865.9 cM, and the average distance between markers was 9.4 cM. Flower color and pod tip shape were mapped and segregated at Mendelian ratios, as expected. The segregation ratio and linkage data analyses indicated that the determinacy growth habit was inherited as two independent and dominant genes, and a genetic model is proposed for this trait.
Resumo:
We derive an analytic expression for the matric flux potential (M) for van Genuchten-Mualem (VGM) type soils which can also be written in terms of a converging infinite series. Considering the first four terms of this series, the accuracy of the approximation was verified by comparing it to values of M estimated by numerical finite difference integration. Using values of the parameters for three soils from different texture classes, the proposed four-term approximation showed an almost perfect match with the numerical solution, except for effective saturations higher than 0.9. Including more terms reduced the discrepancy but also increased the complexity of the equation. The four-term equation can be used for most applications. Cases with special interest in nearly saturated soils should include more terms from the infinite series. A transpiration reduction function for use with the VGM equations is derived by combining the derived expression for M with a root water extraction model. The shape of the resulting reduction function and its dependency on the derivative of the soil hydraulic diffusivity D with respect to the soil water content theta is discussed. Positive and negative values of dD/d theta yield concave and convex or S-shaped reduction functions, respectively. On the basis of three data sets, the hydraulic properties of virtually all soils yield concave reduction curves. Such curves based solely on soil hydraulic properties do not account for the complex interactions between shoot growth, root growth, and water availability.
Resumo:
The understanding of complex physiological processes requires information from many different areas of knowledge. To meet this interdisciplinary scenario, the ability of integrating and articulating information is demanded. The difficulty of such approach arises because, more often than not, information is fragmented through under graduation education in Health Sciences. Shifting from a fragmentary and deep view of many topics to joining them horizontally in a global view is not a trivial task for teachers to implement. To attain that objective we proposed a course herein described Biochemistry of the envenomation response aimed at integrating previous contents of Health Sciences courses, following international recommendations of interdisciplinary model. The contents were organized by modules with increasing topic complexity. The full understanding of the envenoming pathophysiology of each module would be attained by the integration of knowledge from different disciplines. Active-learning strategy was employed focusing concept map drawing. Evaluation was obtained by a 30-item Likert-type survey answered by ninety students; 84% of the students considered that the number of relations that they were able to establish as seen by concept maps increased throughout the course. Similarly, 98% considered that both the theme and the strategy adopted in the course contributed to develop an interdisciplinary view.
Resumo:
Despite the importance of Eucalyptus spp. in the pulp and paper industry, functional genomic approaches have only recently been applied to understand wood formation in this genus. We attempted to establish a global view of gene expression in the juvenile cambial region of Eucalyptus grandis Hill ex Maiden. The expression profile was obtained from serial analysis of gene expression (SAGE) library data produced from 3- and 6-year-old trees. Fourteen-base expressed sequence tags (ESTs) were searched against public Eucalyptus ESTs and annotated with GenBank. Altogether 43,304 tags were generated producing 3066 unigenes with three or more copies each, 445 with a putative identity, 215 with unknown function and 2406 without an EST match. The expression profile of the juvenile cambial region revealed the presence of highly frequent transcripts related to general metabolism and energy metabolism, cellular processes, transport, structural components and information pathways. We made a quantitative analysis of a large number of genes involved in the biosynthesis of cellulose, pectin, hemicellulose and lignin. Our findings provide insight into the expression of functionally related genes involved in juvenile wood formation in young fast-growing E. grandis trees.
Resumo:
Allele frequency distributions and population data for 12 Y-chromosomal short tandem repeats (STRs) included in the PowerPlex (R) Y Systems (Promega) were obtained for a sample of 200 healthy unrelated males living in S (a) over tildeo Paulo State (Southeast of Brazil). A total of 192 haplotypes were identified, of which 184 were unique and 8 were found in 2 individuals. The average gene diversity of the 12 Y-STR was 0.6746 and the haplotype diversity was 0.9996. Pairwise analysis confirmed that our population is more similar with the Italy, North Portugal and Spain, being more distant of the Japan. (c) 2007 Elsevier Ireland Ltd. All rights reserved.
Resumo:
The Brazilian Network of Food Data Systems (BRASILFOODS) has been keeping the Brazilian Food Composition Database-USP (TBCA-USP) (http://www.fcf.usp.br/tabela) since 1998. Besides the constant compilation, analysis and update work in the database, the network tries to innovate through the introduction of food information that may contribute to decrease the risk for non-transmissible chronic diseases, such as the profile of carbohydrates and flavonoids in foods. In 2008, data on carbohydrates, individually analyzed, of 112 foods, and 41 data related to the glycemic response produced by foods widely consumed in the country were included in the TBCA-USP. Data (773) about the different flavonoid subclasses of 197 Brazilian foods were compiled and the quality of each data was evaluated according to the USDAs data quality evaluation system. In 2007, BRASILFOODS/USP and INFOODS/FAO organized the 7th International Food Data Conference ""Food Composition and Biodiversity"". This conference was a unique opportunity for interaction between renowned researchers and participants from several countries and it allowed the discussion of aspects that may improve the food composition area. During the period, the LATINFOODS Regional Technical Compilation Committee and BRASILFOODS disseminated to Latin America the Form and Manual for Data Compilation, version 2009, ministered a Food Composition Data Compilation course and developed many activities related to data production and compilation. (C) 2010 Elsevier Inc. All rights reserved.
Resumo:
Methionine is a component of one-carbon metabolism and a precursor of S-adenosylmethionine (SAM), the methyl donor for DNA methylation. When methionine intake is high, an increase of S-adenosylmethionine (SAM) is expected. DNA methyltransferases convert SAM to S-adenosylhomocysteine (SAH). A high intracellular SAH concentration could inhibit the activity of DNA methyltransferases. Therefore, high methionine ingestion could induce DNA damage and change the methylation pattern of tumor suppressor genes. This study investigated the genotoxicity of a methionine-supplemented diet. It also investigated the diet`s effects on glutathione levels, SAM and SAH concentrations and the gene methylation pattern of p53. Wistar rats received either a methionine-supplemented diet (2% methionine) or a control diet (0.3% methionine) for six weeks. The methionine-supplemented diet was neither genotoxic nor antigenotoxic to kidney cells, as assessed by the comet assay. However, the methionine-supplemented diet restored the renal glutathione depletion induced by doxorubicin. This fact may be explained by the transsulfuration pathway, which converts methionine to glutathione in the kidney. Methionine supplementation increased the renal concentration of SAH without changing the SAM/SAH ratio. This unchanged profile was also observed for DNA methylation at the promoter region of the p53 gene. Further studies are necessary to elucidate this diet`s effects on genomic stability and DNA methylation. (C) 2011 Elsevier ay. All rights reserved.