979 resultados para Genomic Regions
Resumo:
The EF-hand superfamily of calcium binding proteins includes the S100, calcium binding protein, and troponin subfamilies. This study represents a genome, structure, and expression analysis of the S100 protein family, in mouse, human, and rat. We confirm the high level of conservation between mammalian sequences but show that four members, including S100A12, are present only in the human genome. We describe three new members of the S100 family in the three species and their locations within the S100 genomic clusters and propose a revised nomenclature and phylogenetic relationship between members of the EF-hand superfamily. Two of the three new genes were induced in bone-marrow-derived macrophages activated with bacterial lipopolysaccharide, suggesting a role in inflammation. Normal human and murine tissue distribution profiles indicate that some members of the family are expressed in a specific manner, whereas others are more ubiquitous. Structure-function analysis of the chemotactic properties of murine S100A8 and human S100A12, particularly within the active hinge domain, suggests that the human protein is the functional homolog of the murine protein. Strong similarities between the promoter regions of human S100A12 and murine S100A8 support this possibility. This study provides insights into the possible processes of evolution of the EF-hand protein superfamily. Evolution of the S100 proteins appears to have occurred in a modular fashion, also seen in other protein families such as the C2H2-type zinc-finger family. (C) 2004 Elsevier Inc. All rights reserved.
Resumo:
In Mesoamerica, tropical dry forest is a highly threatened habitat, and species endemic to this environment are under extreme pressure. The tree species, Lonchocarpus costaricensis is endemic to the dry northwest of Costa Rica and southwest Nicaragua. It is a locally important species but, as land has been cleared for agriculture, populations have experienced considerable reduction and fragmentation. To assess current levels and distribution of genetic diversity in the species, a combination of chloroplast-specific (cpDNA) and whole genome DNA markers (amplified fragment length polymorphism, AFLP) were used to fingerprint 121 individual trees in 6 populations. Two cpDNA haplotypes were identified, distributed among populations such that populations at the extremes of the distribution showed lowest diversity. A large number (487) of AFLP markers were obtained and indicated that diversity levels were highest in the two coastal populations (Cobano, Matapalo, H = 0.23, 0.28 respectively). Population differentiation was low overall, F-ST = 0.12, although Matapalo was strongly differentiated from all other populations (F-ST = 0.16-0.22), apart from Cobano (F., = 0.11). Spatial genetic structure was present in both datasets at different scales: cpDNA was structured at a range-wide distribution scale, whilst AFLP data revealed genetic neighbourhoods on a population scale. In general, the habitat degradation of recent times appears not to have yet impacted diversity levels in mature populations. However, although no data on seed or saplings were collected, it seems likely that reproductive mechanisms in the species will have been affected by land clearance. It is recommended that efforts should be made to conserve the extant genetic resource base and further research undertaken to investigate diversity levels in the progeny generation.
Resumo:
Background: Current methods to find significantly under- and over-represented gene ontology (GO) terms in a set of genes consider the genes as equally probable balls in a bag, as may be appropriate for transcripts in micro-array data. However, due to the varying length of genes and intergenic regions, that approach is inappropriate for deciding if any GO terms are correlated with a set of genomic positions. Results: We present an algorithm - GONOME - that can determine which GO terms are significantly associated with a set of genomic positions given a genome annotated with (at least) the starts and ends of genes. We show that certain GO terms may appear to be significantly associated with a set of randomly chosen positions in the human genome if gene lengths are not considered, and that these same terms have been reported as significantly over-represented in a number of recent papers. This apparent over-representation disappears when gene lengths are considered, as GONOME does. For example, we show that, when gene length is taken into account, the term development is not significantly enriched in genes associated with human CpG islands, in contradiction to a previous report. We further demonstrate the efficacy of GONOME by showing that occurrences of the proteosome-associated control element (PACE) upstream activating sequence in the S. cerevisiae genome associate significantly to appropriate GO terms. An extension of this approach yields a whole-genome motif discovery algorithm that allows identification of many other promoter sequences linked to different types of genes, including a large group of previously unknown motifs significantly associated with the terms 'translation' and 'translational elongation'. Conclusion: GONOME is an algorithm that correctly extracts over-represented GO terms from a set of genomic positions. By explicitly considering gene size, GONOME avoids a systematic bias toward GO terms linked to large genes. Inappropriate use of existing algorithms that do not take gene size into account has led to erroneous or suspect conclusions. Reciprocally GONOME may be used to identify new features in genomes that are significantly associated with particular categories of genes.
Resumo:
Alcoholism results in changes in the human brain which reinforce the cycle of craving and dependency, and these changes are manifest in the pattern of expression of mRNA and proteins in key cells and brain areas. Long-term alcohol abuse also results in damage to selected regions of the cortex. We have used cDNA microarrays to show that less than 1% of mRNA transcripts differ signifi cantly between cases and controls in the susceptible area and that the expression profi le of a subset of these transcripts is suffi cient to distinguish alcohol abusers from controls. In addition, we have utilized a 2D gel proteomics based approach to determine the identity of proteins in the superior frontal cortex (SFC) of the human brain that show differential expression in controls and long term alcohol abusers. Overall, 182 proteins differed by the criterion of > 2-fold between case and control samples. Of these, 139 showed signifi cantly lower expression in alcoholics, 35 showed signifi cantly higher expression, and 8 were new or had disappeared. To date 63 proteins have been identifi ed. The expression of one family of proteins, the synucleins, has been further characterized using Real Time PCR and Western Blotting. The expression of alpha-synuclein mRNA was signifi cantly lower in the SFC of alcoholics compared with the same area in controls (P = 0.01) whereas no such difference in expression was found in the motor cortex. The expression of beta- and gamma- synuclein were not signifi cantly different between alcoholics and controls. In contrast, the pattern of alphasynuclein protein expression differs from that of the corresponding RNA transcript. Because of the key role of synaptic proteins in the pathogenesis of alcoholism, we are developing 2-D DIGE based techniques to quantify expression changes in synaptosomes prepared from the SFC of controls and alcoholics.
Resumo:
Deletion of the TP53 gene on chromosome 17p13.1 is the prognostic factor associated with the shortest survival in CLL. We used array-based comparative genomic hybridisation (arrayCGH) to identify additional DNA copy number changes in peripheral blood samples from 74 LRF CLL4 trial patients, 37 with >or=5% and 37 without TP53-deleted cells. ArrayCGH reliably detected deletions on 17p, including the TP53 locus, in cases with >or=50%TP53-deleted cells detected by fluorescence in situ hybridisation, plus seven additional cases with deleted regions on 17p excluding TP53. Losses on chromosomal regions 18p and/or 20p were found exclusively in cases with >or=5%TP53-deleted cells (por=5%TP53-deleted cases (p=0.02). In particular, amplification of 2p and deletion of 6q were both more frequent. Cases with >20%TP53-deleted cells had the worst prognosis in the LRF CLL4 trial.
Resumo:
Multiple myeloma is characterized by genomic alterations frequently involving gains and losses of chromosomes. Single nucleotide polymorphism (SNP)-based mapping arrays allow the identification of copy number changes at the sub-megabase level and the identification of loss of heterozygosity (LOH) due to monosomy and uniparental disomy (UPD). We have found that SNP-based mapping array data and fluorescence in situ hybridization (FISH) copy number data correlated well, making the technique robust as a tool to investigate myeloma genomics. The most frequently identified alterations are located at 1p, 1q, 6q, 8p, 13, and 16q. LOH is found in these large regions and also in smaller regions throughout the genome with a median size of 1 Mb. We have identified that UPD is prevalent in myeloma and occurs through a number of mechanisms including mitotic nondisjunction and mitotic recombination. For the first time in myeloma, integration of mapping and expression data has allowed us to reduce the complexity of standard gene expression data and identify candidate genes important in both the transition from normal to monoclonal gammopathy of unknown significance (MGUS) to myeloma and in different subgroups within myeloma. We have documented these genes, providing a focus for further studies to identify and characterize those that are key in the pathogenesis of myeloma.
Resumo:
Comparative genomic hybridization (CGH) studies have demonstrated a high incidence of chromosomal imbalances in non-Hodgkin's lymphoma. However, the information on the genomic imbalances in Burkitt's Lymphoma (BL) is scanty. Conventional cytogenetics was performed in 34 cases, and long-distance PCR for t(8;14) was performed in 18 cases. A total of 170 changes were present with a median of four changes per case (range 1-22). Gains of chromosomal material (143) were more frequent than amplifications (5) or losses (22). The most frequent aberrations were gains on chromosomes 12q (26%), Xq (22%), 22q (20%), 20q (17%) and 9q (15%). Losses predominantly involved chromosomes 13q (17%) and 4q (9%). High-level amplifications were present in the regions 1q23-31 (three cases), 6p12-p25 and 8p22-p23. Upon comparing BL vs Burkitt's cell leukemia (BCL), the latter had more changes (mean 4.3 +/- 2.2) than BL (mean 2.7 +/- 3.2). In addition, BCL cases showed more frequently gains on 8q, 9q, 14q, 20q, and 20q, 9q, 8q and 14q, as well as losses on 13q and 4q. Concerning outcome, the presence of abnormalities on 1q (ascertained either by cytogenetics or by CGH), and imbalances on 7q (P=0.01) were associated with a short survival.
Resumo:
Background: The present study was undertaken towards the development of SSR markers and assessing genetic relationships among 32 date palm ( Phoenix dactylifera L.) representing common cultivars grown in different geographical regions in Saudi Arabia. Results: Ninety-three novel simple sequence repeat markers were developed and screened for their ability to detect polymorphism in date palm. Around 71% of genomic SSRs were dinucleotide, 25% tri, 3% tetra and 1% penta nucleotide motives. Twenty-two primers generated a total of 91 alleles with a mean of 4.14 alleles per locus and 100% polymorphism percentage. A 0.595 average polymorphic information content and 0.662 primer discrimination power values were recorded. The expected and observed heterozygosities were 0.676 and 0.763 respectively. Pair-wise similarity values ranged from 0.06 to 0.89 and the overall cultivars averaged 0.41. The UPGMA cluster analysis recovered by principal coordinate analysis illustrated that cultivars tend to group according to their class of maturity, region of cultivation, and fruit color. Analysis of molecular variations (AMOVA) revealed that genetic variation among and within cultivars were 27% and 73%, respectively according to geographical distribution of cultivars. Conclusions: The developed microsatellite markers are additional values to date palm characterization tools that can be used by researchers in population genetics, cultivar identification as well as genetic resource exploration and management. The tested cultivars exhibited a significant amount of genetic diversity and could be suitable for successful breeding program. Genomic sequences generated from this study are available at the National Center for Biotechnology Information (NCBI), Sequence Read Archive (Accession numbers. LIBGSS_039019).
Resumo:
Background: Reduced-representation sequencing technology iswidely used in genotyping for its economical and efficient features. A popular way to construct the reduced-representation sequencing libraries is to digest the genomic DNA with restriction enzymes. A key factor of this method is to determine the restriction enzyme(s). But there are few computer programs which can evaluate the usability of restriction enzymes in reduced-representation sequencing. SimRAD is an R package which can simulate the digestion of DNA sequence by restriction enzymes and return enzyme loci number as well as fragment number. But for linkage mapping analysis, enzyme loci distribution is also an important factor to evaluate the enzyme. For phylogenetic studies, comparison of the enzyme performance across multiple genomes is important. It is strongly needed to develop a simulation tool to implement these functions. Results: Here, we introduce a Perl module named RestrictionDigest with more functions and improved performance. It can analyze multiple genomes at one run and generate concise comparison of enzyme performance across the genomes. It can simulate single-enzyme digestion, double-enzyme digestion and size selection process and generate comprehensive information of the simulation including enzyme loci number, fragment number, sequences of the fragments, positions of restriction sites on the genome, the coverage of digested fragments on different genome regions and detailed fragment length distribution. Conclusions: RestrictionDigest is an easy-to-use Perl module with flexible parameter settings.With the help of the information produced by the module, researchers can easily determine the most appropriate enzymes to construct the reduced-representation libraries to meet their experimental requirements.
Integrative genomic, epigenetic and metabolomic characterization of beef from grass-fed Angus steers
Resumo:
Beef constitutes a main component of the American diet and still represent the principal source of protein in many parts of the world. Currently, the meat market is experiencing an important transformation; consumers are increasingly switching from consuming traditional beef to grass-fed beef. People recognized products obtained from grass-fed animals as more natural and healthy. However, the true variations between these two production systems regarding various aspects remain unclear. This dissertation provides information from closely genetically related animals, in order to decrease confounding factors, to explain several confused divergences between grain-fed and grass-fed beef. First, we examined the growth curve, important economic traits and quality carcass characteristics over four consecutive years in grain-fed and grass-fed animals, generating valuable information for management decisions and economic evaluation for grass-fed cattle operations. Second, we performed the first integrated transcriptomic and metabolomic analysis in grass-fed beef, detecting alterations in glucose metabolism, divergences in free fatty acids and carnitine conjugated lipid levels, and altered β-oxidation. Results suggest that grass finished beef could possibly benefit consumer health from having lower total fat content and better lipid profile than grain-fed beef. Regarding animal welfare, grass-fed animals may experience less stress than grain-fed individuals as well. Finally, we contrasted the genome-wide DNA methylation of grass-fed beef against grain-fed beef using the methyl-CpG binding domain sequencing (MBD-Seq) method, identifying 60 differentially methylated regions (DMRs). Most of DMRs were located inside or upstream of genes and displayed increased levels of methylation in grass-fed individuals, implying a global DNA methylation increment in this group. Interestingly, chromosome 14, which has been associated with large effects on ADG, marbling, back fat, ribeye area and hot carcass weight in beef cattle, allocated the largest number of DMRs (12/60). The pathway analysis identified skeletal and muscular system as the preeminent physiological system and function, and recognized carbohydrates metabolism, lipid metabolism and tissue morphology among the highest ranked networks. Therefore, although we recognize some limitations and assume that additional examination is still required, this project provides the first integrative genomic, epigenetic and metabolomics characterization of beef produced under grass-fed regimen.
Resumo:
Background: Copy number variations (CNVs) have been shown to account for substantial portions of observed genomic variation and have been associated with qualitative and quantitative traits and the onset of disease in a number of species. Information from high-resolution studies to detect, characterize and estimate population-specific variant frequencies will facilitate the incorporation of CNVs in genomic studies to identify genes affecting traits of importance. Results: Genome-wide CNVs were detected in high-density single nucleotide polymorphism (SNP) genotyping data from 1,717 Nelore (Bos indicus) cattle, and in NGS data from eight key ancestral bulls. A total of 68,007 and 12,786 distinct CNVs were observed, respectively. Cross-comparisons of results obtained for the eight resequenced animals revealed that 92 % of the CNVs were observed in both datasets, while 62 % of all detected CNVs were observed to overlap with previously validated cattle copy number variant regions (CNVRs). Observed CNVs were used for obtaining breed-specific CNV frequencies and identification of CNVRs, which were subsequently used for gene annotation. A total of 688 of the detected CNVRs were observed to overlap with 286 non-redundant QTLs associated with important production traits in cattle. All of 34 CNVs previously reported to be associated with milk production traits in Holsteins were also observed in Nelore cattle. Comparisons of estimated frequencies of these CNVs in the two breeds revealed 14, 13, 6 and 14 regions in high (>20 %), low (<20 %) and divergent (NEL > HOL, NEL < HOL) frequencies, respectively. Conclusions: Obtained results significantly enriched the bovine CNV map and enabled the identification of variants that are potentially associated with traits under selection in Nelore cattle, particularly in genome regions harboring QTLs affecting production traits.