Biblioteca Digital

986 resultados para Genomic Regions

Yeast artificial chromosome contigs reveal that distal variable-region genes reside at least 3 megabases from the joining regions in the murine immunoglobulin kappa locus.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The immunoglobulin kappa gene locus encodes 95% of the light chains of murine antibody molecules and is thought to contain up to 300 variable (V kappa)-region genes generally considered to comprise 20 families. To delineate the locus we have isolated 29 yeast artificial chromosome genomic clones that form two contigs, span > 3.5 megabases, and contain two known non-immunoglobulin kappa markers. Using PCR primers specific for 19 V kappa gene families and Southern analysis, we have refined the genetically defined order of these V kappa gene families. Of these, V kappa 2 maps at least 3.0 Mb from the joining (J kappa) region and appears to be the most distal V kappa gene segment.

Genomic structure of human microtubule-associated protein 2 (MAP-2) and characterization of additional MAP-2 isoforms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have determined that the gene for human microtubule-associated protein 2 (MAP-2) spans 19 exons, including 6 exons identified in this study, 1-4, 8, and 13; all six of these exons are transcribed. The alternative splicing of coding exons generates a greater diversity of MAP-2 transcripts and isoforms. The first three exons encode alternate 5' untranslated regions that can be spliced to additional untranslated sequences contained in exons 4 and 5. Exons 8 and 13 are transcribed in human fetal spinal cord, adult brain, MSN cells, and rat brain, and each exon maintains an open reading frame with both high and low molecular weight MAP-2 isoforms. Antibodies generated to synthetic peptides of exons 8 and 13 demonstrate that these exons are translated and MAP-2 isoforms containing these exons are generated.

Probing the S100 protein family through genomic and functional analysis

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The EF-hand superfamily of calcium binding proteins includes the S100, calcium binding protein, and troponin subfamilies. This study represents a genome, structure, and expression analysis of the S100 protein family, in mouse, human, and rat. We confirm the high level of conservation between mammalian sequences but show that four members, including S100A12, are present only in the human genome. We describe three new members of the S100 family in the three species and their locations within the S100 genomic clusters and propose a revised nomenclature and phylogenetic relationship between members of the EF-hand superfamily. Two of the three new genes were induced in bone-marrow-derived macrophages activated with bacterial lipopolysaccharide, suggesting a role in inflammation. Normal human and murine tissue distribution profiles indicate that some members of the family are expressed in a specific manner, whereas others are more ubiquitous. Structure-function analysis of the chemotactic properties of murine S100A8 and human S100A12, particularly within the active hinge domain, suggests that the human protein is the functional homolog of the murine protein. Strong similarities between the promoter regions of human S100A12 and murine S100A8 support this possibility. This study provides insights into the possible processes of evolution of the EF-hand protein superfamily. Evolution of the S100 proteins appears to have occurred in a modular fashion, also seen in other protein families such as the C2H2-type zinc-finger family. (C) 2004 Elsevier Inc. All rights reserved.

Chloroplast and total genomic diversity in the endemic Costa Rican tree Lonchocarpus costaricensis (J.D. Smith) Pittier (Papilionaceae).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In Mesoamerica, tropical dry forest is a highly threatened habitat, and species endemic to this environment are under extreme pressure. The tree species, Lonchocarpus costaricensis is endemic to the dry northwest of Costa Rica and southwest Nicaragua. It is a locally important species but, as land has been cleared for agriculture, populations have experienced considerable reduction and fragmentation. To assess current levels and distribution of genetic diversity in the species, a combination of chloroplast-specific (cpDNA) and whole genome DNA markers (amplified fragment length polymorphism, AFLP) were used to fingerprint 121 individual trees in 6 populations. Two cpDNA haplotypes were identified, distributed among populations such that populations at the extremes of the distribution showed lowest diversity. A large number (487) of AFLP markers were obtained and indicated that diversity levels were highest in the two coastal populations (Cobano, Matapalo, H = 0.23, 0.28 respectively). Population differentiation was low overall, F-ST = 0.12, although Matapalo was strongly differentiated from all other populations (F-ST = 0.16-0.22), apart from Cobano (F., = 0.11). Spatial genetic structure was present in both datasets at different scales: cpDNA was structured at a range-wide distribution scale, whilst AFLP data revealed genetic neighbourhoods on a population scale. In general, the habitat degradation of recent times appears not to have yet impacted diversity levels in mature populations. However, although no data on seed or saplings were collected, it seems likely that reproductive mechanisms in the species will have been affected by land clearance. It is recommended that efforts should be made to conserve the extant genetic resource base and further research undertaken to investigate diversity levels in the progeny generation.

GONOME: measuring correlations between GO terms and genomic positions

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Current methods to find significantly under- and over-represented gene ontology (GO) terms in a set of genes consider the genes as equally probable balls in a bag, as may be appropriate for transcripts in micro-array data. However, due to the varying length of genes and intergenic regions, that approach is inappropriate for deciding if any GO terms are correlated with a set of genomic positions. Results: We present an algorithm - GONOME - that can determine which GO terms are significantly associated with a set of genomic positions given a genome annotated with (at least) the starts and ends of genes. We show that certain GO terms may appear to be significantly associated with a set of randomly chosen positions in the human genome if gene lengths are not considered, and that these same terms have been reported as significantly over-represented in a number of recent papers. This apparent over-representation disappears when gene lengths are considered, as GONOME does. For example, we show that, when gene length is taken into account, the term development is not significantly enriched in genes associated with human CpG islands, in contradiction to a previous report. We further demonstrate the efficacy of GONOME by showing that occurrences of the proteosome-associated control element (PACE) upstream activating sequence in the S. cerevisiae genome associate significantly to appropriate GO terms. An extension of this approach yields a whole-genome motif discovery algorithm that allows identification of many other promoter sequences linked to different types of genes, including a large group of previously unknown motifs significantly associated with the terms 'translation' and 'translational elongation'. Conclusion: GONOME is an algorithm that correctly extracts over-represented GO terms from a set of genomic positions. By explicitly considering gene size, GONOME avoids a systematic bias toward GO terms linked to large genes. Inappropriate use of existing algorithms that do not take gene size into account has led to erroneous or suspect conclusions. Reciprocally GONOME may be used to identify new features in genomes that are significantly associated with particular categories of genes.

Genomic and proteomic analysis of synaptic protein expression in the brains of human alcoholics

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Alcoholism results in changes in the human brain which reinforce the cycle of craving and dependency, and these changes are manifest in the pattern of expression of mRNA and proteins in key cells and brain areas. Long-term alcohol abuse also results in damage to selected regions of the cortex. We have used cDNA microarrays to show that less than 1% of mRNA transcripts differ signifi cantly between cases and controls in the susceptible area and that the expression profi le of a subset of these transcripts is suffi cient to distinguish alcohol abusers from controls. In addition, we have utilized a 2D gel proteomics based approach to determine the identity of proteins in the superior frontal cortex (SFC) of the human brain that show differential expression in controls and long term alcohol abusers. Overall, 182 proteins differed by the criterion of > 2-fold between case and control samples. Of these, 139 showed signifi cantly lower expression in alcoholics, 35 showed signifi cantly higher expression, and 8 were new or had disappeared. To date 63 proteins have been identifi ed. The expression of one family of proteins, the synucleins, has been further characterized using Real Time PCR and Western Blotting. The expression of alpha-synuclein mRNA was signifi cantly lower in the SFC of alcoholics compared with the same area in controls (P = 0.01) whereas no such difference in expression was found in the motor cortex. The expression of beta- and gamma- synuclein were not signifi cantly different between alcoholics and controls. In contrast, the pattern of alphasynuclein protein expression differs from that of the corresponding RNA transcript. Because of the key role of synaptic proteins in the pathogenesis of alcoholism, we are developing 2-D DIGE based techniques to quantify expression changes in synaptosomes prepared from the SFC of controls and alcoholics.

Characterising the TP53-deleted subgroup of chronic lymphocytic leukemia: an analysis of additional cytogenetic abnormalities detected by interphase fluorescence in situ hybridisation and array-based comparative genomic hybridisation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Deletion of the TP53 gene on chromosome 17p13.1 is the prognostic factor associated with the shortest survival in CLL. We used array-based comparative genomic hybridisation (arrayCGH) to identify additional DNA copy number changes in peripheral blood samples from 74 LRF CLL4 trial patients, 37 with >or=5% and 37 without TP53-deleted cells. ArrayCGH reliably detected deletions on 17p, including the TP53 locus, in cases with >or=50%TP53-deleted cells detected by fluorescence in situ hybridisation, plus seven additional cases with deleted regions on 17p excluding TP53. Losses on chromosomal regions 18p and/or 20p were found exclusively in cases with >or=5%TP53-deleted cells (por=5%TP53-deleted cases (p=0.02). In particular, amplification of 2p and deletion of 6q were both more frequent. Cases with >20%TP53-deleted cells had the worst prognosis in the LRF CLL4 trial.

Integration of global SNP-based mapping and expression arrays reveals key regions, mechanisms, and genes important in the pathogenesis of multiple myeloma.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Multiple myeloma is characterized by genomic alterations frequently involving gains and losses of chromosomes. Single nucleotide polymorphism (SNP)-based mapping arrays allow the identification of copy number changes at the sub-megabase level and the identification of loss of heterozygosity (LOH) due to monosomy and uniparental disomy (UPD). We have found that SNP-based mapping array data and fluorescence in situ hybridization (FISH) copy number data correlated well, making the technique robust as a tool to investigate myeloma genomics. The most frequently identified alterations are located at 1p, 1q, 6q, 8p, 13, and 16q. LOH is found in these large regions and also in smaller regions throughout the genome with a median size of 1 Mb. We have identified that UPD is prevalent in myeloma and occurs through a number of mechanisms including mitotic nondisjunction and mitotic recombination. For the first time in myeloma, integration of mapping and expression data has allowed us to reduce the complexity of standard gene expression data and identify candidate genes important in both the transition from normal to monoclonal gammopathy of unknown significance (MGUS) to myeloma and in different subgroups within myeloma. We have documented these genes, providing a focus for further studies to identify and characterize those that are key in the pathogenesis of myeloma.

Abnormalities on 1q and 7q are associated with poor outcome in sporadic Burkitt's lymphoma. A cytogenetic and comparative genomic hybridization study.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Comparative genomic hybridization (CGH) studies have demonstrated a high incidence of chromosomal imbalances in non-Hodgkin's lymphoma. However, the information on the genomic imbalances in Burkitt's Lymphoma (BL) is scanty. Conventional cytogenetics was performed in 34 cases, and long-distance PCR for t(8;14) was performed in 18 cases. A total of 170 changes were present with a median of four changes per case (range 1-22). Gains of chromosomal material (143) were more frequent than amplifications (5) or losses (22). The most frequent aberrations were gains on chromosomes 12q (26%), Xq (22%), 22q (20%), 20q (17%) and 9q (15%). Losses predominantly involved chromosomes 13q (17%) and 4q (9%). High-level amplifications were present in the regions 1q23-31 (three cases), 6p12-p25 and 8p22-p23. Upon comparing BL vs Burkitt's cell leukemia (BCL), the latter had more changes (mean 4.3 +/- 2.2) than BL (mean 2.7 +/- 3.2). In addition, BCL cases showed more frequently gains on 8q, 9q, 14q, 20q, and 20q, 9q, 8q and 14q, as well as losses on 13q and 4q. Concerning outcome, the presence of abnormalities on 1q (ascertained either by cytogenetics or by CGH), and imbalances on 7q (P=0.01) were associated with a short survival.

Development, characterization and use of genomic SSR markers for assessment of genetic diversity in some Saudi date palm ( Phoenix dactylifera L.) cultivars

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The present study was undertaken towards the development of SSR markers and assessing genetic relationships among 32 date palm ( Phoenix dactylifera L.) representing common cultivars grown in different geographical regions in Saudi Arabia. Results: Ninety-three novel simple sequence repeat markers were developed and screened for their ability to detect polymorphism in date palm. Around 71% of genomic SSRs were dinucleotide, 25% tri, 3% tetra and 1% penta nucleotide motives. Twenty-two primers generated a total of 91 alleles with a mean of 4.14 alleles per locus and 100% polymorphism percentage. A 0.595 average polymorphic information content and 0.662 primer discrimination power values were recorded. The expected and observed heterozygosities were 0.676 and 0.763 respectively. Pair-wise similarity values ranged from 0.06 to 0.89 and the overall cultivars averaged 0.41. The UPGMA cluster analysis recovered by principal coordinate analysis illustrated that cultivars tend to group according to their class of maturity, region of cultivation, and fruit color. Analysis of molecular variations (AMOVA) revealed that genetic variation among and within cultivars were 27% and 73%, respectively according to geographical distribution of cultivars. Conclusions: The developed microsatellite markers are additional values to date palm characterization tools that can be used by researchers in population genetics, cultivar identification as well as genetic resource exploration and management. The tested cultivars exhibited a significant amount of genetic diversity and could be suitable for successful breeding program. Genomic sequences generated from this study are available at the National Center for Biotechnology Information (NCBI), Sequence Read Archive (Accession numbers. LIBGSS_039019).

RestrictionDigest: A powerful Perl module for simulating genomic restriction digests

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Reduced-representation sequencing technology iswidely used in genotyping for its economical and efficient features. A popular way to construct the reduced-representation sequencing libraries is to digest the genomic DNA with restriction enzymes. A key factor of this method is to determine the restriction enzyme(s). But there are few computer programs which can evaluate the usability of restriction enzymes in reduced-representation sequencing. SimRAD is an R package which can simulate the digestion of DNA sequence by restriction enzymes and return enzyme loci number as well as fragment number. But for linkage mapping analysis, enzyme loci distribution is also an important factor to evaluate the enzyme. For phylogenetic studies, comparison of the enzyme performance across multiple genomes is important. It is strongly needed to develop a simulation tool to implement these functions. Results: Here, we introduce a Perl module named RestrictionDigest with more functions and improved performance. It can analyze multiple genomes at one run and generate concise comparison of enzyme performance across the genomes. It can simulate single-enzyme digestion, double-enzyme digestion and size selection process and generate comprehensive information of the simulation including enzyme loci number, fragment number, sequences of the fragments, positions of restriction sites on the genome, the coverage of digested fragments on different genome regions and detailed fragment length distribution. Conclusions: RestrictionDigest is an easy-to-use Perl module with flexible parameter settings.With the help of the information produced by the module, researchers can easily determine the most appropriate enzymes to construct the reduced-representation libraries to meet their experimental requirements.

Integrative genomic, epigenetic and metabolomic characterization of beef from grass-fed Angus steers

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Beef constitutes a main component of the American diet and still represent the principal source of protein in many parts of the world. Currently, the meat market is experiencing an important transformation; consumers are increasingly switching from consuming traditional beef to grass-fed beef. People recognized products obtained from grass-fed animals as more natural and healthy. However, the true variations between these two production systems regarding various aspects remain unclear. This dissertation provides information from closely genetically related animals, in order to decrease confounding factors, to explain several confused divergences between grain-fed and grass-fed beef. First, we examined the growth curve, important economic traits and quality carcass characteristics over four consecutive years in grain-fed and grass-fed animals, generating valuable information for management decisions and economic evaluation for grass-fed cattle operations. Second, we performed the first integrated transcriptomic and metabolomic analysis in grass-fed beef, detecting alterations in glucose metabolism, divergences in free fatty acids and carnitine conjugated lipid levels, and altered β-oxidation. Results suggest that grass finished beef could possibly benefit consumer health from having lower total fat content and better lipid profile than grain-fed beef. Regarding animal welfare, grass-fed animals may experience less stress than grain-fed individuals as well. Finally, we contrasted the genome-wide DNA methylation of grass-fed beef against grain-fed beef using the methyl-CpG binding domain sequencing (MBD-Seq) method, identifying 60 differentially methylated regions (DMRs). Most of DMRs were located inside or upstream of genes and displayed increased levels of methylation in grass-fed individuals, implying a global DNA methylation increment in this group. Interestingly, chromosome 14, which has been associated with large effects on ADG, marbling, back fat, ribeye area and hot carcass weight in beef cattle, allocated the largest number of DMRs (12/60). The pathway analysis identified skeletal and muscular system as the preeminent physiological system and function, and recognized carbohydrates metabolism, lipid metabolism and tissue morphology among the highest ranked networks. Therefore, although we recognize some limitations and assume that additional examination is still required, this project provides the first integrative genomic, epigenetic and metabolomics characterization of beef produced under grass-fed regimen.

Genome-wide copy number variation (CNV) detection in Nelore cattle reveals highly frequent variants in genome regions harboring QTLs affecting production traits.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Copy number variations (CNVs) have been shown to account for substantial portions of observed genomic variation and have been associated with qualitative and quantitative traits and the onset of disease in a number of species. Information from high-resolution studies to detect, characterize and estimate population-specific variant frequencies will facilitate the incorporation of CNVs in genomic studies to identify genes affecting traits of importance. Results: Genome-wide CNVs were detected in high-density single nucleotide polymorphism (SNP) genotyping data from 1,717 Nelore (Bos indicus) cattle, and in NGS data from eight key ancestral bulls. A total of 68,007 and 12,786 distinct CNVs were observed, respectively. Cross-comparisons of results obtained for the eight resequenced animals revealed that 92 % of the CNVs were observed in both datasets, while 62 % of all detected CNVs were observed to overlap with previously validated cattle copy number variant regions (CNVRs). Observed CNVs were used for obtaining breed-specific CNV frequencies and identification of CNVRs, which were subsequently used for gene annotation. A total of 688 of the detected CNVRs were observed to overlap with 286 non-redundant QTLs associated with important production traits in cattle. All of 34 CNVs previously reported to be associated with milk production traits in Holsteins were also observed in Nelore cattle. Comparisons of estimated frequencies of these CNVs in the two breeds revealed 14, 13, 6 and 14 regions in high (>20 %), low (<20 %) and divergent (NEL > HOL, NEL < HOL) frequencies, respectively. Conclusions: Obtained results significantly enriched the bovine CNV map and enabled the identification of variants that are potentially associated with traits under selection in Nelore cattle, particularly in genome regions harboring QTLs affecting production traits.

Circulation of canine viruses in free-ranging Italian wolves (Canis lupus italicus) from three Italian regions

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this study, the duodenum, spleen, tongue, and lungs were sampled from 56 Italian wolves who died between 2017 and 2020. The aim of the study was to evaluate the presence and spread of DNA and RNA viruses in the wolf population examined, relating the virological results to: year of sampling, region of origin, sex, age, season, genetic determination of the species, nutritional conditions, causes of death, matrices examined. In addition, the presence or absence of co-infections was evaluated. Through molecular methods, the presence of genomic DNA of three important DNA viruses was investigated, i.e.: Canine Parvovirus type 2 (CPV-2), Canine Adenovirus type 1 (CAdV-1), Canine Adenovirus type 2 (CAdV-2). Furthermore, the presence of genomic RNA of the important RNA viruses, Canine Enteric Coronavirus (CCoV) and Canine Distemper Virus (CDV), was also investigated. The results showed that the virus with the highest prevalence in the wolf population studied was CPV-2, found in 78.6% of subjects (44/56). The prevalence of CAdV was 17.9% (10/56), in particular CAdV-1 (12.5% - 7/56) and CAdV-2 (5.4% - 3/56). The results of the molecular investigations in RT-PCR of the two RNA viruses (CCoV and CDV) did not give positive results in the study population. In this study it was observed that the majority of wolves that resulted positive were in good nutritional conditions, thus excluding a direct cause of death from CPV-2, CAdV-1, and CAdV- 2 infections. Moreover, the prevalence obtained in this study suggests that, during the years here studied, the circulation of CAdV-1 and CAdV-2 in Italian wolves of the three sampled regions was sporadic, proving consistent with sporadic and short-lived introductions of the virus in these populations. However, the situation for CPV-2 is different as there was a circulation that suggests a pattern of continuous and lasting endemic exposure over time.

A machine learning based method to detect genomic imbalances exploiting X chromosome exome reads

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Whole Exome Sequencing (WES) is rapidly becoming the first-tier test in clinics, both thanks to its declining costs and the development of new platforms that help clinicians in the analysis and interpretation of SNV and InDels. However, we still know very little on how CNV detection could increase WES diagnostic yield. A plethora of exome CNV callers have been published over the years, all showing good performances towards specific CNV classes and sizes, suggesting that the combination of multiple tools is needed to obtain an overall good detection performance. Here we present TrainX, a ML-based method for calling heterozygous CNVs in WES data using EXCAVATOR2 Normalized Read Counts. We select males and females’ non pseudo-autosomal chromosome X alignments to construct our dataset and train our model, make predictions on autosomes target regions and use HMM to call CNVs. We compared TrainX against a set of CNV tools differing for the detection method (GATK4 gCNV, ExomeDepth, DECoN, CNVkit and EXCAVATOR2) and found that our algorithm outperformed them in terms of stability, as we identified both deletions and duplications with good scores (0.87 and 0.82 F1-scores respectively) and for sizes reaching the minimum resolution of 2 target regions. We also evaluated the method robustness using a set of WES and SNP array data (n=251), part of the Italian cohort of Epi25 collaborative, and were able to retrieve all clinical CNVs previously identified by the SNP array. TrainX showed good accuracy in detecting heterozygous CNVs of different sizes, making it a promising tool to use in a diagnostic setting.

«
1
2
...
16
17
18
19
20
21
22
...
65
66
»