990 resultados para 270202 Genome Structure
Resumo:
A 6008 base pair fragment of the vaccinia virus DNA containing the gene for the precursor of the major core protein 4 a, which has been designated P4 a, was sequenced. A long open reading frame (ORF) encoding a protein of molecular weight 102,157 started close to the position where the P4 a mRNA had been mapped. Analysis of the mRNA by S1 nuclease mapping and primer extension indicated that the 5' end defined by the former method is not the true 5' end. This suggests that the P4 a coding region is preceded by leader sequences that are not derived from the immediate vicinity of the gene, similar to what has been reported for another late vaccinia virus mRNA. The sequenced DNA contained several further ORFs on the same, or opposite DNA strand, providing further evidence for the close spacing of protein-coding sequences in the viral genome.
Resumo:
Background: Non-long terminal repeat (non-LTR) retrotransposons have contributed to shaping the structure and function of genomes. In silico and experimental approaches have been used to identify the non-LTR elements of the urochordate Ciona intestinalis. Knowledge of the types and abundance of non-LTR elements in urochordates is a key step in understanding their contribution to the structure and function of vertebrate genomes. Results: Consensus elements phylogenetically related to the I, LINE1, LINE2, LOA and R2 elements of the 14 eukaryotic non-LTR clades are described from C. intestinalis. The ascidian elements showed conservation of both the reverse transcriptase coding sequence and the overall structural organization seen in each clade. The apurinic/apyrimidinic endonuclease and nucleic-acid-binding domains encoded upstream of the reverse transcriptase, and the RNase H and the restriction enzyme-like endonuclease motifs encoded downstream of the reverse transcriptase were identified in the corresponding Ciona families. Conclusions: The genome of C. intestinalis harbors representatives of at least five clades of non-LTR retrotransposons. The copy number per haploid genome of each element is low, less than 100, far below the values reported for vertebrate counterparts but within the range for protostomes. Genomic and sequence analysis shows that the ascidian non-LTR elements are unmethylated and flanked by genomic segments with a gene density lower than average for the genome. The analysis provides valuable data for understanding the evolution of early chordate genomes and enlarges the view on the distribution of the non-LTR retrotransposons in eukaryotes.
Resumo:
HIV-1 sequence diversity is affected by selection pressures arising from host genomic factors. Using paired human and viral data from 1071 individuals, we ran >3000 genome-wide scans, testing for associations between host DNA polymorphisms, HIV-1 sequence variation and plasma viral load (VL), while considering human and viral population structure. We observed significant human SNP associations to a total of 48 HIV-1 amino acid variants (p<2.4 × 10(-12)). All associated SNPs mapped to the HLA class I region. Clinical relevance of host and pathogen variation was assessed using VL results. We identified two critical advantages to the use of viral variation for identifying host factors: (1) association signals are much stronger for HIV-1 sequence variants than VL, reflecting the 'intermediate phenotype' nature of viral variation; (2) association testing can be run without any clinical data. The proposed genome-to-genome approach highlights sites of genomic conflict and is a strategy generally applicable to studies of host-pathogen interaction. DOI:http://dx.doi.org/10.7554/eLife.01123.001.
Resumo:
The species and races of the shrews of the Sorex araneus group exhibit a broad range of chromosomal polymorphisms. European taxa of this group are parapatric and form contact or hybrid zones that span an extraordinary variety of situations, ranging from absolute genetic isolation to almost free gene flow. This variety seems to depend for a large part on the chromosome composition of populations, which are primarily differentiated by various Robertsonian fusions of a subset of acrocentric chromosomes. Previous studies suggested that chromosomal rearrangements play a causative role in the speciation process. In such models, gene flow should be more restricted for markers on chromosomes involved in rearrangements than on chromosomes common in both parent species. In the present study, we address the possibility of such differential gene flow in the context of two genetically very similar but karyotypically different hybrid zones between species of the S. araneus group using microsatellite loci mapped to the chromosome arm level. Interspecific genetic structure across rearranged chromosomes was in general larger than across common chromosomes. However, the difference between the two classes of chromosomes was only significant in the hybrid zone where the complexity of hybrids is expected to be larger. These differences did not distinguish populations within species. Therefore, the rearranged chromosomes appear to affect the reproductive barrier between karyotypic species, although the strength of this effect depends on the complexity of the hybrids produced.
Resumo:
The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.
Resumo:
Studies of the structural basis of protein thermostability have produced a confusing picture. Small sets of proteins have been analyzed from a variety of thermophilic species, suggesting different structural features as responsible for protein thermostability. Taking advantage of the recent advances in structural genomics, we have compiled a relatively large protein structure dataset, which was constructed very carefully and selectively; that is, the dataset contains only experimentally determined structures of proteins from one specific organism, the hyperthermophilic bacterium Thermotoga maritima, and those of close homologs from mesophilic bacteria. In contrast to the conclusions of previous studies, our analyses show that oligomerization order, hydrogen bonds, and secondary structure play minor roles in adaptation to hyperthermophily in bacteria. On the other hand, the data exhibit very significant increases in the density of salt-bridges and in compactness for proteins from T.maritima. The latter effect can be measured by contact order or solvent accessibility, and network analysis shows a specific increase in highly connected residues in this thermophile. These features account for changes in 96% of the protein pairs studied. Our results provide a clear picture of protein thermostability in one species, and a framework for future studies of thermal adaptation.
Resumo:
Approaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of body mass index, height and waist-to-hip ratio, as well as clinical classes of obesity, including up to 263,407 individuals of European ancestry, we identified 4 new loci (IGFBP4, H6PD, RSRC1 and PPP2R2A) influencing height detected in the distribution tails and 7 new loci (HNF4G, RPTOR, GNAT2, MRPS33P4, ADCY9, HS6ST3 and ZZZ3) for clinical classes of obesity. Further, we find a large overlap in genetic structure and the distribution of variants between traits based on extremes and the general population and little etiological heterogeneity between obesity subgroups.
Resumo:
Gene transfer in eukaryotic cells and organisms suffers from epigenetic effects that result in low or unstable transgene expression and high clonal variability. Use of epigenetic regulators such as matrix attachment regions (MARs) is a promising approach to alleviate such unwanted effects. Dissection of a known MAR allowed the identification of sequence motifs that mediate elevated transgene expression. Bioinformatics analysis implied that these motifs adopt a curved DNA structure that positions nucleosomes and binds specific transcription factors. From these observations, we computed putative MARs from the human genome. Cloning of several predicted MARs indicated that they are much more potent than the previously known element, boosting the expression of recombinant proteins from cultured cells as well as mediating high and sustained expression in mice. Thus we computationally identified potent epigenetic regulators, opening new strategies toward high and stable transgene expression for research, therapeutic production or gene-based therapies.
Resumo:
The major antigen on the envelope of extracellular vaccinia virus particles is a polypeptide with an apparent molecular weight of 37,000 (p37K; G. Hiller and K. Weber, J. Virol. 55:651-659, 1985). The gene encoding p37K was mapped in the vaccinia virus genome by hybrid selection of RNA followed by in vitro translation. p37K was then identified among the in vitro translation products by immunoprecipitation with a monoclonal antibody. The gene is located close to the right-hand end of the HindIII F fragment. The corresponding region of the DNA was sequenced, and an open reading frame encoding a polypeptide of 41,748 daltons was observed. The 5' end of the mRNA, as defined by nuclease S1 analysis, maps within only a few nucleotides of the translation initiation codon. Examination of the DNA sequence around the putative initiation site of transcription revealed a characteristic sequence, TAAATG, which includes the ATG translation initiation codon and which is conserved in all but one late gene so far analyzed. It is therefore likely that this sequence is an important regulatory signal for late gene expression in vaccinia virus.
Resumo:
Helicobacter pylori is an important human pathogen associated with serious gastric diseases. Owing to its medical importance and close relationship with its human host, understanding genomic patterns of global and local adaptation in H. pylori may be of particular significance for both clinical and evolutionary studies. Here we present the first such whole genome analysis of 60 globally distributed strains, from which we inferred worldwide population structure and demographic history and shed light on interesting global and local events of positive selection, with particular emphasis on the evolution of San-associated lineages. Our results indicate a more ancient origin for the association of humans and H. pylori than previously thought. We identify several important perspectives for future clinical research on candidate selected regions that include both previously characterized genes (e.g., transcription elongation factor NusA and tumor necrosis factor alpha-inducing protein Tipα) and hitherto unknown functional genes.
Resumo:
Determining the relative roles of vicariance and selection in restricting gene flow between populations is of central importance to the evolutionary process of population divergence and speciation. Here we use molecular and morphological data to contrast the effect of isolation (by mountains and geographical distance) with that of ecological factors (altitudinal gradients) in promoting differentiation in the wedge-billed woodcreeper, Glyphorynchus spirurus, a tropical forest bird, in Ecuador. Tarsus length and beak size increased relative to body size with altitude on both sides of the Andes, and were correlated with the amount of moss on tree trunks, suggesting the role of selection in driving adaptive divergence. In contrast, molecular data revealed a considerable degree of admixture along these altitudinal gradients, suggesting that adaptive divergence in morphological traits has occurred in the presence of gene flow. As suggested by mitochondrial DNA sequence data, the Andes act as a barrier to gene flow between ancient subspecific lineages. Genome-wide amplified fragment length polymorphism markers reflected more recent patterns of gene flow and revealed fine-scale patterns of population differentiation that were not detectable with mitochondrial DNA, including the differentiation of isolated coastal populations west of the Andes. Our results support the predominant role of geographical isolation in driving genetic differentiation in G. spirurus, yet suggest the role of selection in driving parallel morphological divergence along ecological gradients.
Resumo:
BackgroundBipolar disorder is a highly heritable polygenic disorder. Recent enrichment analyses suggest that there may be true risk variants for bipolar disorder in the expression quantitative trait loci (eQTL) in the brain.AimsWe sought to assess the impact of eQTL variants on bipolar disorder risk by combining data from both bipolar disorder genome-wide association studies (GWAS) and brain eQTL.MethodTo detect single nucleotide polymorphisms (SNPs) that influence expression levels of genes associated with bipolar disorder, we jointly analysed data from a bipolar disorder GWAS (7481 cases and 9250 controls) and a genome-wide brain (cortical) eQTL (193 healthy controls) using a Bayesian statistical method, with independent follow-up replications. The identified risk SNP was then further tested for association with hippocampal volume (n = 5775) and cognitive performance (n = 342) among healthy individuals.ResultsIntegrative analysis revealed a significant association between a brain eQTL rs6088662 on chromosome 20q11.22 and bipolar disorder (log Bayes factor = 5.48; bipolar disorder P = 5.85×10(-5)). Follow-up studies across multiple independent samples confirmed the association of the risk SNP (rs6088662) with gene expression and bipolar disorder susceptibility (P = 3.54×10(-8)). Further exploratory analysis revealed that rs6088662 is also associated with hippocampal volume and cognitive performance in healthy individuals.ConclusionsOur findings suggest that 20q11.22 is likely a risk region for bipolar disorder; they also highlight the informative value of integrating functional annotation of genetic variants for gene expression in advancing our understanding of the biological basis underlying complex disorders, such as bipolar disorder.
Resumo:
The glycosylation of glycoconjugates and the biosynthesis of polysaccharides depend on nucleotide-sugars which are the substrates for glycosyltransferases. A large proportion of these enzymes are located within the lumen of the Golgi apparatus as well as the endoplasmic reticulum, while many of the nucleotide-sugars are synthesized in the cytosol. Thus, nucleotide-sugars are translocated from the cytosol to the lumen of the Golgi apparatus and endoplasmic reticulum by multiple spanning domain proteins known as nucleotide-sugar transporters (NSTs). These proteins were first identified biochemically and some of them were cloned by complementation of mutants. Genome and expressed sequence tag sequencing allowed the identification of a number of sequences that may encode for NSTs in different organisms. The functional characterization of some of these genes has shown that some of them can be highly specific in their substrate specificity while others can utilize up to three different nucleotide-sugars containing the same nucleotide. Mutations in genes encoding for NSTs can lead to changes in development in Drosophila melanogaster or Caenorhabditis elegans, as well as alterations in the infectivity of Leishmania donovani. In humans, the mutation of a GDP-fucose transporter is responsible for an impaired immune response as well as retarded growth. These results suggest that, even though there appear to be a fair number of genes encoding for NSTs, they are not functionally redundant and seem to play specific roles in glycosylation.
Resumo:
Affiliation: Henner Brinkmann : Département de biochimie, Faculté de médecine, Université de Montreal
Resumo:
Le centromère est la région chromosomique où le kinétochore s'assemble en mitose. Contrairement à certaines caractéristiques géniques, la séquence centromérique n'est ni conservée entre les espèces ni suffisante à la fonction centromérique. Il est donc bien accepté dans la littérature que le centromère est régulé épigénétiquement par une variante de l'histone H3, CENP-A. KNL-2, aussi connu sous le nom de M18BP1, ainsi que ces partenaires Mis18α et Mis18β sont des protéines essentielles pour l'incorporation de CENP-A nouvellement synthétisé aux centromères. Des évidences expérimentales démontrent que KNL-2, ayant un domaine de liaison à l'ADN nommé Myb, est la protéine la plus en amont pour l'incorporation de CENP-A aux centromères en phase G1. Par contre, sa fonction dans le processus d'incorporation de CENP-A aux centromères n'est pas bien comprise et ces partenaires de liaison ne sont pas tous connus. De nouveaux partenaires de liaison de KNL-2 ont été identifiés par des expériences d'immunoprécipitation suivies d'une analyse en spectrométrie de masse. Un rôle dans l'incorporation de CENP-A nouvellement synthétisé aux centromères a été attribué à MgcRacGAP, une des 60 protéines identifiées par l'essai. MgcRacGAP ainsi que les protéines ECT-2 (GEF) et la petite GTPase Cdc42 ont été démontrées comme étant requises pour la stabilité de CENP-A incorporé aux centromères. Ces différentes observations ont mené à l'identification d'une troisième étape au niveau moléculaire pour l'incorporation de CENP-A nouvellement synthétisé en phase G1, celle de la stabilité de CENP-A nouvellement incorporé aux centromères. Cette étape est importante pour le maintien de l'identité centromérique à chaque division cellulaire. Pour caractériser la fonction de KNL-2 lors de l'incorporation de CENP-A nouvellement synthétisé aux centromères, une technique de microscopie à haute résolution couplée à une quantification d'image a été utilisée. Les résultats générés démontrent que le recrutement de KNL-2 au centromère est rapide, environ 5 minutes après la sortie de la mitose. De plus, la structure du domaine Myb de KNL-2 provenant du nématode C. elegans a été résolue par RMN et celle-ci démontre un motif hélice-tour-hélice, une structure connue pour les domaines de liaison à l'ADN de la famille Myb. De plus, les domaines humain (HsMyb) et C. elegans (CeMyb) Myb lient l'ADN in vitro, mais aucune séquence n'est reconnue spécifiquement par ces domaines. Cependant, il a été possible de démontrer que ces deux domaines lient préférentiellement la chromatine CENP-A-YFP comparativement à la chromatine H2B-GFP par un essai modifié de SIMPull sous le microscope TIRF. Donc, le domaine Myb de KNL-2 est suffisant pour reconnaître de façon spécifique la chromatine centromérique. Finalement, l'élément reconnu par les domaines Myb in vitro a potentiellement été identifié. En effet, il a été démontré que les domaines HsMyb et CeMyb lient l'ADN simple brin in vitro. De plus, les domaines HsMyb et CeMyb ne colocalisent pas avec CENP-A lorsqu'exprimés dans les cellules HeLa, mais plutôt avec les corps nucléaires PML, des structures nucléaires composées d'ARN. Donc, en liant potentiellement les transcrits centromériques, les domaines Myb de KNL-2 pourraient spécifier l'incorporation de CENP-A nouvellement synthétisé uniquement aux régions centromériques.