945 resultados para Genome Sequence
Resumo:
During the first steps of reverse transcription of the retroviral genome, sequences present at the extremities of the RNA are used to reconstitute a host cell PolII promoter. The assembly of the promoter occurs by template switching, which takes advantage of a direct repeat at the ends of the RNA molecule. These steps are catalysed by the viral reverse transcriptase, which carries an intrinsic RNaseH activity that is probably also involved therein. To study the role of the RNaseH activity in this first template-switching event, an in vitro system has been developed based on primer extensions of synthetic RNAs. When an RNA was reverse transcribed with wild-type reverse transcriptase in the presence of a second RNA the 3' part of which was repeated at the 5' end of the first one, extension products could be observed corresponding to a chimeric cDNA comprising both RNA species. This template switching could not be detected when a mutant reverse transcriptase lacking the RNaseH activity was used. The results show that the RNaseH activity is needed to remove the 5' RNA sequences from the cDNA:RNA hybrid thereby enabling its translocation to another RNA containing an appropriate complementary target sequence.
Resumo:
To identify loci for age at menarche, we performed a meta-analysis of 32 genome-wide association studies in 87,802 women of European descent, with replication in up to 14,731 women. In addition to the known loci at LIN28B (P = 5.4 × 10⁻⁶⁰) and 9q31.2 (P = 2.2 × 10⁻³³), we identified 30 new menarche loci (all P < 5 × 10⁻⁸) and found suggestive evidence for a further 10 loci (P < 1.9 × 10⁻⁶). The new loci included four previously associated with body mass index (in or near FTO, SEC16B, TRA2B and TMEM18), three in or near other genes implicated in energy homeostasis (BSX, CRTC1 and MCHR2) and three in or near genes implicated in hormonal regulation (INHBA, PCSK2 and RXRG). Ingenuity and gene-set enrichment pathway analyses identified coenzyme A and fatty acid biosynthesis as biological processes related to menarche timing.
Resumo:
Human immunodeficiency virus type 1 (HIV-1) variants resistant to protease (PR) and reverse transcriptase (RT) inhibitors may display impaired infectivity and replication capacity. The individual contributions of mutated HIV-1 PR and RT to infectivity, replication, RT activity, and protein maturation (herein referred to as "fitness") in recombinant viruses were investigated by separately cloning PR, RT, and PR-RT cassettes from drug-resistant mutant viral isolates into the wild-type NL4-3 background. Both mutant PR and RT contributed to measurable deficits in fitness of viral constructs. In peripheral blood mononuclear cells, replication rates (means +/- standard deviations) of RT recombinants were 72.5% +/- 27.3% and replication rates of PR recombinants were 60.5% +/- 33.6% of the rates of NL4-3. PR mutant deficits were enhanced in CEM T cells, with relative replication rates of PR recombinants decreasing to 15.8% +/- 23.5% of NL4-3 replication rates. Cloning of the cognate RT improved fitness of some PR mutant clones. For a multidrug-resistant virus transmitted through sexual contact, RT constructs displayed a marked infectivity and replication deficit and diminished packaging of Pol proteins (RT content in virions diminished by 56.3% +/- 10.7%, and integrase content diminished by 23.3% +/- 18.4%), a novel mechanism for a decreased-fitness phenotype. Despite the identified impairment of recombinant clones, fitness of two of the three drug-resistant isolates was comparable to that of wild-type, susceptible viruses, suggestive of extensive compensation by genomic regions away from PR and RT. Only limited reversion of mutated positions to wild-type amino acids was observed for the native isolates over 100 viral replication cycles in the absence of drug selective pressure. These data underscore the complex relationship between PR and RT adaptive changes and viral evolution in antiretroviral drug-resistant HIV-1.
Resumo:
Human RNA polymerase (Pol) III-transcribed genes are thought to share a simple termination signal constituted by four or more consecutive thymidine residues in the coding DNA strand, just downstream of the RNA 3'-end sequence. We found that a large set of human tRNA genes (tDNAs) do not display any T(≥4) stretch within 50 bp of 3'-flanking region. In vitro analysis of tDNAs with a distanced T(≥4) revealed the existence of non-canonical terminators resembling degenerate T(≥5) elements, which ensure significant termination but at the same time allow for the production of Pol III read-through pre-tRNAs with unusually long 3' trailers. A panel of such non-canonical signals was found to direct transcription termination of unusual Pol III-synthesized viral pre-miRNA transcripts in gammaherpesvirus 68-infected cells. Genome-wide location analysis revealed that human Pol III tends to trespass into the 3'-flanking regions of tDNAs, as expected from extensive terminator read-through. The widespread occurrence of partial termination suggests that the Pol III primary transcriptome in mammals is unexpectedly enriched in 3'-trailer sequences with the potential to contribute novel functional ncRNAs.
Resumo:
High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits. isb-sib.ch).
Resumo:
L’èxit del Projecte Genoma Humà (PGH) l’any 2000 va fer de la “medicina personalitzada” una realitat més propera. Els descobriments del PGH han simplificat les tècniques de seqüenciació de tal manera que actualment qualsevol persona pot aconseguir la seva seqüència d’ADN complerta. La tecnologia de Read Mapping destaca en aquest tipus de tècniques i es caracteritza per manegar una gran quantitat de dades. Hadoop, el framework d’Apache per aplicacions intensives de dades sota el paradigma Map Reduce, resulta un aliat perfecte per aquest tipus de tecnologia i ha sigut l’opció escollida per a realitzar aquest projecte. Durant tot el treball es realitza l’estudi, l’anàlisi i les experimentacions necessàries per aconseguir un Algorisme Genètic innovador que utilitzi tot el potencial de Hadoop.
Resumo:
BACKGROUND: In mammals, ChIP-seq studies of RNA polymerase II (PolII) occupancy have been performed to reveal how recruitment, initiation and pausing of PolII may control transcription rates, but the focus is rarely on obtaining finely resolved profiles that can portray the progression of PolII through sequential promoter states. RESULTS: Here, we analyze PolII binding profiles from high-coverage ChIP-seq on promoters of actively transcribed genes in mouse and humans. We show that the enrichment of PolII near transcription start sites exhibits a stereotypical bimodal structure, with one peak near active transcription start sites and a second peak 110 base pairs downstream from the first. Using an empirical model that reliably quantifies the spatial PolII signal, gene by gene, we show that the first PolII peak allows for refined positioning of transcription start sites, which is corroborated by mRNA sequencing. This bimodal signature is found both in mouse and humans. Analysis of the pausing-related factors NELF and DSIF suggests that the downstream peak reflects widespread pausing at the +1 nucleosome barrier. Several features of the bimodal pattern are correlated with sequence features such as CpG content and TATA boxes, as well as the histone mark H3K4me3. CONCLUSIONS: We thus show how high coverage DNA sequencing experiments can reveal as-yet unnoticed bimodal spatial features of PolII accumulation that are frequent at individual mammalian genes and reminiscent of transcription initiation and pausing. The initiation-pausing hypothesis is corroborated by evidence from run-on sequencing and immunoprecipitation in other cell types and species.
Resumo:
Personalized medicine has a substantial potential to transform the way diseases will be predicted, prevented and treated. The field will greatly benefit from novel DNA sequencing technologies, in particular commoditization of individual whole genome sequencing. This evolution cannot be stopped, and the medical and scientific community, as well as the society at large, have the responsibility to anticipate the expected benefits from this revolution, but also the potential risks associated with it. Massive investments will be needed for the potential of personalized medicine to be realized, and for the field to come to maturity. In particular, a paradigm change in the way clinical research is done is needed. Switzerland and its Western part pro-actively anticipate these changes.
Resumo:
Alcohol consumption is a moderately heritable trait, but the genetic basis in humans is largely unknown, despite its clinical and societal importance. We report a genome-wide association study meta-analysis of ∼2.5 million directly genotyped or imputed SNPs with alcohol consumption (gram per day per kilogram body weight) among 12 population-based samples of European ancestry, comprising 26,316 individuals, with replication genotyping in an additional 21,185 individuals. SNP rs6943555 in autism susceptibility candidate 2 gene (AUTS2) was associated with alcohol consumption at genome-wide significance (P = 4 × 10(-8) to P = 4 × 10(-9)). We found a genotype-specific expression of AUTS2 in 96 human prefrontal cortex samples (P = 0.026) and significant (P < 0.017) differences in expression of AUTS2 in whole-brain extracts of mice selected for differences in voluntary alcohol consumption. Down-regulation of an AUTS2 homolog caused reduced alcohol sensitivity in Drosophila (P < 0.001). Our finding of a regulator of alcohol consumption adds knowledge to our understanding of genetic mechanisms influencing alcohol drinking behavior.
Resumo:
Little is known about the role of the transcription factor peroxisome proliferator-activated receptor (PPAR) beta/delta in liver. Here we set out to better elucidate the function of PPARbeta/delta in liver by comparing the effect of PPARalpha and PPARbeta/delta deletion using whole genome transcriptional profiling and analysis of plasma and liver metabolites. In fed state, the number of genes altered by PPARalpha and PPARbeta/delta deletion was similar, whereas in fasted state the effect of PPARalpha deletion was much more pronounced, consistent with the pattern of gene expression of PPARalpha and PPARbeta/delta. Minor overlap was found between PPARalpha- and PPARbeta/delta-dependent gene regulation in liver. Pathways upregulated by PPARbeta/delta deletion were connected to innate immunity and inflammation. Pathways downregulated by PPARbeta/delta deletion included lipoprotein metabolism and various pathways related to glucose utilization, which correlated with elevated plasma glucose and triglycerides and reduced plasma cholesterol in PPARbeta/delta-/- mice. Downregulated genes that may underlie these metabolic alterations included Pklr, Fbp1, Apoa4, Vldlr, Lipg, and Pcsk9, which may represent novel PPARbeta/delta target genes. In contrast to PPARalpha-/- mice, no changes in plasma free fatty acid, plasma beta-hydroxybutyrate, liver triglycerides, and liver glycogen were observed in PPARbeta/delta-/- mice. Our data indicate that PPARbeta/delta governs glucose utilization and lipoprotein metabolism and has an important anti-inflammatory role in liver. Overall, our analysis reveals divergent roles of PPARalpha and PPARbeta/delta in regulation of gene expression in mouse liver.
Resumo:
Data analysis, presentation and distribution is of utmost importance to a genome project. A public domain software, ACeDB, has been chosen as the common basis for parasite genome databases, and a first release of TcruziDB, the Trypanosoma cruzi genome database, is available by ftp from ftp://iris.dbbm.fiocruz.br/pub/genomedb/TcruziDB as well as versions of the software for different operating systems (ftp://iris.dbbm.fiocruz.br/pub/unixsoft/). Moreover, data originated from the project are available from the WWW server at http://www.dbbm.fiocruz.br. It contains biological and parasitological data on CL Brener, its karyotype, all available T. cruzi sequences from Genbank, data on the EST-sequencing project and on available libraries, a T. cruzi codon table and a listing of activities and participating groups in the genome project, as well as meeting reports. T. cruzi discussion lists (tcruzi-l@iris.dbbm.fiocruz.br and tcgenics@iris.dbbm.fiocruz.br) are being maintained for communication and to promote collaboration in the genome project
Resumo:
By using improved pulsed field gel electrophoresis conditions, the molecular karyotype of the reference clone CL Brener selected for Trypanosoma cruzi genome project was established. A total of 20 uniform chromosomal bands ranging in size from 0.45 to 3.5 Megabase pairs (Mbp) were resolved in a single run. The weighted sum of the chromosomal bands was approximately 87 Mbp. Chromoblots were hybridized with 39 different homologous probes, 13 of which identified single chromosomes. Several markers showed linkage and four different linkage groups were identified, each comprising two markers. Densitometric analysis suggests that most of the chromosomal bands contain two or more chromosomes representing either homologous chromosomes and/or heterologous chromosomes with similar sizes
Resumo:
We have analyzed the compositional properties of coding (protein encoding) and non-coding sequences of Plasmodium falciparum, a unicellular parasite characterized by an extremely AT-rich genome. GC% levels, base and dinucleotide frequencies were studied. We found that among the various factors that contribute to the properties of the sequences analyzed, the most relevant are the compositional constraints which operate on the whole genome
Resumo:
Since the start of the human genome project, a great number of genome projects on other "model" organism have been initiated, some of them already completed. Several initiatives have also been started on parasite genomes, mainly through support from WHO/TDR, involving North-South and South-South collaborations, and great hopes are vested in that these initiatives will lead to new tools for disease control and prevention, as well as to the establishment of genomic research technology in developing countries. The Trypanosoma cruzi genome project, using the clone CL-Brener as starting point, has made considerable progress through the concerted action of more than 20 laboratories, most of them in the South. A brief overview of the current state of the project is given
Resumo:
Early menopause (EM) affects up to 10% of the female population, reducing reproductive lifespan considerably. Currently, it constitutes the leading cause of infertility in the western world, affecting mainly those women who postpone their first pregnancy beyond the age of 30 years. The genetic aetiology of EM is largely unknown in the majority of cases. We have undertaken a meta-analysis of genome-wide association studies (GWASs) in 3493 EM cases and 13 598 controls from 10 independent studies. No novel genetic variants were discovered, but the 17 variants previously associated with normal age at natural menopause as a quantitative trait (QT) were also associated with EM and primary ovarian insufficiency (POI). Thus, EM has a genetic aetiology which overlaps variation in normal age at menopause and is at least partly explained by the additive effects of the same polygenic variants. The combined effect of the common variants captured by the single nucleotide polymorphism arrays was estimated to account for ∼30% of the variance in EM. The association between the combined 17 variants and the risk of EM was greater than the best validated non-genetic risk factor, smoking.