918 resultados para Genome annotation
Resumo:
Objectives: Sequencing and annotation of the genome of Aspergillus fumigatus has dramatically changed our knowledge about the proteins potentially encoded by the fungus. Own analysis have resulted in at least 47 of them contain a signal for secretion. Among those list we want to characterize those enzymes that may have impact on fungal growth outside and particularly inside the host. We thereby want to learn more about their function in general and to identify possible novel drug targets suited to combat invasive aspergillosis. Methods: Four groups of secreted proteases have been chosen for further analysis: 1 Serine-carboxyl proteases (sedolisins). Four of them were expressed in yeast and partly in bacteria. Substrate-specificity studies and kinetics as well as protein characterization of the yeast derived proteases were performed according to standard methods. Enzyme specific polyclonal antibodies were raised in rabbits using the peptides expressed in bacteria. Expression of proteases in A. fumigatus was investigated with these antibodies and gene knockout mutants for each enzyme as a control. All the following mentioned proteases will be investigated accordingly. 2 Two metalloproteases from the M12-family, ADAM-A and ADAM-B. Both proteases are likely membrane associated and may have inherent sheddase function as their counterparts in mammals. 3 One metalloprotease of the M43 family. An orthologue of this protease in Coccidioides posadasii is known to posses immunomodulating activities. 4 One putative endoprotease of the S28-family. An orthologue in Aspergillus niger is known to digest proline-rich proteins. In A. fumigatus this enzyme may facilitate invasion through proline-rich proteins like collagen. Results: All sedolisins expressed in yeast were proteolytically active: Three of them were characterized as tripeptidyl-peptidases whereas one enzyme is an endoprotease. Corresponding knockout mutants did not reveal a specific phenotype. Expression and investigations on all above mentioned proteases as well as generation of corresponding knockout mutants and double knockout mutants for the ADAMs, respectively, is underway. Promising candidates will be investigated in animal studies for reduced virulence. Conclusions : The real existence of so far hypothetical proteases predicted by the genome project was already demonstrated for the sedolisins by a reverse genetic approach (from gene to protein). With the aim of improving basic knowledge on function of other proteases potentially crucial for fungal growth and thus for pathogenesis, other hypothetical enzymes will be investigated. Those enzymes may turn out to be ideal drug targets for antimycotic chemotherapy.
Resumo:
Calcium has a pivotal role in biological functions, and serum calcium levels have been associated with numerous disorders of bone and mineral metabolism, as well as with cardiovascular mortality. Here we report results from a genome-wide association study of serum calcium, integrating data from four independent cohorts including a total of 12,865 individuals of European and Indian Asian descent. Our meta-analysis shows that serum calcium is associated with SNPs in or near the calcium-sensing receptor (CASR) gene on 3q13. The top hit with a p-value of 6.3 x 10(-37) is rs1801725, a missense variant, explaining 1.26% of the variance in serum calcium. This SNP had the strongest association in individuals of European descent, while for individuals of Indian Asian descent the top hit was rs17251221 (p = 1.1 x 10(-21)), a SNP in strong linkage disequilibrium with rs1801725. The strongest locus in CASR was shown to replicate in an independent Icelandic cohort of 4,126 individuals (p = 1.02 x 10(-4)). This genome-wide meta-analysis shows that common CASR variants modulate serum calcium levels in the adult general population, which confirms previous results in some candidate gene studies of the CASR locus. This study highlights the key role of CASR in calcium regulation.
Resumo:
The pubertal height growth spurt is a distinctive feature of childhood growth reflecting both the central onset of puberty and local growth factors. Although little is known about the underlying genetics, growth variability during puberty correlates with adult risks for hormone-dependent cancer and adverse cardiometabolic health. The only gene so far associated with pubertal height growth, LIN28B, pleiotropically influences childhood growth, puberty and cancer progression, pointing to shared underlying mechanisms. To discover genetic loci influencing pubertal height and growth and to place them in context of overall growth and maturation, we performed genome-wide association meta-analyses in 18 737 European samples utilizing longitudinally collected height measurements. We found significant associations (P < 1.67 × 10(-8)) at 10 loci, including LIN28B. Five loci associated with pubertal timing, all impacting multiple aspects of growth. In particular, a novel variant correlated with expression of MAPK3, and associated both with increased prepubertal growth and earlier menarche. Another variant near ADCY3-POMC associated with increased body mass index, reduced pubertal growth and earlier puberty. Whereas epidemiological correlations suggest that early puberty marks a pathway from rapid prepubertal growth to reduced final height and adult obesity, our study shows that individual loci associating with pubertal growth have variable longitudinal growth patterns that may differ from epidemiological observations. Overall, this study uncovers part of the complex genetic architecture linking pubertal height growth, the timing of puberty and childhood obesity and provides new information to pinpoint processes linking these traits.
Resumo:
Members of the genus Sphingomonas are important catalysts for removal of polycyclic aromatic hydrocarbons (PAHs) in soil, but their activity can be affected by various stress factors. This study examines the physiological and genome-wide transcription response of the phenanthrene-degrading Sphingomonas sp. strain LH128 in biofilms to solute stress (invoked by 450 mM NaCl solution), either as an acute (4-h) or a chronic (3-day) exposure. The degree of membrane fatty acid saturation was increased as a response to chronic stress. Oxygen consumption in the biofilms and phenanthrene mineralization activities of biofilm cells were, however, not significantly affected after imposing either acute or chronic stress. This finding was in agreement with the transcriptomic data, since genes involved in PAH degradation were not differentially expressed in stressed conditions compared to nonstressed conditions. The transcriptomic data suggest that LH128 adapts to NaCl stress by (i) increasing the expression of genes coping with osmolytic and ionic stress such as biosynthesis of compatible solutes and regulation of ion homeostasis, (ii) increasing the expression of genes involved in general stress response, (iii) changing the expression of general and specific regulatory functions, and (iv) decreasing the expression of protein synthesis such as proteins involved in motility. Differences in gene expression between cells under acute and chronic stress suggest that LH128 goes through changes in genome-wide expression to fully adapt to NaCl stress, without significantly changing phenanthrene degrading activity.
Resumo:
Nonalcoholic fatty liver disease (NAFLD) clusters in families, but the only known common genetic variants influencing risk are near PNPLA3. We sought to identify additional genetic variants influencing NAFLD using genome-wide association (GWA) analysis of computed tomography (CT) measured hepatic steatosis, a non-invasive measure of NAFLD, in large population based samples. Using variance components methods, we show that CT hepatic steatosis is heritable (∼26%-27%) in family-based Amish, Family Heart, and Framingham Heart Studies (n = 880 to 3,070). By carrying out a fixed-effects meta-analysis of genome-wide association (GWA) results between CT hepatic steatosis and ∼2.4 million imputed or genotyped SNPs in 7,176 individuals from the Old Order Amish, Age, Gene/Environment Susceptibility-Reykjavik study (AGES), Family Heart, and Framingham Heart Studies, we identify variants associated at genome-wide significant levels (p<5×10(-8)) in or near PNPLA3, NCAN, and PPP1R3B. We genotype these and 42 other top CT hepatic steatosis-associated SNPs in 592 subjects with biopsy-proven NAFLD from the NASH Clinical Research Network (NASH CRN). In comparisons with 1,405 healthy controls from the Myocardial Genetics Consortium (MIGen), we observe significant associations with histologic NAFLD at variants in or near NCAN, GCKR, LYPLAL1, and PNPLA3, but not PPP1R3B. Variants at these five loci exhibit distinct patterns of association with serum lipids, as well as glycemic and anthropometric traits. We identify common genetic variants influencing CT-assessed steatosis and risk of NAFLD. Hepatic steatosis associated variants are not uniformly associated with NASH/fibrosis or result in abnormalities in serum lipids or glycemic and anthropometric traits, suggesting genetic heterogeneity in the pathways influencing these traits.
Resumo:
Evolution of proteins after whole-genome duplicationGene and genome duplication are considered major mechanisms in the creation of newfunctions in genomes, or in the refinement of networks by the division of function amongmore genes. In animals, the best demonstrated whole genome duplication occurred at theorigin of Teleost fishes. This makes fishes an ideal model to study the consequences ofgenome duplication, particularly since we have a good sampling of genome sequences,abundant functional information, and a very well studied outgroup: the tetrapodes (includinghuman). More specifically, I studied the consequences of duplication on proteins usingevolutionary models to infer adaptive events. I analysed the influence of positive selection invertebrate genes, by contrasting singleton genes and duplicated genes. The conclusion of theanalyses was threefold: (i) positive selection affects diverse phylogenetic branches anddiverse gene categories during vertebrate evolution; (ii) it concerns only a small proportion ofsites (1%-5%); and (iii) whole genome duplication had no detectable impact on theprevalence of this positive selection.I also studied evolution at the amino acid level with different methods to detect functionalshifts (covarion process and constant-but-different process). As in my previous research, Ifound similar numbers of functional shifts between duplicates and between orthologs.The accepted framework for studies of molecular evolution is that orthologs share the samefunction, whereas the function of paralogs diverges. This framework gives a special place togene duplication in evolution, as the main mechanism for generating novelty. With myprevious results showing that duplication and speciation are not so different, we investigatedthe literature to question the evidence for similar or divergent evolution of gene function afterduplication relative to speciation genes. This led us to propose a more rigorous design offuture studies of gene duplication.Finally, based on my automated protocol, we built a database of positive selection invertebrates' genes, Selectome. This database is freely available on the web and will helpfuture evolutionary as well as biochemical studies.
Resumo:
Taphrina deformans is a fungus responsible for peach leaf curl, an important plant disease. It is phylogenetically assigned to the Taphrinomycotina subphylum, which includes the fission yeast and the mammalian pathogens of the genus Pneumocystis. We describe here the genome of T. deformans in the light of its dual plant-saprophytic/plant-parasitic lifestyle. The 13.3-Mb genome contains few identifiable repeated elements (ca. 1.5%) and a relatively high GC content (49.5%). A total of 5,735 protein-coding genes were identified, among which 83% share similarities with other fungi. Adaptation to the plant host seems reflected in the genome, since the genome carries genes involved in plant cell wall degradation (e.g., cellulases and cutinases), secondary metabolism, the hallmark glyoxylate cycle, detoxification, and sterol biosynthesis, as well as genes involved in the biosynthesis of plant hormones. Genes involved in lipid metabolism may play a role in its virulence. Several locus candidates for putative MAT cassettes and sex-related genes akin to those of Schizosaccharomyces pombe were identified. A mating-type-switching mechanism similar to that found in ascomycetous yeasts could be in effect. Taken together, the findings are consistent with the alternate saprophytic and parasitic-pathogenic lifestyles of T. deformans. IMPORTANCE: Peach leaf curl is an important plant disease which causes significant losses of fruit production. We report here the genome sequence of the causative agent of the disease, the fungus Taphrina deformans. The genome carries characteristic genes that are important for the plant infection process. These include (i) proteases that allow degradation of the plant tissues; (ii) secondary metabolites which are products favoring interaction of the fungus with the environment, including the host; (iii) hormones that are responsible for the symptom of severely distorted leaves on the host; and (iv) drug detoxification enzymes that confer resistance to fungicides. The availability of the genome allows the design of new drug targets as well as the elaboration of specific management strategies to fight the disease.
Resumo:
Aphids are important agricultural pests and also biological models for studies of insect-plant interactions, symbiosis, virus vectoring, and the developmental causes of extreme phenotypic plasticity. Here we present the 464 Mb draft genome assembly of the pea aphid Acyrthosiphon pisum. This first published whole genome sequence of a basal hemimetabolous insect provides an outgroup to the multiple published genomes of holometabolous insects. Pea aphids are host-plant specialists, they can reproduce both sexually and asexually, and they have coevolved with an obligate bacterial symbiont. Here we highlight findings from whole genome analysis that may be related to these unusual biological features. These findings include discovery of extensive gene duplication in more than 2000 gene families as well as loss of evolutionarily conserved genes. Gene family expansions relative to other published genomes include genes involved in chromatin modification, miRNA synthesis, and sugar transport. Gene losses include genes central to the IMD immune pathway, selenoprotein utilization, purine salvage, and the entire urea cycle. The pea aphid genome reveals that only a limited number of genes have been acquired from bacteria; thus the reduced gene count of Buchnera does not reflect gene transfer to the host genome. The inventory of metabolic genes in the pea aphid genome suggests that there is extensive metabolite exchange between the aphid and Buchnera, including sharing of amino acid biosynthesis between the aphid and Buchnera. The pea aphid genome provides a foundation for post-genomic studies of fundamental biological questions and applied agricultural problems.
Resumo:
Determination of the precise composition and variation of microbiota in cystic fibrosis lungs is crucial since chronic inflammation due to microorganisms leads to lung damage and ultimately, death. However, this constitutes a major technical challenge. Culturing of microorganisms does not provide a complete representation of a microbiota, even when using culturomics (high-throughput culture). So far, only PCR-based metagenomics have been investigated. However, these methods are biased towards certain microbial groups, and suffer from uncertain quantification of the different microbial domains. We have explored whole genome sequencing (WGS) using the Illumina high-throughput technology applied directly to DNA extracted from sputa obtained from two cystic fibrosis patients. To detect all microorganism groups, we used four procedures for DNA extraction, each with a different lysis protocol. We avoided biases due to whole DNA amplification thanks to the high efficiency of current Illumina technology. Phylogenomic classification of the reads by three different methods produced similar results. Our results suggest that WGS provides, in a single analysis, a better qualitative and quantitative assessment of microbiota compositions than cultures and PCRs. WGS identified a high quantity of Haemophilus spp. (patient 1) or Staphylococcus spp. plus Streptococcus spp. (patient 2) together with low amounts of anaerobic (Veillonella, Prevotella, Fusobacterium) and aerobic bacteria (Gemella, Moraxella, Granulicatella). WGS suggested that fungal members represented very low proportions of the microbiota, which were detected by cultures and PCRs because of their selectivity. The future increase of reads' sizes and decrease in cost should ensure the usefulness of WGS for the characterisation of microbiota.
Resumo:
BACKGROUND: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. RESULTS: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. CONCLUSION: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
Resumo:
Background: The trithorax group (trxG) genes absent, small or homeotic discs 1 (ash1) and 2 (ash2) were isolated in a screen for mutants with abnormal imaginal discs. Mutations in either gene cause homeotic transformations but Hox genes are not their only targets. Although analysis of double mutants revealed that ash2 and ash1 mutations enhance each other's phenotypes, suggesting they are functionally related, it was shown that these proteins are subunits of distinct complexes.Results: The analysis of wing imaginal disc transcriptomes from ash2 and ash1 mutants showed that they are highly similar. Functional annotation of regulated genes using Gene Ontology allowed identification of severely affected groups of genes that could be correlated to the wing phenotypes observed. Comparison of the differentially expressed genes with those from other genome-wide analyses revealed similarities between ASH2 and Sin3A, suggesting a putative functional relationship. Coimmunoprecipitation studies and immunolocalization on polytene chromosomes demonstrated that ASH2 and Sin3A interact with HCF (host-cell factor). The results of nucleosome western blots and clonal analysis indicated that ASH2 is necessary for trimethylation of the Lys4 on histone 3 (H3K4).Conclusion: The similarity between the transcriptomes of ash2 and ash1 mutants supports a model in which the two genes act together to maintain stable states of transcription. Like in humans, both ASH2 and Sin3A bind HCF. Finally, the reduction of H3K4 trimethylation in ash2 mutants is the first evidence in Drosophila regarding the molecular function of this trxG gene.
Resumo:
Genome-wide association studies have been instrumental in identifying genetic variants associated with complex traits such as human disease or gene expression phenotypes. It has been proposed that extending existing analysis methods by considering interactions between pairs of loci may uncover additional genetic effects. However, the large number of possible two-marker tests presents significant computational and statistical challenges. Although several strategies to detect epistasis effects have been proposed and tested for specific phenotypes, so far there has been no systematic attempt to compare their performance using real data. We made use of thousands of gene expression traits from linkage and eQTL studies, to compare the performance of different strategies. We found that using information from marginal associations between markers and phenotypes to detect epistatic effects yielded a lower false discovery rate (FDR) than a strategy solely using biological annotation in yeast, whereas results from human data were inconclusive. For future studies whose aim is to discover epistatic effects, we recommend incorporating information about marginal associations between SNPs and phenotypes instead of relying solely on biological annotation. Improved methods to discover epistatic effects will result in a more complete understanding of complex genetic effects.
Resumo:
HIV-1 sequence diversity is affected by selection pressures arising from host genomic factors. Using paired human and viral data from 1071 individuals, we ran >3000 genome-wide scans, testing for associations between host DNA polymorphisms, HIV-1 sequence variation and plasma viral load (VL), while considering human and viral population structure. We observed significant human SNP associations to a total of 48 HIV-1 amino acid variants (p<2.4 × 10(-12)). All associated SNPs mapped to the HLA class I region. Clinical relevance of host and pathogen variation was assessed using VL results. We identified two critical advantages to the use of viral variation for identifying host factors: (1) association signals are much stronger for HIV-1 sequence variants than VL, reflecting the 'intermediate phenotype' nature of viral variation; (2) association testing can be run without any clinical data. The proposed genome-to-genome approach highlights sites of genomic conflict and is a strategy generally applicable to studies of host-pathogen interaction. DOI:http://dx.doi.org/10.7554/eLife.01123.001.
Resumo:
The nucleoid-associated proteins Hha and YdgT repress the expression of the toxin α-hemolysin. An Escherichia coli mutant lacking these proteins overexpresses the toxin α-hemolysin encoded in the multicopy recombinant plasmid pANN202-312R. Unexpectedly, we could observe that this mutant generated clones that no further produced hemolysin (Hly-). Generation of Hly- clones was dependent upon the presence in the culture medium of the antibiotic kanamycin (km), a marker of the hha allele (hha::Tn5). Detailed analysis of different Hly- clones evidenced that recombination between partial IS91 sequences that flank the hly operon had occurred. A fluctuation test evidenced that the presence of km in the culture medium was underlying the generation of these clones. A decrease of the km concentration from 25 mg/l to 12.5 mg/l abolished the appearance of Hly- derivatives. We considered as a working hypothesis that, when producing high levels of the toxin (combination of the hha ydgT mutations with the presence of the multicopy hemolytic plasmid pANN202-312R), the concentration of km of 25 mg/l resulted subinhibitory and stimulated the recombination between adjacent IS91 flanking sequences. To further test this hypothesis, we analyzed the effect of subinhibitory km concentrations in the wild type E. coli strain MG1655 harboring the parental low copy number plasmid pHly152. At a km concentration of 5 mg/l, subinhibitory for strain MG1655 (pHly152), generation of Hly- clones could be readily detected. Similar results were also obtained when, instead of km, ampicillin was used. IS91 is flanking several virulence determinants in different enteric bacterial pathogenic strains from E. coli and Shigella. The results presented here evidence that stress generated by exposure to subinhibitory antibiotic concentrations may result in rearrangements of the bacterial genome. Whereas some of these rearrangements may be deleterious, others may generate genotypes with increased virulence, which may resume infection.