916 resultados para genome analysis


Relevância:

100.00% 100.00%

Publicador:

Resumo:

HIV-1 sequence diversity is affected by selection pressures arising from host genomic factors. Using paired human and viral data from 1071 individuals, we ran >3000 genome-wide scans, testing for associations between host DNA polymorphisms, HIV-1 sequence variation and plasma viral load (VL), while considering human and viral population structure. We observed significant human SNP associations to a total of 48 HIV-1 amino acid variants (p<2.4 × 10−12). All associated SNPs mapped to the HLA class I region. Clinical relevance of host and pathogen variation was assessed using VL results. We identified two critical advantages to the use of viral variation for identifying host factors: (1) association signals are much stronger for HIV-1 sequence variants than VL, reflecting the ‘intermediate phenotype’ nature of viral variation; (2) association testing can be run without any clinical data. The proposed genome-to-genome approach highlights sites of genomic conflict and is a strategy generally applicable to studies of host–pathogen interaction.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Enterococcus faecalis has emerged as a major hospital pathogen. To explore its diversity, we sequenced E. faecalis strain OG1RF, which is commonly used for molecular manipulation and virulence studies. RESULTS: The 2,739,625 base pair chromosome of OG1RF was found to contain approximately 232 kilobases unique to this strain compared to V583, the only publicly available sequenced strain. Almost no mobile genetic elements were found in OG1RF. The 64 areas of divergence were classified into three categories. First, OG1RF carries 39 unique regions, including 2 CRISPR loci and a new WxL locus. Second, we found nine replacements where a sequence specific to V583 was substituted by a sequence specific to OG1RF. For example, the iol operon of OG1RF replaces a possible prophage and the vanB transposon in V583. Finally, we found 16 regions that were present in V583 but missing from OG1RF, including the proposed pathogenicity island, several probable prophages, and the cpsCDEFGHIJK capsular polysaccharide operon. OG1RF was more rapidly but less frequently lethal than V583 in the mouse peritonitis model and considerably outcompeted V583 in a murine model of urinary tract infections. CONCLUSION: E. faecalis OG1RF carries a number of unique loci compared to V583, but the almost complete lack of mobile genetic elements demonstrates that this is not a defining feature of the species. Additionally, OG1RF's effects in experimental models suggest that mediators of virulence may be diverse between different E. faecalis strains and that virulence is not dependent on the presence of mobile genetic elements.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genomes of Fusobacterium nucleatum subspecies polymorphum strain ATCC 10953, Rickettsia typhi strain Wilmington, and Francisella tularensis subspecies holarctica strain OSU18 were sequenced, annotated, and analyzed. Each genome was then compared to the sequenced genomes of closely related bacteria. The genome of F. nucleatum ATCC 10953 was compared to two additional F. nucleatum subspecies, subspecies nucleatum and subspecies vincentii. This analysis revealed substantial evidence of horizontal gene transfer along with considerable genetic diversity within the species of F. nucleatum. R. typhi was compared to R. prowazekii and R. conorii. This analysis uncovered a hotspot for chromosomal rearrangements in the Spotted Fever Group but not the Typhus Group Rickettsia and revealed the close genetic relationship between the Typhus Group rickettsial species. F. tularensis OSU18 was compared to two additional F. tularensis strains. These comparisons uncovered significant chromosomal rearrangements between F. tularensis subspecies due to recombination between insertion sequence elements. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Infectious Bovine Keratoconjunctivitis (IBK), known as pinkeye, is a common infectious disease affecting the eyes of cattle. It is characterized by excessive tearing, inflammation of the conjunctiva, and ulceration of the cornea. Although pinkeye is non-fatal, it has a marked economic impact on the cattle industry, due to the decreased performance of infected individuals. Genetic effects on the susceptibility of IBK have been studied and Hereford, Jersey, and Holstein breeds were found to be more susceptible to IBK than Bos Indicus breeds. The objectives of our study were: 1) to estimate genetic parameters of IBK scored in different categories by using genomic threshold model, and 2) to detect markers in linkage disequilibrium with quantitative tract loci (QTL) associated with IBK.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The recent ability to sequence whole genomes allows ready access to all genetic material. The approaches outlined here allow automated analysis of sequence for the synthesis of optimal primers in an automated multiplex oligonucleotide synthesizer (AMOS). The efficiency is such that all ORFs for an organism can be amplified by PCR. The resulting amplicons can be used directly in the construction of DNA arrays or can be cloned for a large variety of functional analyses. These tools allow a replacement of single-gene analysis with a highly efficient whole-genome analysis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present a method for discovering conserved sequence motifs from families of aligned protein sequences. The method has been implemented as a computer program called emotif (http://motif.stanford.edu/emotif). Given an aligned set of protein sequences, emotif generates a set of motifs with a wide range of specificities and sensitivities. emotif also can generate motifs that describe possible subfamilies of a protein superfamily. A disjunction of such motifs often can represent the entire superfamily with high specificity and sensitivity. We have used emotif to generate sets of motifs from all 7,000 protein alignments in the blocks and prints databases. The resulting database, called identify (http://motif.stanford.edu/identify), contains more than 50,000 motifs. For each alignment, the database contains several motifs having a probability of matching a false positive that range from 10−10 to 10−5. Highly specific motifs are well suited for searching entire proteomes, while generating very few false predictions. identify assigns biological functions to 25–30% of all proteins encoded by the Saccharomyces cerevisiae genome and by several bacterial genomes. In particular, identify assigned functions to 172 of proteins of unknown function in the yeast genome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Translational pausing may occur due to a number of mechanisms, including the presence of non-optimal codons, and it is thought to play a role in the folding of specific polypeptide domains during translation and in the facilitation of signal peptide recognition during see-dependent protein targeting. In this whole genome analysis of Escherichia coli we have found that non-optimal codons in the signal peptide-encoding sequences of secretory genes are overrepresented relative to the mature portions of these genes; this is in addition to their overrepresentation in the 5'-regions of genes encoding non-secretory proteins. We also find increased non-optimal codon usage at the 3' ends of most E. coli genes, in both non-secretory and secretory sequences. Whereas presumptive translational pausing at the 5' and 3' ends of E. coli messenger RNAs may clearly have a general role in translation, we suggest that it also has a specific role in sec-dependent protein export, possibly in facilitating signal peptide recognition. This finding may have important implications for our understanding of how the majority of non-cytoplasmic proteins are targeted, a process that is essential to all biological cells. (C) 2004 Elsevier Inc. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This thesis describes two newly sequenced B. longum subsp. longum genomes and subsequent comparative analysis with publicly available B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis genomes (Chapter 2). The acquired data revealed a closed pan-genome for this bifidobacterial species and furthermore facilitated the definition of the B. longum core genome. The comparative analysis also highlights differences in the potential metabolic abilities of all three sub-species. Interestingly, phylogenetic analysis of the B. longum core genome indicated the existence of a novel B. longum subspecies. Characterisation of restriction-modification systems from two B. longum subsp. longum strains is described in Chapter 3. These defence mechanisms limit the uptake of genetic material, which was successfully demonstrated for some of the identified systems. When these systems were by-passed by methylation of DNA prior to the transformation procedure, the resulting transformation efficiency of both B. longum subsp. longum strains was increased to a level that allowed for the generation of mutants via homologous recombination. Arabinoxylan metabolism by B. longum subsp. longum NCIMB 8809 was investigated in Chapter 4 of this thesis. Transcriptome analysis allowed the identification of a number of genes involved in the degradation, uptake and utilisation of arabinoxylan. Biochemical analysis revealed that three of the identified genes encode arabinofuranosidase activity. Phenotypic assessment of a number of insertion mutants in genes identified by the transcriptome analysis revealed the essential role of two of these enzymes in arabinoxylan metabolism, and a third enzyme in the metabolism of debranched arabinan. Furthermore, this investigation revealed that B. longum subsp. longum NCIMB 8809 does not completely degrade arabinoxylan, but utilises the arabinose substitutions only, while leaving the xylan backbone untouched.Finally, Chapter 5 outlines that B. longum subsp. longum NCIMB 8809 is capable of removing ferulic and p-coumaric acid substitutions that originate from arabinoxylan. Analysis of the genome sequence led to the identification of a candidate gene for this activity, which was subsequently cloned and expressed in E. coli. Biochemical analysis revealed that the enzyme, designated here as FaeA, is indeed capable of releasing both ferulic and p-coumaric acid from arabinoxylan. Furthermore, it is shown that a derivative of B. longum subsp. longum NCIMB 8809 carrying an insertion mutation in faeA had lost the ability to release ferulic and p-coumaric acid from arabinoxylan, and that growth of this mutant strain is negatively affected when cultivated on growth-limiting levels of arabinoxylan.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Bifibobacterium longum subsp. longum 35624™ strain (formerly named Bifidobacterium longum subsp. infantis) is a well described probiotic with clinical efficacy in Irritable Bowel Syndrome clinical trials and induces immunoregulatory effects in mice and in humans. This paper presents (a) the genome sequence of the organism allowing the assignment to its correct subspeciation longum; (b) a comparative genome assessment with other B. longum strains and (c) the molecular structure of the 35624 exopolysaccharide (EPS624). Comparative genome analysis of the 35624 strain with other B. longum strains determined that the sub-speciation of the strain is longum and revealed the presence of a 35624-specific gene cluster, predicted to encode the biosynthetic machinery for EPS624. Following isolation and acid treatment of the EPS, its chemical structure was determined using gas and liquid chromatography for sugar constituent and linkage analysis, electrospray and matrix assisted laser desorption ionization mass spectrometry for sequencing and NMR. The EPS consists of a branched hexasaccharide repeating unit containing two galactose and two glucose moieties, galacturonic acid and the unusual sugar 6-deoxy-L-talose. These data demonstrate that the B. longum 35624 strain has specific genetic features, one of which leads to the generation of a characteristic exopolysaccharide.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Plant reproduction depends on the concerted activation of many genes to ensure correct communication between pollen and pistil. Here, we queried the whole transcriptome of Arabidopsis (Arabidopsis thaliana) in order to identify genes with specific reproductive functions. We used the Affymetrix ATH1 whole genome array to profile wild-type unpollinated pistils and unfertilized ovules. By comparing the expression profile of pistils at 0.5, 3.5, and 8.0 h after pollination and applying a number of statistical and bioinformatics criteria, we found 1,373 genes differentially regulated during pollen-pistil interactions. Robust clustering analysis grouped these genes in 16 time-course clusters representing distinct patterns of regulation. Coregulation within each cluster suggests the presence of distinct genetic pathways, which might be under the control of specific transcriptional regulators. A total of 78% of the regulated genes were expressed initially in unpollinated pistil and/or ovules, 15% were initially detected in the pollen data sets as enriched or preferentially expressed, and 7% were induced upon pollination. Among those, we found a particular enrichment for unknown transcripts predicted to encode secreted proteins or representing signaling and cell wall-related proteins, which may function by remodeling the extracellular matrix or as extracellular signaling molecules. A strict regulatory control in various metabolic pathways suggests that fine-tuning of the biochemical and physiological cellular environment is crucial for reproductive success. Our study provides a unique and detailed temporal and spatial gene expression profile of in vivo pollen-pistil interactions, providing a framework to better understand the basis of the molecular mechanisms operating during the reproductive process in higher plants.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Giant Cell Arteritis (GCA) is the most common vasculitis affecting the elderly. Archived formalin-fixed paraffin-embedded (FFPE) temporal artery biopsy (TAB) specimens potentially represent a valuable resource for large-scale genetic analysis of this disease. FFPE TAB samples were obtained from 12 patients with GCA. Extracted TAB DNA was assessed by real time PCR before restoration using the Illumina HD FFPE Restore Kit. Paired FFPE-blood samples were genotyped on the Illumina OmniExpress FFPE microarray. The FFPE samples that passed stringent quality control measures had a mean genotyping success of >97%. When compared with their matching peripheral blood DNA, the mean discordant heterozygote and homozygote single nucleotide polymorphisms calls were 0.0028 and 0.0003, respectively, which is within the accepted tolerance of reproducibility. This work demonstrates that it is possible to successfully obtain high-quality microarray-based genotypes FFPE TAB samples and that this data is similar to that obtained from peripheral blood.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Genome-wide association studies (GWASs) have been successful at identifying single-nucleotide polymorphisms (SNPs) highly associated with common traits; however, a great deal of the heritable variation associated with common traits remains unaccounted for within the genome. Genome-wide complex trait analysis (GCTA) is a statistical method that applies a linear mixed model to estimate phenotypic variance of complex traits explained by genome-wide SNPs, including those not associated with the trait in a GWAS. We applied GCTA to 8 cohorts containing 7096 case and 19 455 control individuals of European ancestry in order to examine the missing heritability present in Parkinson's disease (PD). We meta-analyzed our initial results to produce robust heritability estimates for PD types across cohorts. Our results identify 27% (95% CI 17-38, P = 8.08E - 08) phenotypic variance associated with all types of PD, 15% (95% CI -0.2 to 33, P = 0.09) phenotypic variance associated with early-onset PD and 31% (95% CI 17-44, P = 1.34E - 05) phenotypic variance associated with late-onset PD. This is a substantial increase from the genetic variance identified by top GWAS hits alone (between 3 and 5%) and indicates there are substantially more risk loci to be identified. Our results suggest that although GWASs are a useful tool in identifying the most common variants associated with complex disease, a great deal of common variants of small effect remain to be discovered. © Published by Oxford University Press 2012.