935 resultados para Complete Genome Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete sequence of the 7.07 Mb genome of the biological control agent Pseudomonas fluorescens Pf-5 is now available, providing a new opportunity to advance knowledge of biological control through genomics. P. fluorescens Pf-5 is a rhizosphere bacterium that suppresses seedling emergence diseases and produces a spectrum of antibiotics toxic to plant-pathogenic fungi and oomycetes. In addition to six known secondary metabolites produced by Pf-5, three novel secondary metabolite biosynthesis gene clusters identified in the genome could also contribute to biological control. The genomic sequence provides numerous clues as to mechanisms used by the bacterium to survive in the spermosphere and rhizosphere. These features include broad catabolic and transport capabilities for utilizing seed and root exudates, an expanded collection of efflux systems for defense against environmental stress and microbial competition, and the presence of 45 outer membrane receptors that should allow for the uptake of iron from a wide array of siderophores produced by soil microorganisms. As expected for a bacterium with a large genome that lives in a rapidly changing environment, Pf-5 has an extensive collection of regulatory genes, only some of which have been characterized for their roles in regulation of secondary metabolite production or biological control. Consistent with its commensal lifestyle, Pf-5 appears to lack a number of virulence and pathogenicity factors found in plant pathogen.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Recent technological progress has greatly facilitated de novo genome sequencing. However, de novo assemblies consist in many pieces of contiguous sequence (contigs) arranged in thousands of scaffolds instead of small numbers of chromosomes. Confirming and improving the quality of such assemblies is critical for subsequent analysis. We present a method to evaluate genome scaffolding by aligning independently obtained transcriptome sequences to the genome and visually summarizing the alignments using the Cytoscape software. Applying this method to the genome of the red fire ant Solenopsis invicta allowed us to identify inconsistencies in 7%, confirm contig order in 20% and extend 16% of scaffolds.Scripts that generate tables for visualization in Cytoscape from FASTA sequence and scaffolding information files are publicly available at https://github.com/ksanao/TGNet.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The molecular diversity of viruses complicates the interpretation of viral genomic and proteomic data. To make sense of viral gene functions, investigators must be familiar with the virus host range, replication cycle and virion structure. Our aim is to provide a comprehensive resource bridging together textbook knowledge with genomic and proteomic sequences. ViralZone web resource (www.expasy.org/viralzone/) provides fact sheets on all known virus families/genera with easy access to sequence data. A selection of reference strains (RefStrain) provides annotated standards to circumvent the exponential increase of virus sequences. Moreover ViralZone offers a complete set of detailed and accurate virion pictures.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Association studies have revealed expression quantitative trait loci (eQTLs) for a large number of genes. However, the causative variants that regulate gene expression levels are generally unknown. We hypothesized that copy-number variation of sequence repeats contribute to the expression variation of some genes. Our laboratory has previously identified that the rare expansion of a repeat c.-174CGGGGCGGGGCG in the promoter region of the CSTB gene causes a silencing of the gene, resulting in progressive myoclonus epilepsy. Here, we genotyped the repeat length and quantified CSTB expression by quantitative real-time polymerase chain reaction in 173 lymphoblastoid cell lines (LCLs) and fibroblast samples from the GenCord collection. The majority of alleles contain either two or three copies of this repeat. Independent analysis revealed that the c.-174CGGGGCGGGGCG repeat length is strongly associated with CSTB expression (P = 3.14 × 10(-11)) in LCLs only. Examination of both genotyped and imputed single-nucleotide polymorphisms (SNPs) within 2 Mb of CSTB revealed that the dodecamer repeat represents the strongest cis-eQTL for CSTB in LCLs. We conclude that the common two or three copy variation is likely the causative cis-eQTL for CSTB expression variation. More broadly, we propose that polymorphic tandem repeats may represent the causative variation of a fraction of cis-eQTLs in the genome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The current availability of five complete genomes of different primate species allows the analysis of genetic divergence over the last 40 million years of evolution. We hypothesized that the interspecies differences observed in susceptibility to HIV-1 would be influenced by the long-range selective pressures on host genes associated with HIV-1 pathogenesis. We established a list of human genes (n = 140) proposed to be involved in HIV-1 biology and pathogenesis and a control set of 100 random genes. We retrieved the orthologous genes from the genome of humans and of four nonhuman primates (Pan troglodytes, Pongo pygmaeus abeli, Macaca mulatta, and Callithrix jacchus) and analyzed the nucleotide substitution patterns of this data set using codon-based maximum likelihood procedures. In addition, we evaluated whether the candidate genes have been targets of recent positive selection in humans by analyzing HapMap Phase 2 single-nucleotide polymorphisms genotyped in a region centered on each candidate gene. A total of 1,064 sequences were used for the analyses. Similar median K(A)/K(S) values were estimated for the set of genes involved in HIV-1 pathogenesis and for control genes, 0.19 and 0.15, respectively. However, genes of the innate immunity had median values of 0.37 (P value = 0.0001, compared with control genes), and genes of intrinsic cellular defense had K(A)/K(S) values around or greater than 1.0 (P value = 0.0002). Detailed assessment allowed the identification of residues under positive selection in 13 proteins: AKT1, APOBEC3G, APOBEC3H, CD4, DEFB1, GML, IL4, IL8RA, L-SIGN/CLEC4M, PTPRC/CD45, Tetherin/BST2, TLR7, and TRIM5alpha. A number of those residues are relevant for HIV-1 biology. The set of 140 genes involved in HIV-1 pathogenesis did not show a significant enrichment in signals of recent positive selection in humans (intraspecies selection). However, we identified within or near these genes 24 polymorphisms showing strong signatures of recent positive selection. Interestingly, the DEFB1 gene presented signatures of both interspecies positive selection in primates and intraspecies recent positive selection in humans. The systematic assessment of long-acting selective pressures on primate genomes is a useful tool to extend our understanding of genetic variation influencing contemporary susceptibility to HIV-1.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A series of mutations, including 5' and 3' deletions, as well as insertions were introduced into the 5' flanking nucleotide sequence of a vaccinia virus late gene. This DNA has been shown previously to contain all the necessary elements for correct regulation of the gene most probably transcribed by the viral RNA polymerase. To facilitate the assays, the mutated DNA was fused to the chloramphenicol acetyltransferase gene and inserted into the genome of live vaccinia virus. The effects of the mutations on expression of the chimeric gene were studied by both enzyme assays and nuclease S1 analysis. The results showed that 5' deletions up to about 15 bp from the putative initiation site of transcription still yielded high levels of gene expression. All mutations, however, that deleted the authentic late mRNA start site, abolished promoter activity.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: Complete mitochondrial genome sequences have become important tools for the study of genome architecture, phylogeny, and molecular evolution. Despite the rapid increase in available mitogenomes, the taxonomic sampling often poorly reflects phylogenetic diversity and is often also biased to represent deeper (family-level) evolutionary relationships. RESULTS: We present the first fully sequenced ant (Hymenoptera: Formicidae) mitochondrial genomes. We sampled four mitogenomes from three species of fire ants, genus Solenopsis, which represent various evolutionary depths. Overall, ant mitogenomes appear to be typical of hymenopteran mitogenomes, displaying a general A+T-bias. The Solenopsis mitogenomes are slightly more compact than other hymentoperan mitogenomes (~15.5 kb), retaining all protein coding genes, ribosomal, and transfer RNAs. We also present evidence of recombination between the mitogenomes of the two conspecific Solenopsis mitogenomes. Finally, we discuss potential ways to improve the estimation of phylogenies using complete mitochondrial genome sequences. CONCLUSIONS: The ant mitogenome presents an important addition to the continued efforts in studying hymenopteran mitogenome architecture, evolution, and phylogenetics. We provide further evidence that the sampling across many taxonomic levels (including conspecifics and congeners) is useful and important to gain detailed insights into mitogenome evolution. We also discuss ways that may help improve the use of mitogenomes in phylogenetic analyses by accounting for non-stationary and non-homogeneous evolution among branches.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A repeated DNA element in Xenopus laevis is described that is present in about 7500 copies dispersed throughout the genome. It was first identified in the 5' flanking region of one vitellogenin gene and was therefore named the Vi element. Seven copies are present within the vitellogenin gene region, three of them within introns of the genes A1, A2 and B2, and the other four copies in the gene flanking regions. Four of these copies have been sequenced. The Vi element is bounded by a well-conserved 13 base-pair inverted repeat; in addition, it is flanked by a three base-pair direct repeat that appears to be site-specific. The length of these four copies varies from 112 to 469 base-pairs; however, sequence homology between the different copies is very high. Their structural characteristics suggest that length heterogeneity may have arisen by either unequal recombinations, deletions or tandem duplications. Altogether, the characteristics and properties of the Vi element indicate that it might represent a mobile genetic element. One of the four copies sequenced is inserted close (position -535) to the transcription initiation site of the vitellogenin gene B2 in a region otherwise showing considerable homology with the closely related gene B1. Nevertheless, the presence of the Vi element does not seem to influence significantly the estrogen-controlled expression of gene B2. In addition, three alleles of this gene created by length polymorphism in intron 3 and in the Vi element inserted near the transcription initiation site are described.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Determination of the precise composition and variation of microbiota in cystic fibrosis lungs is crucial since chronic inflammation due to microorganisms leads to lung damage and ultimately, death. However, this constitutes a major technical challenge. Culturing of microorganisms does not provide a complete representation of a microbiota, even when using culturomics (high-throughput culture). So far, only PCR-based metagenomics have been investigated. However, these methods are biased towards certain microbial groups, and suffer from uncertain quantification of the different microbial domains. We have explored whole genome sequencing (WGS) using the Illumina high-throughput technology applied directly to DNA extracted from sputa obtained from two cystic fibrosis patients. To detect all microorganism groups, we used four procedures for DNA extraction, each with a different lysis protocol. We avoided biases due to whole DNA amplification thanks to the high efficiency of current Illumina technology. Phylogenomic classification of the reads by three different methods produced similar results. Our results suggest that WGS provides, in a single analysis, a better qualitative and quantitative assessment of microbiota compositions than cultures and PCRs. WGS identified a high quantity of Haemophilus spp. (patient 1) or Staphylococcus spp. plus Streptococcus spp. (patient 2) together with low amounts of anaerobic (Veillonella, Prevotella, Fusobacterium) and aerobic bacteria (Gemella, Moraxella, Granulicatella). WGS suggested that fungal members represented very low proportions of the microbiota, which were detected by cultures and PCRs because of their selectivity. The future increase of reads' sizes and decrease in cost should ensure the usefulness of WGS for the characterisation of microbiota.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. RESULTS: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. CONCLUSION: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Non-long terminal repeat (non-LTR) retrotransposons have contributed to shaping the structure and function of genomes. In silico and experimental approaches have been used to identify the non-LTR elements of the urochordate Ciona intestinalis. Knowledge of the types and abundance of non-LTR elements in urochordates is a key step in understanding their contribution to the structure and function of vertebrate genomes. Results: Consensus elements phylogenetically related to the I, LINE1, LINE2, LOA and R2 elements of the 14 eukaryotic non-LTR clades are described from C. intestinalis. The ascidian elements showed conservation of both the reverse transcriptase coding sequence and the overall structural organization seen in each clade. The apurinic/apyrimidinic endonuclease and nucleic-acid-binding domains encoded upstream of the reverse transcriptase, and the RNase H and the restriction enzyme-like endonuclease motifs encoded downstream of the reverse transcriptase were identified in the corresponding Ciona families. Conclusions: The genome of C. intestinalis harbors representatives of at least five clades of non-LTR retrotransposons. The copy number per haploid genome of each element is low, less than 100, far below the values reported for vertebrate counterparts but within the range for protostomes. Genomic and sequence analysis shows that the ascidian non-LTR elements are unmethylated and flanked by genomic segments with a gene density lower than average for the genome. The analysis provides valuable data for understanding the evolution of early chordate genomes and enlarges the view on the distribution of the non-LTR retrotransposons in eukaryotes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: Small RNAs (sRNAs) are widespread among bacteria and have diverse regulatory roles. Most of these sRNAs have been discovered by a combination of computational and experimental methods. In Pseudomonas aeruginosa, a ubiquitous Gram-negative bacterium and opportunistic human pathogen, the GacS/GacA two-component system positively controls the transcription of two sRNAs (RsmY, RsmZ), which are crucial for the expression of genes involved in virulence. In the biocontrol bacterium Pseudomonas fluorescens CHA0, three GacA-controlled sRNAs (RsmX, RsmY, RsmZ) regulate the response to oxidative stress and the expression of extracellular products including biocontrol factors. RsmX, RsmY and RsmZ contain multiple unpaired GGA motifs and control the expression of target mRNAs at the translational level, by sequestration of translational repressor proteins of the RsmA family. RESULTS: A combined computational and experimental approach enabled us to identify 14 intergenic regions encoding sRNAs in P. aeruginosa. Eight of these regions encode newly identified sRNAs. The intergenic region 1698 was found to specify a novel GacA-controlled sRNA termed RgsA. GacA regulation appeared to be indirect. In P. fluorescens CHA0, an RgsA homolog was also expressed under positive GacA control. This 120-nt sRNA contained a single GGA motif and, unlike RsmX, RsmY and RsmZ, was unable to derepress translation of the hcnA gene (involved in the biosynthesis of the biocontrol factor hydrogen cyanide), but contributed to the bacterium's resistance to hydrogen peroxide. In both P. aeruginosa and P. fluorescens the stress sigma factor RpoS was essential for RgsA expression. CONCLUSION: The discovery of an additional sRNA expressed under GacA control in two Pseudomonas species highlights the complexity of this global regulatory system and suggests that the mode of action of GacA control may be more elaborate than previously suspected. Our results also confirm that several GGA motifs are required in an sRNA for sequestration of the RsmA protein.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Centrifuge is a user-friendly system to simultaneously access Arabidopsis gene annotations and intra- and inter-organism sequence comparison data. The tool allows rapid retrieval of user-selected data for each annotated Arabidopsis gene providing, in any combination, data on the following features: predicted protein properties such as mass, pI, cellular location and transmembrane domains; SWISS-PROT annotations; Interpro domains; Gene Ontology records; verified transcription; BLAST matches to the proteomes of A.thaliana, Oryza sativa (rice), Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. The tool lends itself particularly well to the rapid analysis of contigs or of tens or hundreds of genes identified by high-throughput gene expression experiments. In these cases, a summary table of principal predicted protein features for all genes is given followed by more detailed reports for each individual gene. Centrifuge can also be used for single gene analysis or in a word search mode. AVAILABILITY: http://centrifuge.unil.ch/ CONTACT: edward.farmer@unil.ch.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We report a new set of nine primer pairs specifically developed for amplification of Brassica plastid SSR markers. The wide utility of these markers is demonstrated for haplotype identification and detection of polymorphism in B. napus, B. nigra, B. oleracea, B. rapa and in related genera Arabidopsis, Camelina, Raphanus and Sinapis. Eleven gene regions (ndhB-rps7 spacer, rbcL-accD spacer, rpl16 intron, rps16 intron, atpB-rbcL spacer, trnE-trnT spacer, trnL intron, trnL-trnF spacer, trnM-atpE spacer, trnR-rpoC2 spacer, ycf3-psaA spacer) were sequenced from a range of Brassica and related genera for SSR detection and primer design. Other sequences were obtained from GenBank/EMBL. Eight out of nine selected SSR loci showed polymorphism when amplified using the new primers and a combined analysis detected variation within and between Brassica species, with the number of alleles detected per locus ranging from 5 (loci MF-6, MF-1) to 11 (locus MF-7). The combined SSR data were used in a neighbour-joining analysis (SMM, D (DM) distances) to group the samples based on the presence and absence of alleles. The analysis was generally able to separate plastid types into taxon-specific groups. Multi-allelic haplotypes were plotted onto the neighbour joining tree. A total number of 28 haplotypes were detected and these differentiated 22 of the 41 accessions screened from all other accessions. None of these haplotypes was shared by more than one species and some were not characteristic of their predicted type. We interpret our results with respect to taxon differentiation, hybridisation and introgression patterns relating to the 'Triangle of U'.