888 resultados para Genome Sequence
Resumo:
Diversity in the chloroplast genome of 171 accessions representing the Brassica 'C' (n = 9) genome, including domesticated and wild B. oleracea and nine inter-fertile related wild species, was investigated using six chloroplast SSR (microsatellite) markers. The lack of diversity detected among 105 cultivated and wild accessions of B. oleracea contrasted starkly with that found within its wild relatives. The vast majority of B. oleracea accessions shared a single haplotype, whereas as many as six haplotypes were detected in two wild species, B. villosa Biv. and B. cretica Lam.. The SSRs proved to be highly polymorphic across haplotypes, with calculated genetic diversity values (H) of 0.23-0.87. In total, 23 different haplotypes were detected in C genome species, with an additional five haplotypes detected in B. rapa L. (A genome n = 10) and another in B. nigra L. (B genome, n = 8). The low chloroplast diversity of B. oleracea is not suggestive of multiple domestication events. The predominant B. oleracea haplotype was also common in B. incana Ten. and present in low frequencies in B. villosa, B. macrocarpa Guss, B. rupestris Raf. and B. cretica. The chloroplast SSRs reveal a wealth of diversity within wild Brassica species that will facilitate further evolutionary and phylogeographic studies of this important crop genus.
Resumo:
The monophyly of the Peltophorum group, one of nine informal groups recognized by Polhill in the Caesalpinieae, was tested using sequence data from the trnL-F, rbcL, and rps16 regions of the chloroplast genome. Exemplars were included from all 16 genera of the Peltophorum group, and from 15 genera representing seven of the other eight informal groups in the tribe. The data were analyzed separately and in combined analyses using parsimony and Bayesian methods. The analysis method had little effect on the topology of well-supported relationships. The molecular data recovered a generally well-supported phylogeny with many intergeneric relationships resolved. Results show that the Peltophorum group as currently delimited is polyphyletic, but that eight genera plus one undescribed genus form a core Peltophorum group, which is referred to here as the Peltophorum group sensu stricto. These genera are Bussea, Conzattia, Colvillea, Delonix, Heteroflorum (inedit.), Lemuropisum, Parkinsonia, Peltophorum, and Schizolobium. The remaining eight genera of the Peltophorum group s.l. are distributed across the Caesalpinieae. Morphological support for the redelimited Peltophorum group and the other recovered clades was assessed, and no unique synapomorphy was found for the Peltophorum group s.s. A proposal for the reclassification of the Peltophorum group s.l. is presented.
Resumo:
A recently emerging bleeding canker disease, caused by Pseudomonas syringae pathovar aesculi (Pae), is threatening European horse chestnut in northwest Europe. Very little is known about the origin and biology of this new disease. We used the nucleotide sequences of seven commonly used marker genes to investigate the phylogeny of three strains isolated recently from bleeding stem cankers on European horse chestnut in Britain (E-Pae). On the basis of these sequences alone, the E-Pae strains were identical to the Pae type-strain (I-Pae), isolated from leaf spots on Indian horse chestnut in India in 1969. The phylogenetic analyses also showed that Pae belongs to a distinct clade of P. syringae pathovars adapted to woody hosts. We generated genome-wide Illumina sequence data from the three E-Pae strains and one strain of I-Pae. Comparative genomic analyses revealed pathovar-specific genomic regions in Pae potentially implicated in virulence on a tree host, including genes for the catabolism of plant-derived aromatic compounds and enterobactin synthesis. Several gene clusters displayed intra-pathovar variation, including those encoding type IV secretion, a novel fatty acid biosynthesis pathway and a sucrose uptake pathway. Rates of single nucleotide polymorphisms in the four Pae genomes indicate that the three E-Pae strains diverged from each other much more recently than they diverged from I-Pae. The very low genetic diversity among the three geographically distinct E-Pae strains suggests that they originate from a single, recent introduction into Britain, thus highlighting the serious environmental risks posed by the spread of an exotic plant pathogenic bacterium to a new geographic location. The genomic regions in Pae that are absent from other P. syringae pathovars that infect herbaceous hosts may represent candidate genetic adaptations to infection of the woody parts of the tree.
Resumo:
Repeat induced point mutation (RIP), a mechanism causing hypermutation of repetitive DNA sequences in fungi, has been described as a ‘genome defense’ which functions to inactivate mobile elements and inhibit their deleterious effects on genome stability. Here we address the interactions between RIP and transposable elements in the Microbotryum violaceum species complex. Ten strains of M. violaceum, most of which belong to different species of the fungus, were all found to contain intragenomic populations of copia-like retrotransposons. Intragenomic DNA sequence variation among the copia-like elements was analyzed for evidence of RIP. Among species with RIP, there was no significant correlation between the frequency of RIP-induced mutations and inferred transposition rate based on diversity. Two strains of M. violaceum, from two different plant species but belonging to the same fungal lineage, contained copia-like elements with very low diversity, as would result from a high transposition rate, and these were also unique in showing no evidence of the hypermutation patterns indicative of the RIP genome defense. In this species, evidence of RIP was also absent from a Class II helitron-like transposable element. However, unexpectedly the absolute repetitive element load was lower than in other strains.
Resumo:
The genome of Salmonella enterica serovar Enteritidis was shown to possess three IS3-like insertion elements, designated IS1230A, B and C, and each was cloned and their respective deoxynucleotide sequences determined. Mutations in elements IS1230A and B resulted in frameshifts in the open reading frames that encoded a putative transposase to be inactive. IS1230C was truncated at nucleotide 774 relative to IS1230B and therefore did not possess the 3' terminal inverted repeat. The three IS1230 derivatives were closely related to each other based on nucleotide sequence similarity. IS1230A was located adjacent to the sef operon encoding SEF14 fimbriae located at minute 97 of the genome of S. Enteritidis. IS1230B was located adjacent to the umuDC operon at minute 42.5 on the genome, itself located near to one terminus of an 815-kb genome inversion of S. Enteritidis relative to S. Typhimurium. IS1230C was located next to attB, the bacteriophage P22 attachment site, and proB, encoding gamma-glutamyl phosphate reductase. A truncated 3' remnant of IS1230, designated IS1230T, was identified in a clinical isolate of S. Typhimurium DT193 strain 2391. This element was located next to attB adjacent to which were bacteriophage P22-like sequences. Southern hybridisation of total genomic DNA from eighteen phage types of S. Enteritidis and eighteen definitive types of S. Typhimurium showed similar, if not identical, restriction fragment profiles in the respective serovars when probed with IS1230A.
Resumo:
We have performed microarray hybridization studies on 40 clinical isolates from 12 common serovars within Salmonella enterica subspecies I to identify the conserved chromosomal gene pool. We were able to separate the core invariant portion of the genome by a novel mathematical approach using a decision tree based on genes ranked by increasing variance. All genes within the core component were confirmed using available sequence and microarray information for S. enterica subspecies I strains. The majority of genes within the core component had conserved homologues in Escherichia coli K-12 strain MG1655. However, many genes present in the conserved set which were absent or highly divergent in K-12 had close homologues in pathogenic bacteria such as Shigella flexneri and Pseudomonas aeruginosa. Genes within previously established virulence determinants such as SPI1 to SPI5 were conserved. In addition several genes within SPI6, all of SPI9, and three fimbrial operons (fim, bcf, and stb) were conserved within all S. enterica strains included in this study. Although many phage and insertion sequence elements were missing from the core component, approximately half the pseudogenes present in S. enterica serovar Typhi were conserved. Furthermore, approximately half the genes conserved in the core set encoded hypothetical proteins. Separation of the core and variant gene sets within S. enterica subspecies I has offered fundamental biological insight into the genetic basis of phenotypic similarity and diversity across S. enterica subspecies I and shown how the core genome of these pathogens differs from the closely related E. coli K-12 laboratory strain.
Resumo:
Before the advent of genome-wide association studies (GWASs), hundreds of candidate genes for obesity-susceptibility had been identified through a variety of approaches. We examined whether those obesity candidate genes are enriched for associations with body mass index (BMI) compared with non-candidate genes by using data from a large-scale GWAS. A thorough literature search identified 547 candidate genes for obesity-susceptibility based on evidence from animal studies, Mendelian syndromes, linkage studies, genetic association studies and expression studies. Genomic regions were defined to include the genes ±10 kb of flanking sequence around candidate and non-candidate genes. We used summary statistics publicly available from the discovery stage of the genome-wide meta-analysis for BMI performed by the genetic investigation of anthropometric traits consortium in 123 564 individuals. Hypergeometric, rank tail-strength and gene-set enrichment analysis tests were used to test for the enrichment of association in candidate compared with non-candidate genes. The hypergeometric test of enrichment was not significant at the 5% P-value quantile (P = 0.35), but was nominally significant at the 25% quantile (P = 0.015). The rank tail-strength and gene-set enrichment tests were nominally significant for the full set of genes and borderline significant for the subset without SNPs at P < 10(-7). Taken together, the observed evidence for enrichment suggests that the candidate gene approach retains some value. However, the degree of enrichment is small despite the extensive number of candidate genes and the large sample size. Studies that focus on candidate genes have only slightly increased chances of detecting associations, and are likely to miss many true effects in non-candidate genes, at least for obesity-related traits.
Resumo:
The genome of the soil-dwelling heterotrophic N2-fixing Gram-negative bacterium Azotobacter chroococcum NCIMB 8003 (ATCC 4412) (Ac-8003) has been determined. It consists of 7 circular replicons totalling 5,192,291 bp comprising a circular chromosome of 4,591,803 bp and six plasmids pAcX50a, b, c, d, e, f of 10,435 bp, 13,852, 62,783, 69,713, 132,724, and 311,724 bp respectively. The chromosome has a G+C content of 66.27% and the six plasmids have G+C contents of 58.1, 55.3, 56.7, 59.2, 61.9, and 62.6% respectively. The methylome has also been determined and 5 methylation motifs have been identified. The genome also contains a very high number of transposase/inactivated transposase genes from at least 12 of the 17 recognised insertion sequence families. The Ac-8003 genome has been compared with that of Azotobacter vinelandii ATCC BAA-1303 (Av-DJ), a derivative of strain O, the only other member of the Azotobacteraceae determined so far which has a single chromosome of 5,365,318 bp and no plasmids. The chromosomes show significant stretches of synteny throughout but also reveal a history of many deletion/insertion events. The Ac-8003 genome encodes 4628 predicted protein-encoding genes of which 568 (12.2%) are plasmid borne. 3048 (65%) of these show > 85% identity to the 5050 protein-encoding genes identified in Av-DJ, and of these 99 are plasmid-borne. The core biosynthetic and metabolic pathways and macromolecular architectures and machineries of these organisms appear largely conserved including genes for CO-dehydrogenase, formate dehydrogenase and a soluble NiFe-hydrogenase. The genetic bases for many of the detailed phenotypic differences reported for these organisms have also been identified. Also many other potential phenotypic differences have been uncovered. Properties endowed by the plasmids are described including the presence of an entire aerobic corrin synthesis pathway in pAcX50f and the presence of genes for retro-conjugation in pAcX50c. All these findings are related to the potentially different environmental niches from which these organisms were isolated and to emerging theories about how microbes contribute to their communities.
Resumo:
Non-LTR retrotransposons, also known as long interspersed nuclear elements (LINEs), are transposable elements that encode a reverse transcriptase and insert into genomic locations via RNA intermediates. The sequence analysis of a cDNA library constructed from mRNA of the salivary glands of R. americana showed the presence of putative class I elements. The cDNA clone with homology to a reverse transcriptase was the starting point for the present study. Genomic phage was isolated and sequenced and the molecular structure of the element was characterized as being a non-LTR retrotransposable element. Southern blot analysis indicated that this transposable element is represented by repeat sequences in the genome of R. americana. Chromosome tips were consistently positive when this element was used as probe in in-situ hybridization. Real-time RT-PCR showed that this retrotransposon is transcribed at different periods of larval development. Most interesting, the silencing of this retrotransposon in R. americana by RNA interference resulted in reduced transcript levels and in accelerated larval development.
Resumo:
The genome of the most virulent among 22 Brazilian geographical isolates of Spodoptera frugiperda nucleopolyhedrovirus, isolate 19 (SfMNPV-1 9), was completely sequenced and shown to comprise 132 565 bp and 141 open reading frames (ORFs). A total of 11 ORFs with no homology to genes in the GenBank database were found. Of those, four had typical baculovirus; promoter motifs and polyadenylation sites. Computer-simulated restriction enzyme cleavage patterns of SfMNPV-1 9 were compared with published physical maps of other SfMNPV isolates. Differences were observed in terms of the restriction profiles and genome size. Comparison of SfMNPV-1 9 with the sequence of the SfMNPV isolate 3AP2 indicated that they differed due to a 1427 bp deletion, as well as by a series of smaller deletions and point mutations. The majority of genes of SfMNPV-1 9 were conserved in the closely related Spodoptera exigua NPV (SeMNPV) and Agrotis segetum NPV (AgseMNPV-A), but a few regions experienced major changes and rearrangements. Synthenic maps for the genomes of group 11 NPVs revealed that gene collinearity was observed only within certain clusters. Analysis of the dynamics of gene gain and loss along the phylogenetic tree of the NPVs showed that group 11 had only five defining genes and supported the hypothesis that these viruses form ten highly divergent ancient lineages. Crucially, more than 60% of the gene gain events followed a power-law relation to genetic distance among baculoviruses, indicative of temporal organization in the gene accretion process.
Resumo:
Motivation: DNA assembly programs classically perform an all-against-all comparison of reads to identify overlaps, followed by a multiple sequence alignment and generation of a consensus sequence. If the aim is to assemble a particular segment, instead of a whole genome or transcriptome, a target-specific assembly is a more sensible approach. GenSeed is a Perl program that implements a seed-driven recursive assembly consisting of cycles comprising a similarity search, read selection and assembly. The iterative process results in a progressive extension of the original seed sequence. GenSeed was tested and validated on many applications, including the reconstruction of nuclear genes or segments, full-length transcripts, and extrachromosomal genomes. The robustness of the method was confirmed through the use of a variety of DNA and protein seeds, including short sequences derived from SAGE and proteome projects.
Resumo:
Hepatitis C virus (HCV), exhibits considerable genetic diversity, but presents a relatively well conserved 5 ` noncoding region (5 ` NCR) among all genotypes. In this study, the structural features and translational efficiency of the HCV 5 ` NCR sequences were analyzed using the programs RNAfold, RNAshapes and RNApdist and with a bicistronic dual luciferase expression system, respectively. RNA structure prediction software indicated that base substitutions will alter potentially the 5 ` NCR structure. The heterogeneous sequence observed on 5 ` NCR led to important changes in their translation efficiency in different cell culture lines. Interactions of the viral RNA with cellular transacting factors may vary according to the cell type and viral genome polymorphisms that may result in the translational efficiency observed. J. Med. Virol. 81: 1212-1219, 2009. (C) 2009 Wiley-Liss, Inc.
Resumo:
Paracoccidioides brasiliensis isolates are not homogeneous in their patterns of pathogenicity in animals and adhesion to epithelial cells. During this investigation, genotypic differences were observed between two samples of P. brasiliensis strain 18 yeast phase (Pbl 8) previously cultured many times, one taken before (Pb18a) and the other after (Pb18b) animal inoculation. Random amplified polymorphic DNA analysis using the primer OPJ4 distinguished Pb18b from Pbl Ba by one 308 bp DNA fragment, which after cloning and sequencing was shown to encode a polypeptide sequence homologous to the protein beta-adaptin. It is suggested, by comparison to other micro-organisms, that this protein might play an important role in the virulence of P. brasiliensis. This result demonstrates the influence of in vitro subculturing on the genotype of this organism.
Resumo:
To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST),program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged.
Resumo:
The imprints of domestication and breed development on the genomes of livestock likely differ from those of companion animals. A deep draft sequence assembly of shotgun reads from a single Hereford female and comparative sequences sampled from six additional breeds were used to develop probes to interrogate 37,470 single-nucleotide polymorphisms (SNPs) in 497 cattle from 19 geographically and biologically diverse breeds. These data show that cattle have undergone a rapid recent decrease in effective population size from a very large ancestral population, possibly due to bottlenecks associated with domestication, selection, and breed formation. Domestication and artificial selection appear to have left detectable signatures of selection within the cattle genome, yet the current levels of diversity within breeds are at least as great as exists within humans.