935 resultados para Complete Genome Sequence


Relevância:

40.00% 40.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Broad-host-range plasmid pRIO-5, harboring the extended-spectrum beta-lactamase bla(BES-1) gene in Serratia marcescens, was fully sequenced. Analysis of the 12,957-bp sequence of this IncP6-type plasmid revealed that the bla(BES-1) gene was associated with two copies of the insertion sequence IS26. The promoter responsible for the bla(BES-1) expression was hybrid, made of a - 35 box located inside the inverted repeat of IS26 and a - 10 box inside a remnant of an insertion sequence.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This PhD Thesis is the result of my research activity in the last three years. My main research interest was centered on the evolution of mitochondrial genome (mtDNA), and on its usefulness as a phylogeographic and phylogenetic marker at different taxonomic levels in different taxa of Metazoa. From a methodological standpoint, my main effort was dedicated to the sequencing of complete mitochondrial genomes, and the approach to whole-genome sequencing was based on the application of Long-PCR and shotgun sequences. Moreover, this research project is a part of a bigger sequencing project of mtDNAs in many different Metazoans’ taxa, and I mostly dedicated myself to sequence and analyze mtDNAs in selected taxa of bivalves and hexapods (Insecta). Sequences of bivalve mtDNAs are particularly limited, and my study contributed to extend the sampling. Moreover, I used the bivalve Musculista senhousia as model taxon to investigate the molecular mechanisms and the evolutionary significance of their aberrant mode of mitochondrial inheritance (Doubly Uniparental Inheritance, see below). In Insects, I focused my attention on the Genus Bacillus (Insecta Phasmida). A detailed phylogenetic analysis was performed in order to assess phylogenetic relationships within the genus, and to investigate the placement of Phasmida in the phylogenetic tree of Insecta. The main goal of this part of my study was to add to the taxonomic coverage of sequenced mtDNAs in basal insects, which were only partially analyzed.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Bioinformatics, in the last few decades, has played a fundamental role to give sense to the huge amount of data produced. Obtained the complete sequence of a genome, the major problem of knowing as much as possible of its coding regions, is crucial. Protein sequence annotation is challenging and, due to the size of the problem, only computational approaches can provide a feasible solution. As it has been recently pointed out by the Critical Assessment of Function Annotations (CAFA), most accurate methods are those based on the transfer-by-homology approach and the most incisive contribution is given by cross-genome comparisons. In the present thesis it is described a non-hierarchical sequence clustering method for protein automatic large-scale annotation, called “The Bologna Annotation Resource Plus” (BAR+). The method is based on an all-against-all alignment of more than 13 millions protein sequences characterized by a very stringent metric. BAR+ can safely transfer functional features (Gene Ontology and Pfam terms) inside clusters by means of a statistical validation, even in the case of multi-domain proteins. Within BAR+ clusters it is also possible to transfer the three dimensional structure (when a template is available). This is possible by the way of cluster-specific HMM profiles that can be used to calculate reliable template-to-target alignments even in the case of distantly related proteins (sequence identity < 30%). Other BAR+ based applications have been developed during my doctorate including the prediction of Magnesium binding sites in human proteins, the ABC transporters superfamily classification and the functional prediction (GO terms) of the CAFA targets. Remarkably, in the CAFA assessment, BAR+ placed among the ten most accurate methods. At present, as a web server for the functional and structural protein sequence annotation, BAR+ is freely available at http://bar.biocomp.unibo.it/bar2.0.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A porcine BAC clone harboring the tightly linked IFNAR1 and IFNGR2 genes was identified by comparative analysis of the publicly available porcine BAC end sequences. The complete 168,835 bp insert sequence of this clone was determined. Sequence comparisons of the genomic sequence with EST sequences from public databases were performed and allowed a detailed annotation of the IFNAR1 and IFNGR2 genes. The analyzed genes showed a conserved genomic organization with their known mammalian orthologs, however the sequence conservation of these genes across species was relatively low. In addition to the IFNAR1 and IFNGR2 genes, which were completely sequenced, the analyzed BAC clone also contained parts of an orphan gene encoding a putative transmembrane protein (TMEM50B). In contrast to the IFNAR1 and IFNGR2 genes the sequence conservation of the TMEM50B gene across different mammalian species was extremely high.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The gene for agouti signaling protein (ASIP) is centrally involved in the expression of coat color traits in animals. The Mangalitza pig breed is characterized by a black-and-tan phenotype with black dorsal pigmentation and yellow or white ventral pigmentation. We investigated a Mangalitza x Piétrain cross and observed a coat color segregation pattern in the F2 generation that can be explained by virtue of two alleles at the MC1R locus and two alleles at the ASIP locus. Complete linkage of the black-and-tan phenotype to microsatellite alleles at the ASIP locus on SSC 17q21 was observed. Corroborated by the knowledge of similar mouse coat color mutants, it seems therefore conceivable that the black-and-tan pigmentation of Mangalitza pigs is caused by an ASIP allele a(t), which is recessive to the wild-type allele A. Toward positional cloning of the a(t) mutation, a 200-kb genomic BAC/PAC contig of this chromosomal region has been constructed and subsequently sequenced. Full-length ASIP cDNAs obtained by RACE differed in their 5' untranslated regions, whereas they shared a common open reading frame. Comparative sequencing of all ASIP exons and ASIP cDNAs between Mangalitza and Piétrain pigs did not reveal any differences associated with the coat color phenotype. Relative qRT-PCR analyses showed different dorsoventral skin expression intensities of the five ASIP transcripts in black-and-tan Mangalitza. The a(t) mutation is therefore probably a regulatory ASIP mutation that alters its dorsoventral expression pattern.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Genome predictions based on selected genes would be a very welcome approach for taxonomic studies, including DNA-DNA similarity, G+C content and representative phylogeny of bacteria. At present, DNA-DNA hybridizations are still considered the gold standard in species descriptions. However, this method is time-consuming and troublesome, and datasets can vary significantly between experiments as well as between laboratories. For the same reasons, full matrix hybridizations are rarely performed, weakening the significance of the results obtained. The authors established a universal sequencing approach for the three genes recN, rpoA and thdF for the Pasteurellaceae, and determined if the sequences could be used for predicting DNA-DNA relatedness within the family. The sequence-based similarity values calculated using a previously published formula proved most useful for species and genus separation, indicating that this method provides better resolution and no experimental variation compared to hybridization. By this method, cross-comparisons within the family over species and genus borders easily become possible. The three genes also serve as an indicator of the genome G+C content of a species. A mean divergence of around 1 % was observed from the classical method, which in itself has poor reproducibility. Finally, the three genes can be used alone or in combination with already-established 16S rRNA, rpoB and infB gene-sequencing strategies in a multisequence-based phylogeny for the family Pasteurellaceae. It is proposed to use the three sequences as a taxonomic tool, replacing DNA-DNA hybridization.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The growing knowledge on physiology, cell biology and biochemistry of the reproductive organs has provided many insights into molecular mechanisms that are required for successful reproduction. Research directed at the investigation of reproduction physiology in domestic animals was hampered in the past by a lack of species-specific genomic information. The genome sequences of dog, cattle and horse have become publicly available in 2005, 2006 and 2007 respectively. Although the gene content of mammalian genomes is generally very similar, genes involved in reproduction tend to be less conserved than the average mammalian gene. The availability of genome sequences provides a valuable resource to check whether any protein that may be known from human or mouse research is present in cattle and/or horse as well. Currently there are more than 200 genes known that are involved in the production of fertile sperm cells. Great progress has been made in the understanding of genetic aberrations that lead to male infertility. Additionally, the first genetic mechanisms are being discovered that contribute to the quantitative variation of fertility traits in fertile male animals. Here, I will review some selected aspects of genetic research in male fertility and offer some perspectives for the use of genomic sequence information.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Thirteen spontaneous multiple-antibiotic-resistant (Mar) mutants of Escherichia coli AG100 were isolated on Luria-Bertani (LB) agar in the presence of tetracycline (4 microg/ml). The phenotype was linked to insertion sequence (IS) insertions in marR or acrR or unstable large tandem genomic amplifications which included acrAB and which were bordered by IS3 or IS5 sequences. Five different lon mutations, not related to the Mar phenotype, were also found in 12 of the 13 mutants. Under specific selective conditions, most drug-resistant mutants appearing late on the selective plates evolved from a subpopulation of AG100 with lon mutations. That the lon locus was involved in the evolution to low levels of multidrug resistance was supported by the following findings: (i) AG100 grown in LB broth had an important spontaneous subpopulation (about 3.7x10(-4)) of lon::IS186 mutants, (ii) new lon mutants appeared during the selection on antibiotic-containing agar plates, (iii) lon mutants could slowly grow in the presence of low amounts (about 2x MIC of the wild type) of chloramphenicol or tetracycline, and (iv) a lon mutation conferred a mutator phenotype which increased IS transposition and genome rearrangements. The association between lon mutations and mutations causing the Mar phenotype was dependent on the medium (LB versus MacConkey medium) and the antibiotic used for the selection. A previously reported unstable amplifiable high-level resistance observed after the prolonged growth of Mar mutants in a low concentration of tetracycline or chloramphenicol can be explained by genomic amplification.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

BACKGROUND: The mollicute Mycoplasma conjunctivae is the etiological agent leading to infectious keratoconjunctivitis (IKC) in domestic sheep and wild caprinae. Although this pathogen is relatively benign for domestic animals treated by antibiotics, it can lead wild animals to blindness and death. This is a major cause of death in the protected species in the Alps (e.g., Capra ibex, Rupicapra rupicapra). METHODS: The genome was sequenced using a combined technique of GS-FLX (454) and Sanger sequencing, and annotated by an automatic pipeline that we designed using several tools interconnected via PERL scripts. The resulting annotations are stored in a MySQL database. RESULTS: The annotated sequence is deposited in the EMBL database (FM864216) and uploaded into the mollicutes database MolliGen http://cbi.labri.fr/outils/molligen/ allowing for comparative genomics. CONCLUSION: We show that our automatic pipeline allows for annotating a complete mycoplasma genome and present several examples of analysis in search for biological targets (e.g., pathogenic proteins).

Relevância:

40.00% 40.00%

Publicador:

Resumo:

HIV-1 sequence diversity is affected by selection pressures arising from host genomic factors. Using paired human and viral data from 1071 individuals, we ran >3000 genome-wide scans, testing for associations between host DNA polymorphisms, HIV-1 sequence variation and plasma viral load (VL), while considering human and viral population structure. We observed significant human SNP associations to a total of 48 HIV-1 amino acid variants (p<2.4 × 10−12). All associated SNPs mapped to the HLA class I region. Clinical relevance of host and pathogen variation was assessed using VL results. We identified two critical advantages to the use of viral variation for identifying host factors: (1) association signals are much stronger for HIV-1 sequence variants than VL, reflecting the ‘intermediate phenotype’ nature of viral variation; (2) association testing can be run without any clinical data. The proposed genome-to-genome approach highlights sites of genomic conflict and is a strategy generally applicable to studies of host–pathogen interaction.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

BACKGROUND: Enterococcus faecalis has emerged as a major hospital pathogen. To explore its diversity, we sequenced E. faecalis strain OG1RF, which is commonly used for molecular manipulation and virulence studies. RESULTS: The 2,739,625 base pair chromosome of OG1RF was found to contain approximately 232 kilobases unique to this strain compared to V583, the only publicly available sequenced strain. Almost no mobile genetic elements were found in OG1RF. The 64 areas of divergence were classified into three categories. First, OG1RF carries 39 unique regions, including 2 CRISPR loci and a new WxL locus. Second, we found nine replacements where a sequence specific to V583 was substituted by a sequence specific to OG1RF. For example, the iol operon of OG1RF replaces a possible prophage and the vanB transposon in V583. Finally, we found 16 regions that were present in V583 but missing from OG1RF, including the proposed pathogenicity island, several probable prophages, and the cpsCDEFGHIJK capsular polysaccharide operon. OG1RF was more rapidly but less frequently lethal than V583 in the mouse peritonitis model and considerably outcompeted V583 in a murine model of urinary tract infections. CONCLUSION: E. faecalis OG1RF carries a number of unique loci compared to V583, but the almost complete lack of mobile genetic elements demonstrates that this is not a defining feature of the species. Additionally, OG1RF's effects in experimental models suggest that mediators of virulence may be diverse between different E. faecalis strains and that virulence is not dependent on the presence of mobile genetic elements.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Ultrastructural analysis of the polydnavirus of the braconid wasp Chelonus inanitus revealed that virions consist of one cylindrical nucleocapsid enveloped by a single unit membrane. Nucleocapsids have a constant diameter of 33.7 +/- 1.4 nm and a variable length of between 8 and 46 nm. Spreading of viral DNA showed that the genome consists of circular dsDNA molecules of variable sizes and measurement of the contour lengths indicated sizes of between 7 and 31 kbp. When virions were exposed to osmotic shock conditions to release the DNA, only one circular molecule was released per particle suggesting that the various DNA molecules are singly encapsidated in this bracovirus. The viral genome was seen to consist of at least 10 different segments and the aggregate genome size is in the order of 200 kbp. By partial digestion of viral DNA with HindIII or EcoRI in the presence of ethidium bromide and subsequent ligation with HindIII-cut pSP65 or EcoRI-cut pSP64 and transfection into Escherichia coli, libraries of 103 HindIII and 23 EcoRI clones were obtained. Southern blots revealed that complete and unrearranged segments were cloned with this approach, and restriction maps for five segments were obtained. Part of a 16.8 kbp segment was sequenced, found to be AT-rich (73%) and to contain six copies of a 17 bp repeated sequence. The development of the female reproductive tract in the course of pupal-adult development of the wasp was investigated and seen to be strictly correlated with the pigmentation pattern. By the use of a semiquantitative PCR, replication of viral DNA was observed to initiate at a specific stage of pupal-adult development.