109 resultados para Sequence alignment
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
Motivation: DNA assembly programs classically perform an all-against-all comparison of reads to identify overlaps, followed by a multiple sequence alignment and generation of a consensus sequence. If the aim is to assemble a particular segment, instead of a whole genome or transcriptome, a target-specific assembly is a more sensible approach. GenSeed is a Perl program that implements a seed-driven recursive assembly consisting of cycles comprising a similarity search, read selection and assembly. The iterative process results in a progressive extension of the original seed sequence. GenSeed was tested and validated on many applications, including the reconstruction of nuclear genes or segments, full-length transcripts, and extrachromosomal genomes. The robustness of the method was confirmed through the use of a variety of DNA and protein seeds, including short sequences derived from SAGE and proteome projects.
Resumo:
We present the genome sequences of a new clinical isolate of the important human pathogen, Aspergillus fumigatus, A1163, and two closely related but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of A1163 with the recently sequenced A. fumigatus isolate Af293 has identified core, variable and up to 2% unique genes in each genome. While the core genes are 99.8% identical at the nucleotide level, identity for variable genes can be as low 40%. The most divergent loci appear to contain heterokaryon incompatibility ( het) genes associated with fungal programmed cell death such as developmental regulator rosA. Cross-species comparison has revealed that 8.5%, 13.5% and 12.6%, respectively, of A. fumigatus, N. fischeri and A. clavatus genes are species-specific. These genes are significantly smaller in size than core genes, contain fewer exons and exhibit a subtelomeric bias. Most of them cluster together in 13 chromosomal islands, which are enriched for pseudogenes, transposons and other repetitive elements. At least 20% of A. fumigatus-specific genes appear to be functional and involved in carbohydrate and chitin catabolism, transport, detoxification, secondary metabolism and other functions that may facilitate the adaptation to heterogeneous environments such as soil or a mammalian host. Contrary to what was suggested previously, their origin cannot be attributed to horizontal gene transfer ( HGT), but instead is likely to involve duplication, diversification and differential gene loss (DDL). The role of duplication in the origin of lineage-specific genes is further underlined by the discovery of genomic islands that seem to function as designated ""gene dumps'' and, perhaps, simultaneously, as "" gene factories''.
Resumo:
The freshwater prawn Macrobrachium amazonicum is widely distributed in South America, and occupies habitats with a wide range of salinities. Several investigations have revealed the existence of wide intraspecific variability among different populations, although the understanding of this variability is still fragmentary and incomplete. We compared and characterized inland and coastal populations of M. amazonicum from Brazil, using molecular data (16S and COI mtDNA) to describe the degree of variability, structure, and relationships among them. Genetic divergence rates among populations showed variability at the intraspecific level. All the analyses evidenced significant genetic divergence among populations, structuring them in three groups: I-inland waters of the Amazonian Hydrographic Region (HR); II-Parana/Paraguay HR; and III-coastal systems of northern and northeastern Brazil. Phylogenetic reconstructions revealed that the populations form a single monophyletic clade, which supports their characterization as a single species. Clade I was a sister clade of that formed by clades II and III, which were themselves sister clades. Populations from Sertaozinho/Miguelopolis and Avare, introduced into the state of Sao Paulo, may have originated from natural populations in the states of Mato Grosso do Sul and Para, respectively. Geographical isolation probably contributed to the observed variation, and if this isolation continues. M. amazonicum may undergo speciation within its broad geographical distribution. The sequences obtained here can be used as name-tags for population identification, and the DNA barcodes are useful to identify the origin of specimens used in different freshwater-prawn cultures or introduced populations of unknown origin.
Resumo:
Background: Rhipicephalus sanguineus, known as the brown dog tick, is a common ectoparasite of domestic dogs and can be found worldwide. R. sanguineus is recognized as the primary vector of the etiological agent of canine monocytic ehrlichiosis and canine babesiosis. Here we present the first description of a R. sanguineus salivary gland transcriptome by the production and analysis of 2,034 expressed sequence tags (EST) from two cDNA libraries, one consctructed using mRNA from dissected salivary glands from female ticks fed for 3-5 days (early to mid library, RsSGL1) and the another from ticks fed for 5 days (mid library, RsSGL2), identifying 1,024 clusters of related sequences. Results: Based on sequence similarities to nine different databases, we identified transcripts of genes that were further categorized according to function. The category of putative housekeeping genes contained similar to 56% of the sequences and had on average 2.49 ESTs per cluster, the secreted protein category contained 26.6% of the ESTs and had 2.47 EST's/clusters, while 15.3% of the ESTs, mostly singletons, were not classifiable, and were annotated as ""unknown function"". The secreted category included genes that coded for lipocalins, proteases inhibitors, disintegrins, metalloproteases, immunomodulatory and antiinflammatory proteins, as Evasins and Da-p36, as well as basic-tail and 18.3 kDa proteins, cement proteins, mucins, defensins and antimicrobial peptides. Comparison of the abundance of ESTs from similar contigs of the two salivary gland cDNA libraries allowed the identification of differentially expressed genes, such as genes coding for Evasins and a thrombin inhibitor, which were over expressed in the RsSGL1 (early to mid library) versus RsSGL2 (mid library), indicating their role in inhibition of inflammation at the tick feeding site from the very beginning of the blood meal. Conversely, sequences related to cement (64P), which function has been correlated with tick attachment, was largely expressed in the mid library. Conclusions: Our survey provided an insight into the R. sanguineus sialotranscriptome, which can assist the discovery of new targets for anti-tick vaccines, as well as help to identify pharmacologically active proteins.
Resumo:
Background: Mites (Acari) have traditionally been treated as monophyletic, albeit composed of two major lineages: Acariformes and Parasitiformes. Yet recent studies based on morphology, molecular data, or combinations thereof, have increasingly drawn their monophyly into question. Furthermore, the usually basal (molecular) position of one or both mite lineages among the chelicerates is in conflict to their morphology, and to the widely accepted view that mites are close relatives of Ricinulei. Results: The phylogenetic position of the acariform mites is examined through employing SSU, partial LSU sequences, and morphology from 91 chelicerate extant terminals (forty Acariformes). In a static homology framework, molecular sequences were aligned using their secondary structure as guide, whereby regions of ambiguous alignment were discarded, and pre-aligned sequences analyzed under parsimony and different mixed models in a Bayesian inference. Parsimony and Bayesian analyses led to trees largely congruent concerning infraordinal, well-supported branches, but with low support for inter-ordinal relationships. An exception is Solifugae + Acariformes (P. P = 100%, J. = 0.91). In a dynamic homology framework, two analyses were run: a standard POY analysis and an analysis constrained by secondary structure. Both analyses led to largely congruent trees; supporting a (Palpigradi (Solifugae Acariformes)) clade and Ricinulei as sister group of Tetrapulmonata with the topology (Ricinulei (Amblypygi (Uropygi Araneae))). Combined analysis with two different morphological data matrices were run in order to evaluate the impact of constraining the analysis on the recovered topology when employing secondary structure as a guide for homology establishment. The constrained combined analysis yielded two topologies similar to the exclusively molecular analysis for both morphological matrices, except for the recovery of Pedipalpi instead of the (Uropygi Araneae) clade. The standard (direct optimization) POY analysis, however, led to the recovery of trees differing in the absence of the otherwise well-supported group Solifugae + Acariformes. Conclusions: Previous studies combining ribosomal sequences and morphology often recovered topologies similar to purely morphological analyses of Chelicerata. The apparent stability of certain clades not recovered here, like Haplocnemata and Acari, is regarded as a byproduct of the way the molecular homology was previously established using the instrumentalist approach implemented in POY. Constraining the analysis by a priori homology assessment is defended here as a way of maintaining the severity of the test when adding new data to the analysis. Although the strength of the method advocated here is keeping phylogenetic information from regions usually discarded in an exclusively static homology framework; it still has the inconvenience of being uninformative on the effect of alignment ambiguity on resampling methods of clade support estimation. Finally, putative morphological apomorphies of Solifugae + Acariformes are the reduction of the proximal cheliceral podomere, medial abutting of the leg coxae, loss of sperm nuclear membrane, and presence of differentiated germinative and secretory regions in the testis delivering their products into a common lumen.
Resumo:
The cyanobacterial population in the Cajati waste stabilization pond system (WSP) from Sao Paulo State, Brazil was assessed by cell isolation and direct microscope counting techniques. Ten strains, belonging to five genera (Synechococcus, Merismopedia, Leptolyngbya, Limnothrix, and Nostoc), were isolated and identified by morphological and molecular analyses. Morphological identification of the isolated strains was congruent with their phylogenetic analyses based on 16S rDNA gene sequences. Six cyanobacterial genera (Synechocystis, Aphanocapsa, Merismopedia, Lyngbya, Phormidium, and Pseudanabaena) were identified by direct microscope inspection. Both techniques were complementary, since, of the six genera identified by direct microscopic inspection, only Merismopedia was isolated, and the four other isolated genera were not detected by direct inspection. Direct microscope counting of preserved cells showed that cyanobacteria were the dominant members (> 90%) of the phytoplankton community during both periods evaluated (summer and autumn). ELISA tests specific for hepatotoxicmicrocystins gave positive results for six strains (Synechococcus CENA108, Merismopedia CENA106, Leptolyngbya CENA103, Leptolyngbya CENA112, Limnothrix CENA109, and Limnothrix CENA110), and for wastewater samples collected from raw influent (3.70 mu g microcystins/l) and treated effluent (3.74 mu g microcystins/l) in summer. Our findings indicate that toxic cyanobacteria in WSP systems are of concern, since the treated effluent containing cyanotoxins will be discharged into rivers, irrigation channels, estuaries, or reservoirs, and can affect human and animal health.
Resumo:
The production of hydrogen from soft-drink wastewater in two upflow anaerobic packed-bed reactors was evaluated. The results show that soft-drink wastewater is a good source for hydrogen generation. Data from both reactors indicate that the reactor without medium containing macro- and micronutrients (R2) provided a higher hydrogen yield (3.5 mol H(2) mol(-1) of sucrose) as compared to the reactor (R1) with a nutrient-containing medium (3.3 mol H(2) mol(-1) of sucrose). Reactor R2 continuously produced hydrogen, whereas reactor R1 exhibited a short period of production and produced lower amounts of hydrogen. Better hydrogen production rates and percentages of biogas were also observed for reactor R2, which produced 0.4 L h(-1) L(-1) and 15.8% of H(2), compared to reactor R1, which produced 0.2 L h(-1) L(-1) and 2.6% of H(2). The difference in performance between the reactors was likely due to changes in the metabolic pathway for hydrogen production and decreases in bed porosity as a result of excessive biomass growth in reactor R1. Molecular biological analyses of samples from reactors R1 and R2 indicated the presence of several microorganisms, including Clostridium (91% similarity), Enterobacter (93% similarity) and Klebsiella (97% similarity). Copyright (C) 2011, Hydrogen Energy Publications, LLC. Published by Elsevier Ltd. All rights reserved.
Resumo:
In February 2007, sweet orange trees with characteristic symptoms of huanglongbing (HLB) were encountered in a region of Sao Paulo state (SPs) hitherto free of HLB. These trees tested negative for the three liberibacter species associated with HLB. A polymerase chain reaction (PCR) product from symptomatic fruit columella DNA amplifications with universal primers fDI/rPI was cloned and sequenced. The corresponding agent was found to have highest 16S rDNA sequence identity (99%) with the Pigeon pea witches`-broom phytoplasma of group 16Sr IX. Sequences of PCR products obtained with phytoplasma 16S rDNA primer pairs fU5/rU3, fU5/P7 confirm these result.,;. With two primers D7f2/D7r2 designed based oil the 16S rDNA Sequence of the cloned DNA fragment, positive amplifications were obtained from more than one hundred samples including symptomatic fruits and blotchy mottle leaves. Samples positive for phytoplasmas were negative for liberibacters, except for four samples, which were positive for both the phytoplasma and `Candidatus Liberibacter asiaticus`. The phytoplasma was detected by electron microscopy in the sieve tubes of midribs from symptomatic leaves. These results Show that a phytoplasma of group IX is associated with citrus HLB symptoms ill northern, central, and Southern SPs. This phytoplasma has very probably been transmitted to citrus from an external Source of inoculum, but the Putative insect vector is not yet known.
Resumo:
An inhibitory protein that neutralizes the enzymatic, toxic and pharmacological activities of several phospholipases A(2) from Bothrops venoms was isolated from B. jararacussu snake plasma by affinity chromatography using the immobilized myotoxin BthTX-I on Sepharose gel. Biochemical characterization of this inhibitory protein, denominated alpha BjussuMIP, showed it to be an oligomeric glycoprotein with M-r of 24,000 for the monomeric subunit. Secondary structural analysis by circular dichroism revealed 44% alpha-helix, 18% beta-sheet, 10% beta-turn and 28% random coil structures. Circular dichroism spectroscopy indicated that no significant alterations in the secondary structure of either alpha BjussuMIP or the target protein occur following their interaction. The product from the reaction with reverse transcriptase produced a cDNA fragment of 432 bp that codifies for a mature protein of 144 amino acid residues. The first 21 amino acid residues from the N-terminal and five tryptic peptides were characterized by mass spectrometry of the mature protein and confirmed by the nucleotide sequence. Alignment of alpha BjussuMIP with other snake inhibitors showed a sequence similarity of 73-92% with these alpha PLIs. alpha BjussuMIP was relatively stable within the pH range of 6-12 and temperatures from 0 degrees C to 80 degrees C, even after deglycosylation. The results showed effects against Bothrops phospholipase A(2) activities (enzymatic, edema inducing, myotoxic, cytotoxic and bactericidal), suggesting that alpha BjussuMIP may prove useful in the treatment of snakebite envenomations. (C) 2008 Elsevier Masson SAS. All rights reserved.
Resumo:
Drosophila antonietae is a cactophilic species that is found in the mesophilic forest of the Parana`-Paraguay river basin and in the dunes of the South Atlantic coast of Brazil. Although the genetic structure of the Parana`-Paraguay river basin populations has already been established, the relationship between these populations and those on the Atlantic coast is controversial. In this study, we compared 33 repetitive units of pBuM-2 satellite DNA isolated from individuals from 8 populations of D. antonietae in these geographic regions, including some populations found within a contact zone with the closely related D. serido. The pBuM-2 sequences showed low interpopulational variability. This result was interpreted as a consequence of both gene flow among the populations and unequal crossing over promoting homogenization of the tandem arrays. The results presented here, together with those of previous studies, highlight the use of pBuM-2 for solving taxonomic conflicts within the D. buzzatii species cluster.
Resumo:
The genus Macrobrachium Bate, 1868 is one of the best examples of widespread crustacean genera distributed globally throughout tropical and subtropical waters. Previous investigators have noted the systematic complexity of the group, and have suggested rearrangements within the family Palaemonidae. Our phylogenetic analysis of new mitochondrial DNA sequences of 58 species of Macrobrachium distributed mainly in America support the hypothesis of monophyly of this genus, if Cryphiops Dana, 1852 is accepted as a generic synonym. We concluded that the independent evolution of different types of life cycle (abbreviated larval development-ALD and extended larval development-ELD) must have occurred more than once in the history of the group. Similarly, we also concluded that the current type species of the genus, Macrobrachium americanum Bate, 1868, should not be considered valid, as previously proposed. The synonymy of two members of the `olfersi` species complex (M. birai Lobao, Melo&Fernandes, 1986 and M. holthuisi Genofre&Lobao, 1978) with M. olfersi (Wiegmann, 1836) was confirmed. Similar results were found in comparing M. petronioi Melo, Lobao&Fernandes, 1986 and M. potiuna (Muller, 1880), in which the genetic divergence placed M. petronioi within the level of intraspecific variation of M. potiuna. The taxonomic status of the genus Cryphiops, as well as theories on the origin of Macrobrachium, is also called into question.
Resumo:
The current taxonomy of two poorly known hermit crab species Pagurus forceps H. Milne Edwards, 1836 and Pagurus comptus White, 1847 from temperate Pacific and Atlantic coastlines of South America is based only on adult morphology. Past studies have questioned the separation of these two very similar species, which occur sympatrically. We included specimens morphologically assignable to P. forceps and P. comptus in a phylogenetic analysis, along with other selected anomuran decapods, based on 16S ribosomal gene sequences. Differences between samples putatively assigned to either P. forceps and P. comptus were moderate, with sequence similarity ranging from 98.2 to 99.4% for the fragments analyzed. Our comparison of mitochondrial DNA sequences (16S rRNA) revealed diagnostic differences between the two putative species, suggesting that P. forceps and P. comptus are indeed phylogenetically close but different species, with no genetic justification to support their synonymization. The polyphyly of Pagurus is not corroborated here among the represented Atlantic species, despite obviously complex relationships among the members of the genus.
Resumo:
Lipopeptides produced by Bacillus subtilis are known for their high antifungal activity. The aim of this paper is to show that at high concentration they can damage the surface ultra-structure of bacterial cells. A lipopeptide extract containing iturin and surfactin (5 mg mL-1) was prepared after isolation from B. subtilis (strain OG) by solid phase extraction. Analysis by atomic force microscope (AFM) showed that upon evaporation, lipopeptides form large aggregates (0.1-0.2 mu m2) on the substrates silicon and mica. When the same solution is incubated with fungi and bacteria and the system is allowed to evaporate, dramatic changes are observed on the cells. AFM micrographs show disintegration of the hyphae of Phomopsis phaseoli and the cell walls of Xanthomonas campestris and X. axonopodis. Collapses to fungal and bacterial cells may be a result of formation of pores triggered by micelles and lamellar structures, which are formed above the critical micelar concentration of lipopeptides. As observed for P. phaseoli, the process involves binding, solubilization, and formation of novel structures in which cell wall components are solubilized within lipopeptide vesicles. This is the first report presenting evidences that vesicles of uncharged and negatively charged lipopeptides can alter the morphology of gram-negative bacteria.
Resumo:
Aminoacyl-transfer RNA (tRNA) synthetases (aaRS) are key players in translation and act early in protein synthesis by mediating the attachment of amino acids to their cognate tRNA molecules. In plants, protein synthesis may occur in three subcellular compartments (cytosol, mitochondria, and chloroplasts), which requires multiple versions of the protein to be correctly delivered to its proper destination. The organellar aaRS are nuclear encoded and equipped with targeting information at the N-terminal sequence, which enables them to be specifically translocated to their final location. Most of the aaRS families present organellar proteins that are dual targeted to mitochondria and chloroplasts. Here, we examine the dual targeting behavior of aaRS from an evolutionary perspective. Our results show that Arabidopsis thaliana aaRS sequences are a result of a horizontal gene transfer event from bacteria. However, there is no evident bias indicating one single ancestor (Cyanobacteria or Proteobacteria). The dual-targeted aaRS phylogenetic relationship was characterized into two different categories (paralogs and homologs) depending on the state recovered for both dual-targeted and cytosolic proteins. Taken together, our results suggest that the dual-targeted condition is a gain-of-function derived from gene duplication. Selection may have maintained the original function in at least one of the copies as the additional copies diverged.
Resumo:
Although many mathematical models exist predicting the dynamics of transposable elements (TEs), there is a lack of available empirical data to validate these models and inherent assumptions. Genomes can provide a snapshot of several TE families in a single organism, and these could have their demographics inferred by coalescent analysis, allowing for the testing of theories on TE amplification dynamics. Using the available genomes of the mosquitoes Aedes aegypti and Anopheles gambiae, we indicate that such an approach is feasible. Our analysis follows four steps: (1) mining the two mosquito genomes currently available in search of TE families; (2) fitting, to selected families found in (1), a phylogeny tree under the general time-reversible (GTR) nucleotide substitution model with an uncorrelated lognormal (UCLN) relaxed clock and a nonparametric demographic model; (3) fitting a nonparametric coalescent model to the tree generated in (2); and (4) fitting parametric models motivated by ecological theories to the curve generated in (3).