14 resultados para SEQUENCE ALIGNMENT

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)


Relevância:

70.00% 70.00%

Publicador:

Resumo:

Motivation: DNA assembly programs classically perform an all-against-all comparison of reads to identify overlaps, followed by a multiple sequence alignment and generation of a consensus sequence. If the aim is to assemble a particular segment, instead of a whole genome or transcriptome, a target-specific assembly is a more sensible approach. GenSeed is a Perl program that implements a seed-driven recursive assembly consisting of cycles comprising a similarity search, read selection and assembly. The iterative process results in a progressive extension of the original seed sequence. GenSeed was tested and validated on many applications, including the reconstruction of nuclear genes or segments, full-length transcripts, and extrachromosomal genomes. The robustness of the method was confirmed through the use of a variety of DNA and protein seeds, including short sequences derived from SAGE and proteome projects.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Eukaryotic genome expansion/retraction caused by LTR-retrotransposon activity is dependent on the expression of full length copies to trigger efficient transposition and recombination-driven events. The Tnt1 family of retrotransposons has served as a model to evaluate the diversity among closely related elements within Solanaceae species and found that members of the family vary mainly in their U3 region of the long terminal repeats (LTRs). Recovery of a full length genomic copy of Retrosol was performed through a PCR-based approach from wild potato, Solanum oplocense. Further characterization focusing on both LTR sequences of the amplified copy allowed estimating an approximate insertion time at 2 million years ago thus supporting the occurrence of transposition cycles after genus divergence. Copy number of Tnt1-like elements in Solanum species were determined through genomic quantitative PCR whereby results sustain that Retrosol in Solanum species is a low copy number retrotransposon (1-4 copies) while Retrolyc1 has an intermediate copy number (38 copies) in S. peruvianum. Comparative analysis of retrotransposon content revealed no correlation between genome size or ploidy level and Retrosol copy number. The tetraploid cultivated potato with a cellular genome size of 1,715 Mbp harbours similar copy number per monoploid genome than other diploid Solanum species (613-884 Mbp). Conversely, S. peruvianum genome (1,125 Mbp) has a higher copy number. These results point towards a lineage specific dynamic flux regarding the history of amplification/activity of Tnt1-like elements in the genome of Solanum species.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The enzymatic activity of thioredoxin reductase enzymes is endowed by at least two redox centers: a flavin and a dithiol/disulfide CXXC motif. The interaction between thioredoxin reductase and thioredoxin is generally species-specific, but the molecular aspects related to this phenomenon remain elusive. Here, we investigated the yeast cytosolic thioredoxin system, which is composed of NADPH, thioredoxin reductase (ScTrxR1), and thioredoxin 1 (ScTrx1) or thioredoxin 2 (ScTrx2). We showed that ScTrxR1 was able to efficiently reduce yeast thioredoxins (mitochondrial and cytosolic) but failed to reduce the human and Escherichia coli thioredoxin counterparts. To gain insights into this specificity, the crystallographic structure of oxidized ScTrxR1 was solved at 2.4 angstrom resolution. The protein topology of the redox centers indicated the necessity of a large structural rearrangement for FAD and thioredoxin reduction using NADPH. Therefore, we modeled a large structural rotation between the two ScTrxR1 domains (based on the previously described crystal structure, PDB code 1F6M). Employing diverse approaches including enzymatic assays, site-directed mutagenesis, amino acid sequence alignment, and structure comparisons, insights were obtained about the features involved in the species-specificity phenomenon, such as complementary electronic parameters between the surfaces of ScTrxR1 and yeast thioredoxin enzymes and loops and residues (such as Ser(72) in ScTrx2). Finally, structural comparisons and amino acid alignments led us to propose a new classification that includes a larger number of enzymes with thioredoxin reductase activity, neglected in the low/high molecular weight classification.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Vegetables are critical for human health as they are a source of multiple vitamins including vitamin E (VTE). In plants, the synthesis of VTE compounds, tocopherol and tocotrienol, derives from precursors of the shikimate and methylerythritol phosphate pathways. Quantitative trait loci (QTL) for alpha-tocopherol content in ripe fruit have previously been determined in an Solanum pennellii tomato introgression line population. In this work, variations of tocopherol isoforms (alpha, beta, gamma, and delta) in ripe fruits of these lines were studied. In parallel all tomato genes structurally associated with VTE biosynthesis were identified and mapped. Previously identified VTE QTL on chromosomes 6 and 9 were confirmed whilst novel ones were identified on chromosomes 7 and 8. Integrated analysis at the metabolic, genetic and genomic levels allowed us to propose 16 candidate loci putatively affecting tocopherol content in tomato. A comparative analysis revealed polymorphisms at nucleotide and amino acid levels between Solanum lycopersicum and S. pennellii candidate alleles. Moreover, evolutionary analyses showed the presence of codons evolving under both neutral and positive selection, which may explain the phenotypic differences between species. These data represent an important step in understanding the genetic determinants of VTE natural variation in tomato fruit and as such in the ability to improve the content of this important nutriceutical.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

With the aim of determining the genetic basis of metabolic regulation in tomato fruit, we constructed a detailed physical map of genomic regions spanning previously described metabolic quantitative trait loci of a Solanum pennellii introgression line population. Two genomic libraries from S. pennellii were screened with 104 colocated markers from five selected genomic regions, and a total of 614 bacterial artificial chromosome (BAC)/cosmids were identified as seed clones. Integration of sequence data with the genetic and physical maps of Solanum lycopersicum facilitated the anchoring of 374 of these BAC/cosmid clones. The analysis of this information resulted in a genome-wide map of a nondomesticated plant species and covers 10% of the physical distance of the selected regions corresponding to approximately 1% of the wild tomato genome. Comparative analyses revealed that S. pennellii and domesticated tomato genomes can be considered as largely colinear. A total of 1,238,705 bp from both BAC/cosmid ends and nine large insert clones were sequenced, annotated, and functionally categorized. The sequence data allowed the evaluation of the level of polymorphism between the wild and cultivated tomato species. An exhaustive microsynteny analysis allowed us to estimate the divergence date of S. pennellii and S. lycopersicum at 2.7 million years ago. The combined results serve as a reference for comparative studies both at the macrosyntenic and microsyntenic levels. They also provide a valuable tool for fine-mapping of quantitative trait loci in tomato. Furthermore, they will contribute to a deeper understanding of the regulatory factors underpinning metabolism and hence defining crop chemical composition.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The toucan genus Ramphastos (Piciformes: Ramphastidae) has been a model in the formulation of Neotropical paleobiogeographic hypotheses. Weckstein (2005) reported on the phylogenetic history of this genus based on three mitochondrial genes, but some relationships were weakly supported and one of the subspecies of R. vitellinus (citreolaemus) was unsampled. This study expands on Weckstein (2005) by adding more DNA sequence data (including a nuclear marker) and more samples, including R v. citreolaemus. Maximum parsimony, maximum likelihood, and Bayesian methods recovered similar trees, with nodes showing high support. A monophyletic R. vitellinus complex was strongly supported as the sister-group to R. brevis. The results also confirmed that the southeastern and northern populations of R. vitellinus ariel are paraphyletic. X v. citreolaemus is sister to the Amazonian subspecies of the vitellinus complex. Using three protein-coding genes (COI, cytochrome-b and ND2) and interval-calibrated nodes under a Bayesian relaxed-clock framework, we infer that ramphastid genera originated in the middle Miocene to early Pliocene, Ramphastos species originated between late Miocene and early Pleistocene, and intra-specific divergences took place throughout the Pleistocene. Parsimony-based reconstruction of ancestral areas indicated that evolution of the four trans-Andean Ramphastos taxa (R. v. citreolaemus, R. a. swainsonii, R. brevis and R. sulfuratus) was associated with four independent dispersals from the cis-Andean region. The last pulse of Andean uplift may have been important for the evolution of R. sulfuratus, whereas the origin of the other trans-Andean Ramphastos taxa is consistent with vicariance due to drying events in the lowland forests north of the Andes. Estimated rates of molecular evolution were higher than the ""standard"" bird rate of 2% substitutions/site/million years for two of the three genes analyzed (cytochrome-b and ND2). (C) 2009 Elsevier Inc. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

An aspartic endopeptidase was purified in our laboratory from Rhipicephalus (Boophilus) microplus eggs [Logullo, C., Vaz, I.S., Sorgine, M.H., Paiva-Silva, G.O., Faria, F.S., Zingali, R.B., De Lima, M.F., Abreu, L., Oliveira, E.F., Alves, E.W, Masuda, H., Gonzales, J.C., Masuda, A., and Oliveira, P.L., 1998. Isolation of an aspartic proteinase precursor from the egg of a hard tick, Rhipicephalus (Boophilus) microplus. Parasitology 116, 525-532]. Boophilus yolk cathepsin (BYC) was tested as component of a protective vaccine against the tick, inducing a significant immune response in cattle [da Silva, VI., Jr., Logullo, C., Sorgine, M., Velloso, F.F., Rosa de Lima, M.F., Gonzales, J.C., Masuda, H., Oliveira, P.L., and Masuda, A., 1998. Immunization of bovines with an aspartic proteinase precursor isolated from Rhipicephalus (Boophilus) microplus eggs. Vet. Immunol. Immunopathol. 66,331-341]. In this work, BYC was cloned and its primary sequence showed high similarity with other aspartic endopeptidases. In spite of this similarity, BYC sequence shows many important differences in relation to other aspartic peptidases, the most important being the lack of the second catalytic Asp residue, considered to be essential for the catalysis of this class of endopeptidases. When we determined BYC cleavage specificity by LC-MS, we found out that it presents a preference for hydrophobic residues in P1 and P1` in accordance to most aspartic endopeptidases. Also, when analyzed by circular dicroism, BYC presented high beta sheet content, also a characteristic of aspartic endopeptidases. On the other hand, although both native and recombinant BYC are catalytically active, they present a very low specific activity, what seems to indicate that this peptidase will digest its natural substrate, vitellin, very slowly. We speculate that such a slow Vn degradative process might constitute an important strategy to preserve egg protein content to the hatching larvae. (c) 2007 Elsevier Inc. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Phosphoribosyl pyrophosphate synthetase (PRS-EC:2.7.6.1) is an important enzyme present in several metabolic pathways, thus forming a complex family of isoenzymes. However, plant PRS enzymes have not been extensively investigated. In this study, a sugarcane prs gene has been characterized from the Sugar Cane Expressed Sequence Tag Genome Project. This gene contains a 984-bp open reading frame encoding a 328-amino acid protein. The predicted amino acid sequence has 77% and 78% amino acid sequence identity to Arabidopsis thaliana and Spinacia oleracea PRS4, respectively. The assignment of sugarcane PRS as a phosphate-independent PRS isoenzyme (Class II PRS) is verified following enzyme assay and phylogenetic reconstruction of PRS homologues. To gain further insight into the structural framework of the phosphate independence of sugarcane PRS, a molecular model is described. This model reveals the formation of two conserved domains elucidating the structural features involved in sugarcane PRS phosphate independence. The recombinant PRS retains secondary structure elements and a quaternary arrangement consistent with known PRS homologues, based on circular dichroism measurements.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Selenophosphate synthetase (EC 2.7.9.3), the product of the selD gene, produces the biologically selenium donor compound, monoselenophosphate, from ATP and selenide, for the synthesis of cysteine. The kinetoplastid Leishmania major and Trypanosoma brucei selD genes were cloned and the protein overexpressed and purified to apparent homogeneity. The selD gene in L. major and T brucei respectively 1197 and 1179 bp long encoding proteins of 399 and 393 amino acids with molecular of 42.7 and 43 kDa. The molecular mass of 100 kDa for both (L. major and T brucei) SEWS is consistent dimeric proteins. The kinetoplastid selD complement Escherichia call (WL400) selD deletion it is a functional enzyme and the specific activity of these enzymes was determined. A conserved residue was identified both by multiple sequence alignment as well as by functional and activity assay of the mutant (Cys to Ala) forms of the SELD identifying this residue as essential for catalytic function. (C) 2008 Elsevier B.V. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The InteGrade middleware intends to exploit the idle time of computing resources in computer laboratories. In this work we investigate the performance of running parallel applications with communication among processors on the InteGrade grid. As costly communication on a grid can be prohibitive, we explore the so-called systolic or wavefront paradigm to design the parallel algorithms in which no global communication is used. To evaluate the InteGrade middleware we considered three parallel algorithms that solve the matrix chain product problem, the 0-1 Knapsack Problem, and the local sequence alignment problem, respectively. We show that these three applications running under the InteGrade middleware and MPI take slightly more time than the same applications running on a cluster with only LAM-MPI support. The results can be considered promising and the time difference between the two is not substantial. The overhead of the InteGrade middleware is acceptable, in view of the benefits obtained to facilitate the use of grid computing by the user. These benefits include job submission, checkpointing, security, job migration, etc. Copyright (C) 2009 John Wiley & Sons, Ltd.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Insect digestive chymotrypsins are present in a large variety of insect orders but their substrate specificity still remains unclear. Ewer insect chymotrypsins from 3 different insect orders (Dictyoptera, Coleoptera and two Lepidoptera) were isolated using affinity chromatography. Enzymes presented molecular masses in the range of 20 to 31 kDa and pH optima in the range of 7.5 to 10.0. Kinetic characterization. using different, colorimetric and fluorescent substrates indicated that insect chymotrypsins differ from, bovine chymotrypsin in their primary specificity toward small substrates (like N-benzoyl-L-Tyr p-nitroanilide) rather than on their preference for large substrates (exemplified by Succynil-Ala-Ala-Pro-Phe P-nitroanilide). Chloromethyl ketones (TPCK, N-alpha-tosyl-L-Phe chloromethyl ketone and Z-GGF-CK, N-carbobenzoxy-Gly-Gly-phe-CK) inactivated all chymotrypsins legated. Inactivation rates follow apparent first-order kinetics with variable second order rates (TPCK, 42 to 130 M(-1)s(-1); Z-GGF-CK, 150 to 450 M(-1)s(-1) that may be remarkably low for S. frugiperda chymotrypsin (TPCK, 6 M(-1)s(-1); Z-GGF-CK, 6.1 M(-1) s(-1)). Homology modelling and sequence alignment showed that. in lepidopteran chymotrypsins, differences in the amino acid residues in the neighborhood of the catalytic His 57 may affect its pKa, value. This is Proposed as the cause of the decrease in His 57 reactivity toward chloromethyl ketones. Such amino acid replacement in the active site is proposed. to be an adaptation to the presence of dietary ketones. (C) 2009 Wiley Periodicals, Inc.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We describe AMIN (Amidase N-terminal domain), a novel protein domain found specifically in bacterial periplasmic proteins. AMIN domains are widely distributed among peptidoglycan hydrolases and transporter protein families. Based on experimental data, contextual information and phyletic profiles, we suggest that AMIN domains mediate the targeting of periplasmic or extracellular proteins to specific regions of the bacterial envelope.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Spodoptera frugiperda beta-1,3-glucanase (SLam) was purified from larval midgut. It has a molecular mass of 37.5 kDa, an alkaline optimum pH of 9.0, is active against beta-1,3-glucan (laminarin), but cannot hydrolyze yeast beta-1,3-1,6-glucan or other polysaccharides. The enzyme is an endoglucanase with low processivity (0.4), and is not inhibited by high concentrations of substrate. In contrast to other digestive beta-1,3-glucanases from insects, SLam is unable to lyse Saccharomyces cerevisae cells. The cDNA encoding SLam was cloned and sequenced, showing that the protein belongs to glycosyl hydrolase family 16 as other insect glucanases and glucan-binding proteins. Multiple sequence alignment of beta-1,3-glucanases and beta-glucan-binding protein supports the assumption that the beta-1,3-glucanase gene duplicated in the ancestor of mollusks and arthropods. One copy originated the derived beta-1,3-glucanases by the loss of an extended N-terminal region and the beta-glucan-binding proteins by the loss of the catalytic residues. SLam homology modeling suggests that E228 may affect the ionization of the catalytic residues, thus displacing the enzyme pH optimum. SLam antiserum reacts with a single protein in the insect midgut. Immunocytolocalization shows that the enzyme is present in secretory vesicles and glycocalyx from columnar cells. (C) 2010 Elsevier Ltd. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Both soluble (SfTre1) and membrane-bound (SfTre2) trehalases occur along the midgut of Spodoptera frugiperda larvae. Released SfTre2 was purified as a 67 kDa protein. Its K(m) (1.6 mM) and thermal stability (half life 10 min at 62 degrees C) are different from the previously isolated soluble trehalase (K(m) = 0.47 mM; 100% stable at 62 degrees C). Two cDNAs coding for S. frugiperda trehalases have been cloned using primers based on consensus sequences of trehalases and having as templates a cDNA library prepared from total polyA-containing RNA extracted from midguts. One cDNA codes for a trehalase that has a predicted transmembrane sequence and was defined as SfTre2. The other, after being cloned and expressed, results in a recombinant trehalase with a K(m) value and thermal stability like those of native soluble trehalase. This enzyme was defined as SfTre1 and, after it was used to generate antibodies, it was immunolocalized at the secretory vesicles and at the glycocalyx of columnar cells. Escherichia coli trehalase 3D structure and sequence alignment with SfTre1 support a proposal regarding the residue modulating the pKa value of the proton donor.