987 resultados para sequence identity
Resumo:
The nucleotide sequence of genes 4 and 9, encoding the outer capsid proteins VP4 and VP7 of a serotype 10 tissue culture-adapted strain, 1321, representative of asymptomatic neonatal rotaviruses isolated from neonates in Bangalore, India, were determined. Comparison of nucleotide and deduced amino acid sequences of 1321 VP4 and VP7 with previously published sequences of various serotypes revealed that both genes were highly homologous to the respective genes of serotype 10 bovine rotavirus, B223. The VP4 of 1321 represents a new human P serotype and the 1321 and related strains represent the first description of neonatal rotaviruses that appear to derive both surface proteins from an animal rotavirus.
Large distribution and high sequence identity of a Copia-type retrotransposon in angiosperm families
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Sequence divergence acts as a potent barrier to homologous recombination; much of this barrier derives from an antirecombination activity exerted by mismatch repair proteins. An inverted repeat assay system with recombination substrates ranging in identity from 74% to 100% has been used to define the relationship between sequence divergence and the rate of mitotic crossing-over in yeast. To elucidate the role of the mismatch repair machinery in regulating recombination between mismatched substrates, we performed experiments in both wild-type and mismatch repair defective strains. We find that a single mismatch is sufficient to inhibit recombination between otherwise identical sequences, and that this inhibition is dependent on the mismatch repair system. Additional mismatches have a cumulative negative effect on the recombination rate. With sequence divergence of up to approximately 10%, the inhibitory effect of mismatches results mainly from antirecombination activity of the mismatch repair system. With greater levels of divergence, recombination is inefficient even in the absence of mismatch repair activity. In both wild-type and mismatch repair defective strains, an approximate log-linear relationship is observed between the recombination rate and the level of sequence divergence.
Resumo:
This report documents the error rate in a commercially distributed subset of the IMAGE Consortium mouse cDNA clone collection. After isolation of plasmid DNA from 1189 bacterial stock cultures, only 62.2% were uncontaminated and contained cDNA inserts that had significant sequence identity to published data for the ordered clones. An agarose gel electrophoresis pre-screening strategy identified 361 stock cultures that appeared to contain two or more plasmid species. Isolation of individual colonies from these stocks demonstrated that 7.1% of the original 1189 stocks contained both a correct and an incorrect plasmid. 5.9% of the original 1189 stocks contained multiple, distinct, incorrect plasmids, indicating the likelihood of multiple contaminating events. While only 739 of the stocks purchased contained the desired cDNA clone, agarose gel pre-screening, colony isolation and similarity searching of dbEST allowed for the identification of an additional 420 clones that would have otherwise been discarded. Considering the high error rate in this subset of the IMAGE cDNA clone set, the use of sequence verified clones for cDNA microarray construction is warranted. When this is not possible, pre-screening non-sequence verified clones with agarose gel electrophoresis provides an inexpensive and efficient method to eliminate contaminated clones from the probe set.
Resumo:
Background. A variety of interactions between up to three different movement proteins (MPs), the coat protein (CP) and genomic DNA mediate the inter- and intra-cellular movement of geminiviruses in the genus Begomovirus. Although movement of viruses in the genus Mastrevirus is less well characterized, direct interactions between a single MP and the CP of these viruses is also clearly involved in both intra- and intercellular trafficking of virus genomic DNA. However, it is currently unknown how specific these MP-CP interactions are, nor how disruption of these interactions might impact on virus viability. Results. Using chimaeric genomes of two strains of Maize streak virus (MSV) we adopted a genetic approach to investigate the gross biological effects of interfering with interactions between virus MP and CP homologues derived from genetically distinct MSV isolates. MP and CP genes were reciprocally exchanged, individually and in pairs, between maize (MSV-Kom)- and Setaria sp. (MSV-Set)-adapted isolates sharing 78% genome-wide sequence identity. All chimaeras were infectious in Zea mays c.v. Jubilee and were characterized in terms of symptomatology and infection efficiency. Compared with their parental viruses, all the chimaeras were attenuated in symptom severity, infection efficiency, and the rate at which symptoms appeared. The exchange of individual MP and CP genes resulted in lower infection efficiency and reduced symptom severity in comparison with exchanges of matched MP-CP pairs. Conclusion. Specific interactions between the mastrevirus MP and CP genes themselves and/or their expression products are important determinants of infection efficiency, rate of symptom development and symptom severity. © 2008 van der Walt et al; licensee BioMed Central Ltd.
Resumo:
The complete nucleotide sequence of Subterranean clover mottle virus (SCMoV) genomic RNA has been determined. The SCMoV genome is 4,258 nucleotides in length. It shares most nucleotide and amino acid sequence identity with the genome of Lucerne transient streak virus (LTSV). SCMoV RNA encodes four overlapping open reading frames and has a genome organisation similar to that of Cocksfoot mottle virus (CfMV). ORF1 and ORF4 are predicted to encode single proteins. ORF2 is predicted to encode two proteins that are derived from a -1 translational frameshift between two overlapping reading frames (ORF2a and ORF2b). A search of amino acid databases did not find a significant match for ORF1 and the function of this protein remains unclear. ORF2a contains a motif typical of chymotrypsin-like serine proteases and ORF2b has motifs characteristically present in positive-stranded RNA-dependent RNA polymerases. ORF4 is likely to be expressed from a subgenomic RNA and encodes the viral coat protein. The ORF2a/ORF2b overlapping gene expression strategy used by SCMoV and CfMV is similar to that of the poleroviruses and differ from that of other published sobemoviruses. These results suggest that the sobemoviruses could now be divided into two distinct subgroups based on those that express the RNA-dependent RNA polymerase from a single, in-frame polyprotein, and those that express it via a -1 translational frameshifting mechanism.
Resumo:
Jacalin and artocarpin, the two lectins from jackfruit (Artocarpus integrifolia) seeds, have different physicochemical properties and carbohydrate-binding specificities. However, comparison of the partial amino-acid sequence of artocarpin with the known sequence of jacalin indicates close to 50% sequence identity. Artocarpin crystallizes in two forms, both monoclinic P2(1), with one and two tetramic molecules, respectively, in the asymmetric units of form I (a = 69.9, b = 73.7, c = 60.6 Angstrom and beta = 95.1 degrees) and form II (a = 87.6, b = 72.2, c = 92.6 Angstrom and beta = 101.1 degrees). Both the crystal structures have been solved by the molecular replacement method using the known structure of jacalin as the search model and ope of them partially refined, confirming that the two lectins are indeed homologous.
Resumo:
The first complete genome sequence of capsicum chlorosis virus (CaCV) from Australia was determined using a combination of Illumina HiSeq RNA and Sanger sequencing technologies. Australian CaCV had a tripartite genome structure like other CaCV isolates. The large (L) RNA was 8913 nucleotides (nt) in length and contained a single open reading frame (ORF) of 8634 nt encoding a predicted RNA-dependent RNA polymerase (RdRp) in the viral-complementary (vc) sense. The medium (M) and small (S) RNA segments were 4846 and 3944 nt in length, respectively, each containing two non-overlapping ORFs in ambisense orientation, separated by intergenic regions (IGR). The M segment contained ORFs encoding the predicted non-structural movement protein (NSm; 927 nt) and precursor of glycoproteins (GP; 3366 nt) in the viral sense (v) and vc strand, respectively, separated by a 449-nt IGR. The S segment coded for the predicted nucleocapsid (N) protein (828 nt) and non-structural suppressor of silencing protein (NSs; 1320 nt) in the vc and v strand, respectively. The S RNA contained an IGR of 1663 nt, being the largest IGR of all CaCV isolates sequenced so far. Comparison of the Australian CaCV genome with complete CaCV genome sequences from other geographic regions showed highest sequence identity with a Taiwanese isolate. Genome sequence comparisons and phylogeny of all available CaCV isolates provided evidence for at least two highly diverged groups of CaCV isolates that may warrant re-classification of AIT-Thailand and CP-China isolates as unique tospoviruses, separate from CaCV.
Resumo:
NSP3, an acidic nonstructural protein, encoded by gene 7 has been implicated as the key player in the assembly of the 11 viral plus-strand RNAs into the early replication intermediates during rotavirus morphogenesis. To date, the sequence or NSP3 from only three animal rotaviruses (SA11, SA114F, and bovine UK) has been determined and that from a human strain has not been reported. To determine the genetic diversity among gene 7 alleles from group A rotaviruses, the nucleotide sequence of the NSP3 gene from 13 strains belonging to nine different G serotypes, from both humans and animals, has been determined. Based on the amino acid sequence identity as well as phylogenetic analysis, NSP3 from group A rotaviruses falls into three evolutionarily related groups, i.e., the SA11 group, the Wa group, and the S2 group. The SA 11/SA114F gene appears to have a distant ancestral origin from that of the others and codes for a polypeptide of 315 amino acids (aa) in length. NSP3 from all other group A rotaviruses is only 313 aa in length because of a 2-amino-acid deletion near the carboxy-terminus, While the SA114F gene has the longest 3' untranslated region (UTR) of 132 nucleotides, that from other strains suffered deletions of varying lengths at two positions downstream of the translational termination codon. In spite of the divergence of the nucleotide (nt) sequence in the protein coding region, a stretch of about 80 nt in the 3' UTR is highly conserved in the NSP3 gene from all the strains. This conserved sequence in the 3' UTR might play an important role in the regulation of expression of the NSP3 gene. (C) 1995 Academic Press, Inc.
Resumo:
Growth hormone (GH), prolactin (PRL) and somatolactin (SL) were purified simultaneously under alkaline condition (pH 9.0) from pituitary glands of sea perch (Lateolabrax japonicas) by a two-step procedure involving gel filtration on Sephadex G-100 and reverse-phase high-performance liquid chromatography (rpHPLC). At each step of purification, fractions were monitored by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) and by immunoblotting with chum salmon GH. PRL and SL antisera. The yields of sea perch GH, PRL and SL were 4.2, 1.0 and 0.28 mg/g wet tissue, respectively. The molecular weights of 19,200 and 20,370 Da were estimated by SDS-PAGE for sea perch GH and PRL, respectively. Two forms of sea perch SL were found: one (28,400 Da) is probably glycosylated, while the other one (23,200 Da) is believed to be deglycosylated. GH bioactivity was examined by an in vivo assay. Intraperitoneal injection of sea perch GH at a dose of 0.01 and 0.1 mug/g body weight at 7-day intervals resulted in a significant increase in body weight and length of juvenile rainbow trout. The complete sea-perch GH amino acid sequence of 187 residues was determined by sequencing fragments cleaved by chemicals and enzymes. Alignment of sea-perch GH with those of other fish GHs revealed that sea-perch GH is most similar to advanced marine fish, such as tuna, gilthead sea bream, yellowfin porgy, red sea bream, bonito and yellow tail with 98.4, 96.2%, 95.7%, 95.2%, 94.1% and 91% sequence identity, respectively. Sea-perch GH has low identity to Atlantic cod (76.5%), hardtail (73.3%), flounder (68.4%), chum salmon (66.3%), carp (54%) and blue shark (38%). Partial amino-acid sequences of 127 of sea-perch PRL and the N-terminal of 16 amino-acid sequence of sea-perch SL have been determined. The data show that sea-perch PRL has a slightly higher sequence identity with tilapia PRL( 73.2%) than with chum salmon PRL(70%) in this 127 amino-acid sequence. (C) 2001 Elsevier Science B.V. All rights reserved.
Resumo:
The pefA gene which encoded the serotype associated plasmid (SAP) mediated fimbrial major subunit antigen of Salmonella enterica serotype Typhimurium shared genetic identity with 128 of 706 salmonella isolates as demonstrated by dot (colony) hybridization. Seventy-seven of 113 isolates of Typhimurium and individual isolates of serotypes Bovis-morbificans, Cholerae-suis and Enteritidis phage type 9b hybridized pefA strongly, whereas 48 isolates of Enteritidis hybridized pefA weakly and one Enteritidis isolate of phage type 14b failed to hybridize. Individual isolates of 294 serotypes and 247 individual isolates of serotype Dublin did not hybridize pefA. Southern hybridization of plasmids extracted from Enteritidis demonstrated that the pefA gene probe hybridized strongly an atypical SAP of 80 kb in size harboured by one Enteritidis isolate of phage-type 9b, whereas the typical SAP of 58 kb in size harboured by 48 Enteritidis isolates hybridized weakly. One Enteritidis isolate of phage type 14b which failed to hybridize pefA in dot (colony) hybridization experiments was demonstrated to be plasmid free. A cosmid library of Enteritidis phage type 4 expressed in Escherichia coli K12 was screened by hybridization for the presence of pef sequences. Recombinant clones which were deduced to harbour the entire pef operon elaborated a PEF-like fimbrial structure at the cell surface. The PEF-like fimbrial antigen was purified from one cosmid clone and used in western blot experiments with sera from chickens infected with Enteritidis phage-type 4. Seroconversion to the fimbrial antigen was observed which indicated that the Enteritidis PEF-like fimbrial structure was expressed at some stage during infection. Nucleotide sequence analysis demonstrated that the pefA alleles of Typhimurium and Enteritidis phage-type 4 shared 76% DNA nucleotide and 82% deduced amino acid sequence identity.
Resumo:
The nucleotide sequence of a 3 kb region immediately upstream of the sef operon operon of Salmonella enteritidis was determined. A 1230 base pair insertion sequence which shared sequence identity (> 75%) with members of the IS3 family was revealed. This element, designated IS1230, had almost identical (90% identity) terminal inverted repeats to Escherichia coli IS3 but unlike other IS3-like sequences lacked the two characteristic open reading frames which encode the putative transposase. S. enteritidis possessed only one copy of this insertion sequence although Southern hybridisation analysis of restriction digests of genomic DNA revealed another fragment located in a region different from the sef operon which hybridised weakly which suggested the presence of an IS1230 homologue. The distribution of IS1230 and IS1230-like elements was shown to be widespread amongst salmonellas and the patterns of restriction fragments which hybridised differed significantly between Salmonella serotypes and it is suggested that IS1230 has potential for development as a differential diagnostic tool.
Resumo:
BaP1 is a 22.7-kD P-I-type zinc-dependent metalloproteinase isolated from the venom of the snake Bothrops asper, a medically relevant species in Central America. This enzyme exerts multiple tissue-damaging activities, including hemorrhage, myonecrosis, dermonecrosis, blistering, and edema. BaP1 is a single chain of 202 amino acids that shows highest sequence identity with metalloproteinases isolated front the venoms of snakes of the subfamily Crotalinae. It has six Cys residues involved in three disulfide bridges (Cys 117-Cys 197, Cys 159-Cys 181, Cys 157-Cys 164). It has the consensus sequence H(142)E(143)XXH(146)XXGXXH(152), as well as the sequence C164I165M166, which characterize the metzincin superfamily of metalloproteinases. The active-site cleft separates a major subdomain (residues 1-152), comprising four a-helices and a five-stranded beta-sheet, from the minor subdomain, which is formed by a single a-helix and several loops. The catalytic zinc ion is coordinated by the N-epsilon2 nitrogen atoms of His 142, His 146, and His 152, in addition to a solvent water molecule, which in turn is bound to Glu 143. Several conserved residues contribute to the formation of the hydrophobic pocket, and Met 166 serves as a hydrophobic base for the active-site groups. Sequence and structural comparisons of hemorrhagic and nonhemorrhagic P-I metalloproteinases from snake venoms revealed differences in several regions. In particular, the loop comprising residues 153 to 176 has marked structural differences between metalloproteinases with very different hemorrhagic activities. Because this region lies in close proximity to the active-site microenvironment, it may influence the interaction of these enzymes with physiologically relevant substrates in the extracellular matrix.
Resumo:
Intron splicing is one of the most important steps involved in the maturation process of a pre-mRNA. Although the sequence profiles around the splice sites have been studied extensively, the levels of sequence identity between the exonic sequences preceding the donor sites and the intronic sequences preceding the acceptor sites has not been examined as thoroughly. In this study we investigated identity patterns between the last 15 nucleotides of the exonic sequence preceding the 5' splice site and the intronic sequence preceding the 3' splice site in a set of human protein-coding genes that do not exhibit intron retention. We found that almost 60% of consecutive exons and introns in human protein-coding genes share at least two identical nucleotides at their 3' ends and, on average, the sequence identity length is 2.47 nucleotides. Based on our findings we conclude that the 3' ends of exons and introns tend to have longer identical sequences within a gene than when being taken from different genes. Our results hold even if the pairs are non-consecutive in the transcription order. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Bioinformatics, in the last few decades, has played a fundamental role to give sense to the huge amount of data produced. Obtained the complete sequence of a genome, the major problem of knowing as much as possible of its coding regions, is crucial. Protein sequence annotation is challenging and, due to the size of the problem, only computational approaches can provide a feasible solution. As it has been recently pointed out by the Critical Assessment of Function Annotations (CAFA), most accurate methods are those based on the transfer-by-homology approach and the most incisive contribution is given by cross-genome comparisons. In the present thesis it is described a non-hierarchical sequence clustering method for protein automatic large-scale annotation, called “The Bologna Annotation Resource Plus” (BAR+). The method is based on an all-against-all alignment of more than 13 millions protein sequences characterized by a very stringent metric. BAR+ can safely transfer functional features (Gene Ontology and Pfam terms) inside clusters by means of a statistical validation, even in the case of multi-domain proteins. Within BAR+ clusters it is also possible to transfer the three dimensional structure (when a template is available). This is possible by the way of cluster-specific HMM profiles that can be used to calculate reliable template-to-target alignments even in the case of distantly related proteins (sequence identity < 30%). Other BAR+ based applications have been developed during my doctorate including the prediction of Magnesium binding sites in human proteins, the ABC transporters superfamily classification and the functional prediction (GO terms) of the CAFA targets. Remarkably, in the CAFA assessment, BAR+ placed among the ten most accurate methods. At present, as a web server for the functional and structural protein sequence annotation, BAR+ is freely available at http://bar.biocomp.unibo.it/bar2.0.