978 resultados para Nucleotide-sequence Analysis
Resumo:
The nucleotide sequence of a 3 kb region immediately upstream of the sef operon of Salmonella enteritidis was determined. A 1230 base pair insertion sequence which shared sequence identity (> 75%) with members of the IS3 family was revealed. This element, designated IS1230, had almost identical (90% identity) terminal inverted repeats to Escherichia coli IS3 but unlike other IS3-like sequences lacked the two characteristic open reading frames which encode the putative transposase. S. enteritidis possessed only one copy of this insertion sequence although Southern hybridisation analysis of restriction digests of genomic DNA revealed another fragment located in a region different from the sef operon which hybridised weakly which suggested the presence of an IS1230 homologue. The distribution of IS1230 and IS1230-like elements was shown to be widespread amongst salmonellas and the patterns of restriction fragments which hybridised differed significantly between Salmonella serotypes and it is suggested that IS1230 has potential for development as a differential diagnostic tool.
Resumo:
In this study, 222 genome survey sequences were generated for Trypanosoma rangeli strain P07 isolated from an opossum (Didelphis albiventris) in Minas Gerais State, Brazil. T. rangeli sequences were compared by BLASTX (Basic Local Alignment Search Tool X) analysis with the assembled contigs of Leishmania braziliensis, Leishmania infantum, Leishmania major, Trypanosoma brucei, and Trypanosoma cruzi. Results revealed that 82% (182/222) of the sequences were associated with predicted proteins described, whereas 18% (40/222) of the sequences did not show significant identity with sequences deposited in databases, suggesting that they may represent T. rangeli-specific sequences. Among the 182 predicted sequences, 179 (80.6%) had the highest similarity with T. cruzi, 2 (0.9%) with T. brucei, and 1 (0.5%) with L. braziliensis. Computer analysis permitted the identification of members of various gene families described for trypanosomatids in the genome of T. rangeli, such as trans-sialidases, mucin-associated surface proteins, and major surface proteases (MSP or gp63). This is the first report identifying sequences of the MSP family in T. rangeli. Multiple sequence alignments showed that the predicted MSP of T. rangeli presented the typical characteristics of metalloproteases, such as the presence of the HEXXH motif, which corresponds to a region previously associated with the catalytic site of the enzyme, and various cysteine and proline residues, which are conserved among MSPs of different trypanosomatid species. Reverse transcriptase-polymerase chain reaction analysis revealed the presence of MSP transcripts in epimastigote forms of T. rangeli.
Resumo:
Despite many successes of conventional DNA sequencing methods, some DNAs remain difficult or impossible to sequence. Unsequenceable regions occur in the genomes of many biologically important organisms, including the human genome. Such regions range in length from tens to millions of bases, and may contain valuable information such as the sequences of important genes. The authors have recently developed a technique that renders a wide range of problematic DNAs amenable to sequencing. The technique is known as sequence analysis via mutagenesis (SAM). This paper presents a number of algorithms for analysing and interpreting data generated by this technique.
Resumo:
Endoparasitoid wasps produce maternal protein secretions, which are transported into the body of insect hosts at oviposition to regulate host physiology for successful development of their offspring. Venturia canescens calyx fluid contains so-called virus-like particles (VLPs) that are essential for immune evasion of the developing parasitoid inside the host. VLPs consist of four major proteins. In this paper, we describe the isolation and molecular cloning of a gene (vlp2) that is a constituent of VLPs and discuss its possible role in VLP structure and function.
Resumo:
We aimed to study patterns of variation and factors influencing the evolutionary dynamics of a satellite DNA, pBuM, in all seven Drosophila species from the buzzatii cluster (repleta group). We analyzed 117 alpha pBuM-1 (monomer length 190 bp) and 119 composite alpha/beta (370 bp) pBuM-2 repeats and determined the chromosome location and long-range organization on DNA fibers of major sequence variants. Such combined methodologies in the study of satDNAs have been used in very few organisms. In most species, concerted evolution is linked to high copy number of pBuM repeats. Species presenting low-abundance and scattered distributed pBuM repeats did not undergo concerted evolution and maintained part of the ancestral inter-repeat variability. The alpha and alpha/beta repeats colocalized in heterochromatic regions and were distributed on multiple chromosomes, with notable differences between species. High-resolution FISH revealed array sizes of a few kilobases to over 0.7 Mb and mutual arrangements of alpha and alpha/beta repeats along the same DNA fibers, but with considerable changes in the amount of each variant across species. From sequence, chromosomal and phylogenetic data, we could infer that homogenization and amplification events involved both new and ancestral pBuM variants. Altogether, the data on the structure and organization of the pBuM satDNA give insights into genome evolution including mechanisms that contribute to concerted evolution and diversification.
Resumo:
The genetic diversity of three temperate fruit tree phytoplasmas ‘Candidatus Phytoplasma prunorum’, ‘Ca. P. mali’ and ‘Ca. P. pyri’ has been established by multilocus sequence analysis. Among the four genetic loci used, the genes imp and aceF distinguished 30 and 24 genotypes, respectively, and showed the highest variability. Percentage of substitution for imp ranged from 50 to 68% according to species. Percentage of substitution varied between 9 and 12% for aceF, whereas it was between 5 and 6% for pnp and secY. In the case of ‘Ca P. prunorum’ the three most prevalent aceF genotypes were detected in both plants and insect vectors, confirming that the prevalent isolates are propagated by insects. The four isolates known to be hypo-virulent had the same aceF sequence, indicating a possible monophyletic origin. Haplotype network reconstructed by eBURST revealed that among the 34 haplotypes of ‘Ca. P. prunorum’, the four hypo-virulent isolates also grouped together in the same clade. Genotyping of some Spanish and Azerbaijanese ‘Ca. P. pyri’ isolates showed that they shared some alleles with ‘Ca. P. prunorum’, supporting for the first time to our knowledge, the existence of inter-species recombination between these two species.
Resumo:
In this paper, we analysed the haemagglutinin (HA) gene identified by polymerase chain reaction from 90 influenza A H1N1 virus strains that circulated in Brazil from April 2009-June 2010. A World Health Organization sequencing protocol allowed us to identify amino acid mutations in the HA protein at positions S220T (71%), D239G/N/S (20%), Y247H (4.5%), E252K (3.3%), M274V (2.2%), Q310H (26.7%) and E391K (12%). A fatal outcome was associated with the D239G mutation (p < 0.0001). Brazilian HA genetic diversity, in comparison to a reference strain from California, highlights the role of influenza virus surveillance for study of viral evolution, in addition to monitoring the spread of the virus worldwide.
Resumo:
This book gives a general view of sequence analysis, the statistical study of successions of states or events. It includes innovative contributions on life course studies, transitions into and out of employment, contemporaneous and historical careers, and political trajectories. The approach presented in this book is now central to the life-course perspective and the study of social processes more generally. This volume promotes the dialogue between approaches to sequence analysis that developed separately, within traditions contrasted in space and disciplines. It includes the latest developments in sequential concepts, coding, atypical datasets and time patterns, optimal matching and alternative algorithms, survey optimization, and visualization. Field studies include original sequential material related to parenting in 19th-century Belgium, higher education and work in Finland and Italy, family formation before and after German reunification, French Jews persecuted in occupied France, long-term trends in electoral participation, and regime democratization. Overall the book reassesses the classical uses of sequences and it promotes new ways of collecting, formatting, representing and processing them. The introduction provides basic sequential concepts and tools, as well as a history of the method. Chapters are presented in a way that is both accessible to the beginner and informative to the expert.
Resumo:
The biological properties of wild-type A75/17 and cell culture-adapted Onderstepoort canine distemper virus differ markedly. To learn more about the molecular basis for these differences, we have isolated and sequenced the protein-coding regions of the attachment and fusion proteins of wild-type canine distemper virus strain A75/17. In the attachment protein, a total of 57 amino acid differences were observed between the Onderstepoort strain and strain A75/17, and these were distributed evenly over the entire protein. Interestingly, the attachment protein of strain A75/17 contained an extension of three amino acids at the C terminus. Expression studies showed that the attachment protein of strain A75/17 had a higher apparent molecular mass than the attachment protein of the Onderstepoort strain, in both the presence and absence of tunicamycin. In the fusion protein, 60 amino acid differences were observed between the two strains, of which 44 were clustered in the much smaller F2 portion of the molecule. Significantly, the AUG that has been proposed as a translation initiation codon in the Onderstepoort strain is an AUA codon in strain A75/17. Detailed mutation analyses showed that both the first and second AUGs of strain A75/17 are the major translation initiation sites of the fusion protein. Similar analyses demonstrated that, also in the Onderstepoort strain, the first two AUGs are the translation initiation codons which contribute most to the generation of precursor molecules yielding the mature form of the fusion protein.
Resumo:
One major methodological problem in analysis of sequence data is the determination of costs from which distances between sequences are derived. Although this problem is currently not optimally dealt with in the social sciences, it has some similarity with problems that have been solved in bioinformatics for three decades. In this article, the authors propose an optimization of substitution and deletion/insertion costs based on computational methods. The authors provide an empirical way of determining costs for cases, frequent in the social sciences, in which theory does not clearly promote one cost scheme over another. Using three distinct data sets, the authors tested the distances and cluster solutions produced by the new cost scheme in comparison with solutions based on cost schemes associated with other research strategies. The proposed method performs well compared with other cost-setting strategies, while it alleviates the justification problem of cost schemes.