951 resultados para sequence based alignments
Resumo:
Background: With the decrease of DNA sequencing costs, sequence-based typing methods are rapidly becoming the gold standard for epidemiological surveillance. These methods provide reproducible and comparable results needed for a global scale bacterial population analysis, while retaining their usefulness for local epidemiological surveys. Online databases that collect the generated allelic profiles and associated epidemiological data are available but this wealth of data remains underused and are frequently poorly annotated since no user-friendly tool exists to analyze and explore it. Results: PHYLOViZ is platform independent Java software that allows the integrated analysis of sequence-based typing methods, including SNP data generated from whole genome sequence approaches, and associated epidemiological data. goeBURST and its Minimum Spanning Tree expansion are used for visualizing the possible evolutionary relationships between isolates. The results can be displayed as an annotated graph overlaying the query results of any other epidemiological data available. Conclusions: PHYLOViZ is a user-friendly software that allows the combined analysis of multiple data sources for microbial epidemiological and population studies. It is freely available at http://www.phyloviz.net.
Resumo:
BACKGROUND: A growing number of patients with chronic hepatitis B is being treated for extended periods with nucleoside and/or nucleotide analogs. In this context, antiviral resistance represents an increasingly common and complex issue. METHODS: Mutations in the hepatitis B virus (HBV) reverse transcriptase (rt) gene and viral genotypes were determined by direct sequencing of PCR products and alignment with reference sequences deposited in GenBank. RESULTS: Plasma samples from 60 patients with chronic hepatitis B were analyzed since March 2009. The predominant mutation pattern identified in patients with virological breakthrough was rtM204V/I ± different compensatory mutations, conferring resistance to L-nucleosides (lamivudine, telbivudine, emtricitabine) and predisposing to entecavir resistance (n = 18). Complex mutation patterns with a potential for multidrug resistance were identified in 2 patients. Selection of a fully entecavir resistant strain was observed in a patient exposed to lamivudine alone. Novel mutations were identified in 1 patient. Wild-type HBV was identified in 9 patients with suspected virological breakthrough, raising concerns about treatment adherence. No preexisting resistance mutations were identified in treatment-naïve patients (n = 13). Viral genome amplification and sequencing failed in 16 patients, of which only 2 had a documented HBV DNA > 1000 IU/ml. HBV genotypes were D in 28, A in 6, B in 4, C in 3 and E in 3 patients. Results will be updated in August 2010 and therapeutic implications discussed. CONCLUSIONS: With expanding treatment options and a growing number of patients exposed to nucleoside and/or nucleotide analogs, sequence-based HBV antiviral resistance testing is expected to become a cornerstone in the management of chronic hepatitis B.
Resumo:
Early immunological data, obtained by immunodiffusion and immunoelectrophoresis, on the whole-cell antigenicity of kinetoplastid protozoa were retrieved and used to construct a dendrogram of antigenic distances. Remarkably, they supported the same taxonomic conclusions as analyses based on DNA and protein sequence data.
Resumo:
Motivation: The ability of a simple method (MODCHECK) to determine the sequence–structure compatibility of a set of structural models generated by fold recognition is tested in a thorough benchmark analysis. Four Model Quality Assessment Programs (MQAPs) were tested on 188 targets from the latest LiveBench-9 automated structure evaluation experiment. We systematically test and evaluate whether the MQAP methods can successfully detect native-likemodels. Results: We show that compared with the other three methods tested MODCHECK is the most reliable method for consistently performing the best top model selection and for ranking the models. In addition, we show that the choice of model similarity score used to assess a model's similarity to the experimental structure can influence the overall performance of these tools. Although these MQAP methods fail to improve the model selection performance for methods that already incorporate protein three dimension (3D) structural information, an improvement is observed for methods that are purely sequence-based, including the best profile–profile methods. This suggests that even the best sequence-based fold recognition methods can still be improved by taking into account the 3D structural information.
Resumo:
Data generated from next generation sequencing (NGS) will soon comprise the majority of information about arbuscular mycorrhizal fungal (AMF) communities. Although these approaches give deeper insight, analysing NGS data involves decisions that can significantly affect results and conclusions. This is particularly true for AMF community studies, because much remains to be known about their basic biology and genetics. During a workshop in 2013, representatives from seven research groups using NGS for AMF community ecology gathered to discuss common challenges and directions for future research. Our goal was to improve the quality and accessibility of NGS data for the AMF research community. Discussions spanned sampling design, sample preservation, sequencing, bioinformatics and data archiving. With concrete examples we demonstrated how different approaches can significantly alter analysis outcomes. Failure to consider the consequences of these decisions may compound bias introduced at each step along the workflow. The products of these discussions have been summarized in this paper in order to serve as a guide for any researcher undertaking NGS sequencing of AMF communities.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Abstract Background Identification of nontuberculous mycobacteria (NTM) based on phenotypic tests is time-consuming, labor-intensive, expensive and often provides erroneous or inconclusive results. In the molecular method referred to as PRA-hsp65, a fragment of the hsp65 gene is amplified by PCR and then analyzed by restriction digest; this rapid approach offers the promise of accurate, cost-effective species identification. The aim of this study was to determine whether species identification of NTM using PRA-hsp65 is sufficiently reliable to serve as the routine methodology in a reference laboratory. Results A total of 434 NTM isolates were obtained from 5019 cultures submitted to the Institute Adolpho Lutz, Sao Paulo Brazil, between January 2000 and January 2001. Species identification was performed for all isolates using conventional phenotypic methods and PRA-hsp65. For isolates for which these methods gave discordant results, definitive species identification was obtained by sequencing a 441 bp fragment of hsp65. Phenotypic evaluation and PRA-hsp65 were concordant for 321 (74%) isolates. These assignments were presumed to be correct. For the remaining 113 discordant isolates, definitive identification was based on sequencing a 441 bp fragment of hsp65. PRA-hsp65 identified 30 isolates with hsp65 alleles representing 13 previously unreported PRA-hsp65 patterns. Overall, species identification by PRA-hsp65 was significantly more accurate than by phenotype methods (392 (90.3%) vs. 338 (77.9%), respectively; p < .0001, Fisher's test). Among the 333 isolates representing the most common pathogenic species, PRA-hsp65 provided an incorrect result for only 1.2%. Conclusion PRA-hsp65 is a rapid and highly reliable method and deserves consideration by any clinical microbiology laboratory charged with performing species identification of NTM.
Resumo:
The growing knowledge on physiology, cell biology and biochemistry of the reproductive organs has provided many insights into molecular mechanisms that are required for successful reproduction. Research directed at the investigation of reproduction physiology in domestic animals was hampered in the past by a lack of species-specific genomic information. The genome sequences of dog, cattle and horse have become publicly available in 2005, 2006 and 2007 respectively. Although the gene content of mammalian genomes is generally very similar, genes involved in reproduction tend to be less conserved than the average mammalian gene. The availability of genome sequences provides a valuable resource to check whether any protein that may be known from human or mouse research is present in cattle and/or horse as well. Currently there are more than 200 genes known that are involved in the production of fertile sperm cells. Great progress has been made in the understanding of genetic aberrations that lead to male infertility. Additionally, the first genetic mechanisms are being discovered that contribute to the quantitative variation of fertility traits in fertile male animals. Here, I will review some selected aspects of genetic research in male fertility and offer some perspectives for the use of genomic sequence information.
Resumo:
Nucleic acid sequence-based amplification (NASBA) has proved to be an ultrasensitive method for HIV-1 diagnosis in plasma even in the primary HIV infection stage. This technique was combined with fluorescence correlation spectroscopy (FCS) which enables online detection of the HIV-1 RNA molecules amplified by NASBA. A fluorescently labeled DNA probe at nanomolar concentration was introduced into the NASBA reaction mixture and hybridizing to a distinct sequence of the amplified RNA molecule. The specific hybridization and extension of this probe during amplification reaction, resulting in an increase of its diffusion time, was monitored online by FCS. As a consequence, after having reached a critical concentration of 0.1–1 nM (threshold for unaided FCS detection), the number of amplified RNA molecules in the further course of reaction could be determined. Evaluation of the hybridization/extension kinetics allowed an estimation of the initial HIV-1 RNA concentration that was present at the beginning of amplification. The value of initial HIV-1 RNA number enables discrimination between positive and false-positive samples (caused for instance by carryover contamination)—this possibility of discrimination is an essential necessity for all diagnostic methods using amplification systems (PCR as well as NASBA). Quantitation of HIV-1 RNA in plasma by combination of NASBA with FCS may also be useful in assessing the efficacy of anti-HIV agents, especially in the early infection stage when standard ELISA antibody tests often display negative results.
Resumo:
Transmission of human immunodeficiency virus 1 (HIV-1) from an infected women to her offspring during gestation and delivery was found to be influenced by the infant's major histocompatibility complex class II DRB1 alleles. Forty-six HIV-infected infants and 63 seroreverting infants, born with passively acquired anti-HIV antibodies but not becoming detectably infected, were typed by an automated nucleotide-sequence-based technique that uses low-resolution PCR to select either the simpler Taq or the more demanding T7 sequencing chemistry. One or more DR13 alleles, including DRB1*1301, 1302, and 1303, were found in 31.7% of seroreverting infants and 15.2% of those becoming HIV-infected [OR (odds ratio) = 2.6 (95% confidence interval 1.0-6.8); P = 0.048]. This association was influenced by ethnicity, being seen more strongly among the 80 Black and Hispanic children [OR = 4.3 (1.2-16.4); P = 0.023], with the most pronounced effect among Black infants where 7 of 24 seroreverters inherited these alleles with none among 12 HIV-infected infants (Haldane OR = 12.3; P = 0.037). The previously recognized association of DR13 alleles with some situations of long-term nonprogression of HIV suggests that similar mechanisms may regulate both the occurrence of infection and disease progression after infection. Upon examining for residual associations, only only the DR2 allele DRB1*1501 was associated with seroreversion in Caucasoid infants (OR = 24; P = 0.004). Among Caucasoids the DRB1*03011 allele was positively associated with the occurrence of HIV infection (P = 0.03).
Resumo:
This article investigates the expression patterns of 160 genes that are expressed during early mouse development. The cDNAs were isolated from 7.5 d postcoitum (dpc) encloderm, a region that comprises visceral encloderm (VE), definitive encloderm, and the node-tissues that are required for the initial steps of axial specification and tissue patterning in the mouse. To avoid examining the same gene more than once, and to exclude potentially ubiquitously expressed housekeeping genes, cDNA sequence was derived from 1978 clones of the Endoderm library. These yielded 1440 distinct cDNAs, of which 123 proved to be novel in the mouse. In situ hybridization analysis was carried out on 160 of the cDNAs, and of these, 29 (18%) proved to have restricted expression patterns.
Resumo:
L’évolution des protéines est un domaine important de la recherche en bioinformatique et catalyse l'intérêt de trouver des outils d'alignement qui peuvent être utilisés de manière fiable et modéliser avec précision l'évolution d'une famille de protéines. TM-Align (Zhang and Skolnick, 2005) est considéré comme l'outil idéal pour une telle tâche, en termes de rapidité et de précision. Par conséquent, dans cette étude, TM-Align a été utilisé comme point de référence pour faciliter la détection des autres outils d'alignement qui sont en mesure de préciser l'évolution des protéines. En parallèle, nous avons élargi l'actuel outil d'exploration de structures secondaires de protéines, Helix Explorer (Marrakchi, 2006), afin qu'il puisse également être utilisé comme un outil pour la modélisation de l'évolution des protéines.
Resumo:
The number of sequences generated by genome projects has increased exponentially, but gene characterization has not followed at the same rate. Sequencing and analysis of full-length cDNAs is an important step in gene characterization that has been used nowadays by several research groups. In this work, we have selected Schistosoma mansoni clones for full-length sequencing, using an algorithm that investigates the presence of the initial methionine in the parasite sequence based on the positions of alignment start between two sequences. BLAST searches to produce such alignments have been performed using parasite expressed sequence tags produced by Minas Gerais Genome Network against sequences from the database Eukaryotic Cluster of Orthologous Groups (KOG). This procedure has allowed the selection of clones representing 398 proteins which have not been deposited as S. mansoni complete CDS in any public database. Dedicated sequencing of 96 of such clones with reads from both 5' and 3' ends has been performed. These reads have been assembled using PHRAP, resulting in the production of 33 full-length sequences that represent novel S. mansoni proteins. These results shall contribute to construct a more complete view of the biology of this important parasite.
Resumo:
Wurst is a protein threading program with an emphasis on high quality sequence to structure alignments (http://www.zbh.uni-hamburg.de/wurst). Submitted sequences are aligned to each of about 3000 templates with a conventional dynamic programming algorithm, but using a score function with sophisticated structure and sequence terms. The structure terms are a log-odds probability of sequence to structure fragment compatibility, obtained from a Bayesian classification procedure. A simplex optimization was used to optimize the sequence-based terms for the goal of alignment and model quality and to balance the sequence and structural contributions against each other. Both sequence and structural terms operate with sequence profiles.
Resumo:
Xanthomonas axonopodis pv. passiflorae causes bacterial spot in passion fruit. It attacks the purple and yellow passion fruit as well as the sweet passion fruit. The diversity of 87 isolates of pv. passiflorae collected from across 22 fruit orchards in Brazil was evaluated using molecular profiles and statistical procedures, including an unweighted pair-group method with arithmetical averages-based dendrogram, analysis of molecular variance (AMOVA), and an assigning test that provides information on genetic structure at the population level. Isolates from another eight pathovars were included in the molecular analyses and all were shown to have a distinct repetitive sequence-based polymerase chain reaction profile. Amplified fragment length polymorphism technique revealed considerable diversity among isolates of pv. passiflorae, and AMOVA showed that most of the variance (49.4%) was due to differences between localities. Cluster analysis revealed that most genotypic clusters were homogeneous and that variance was associated primarily with geographic origin. The disease adversely affects fruit production and may kill infected plants. A method for rapid diagnosis of the pathogen, even before the disease symptoms become evident, has value for producers. Here, a set of primers (Xapas) was designed by exploiting a single-nucleotide polymorphism between the sequences of the intergenic 16S-23S rRNA spacer region of the pathovars. Xapas was shown to effectively detect all pv. passiflorae isolates and is recommended for disease diagnosis in passion fruit orchards.