9 resultados para Conserved gene synteny
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
INTRODUCTION: The symptoms of Brazilian borreliosis resemble the clinical manifestations of Lyme disease (LD). However, there are differences between the two in terms of epidemiological and laboratory findings. Primers usually employed to diagnose LD have failed to detect Borrelia strains in Brazil. OBJECTIVE: We aimed to identify the Brazilian Borrelia using a conserved gene that synthesizes the flagellar hook (flgE) of Borrelia burgdorferi sensu lato. METHOD: Three patients presenting with erythema migrans and positive epidemiological histories were recruited for the study. Blood samples were collected, and the DNA was extracted by commercial kits. RESULTS: The gene flgE was amplified from DNA of all selected patients. Upon sequencing, these positive samples revealed 99% homology to B. burgdorferi flgE. CONCLUSION: These results support the existence of borreliosis in Brazil. However, it is unclear whether this borreliosis is caused by a genetically modified B. burgdorferi sensu stricto or by a new species of Borrelia spp.
Resumo:
In silico analyses of Leishmania spp. genome data are a powerful resource to improve the understanding of these pathogens' biology. Trypanosomatids such as Leishmania spp. have their protein-coding genes grouped in long polycistronic units of functionally unrelated genes. The control of gene expression happens by a variety of posttranscriptional mechanisms. The high degree of synteny among Leishmania species is accompanied by highly conserved coding sequences (CDS) and poorly conserved intercoding untranslated sequences. To identify the elements involved in the control of gene expression, we conducted an in silico investigation to find conserved intercoding sequences (CICS) in the genomes of L major, L infantum, and L braziliensis. We used a combination of computational tools, such as Linux-Shell, PERL and R languages, BLAST, MSPcrunch, SSAKE, and Pred-A-Term algorithms to construct a pipeline which was able to: (i) search for conservation in target-regions, (ii) eliminate CICS redundancy and mask repeat elements, (iii) predict the mRNA's extremities, (iv) analyze the distribution of orthologous genes within the generated LeishCICS-clusters, (v) assign GO terms to the LeishCICS-clusters. and (vi) provide statistical support for the gene-enrichment annotation. We associated the LeishCICS-cluster data, generated at the end of the pipeline, with the expression profile oft. donovani genes during promastigote-amastigote differentiation, as previously evaluated by others (GEO accession: GSE21936). A Pearson's correlation coefficient greater than 0.5 was observed for 730 LeishCICS-clusters containing from 2 to 17 genes. The designed computational pipeline is a useful tool and its application identified potential regulatory cis elements and putative regulons in Leishmania. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
Coccidiosis of the domestic fowl is a worldwide disease caused by seven species of protozoan parasites of the genus Eimeria. The genome of the model species, Eimeria tenella, presents a complexity of 55-60 MB distributed in 14 chromosomes. Relatively few studies have been undertaken to unravel the complexity of the transcriptome of Eimeria parasites. We report here the generation of more than 45,000 open reading frame expressed sequence tag (ORESTES) cDNA reads of E. tenella, Eimeria maxima and Eimeria acervulina, covering several developmental stages: unsporulated oocysts, sporoblastic oocysts, sporulated oocysts, sporozoites and second generation merozoites. All reads were assembled to constitute gene indices and submitted to a comprehensive functional annotation pipeline. In the case of E. tenella, we also incorporated publicly available ESTs to generate an integrated body of information. Orthology analyses have identified genes conserved across different apicomplexan parasites, as well as genes restricted to the genus Eimeria. Digital expression profiles obtained from ORESTES/EST countings, submitted to clustering analyses, revealed a high conservation pattern across the three Eimeria spp. Distance trees showed that unsporulated and sporoblastic oocysts constitute a distinct clade in all species, with sporulated oocysts forming a more external branch. This latter stage also shows a close relationship with sporozoites, whereas first and second generation merozoites are more closely related to each other than to sporozoites. The profiles were unambiguously associated with the distinct developmental stages and strongly correlated with the order of the stages in the parasite life cycle. Finally, we present The Eimeria Transcript Database (http://www.coccidia.icb.usp.br/eimeriatdb), a website that provides open access to all sequencing data, annotation and comparative analysis. We expect this repository to represent a useful resource to the Eimeria scientific community, helping to define potential candidates for the development of new strategies to control coccidiosis of the domestic fowl. (C) 2011 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Resumo:
The tax gene of human T-lymphotropic virus type 1 (HTLV-1) diverges among isolates according to geographic regions and has been classified into two genotypes: taxA and taxB. In Brazil, taxA is the most prevalent genotype in symptomatic and asymptomatic carriers. Few studies have been conducted in HIV-infected patients. The present study characterized the tax gene (1059 bp) in 13 Brazilian HIV-1/HTLV-1-coinfected patients from the south and southeast regions. The results confirmed the transcontinental HTLV-1 subgroup A of the Cosmopolitan subtype and showed high nucleotide similarity both among Brazilian sequences and in relation to the ATK prototype (99.5% and 99.2%, respectively). Six nucleotide substitutions were highly conserved among isolates, ranging from 76.9% to 100%: C7401T, T7914C, C7920T, C7982T, G8231A, and A8367C. The presence of the Brazilian molecular signature of genotype taxA was confirmed in all of the isolates, and they clustered into two Latin American clusters, which confirms the double introduction of HTLV-1 in Brazil.
Resumo:
The classic approach to gene discovery relies on the construction of linkage maps. We report the first molecular-based linkage map for Drosophila mediopunctata, a neotropical species of the tripunctata group. Eight hundred F2 individuals were genotyped at 49 microsatellite loci, resulting in a map that is approximate to 450 centimorgans long. Five linkage groups were detected, and the species' chromosomes were identified through cross-references to BLASTn searches and Muller elements. Strong synteny was observed when compared with the Drosophila melanogaster chromosome arms, but little conservation in the gene order was seen. The incorporation of morphological data corresponding to the number of central abdominal spots on the map was consistent with the expected location of a genomic region responsible for the phenotype on the second chromosome.
Resumo:
The non-classical human leukocyte antigen (HLA) class I genes present a very low rate of variation. So far, only 10 HLA-E alleles encoding three proteins have been described, but only two are frequently found in worldwide populations. Because of its historical background, Brazilians are very suitable for population genetic studies. Therefore, 104 bone marrow donors from Brazil were evaluated for HLA-E exons 14. Seven variation sites were found, including two known single nucleotide polymorphisms (SNPs) at positions +424 and +756 and five new SNPs at positions +170 (intron 1), +1294 (intron 3), +1625, +1645 and +1857 (exon 4). Haplotyping analysis did show eight haplotypes, three of them known as E*01:01:01, E*01:03:01 and E*01:03:02:01 and five HLA-E new alleles that carry the new variation sites. The HLA-E*01:01:01 allele was the predominant haplotype (62.50%), followed by E*01:03:02:01 (24.52%). Selective neutrality tests have disclosed an interesting pattern of selective pressures in which balancing selection is probably shaping allele frequency distributions at an SNP at exon 3 (codon 107), sequence diversity at exon 4 and the non-coding regions is facing significant purifying pressure. Even in an admixed population such as the Brazilian one, the HLA-E locus is very conserved, presenting few polymorphic SNPs in the coding region.
Resumo:
SERA5 is regarded as a promising malaria vaccine candidate of the most virulent human malaria parasite Plasmodium falciparum. SERA5 is a 120 kDa abundantly expressed blood-stage protein containing a papain-like protease. Since substantial polymorphism in blood-stage vaccine candidates may potentially limit their efficacy, it is imperative to fully investigate polymorphism of the SERA5 gene (sera5). In this study, we performed evolutionary and population genetic analysis of sera5. The level of inter-species divergence (kS = 0.076) between P. falciparum and Plasmodium reichenowi, a closely related chimpanzee malaria parasite is comparable to that of housekeeping protein genes. A signature of purifying selection was detected in the proenzyme and enzyme domains. Analysis of 445 near full-length P. falciparum sera5 sequences from nine countries in Africa, Southeast Asia, Oceania and South America revealed extensive variations in the number of octamer repeat (OR) and serine repeat (SR) regions as well as substantial level of single nucleotide polymorphism (SNP) in non-repeat regions (2562 bp). Remarkably, a 14 amino acid sequence of SERA5 (amino acids 59-72) that is known to be the in vitro target of parasite growth inhibitory antibodies was found to be perfectly conserved in all 445 worldwide isolates of P. falciparum evaluated. Unlike other major vaccine target antigen genes such as merozoite surface protein-1, apical membrane antigen-1 or circumsporozoite protein, no strong evidence for positive selection was detected for SNPs in the non-repeat regions of sera5. A biased geographical distribution was observed in SNPs as well as in the haplotypes of the sera5 OR and SR regions. In Africa, OR- and SR-haplotypes with low frequency (<5%) and SNPs with minor allele frequency (<5%) were abundant and were mostly continent-specific. Consistently, significant genetic differentiation, assessed by the Wright's fixation index (FST) of inter-population variance in allele frequencies, was detected for SNPs and both OR- and SR-haplotypes among almost all parasite populations. The exception was parasite populations between Tanzania and Ghana, suggesting frequent gene flow in Africa. The present study points to the importance of investigating whether biased geographical distribution for SNPs and repeat variants in the OR and SR regions affect the reactivity of human serum antibodies to variants. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
Abstract Background The mitochondrial DNA of kinetoplastid flagellates is distinctive in the eukaryotic world due to its massive size, complex form and large sequence content. Comprised of catenated maxicircles that contain rRNA and protein-coding genes and thousands of heterogeneous minicircles encoding small guide RNAs, the kinetoplast network has evolved along with an extreme form of mRNA processing in the form of uridine insertion and deletion RNA editing. Many maxicircle-encoded mRNAs cannot be translated without this post-transcriptional sequence modification. Results We present the complete sequence and annotation of the Trypanosoma cruzi maxicircles for the CL Brener and Esmeraldo strains. Gene order is syntenic with Trypanosoma brucei and Leishmania tarentolae maxicircles. The non-coding components have strain-specific repetitive regions and a variable region that is unique for each strain with the exception of a conserved sequence element that may serve as an origin of replication, but shows no sequence identity with L. tarentolae or T. brucei. Alternative assemblies of the variable region demonstrate intra-strain heterogeneity of the maxicircle population. The extent of mRNA editing required for particular genes approximates that seen in T. brucei. Extensively edited genes were more divergent among the genera than non-edited and rRNA genes. Esmeraldo contains a unique 236-bp deletion that removes the 5'-ends of ND4 and CR4 and the intergenic region. Esmeraldo shows additional insertions and deletions outside of areas edited in other species in ND5, MURF1, and MURF2, while CL Brener has a distinct insertion in MURF2. Conclusion The CL Brener and Esmeraldo maxicircles represent two of three previously defined maxicircle clades and promise utility as taxonomic markers. Restoration of the disrupted reading frames might be accomplished by strain-specific RNA editing. Elements in the non-coding region may be important for replication, transcription, and anchoring of the maxicircle within the kinetoplast network.
Resumo:
Abstract Background The structure of regulatory networks remains an open question in our understanding of complex biological systems. Interactions during complete viral life cycles present unique opportunities to understand how host-parasite network take shape and behave. The Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is a large double-stranded DNA virus, whose genome may encode for 152 open reading frames (ORFs). Here we present the analysis of the ordered cascade of the AgMNPV gene expression. Results We observed an earlier onset of the expression than previously reported for other baculoviruses, especially for genes involved in DNA replication. Most ORFs were expressed at higher levels in a more permissive host cell line. Genes with more than one copy in the genome had distinct expression profiles, which could indicate the acquisition of new functionalities. The transcription gene regulatory network (GRN) for 149 ORFs had a modular topology comprising five communities of highly interconnected nodes that separated key genes that are functionally related on different communities, possibly maximizing redundancy and GRN robustness by compartmentalization of important functions. Core conserved functions showed expression synchronicity, distinct GRN features and significantly less genetic diversity, consistent with evolutionary constraints imposed in key elements of biological systems. This reduced genetic diversity also had a positive correlation with the importance of the gene in our estimated GRN, supporting a relationship between phylogenetic data of baculovirus genes and network features inferred from expression data. We also observed that gene arrangement in overlapping transcripts was conserved among related baculoviruses, suggesting a principle of genome organization. Conclusions Albeit with a reduced number of nodes (149), the AgMNPV GRN had a topology and key characteristics similar to those observed in complex cellular organisms, which indicates that modularity may be a general feature of biological gene regulatory networks.