975 resultados para Open Reading Frame
Resumo:
The ribonucleotide reductase gene tandem bnrdE/bnrdF in SPbeta-related prophages of different Bacillus spp. isolates presents different configurations of intervening sequences, comprising one to three of six non-homologous splicing elements. Insertion sites of group I introns and intein DNA are clustered in three relatively short segments encoding functionally important domains of the ribonucleotide reductase. Comparison of the bnrdE homologs reveals mutual exclusion of a group I intron and an intein coding sequence flanking the codon that specifies a conserved cysteine. In vivo splicing was demonstrated for all introns. However, for two of them a part of the mRNA precursor molecules remains unspliced. Intergenic bnrdE-bnrdF regions are unexpectedly long, comprising between 238 and 541 nt. The longest encodes a putative polypeptide related to HNH homing endonucleases.
Resumo:
The Bacillus subtilis strain 168 chromosomal region extending from 109 degrees to 112 degrees has been sequenced. Among the 35 ORFs identified, cotT and rapA were the only genes that had been previously mapped and sequenced. Out of ten ORFs belonging to a single putative transcription unit, seven are probably involved in hexuronate catabolism. Their sequences are homologous to Escherichia coli genes exuT, uidB, uxaA, uxaB, uxaC, uxuA and uxuB, which are all required for the uptake of free D-glucuronate, D-galacturonate and beta-glucuronide, and their transformation into glyceraldehyde 3-phosphate and pyruvate via 2-keto-3-deoxygluconate. The remaining three ORFs encode two dehydrogenases and a transcriptional regulator. The operon is preceded by a putative catabolite-responsive element (CRE), located between a hypothetical promoter and the RBS of the first gene. This element, the longest and the only so far described that is fully symmetrical, consists of a 26 bp palindrome matching the theoretical B. subtilis CRE sequence. The remaining predicted amino acid sequences that share homologies with other proteins comprise: a cytochrome P-450, a glycosyltransferase, an ATP-binding cassette transporter, a protein similar to the formate dehydrogenase alpha-subunit (FdhA), protein similar to NADH dehydrogenases, and three homologues of polypeptides that have undefined functions.
Resumo:
SEN virus (SENV) is a circular, single stranded DNA virus that has been first characterized in the serum of a human immunodeficiency virus type 1 (HIV-1)-infected patient. Eight genotypes of SENV (A-H) have been identified and further recognized as variants of TT virus (TTV) in the family Circoviridae. Here we describe the first genomic characterization of a SENV isolate (5-A) from South America. Using 'universal' primers, able to amplify most, if not all, TTV/SENV genotypes, a segment of > 3 kb was amplified by polymerase chain reaction from the serum of an HIV-1 infected patient. The amplicon was cloned and a 3087-nucleotide sequence was determined, that showed a high (85%) homology with the sequence of the Italian isolate SENV-F. Proteins encoded by open reading frames (ORFs) 1 to 4 consisted of 758, 129, 276, and 267 amino acids, respectively. By phylogenetic analysis, isolate 5-A was classified into TTV genotype 19 (phylogenetic group 3), together with SENV-F and TTV isolate SAa-38.
Resumo:
Trypanosoma cruzi acute infections often go unperceived, but one third of chronically infected individuals die of Chagas disease, showing diverse manifestations affecting the heart, intestines, and nervous systems. A common denominator of pathology in Chagas disease is the minimal rejection unit, whereby parasite-free target host cells are destroyed by immune system mononuclear effectors cells infiltrates. Another key feature stemming from T. cruzi infection is the integration of kDNA minicircles into the vertebrate host genome; horizontal transfer of the parasite DNA can undergo vertical transmission to the progeny of mammals and birds. kDNA integration-induced mutations can enter multiple loci in diverse chromosomes, generating new genes, pseudo genes and knock-outs, and resulting in genomic shuffling and remodeling over time. As a result of the juxtaposition of kDNA insertions with host open reading frames, novel chimeric products may be generated. Germ line transmission of kDNA-mutations determined the appearance of lesions in birds that are indistinguishable from those seen in Chagas disease patients. The production of tissue lesions showing typical minimal rejection units in birds' refractory to T. cruzi infection is consistent with the hypothesis that autoimmunity, likely triggered by integration-induced phenotypic alterations, plays a major role in the pathogenesis of Chagas disease.
Resumo:
As acute nonlymphocytic leukemia (ANLL) with inv(16) (p13q22) or t(16;16)(p13;q22) has been shown to result from the fusion of transcription factor subunit core binding factor (CBFB) to a myosin heavy chain (MYH11), we sought to design methods to detect this rearrangement using reverse transcriptase-polymerase chain reaction (RT-PCR). In all of 27 inv(16)(p13q22) and four t(16;16)(p13;q22) cases tested, a chimeric CBFB-MYH11 transcript coding for an in-frame fusion protein was detected. In a more extensive RT-PCR analysis with different primer pairs, we detected a second new chimeric CBFB-MYH11 transcript in 10 of 11 patients tested. The CBFB-MYH11 reading frame of the second transcript was maintained in one patient but not in the others. We show that the different CBFB-MYH11 transcripts in one patient arise from alternative splicing. Translation of the transcript in which the CBFB-MYH11 reading frame is not maintained leads to a slightly truncated CBFB protein.
Resumo:
The classical minor lymphocyte stimulating (Mls) antigens, which induce a strong primary T cell response in vitro, are closely linked to endogenous copies of mouse mammary tumor viruses (MMTV). Expression of Mls genes leads to clonal deletion of T cell subsets expressing specific T cell receptor (TCR) V beta chains. We describe the isolation and characterization of a new exogenous (infectious) MMTV with biological properties similar to the Mls antigen Mls-1a. In vivo administration of either Mls-1a-expressing B cells or the infectious MMTV (SW) led to an increase of T cells expressing V beta 6 followed by their deletion. Surprisingly, different kinetics of deletion were observed with the exogenous virus depending upon the route of infection. Infection through the mucosa led to a slow deletion of V beta 6+ T cells, whereas deletion was rapid after subcutaneous infection. Sequence analysis of the open reading frames in the 3' long terminal repeat of both this exogenous MMTV (SW) and of Mtv-7 (which is closely linked to Mls-1a) revealed striking similarities, particularly in the COOH terminus, which has been implicated in TCR V beta recognition. The identification of an infectious MMTV with the properties of a strong Mls antigen provides a new, powerful tool to study immunity and tolerance in vivo.
Resumo:
The characterisation of the gene encoding Trypanosoma cruzi CL Brener phosphofructokinase (PFK) and the biochemical properties of the expressed enzyme are reported here. In contradiction with previous reports, the PFK genes of CL Brener and YBM strain T. cruzi were found to be similar to their Leishmania mexicana and Trypanosoma brucei homologs in terms of both kinetic properties and size, with open reading frames encoding polypeptides with a deduced molecular mass of 53,483. The predicted amino acid sequence contains the C-terminal glycosome-targeting tripeptide SKL; this localisation was confirmed by immunofluorescence assays. In sequence comparisons with the genes of other eukaryotes, it was found that, despite being an adenosine triphosphate-dependent enzyme, T. cruzi PFK shows significant sequence similarity with inorganic pyrophosphate-dependent PFKs.
Resumo:
BACKGROUND: Infection with Leishmania parasites causes mainly cutaneous lesions at the site of the sand fly bite. Inflammatory metastatic forms have been reported with Leishmania species such as L. braziliensis, guyanensis and aethiopica. Little is known about the factors underlying such exacerbated clinical presentations. Leishmania RNA virus (LRV) is mainly found within South American Leishmania braziliensis and guyanensis. In a mouse model of L. guyanensis infection, its presence is responsible for an hyper-inflammatory response driven by the recognition of the viral dsRNA genome by the host Toll-like Receptor 3 leading to an exacerbation of the disease. In one instance, LRV was reported outside of South America, namely in the L. major ASKH strain from Turkmenistan, suggesting that LRV appeared before the divergence of Leishmania subgenera. LRV presence inside Leishmania parasites could be one of the factors implicated in disease severity, providing rationale for LRV screening in L. aethiopica. METHODOLOGY/PRINCIPAL FINDINGS: A new LRV member was identified in four L. aethiopica strains (LRV-Lae). Three LRV-Lae genomes were sequenced and compared to L. guyanensis LRV1 and L. major LRV2. LRV-Lae more closely resembled LRV2. Despite their similar genomic organization, a notable difference was observed in the region where the capsid protein and viral polymerase open reading frames overlap, with a unique -1 situation in LRV-Lae. In vitro infection of murine macrophages showed that LRV-Lae induced a TLR3-dependent inflammatory response as previously observed for LRV1. CONCLUSIONS/SIGNIFICANCE: In this study, we report the presence of an immunogenic dsRNA virus in L. aethiopica human isolates. This is the first observation of LRV in Africa, and together with the unique description of LRV2 in Turkmenistan, it confirmed that LRV was present before the divergence of the L. (Leishmania) and (Viannia) subgenera. The potential implication of LRV-Lae on disease severity due to L. aethiopica infections is discussed.
Resumo:
The objective of this study was to detect and identify hepatitis E virus (HEV) strains in liver and bile samples from slaughtered pigs in the state of Paraná, Brazil. Liver and bile samples were collected from 118 asymptomatic adult pigs at a slaughterhouse in a major Brazilian pork production area. The samples were assayed using a nested reverse transcription-polymerase chain reaction protocol with primer sets targeting open reading frames (ORF)1 and 2 of the HEV genome. HEV RNA was detected in two (1.7%) liver samples and one (0.84%) bile sample using both primers sets. The HEV strains were classified as genotype 3b on the basis of their nucleotide sequences. These data suggest that healthy pigs may be a source of HEV infection for consumers of pig liver and slaughterhouse workers in Brazil.
Resumo:
The origin of new genes through gene duplication is fundamental to the evolution of lineage- or species-specific phenotypic traits. In this report, we estimate the number of functional retrogenes on the lineage leading to humans generated by the high rate of retroposition (retroduplication) in primates. Extensive comparative sequencing and expression studies coupled with evolutionary analyses and simulations suggest that a significant proportion of recent retrocopies represent bona fide human genes. We estimate that at least one new retrogene per million years emerged on the human lineage during the past approximately 63 million years of primate evolution. Detailed analysis of a subset of the data shows that the majority of retrogenes are specifically expressed in testis, whereas their parental genes show broad expression patterns. Consistently, most retrogenes evolved functional roles in spermatogenesis. Proteins encoded by X chromosome-derived retrogenes were strongly preserved by purifying selection following the duplication event, supporting the view that they may act as functional autosomal substitutes during X-inactivation of late spermatogenesis genes. Also, some retrogenes acquired a new or more adapted function driven by positive selection. We conclude that retroduplication significantly contributed to the formation of recent human genes and that most new retrogenes were progressively recruited during primate evolution by natural and/or sexual selection to enhance male germline function.
Resumo:
MOTIVATION: Supporting the functionality of recent duplicate gene copies is usually difficult, owing to high sequence similarity between duplicate counterparts and shallow phylogenies, which hamper both the statistical and experimental inference. RESULTS: We developed an integrated evolutionary approach to identify functional duplicate gene copies and other lineage-specific genes. By repeatedly simulating neutral evolution, our method estimates the probability that an ORF was selectively conserved and is therefore likely to represent a bona fide coding region. In parallel, our method tests whether the accumulation of non-synonymous substitutions reveals signatures of selective constraint. We show that our approach has high power to identify functional lineage-specific genes using simulated and real data. For example, a coding region of average length (approximately 1400 bp), restricted to hominoids, can be predicted to be functional in approximately 94-100% of cases. Notably, the method may support functionality for instances where classical selection tests based on the ratio of non-synonymous to synonymous substitutions fail to reveal signatures of selection. Our method is available as an automated tool, ReEVOLVER, which will also be useful to systematically detect functional lineage-specific genes of closely related species on a large scale. AVAILABILITY: ReEVOLVER is available at http://www.unil.ch/cig/page7858.html.
Resumo:
The organization of lin genes and IS6100 was studied in three strains of Sphingomonas paucimobilis (B90A, Sp+, and UT26) which degraded hexachlorocyclohexane (HCH) isomers but which had been isolated at different geographical locations. DNA-DNA hybridization data revealed that most of the lin genes in these strains were associated with IS6100, an insertion sequence classified in the IS6 family and initially found in Mycobacterium fortuitum. Eleven, six, and five copies of IS6100 were detected in B90A, Sp+, and UT26, respectively. IS6100 elements in B90A were sequenced from five, one, and one regions of the genomes of B90A, Sp+, and UT26, respectively, and were found to be identical. DNA-DNA hybridization and DNA sequencing of cosmid clones also revealed that S. paucimobilis B90A contains three and two copies of linX and linA, respectively, compared to only one copy of these genes in strains Sp+ and UT26. Although the copy number and the sequence of the remaining genes of the HCH degradative pathway (linB, linC, linD, and linE) were nearly the same in all strains, there were striking differences in the organization of the linA genes as a result of replacement of portions of DNA sequences by IS6100, which gave them a strange mosaic configuration. Spontaneous deletion of linD and linE from B90A and of linA from Sp+ occurred and was associated either with deletion of a copy of IS6100 or changes in IS6100 profiles. The evidence gathered in this study, coupled with the observation that the G+C contents of the linA genes are lower than that of the remaining DNA sequence of S. paucimobilis, strongly suggests that all these strains acquired the linA gene through horizontal gene transfer mediated by IS6100. The association of IS6100 with the rest of the lin genes further suggests that IS6100 played a role in shaping the current lin gene organization.
Resumo:
Conservation of the function of open reading frames recently identified in fungal genome projects can be assessed by complementation of deletion mutants of putative Saccharomyces cerevisiae orthologs. A parallel complementation assay expressing the homologous wild type S. cerevisiae gene is generally performed as a positive control. However, we and others have found that failure of complementation can occur in this case. We investigated the specific cases of S. cerevisiae TBF1 and TIM54 essential genes. Heterologous complementation with Candida glabrata TBF1 or TIM54 gene was successful using the constitutive promoters TDH3 and TEF. In contrast, homologous complementation with S. cerevisiae TBF1 or TIM54 genes failed using these promoters, and was successful only using the natural promoters of these genes. The reduced growth rate of S. cerevisiae complemented with C. glabrata TBF1 or TIM54 suggested a diminished functionality of the heterologous proteins compared to the homologous proteins. The requirement of the homologous gene for the natural promoter was alleviated for TBF1 when complementation was assayed in the absence of sporulation and germination, and for TIM54 when two regions of the protein presumably responsible for a unique translocation pathway of the TIM54 protein into the mitochondrial membrane were deleted. Our results demonstrate that the use of different promoters may prove necessary to obtain successful complementation, with use of the natural promoter being the best approach for homologous complementation.
Resumo:
Little is known about the relation between the genome organization and gene expression in Leishmania. Bioinformatic analysis can be used to predict genes and find homologies with known proteins. A model was proposed, in which genes are organized into large clusters and transcribed from only one strand, in the form of large polycistronic primary transcripts. To verify the validity of this model, we studied gene expression at the transcriptional, post-transcriptional and translational levels in a unique locus of 34kb located on chr27 and represented by cosmid L979. Sequence analysis revealed 115 ORFs on either DNA strand. Using computer programs developed for Leishmania genes, only nine of these ORFs, localized on the same strand, were predicted to code for proteins, some of which show homologies with known proteins. Additionally, one pseudogene, was identified. We verified the biological relevance of these predictions. mRNAs from nine predicted genes and proteins from seven were detected. Nuclear run-on analyses confirmed that the top strand is transcribed by RNA polymerase II and suggested that there is no polymerase entry site. Low levels of transcription were detected in regions of the bottom strand and stable transcripts were identified for four ORFs on this strand not predicted to be protein-coding. In conclusion, the transcriptional organization of the Leishmania genome is complex, raising the possibility that computer predictions may not be comprehensive.
Resumo:
This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.