976 resultados para Open Reading Frames (orfs)
Resumo:
We analyzed the codon usage bias of eight open reading frames (ORFs) across up to 79 human papillomavirus (HPV) genotypes from three distinct phylogenetic groups. All eight ORFs across HPV genotypes show a strong codon usage bias, amongst degenerately encoded amino acids, toward 18 codons mainly with T at the 3rd position. For all 18 degenerately encoded amino acids, codon preferences amongst human and animal PV ORFs are significantly different from those averaged across mammalian genes. Across the HPV types, the L2 ORFs show the highest codon usage bias (73.2 +/- 1.6% and the E4 ORFs the lowest (51.1 +/- 0.5%), reflecting as similar bias in codon 3rd position A + T content (L2: 76.1 +/- 4.2%; E4: 58.6 +/- 4.5%). The E4 ORF, uniquely amongst the HPV ORFs, is G + C rich, while the other ORFs are A + T rich. Codon usage bias correlates positively with A + T content at the codon 3rd position in the E2, E6, L1 and L2 ORFs, but negatively in the E4 ORFs. A general conservation of preferred codon usage across human and non-human PV genotypes whether they originate from a same supergroup or not, together with observed difference between the preferred codon usage for HPV ORFs and for genes of the cells they infect, suggests that specific codon usage bias and A + T content variation may somehow increase the replicational fitness of HPVs in mammalian epithelial cells, and have practical implications for gene therapy of HPV infection. (C) 2003 Elsevier B.V. All rights reserved.
Resumo:
Pili of pathogenic Neisseria are major virulence factors associated with adhesion, cytotoxicity, twitching motility, autoaggregation, and DNA transformation. Pili are modified posttranslationally by the addition of phosphorylcholine. However, no genes involved in either the biosynthesis or the transfer of phosphorylcholine in Neisseria meningitidis have been identified. In this study, we identified five candidate open reading frames (ORFs) potentially involved in the biosynthesis or transfer of phosphorylcholine to pilin in N. meningitidis. Insertional mutants were constructed for each ORF in N. meningitidis strain C311#3 to determine their effect on phosphorylcholine expression. The effect of the mutant ORFs on the modification by phosphorylcholine was analyzed by Western analysis with phosphorylcholine-specific monoclonal antibody TEPC-15. Analysis of the mutants showed that ORF NMB0415, now defined as pptA (pilin phosphorylcholine transferase A), is involved in the addition of phosphorylcholine to pilin in N. meningitidis. Additionally, the phase variation (high frequency on-off switching of expression) of phosphorylcholine on pilin is due to changes in a homopolymeric guanosine tract in pptA.
Resumo:
Dissertação de mestrado em Bioengenharia
Resumo:
SEN virus (SENV) is a circular, single stranded DNA virus that has been first characterized in the serum of a human immunodeficiency virus type 1 (HIV-1)-infected patient. Eight genotypes of SENV (A-H) have been identified and further recognized as variants of TT virus (TTV) in the family Circoviridae. Here we describe the first genomic characterization of a SENV isolate (5-A) from South America. Using 'universal' primers, able to amplify most, if not all, TTV/SENV genotypes, a segment of > 3 kb was amplified by polymerase chain reaction from the serum of an HIV-1 infected patient. The amplicon was cloned and a 3087-nucleotide sequence was determined, that showed a high (85%) homology with the sequence of the Italian isolate SENV-F. Proteins encoded by open reading frames (ORFs) 1 to 4 consisted of 758, 129, 276, and 267 amino acids, respectively. By phylogenetic analysis, isolate 5-A was classified into TTV genotype 19 (phylogenetic group 3), together with SENV-F and TTV isolate SAa-38.
Resumo:
This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.
Resumo:
Background: The understanding of whole genome sequences in higher eukaryotes depends to a large degree on the reliable definition of transcription units including exon/intron structures, translated open reading frames (ORFs) and flanking untranslated regions. The best currently available chicken transcript catalog is the Ensembl build based on the mappings of a relatively small number of full length cDNAs and ESTs to the genome as well as genome sequence derived in silico gene predictions.Results: We use Long Serial Analysis of Gene Expression (LongSAGE) in bursal lymphocytes and the DT40 cell line to verify the quality and completeness of the annotated transcripts. 53.6% of the more than 38,000 unique SAGE tags (unitags) match to full length bursal cDNAs, the Ensembl transcript build or the genome sequence. The majority of all matching unitags show single matches to the genome, but no matches to the genome derived Ensembl transcript build. Nevertheless, most of these tags map close to the 3' boundaries of annotated Ensembl transcripts.Conclusions: These results suggests that rather few genes are missing in the current Ensembl chicken transcript build, but that the 3' ends of many transcripts may not have been accurately predicted. The tags with no match in the transcript sequences can now be used to improve gene predictions, pinpoint the genomic location of entirely missed transcripts and optimize the accuracy of gene finder software.
Resumo:
This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.
Resumo:
Pseudomonas sp. strain B13 is a bacterium known to degrade chloroaromatic compounds. The properties to use 3- and 4-chlorocatechol are determined by a self-transferable DNA element, the clc element, which normally resides at two locations in the cell's chromosome. Here we report the complete nucleotide sequence of the clc element, demonstrating the unique catabolic properties while showing its relatedness to genomic islands and integrative and conjugative elements rather than to other known catabolic plasmids. As far as catabolic functions, the clc element harbored, in addition to the genes for chlorocatechol degradation, a complete functional operon for 2-aminophenol degradation and genes for a putative aromatic compound transport protein and for a multicomponent aromatic ring dioxygenase similar to anthranilate hydroxylase. The genes for catabolic functions were inducible under various conditions, suggesting a network of catabolic pathway induction. For about half of the open reading frames (ORFs) on the clc element, no clear functional prediction could be given, although some indications were found for functions that were similar to plasmid conjugation. The region in which these ORFs were situated displayed a high overall conservation of nucleotide sequence and gene order to genomic regions in other recently completed bacterial genomes or to other genomic islands. Most notably, except for two discrete regions, the clc element was almost 100% identical over the whole length to a chromosomal region in Burkholderia xenovorans LB400. This indicates the dynamic evolution of this type of element and the continued transition between elements with a more pathogenic character and those with catabolic properties.
Resumo:
A large number of gene products that are enriched in the striatum have ill-defined functions, although they may have key roles in age-dependent neurodegenerative diseases affecting the striatum, especially Huntington disease (HD). In the present study, we focused on Abhd11os, (called ABHD11-AS1 in human) which is a putative long noncoding RNA (lncRNA) whose expression is enriched in the mouse striatum. We confirm that despite the presence of 2 small open reading frames (ORFs) in its sequence, Abhd11os is not translated into a detectable peptide in living cells. We demonstrate that Abhd11os levels are markedly reduced in different mouse models of HD. We performed in vivo experiments in mice using lentiviral vectors encoding either Abhd11os or a small hairpin RNA targeting Abhd11os. Results show that Abhd11os overexpression produces neuroprotection against an N-terminal fragment of mutant huntingtin, whereas Abhd11os knockdown is protoxic. These novel results indicate that the loss lncRNA Abhd11os likely contribute to striatal vulnerability in HD. Our study emphasizes that lncRNA may play crucial roles in neurodegenerative diseases.
Resumo:
Two Azospirillum brasilense open reading frames (ORFs) exhibited homology with the two-component NtrY/NtrX regulatory system from Azorhizobium caulinodans. These A. brasilense ORFs, located downstream to the nifR3ntrBC operon, were isolated, sequenced and characterized. The present study suggests that ORF1 and ORF2 correspond to the A. brasilense ntrY and ntrX genes, respectively. The amino acid sequences of A. brasilense NtrY and NtrX proteins showed high similarity to sensor/kinase and regulatory proteins, respectively. Analysis of lacZ transcriptional fusions by the ß-galactosidase assay in Escherichia coli ntrC mutants showed that the NtrY/NtrX proteins failed to activate transcription of the nifA promoter of A. brasilense. The ntrYX operon complemented a nifR3ntrBC deletion mutant of A. brasilense for nitrate-dependent growth, suggesting a possible cross-talk between the NtrY/X and NtrB/C sensor/regulator pairs. Our data support the existence of another two-component regulatory system in A. brasilense, the NtrY/NtrX system, probably involved in the regulation of nitrate assimilation.
Resumo:
ABSTRACT Recombinant adenoviruses are currently under intense investigation as potential gene delivery and gene expression vectors with applications in human and veterinary medicine. As part of our efforts to develop a bovine adenovirus type 2 (BAV2) based vector system, the nucleotide sequence of BAV2 was determined. Sixty-six open reading frames (ORFs) were found with the potential to encode polypeptides that were at least 50 amino acid (aa) residue long. Thirty-one of the BAV2 polypeptide sequences were found to share homology to already identified adenovirus proteins. The arrangement of the genes revealed that the BAV2 genomic organization closely resembles that of well-characterized human adenoviruses. In the course of this study, continuous propagation of BAV2 over many generations in cell culture resulted in the isolation of a BAV2 spontaneous mutant in which the E3 region was deleted. Restriction enzyme, sequencing and PCR analyses produced concordant results that precisely located the deletion and revealed that its size was exactly 1299 bp. The E3-deleted virus was plaque-purified and further propagated in cell culture. It appeared that the replication of such a virus lacking a portion of the E3 region was not affected, at least in cell culture. Attempts to rescue a recombinant BAV2 virus with the bacterial kanamycin resistance gene in the E3 region yielded a candidate as verified with extensive Southern blotting and PCR analyses. Attempts to purify the recombinant virus were not successful, suggesting that such recombinant BAV2 was helper-dependent. Ten clones containing full-length BAV2 genomes in a pWE15 cosmid vector were constructed. The infectivity of these constructs was tested by using different transfection methods. The BAV2 genomic clones did appear to be infectious only after extended incubation period. This may be due to limitations of various transfection methods tested, or biological differences between virus- and E. co//-derived BAV2 DNA.
Resumo:
Adenoviruses are non-enveloped icosahedral-shaped particles which possess a double-stranded DNA genome. Currently, nearly 100 serotypes of adenoviruses have been identified, 48 of which are of human origin. Bovine adenoviruses (BAVs), causing both mild respiratory and/or enteral diseases in cattle, have been reported in many countries all over the world. Currently, nine serotypes of SAVs have been isolated which have been placed into two subgroups based on a number of characteristics which include complement fixation tests as well as the ability to replicate in various cell lines. Bovine adenovirus type 2 (BAV2), belonging to subgroup I, is able to cause pneumonia as well as pneumonic-like symptoms in calves. In this study, the genome of BAV2 (strain No. 19) was subcloned into the plasmid vector pUC19. In total, 16 plasmids were constructed; three carry internal San fragments (spanning 3.1 to 65.2% ), and 10 carry internal Pstl fragments (spanning 4.9 to 97.4%), of the viral genome. Each of these plasmids was analyzed using twelve restriction endonucleases; BamHI, CiaI, EcoRl, HiOOlll, Kpnl, Noll, NS(N, Ps~, Pvul, Saj, Xbal, and Xhol. Terminal end fragments were also cloned and analyzed, sUbsequent to the removal of the 5' terminal protein, in the form of 2 BamHI B fragments, cloned in opposite orientations (spanning 0 to 18.1°k), and one Pstll fragment (spanning 97.4 to 1000/0). These cloned fragments, along with two other plasmids previously constructed carrying internal EcoRI fragments (spanning 20.6 to 90.5%), were then used to construct a detailed physical restriction map using the twelve restriction endonucleases, as well as to estimate the size of the genome for BAV2(32.5 Kbp). The DNA sequences of the early region 1 (E1) and hexon-associated gene (protein IX) have also been determined. The amino acid sequences of four open reading frames (ORFs) have been compared to those of the E1 proteins and protein IX from other Ads.
Resumo:
Recombinant Adenoviruses (Ads) have been shown to have potential applications in three areas: gene therapy, high level protein expression and recombinant vaccines.' At least three different locations within the Ad genome can be deleted and subsequently used for the insertion of foreign sequences. These include the Early 3 (E3), Early 1 (E1) and Early 4 (E4) regions. Viral vectors of this type have been well studied in Human Ads 2 and 5, however one has not yet been constructed for Bovine Adenovirus Type 2 (BAV2). The E3 region is located between 76.6 and 86 m.u. on the r-strand and is transcribed in a rightward direction. The gene products of the Early 3 region (E3) have been shown to be non-essential for viral replication, in vitro, but are required for host immunosurveillance. This study represents the cloning and reconstitution of a BAV2 E3 deletion mutant. A deletion of 1800bp was made within the E3 region of BAV2 and the thymidine kinase gene was subsequently inserted in the deleted area . . The plasmid pdlE3-4tk1 (23.4Kbp) was constructed and used to to facilitate homologous recombination with the wild type BAV2 to produce a mutant. Southern Blotting and Hybridization results suggest the presence of a BAV2 E3 deletion mutant with thymidine kinase sequences present. The E4 region of Human Adenovirus types 2 and 5 is located at the extreme right end of the genome (91.3 map units - 99.1 map units) and is transcribed in a leftward direction giving rise to a complicated set of differentially spliced mRNAs. Essentially there are 7 open reading frames (ORFs) encoding for at least 7 polypeptides. The gene products encoded by the E4 region have been shown to be essential for the expression of late viral genes, host cell shutoff and normal viral growth. We have cloned and sequenced the right end segment between 90.5 map units and 100 map units of the BAV2 genome. The results show several open reading frames which encode polypeptides exhibiting homology to three polypeptides encoded by the E4 region of human adenovirus type 2. These include the 14kDa protein encoded by ORF1, the 34kDa protein encoded by ORF6 and the 13kDa protein encoded by ORF3. The nucleotide sequence, restriction enzyme map, and ORF map of the E4 region could be very useful in future molecular manipulation of this region and could possibly explain the slow growth rate of BAV2 in MDBK cells.
Resumo:
Porcine reproductive and respiratory syndrome (PRRS) is an economically devastating viral disease affecting the swine industry worldwide. The etiological agent, PRRS virus (PRRSV), possesses a RNA viral genome with nine open reading frames (ORFs). The ORF1a and ORF1b replicase-associated genes encode the polyproteins pp1a and pp1ab, respectively. The pp1a is processed in nine non-structural proteins (nsps): nsp1a, nsp1b, and nsp2 to nsp8. Proteolytic cleavage of pp1ab generates products nsp9 to nsp12. The proteolytic pp1a cleavage products process and cleave pp1a and pp1ab into nsp products. The nsp9 to nsp12 are involved in virus genome transcription and replication. The 30 end of the viral genome encodes four minor and three major structural proteins. The GP2a, GP3 and GP4 (encoded by ORF2a, 3 and 4), are glycosylated membrane associated minor structural proteins. The fourth minor structural protein, the E protein (encoded by ORF2b), is an unglycosylated membrane associated protein. The viral envelope contains two major structural proteins: a glycosylated major envelope protein GP5 (encoded by ORF5) and an unglycosylated membrane M protein (encoded by ORF6). The third major structural protein is the nucleocapsid N protein (encoded by ORF7). All PRRSV non-structural and structural proteins are essential for virus replication, and PRRSV infectivity is relatively intolerant to subtle changes within the structural proteins. PRRSV virulence is multigenic and resides in both the non-structural and structural viral proteins. This review discusses the molecular characteristics, biological and immunological functions of the PRRSV structural and nsps and their involvement in the virus pathogenesis.
Resumo:
Background: The tight junction (TJ) is one of the most important structures established during merozoite invasion of host cells and a large amount of proteins stored in Toxoplasma and Plasmodium parasites’ apical organelles are involved in forming the TJ. Plasmodium falciparum and Toxoplasma gondii apical membrane antigen 1 (AMA-1) and rhoptry neck proteins (RONs) are the two main TJ components. It has been shown that RON4 plays an essential role during merozoite and sporozoite invasion to target cells. This study has focused on characterizing a novel Plasmodium vivax rhoptry protein, RON4, which is homologous to PfRON4 and PkRON4. Methods: The ron4 gene was re-annotated in the P. vivax genome using various bioinformatics tools and taking PfRON4 and PkRON4 amino acid sequences as templates. Gene synteny, as well as identity and similarity values between open reading frames (ORFs) belonging to the three species were assessed. The gene transcription of pvron4, and the expression and localization of the encoded protein were also determined in the VCG-1 strain by molecular and immunological studies. Nucleotide and amino acid sequences obtained for pvron4 in VCG-1 were compared to those from strains coming from different geographical areas. Results: PvRON4 is a 733 amino acid long protein, which is encoded by three exons, having similar transcription and translation patterns to those reported for its homologue, PfRON4. Sequencing PvRON4 from the VCG-1 strain and comparing it to P. vivax strains from different geographical locations has shown two conserved regions separated by a low complexity variable region, possibly acting as a “smokescreen”. PvRON4 contains a predicted signal sequence, a coiled-coil α-helical motif, two tandem repeats and six conserved cysteines towards the carboxyterminus and is a soluble protein lacking predicted transmembranal domains or a GPI anchor. Indirect immunofluorescence assays have shown that PvRON4 is expressed at the apical end of schizonts and co-localizes at the rhoptry neck with PvRON2.