888 resultados para Genome Sequence
Resumo:
A complete reference genome of the Apis mellifera Filamentous virus (AmFV) was determined using Illumina Hiseq sequencing. The AmFV genome is a double stranded DNA molecule of approximately 498,500 nucleotides with a GC content of 50.8%. It encompasses 247 non-overlapping open reading frames (ORFs), equally distributed on both strands, which cover 65% of the genome. While most of the ORFs lacked threshold sequence alignments to reference protein databases, twenty-eight were found to display significant homologies with proteins present in other large double stranded DNA viruses. Remarkably, 13 ORFs had strong similarity with typical baculovirus domains such as PIFs (per os infectivity factor genes: pif-1, pif-2, pif-3 and p74) and BRO (Baculovirus Repeated Open Reading Frame). The putative AmFV DNA polymerase is of type B, but is only distantly related to those of the baculoviruses. The ORFs encoding proteins involved in nucleotide metabolism had the highest percent identity to viral proteins in GenBank. Other notable features include the presence of several collagen-like, chitin-binding, kinesin and pacifastin domains. Due to the large size of the AmFV genome and the inconsistent affiliation with other large double stranded DNA virus families infecting invertebrates, AmFV may belong to a new virus family.
Resumo:
Trypanosomes show an intriguing organization of their mitochondrial DNA into a catenated network, the kinetoplast DNA (kDNA). While more than 30 proteins involved in kDNA replication have been described, only few components of kDNA segregation machinery are currently known. Electron microscopy studies identified a high-order structure, the tripartite attachment complex (TAC), linking the basal body of the flagellum via the mitochondrial membranes to the kDNA. Here we describe TAC102, a novel core component of the TAC, which is essential for proper kDNA segregation during cell division. Loss of TAC102 leads to mitochondrial genome missegregation but has no impact on proper organelle biogenesis and segregation. The protein is present throughout the cell cycle and is assembled into the newly developing TAC only after the pro-basal body has matured indicating a hierarchy in the assembly process. Furthermore, we provide evidence that the TAC is replicated de novo rather than using a semi-conservative mechanism. Lastly, we demonstrate that TAC102 lacks an N-terminal mitochondrial targeting sequence and requires sequences in the C-terminal part of the protein for its proper localization.
Resumo:
The genomes of Fusobacterium nucleatum subspecies polymorphum strain ATCC 10953, Rickettsia typhi strain Wilmington, and Francisella tularensis subspecies holarctica strain OSU18 were sequenced, annotated, and analyzed. Each genome was then compared to the sequenced genomes of closely related bacteria. The genome of F. nucleatum ATCC 10953 was compared to two additional F. nucleatum subspecies, subspecies nucleatum and subspecies vincentii. This analysis revealed substantial evidence of horizontal gene transfer along with considerable genetic diversity within the species of F. nucleatum. R. typhi was compared to R. prowazekii and R. conorii. This analysis uncovered a hotspot for chromosomal rearrangements in the Spotted Fever Group but not the Typhus Group Rickettsia and revealed the close genetic relationship between the Typhus Group rickettsial species. F. tularensis OSU18 was compared to two additional F. tularensis strains. These comparisons uncovered significant chromosomal rearrangements between F. tularensis subspecies due to recombination between insertion sequence elements. ^
Resumo:
The creation, preservation, and degeneration of cis-regulatory elements controlling developmental gene expression are fundamental genome-level evolutionary processes about which little is known. In this study, critical differences in cis-regulatory elements controlling the expression of the sea urchin aboral ectoderm-specific spec genes were identified and explored. In genomes of species within the Strongylocentrotidae family, multiple copies of a repetitive sequence element termed RSR were present, but RSRs were not detected in genomes of species outside Strongylocentrotidae. RSRs are invariably associated with spec genes, and in Strongylocentrotus purpuratus, the spec2a RSR functioned as a transcriptional enhancer displaying greater activity than RSRs from the spec1 or spec2c paralogs. Single base-pair differences at two cis-regulatory elements within the spec2a RSR greatly increased the binding affinities of four transcription factors: SpCCAAT-binding factor at one element and SpOtx, SpGoosecoid, and SpGATA-E at another. The cis-regulatory elements to which SpCCAAT-binding factor, SpOtx, SpGoosecoid, and SpGATA-E bound were recent evolutionary acquisitions that could act either to activate or repress transcription, depending on the cell type. These elements were found in the spec2a RSR ortholog in Strongylocentrotus pallidus but not in the RSR orthologs of Strongylocentrotus droebachiensis or Hemicentrotus pulcherrimus. These results indicate that spec genes exhibit a dynamic pattern of cis-regulatory element evolution while stabilizing selection preserves their aboral ectoderm expression domain. ^
Resumo:
Unique, small sequences (sequence tag sites) have been identified at the 3′ ends of most human genes that serve as landmarks in genome mapping. We investigated whether a single copy gene could be isolated directly from total human DNA by transformation-associated recombination (TAR) cloning in yeast using a short, 3′ unique target. A TAR cloning vector was constructed that, when linearized, contained a small amount (381 bp) of 3′ hypoxanthine phosphoribosyltransferase (HPRT) sequence at one end and an 189-bp Alu repeat at the other end. Transformation with this vector along with human DNA led to selective isolations of the entire HPRT gene as yeast artificial chromosomes (YACs) that extended from the 3′ end sequence to various Alu positions as much as 600 kb upstream. These YACs were retrofitted with a NeoR and a bacterial artificial chromosome (BAC) sequence to transfer the YACs to bacteria and subsequently the BACs to mouse cells by using a Neo selection. Most of the HPRT isolates were functional, demonstrating that TAR cloning retains the functional integrity of the isolated material. Thus, this modified version of TAR cloning, which we refer to as radial TAR cloning, can be used to isolate large segments of the human genome accurately and directly with only a small amount of sequence information.
Resumo:
The recent ability to sequence whole genomes allows ready access to all genetic material. The approaches outlined here allow automated analysis of sequence for the synthesis of optimal primers in an automated multiplex oligonucleotide synthesizer (AMOS). The efficiency is such that all ORFs for an organism can be amplified by PCR. The resulting amplicons can be used directly in the construction of DNA arrays or can be cloned for a large variety of functional analyses. These tools allow a replacement of single-gene analysis with a highly efficient whole-genome analysis.
Resumo:
Cosmids from the 1A3–1A10 region of the complete miniset were individually subcloned by using the vector M13 mp18. Sequences of each cosmid were assembled from about 400 DNA fragments generated from the ends of these phage subclones and merged into one 189-kb contig. About 160 ORFs identified by the CodonUse program were subjected to similarity searches. The biological functions of 80 ORFs could be assigned reliably by using the WIT and Magpie genome investigation tools. Eighty percent of these recognizable ORFs were organized in functional clusters, which simplified assignment decisions and increased the strength of the predictions. A set of 26 genes for cobalamin biosynthesis, genes for polyhydroxyalkanoic acid metabolism, DNA replication and recombination, and DNA gyrase were among those identified. Most of the ORFs lacking significant similarity with reference databases also were grouped. There are two large clusters of these ORFs, one located between 45 and 67 kb of the map, and the other between 150 and 183 kb. Nine of the loosely identified ORFs (of 15) of the first of these clusters match ORFs from phages or transposons. The other cluster also has four ORFs of possible phage origin.
Resumo:
Peer reviewed
Resumo:
The partially overlapping ORF P and ORF O are located within the domains of the herpes simplex virus 1 genome transcribed during latency. Earlier studies have shown that ORF P is repressed by infected cell protein 4 (ICP4), the major viral regulatory protein, binding to its cognate site at the transcription initiation site of ORF P. The ORF P protein binds to p32, a component of the ASF/SF2 alternate splicing factors; in cells infected with a recombinant virus in which ORF P was derepressed there was a significant decrease in the expression of products of key regulatory genes containing introns. We report that (i) the expression of ORF O is repressed during productive infection by the same mechanism as that determining the expression of ORF P; (ii) in cells infected at the nonpermissive temperature for ICP4, ORF O protein is made in significantly lower amounts than the ORF P protein; (iii) the results of insertion of a sequence encoding 20 amino acids between the putative initiator methionine codons of ORF O and ORF P suggest that ORF O initiates at the methionine codon of ORF P and that the synthesis of ORF O results from frameshift or editing of its RNA; and (iv) glutathione S-transferase–ORF O fusion protein bound specifically ICP4 and precluded its binding to its cognate site on DNA in vitro. These and earlier results indicate that ORF P and ORF O together have the capacity to reduce the synthesis or block the expression of regulatory proteins essential for viral replication in productive infection.
Resumo:
A crucial step in exploiting the information inherent in genome sequences is to assign to each protein sequence its three-dimensional fold and biological function. Here we describe fold assignment for the proteins encoded by the small genome of Mycoplasma genitalium. The assignment was carried out by our computer server (http://www.doe-mbi.ucla.edu/people/frsvr/frsvr.html), which assigns folds to amino acid sequences by comparing sequence-derived predictions with known structures. Of the total of 468 protein ORFs, 103 (22%) can be assigned a known protein fold with high confidence, as cross-validated with tests on known structures. Of these sequences, 75 (16%) show enough sequence similarity to proteins of known structure that they can also be detected by traditional sequence–sequence comparison methods. That is, the difference of 28 sequences (6%) are assignable by the sequence–structure method of the server but not by current sequence–sequence methods. Of the remaining 78% of sequences in the genome, 18% belong to membrane proteins and the remaining 60% cannot be assigned either because these sequences correspond to no presently known fold or because of insensitivity of the method. At the current rate of determination of new folds by x-ray and NMR methods, extrapolation suggests that folds will be assigned to most soluble proteins in the next decade.
Resumo:
ETS transcription factors play important roles in hematopoiesis, angiogenesis, and organogenesis during murine development. The ETS genes also have a role in neoplasia, for example in Ewing’s sarcomas and retrovirally induced cancers. The ETS genes encode transcription factors that bind to specific DNA sequences and activate transcription of various cellular and viral genes. To isolate novel ETS target genes, we used two approaches. In the first approach, we isolated genes by the RNA differential display technique. Previously, we have shown that the overexpression of ETS1 and ETS2 genes effects transformation of NIH 3T3 cells and specific transformants produce high levels of the ETS proteins. To isolate ETS1 and ETS2 responsive genes in these transformed cells, we prepared RNA from ETS1, ETS2 transformants, and normal NIH 3T3 cell lines and converted it into cDNA. This cDNA was amplified by PCR and displayed on sequencing gels. The differentially displayed bands were subcloned into plasmid vectors. By Northern blot analysis, several clones showed differential patterns of mRNA expression in the NIH 3T3-, ETS1-, and ETS2-expressing cell lines. Sixteen clones were analyzed by DNA sequence analysis, and 13 of them appeared to be unique because their DNA sequences did not match with any of the known genes present in the gene bank. Three known genes were found to be identical to the CArG box binding factor, phospholipase A2-activating protein, and early growth response 1 (Egr1) genes. In the second approach, to isolate ETS target promoters directly, we performed ETS1 binding with MboI-cleaved genomic DNA in the presence of a specific mAb followed by whole genome PCR. The immune complex-bound ETS binding sites containing DNA fragments were amplified and subcloned into pBluescript and subjected to DNA sequence and computer analysis. We found that, of a large number of clones isolated, 43 represented unique sequences not previously identified. Three clones turned out to contain regulatory sequences derived from human serglycin, preproapolipoprotein C II, and Egr1 genes. The ETS binding sites derived from these three regulatory sequences showed specific binding with recombinant ETS proteins. Of interest, Egr1 was identified by both of these techniques, suggesting strongly that it is indeed an ETS target gene.
Resumo:
A rapidly growing area of genome research is the generation of expressed sequence tags (ESTs) in which large numbers of randomly selected cDNA clones are partially sequenced. The collection of ESTs reflects the level and complexity of gene expression in the sampled tissue. To date, the majority of plant ESTs are from nonwoody plants such as Arabidopsis, Brassica, maize, and rice. Here, we present a large-scale production of ESTs from the wood-forming tissues of two poplars, Populus tremula L. × tremuloides Michx. and Populus trichocarpa ‘Trichobel.’ The 5,692 ESTs analyzed represented a total of 3,719 unique transcripts for the two cDNA libraries. Putative functions could be assigned to 2,245 of these transcripts that corresponded to 820 protein functions. Of specific interest to forest biotechnology are the 4% of ESTs involved in various processes of cell wall formation, such as lignin and cellulose synthesis, 5% similar to developmental regulators and members of known signal transduction pathways, and 2% involved in hormone biosynthesis. An additional 12% of the ESTs showed no significant similarity to any other DNA or protein sequences in existing databases. The absence of these sequences from public databases may indicate a specific role for these proteins in wood formation. The cDNA libraries and the accompanying database are valuable resources for forest research directed toward understanding the genetic control of wood formation and future endeavors to modify wood and fiber properties for industrial use.
Resumo:
Ngrol genes (NgrolB, NgrolC, NgORF13, and NgORF14) that are similar in sequence to genes in the left transferred DNA (TL-DNA) of Agrobacterium rhizogenes have been found in the genome of untransformed plants of Nicotiana glauca. It has been suggested that a bacterial infection resulted in transformation of Ngrol genes early in the evolution of the genus Nicotiana. Although the corresponding four rol genes in TL-DNA provoked hairy-root syndrome in plants, present-day N. glauca and plants transformed with Ngrol genes did not exhibit this phenotype. Sequenced complementation analysis revealed that the NgrolB gene did not induce adventitious roots because it contained two point mutations. Single-base site-directed mutagenesis at these two positions restored the capacity for root induction to the NgrolB gene. When the NgrolB, with these two base substitutions, was positioned under the control of the cauliflower mosaic virus 35S promoter (P35S), transgenic tobacco plants exhibited morphological abnormalities that were not observed in P35s-RirolB plants. In contrast, the activity of the NgrolC gene may have been conserved after an ancient infection by bacteria. Discussed is the effect of the horizontal gene transfer of the Ngrol genes and mutations in the NgrolB gene on the phenotype of ancient plants during the evolution of N. glauca.
Resumo:
We have developed high-density DNA microarrays of yeast ORFs. These microarrays can monitor hybridization to ORFs for applications such as quantitative differential gene expression analysis and screening for sequence polymorphisms. Automated scripts retrieved sequence information from public databases to locate predicted ORFs and select appropriate primers for amplification. The primers were used to amplify yeast ORFs in 96-well plates, and the resulting products were arrayed using an automated micro arraying device. Arrays containing up to 2,479 yeast ORFs were printed on a single slide. The hybridization of fluorescently labeled samples to the array were detected and quantitated with a laser confocal scanning microscope. Applications of the microarrays are shown for genetic and gene expression analysis at the whole genome level.
Resumo:
A multiple protein–DNA complex formed at a human α-globin locus-specific regulatory element, HS-40, confers appropriate developmental expression pattern on human embryonic ζ-globin promoter activity in humans and transgenic mice. We show here that introduction of a 1-bp mutation in an NF-E2/AP1 sequence motif converts HS-40 into an erythroid-specific locus-control region. Cis-linkage with this locus-control region, in contrast to the wild-type HS-40, allows erythroid lineage-specific derepression of the silenced human ζ-globin promoter in fetal and adult transgenic mice. Furthermore, ζ-globin promoter activities in adult mice increase in proportion to the number of integrated DNA fragments even at 19 copies/genome. The mutant HS-40 in conjunction with human ζ-globin promoter thus can be used to direct position-independent and copy number-dependent expression of transgenes in adult erythroid cells. The data also supports a model in which competitive DNA binding of different members of the NF-E2/AP1 transcription factor family modulates the developmental stage specificity of an erythroid enhancer. Feasibility to reswitch on embryonic/fetal globin genes through the manipulation of nuclear factor binding at a single regulatory DNA motif is discussed.