980 resultados para genome structure
Resumo:
Transcription enhancer factor 1 is essential for cardiac, skeletal, and smooth muscle development and uses its N-terminal TEA domain (TEAD) to bind M-CAT elements. Here, we present the first structure of TEAD and show that it is a three-helix bundle with a homeodomain fold. Structural data reveal how TEAD binds DNA. Using structure-function correlations, we find that the L1 loop is essential for cooperative loading of TEAD molecules on to tandemly duplicated M-CAT sites. Furthermore, using a microarray chip-based assay, we establish that known binding sites of the full-length protein are only a subset of DNA elements recognized by TEAD. Our results provide a model for understanding the regulation of genome-wide gene expression during development by TEA/ATTS family of transcription factors.
Resumo:
Cytoplasmic polyhedrosis virus (CPV) is unique within the Reoviridae family in having a turreted single-layer capsid contained within polyhedrin inclusion bodies, yet being fully capable of cell entry and endogenous RNA transcription. Biochemical data have shown that the amino-terminal 79 residues of the CPV turret protein (TP) is sufficient to bring CPV or engineered proteins into the polyhedrin matrix for micro-encapsulation. Here we report the three-dimensional structure of CPV at 3.88 A resolution using single-particle cryo-electron microscopy. Our map clearly shows the turns and deep grooves of alpha-helices, the strand separation in beta-sheets, and densities for loops and many bulky side chains; thus permitting atomic model-building effort from cryo-electron microscopy maps. We observed a helix-to-beta-hairpin conformational change between the two conformational states of the capsid shell protein in the region directly interacting with genomic RNA. We have also discovered a messenger RNA release hole coupled with the mRNA capping machinery unique to CPV. Furthermore, we have identified the polyhedrin-binding domain, a structure that has potential in nanobiotechnology applications.
Resumo:
Tumor necrosis factor receptor p75/80 ((TNF-R p75/80) is a 75 kDa type 1 transmembrane protein expressed predominately on cells of hematopoietic lineage. TNF-R p75/80 belongs to the TNF receptor superfamily characterized by cysteine-rich extracellular regions composed of three to six disulfide-linked domains. In the present report, we have characterized, for the first time, the complete gene structure for human TNF-R p75/80 which spans approximately 43 kbp. The gene consists of 10 exons (ranging from 34 bp to 2.5 kbp) and 9 introns (343 bp to 19 kbp). Consensus elements for transcription factors involved in T cell development and activation were noted in the 5$\sp\prime$ flanking region including TCF-1, Ikaros, AP-1, CK-2, IL-6RE, ISRE, GAS, NF-$\kappa$B and SP1, as well as an unusually high GC content and CpG frequency that appears characteristic of some TNF-R family members. The unusual (GATA)$\sb{\rm n}$ and (GAA)(GGA) repeats found within intron 1 may prove useful for further genome analysis within the 1p36 chromosomal locus. The human TNF-R p75/80 gene structure will permit further assessment of its involvement in normal hematopoietic cell development and function, autoimmune disease, and non-random translocations in hematopoietic malignancies. The region 1.8 kb 5$\sp\prime$ of the ATG was able to drive luciferase expression when transfected into cell lines expressing TNF-R p75/80. Further characterization of the 5$\sp\prime$-regulatory region will aid in determining factors and signal transduction pathways involved in regulating TNF-R p75/80 expression. ^
Resumo:
Mitochondria cannot form de novo but require mechanisms allowing their inheritance to daughter cells. In contrast to most other eukaryotes Trypanosoma brucei has a single mitochondrion whose single-unit genome is physically connected to the flagellum. Here we identify a β-barrel mitochondrial outer membrane protein, termed tripartite attachment complex 40 (TAC40), that localizes to this connection. TAC40 is essential for mitochondrial DNA inheritance and belongs to the mitochondrial porin protein family. However, it is not specifically related to any of the three subclasses of mitochondrial porins represented by the metabolite transporter voltage-dependent anion channel (VDAC), the protein translocator of the outer membrane 40 (TOM40), or the fungi-specific MDM10, a component of the endoplasmic reticulum–mitochondria encounter structure (ERMES). MDM10 and TAC40 mediate cellular architecture and participate in transmembrane complexes that are essential for mitochondrial DNA inheritance. In yeast MDM10, in the context of the ERMES, is postulated to connect the mitochondrial genomes to actin filaments, whereas in trypanosomes TAC40 mediates the linkage of the mitochondrial DNA to the basal body of the flagellum. However, TAC40 does not colocalize with trypanosomal orthologs of ERMES components and, unlike MDM10, it regulates neither mitochondrial morphology nor the assembly of the protein translocase. TAC40 therefore defines a novel subclass of mitochondrial porins that is distinct from VDAC, TOM40, and MDM10. However, whereas the architecture of the TAC40-containing complex in trypanosomes and the MDM10-containing ERMES in yeast is very different, both are organized around a β-barrel protein of the mitochondrial porin family that mediates a DNA–cytoskeleton linkage that is essential for mitochondrial DNA inheritance.
Resumo:
TbRRM1 of Trypanosoma brucei is a nucleoprotein that was previously identified in a search for splicing factors in T. brucei. We show that TbRRM1 associates with mRNAs and with the auxiliary splicing factor polypyrimidine tract-binding protein 2, but not with components of the core spliceosome. TbRRM1 also interacts with several retrotransposon hot spot (RHS) proteins and histones. RNA immunoprecipitation of a tagged form of TbRRM1 from procyclic (insect) form trypanosomes identified ca. 1,500 transcripts that were enriched and 3,000 transcripts that were underrepresented compared to cellular mRNA. Enriched transcripts encoded RNA-binding proteins, including TbRRM1 itself, several RHS transcripts, mRNAs with long coding regions, and a high proportion of stage-regulated mRNAs that are more highly expressed in bloodstream forms. Transcripts encoding ribosomal proteins, other factors involved in translation, and procyclic-specific transcripts were underrepresented. Knockdown of TbRRM1 by RNA interference caused widespread changes in mRNA abundance, but these changes did not correlate with the binding of the protein to transcripts, and most splice sites were unchanged, negating a general role for TbRRM1 in splice site selection. When changes in mRNA abundance were mapped across the genome, regions with many downregulated mRNAs were identified. Two regions were analyzed by chromatin immunoprecipitation, both of which exhibited increases in nucleosome occupancy upon TbRRM1 depletion. In addition, subjecting cells to heat shock resulted in translocation of TbRRM1 to the cytoplasm and compaction of chromatin, consistent with a second role for TbRRM1 in modulating chromatin structure. IMPORTANCE: Trypanosoma brucei, the parasite that causes human sleeping sickness, is transmitted by tsetse flies. The parasite progresses through different life cycle stages in its two hosts, altering its pattern of gene expression in the process. In trypanosomes, protein-coding genes are organized as polycistronic units that are processed into monocistronic mRNAs. Since genes in the same unit can be regulated independently of each other, it is believed that gene regulation is essentially posttranscriptional. In this study, we investigated the role of a nuclear RNA-binding protein, TbRRM1, in the insect stage of the parasite. We found that TbRRM1 binds nuclear mRNAs and also affects chromatin status. Reduction of nuclear TbRRM1 by RNA interference or heat shock resulted in chromatin compaction. We propose that TbRRM1 regulates RNA polymerase II-driven gene expression both cotranscriptionally, by facilitating transcription and efficient splicing, and posttranscriptionally, via its interaction with nuclear mRNAs.
Resumo:
Classical swine fever virus (CSFV) causes a highly contagious disease in pigs that can range from a severe haemorrhagic fever to a nearly unapparent disease, depending on the virulence of the virus strain. Little is known about the viral molecular determinants of CSFV virulence. The nonstructural protein NS4B is essential for viral replication. However, the roles of CSFV NS4B in viral genome replication and pathogenesis have not yet been elucidated. NS4B of the GPE- vaccine strain and of the highly virulent Eystrup strain differ by a total of seven amino acid residues, two of which are located in the predicted trans-membrane domains of NS4B and were described previously to relate to virulence, and five residues clustering in the N-terminal part. In the present study, we examined the potential role of these five amino acids in modulating genome replication and determining pathogenicity in pigs. A chimeric low virulent GPE- -derived virus carrying the complete Eystrup NS4B showed enhanced pathogenicity in pigs. The in vitro replication efficiency of the NS4B chimeric GPE- replicon was significantly higher than that of the replicon carrying only the two Eystrup-specific amino acids in NS4B. In silico and in vitro data suggest that the N-terminal part of NS4B forms an amphipathic α-helix structure. The N-terminal NS4B with these five amino acid residues is associated with the intracellular membranes. Taken together, this is the first gain-of-function study showing that the N-terminal domain of NS4B can determine CSFV genome replication in cell culture and viral pathogenicity in pigs.
Resumo:
Trypanosomes show an intriguing organization of their mitochondrial DNA into a catenated network, the kinetoplast DNA (kDNA). While more than 30 proteins involved in kDNA replication have been described, only few components of kDNA segregation machinery are currently known. Electron microscopy studies identified a high-order structure, the tripartite attachment complex (TAC), linking the basal body of the flagellum via the mitochondrial membranes to the kDNA. Here we describe TAC102, a novel core component of the TAC, which is essential for proper kDNA segregation during cell division. Loss of TAC102 leads to mitochondrial genome missegregation but has no impact on proper organelle biogenesis and segregation. The protein is present throughout the cell cycle and is assembled into the newly developing TAC only after the pro-basal body has matured indicating a hierarchy in the assembly process. Furthermore, we provide evidence that the TAC is replicated de novo rather than using a semi-conservative mechanism. Lastly, we demonstrate that TAC102 lacks an N-terminal mitochondrial targeting sequence and requires sequences in the C-terminal part of the protein for its proper localization.
Resumo:
The intergenic spacer (IGS) region of the ribosomal DNA was cloned and sequenced in eight species within the Gibberella fujikuroi species complex with anamorphs in the genus Fusarium , a group that includes the most relevant toxigenic species. DNA sequence analyses revealed two categories of repeated elements: long repeats and short repeats of 125 and 8 bp, respectively. Long repeats were present in two copies and were conserved in all the species analyzed, whereas different numbers of short repeat elements were observed, leading to species-specific IGS sequences with different length. In Fusarium subglutinans and Fusarium nygamai , these differences seemed to be the result of duplication and deletion events. Here, we propose a model based on unequal crossing over that can explain these processes. The partial IGS sequence of 22 Fusarium proliferatum isolates was also obtained to study variation at the intraspecific level. The results revealed no differences in terms of number or pattern of repeated elements and detected frequent gene conversion events. These results suggest that the homogenization observed at the intraspecific level might not be achieved primarily by unequal crossing-over events but rather by processes associated with recombination such as gene conversion events.
Resumo:
Integration of transgenic DNA into the plant genome was investigated in 13 transgenic oat (Avena sativa L.) lines produced using microprojectile bombardment with one or two cotransformed plasmids. In all transformation events, the transgenic DNA integrated into the plant genome consisted of intact transgene copies that were accompanied by multiple, rearranged, and/or truncated transgene fragments. All fragments of transgenic DNA cosegregated, indicating that they were integrated at single gene loci. Analysis of the structure of the transgenic loci indicated that the transgenic DNA was interspersed by the host genomic DNA. The number of insertions of transgenic DNA within the transgene loci varied from 2 to 12 among the 13 lines. Restriction endonucleases that do not cleave the introduced plasmids produced restriction fragments ranging from 3.6 to about 60 kb in length hybridizing to a probe comprising the introduced plasmids. Although the size of the interspersing host DNA within the transgene locus is unknown, the sizes of the transgene-hybridizing restriction fragments indicated that the entire transgene locus must be at least from 35–280 kb. The observation that all transgenic lines analyzed exhibited genomic interspersion of multiple clustered transgenes suggests a predominating integration mechanism. We propose that transgene integration at multiple clustered DNA replication forks could account for the observed interspersion of transgenic DNA with host genomic DNA within transgenic loci.
Resumo:
The RNA phage Qβ requires for the replication of its genome an RNA binding protein called Qβ host factor or Hfq protein. Our previous results suggested that this protein mediates the access of replicase to the 3′-end of the Qβ plus strand RNA. Here we report the results of an evolutionary experiment in which phage Qβ was adapted to an Escherichia coli Q13 host strain with an inactivated host factor (hfq) gene. This strain initially produced phage at a titer ≈10,000-fold lower than the wild-type strain and with minute plaque morphology, but after 12 growth cycles, phage titer and plaque size had evolved to levels near those of the wild-type host. RNAs isolated from adapted Qβ mutants were efficient templates for replicase without host factor in vitro. Electron microscopy showed that mutant RNAs, in contrast to wild-type RNA, efficiently interacted with replicase at the 3′-end in the absence of host factor. The same set of four mutations in the 3′-terminal third of the genome was found in several independently evolved phage clones. One mutation disrupts the base pairing of the 3′-terminal CCCoh sequence, suggesting that the host factor stimulates activity of the wild-type RNA template by melting out its 3′-end.
Resumo:
A crucial step in exploiting the information inherent in genome sequences is to assign to each protein sequence its three-dimensional fold and biological function. Here we describe fold assignment for the proteins encoded by the small genome of Mycoplasma genitalium. The assignment was carried out by our computer server (http://www.doe-mbi.ucla.edu/people/frsvr/frsvr.html), which assigns folds to amino acid sequences by comparing sequence-derived predictions with known structures. Of the total of 468 protein ORFs, 103 (22%) can be assigned a known protein fold with high confidence, as cross-validated with tests on known structures. Of these sequences, 75 (16%) show enough sequence similarity to proteins of known structure that they can also be detected by traditional sequence–sequence comparison methods. That is, the difference of 28 sequences (6%) are assignable by the sequence–structure method of the server but not by current sequence–sequence methods. Of the remaining 78% of sequences in the genome, 18% belong to membrane proteins and the remaining 60% cannot be assigned either because these sequences correspond to no presently known fold or because of insensitivity of the method. At the current rate of determination of new folds by x-ray and NMR methods, extrapolation suggests that folds will be assigned to most soluble proteins in the next decade.
Resumo:
Chromosomal forms of Anopheles gambiae, given the informal designations Bamako, Mopti, and Savannah, have been recognized by the presence or absence of four paracentric inversions on chromosome 2. Studies of karyotype frequencies at sites where the forms occur in sympatry have led to the suggestion that these forms represent species. We conducted a study of the genetic structure of populations of An. gambiae from two villages in Mali, west Africa. Populations at each site were composed of the Bamako and Mopti forms and the sibling species, Anopheles arabiensis. Karyotypes were determined for each individual mosquito and genotypes at 21 microsatellite loci determined. A number of the microsatellites have been physically mapped to polytene chromosomes, making it possible to select loci based on their position relative to the inversions used to define forms. We found that the chromosomal forms differ at all loci on chromosome 2, but there were few differences for loci on other chromosomes. Geographic variation was small. Gene flow appears to vary among different regions within the genome, being lowest on chromosome 2, probably due to hitchhiking with the inversions. We conclude that the majority of observed genetic divergence between chromosomal forms can be explained by forces that need not involve reproductive isolation, although reproductive isolation is not ruled out. We found low levels of gene flow between the sibling species Anopheles gambiae and Anopheles arabiensis, similar to estimates based on observed frequencies of hybrid karyotypes in natural populations.
Resumo:
Submillimolar levels of calcium, similar to the physiological total (bound + free) intranuclear concentration (0.01–1 mM), induced a conformational change within d(TG/AC)n, one of the frequent dinucleotide repeats of the mammalian genome. This change is calcium-specific, because no other tested cation induced it and it was detected as a concentration-dependent transition from B- to a non-B-DNA conformation expanding from 3′ end toward the 5′ of the repeat. Genomic footprinting of various rat brain regions revealed the existence of similar non-B-DNA conformation within a d(TG/AC)28 repeat of the endogenous enkephalin gene only in enkephalin-expressing caudate nucleus and not in the nonexpressing thalamus. Binding assays demonstrated that DNA could bind calcium and can compete with calmodulin for calcium.
Resumo:
Hepatitis C virus (HCV) helicase, non-structural protein 3 (NS3), is proposed to aid in HCV genome replication and is considered a target for inhibition of HCV. In order to investigate the substrate requirements for nucleic acid unwinding by NS3, substrates were prepared by annealing a 30mer oligonucleotide to a 15mer. The resulting 15 bp duplex contained a single-stranded DNA overhang of 15 nt referred to as the bound strand. Other substrates were prepared in which the 15mer DNA was replaced by a strand of peptide nucleic acid (PNA). The PNA–DNA substrate was unwound by NS3, but the observed rate of strand separation was at least 25-fold slower than for the equivalent DNA–DNA substrate. Binding of NS3 to the PNA–DNA substrate was similar to the DNA–DNA substrate, due to the fact that NS3 initially binds to the single-stranded overhang, which was identical in each substrate. A PNA–RNA substrate was not unwound by NS3 under similar conditions. In contrast, morpholino–DNA and phosphorothioate–DNA substrates were utilized as efficiently by NS3 as DNA–DNA substrates. These results indicate that the PNA–DNA and PNA–RNA heteroduplexes adopt structures that are unfavorable for unwinding by NS3, suggesting that the unwinding activity of NS3 is sensitive to the structure of the duplex.
Resumo:
DNMT2 is a human protein that displays strong sequence similarities to DNA (cytosine-5)-methyltransferases (m5C MTases) of both prokaryotes and eukaryotes. DNMT2 contains all 10 sequence motifs that are conserved among m5C MTases, including the consensus S-adenosyl-l-methionine-binding motifs and the active site ProCys dipeptide. DNMT2 has close homologs in plants, insects and Schizosaccharomyces pombe, but no related sequence can be found in the genomes of Saccharomyces cerevisiae or Caenorhabditis elegans. The crystal structure of a deletion mutant of DNMT2 complexed with S-adenosyl-l-homocysteine (AdoHcy) has been determined at 1.8 Å resolution. The structure of the large domain that contains the sequence motifs involved in catalysis is remarkably similar to that of M.HhaI, a confirmed bacterial m5C MTase, and the smaller target recognition domains of DNMT2 and M.HhaI are also closely related in overall structure. The small domain of DNMT2 contains three short helices that are not present in M.HhaI. DNMT2 binds AdoHcy in the same conformation as confirmed m5C MTases and, while DNMT2 shares all sequence and structural features with m5C MTases, it has failed to demonstrate detectable transmethylase activity. We show here that homologs of DNMT2, which are present in some organisms that are not known to methylate their genomes, contain a specific target-recognizing sequence motif including an invariant CysPheThr tripeptide. DNMT2 binds DNA to form a denaturant-resistant complex in vitro. While the biological function of DNMT2 is not yet known, the strong binding to DNA suggests that DNMT2 may mark specific sequences in the genome by binding to DNA through the specific target-recognizing motif.