966 resultados para gene structure


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Structure-function analysis of human Integrator subunit 4 Anupama Sataluri Advisor: Eric. J. Wagner, Ph.D. Uridine-rich small nuclear RNAs (U snRNA) are RNA Polymerase-II (RNAPII) transcripts that are ubiquitously expressed and are known to be essential for gene expression. snRNAs play a key role in mRNA splicing and in histone mRNA expression. Inaccurate snRNA biosynthesis can lead to diseases related to defective splicing and histone mRNA expression. Although the 3′ end formation mechanism and processing machinery of other RNAPII transcripts such as mRNA has been well studied, the mechanism of snRNA 3′ end processing has remained a mystery until the recent discovery of the machinery that mediates this process. In 2005, a complex of 14 subunits (the Integrator complex) associated with RNA Polymerase-II was discovered. The 14subunits were annotated Integrator 1-14 based on their size. The subunits of this complex together were found to facilitate 3′ end processing of snRNA. Identification of the Integrator complex propelled research in the direction of understanding the events of snRNA 3’end processing. Recent studies from our lab confirmed that Integrator subunit (IntS) 9 and 11 together perform the endonucleolytic cleavage of the nascent snRNA 3′ end to generate mature snRNA. However, the role of other members of the Integrator complex remains elusive. Current research in our lab is focused on deciphering the role of each subunit within the Integrator complex This work specifically focuses on elucidating the role of human Integrator subunit 4 (IntS4) and understanding how it facilitates the overall function of the complex. IntS4 has structural similarity with a protein called “Symplekin”, which is part of the mRNA 3’end processing machinery. Symplekin has been thoroughly researched in recent years and structure-function correlation studies in the context of mRNA 3’end processing have reported a scaffold function for Symplekin due to the presence of HEAT repeat motifs in its N-terminus. Based upon the structural similarity between IntS4 and Symplekin, we hypothesized that Integrator subunit 4 may be behaving as a Symplekin-like scaffold molecule that facilitates the interaction between other members of the Integrator Complex. To answer this question, the two important goals of this study were to: 1) identify the region of IntS4, which is important for snRNA 3′ end processing and 2) determine binding partners of IntS4 which promote its function as a scaffold. IntS4 structurally consists of a highly conserved N-terminus with 8 HEAT repeats, followed by a nonconserved C- terminus. A series of siRNA resistant N and C-terminus deletion constructs as well as specific point mutants within its N-terminal HEAT repeats were generated for human IntS4 and, utilizing a snRNA transcriptional readthrough GFP-reporter assay, we tested their ability to rescue misprocessing. This assay revealed a possible scaffold like property of IntS4. To probe IntS4 for interaction partners, we performed co-immunoprecipitation on nuclear extracts of IntS4 expressing stable cell lines and identified IntS3 and IntS5 among other Integrator subunits to be binding partners which facilitate the scaffold like function of hIntS4. These findings have established a critical role for IntS4 in snRNA 3′ end processing, identified that both its N and C termini are essential for its function, and mapped putative interaction domains with other Integrator subunits.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Complex diseases such as cancer result from multiple genetic changes and environmental exposures. Due to the rapid development of genotyping and sequencing technologies, we are now able to more accurately assess causal effects of many genetic and environmental factors. Genome-wide association studies have been able to localize many causal genetic variants predisposing to certain diseases. However, these studies only explain a small portion of variations in the heritability of diseases. More advanced statistical models are urgently needed to identify and characterize some additional genetic and environmental factors and their interactions, which will enable us to better understand the causes of complex diseases. In the past decade, thanks to the increasing computational capabilities and novel statistical developments, Bayesian methods have been widely applied in the genetics/genomics researches and demonstrating superiority over some regular approaches in certain research areas. Gene-environment and gene-gene interaction studies are among the areas where Bayesian methods may fully exert its functionalities and advantages. This dissertation focuses on developing new Bayesian statistical methods for data analysis with complex gene-environment and gene-gene interactions, as well as extending some existing methods for gene-environment interactions to other related areas. It includes three sections: (1) Deriving the Bayesian variable selection framework for the hierarchical gene-environment and gene-gene interactions; (2) Developing the Bayesian Natural and Orthogonal Interaction (NOIA) models for gene-environment interactions; and (3) extending the applications of two Bayesian statistical methods which were developed for gene-environment interaction studies, to other related types of studies such as adaptive borrowing historical data. We propose a Bayesian hierarchical mixture model framework that allows us to investigate the genetic and environmental effects, gene by gene interactions (epistasis) and gene by environment interactions in the same model. It is well known that, in many practical situations, there exists a natural hierarchical structure between the main effects and interactions in the linear model. Here we propose a model that incorporates this hierarchical structure into the Bayesian mixture model, such that the irrelevant interaction effects can be removed more efficiently, resulting in more robust, parsimonious and powerful models. We evaluate both of the 'strong hierarchical' and 'weak hierarchical' models, which specify that both or one of the main effects between interacting factors must be present for the interactions to be included in the model. The extensive simulation results show that the proposed strong and weak hierarchical mixture models control the proportion of false positive discoveries and yield a powerful approach to identify the predisposing main effects and interactions in the studies with complex gene-environment and gene-gene interactions. We also compare these two models with the 'independent' model that does not impose this hierarchical constraint and observe their superior performances in most of the considered situations. The proposed models are implemented in the real data analysis of gene and environment interactions in the cases of lung cancer and cutaneous melanoma case-control studies. The Bayesian statistical models enjoy the properties of being allowed to incorporate useful prior information in the modeling process. Moreover, the Bayesian mixture model outperforms the multivariate logistic model in terms of the performances on the parameter estimation and variable selection in most cases. Our proposed models hold the hierarchical constraints, that further improve the Bayesian mixture model by reducing the proportion of false positive findings among the identified interactions and successfully identifying the reported associations. This is practically appealing for the study of investigating the causal factors from a moderate number of candidate genetic and environmental factors along with a relatively large number of interactions. The natural and orthogonal interaction (NOIA) models of genetic effects have previously been developed to provide an analysis framework, by which the estimates of effects for a quantitative trait are statistically orthogonal regardless of the existence of Hardy-Weinberg Equilibrium (HWE) within loci. Ma et al. (2012) recently developed a NOIA model for the gene-environment interaction studies and have shown the advantages of using the model for detecting the true main effects and interactions, compared with the usual functional model. In this project, we propose a novel Bayesian statistical model that combines the Bayesian hierarchical mixture model with the NOIA statistical model and the usual functional model. The proposed Bayesian NOIA model demonstrates more power at detecting the non-null effects with higher marginal posterior probabilities. Also, we review two Bayesian statistical models (Bayesian empirical shrinkage-type estimator and Bayesian model averaging), which were developed for the gene-environment interaction studies. Inspired by these Bayesian models, we develop two novel statistical methods that are able to handle the related problems such as borrowing data from historical studies. The proposed methods are analogous to the methods for the gene-environment interactions on behalf of the success on balancing the statistical efficiency and bias in a unified model. By extensive simulation studies, we compare the operating characteristics of the proposed models with the existing models including the hierarchical meta-analysis model. The results show that the proposed approaches adaptively borrow the historical data in a data-driven way. These novel models may have a broad range of statistical applications in both of genetic/genomic and clinical studies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Phosphatidylserine decarboxylase of E. coli, a cytoplasmic membrane protein, catalyzes the formation of phosphatidylethanolamine, the principal phospholipid of the organism. The activity of the enzyme is dependent on a covalently bound pyruvate (Satre and Kennedy (1978) J. Biol. Chem. 253, 479-483). This study shows that the enzyme consists of two nonidentical subunits, $\alpha$ (Mr = 7,332) and $\beta$ (Mr = 28,579), with the pyruvate prosthetic group in amide linkage to the amino-terminus of the $\alpha$ subunit. Partial protein sequence and DNA sequence analysis reveal that the two subunits are derived from a proenzyme ($\pi$ subunit, Mr = 35,893) through a post-translational event. During the conversion of the proenzyme to the $\alpha$ and $\beta$ subunits, the peptide bond between Gly253-Ser254 is cleaved, and Ser254 is converted to the pyruvate prosthetic group at the amino-terminus of the $\alpha$ subunit (Li and Dowhan (1988) J. Biol. Chem. 263, 11516-11522).^ The proenzyme cannot be detected in cells carrying either single or multiple copies of the gene (psd), but can be observed in a T7 RNA polymerase/promoter and transcription-translation system. The cleavage of the wild-type proenzyme occurs rapidly with a half-time on the order of 2 min. Changing of the Ser254 to cysteine (S254C) or threonine (S254T) slows the cleavage rate dramatically and results in mutants with a half-time for processing of around 2-4 h. Change of the Ser254 to alanine (S254A) blocks the cleavage of the proenzyme. The reduced processing rate with the mutations of the proenzyme is consistent with less of the functional enzyme being made. Mutants S254C and S254T produce $\sim$15% and $\sim$1%, respectively, of the activity of the wild-type allele, but can still complement a temperature-sensitive mutant of the psd locus. Neither detectable activity nor complementation is observed by mutant S254A. These results are consistent with the hydroxyl-group of the Ser254 playing a critical role in the cleavage of the peptide bond Gly253-Ser254 of the pro-phosphatidylserine decarboxylase, and support the mechanism proposed by Snell and co-workers (Recsei and Snell (1984) Annu. Rev. Biochem. 53, 357-387) for the formation of the prosthetic group of pyruvate-dependent decarboxylases. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The effect of DNA cytosine methylation on H-ras promoter activity was assessed using a transient expression system employing the plasmid H-rasCAT (NaeI H-ras promoter linked to the chloramphenicol acetyltransferase (CAT) gene). This 551 bp promoter is 80% GC rich, enriched with 168 CpG dinucleotides, and contains six functional GC box elements which represent major DNA methylation target sites. Prokaryotic methyltransferases HhaI (CGm$\sp5$CG) and HpaII (Cm$\sp5$CGG) alone or in combination with a human placental methyltransferase (HP MTase) were used to introduce methyl groups at different CpG sites within the promoter. To test for functional promoter activity, the methylated plasmids were introduced into CV-1 cells and CAT activity assessed 48 h post-transfection. Methylation at specific HhaI and HpaII sites reduced CAT expression by 70%, whereas more extensive methylation at generalized CpG sites with HP MTase inactivated the promoter $>$95%. The inhibition of H-ras promoter activity was not attributable to methylation-induced differences in DNA uptake or stability in the cell, topological form of the plasmid, or methylation effects in nonpromoter regions. We also observed that DNA cytosine methylation of a 360 bp promoter fragment by HP MTase induced a local change in DNA conformation. Using three independent methodologies (nitrocellulose filter binding assays, gel mobility shifts, and Southwestern blots), we determined that this change in promoter conformation affected the interaction of nuclear proteins with cis-regulatory sequences residing in the promoter region. The results provide evidence to suggest that DNA methylation may regulate gene expression by inducing changes in local promoter conformation which in turn alters the interactions between DNA and protein factors required for transcription. The results provide supportive evidence for the hypothesis of Cedar and Riggs, who postulated that DNA methylation may regulate gene expression by altering the binding affinities of proteins for DNA. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cardiovascular disease (CVD) is the leading cause of death in the United States. One manifestation of CVD known to increase mortality is an enlarged, or hypertrophic heart. Hypertrophic cardiomyocytes adapt to increased contractile demand at the genetic level with a re-emergence of the fetal gene program and a downregulation of fatty acid oxidation genes with concomitant increased reliance on glucose-based metabolism. To understand the transcriptional regulatory pathways that implement hypertrophic directives we analyzed the upstream promoter region of the muscle specific isoform of the nuclear-encoded mitochondrial gene, carnitine palmitoyltransferase-1β (CPT-1β) in cultured rat neonatal cardiac myocytes. This enzyme catalyzes the rate-limiting step of fatty acid entry into β-oxidation and is downregulated in cardiac hypertrophy and failure, making it an attractive model for the study of hypertrophic gene regulation and metabolic adaptations. We demonstrate that the muscle-enriched transcription factors GATA-4 and SRF synergistically activate CPT-1β; moreover, DNA binding to cognate sites and intact protein structure are required. This mechanism coordinates upregulation of energy generating processes with activation of the energy consuming contractile promoter for cardiac α-actin. We hypothesized that fatty acid or glucose responsive transcription factors may also regulate CPT-1β. Oleate weakly stimulates CPT-1β activity; in contrast, the glucose responsive Upstream Stimulatory Factors (USF) dramatically depresses the CPT-1β reporter. USF regulates CPT-1β through a novel physical interaction with the cofactor PGC-1 and abrogation of MEF2A/PGC-1 synergistic stimulation. In this way, USF can inversely regulate metabolic gene programs and may play a role in the shift of metabolic substrate preference seen in hypertrophy. Failing hearts have elevated expression of the nuclear hormone receptor COUP-TF. We report that COUP-TF significantly suppresses reporter transcription independent of DNA binding and specific interactions with GATA-4, Nkx2.5 or USF. In summary, CPT-1β transcriptional regulation integrates mitochondrial gene expression with two essential cardiac functions: contraction and metabolic substrate oxidation. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Lichens are symbioses between fungi (mycobionts) and photoautotrophic green algae or cyanobacteria (photobionts). Many lichens occupy large distributional ranges covering several climatic zones. So far, little is known about the large-scale phylogeography of lichen photobionts and their role in shaping the distributional ranges of lichens. We studied south polar, temperate and north polar populations of the widely distributed fruticose lichen Cetraria aculeata. Based on the DNA sequences from three loci for each symbiont, we compared the genetic structure of mycobionts and photobionts. Phylogenetic reconstructions and Bayesian clustering methods divided the mycobiont and photobiont data sets into three groups. An AMOVA shows that the genetic variance of the photobiont is best explained by differentiation between temperate and polar regions and that of the mycobiont by an interaction of climatic and geographical factors. By partialling out the relative contribution of climate, geography and codispersal, we found that the most relevant factors shaping the genetic structure of the photobiont are climate and a history of codispersal. Mycobionts in the temperate region are consistently associated with a specific photobiont lineage. We therefore conclude that a photobiont switch in the past enabled C. aculeata to colonize temperate as well as polar habitats. Rare photobiont switches may increase the geographical range and ecological niche of lichen mycobionts by associating them with locally adapted photobionts in climatically different regions and, together with isolation by distance, may lead to genetic isolation between populations and thus drive the evolution of lichens.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Measures of agro-ecosystems genetic variability are essential to sustain scientific-based actions and policies tending to protect the ecosystem services they provide. To build the genetic variability datum it is necessary to deal with a large number and different types of variables. Molecular marker data is highly dimensional by nature, and frequently additional types of information are obtained, as morphological and physiological traits. This way, genetic variability studies are usually associated with the measurement of several traits on each entity. Multivariate methods are aimed at finding proximities between entities characterized by multiple traits by summarizing information in few synthetic variables. In this work we discuss and illustrate several multivariate methods used for different purposes to build the datum of genetic variability. We include methods applied in studies for exploring the spatial structure of genetic variability and the association of genetic data to other sources of information. Multivariate techniques allow the pursuit of the genetic variability datum, as a unifying notion that merges concepts of type, abundance and distribution of variability at gene level.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The intergenic spacer (IGS) region of the ribosomal DNA was cloned and sequenced in eight species within the Gibberella fujikuroi species complex with anamorphs in the genus Fusarium , a group that includes the most relevant toxigenic species. DNA sequence analyses revealed two categories of repeated elements: long repeats and short repeats of 125 and 8 bp, respectively. Long repeats were present in two copies and were conserved in all the species analyzed, whereas different numbers of short repeat elements were observed, leading to species-specific IGS sequences with different length. In Fusarium subglutinans and Fusarium nygamai , these differences seemed to be the result of duplication and deletion events. Here, we propose a model based on unequal crossing over that can explain these processes. The partial IGS sequence of 22 Fusarium proliferatum isolates was also obtained to study variation at the intraspecific level. The results revealed no differences in terms of number or pattern of repeated elements and detected frequent gene conversion events. These results suggest that the homogenization observed at the intraspecific level might not be achieved primarily by unequal crossing-over events but rather by processes associated with recombination such as gene conversion events.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Synapsins are a family of neuron-specific synaptic vesicle-associated phosphoproteins that have been implicated in synaptogenesis and in the modulation of neurotransmitter release. In mammals, distinct genes for synapsins I and II have been identified, each of which gives rise to two alternatively spliced isoforms. We have now cloned and characterized a third member of the synapsin gene family, synapsin III, from human DNA. Synapsin III gives rise to at least one protein isoform, designated synapsin IIIa, in several mammalian species. Synapsin IIIa is associated with synaptic vesicles, and its expression appears to be neuron-specific. The primary structure of synapsin IIIa conforms to the domain model previously described for the synapsin family, with domains A, C, and E exhibiting the highest degree of conservation. Synapsin IIIa contains a novel domain, termed domain J, located between domains C and E. The similarities among synapsins I, II, and III in domain organization, neuron-specific expression, and subcellular localization suggest a possible role for synapsin III in the regulation of neurotransmitter release and synaptogenesis. The human synapsin III gene is located on chromosome 22q12–13, which has been identified as a possible schizophrenia susceptibility locus. On the basis of this localization and the well established neurobiological roles of the synapsins, synapsin III represents a candidate gene for schizophrenia.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The three-dimensional structure of Corynebacterium 2,5-diketo-d-gluconic acid reductase A (2,5-DKGR A; EC 1.1.1.-), in complex with cofactor NADPH, has been solved by using x-ray crystallographic data to 2.1-Å resolution. This enzyme catalyzes stereospecific reduction of 2,5-diketo-d-gluconate (2,5-DKG) to 2-keto-l-gulonate. Thus the three-dimensional structure has now been solved for a prokaryotic example of the aldo–keto reductase superfamily. The details of the binding of the NADPH cofactor help to explain why 2,5-DKGR exhibits lower binding affinity for cofactor than the related human aldose reductase does. Furthermore, changes in the local loop structure near the cofactor suggest that 2,5-DKGR will not exhibit the biphasic cofactor binding characteristics observed in aldose reductase. Although the crystal structure does not include substrate, the two ordered water molecules present within the substrate-binding pocket are postulated to provide positional landmarks for the substrate 5-keto and 4-hydroxyl groups. The structural basis for several previously described active-site mutants of 2,5-DKGR A is also proposed. Recent research efforts have described a novel approach to the synthesis of l-ascorbate (vitamin C) by using a genetically engineered microorganism that is capable of synthesizing 2,5-DKG from glucose and subsequently is transformed with the gene for 2,5-DKGR. These modifications create a microorganism capable of direct production of 2-keto-l-gulonate from d-glucose, and the gulonate can subsequently be converted into vitamin C. In economic terms, vitamin C is the single most important specialty chemical manufactured in the world. Understanding the structural determinants of specificity, catalysis, and stability for 2,5-DKGR A is of substantial commercial interest.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Familial multiple system tauopathy with presenile dementia (MSTD) is a neurodegenerative disease with an abundant filamentous tau protein pathology. It belongs to the group of familial frontotemporal dementias with Parkinsonism linked to chromosome 17 (FTDP-17), a major class of inherited dementing disorders whose genetic basis is unknown. We now report a G to A transition in the intron following exon 10 of the gene for microtubule-associated protein tau in familial MSTD. The mutation is located at the 3′ neighboring nucleotide of the GT splice-donor site and disrupts a predicted stem-loop structure. We also report an abnormal preponderance of soluble tau protein isoforms with four microtubule-binding repeats over isoforms with three repeats in familial MSTD. This most likely accounts for our previous finding that sarkosyl-insoluble tau protein extracted from the filamentous deposits in familial MSTD consists only of tau isoforms with four repeats. These findings reveal that a departure from the normal ratio of four-repeat to three-repeat tau isoforms leads to the formation of abnormal tau filaments. The results show that dysregulation of tau protein production can cause neurodegeneration and imply that the FTDP-17 gene is the tau gene. This work has major implications for Alzheimer’s disease and other tauopathies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The BTB domain (also known as the POZ domain) is an evolutionarily conserved protein–protein interaction motif found at the N terminus of 5–10% of C2H2-type zinc-finger transcription factors, as well as in some actin-associated proteins bearing the kelch motif. Many BTB proteins are transcriptional regulators that mediate gene expression through the control of chromatin conformation. In the human promyelocytic leukemia zinc finger (PLZF) protein, the BTB domain has transcriptional repression activity, directs the protein to a nuclear punctate pattern, and interacts with components of the histone deacetylase complex. The association of the PLZF BTB domain with the histone deacetylase complex provides a mechanism of linking the transcription factor with enzymatic activities that regulate chromatin conformation. The crystal structure of the BTB domain of PLZF was determined at 1.9 Å resolution and reveals a tightly intertwined dimer with an extensive hydrophobic interface. Approximately one-quarter of the monomer surface area is involved in the dimer intermolecular contact. These features are typical of obligate homodimers, and we expect the full-length PLZF protein to exist as a branched transcription factor with two C-terminal DNA-binding regions. A surface-exposed groove lined with conserved amino acids is formed at the dimer interface, suggestive of a peptide-binding site. This groove may represent the site of interaction of the PLZF BTB domain with nuclear corepressors or other nuclear proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The structure of complexes made from DNA and suitable lipids (lipoplex, Lx) was examined by cryo-electron microscopy (cryoEM). We observed a distinct concentric ring-like pattern with striated shells when using plasmid DNA. These spherical multilamellar particles have a mean diameter of 254 nm with repetitive spacing of 7.5 nm with striation of 5.3 nm width. Small angle x-ray scattering revealed repetitive ordering of 6.9 nm, suggesting a lamellar structure containing at least 12 layers. This concentric and lamellar structure with different packing regimes also was observed by cryoEM when using linear double-stranded DNA, single-stranded DNA, and oligodeoxynucleotides. DNA chains could be visualized in DNA/lipid complexes. Such specific supramolecular organization is the result of thermodynamic forces, which cause compaction to occur through concentric winding of DNA in a liquid crystalline phase. CryoEM examination of T4 phage DNA packed either in T4 capsides or in lipidic particles showed similar patterns. Small angle x-ray scattering suggested an hexagonal phase in Lx-T4 DNA. Our results indicate that both lamellar and hexagonal phases may coexist in the same Lx preparation or particle and that transition between both phases may depend on equilibrium influenced by type and length of the DNA used.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Fractionation of the abundant small ribonucleoproteins (RNPs) of the trypanosomatid Leptomonas collosoma revealed the existence of a group of unidentified small RNPs that were shown to fractionate differently than the well-characterized trans-spliceosomal RNPs. One of these RNAs, an 80-nt RNA, did not possess a trimethylguanosine (TMG) cap structure but did possess a 5′ phosphate terminus and an invariant consensus U5 snRNA loop 1. The gene coding for the RNA was cloned, and the coding region showed 55% sequence identity to the recently described U5 homologue of Trypanosoma brucei [Dungan, J. D., Watkins, K. P. & Agabian, N. (1996) EMBO J. 15, 4016–4029]. The L. collosoma U5 homologue exists in multiple forms of RNP complexes, a 10S monoparticle, and two subgroups of 18S particles that either contain or lack the U4 and U6 small nuclear RNAs, suggesting the existence of a U4/U6⋅U5 tri-small nuclear RNP complex. In contrast to T. brucei U5 RNA (62 nt), the L. collosoma homologue is longer (80 nt) and possesses a second stem–loop. Like the trypanosome U3, U6, and 7SL RNA genes, a tRNA gene coding for tRNACys was found 98 nt upstream to the U5 gene. A potential for base pair interaction between U5 and SL RNA in the 5′ splice site region (positions −1 and +1) and downstream from it is proposed. The presence of a U5-like RNA in trypanosomes suggests that the most essential small nuclear RNPs are ubiquitous for both cis- and trans-splicing, yet even among the trypanosomatids the U5 RNA is highly divergent.