916 resultados para Genome-specific Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Viral genomes are encapsidated within protective protein shells. This encapsidation can be achieved either by a co-condensation reaction of the nucleic acid and coat proteins, or by first forming empty viral particles which are subsequently packaged with nucleic acid, the latter mechanism being typical for many dsDNA bacteriophages. Bacteriophage PRD1 is an icosahedral, non-tailed dsDNA virus that has an internal lipid membrane, the hallmark of the Tectiviridae family. Although PRD1 has been known to assemble empty particles into which the genome is subsequently packaged, the mechanism for this has been unknown, and there has been no evidence for a separate packaging vertex, similar to the portal structures used for packaging in the tailed bacteriophages and herpesviruses. In this study, a unique DNA packaging vertex was identified for PRD1, containing the packaging ATPase P9, packaging factor P6 and two small membrane proteins, P20 and P22, extending the packaging vertex to the internal membrane. Lack of small membrane protein P20 was shown to totally abolish packaging, making it an essential part of the PRD1 packaging mechanism. The minor capsid proteins P6 was shown to be an important packaging factor, its absence leading to greatly reduced packaging efficiency. An in vitro DNA packaging mechanism consisting of recombinant packaging ATPase P9, empty procapsids and mutant PRD1 DNA with a LacZ-insert was developed for the analysis of PRD1 packaging, the first such system ever for a virus containing an internal membrane. A new tectiviral sequence, a linear plasmid called pBClin15, was identified in Bacillus cereus, providing material for sequence analysis of the tectiviruses. Analysis of PRD1 P9 and other putative tectiviral ATPase sequences revealed several conserved sequence motifs, among them a new tectiviral packaging ATPase motif. Mutagenesis studies on PRD1 P9 were used to confirm the significance of the motifs. P9-type putative ATPase sequences carrying a similar sequence motif were identified in several other membrane containing dsDNA viruses of bacterial, archaeal and eukaryotic hosts, suggesting that these viruses may have similar packaging mechanisms. Interestingly, almost the same set of viruses that were found to have similar putative packaging ATPases had earlier been found to share similar coat protein folds and capsid structures, and a common origin for these viruses had been suggested. The finding in this study of similar packaging proteins further supports the idea that these viruses are descendants of a common ancestor.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Autoimmune diseases are a major health problem. Usually autoimmune disorders are multifactorial and their pathogenesis involves a combination of predisposing variations in the genome and other factors such as environmental triggers. APECED (autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy) is a rare, recessively inherited, autoimmune disease caused by mutations in a single gene. Patients with APECED suffer from several organ-specific autoimmune disorders, often affecting the endocrine glands. The defective gene, AIRE, codes for a transcriptional regulator. The AIRE (autoimmune regulator) protein controls the expression of hundreds of genes, representing a substantial subset of tissue-specific antigens which are presented to developing T cells in the thymus and has proven to be a key molecule in the establishment of immunological tolerance. However, the molecular mechanisms by which AIRE mediates its functions are still largely obscure. The aim of this thesis has been to elucidate the functions of AIRE by studying the molecular interactions it is involved in by utilizing different cultured cell models. A potential molecular mechanism for exceptional, dominant, inheritance of APECED in one family, carrying a glycine 228 to tryptophan (G228W) mutation, was described in this thesis. It was shown that the AIRE polypeptide with G228W mutation has a dominant negative effect by binding the wild type AIRE and inhibiting its transactivation capacity in vitro. The data also emphasizes the importance of homomultimerization of AIRE in vivo. Furthermore, two novel protein families interacting with AIRE were identified. The importin alpha molecules regulate the nuclear import of AIRE by binding to the nuclear localization signal of AIRE, delineated as a classical monopartite signal sequence. The interaction of AIRE with PIAS E3 SUMO ligases, indicates a link to the sumoylation pathway, which plays an important role in the regulation of nuclear architecture. It was shown that AIRE is not a target for SUMO modification but enhances the localization of SUMO1 and PIAS1 proteins to nuclear bodies. Additional support for the suggestion that AIRE would preferably up-regulate genes with tissue-specific expression pattern and down-regulate housekeeping genes was obtained from transactivation studies performed with two models: human insulin and cystatin B promoters. Furthermore, AIRE and PIAS activate the insulin promoter concurrently in a transactivation assay, indicating that their interaction is biologically relevant. Identification of novel interaction partners for AIRE provides us information about the molecular pathways involved in the establishment of immunological tolerance and deepens our understanding of the role played by AIRE not only in APECED but possibly also in several other autoimmune diseases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Filamentous fungi of the subphylum Pezizomycotina are well known as protein and secondary metabolite producers. Various industries take advantage of these capabilities. However, the molecular biology of yeasts, i.e. Saccharomycotina and especially that of Saccharomyces cerevisiae, the baker's yeast, is much better known. In an effort to explain fungal phenotypes through their genotypes we have compared protein coding gene contents of Pezizomycotina and Saccharomycotina. Only biomass degradation and secondary metabolism related protein families seem to have expanded recently in Pezizomycotina. Of the protein families clearly diverged between Pezizomycotina and Saccharomycotina, those related to mitochondrial functions emerge as the most prominent. However, the primary metabolism as described in S. cerevisiae is largely conserved in all fungi. Apart from the known secondary metabolism, Pezizomycotina have pathways that could link secondary metabolism to primary metabolism and a wealth of undescribed enzymes. Previous studies of individual Pezizomycotina genomes have shown that regardless of the difference in production efficiency and diversity of secreted proteins, the content of the known secretion machinery genes in Pezizomycotina and Saccharomycotina appears very similar. Genome wide analysis of gene products is therefore needed to better understand the efficient secretion of Pezizomycotina. We have developed methods applicable to transcriptome analysis of non-sequenced organisms. TRAC (Transcriptional profiling with the aid of affinity capture) has been previously developed at VTT for fast, focused transcription analysis. We introduce a version of TRAC that allows more powerful signal amplification and multiplexing. We also present computational optimisations of transcriptome analysis of non-sequenced organism and TRAC analysis in general. Trichoderma reesei is one of the most commonly used Pezizomycotina in the protein production industry. In order to understand its secretion system better and find clues for improvement of its industrial performance, we have analysed its transcriptomic response to protein secretion stress conditions. In comparison to S. cerevisiae, the response of T. reesei appears different, but still impacts on the same cellular functions. We also discovered in T. reesei interesting similarities to mammalian protein secretion stress response. Together these findings highlight targets for more detailed studies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In recent years, identification of sequence patterns has been given immense importance to understand better their significance with respect to genomic organization and evolutionary processes. To this end, an algorithm has been derived to identify all similar sequence repeats present in a protein sequence. The proposed algorithm is useful to correlate the three-dimensional structure of various similar sequence repeats available in the Protein Data Bank against the same sequence repeats present in other databases like SWISS-PROT, PIR and Genome databases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool `PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Aiming to identify novel genetic variants and to confirm previously identified genetic variants associated with bone mineral density (BMD), we conducted a three-stage genome-wide association (GWA) meta-analysis in 27 061 study subjects. Stage 1 meta-analyzed seven GWA samples and 11 140 subjects for BMDs at the lumbar spine, hip and femoral neck, followed by a Stage 2 in silico replication of 33 SNPs in 9258 subjects, and by a Stage 3 de novo validation of three SNPs in 6663 subjects. Combining evidence from all the stages, we have identified two novel loci that have not been reported previously at the genome-wide significance (GWS; 5.0 × 10-8) level: 14q24.2 (rs227425, P-value 3.98 × 10-13, SMOC1) in the combined sample of males and females and 21q22.13 (rs170183, P-value 4.15 × 10-9, CLDN14) in the female-specific sample. The two newly identified SNPs were also significant in the GEnetic Factors for OSteoporosis consortium (GEFOS, n 5 32 960) summary results. We have also independently confirmed 13 previously reported loci at the GWS level: 1p36.12 (ZBTB40), 1p31.3 (GPR177), 4p16.3 (FGFRL1), 4q22.1 (MEPE), 5q14.3 (MEF2C), 6q25.1 (C6orf97, ESR1), 7q21.3 (FLJ42280, SHFM1), 7q31.31 (FAM3C, WNT16), 8q24.12 (TNFRSF11B), 11p15.3 (SOX6), 11q13.4 (LRP5), 13q14.11 (AKAP11) and 16q24 (FOXL1). Gene expression analysis in osteogenic cells implied potential functional association of the two candidate genes (SMOC1 and CLDN14) in bone metabolism. Our findings independently confirm previously identified biological pathways underlying bone metabolism and contribute to the discovery of novel pathways, thus providing valuable insights into the intervention and treatment of osteoporosis. © The Author 2013. Published by Oxford University Press.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The current explosion of DNA sequence information has generated increasing evidence for the claim that noncoding repetitive DNA sequences present within and around different genes could play an important role in genetic control processes, although the precise role and mechanism by which these sequences function are poorly understood. Several of the simple repetitive sequences which occur in a large number of loci throughout the human and other eukaryotic genomes satisfy the sequence criteria for forming non-B DNA structures in vitro. We have summarized some of the features of three different types of simple repeats that highlight the importance of repetitive DNA in the control of gene expression and chromatin organization. (i) (TG/CA)n repeats are widespread and conserved in many loci. These sequences are associated with nucleosomes of varying linker length and may play a role in chromatin organization. These Z-potential sequences can help absorb superhelical stress during transcription and aid in recombination. (ii) Human telomeric repeat (TTAGGG)n adopts a novel quadruplex structure and exhibits unusual chromatin organization. This unusual structural motif could explain chromosome pairing and stability. (iii) Intragenic amplification of (CTG)n/(CAG)n trinucleotide repeat, which is now known to be associated with several genetic disorders, could down-regulate gene expression in vivo. The overall implications of these findings vis-à-vis repetitive sequences in the genome are summarized.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Restriction endonucleases (REases) protect bacteria from invading foreign DNAs and are endowed with exquisite sequence specificity. REases have originated from the ancestral proteins and evolved new sequence specificities by genetic recombination, gene duplication, replication slippage, and transpositional events. They are also speculated to have evolved from nonspecific endonucleases, attaining a high degree of sequence specificity through point mutations. We describe here an example of generation of exquisitely site-specific REase from a highly-promiscuous one by a single point mutation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Confinement and Surface specific interactions call induce Structures otherwise unstable at that temperature and pressure. Here we Study the groove specific water dynamics ill the nucleic acid sequences, poly-AT and poly-GC, in long B-DNA duplex chains by large scale atomistic molecular dynamics simulations, accompanied by thermodynamic analysis. While water dynamics in the major groove remains insensitive to the sequence differences, exactly the opposite is true for the minor groove water. Much slower water dynamics observed in the minor grooves (especially in the AT minor) call be attributed to all enhanced tetrahedral ordering (< t(h)>) of water. The largest value of < t(h)> in the AT minor groove is related to the spine of hydration found in X-ray Structure. The calculated configurational entropy (S-C) of the water molecules is found to be correlated with the self-diffusion coefficient of water in different region via Adam-Gibbs relation D = A exp(-B/TSC), and also with < t(h)>.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Horizontal gene transfer (HGT) is known to be a major force in genome evolution. The acquisition of genes from viruses by eukaryotic genomes is a well-studied example of HGT, including rare cases of non-retroviral RNA virus integration. The present study describes the integration of cucumber mosaic virus RNA-1 into soybean genome. After an initial metatranscriptomic analysis of small RNAs derived from soybean, the de novo assembly resulted a 3029-nt contig homologous to RNA-1. The integration of this sequence in the soybean genome was confirmed by DNA deep sequencing. The locus where the integration occurred harbors the full RNA-1 sequence followed by the partial sequence of an endogenous mRNA and another sequence of RNA-1 as an inverted repeat and allowing the formation of a hairpin structure. This region recombined into a retrotransposon located inside an exon of a soybean gene. The nucleotide similarity of the integrated sequence compared to other Cucumber mosaic virus sequences indicates that the integration event occurred recently. We described a rare event of non-retroviral RNA virus integration in soybean that leads to the production of a double-stranded RNA in a similar fashion to virus resistance RNAi plants.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background & objectives: Periplasmic copper and zinc superoxide dismutase (Cu,Zn-SOD or SodC) is an important component of the antioxidant shield which protects bacteria from the phagocytic oxidative burst. Cu,Zn-SODs protect Gram-negative bacteria against oxygen damage which have also been shown to contribute to the pathogenicity of these bacterial species. We report the presence of SodC in drug resistant Salmonella sp. isolated from patients suffering from enteric fever. Further sodC was amplified, cloned into Escherichia coli and the nucleotide sequence and amino acid sequence homology were compared with the standard strain Salmonella Typhimurium 14028. Methods: Salmonella enterica serovar Typhi (S. Typhi) and Salmonellaenterica serovar Paratyphi (S. Paratyphi) were isolated and identified from blood samples of the patients. The isolates were screened for the presence of Cu, Zn-SOD by PAGE using KCN as inhibitor of Cu,Zn-SOD. The gene (sodC) was amplified by PCR, cloned and sequenced. The nucleotide and amino acid sequences of sodC were compared using CLUSTAL X.Results: SodC was detected in 35 per cent of the Salmonella isolates. Amplification of the genomic DNA of S. Typhi and S. Paratyphi with sodC specific primers resulted in 519 and 515 bp amplicons respectively. Single mutational difference at position 489 was observed between thesodC of S. Typhi and S. Paratyphi while they differed at 6 positions with the sodC of S. Typhimurium 14028. The SodC amino acid sequences of the two isolates were homologous but 3 amino acid difference was observed with that of standard strain S. Typhimurium 14028.Interpretation & conclusions: The presence of SodC in pathogenic bacteria could be a novel candidate as phylogenetic marker.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

DNA methyltransferases (MTases) are a group of enzymes that catalyze the methyl group transfer from S-adenosyl-L-methionine in a sequence-specific manner. Orthodox Type II DNA MTases usually recognize palindromic DNA sequences and add a methyl group to the target base (either adenine or cytosine) on both strands. However, there are a number of MTases that recognize asymmetric target sequences and differ in their subunit organization. In a bacterial cell, after each round of replication, the substrate for any MTase is hemimethylated DNA, and it therefore needs only a single methylation event to restore the fully methylated state. This is in consistent with the fact that most of the DNA MTases studied exist as monomers in solution. Multiple lines of evidence suggest that some DNA MTases function as dimers. Further, functional analysis of many restriction-modification systems showed the presence of more than one or fused MTase genes. It was proposed that presence of two MTases responsible for the recognition and methylation of asymmetric sequences would protect the nascent strands generated during DNA replication from cognate restriction endonuclease. In this review, MTases recognizing asymmetric sequences have been grouped into different subgroups based on their unique properties. Detailed characterization of these unusual MTases would help in better understanding of their specific biological roles and mechanisms of action. The rapid progress made by the genome sequencing of bacteria and archaea may accelerate the identification and study of species- and strain-specific MTases of host-adapted bacteria and their roles in pathogenic mechanisms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The TCP transcription factors control multiple developmental traits in diverse plant species. Members of this family share an similar to 60-residue-long TCP domain that binds to DNA. The TCP domain is predicted to form a basic helix-loop-helix ( bHLH) structure but shares little sequence similarity with canonical bHLH domain. This classifies the TCP domain as a novel class of DNA binding domain specific to the plant kingdom. Little is known about how the TCP domain interacts with its target DNA. We report biochemical characterization and DNA binding properties of a TCP member in Arabidopsis thaliana, TCP4. We have shown that the 58-residue domain of TCP4 is essential and sufficient for binding to DNA and possesses DNA binding parameters comparable to canonical bHLH proteins. Using a yeast-based random mutagenesis screen and site-directed mutants, we identified the residues important for DNA binding and dimer formation. Mutants defective in binding and dimerization failed to rescue the phenotype of an Arabidopsis line lacking the endogenous TCP4 activity. By combining structure prediction, functional characterization of the mutants, and molecular modeling, we suggest a possible DNA binding mechanism for this class of transcription factors.