916 resultados para Genome-specific Sequence
Resumo:
Soluble chromatin was prepared from rat testes after a brief micrococcal nuclease digestion. After adsorption onto hydroxylapatite at low ionic strength, the histone Hl subtypes were eluted with a shallow salt gradient of 0.3 M NaCl to 0.7 M NaCl. Histone Hlt was eluted at 0.4 MNaCl, while histones H1a and Hlc were eluted at 0.43 M NaCl and 0.45 M respectively. The extreme divergence of the amino acid sequence of the C-terminal half of histone Hlt, the major DNA binding domain of histone Hl, from that of the somatic consensus sequence may contribute to the weaker interaction of histone Hlt with the rat testis chromatin. Further, histone Hlt was not phosphorylated in vivo in contrast to histone Hla and Hlc, as is evident from the observation that histone Hlt lacks the SPKK motif recognized by the CDC-2kinase or the RR/KXS motif recognized by protein kinase A.
Resumo:
The complete amino acid sequence of two non identical subunits of the glucose/mannose-specific lectin from Dolichos lab lab (field bean) has been determined by sequential Edman analyses of the intact subunits and peptides derived by enzymatic and chemical cleavage. Peptides were purified by reverse phase high performance liquid chromatography and ion pair chromatography. The D. lab lab lectin is a glycoprotein having two polypeptide chains of 132 and 105 amino acid residues. The amino acid sequence of the D. Lab lab lectin is compared with the various lectins of the family Leguminosae. The D. lab lab lectin is the only species of the tribe Phaseoleae that contains two nonidentical subunits of almost equal size and that shows a specificity to glucose/ mannose. The lectin shows a greater homology to the glucose/mannose specific lectins, especially concanavalin A. The unique subunit architecture of the D. lab lab lectin indicates the presence of new post translational cleavage sites.
Resumo:
Monoclonal antibodies (MAbs) have been used extensively for identification of sequence-specific epitopes using either the ELISA or/and IRMA methods, However, attempts to use MAbs for identification of conformation-specific epitopes have been very few as they are considered very labile. We have investigated the stability of conformation-specific epitopes of human chorionic gonadotropin (hCG) using a quantitative solid-phase radioimmnunoassay (SPRIA) technique. Several epitopes are stable to mild modification (chemical and proteolytic) conditions, and epitopes show differential stability for these modifications. Based on these observations, a monoclonal antibody (MAb 16) for an a-subunit-specific epitope of hCG has been used to monitor changes at the epitopic site (identified as epitope 16) on modification of hCG, using SPRIA with immobilized MAb 16. Modifications of amino groups, hydroxyl group of tyrosine as well as carboxyl group of Asp/Glu all bring about sufficient changes in the epitope integrity. Peptide bond hydrolysis at lysine residues damages the epitope, but not at arginine residues, Hydrolysis at tyrosine does not affect the epitope, though modification of the side-chain of tyrosine inactivates the epitope. Destruction of the epitope occurs on reduction of the disulphide bonds. Partial retention of the epitope activity is seen on modification of carboxyl or the epsilon-amino groups of lysine. Based on these results four to six amino acids have been identified to be at the epitopic site, and the data suggest that two peptide segments are brought together by the disulphide bond Cys10-Cys60 to form the epitope.
Resumo:
A clone showing female-specific expression was identified from an embryonic cDNA library of a mealybug, Planococcus lilacinus, In Southern blots this clone (P7) showed hybridization to genomic DNA of females, but not to that of males, However, P7 showed no hybridization to nuclei of either sex, raising the possibility that it was extrachromosomal in origin, In sectioned adult females P7 hybridized to an abdominal organ called the mycetome. The mycetome is formed by mycetocytes, which are polyploid cells originating from the polar bodies and cleavage nuclei that harbour maternally transmitted, intracellular symbionts. Electron microscopy confirmed the presence of symbionts within the mycetocytes, Sequence analysis showed that P7 is a 16S rRNA gene, confirming its prokaryotic origin, P7 transcripts are localized to one pole in young embryos but are found in the pole as well as in the germ band during later stages of development, P7 expression is detectable in young embryos of both sexes but the absence of P7 in third instar and adult males suggests that this gene, and hence the endosymbionts, are subject to sex-specific elimination. Copyright (C) 1997 Elsevier Science Ltd.
Resumo:
Differential organisation of homologous chromosomes is related to both sex determination and genomic imprinting in coccid insects, the mealybugs. We report here the identification of two middle repetitive sequences that are differentially organised between the two sexes and also within the same diploid nucleus. These two sequences form a part of the male-specific nuclease-resistant chromatin (NRC) fraction of a mealybug Planococcus lilacinus. To understand the phenomenon of differential organisation we have analysed the components of NRC by cloning the DNA sequences present, deciphering their primary sequence, nucleosomal organisation, genomic distribution and cytological localisation, Our observations suggest that the middle repetitive sequences within NRC are functionally significant and we discuss their probable involvement in male-specific chromatin organisation.
Resumo:
In this paper, we report an analysis of the protein sequence length distribution for 13 bacteria, four archaea and one eukaryote whose genomes have been completely sequenced, The frequency distribution of protein sequence length for all the 18 organisms are remarkably similar, independent of genome size and can be described in terms of a lognormal probability distribution function. A simple stochastic model based on multiplicative processes has been proposed to explain the sequence length distribution. The stochastic model supports the random-origin hypothesis of protein sequences in genomes. Distributions of large proteins deviate from the overall lognormal behavior. Their cumulative distribution follows a power-law analogous to Pareto's law used to describe the income distribution of the wealthy. The protein sequence length distribution in genomes of organisms has important implications for microbial evolution and applications. (C) 1999 Elsevier Science B.V. All rights reserved.
Resumo:
Triplex forming oligonucleotides (TFOs) have the potential to modulate gene expression. While most of the experiments are directed towards triplex mediated inhibition of gene expression the strategy potentially could be used for gene specific activation. In an attempt to design a strategy for gene specific activation in vivo applicable to a large number of genes we have designed a TFO based activator-target system which may be utilized in Saccharomyces cerevisiae or any other system where Gal4 protein is ectopically expressed. The total genome sequence of Saccharomyces cerevisiae and expression profiles were used to select the target genes with upstream poly (pu/py) sequences. We have utilized the paradigm of Gal4 protein and its binding site. We describe here the selection of target genes and design of hairpin-TFO including the targeting sequences containing polypurine stretch found in the upstream promoter regions of weakly expressed genes. We demonstrate, the formation of hairpin-TFO, its binding to Gal4 protein, its ability to form triplex with the target duplex in vitro, the effect of polyethylenimine on complex formation and discuss the implication on in vivo transcription activation.
Resumo:
Study of activity of cloned promoters in slow-growing Mycobacterium tuberculosis during long-term growth conditions in vitro or inside macrophages, requires a genome-integration proficient promoter probe vector, which can be stably maintained even without antibiotics, carrying a substrate-independent, easily scorable and highly sensitive reporter gene. In order to meet this requirement, we constructed pAKMN2, which contains mycobacterial codon-optimized gfpm2+ gene, coding for GFPm2+ of highest fluorescence reported till date, mycobacteriophage L5 attP-int sequence for genome integration, and a multiple cloning site. pAKMN2 showed stable integration and expression of GFPm2+ from M. tuberculosis and M. smegmatis genome. Expression of GFPm2+, driven by the cloned minimal promoters of M. tuberculosis cell division gene, ftsZ (MtftsZ), could be detected in the M. tuberculosis/pAKMN2-promoter integrants, growing at exponential phase in defined medium in vitro and inside macrophages. Stable expression from genome-integrated format even without antibiotic, and high sensitivity of detection by flow cytometry and fluorescence imaging, in spite of single copy integration, make pAKMN2 useful for the study of cloned promoters of any mycobacterial species under long-term in vitro growth or stress conditions, or inside macrophages.
Resumo:
Phosphoinositide-specific phospholipase C (PLC) is involved in Ca2+ mediated signalling events that lead to altered cellular status. Using various sequence-analysis methods, we identified two conserved motifs in known PLC sequences. The identified motifs are located in the C2 domain of plant PLCs and are not found in any other protein. These motifs are specifically found in the Ca2+ binding loops and form adjoining beta strands. Further, we identified certain conserved residues that are highly distinct from corresponding residues of animal PLCs. The motifs reported here could be used to annotate plant-specific phospholipase C sequences. Furthermore, we demonstrated that the C2 domain alone is capable of targeting PLC to the membrane in response to a Ca2+ signal. We also showed that the binding event results from a change in the hydrophobicity of the C2 domain upon Ca2+ binding. Bioinformatic analyses revealed that all PLCs from Arabidopsis and rice lack a transmembrane domain, myristoylation and GPI-anchor protein modifications. Our bioinformatic study indicates that plant PLCs are located in the cytoplasm, the nucleus and the mitochondria. Our results suggest that there are no distinct isoforms of plant PLCs, as have been proposed to exist in the soluble and membrane associated fractions. The same isoform could potentially be present in both subcellular fractions, depending on the calcium level of the cytosol. Overall, these data suggest that the C2 domain of PLC plays a vital role in calcium signalling.
Resumo:
The rapidly growing structure databases enhance the probability of finding identical sequences sharing structural similarity. Structure prediction methods are being used extensively to abridge the gap between known protein sequences and the solved structures which is essential to understand its specific biochemical and cellular functions. In this work, we plan to study the ambiguity between sequence-structure relationships and examine if sequentially identical peptide fragments adopt similar three-dimensional structures. Fragments of varying lengths (five to ten residues) were used to observe the behavior of sequence and its three-dimensional structures. The STAMP program was used to superpose the three-dimensional structures and the two parameters (Sequence Structure Similarity Score (Sc) and Root Mean Square Deviation value) were employed to classify them into three categories: similar, intermediate and dissimilar structures. Furthermore, the same approach was carried out on all the three-dimensional protein structures solved in the two organisms, Mycobacterium tuberculosis and Plasmodium falciparum to validate our results.
Resumo:
Takifugu rubripes is teleost fish widely used in comparative genomics to understand the human system better due to its similarities both in number of genes and structure of genes. In this work we survey the fugu genome, and, using sensitive computational approaches, we identify the repertoire of putative protein kinases and classify them into groups and subfamilies. The fugu genome encodes 519 protein kinase-like sequences and this number of putative protein kinases is comparable closely to that of human. However, in spite of its similarities to human kinases at the group level, there are differences at the subfamily level as noted in the case of KIS and DYRK subfamilies which contribute to differences which are specific to the adaptation of the organism. Also, certain unique domain combination of galectin domain and YkA domain suggests alternate mechanisms for immune response and binding to lipoproteins. Lastly, an overall similarity with the MAPK pathway of humans suggests its importance to understand signaling mechanisms in humans. Overall the fugu serves as a good model organism to understand roles of human kinases as far as kinases such as LRRK and IRAK and their associated pathways are concerned.
Resumo:
Over the past two decades, many ingenious efforts have been made in protein remote homology detection. Because homologous proteins often diversify extensively in sequence, it is challenging to demonstrate such relatedness through entirely sequence-driven searches. Here, we describe a computational method for the generation of `protein-like' sequences that serves to bridge gaps in protein sequence space. Sequence profile information, as embodied in a position-specific scoring matrix of multiply aligned sequences of bona fide family members, serves as the starting point in this algorithm. The observed amino acid propensity and the selection of a random number dictate the selection of a residue for each position in the sequence. In a systematic manner, and by applying a `roulette-wheel' selection approach at each position, we generate parent family-like sequences and thus facilitate an enlargement of sequence space around the family. When generated for a large number of families, we demonstrate that they expand the utility of natural intermediately related sequences in linking distant proteins. In 91% of the assessed examples, inclusion of designed sequences improved fold coverage by 5-10% over searches made in their absence. Furthermore, with several examples from proteins adopting folds such as TIM, globin, lipocalin and others, we demonstrate that the success of including designed sequences in a database positively sensitized methods such as PSI-BLAST and Cascade PSI-BLAST and is a promising opportunity for enormously improved remote homology recognition using sequence information alone.
Resumo:
Chromosomal aberration is considered to be one of the major characteristic features in many cancers. Chromosomal translocation, one type of genomic abnormality, can lead to deregulation of critical genes involved in regulating important physiological functions such as cell proliferation and DNA repair. Although chromosomal translocations were thought to be random events, recent findings suggest that certain regions in the human genome are more susceptible to breakage than others. The possibility of deviation from the usual B-DNA conformation in such fragile regions has been an active area of investigation. This review summarizes the factors that contribute towards the fragility of these regions in the chromosomes, such as DNA sequences and the role of different forms of DNA structures. Proteins responsible for chromosomal fragility, and their mechanism of action are also discussed. The effect of positioning of chromosomes within the nucleus favoring chromosomal translocations and the role of repair mechanisms are also addressed.
Suite of tools for statistical N-gram language modeling for pattern mining in whole genome sequences
Resumo:
Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.
Resumo:
Staphylococcus aureus is a major human pathogen, first recognized as a leading cause of hospital-acquired infections. Community-associated S. aureus (CA-SA) pose a greater threat due to increase in severity of infection and disease among children and healthy adults. CA-SA strains in India are genetically diverse, among which is the sequence type (ST) 772, which has now spread to Australia, Europe and Japan. Towards understanding the genetic characteristics of ST772, we obtained draft genome sequences of five relevant clinical isolates and studied the properties of their PVL-carrying prophages, whose presence is a defining hallmark of CA-SA. We show that this is a novel prophage, which carries the structural genes of the hlb-carrying prophage and includes the sea enterotoxin. This architecture probably emerged early within the ST772 lineage, at least in India. The sea gene, unique to ST772 PVL, despite having promoter sequence characteristics typical of low expression, appears to be highly expressed during early phase of growth in laboratory conditions. We speculate that this might be a consequence of its novel sequence context. The crippled nature of the hlb-converting prophage in ST772. suggests that widespread mobility of the sea enterotoxin might be a selective force behind its `transfer' to the PVL prophage. Wild type ST772 strains induced strong proliferative responses as well as high cytotoxic activity against neutrophils, likely mediated by superantigen SEA and the PVL toxin respectively. Both proliferation and cytotoxicity were markedly reduced in a cured ST772 strain indicating the impact of the phage on virulence. The presence of SEA alongside he genes for the immune system-modulating PVL toxin may contribute to the success and virulence of ST772.