25 resultados para Molecular Sequence Data
em Indian Institute of Science - Bangalore - Índia
Resumo:
The role of lectins in mediating cancer metastasis, apoptosis as well as various other signaling events has been well established in the past few years. Data on various aspects of the role of lectins in cancer is being accumulated at a rapid pace. The data on lectins available in the literature is so diverse, that it becomes difficult and time-consuming, if not impossible to comprehend the advances in various areas and obtain the maximum benefit. Not only do the lectins vary significantly in their individual functional roles, but they are also diverse in their sequences, structures, binding site architectures, quaternary structures, carbohydrate affinities and specificities as well as their potential applications. An organization of these seemingly independent data into a common framework is essential in order to achieve effective use of all the data towards understanding the roles of different lectins in different aspects of cancer and any resulting applications. An integrated knowledge base (CancerLectinDB) together with appropriate analytical tools has therefore been developed for lectins relevant for any aspect of cancer, by collating and integrating diverse data. This database is unique in terms of providing sequence, structural, and functional annotations for lectins from all known sources in cancer and is expected to be a useful addition to the number of glycan related resources now available to the community. The database has been implemented using MySQL on a Linux platform and web-enabled using Perl-CGI and Java tools. Data for individual lectins pertain to taxonomic, biochemical, domain architecture, molecular sequence and structural details as well as carbohydrate specificities. Extensive links have also been provided for relevant bioinformatics resources and analytical tools. Availability of diverse data integrated into a common framework is expected to be of high value for various studies on lectin cancer biology.
Resumo:
The mechanism of translation in eubacteria and organelles is thought to be similar. In eubacteria, the three initiation factors IF1, IF2, and IF3 are vital. Although the homologs of IF2 and IF3 are found in mammalian mitochondria, an IF1 homolog has never been detected. Here, we show that bovine mitochondrial IF2 (IF2mt) complements E. coli containing a deletion of the IF2 gene (E. coli ΔinfB). We find that IF1 is no longer essential in an IF2mt-supported E. coli ΔinfB strain. Furthermore, biochemical and molecular modeling data show that a conserved insertion of 37 amino acids in the IF2mt substitutes for the function of IF1. Deletion of this insertion from IF2mt supports E. coli for the essential function of IF2. However, in this background, IF1 remains essential. These observations provide strong evidence that a single factor (IF2mt) in mammalian mitochondria performs the functions of two eubacterial factors, IF1 and IF2.
Resumo:
Dimethyl sulphoxide complexes of lanthanide and yttrium nitrates of the general formula M(DMSO)n(NO3)3 where M = La, Ce, Pr, Nd, Sm or Gd; n = 4 and M = Y, Ho or Yb; n = 3 have been isolated and characterized. The i.r. data besides excluding the presence of D3h nitrate, reveal co-ordination through the oxygen atom of the dimethyl sulphoxide. The complexes are monomeric in acetonitrile. Molecular conductance data in acetone, acetonitrile, dimethyl formamide and dimethyl sulphoxide suggest a co-ordination number of eight for the lighter lanthanides and seven for yttrium and the heavier lanthanides.
Resumo:
Rare earth perchlorate-antipyrine (ap) complexes of the formula Ln (ClO4)3.6 ap have been prepared and characterised. Infrared and electronic spectra showed the co-ordination through carbonyl oxygen. Conductivity and molecular weight data indicated a co-ordination number of six for these complexes.
Resumo:
Elucidation of the detailed structural features and sequence requirements for iv helices of various lengths could be very important in understanding secondary structure formation in proteins and, hence. in the protein folding mechanism. An algorithm to characterize the geometry of an alpha helix from its C-alpha coordinates has been developed and used to analyze the structures of long cu helices (number of residues greater than or equal to 25) found in globular proteins, the crystal structure coordinates of which are available from the Brookhaven Protein Data Bank, Ail long a helices can be unambiguously characterized as belonging to one of three classes: linear, curved, or kinked, with a majority being curved. Analysis of the sequences of these helices reveals that the long alpha helices have unique sequence characteristics that distinguish them from the short alpha helices in globular proteins, The distribution and statistical propensities of individual amino acids to occur in long alpha heices are different from those found in short alpha helices, with amino acids having longer side chains and/or having a greater number of functional groups occurring more frequently in these helices, The sequences of the long alpha helices can be correlated with their gross structural features, i.e., whether they are curved, linear, or kinked, and in case of the curved helices, with their curvature.
Resumo:
Tetrapeptide sequences of the type Z-Pro-Y-X were obtained from the crystal structure data on 34 globular proteins, and used in an analysis of the positional preferences of the individual amino acid residues in the β-turn conformation. The effect of fixing proline as the second position residue in the tetrapeptide sequence was studied by comparing the data obtained on the positional preferences with the corresponding data obtained by Chou and Fasman using the Z-R-Y-X sequence, where no particular residue was fixed in any of the four positions. While, in general, several amino acid residues having relatively very high or very low preferences for specific positions were found to be common to both the Z-Pro-Y-X and Z-R-Y-X sequences, many significant differences were found between the two sets of data, which are to be attributed to specific interactions arising from the presence of the proline residue.
Resumo:
Based upon a stereochemical guideline, two topologically distinct types of helicalduplexes have been deduced for a polynucleotide duplex with alternating purine pyrimidine sequence (PAPP): (a) right-handed uniform (RU) helix and (b) left-handed zig-zag (LZ) helix. Both structures have trinucleoside diphosphate as the basic unit wherein the purine pyrimidine fragment has a different conformation from the pyrimidine-purine fragment. Thus, RU and LZ helices represent two different classes of sequence-dependent molecular conformations for PAPP. The conformationalf eatures of an RU helix of PAPP in B-form and three LZ-helices for B-, D- and Z-forms are discussed.
Resumo:
The DNA polymorphism among 22 isolates of Sclerospora graminicola, the causal agent of downy mildew disease of pearl millet was assessed using 20 inter simple sequence repeats (ISSR) primers. The objective of the study was to examine the effectiveness of using ISSR markers for unravelling the extent and pattern of genetic diversity in 22 S. graminicola isolates collected from different host cultivars in different states of India. The 19 functional ISSR primers generated 410 polymorphic bands and revealed 89% polymorphism and were able to distinguish all the 22 isolates. Polymorphic bands used to construct an unweighted pair group method of averages (UPGMA) dendrogram based on Jaccard's co-efficient of similarity and principal coordinate analysis resulted in the formation of four major clusters of 22 isolates. The standardized Nei genetic distance among the 22 isolates ranged from 0.0050 to 0.0206. The UPGMA clustering using the standardized genetic distance matrix resulted in the identification of four clusters of the 22 isolates with bootstrap values ranging from 15 to 100. The 3D-scale data supported the UPGMA results, which resulted into four clusters amounting to 70% variation among each other. However, comparing the two methods show that sub clustering by dendrogram and multi dimensional scaling plot is slightly different. All the S. graminicola isolates had distinct ISSR genotypes and cluster analysis origin. The results of ISSR fingerprints revealed significant level of genetic diversity among the isolates and that ISSR markers could be a powerful tool for fingerprinting and diversity analysis in fungal pathogens.
Resumo:
Software packages NUPARM and NUCGEN, are described, which can be used to understand sequence directed structural variations in nucleic acids, by analysis and generation of non-uniform structures. A set of local inter basepair parameters (viz. tilt, roll, twist, shift, slide and rise) have been defined, which use geometry and coordinates of two successive basepairs only and can be used to generate polymeric structures with varying geometries for each of the 16 possible dinucleotide steps. Intra basepair parameters, propeller, buckle, opening and the C6...C8 distance can also be varied, if required, while the sugar phosphate backbone atoms are fixed in some standard conformation ill each of the nucleotides. NUPARM can be used to analyse both DNA and RNA structures, with single as well as double stranded helices. The NUCGEN software generates double helical models with the backbone fixed in B-form DNA, but with appropriate modifications in the input data, it can also generate A-form DNA ar rd RNA duplex structures.
Resumo:
The crystal and molecular structure of the ammonium salt of deoxycytidylyl-(3'-5')-deoxyguanosine has been determined from 0.85 A resolution single crystal X-ray diffraction data. The crystals obtained by acetone diffusion technique at -20 degrees C, are orthorhombic, P212121, a = 12.880(2), b = 17444(2) and c = 27.642(2) A. The structure was solved by high resolution Patterson and Fourier methods and refined to R = 0.136. There are two d(CpG) molecules in the asymmetric unit forming a mini left handed Z-DNA helix. This is in contrast to the earlier reported forms of d(CpG) where the molecules form self base paired duplexes. There are two ammonium ions in the asymmetric unit. The major groove NH+4 ion interacts with N7 of guanines through water bridges besides making H-bonded interactions directly with the phosphate oxygen atoms. A second NH+4 ion is found in the minor groove interacting directly with the phosphate oxygen atoms. Symmetry related molecules pack in such a way that the cytosine base stacks on cytosine and guanine base on guanine. Our structure demonstrates that alternating d(CpG) sequences have the ability to adopt the left handed Z-DNA structure even at the dimer level i.e., in a sequence which is only two base pairs long.
Resumo:
Determining the sequence of amino acid residues in a heteropolymer chain of a protein with a given conformation is a discrete combinatorial problem that is not generally amenable for gradient-based continuous optimization algorithms. In this paper we present a new approach to this problem using continuous models. In this modeling, continuous "state functions" are proposed to designate the type of each residue in the chain. Such a continuous model helps define a continuous sequence space in which a chosen criterion is optimized to find the most appropriate sequence. Searching a continuous sequence space using a deterministic optimization algorithm makes it possible to find the optimal sequences with much less computation than many other approaches. The computational efficiency of this method is further improved by combining it with a graph spectral method, which explicitly takes into account the topology of the desired conformation and also helps make the combined method more robust. The continuous modeling used here appears to have additional advantages in mimicking the folding pathways and in creating the energy landscapes that help find sequences with high stability and kinetic accessibility. To illustrate the new approach, a widely used simplifying assumption is made by considering only two types of residues: hydrophobic (H) and polar (P). Self-avoiding compact lattice models are used to validate the method with known results in the literature and data that can be practically obtained by exhaustive enumeration on a desktop computer. We also present examples of sequence design for the HP models of some real proteins, which are solved in less than five minutes on a single-processor desktop computer Some open issues and future extensions are noted.
Resumo:
The importance and usefulness of local doublet parameters in understanding sequence dependent effects has been described for A- and B-DNA oligonucleotide crystal structures. Each of the two sets of local parameters described by us in the NUPARM algorithm, namely the local doublet parameters, calculated with reference to the mean z-axis, and the local helical parameters, calculated with reference to the local helix axis, is sufficient to describe the oligonucleotide structures, with the local helical parameters giving a slightly magnified picture of the variations in the structures. The values of local doublet parameters calculated by NUPARM algorithm are similar to those calculated by NEWHELIX90 program, only if the oligonucleotide fragment is not too distorted. The mean values obtained using all the available data for B-DNA crystals are not significantly different from those obtained when a limited data set is used, consisting only of structures with a data resolution of better than 2.4 A and without any bound drug molecule. Thus the variation observed in the oligonucleotide crystals appears to be independent of the quality of their crystallinity. No strong correlation is seen between any pair of local doublet parameters but the local helical parameters are interrelated by geometric relationships. An interesting feature that emerges from this analysis is that the local rise along the z-axis is highly correlated with the difference in the buckle values of the two basepairs in the doublet, as suggested earlier for the dodecamer structures (Bansal and Bhattacharyya, in Structure & Methods: DNA & RNA, Vol. 3 (Eds., R.H. Sarma and M.H. Sarma), pp. 139-153 (1990)). In fact the local rise values become almost constant for both A- and B-forms, if a correction is applied for the buckling of the basepairs. In B-DNA the AA, AT, TA and GA basepair sequences generally have a smaller local rise (3.25 A) compared to the other sequences (3.4 A) and this seems to be an intrinsic feature of basepair stacking interaction and not related to any other local doublet parameter. The roll angles in B-DNA oligonucleotides have small values (less than +/- 8 degrees), while mean local twist varies from 24 degrees to 45 degrees. The CA/TG doublet sequences show two types of preferred geometries, one with positive roll, small positive slide and reduced twist and another with negative roll, large positive slide and increased twist.(ABSTRACT TRUNCATED AT 400 WORDS)
Resumo:
Recent experimental studies have shown that the Rec-A mediated homologous recombination reaction involves a triple helical intermediate, in which the third strand base forms hydrogen bonds with both the bases in the major groove of the Watson-Crick duplex. Such 'mixed' hydrogen bonds allow formation of sequence independent triplexes. DNA triple helices involving 'mixed' hydrogen bonds have been studied, using model building, molecular mechanics (MM) and molecular dynamics (MD). Models were built for a tripler comprising all four possible triplets viz., G.C*C, C.G*G, A.T*T and T.A*A. To check the stability of all the 'mixed' hydrogen bonds in such triplexes and the conformational preferences of such tripler structures, MD studies were carried out starting from two structures with 30 degrees and 36 degrees twist between the basepairs. It was observed that though the two triplexes converged towards a similar structure, the various hydrogen bonds between the WC duplex and the third strand showed differential stabilities. An MD simulation with restrained hydrogen bonds showed that the resulting structure was stable and remained close to the starting structure. These studies help us in defining stable hydrogen bond geometries involving the third strand and the WC duplex. It was observed that in the C.G*G triplets the N7 atom of the second strand is always involved in hydrogen bonding. In the G.C*C triplets, either N3 or O2 in the third strand cytosine can interchangeably act as a hydrogen bond acceptor.
Resumo:
DNA triple helices containing two purine strands and one pyrimidine strand (C.G*G and T.A*A) have been studied, using model building followed by energy minimisation, for different orientations of the third strand resulting from variation in the hydrogen bonding between the Watson-Crick duplex and the third strand and the glycosidic torsion angle in the third strand. Our results show that in the C.G*G case the structure with a parallel orientation of the third strand, resulting from Hoogsteen hydrogen bonds between the third strand and the Watson-Crick duplex, is energetically the most favourable while in the T.A*A case the antiparallel orientation of the third strand, resulting from reverse Hoogsteen hydrogen bonds, is energetically the most favourable. These studies when extended to the mixed sequence triplexes, in which the second strand is a mixture of G and A, correspondingly the third strand is a mixture of G and APT, show that though the parallel orientation is still energetically more favourable, the antiparallel orientation becomes energetically comparable with an increasing number of thymines in the third strand. Structurally, for the mixed triplexes containing G and T in the third strand, it is seen that the basepair non-isomorphism between the C.G*G and the T.A*T triplets can be overcome with some changes in the base pair parameters without much distortion of either the backbone or the hydrogen bonds.
Resumo:
Sesbania mosaic virus (SMV) is a plant virus infecting Sesbania grandiflora plants in Andhra Pradesh, India. Amino acid sequence of the tryptic peptides of SMV coat protein were determined using a gas phase sequenator. These sequences showed identical amino acids at 69% of the positions when aligned with the corresponding residues of southern bean mosaic virus (SBMV).Crystals diffracting to better than 3 Å resolution were obtained by precipitating the virus with ammonium sulphate. The crystals belonged to rhombohedral space group R3 with α = 291·4 Å and α = 61·9°. Three-dimensional X-ray diffraction data on these crystals were collected to a resolution of 4·7 Å, using a Siemens-Nicolet area detector system. Self-rotation function studies revealed the icosahedral symmetry of the virus particles, as well as their precise orientation in the unit cell. Cross-rotation function and modelling studies with SBMV showed that it is a valid starting model for SMV structure determination. Low resolution phases computed using a polyalanine model of SBMV were subjected to refinement and extension by real-space electron density averaging and solvent flattening. The final electron density map revealed a polypeptide fold similar to SBMV. The single disulphide bridge of SBMV coat protein is retained in SMV. Four icosahedrally independent cation binding sites have been tentatively identified. Three of these sites, related by a quasi threefold axis, are also found in SBMV. The fourth site is situated on the quasi threefold axis. Aspartic acid residues, which replace Ile218 of SBMV from the quasi threefold-related subunits are suitable ligands to the cation at this site