208 resultados para Functional Classification Trees
Resumo:
This paper studies the problem of constructing robust classifiers when the training is plagued with uncertainty. The problem is posed as a Chance-Constrained Program (CCP) which ensures that the uncertain data points are classified correctly with high probability. Unfortunately such a CCP turns out to be intractable. The key novelty is in employing Bernstein bounding schemes to relax the CCP as a convex second order cone program whose solution is guaranteed to satisfy the probabilistic constraint. Prior to this work, only the Chebyshev based relaxations were exploited in learning algorithms. Bernstein bounds employ richer partial information and hence can be far less conservative than Chebyshev bounds. Due to this efficient modeling of uncertainty, the resulting classifiers achieve higher classification margins and hence better generalization. Methodologies for classifying uncertain test data points and error measures for evaluating classifiers robust to uncertain data are discussed. Experimental results on synthetic and real-world datasets show that the proposed classifiers are better equipped to handle data uncertainty and outperform state-of-the-art in many cases.
Resumo:
Our ability to infer the protein quaternary structure automatically from atom and lattice information is inadequate, especially for weak complexes, and heteromeric quaternary structures. Several approaches exist, but they have limited performance. Here, we present a new scheme to infer protein quaternary structure from lattice and protein information, with all-around coverage for strong, weak and very weak affinity homomeric and heteromeric complexes. The scheme combines naive Bayes classifier and point group symmetry under Boolean framework to detect quaternary structures in crystal lattice. It consistently produces >= 90% coverage across diverse benchmarking data sets, including a notably superior 95% coverage for recognition heteromeric complexes, compared with 53% on the same data set by current state-of-the-art method. The detailed study of a limited number of prediction-failed cases offers interesting insights into the intriguing nature of protein contacts in lattice. The findings have implications for accurate inference of quaternary states of proteins, especially weak affinity complexes.
Resumo:
A general analysis of squeezing transformations for two-mode systems is given based on the four-dimensional real symplectic group Sp(4, R). Within the framework of the unitary (metaplectic) representation of this group, a distinction between compact photon-number-conserving and noncompact photon-number-nonconserving squeezing transformations is made. We exploit the U(2) invariant squeezing criterion to divide the set of all squeezing transformations into a two-parameter family of distinct equivalence classes with representative elements chosen for each class. Familiar two-mode squeezing transformations in the literature are recognized in our framework and seen to form a set of measure zero. Examples of squeezed coherent and thermal states are worked out. The need to extend the heterodyne detection scheme to encompass all of U(2) is emphasized, and known experimental situations where all U(2) elements can be reproduced are briefly described.
Resumo:
Influence of dispersion of uniformly sized mono-functional and bi-functional (''Janus'') particles on ionic conductivity of novel ``soggy sand'' electrolytes and its implications on mechanical strength and lithium-ion battery performance are discussed here.
Resumo:
Monoclonal antibodies (mAbs) to chicken thiamin carrier protein (TCP) have been produced by hybridoma technology to identify the crucial epitopes involved in bioneutralization of the vitamin carrier. The monoclonality of these mAbs (A4C4, F3H6, H8H3, C8C1 and G7H10) was sought to be confirmed by sub-class isotyping; they all belong to IgG1, k type. The epitopes recognized by all the five mAbs are conserved in TCP from the chicken to the rat as assessed by liquid phase RIA and immunoprecipitation of I-125-labelled proteins from pregnant rat serum. Among these mAbs, passive immunization of pregnant rats with the mAb C8C1 only on three consecutive days (day 10, 11 and 12) resulted in embryonic resorption. These results demonstrate the importance of epitopic structure specified by the mAb C8C1 on TCP during pregnancy in rats.
Resumo:
EcoP15I DNA methyltransferase recognizes the sequence 5'-CAGCAG-3' and transfers a methyl group to N-6 of the second adenine residue in the recognition sequence. All N-6 adenine methyltransferases contain two highly conserved sequences, FxGxG (motif I), postulated to form part of the S-adenosyl-L-methionine binding site and (D/N/S)PP(Y/F) (motif IV) involved in catalysis. We have altered the second glycine residue in motif I to arginine and serine, and substituted tyrosine in motif IV with tryptophan in EcoP15I DNA methyltransferase, using site-directed mutagenesis. The mutant enzymes were overexpressed, purified and characterized by biochemical methods. The mutations in motif I completely abolished AdoMet binding but left target DNA recognition unaltered. Although the mutation in motif IV resulted in loss of enzyme activity, we observed enhanced crosslinking of S-adenosyl-L-methionine and DNA. This implies that DNA and AdoMet binding sites are close to motif IV. Taken together, these results reinforce the importance of motif I in AdoMet binding and motif IV in catalysis. Additionally, limited proteolysis and UV crosslinking experiments with EcoP15I DNA methyltransferase imply that DNA binds in a cleft formed by two domains in the protein. Methylation protection analysis provides evidence for the fact that EcoP15I DNA MTase makes contacts in the major groove of its substrate DNA. Interestingly, hypermethylation of the guanine residue next to the target adenine residue indicates that the protein probably flips out the target adenine residue. (C) 1996 Academic Press Limited
Resumo:
Molybdenum-cofactor (Moco) biosynthesis is an evolutionarily conserved pathway in almost all kingdoms of life, including humans. Two proteins, MogA and MoeA, catalyze the last step of this pathway in bacteria, whereas a single two-domain protein carries out catalysis in eukaryotes. Here, three crystal structures of the Moco-biosynthesis protein MogA from the two thermophilic organisms Thermus thermophilus (TtMogA; 1.64 angstrom resolution, space group P2(1)) and Aquifex aeolicus (AaMogA; 1.70 angstrom resolution, space group P2(1) and 1.90 angstrom resolution, space group P1) have been determined. The functional roles and the residues involved in oligomerization of the protein molecules have been identified based on a comparative analysis of these structures with those of homologous proteins. Furthermore, functional roles have been proposed for the N- and C-terminal residues. In addition, a possible protein-protein complex of MogA and MoeA has been proposed and the residues involved in protein-protein interactions are discussed. Several invariant water molecules and those present at the subunit interfaces have been identified and their possible structural and/or functional roles are described in brief. In addition, molecular-dynamics and docking studies with several small molecules (including the substrate and the product) have been carried out in order to estimate their binding affinities towards AaMogA and TtMogA. The results obtained are further compared with those obtained for homologous eukaryotic proteins.
Resumo:
Diisopropoxytitanium(III) tetrahydroborate, ((PrO)-Pr-1)(2)TiBH4), generated in situ in dichloromethane from diisopropoxytitanium dichloride and benzyltriethylammonium borohydride in a 1:2 ratio selectively reduces aldehydes, ketones, acid chlorides, carboxylic acids, and N-Boc-protected amino acids to the corresponding alcohols in excellent yield under very mild reaction conditions (-78 to 25 degrees C).
Resumo:
In this paper, we look at the problem of scheduling expression trees with reusable registers on delayed load architectures. Reusable registers come into the picture when the compiler has a data-flow analyzer which is able to estimate the extent of use of the registers. Earlier work considered the same problem without allowing for register variables. Subsequently, Venugopal considered non-reusable registers in the tree. We further extend these efforts to consider a much more general form of the tree. We describe an approximate algorithm for the problem. We formally prove that the code schedule produced by this algorithm will, in the worst case, generate one interlock and use just one more register than that used by the optimal schedule. Spilling is minimized. The approximate algorithm is simple and has linear complexity.
Resumo:
The evolutionary diversity of the HSP70 gene family at the genetic level has generated complex structural variations leading to altered functional specificity and mode of regulation in different cellular compartments. By utilizing Saccharomyces cerevisiae as a model system for better understanding the global functional cooperativity between Hsp70 paralogs, we have dissected the differences in functional properties at the biochemical level between mitochondrial heat shock protein 70 (mtHsp70) Ssc1 and an uncharacterized Ssc3 paralog. Based on the evolutionary origin of Ssc3 and a high degree of sequence homology with Ssc1, it has been proposed that both have a close functional overlap in the mitochondrial matrix. Surprisingly, our results demonstrate that there is no functional cross-talk between Ssc1 and Ssc3 paralogs. The lack of in vivo functional overlap is due to altered conformation and significant lower stability associated with Ssc3. The substrate-binding domain of Ssc3 showed poor affinity toward mitochondrial client proteins and Tim44 due to the open conformation in ADP-bound state. In addition to that, the nucleotide-binding domain of Ssc3 showed an altered regulation by the Mge1 co-chaperone due to a high degree of conformational plasticity, which strongly promotes aggregation. Besides, Ssc3 possesses a dysfunctional inter-domain interface thus rendering it unable to perform functions similar to generic Hsp70s. Moreover, we have identified the critical amino acid sequence of Ssc1 and Ssc3 that can ``make or break'' mtHsp70 chaperone function. Together, our analysis provides the first evidence to show that the nucleotide-binding domain of mtHsp70s plays a critical role in determining the functional specificity among paralogs and orthologs across kingdoms.
Resumo:
P>Transcription activator C employs a unique mechanism to activate mom gene of bacteriophage Mu. The activation process involves, facilitating the recruitment of RNA polymerase (RNAP) by altering the topology of the promoter and enhancing the promoter clearance by reducing the abortive transcription. To understand the basis of this multi-step activation mechanism, we investigated the nature of the physical interaction between C and RNAP during the process. A variety of assays revealed that only DNA-bound C contacts the beta' subunit of RNAP. Consistent to these results, we have also isolated RNAP mutants having mutations in the beta' subunit which were compromised in C-mediated activation. Mutant RNAPs show reduced productive transcription and increased abortive initiation specifically at the C-dependent mom promoter. Positive control (pc) mutants of C, defective in interaction with RNAP, retained the property of recruiting RNAP to the promoter but were unable to enhance promoter clearance. These results strongly suggest that the recruitment of RNAP to the mom promoter does not require physical interaction with C, whereas a contact between the beta' subunit and the activator, and the subsequent allosteric changes in the active site of the enzyme are essential for the enhancement of promoter clearance.
Resumo:
Three classification techniques, namely, K-means Cluster Analysis (KCA), Fuzzy Cluster Analysis (FCA), and Kohonen Neural Networks (KNN) were employed to group 25 microwatersheds of Kherthal watershed, Rajasthan into homogeneous groups for formulating the basis for suitable conservation and management practices. Ten parameters, mainly, morphological, namely, drainage density (D-d), bifurcation ratio (R-b), stream frequency (F-u), length of overland flow (L-o), form factor (R-f), shape factor (B-s), elongation ratio (R-e), circulatory ratio (R-c), compactness coefficient (C-c) and texture ratio (T) are used for the classification. Optimal number of groups is chosen, based on two cluster validation indices Davies-Bouldin and Dunn's. Comparative analysis of various clustering techniques revealed that 13 microwatersheds out of 25 are commonly suggested by KCA, FCA and KNN i.e., 52%; 17 microwatersheds out of 25 i.e., 68% are commonly suggested by KCA and FCA whereas these are 16 out of 25 in FCA and KNN (64%) and 15 out of 25 in KNN and CA (60%). It is observed from KNN sensitivity analysis that effect of various number of epochs (1000, 3000, 5000) and learning rates (0.01, 0.1-0.9) on total squared error values is significant even though no fixed trend is observed. Sensitivity analysis studies revealed that microwatershecls have occupied all the groups even though their number in each group is different in case of further increase in the number of groups from 5 to 6, 7 and 8. (C) 2010 International Association of Hydro-environment Engineering and Research, Asia Pacific Division. Published by Elsevier B.V. All rights reserved.
Resumo:
Occasionally, ribosomes stall on mRNAs prior to the completion of the polypeptide chain. In Escherichia coli and other eubacteria, tmRNA-mediated trans-translation is a major mechanism that recycles the stalled ribosomes. The tmRNA possesses a tRNA-like domain and a short mRNA region encoding a short peptide (ANDENYALAA in E. coli) followed by a termination codon. The first amino acid (Ala) of this peptide encoded by the resume codon (GCN) is highly conserved in tmRNAs in different species. However, reasons for the high evolutionary conservation of the resume codon identity have remained unclear. In this study, we show that changing the E. coli tmRNA resume codon to other efficiently translatable codons retains efficient functioning of the tmRNA. However, when the resume codon was replaced with the low-usage codons, its function was adversely affected. Interestingly, expression of tRNAs decoding the low-usage codon from plasmid-borne gene copies restored efficient utilization of tmRNA. We discuss why in E. coli, the GCA (Ala) is one of the best codons and why all codons in the short mRNA of the tmRNA are decoded by the abundant tRNAs.