208 resultados para Functional Classification Trees
Resumo:
This paper presents a Chance-constraint Programming approach for constructing maximum-margin classifiers which are robust to interval-valued uncertainty in training examples. The methodology ensures that uncertain examples are classified correctly with high probability by employing chance-constraints. The main contribution of the paper is to pose the resultant optimization problem as a Second Order Cone Program by using large deviation inequalities, due to Bernstein. Apart from support and mean of the uncertain examples these Bernstein based relaxations make no further assumptions on the underlying uncertainty. Classifiers built using the proposed approach are less conservative, yield higher margins and hence are expected to generalize better than existing methods. Experimental results on synthetic and real-world datasets show that the proposed classifiers are better equipped to handle interval-valued uncertainty than state-of-the-art.
Resumo:
In the malarial parasite, enzymes of heme-biosynthetic pathway are distributed in different cellular compartments. The site of localization of ferrochelatase in the malarial parasite is crucial, since it will decide the ultimate site of heme synthesis. Earlier results have differed in terms of localization, being the mitochondrion or apicoplast and the functional enzyme has not been cloned, expressed and characterized. The present study reveals that Plasmodium falciparum ferrochelatase (PfFC) gene encodes multiple transcripts of which the one encoding the full length functional protein (PfFC) has been cloned and the recombinant protein over-expressed and purified from E. coli cells. The enzyme shows maximum activity with iron, while zinc is a poor substrate. Immunofluorescence studies with antibodies to functional ferrochelatase reveal that the native enzyme is localized to the mitochondrion of the parasite indicating that this organelle is the ultimate site of heme synthesis.
Resumo:
We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based on local polynomial regression. The local polynomial model is well-suited for voiced segments in speech. The unvoiced segments are noise-like and do not exhibit any smooth structure. This property of smoothness is used for devising a new metric called the variance ratio metric, which, after thresholding, indicates the voiced/unvoiced boundaries with 75% accuracy for 0dB global signal-to-noise ratio (SNR). A novelty of our algorithm is that it processes the signal continuously, sample-by-sample rather than frame-by-frame. Simulation results on TIMIT speech database (downsampled to 8kHz) for various SNRs are presented to illustrate the performance of the new algorithm. Results indicate that the algorithm is robust even in high noise levels.
Resumo:
This paper presents two algorithms for smoothing and feature extraction for fingerprint classification. Deutsch's(2) Thinning algorithm (rectangular array) is used for thinning the digitized fingerprint (binary version). A simple algorithm is also suggested for classifying the fingerprints. Experimental results obtained using such algorithms are presented.
Resumo:
A positive cis-acting DNA element in the near 5'-upstream region of the CYP2B1/B2 genes in rat liver was found to play an important role in the transcription of these genes. An oligonucleotide covering -69 to -98 nt mimicked the gel mobility shift pattern given by the fragment -179 to +29 nt, which was earlier found adequate to confer the regulatory features of this gene. Two major complexes were seen, of which the slower and faster moving complexes became intense under uninduced and Phenobarbitone-induced conditions respectively. Minigene cloned DNA plasmid covering -179 to +181 nt in pUC 19 and Bal 31 mutants derived from this parent were transcribed in whole nuclei and cell free transcription extracts and mutants containing only upto -75 nt of the upstream were poorly transcribed. Transcription extracts from phenobarbitone-injected rat liver nuclei were significantly more active than extracts from uninduced rats in transcribing the minigene constructs. Addition of the oligonucleotide (-69 to -98nt) specifically inhibited the transcription of the minigene construct (-179 to +181 nt) in the cell free transcription system. It is therefore, concluded that the region -69 to -98 nt acts as a positive cis-acting element in the transcription of the CYP2B1/B2 genes and in mediating the inductive effects of phenobarbitone.
Resumo:
Plants are sessile organisms that have evolved a variety of mechanisms to maintain their cellular homeostasis under stressful environmental conditions. Survival of plants under abiotic stress conditions requires specialized group of heat shock protein machinery, belonging to Hsp70:J-protein family. These heat shock proteins are most ubiquitous types of chaperone machineries involved in diverse cellular processes including protein folding, translocation across cell membranes, and protein degradation. They play a crucial role in maintaining the protein homeostasis by reestablishing functional native conformations under environmental stress conditions, thus providing protection to the cell. J-proteins are co-chaperones of Hsp70 machine, which play a critical role by stimulating Hsp70s ATPase activity, thereby stabilizing its interaction with client proteins. Using genome-wide analysis of Arabidopsis thaliana, here we have outlined identification and systematic classification of J-protein co-chaperones which are key regulators of Hsp70s function. In comparison with Saccharomyces cerevisiae model system, a comprehensive domain structural organization, cellular localization, and functional diversity of A. thaliana J-proteins have also been summarized. Electronic supplementary material The online version of this article (doi:10.1007/s10142-009-0132-0) contains supplementary material, which is available to authorized users.
Resumo:
Elephants use vocalizations for both long and short distance communication. Whereas the acoustic repertoire of the African elephant (Loxodonta africana) has been extensively studied in its savannah habitat, very little is known about the structure and social context of the vocalizations of the Asian elephant (Elephas maximus), which is mostly found in forests. In this study, the vocal repertoire of wild Asian elephants in southern India was examined. The calls could be classified into four mutually exclusive categories, namely, trumpets, chirps, roars, and rumbles, based on quantitative analyses of their spectral and temporal features. One of the call types, the rumble, exhibited high structural diversity, particularly in the direction and extent of frequency modulation of calls. Juveniles produced three of the four call types, including trumpets, roars, and rumbles, in the context of play and distress. Adults produced trumpets and roars in the context of disturbance, aggression, and play. Chirps were typically produced in situations of confusion and alarm. Rumbles were used for contact calling within and among herds, by matriarchs to assemble the herd, in close-range social interactions, and during disturbance and aggression. Spectral and temporal features of the four call types were similar between Asian and African elephants.
Pi-turns in proteins and peptides: Classification, conformation, occurrence, hydration and sequence.
Resumo:
The i + 5-->i hydrogen bonded turn conformation (pi-turn) with the fifth residue adopting alpha L conformation is frequently found at the C-terminus of helices in proteins and hence is speculated to be a "helix termination signal." An analysis of the occurrence of i + 5-->i hydrogen bonded turn conformation at any general position in proteins (not specifically at the helix C-terminus), using coordinates of 228 protein crystal structures determined by X-ray crystallography to better than 2.5 A resolution is reported in this paper. Of 486 detected pi-turn conformations, 367 have the (i + 4)th residue in alpha L conformation, generally occurring at the C-terminus of alpha-helices, consistent with previous observations. However, a significant number (111) of pi-turn conformations occur with (i + 4)th residue in alpha R conformation also, generally occurring in alpha-helices as distortions either at the terminii or at the middle, a novel finding. These two sets of pi-turn conformations are referred to by the names pi alpha L and pi alpha R-turns, respectively, depending upon whether the (i + 4)th residue adopts alpha L or alpha R conformations. Four pi-turns, named pi alpha L'-turns, were noticed to be mirror images of pi alpha L-turns, and four more pi-turns, which have the (i + 4)th residue in beta conformation and denoted as pi beta-turns, occur as a part of hairpin bend connecting twisted beta-strands. Consecutive pi-turns occur, but only with pi alpha R-turns. The preference for amino acid residues is different in pi alpha L and pi alpha R-turns. However, both show a preference for Pro after the C-termini. Hydrophilic residues are preferred at positions i + 1, i + 2, and i + 3 of pi alpha L-turns, whereas positions i and i + 5 prefer hydrophobic residues. Residue i + 4 in pi alpha L-turns is mainly Gly and less often Asn. Although pi alpha R-turns generally occur as distortions in helices, their amino acid preference is different from that of helices. Poor helix formers, such as His, Tyr, and Asn, also were found to be preferred for pi alpha R-turns, whereas good helix former Ala is not preferred. pi-Turns in peptides provide a picture of the pi-turn at atomic resolution. Only nine peptide-based pi-turns are reported so far, and all of them belong to pi alpha L-turn type with an achiral residue in position i + 4. The results are of importance for structure prediction, modeling, and de novo design of proteins.
Resumo:
The recA locus of pathogenic mycobacteria differs from that of nonpathogenic species because it contains large intervening sequences nested in the RecA homology region that are excised by an unusual protein-splicing reaction. In vivo assays indicated that Mycobacterium tuberculosis recA partially complemented Escherichia coli recA mutants for recombination and mutagenesis. Further, splicing of the 85 kDa precursor to 38 kDa MtRecA protein was necessary for the display of its activity, in vivo. To gain insights into the molecular basis for partial and lack of complementation by MtRecA and 85 kDa proteins, respectively, we purified both of them to homogeneity. MtRecA protein, but not the 85 kDa form, bound stoichiometrically to single-stranded DNA in the presence of ATP. MtRecA protein was cross-linked to 8-azidoadenosine 5'-triphosphate with reduced efficiency, and kinetic analysis of ATPase activity suggested that it is due to decreased affinity for ATP. In contrast, the 85 kDa form was unable to bind ATP, in the presence or absence of ssDNA and, consequently, was entirely devoid of ATPase activity. Molecular modeling studies suggested that the decreased affinity of MtRecA protein for ATP and the reduced efficiency of its hydrolysis might be due to the widening of the cleft which alters the hydrogen bonds and the contact area between the enzyme and the substrate and changes in the disposition of the amino acid residues around the magnesium ion and the gamma-phosphate. The formation of joint molecules promoted by MtRecA protein was stimulated by SSB when the former was added first. The probability of an association between the lack and partial levels of biological activity of RecA protein(s) to that of illegitimate recombination in pathogenic mycobacteria is considered.
Resumo:
The current explosion of DNA sequence information has generated increasing evidence for the claim that noncoding repetitive DNA sequences present within and around different genes could play an important role in genetic control processes, although the precise role and mechanism by which these sequences function are poorly understood. Several of the simple repetitive sequences which occur in a large number of loci throughout the human and other eukaryotic genomes satisfy the sequence criteria for forming non-B DNA structures in vitro. We have summarized some of the features of three different types of simple repeats that highlight the importance of repetitive DNA in the control of gene expression and chromatin organization. (i) (TG/CA)n repeats are widespread and conserved in many loci. These sequences are associated with nucleosomes of varying linker length and may play a role in chromatin organization. These Z-potential sequences can help absorb superhelical stress during transcription and aid in recombination. (ii) Human telomeric repeat (TTAGGG)n adopts a novel quadruplex structure and exhibits unusual chromatin organization. This unusual structural motif could explain chromosome pairing and stability. (iii) Intragenic amplification of (CTG)n/(CAG)n trinucleotide repeat, which is now known to be associated with several genetic disorders, could down-regulate gene expression in vivo. The overall implications of these findings vis-à-vis repetitive sequences in the genome are summarized.
Resumo:
Contraction of an edge e merges its end points into a new single vertex, and each neighbor of one of the end points of e is a neighbor of the new vertex. An edge in a k-connected graph is contractible if its contraction does not result in a graph with lesser connectivity; otherwise the edge is called non-contractible. In this paper, we present results on the structure of contractible edges in k-trees and k-connected partial k-trees. Firstly, we show that an edge e in a k-tree is contractible if and only if e belongs to exactly one (k + 1) clique. We use this characterization to show that the graph formed by contractible edges is a 2-connected graph. We also show that there are at least |V(G)| + k - 2 contractible edges in a k-tree. Secondly, we show that if an edge e in a partial k-tree is contractible then e is contractible in any k-tree which contains the partial k-tree as an edge subgraph. We also construct a class of contraction critical 2k-connected partial 2k-trees.
Resumo:
In the present work we report a rapid microwave irradiation-assisted chemical synthesis technique for the growth of nanoparticles, nanorods, and nanotubes of a variety of metal oxides in the presence of an appropriate surfactant (cationic, anionic, non ionic and polymeric), without the use of any templates. The method is simple, inexpensive, and helps one to prepare nanostructures in quick time, measured in seconds and minutes. This method has been applied successfully to synthesize nanostructures of a variety of binary and ternary metal oxides such as ZnO, CdO, Fe2O3, CuO, Ga2O3, Gd2O3, ZnFe2O4, etc. There is an observed variation in the morphology of the nanostructures with changes in different process parameters, such as microwave power, irradiation time, identity of solvent, type of surfactant, and its concentration.