160 resultados para Topic identification


Relevância:

20.00% 20.00%

Publicador:

Resumo:

There are many popular models available for classification of documents like Naïve Bayes Classifier, k-Nearest Neighbors and Support Vector Machine. In all these cases, the representation is based on the “Bag of words” model. This model doesn't capture the actual semantic meaning of a word in a particular document. Semantics are better captured by proximity of words and their occurrence in the document. We propose a new “Bag of Phrases” model to capture this discriminative power of phrases for text classification. We present a novel algorithm to extract phrases from the corpus using the well known topic model, Latent Dirichlet Allocation(LDA), and to integrate them in vector space model for classification. Experiments show a better performance of classifiers with the new Bag of Phrases model against related representation models.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Host cell remodelling is a hallmark of malaria pathogenesis. It involves protein folding, unfolding and trafficking events and thus participation of chaperones such as Hsp70s and Hsp40s is well speculated. Until recently, only Hsp40s were thought to be the sole representative of the parasite chaperones in the exportome. However, based on the re-annotated Plasmodium falciparum genome sequence, a putative candidate for exported Hsp70 has been reported, which otherwise was known to be a pseudogene. We raised a specific antiserum against a C-terminal peptide uniquely present in PfHsp70-x. Immunoblotting and immunofluorescence-based approaches in combination with sub-cellular fractionation by saponin and streptolysin-O have been taken to determine the expression and localization of PfHsp70-x in infected erythrocyte. The re-annotated sequence of PfHsp70-x reveals it to be a functional protein with an endoplasmic reticulum signal peptide. It gets maximally expressed at the schizont stage of intra-erythrocytic life cycle. Majority of the protein localizes to the parasitophorous vacuole and some of it gets exported to the erythrocyte compartment where it associates with Maurer's clefts. The identification of an exported parasite Hsp70 chaperone presents us with the fact that the parasite has evolved customized chaperones which might be playing crucial roles in aspects of trafficking and host cell remodelling.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

IR spectroscopy has been widely employed to distinguish between different crystal forms such as polymorphs, clathrates, hydrates and co-crystals. IR has been used to monitor co-crystal formation and single synthon detection. In this work, we have developed a strategy to identify multiple supramolecular synthons in polymorphs and co-crystals with this technique. The identification of multiple synthons in co-crystals with IR is difficult for several reasons. In this paper, a four step method involving well assigned IR spectral markers that correspond to bonds in a synthon is used. IR spectra of three forms of the co-crystal system, 4-hydroxybenzoic acid: 4,4'-bipyridine (2 : 1), show clear differences that may be attributed to differences in the synthon combinations existing in the forms (synthon polymorphism). These differences were picked out from the three IR spectra and the bands analysed and assigned to synthons. Our method first identifies IR marker bands corresponding to (covalent) bonds in known/model crystals and then the markers are mapped in known co-crystals having single synthons. Thereafter, the IR markers are queried in known co-crystals with multiple synthons. Finally they are queried in unknown co-crystals with multiple synthons. In the last part of the study, the N-H stretching absorptions of primary amides that crystallize with the amide dimers linked in a ladder like chain show two specific absorptions which are used as marker absorptions and all variations of this band structure have been used to provide details on the environment around the dimer. The extended dimer can accordingly be easily distinguished from the isolated dimer.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Using a Girsanov change of measures, we propose novel variations within a particle-filtering algorithm, as applied to the inverse problem of state and parameter estimations of nonlinear dynamical systems of engineering interest, toward weakly correcting for the linearization or integration errors that almost invariably occur whilst numerically propagating the process dynamics, typically governed by nonlinear stochastic differential equations (SDEs). Specifically, the correction for linearization, provided by the likelihood or the Radon-Nikodym derivative, is incorporated within the evolving flow in two steps. Once the likelihood, an exponential martingale, is split into a product of two factors, correction owing to the first factor is implemented via rejection sampling in the first step. The second factor, which is directly computable, is accounted for via two different schemes, one employing resampling and the other using a gain-weighted innovation term added to the drift field of the process dynamics thereby overcoming the problem of sample dispersion posed by resampling. The proposed strategies, employed as add-ons to existing particle filters, the bootstrap and auxiliary SIR filters in this work, are found to non-trivially improve the convergence and accuracy of the estimates and also yield reduced mean square errors of such estimates vis-a-vis those obtained through the parent-filtering schemes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents a multi-class support vector machine (SVMs) approach for locating and diagnosing faults in electric power distribution feeders with the penetration of Distributed Generations (DGs). The proposed approach is based on the three phase voltage and current measurements which are available at all the sources i.e. substation and at the connection points of DG. To illustrate the proposed methodology, a practical distribution feeder emanating from 132/11kV-grid substation in India with loads and suitable number of DGs at different locations is considered. To show the effectiveness of the proposed methodology, practical situations in distribution systems (DS) such as all types of faults with a wide range of varying fault locations, source short circuit (SSC) levels and fault impedances are considered for studies. The proposed fault location scheme is capable of accurately identify the fault type, location of faulted feeder section and the fault impedance. The results demonstrate the feasibility of applying the proposed method in practical in smart grid distribution automation (DA) for fault diagnosis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Trypanosoma evansi is the most extensively distributed trypanosome responsible for disease called surra in livestock in many countries including frequent outbreaks in India. The prevalence of this disease is most commonly reported by standard parasitological detection methods (SPDM); however, antibody ELISA is being in practice by locally produced whole cell lysate (WCL) antigens in many countries. In the present investigation, we attempted to identify and purify immuno dominant, infection specific trypanosome antigens from T. evansi proteome using experimentally infected equine serum by immuno blot. Three immuno dominant clusters of proteins i.e. 62-66 kDa, 52-55 kDa and 41-43 kDa were identified based on their consistent reactivity with donkey sequential serum experimentally infected T. evansi up to 280 days post infection (dpi). The protein cluster of 62-66 kDa was purified in bulk in native form and comparatively evaluated with whole cell lysate antigen (WCL). ELISA and immuno blot showed that polypeptide of this cluster is 100% sensitive in detection of early and chronic infection. Further, this protein cluster was also found immuno reactive against hyper immune serum raised against predominantly 66 kDa exo antigen, revealed that this is a common immunodominant moieties in proteome and secretome of T. evansi.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

C-di-GMP Bis-(3'-5')-cyclic-dimeric-guanosine monophosphate], a second messenger is involved in intracellular communication in the bacterial species. As a result several multi-cellular behaviors in both Gram-positive and Gram-negative bacteria are directly linked to the intracellular level of c-di-GMP. The cellular concentration of c-di-GMP is maintained by two opposing activities, diguanylate cyclase (DGC) and phosphodiesterase (PDE-A). In Mycobacterium smegmatis, a single bifunctional protein MSDGC-1 is responsible for the cellular concentration of c-di-GMP. A better understanding of the regulation of c-di-GMP at the genetic level is necessary to control the function of above two activities. In this work, we have characterized the promoter element present in msdgc-1 along with the + 1 transcription start site and identified the sigma factors that regulate the transcription of msdgc-1. Interestingly, msdgc-1 utilizes SigA during the initial phase of growth, whereas near the stationary phase SigB containing RNA polymerase takes over the expression of msdgc-1. We report here that the promoter activity of msdgc-1 increases during starvation or depletion of carbon source like glucose or glycerol. When msdgc-1 is deleted, the numbers of viable cells are similar to 10 times higher in the stationary phase in comparison to that of the wild type. We propose here that msdgc-1 is involved in the regulation of cell population density. (C) 2013 Elsevier B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Traditional taxonomy based on morphology has often failed in accurate species identification owing to the occurrence of cryptic species, which are reproductively isolated but morphologically identical. Molecular data have thus been used to complement morphology in species identification. The sexual advertisement calls in several groups of acoustically communicating animals are species-specific and can thus complement molecular data as non-invasive tools for identification. Several statistical tools and automated identifier algorithms have been used to investigate the efficiency of acoustic signals in species identification. Despite a plethora of such methods, there is a general lack of knowledge regarding the appropriate usage of these methods in specific taxa. In this study, we investigated the performance of two commonly used statistical methods, discriminant function analysis (DFA) and cluster analysis, in identification and classification based on acoustic signals of field cricket species belonging to the subfamily Gryllinae. Using a comparative approach we evaluated the optimal number of species and calling song characteristics for both the methods that lead to most accurate classification and identification. The accuracy of classification using DFA was high and was not affected by the number of taxa used. However, a constraint in using discriminant function analysis is the need for a priori classification of songs. Accuracy of classification using cluster analysis, which does not require a priori knowledge, was maximum for 6-7 taxa and decreased significantly when more than ten taxa were analysed together. We also investigated the efficacy of two novel derived acoustic features in improving the accuracy of identification. Our results show that DFA is a reliable statistical tool for species identification using acoustic signals. Our results also show that cluster analysis of acoustic signals in crickets works effectively for species classification and identification.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study investigates the application of support vector clustering (SVC) for the direct identification of coherent synchronous generators in large interconnected multi-machine power systems. The clustering is based on coherency measure, which indicates the degree of coherency between any pair of generators. The proposed SVC algorithm processes the coherency measure matrix that is formulated using the generator rotor measurements to cluster the coherent generators. The proposed approach is demonstrated on IEEE 10 generator 39-bus system and an equivalent 35 generators, 246-bus system of practical Indian southern grid. The effect of number of data samples and fault locations are also examined for determining the accuracy of the proposed approach. An extended comparison with other clustering techniques is also included, to show the effectiveness of the proposed approach in grouping the data into coherent groups of generators. This effectiveness of the coherent clusters obtained with the proposed approach is compared in terms of a set of clustering validity indicators and in terms of statistical assessment that is based on the coherency degree of a generator pair.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Diaminopropionate ammonialyase (DAPAL), a fold-typeII pyridoxal 5-phosphate-dependent enzyme, catalyzes the ,-elimination of diaminopropionate (DAP) to pyruvate and ammonia. DAPAL was able to utilize both d- and l-DAP as substrates with almost equal efficiency. Mutational analysis of functionally important residues such as Thr385, Asp125 and Asp194 was carried out to understand the mechanism by which the isomers are hydrolyzed. Further, the putative residues involved in the formation of disulfide bond Cys271 and Cys299 were also mutated. T385S, T385D sDAPAL were as active with dl-DAP as substrate as sDAPAL, whereas the later exhibited a threefold increase in catalytic efficiency with d-Ser as substrate. Further analysis of these mutants suggested that DAPAL might follow an anti-E-2 mechanism of catalysis that does not involve the formation of a quinonoid intermediate. Of the two mutants of Asp125, D125E showed complete loss of activity with d-DAP as substrate, whereas the reaction with l-DAP was not affected significantly, demonstrating that Asp125 was essential for abstraction of protons from the d-isomer. By contrast, mutational analysis of Asp194 showed that the residue may not be directly involved in proton abstraction from l-DAP. sDAPAL does not form a disulfide bond in solution, although the position of Cys299 and Cys271 in the modeled structure of sDAPAL favored the formation of a disulfide bond. Further, unlike eDAPAL, sDAPAL could be activated by monovalent cations. Mutation of the cysteine residues showed that Cys271 may be involved in coordinating the monovalent cation, as observed in the case of other fold-typeII enzymes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Glioblastoma (GBM) is the most common, malignant adult primary tumor with dismal patient survival, yet the molecular determinants of patient survival are poorly characterized. Global methylation profile of GBM samples (our cohort; n = 44) using high-resolution methylation microarrays was carried out. Cox regression analysis identified a 9-gene methylation signature that predicted survival in GBM patients. A risk-score derived from methylation signature predicted survival in univariate analysis in our and The Cancer Genome Atlas (TCGA) cohort. Multivariate analysis identified methylation risk score as an independent survival predictor in TCGA cohort. Methylation risk score stratified the patients into low-risk and high-risk groups with significant survival difference. Network analysis revealed an activated NF-kappa B pathway association with high-risk group. NF-kappa B inhibition reversed glioma chemoresistance, and RNA interference studies identified interleukin-6 and intercellular adhesion molecule-1 as key NF-kappa B targets in imparting chemoresistance. Promoter hypermethylation of neuronal pentraxin II (NPTX2), a risky methylated gene, was confirmed by bisulfite sequencing in GBMs. GBMs and glioma cell lines had low levels of NPTX2 transcripts, which could be reversed upon methylation inhibitor treatment. NPTX2 overexpression induced apoptosis, inhibited proliferation and anchorage-independent growth, and rendered glioma cells chemosensitive. Furthermore, NPTX2 repressed NF-kappa B activity by inhibiting AKT through a p53-PTEN-dependent pathway, thus explaining the hypermethylation and downregulation of NPTX2 in NF-kappa B-activated high-risk GBMs. Taken together, a 9-gene methylation signature was identified as an independent GBM prognosticator and could be used for GBM risk stratification. Prosurvival NF-kappa B pathway activation characterized high-risk patients with poor prognosis, indicating it to be a therapeutic target. (C) 2013 AACR.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Impoverishment of particles, i.e. the discretely simulated sample paths of the process dynamics, poses a major obstacle in employing the particle filters for large dimensional nonlinear system identification. A known route of alleviating this impoverishment, i.e. of using an exponentially increasing ensemble size vis-a-vis the system dimension, remains computationally infeasible in most cases of practical importance. In this work, we explore the possibility of unscented transformation on Gaussian random variables, as incorporated within a scaled Gaussian sum stochastic filter, as a means of applying the nonlinear stochastic filtering theory to higher dimensional structural system identification problems. As an additional strategy to reconcile the evolving process dynamics with the observation history, the proposed filtering scheme also modifies the process model via the incorporation of gain-weighted innovation terms. The reported numerical work on the identification of structural dynamic models of dimension up to 100 is indicative of the potential of the proposed filter in realizing the stated aim of successfully treating relatively larger dimensional filtering problems. (C) 2013 Elsevier Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It is particularly appropriate that the Journal of the Indian Institute of Science is bringing out a commemorative issue to mark the International Year of Crystallography 2014 (IYCr2014). India has had a strong crystallographic tradition, and the earliest work in what may be described as structural crystallography from this country is the work of K. Banerjee on the determination of the crystal structure of naphthalene in 1930. The Indian Institute of Science itself has played no small part in establishing and sustaining the subject of crystallography in this country. A large number of papers in this special issue are written by authors who have either have been trained in the Institute or who have some kind of professional association with this organization. In this article I will try to capture some unique features that characterize the intersection of the crystallographic and the chemical domains, mostly as they pertain to the Indian contribution to this subject. Crystallography is of course is as old as chemistry itself, and some would say it is even older. The relationships between chemistry and crystallography go back to much before the discovery of diffraction of X-rays by crystals.The discovery of polymorphism by Mitscherlisch in 1822, Haüy’s formulation of the molecule integrante, and the work of Fedorov and Groth on the identification of crystals from their morphology alone, are well known examples of such relationships.A very early article by Tutton speaks of “crystallo-chemical analysis”. In this article, I shall, however, be dealing with the interplay of chemistry and crystallography only in the post diffraction era, that is, after 1912. Much had been written and said about chemical crystallography, and even within the context of the present special issue, there is a review of chemical crystallography in India including some futuristic trends. This topic was also reviewed by Nangia in a special publication brought out by Indian Academy of Sciences in 2009,and by Desiraju in a special publication brought out by the Indian National Science Academy in 2010. A rather detailed account of crystallography in India appeared in 2007 in the newsletter of the International Union of Crystallography (IUCr) in which chemical crystallography was detailed. Since all these publications are fairly recent there is little need for me to attempt a comprehensive coverage of chemical crystallography in India in this short review

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a novel form of nonlinear stochastic filtering based on an iterative evaluation of a Kalman-like gain matrix computed within a Monte Carlo scheme as suggested by the form of the parent equation of nonlinear filtering (Kushner-Stratonovich equation) and retains the simplicity of implementation of an ensemble Kalman filter (EnKF). The numerical results, presently obtained via EnKF-like simulations with or without a reduced-rank unscented transformation, clearly indicate remarkably superior filter convergence and accuracy vis-a-vis most available filtering schemes and eminent applicability of the methods to higher dimensional dynamic system identification problems of engineering interest. (C) 2013 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.