204 resultados para Metaphor Identification Procedure (MIP)
Resumo:
We present a new approach to spoken language modeling for language identification (LID) using the Lempel-Ziv-Welch (LZW) algorithm. The LZW technique is applicable to any kind of tokenization of the speech signal. Because of the efficiency of LZW algorithm to obtain variable length symbol strings in the training data, the LZW codebook captures the essentials of a language effectively. We develop two new deterministic measures for LID based on the LZW algorithm namely: (i) Compression ratio score (LZW-CR) and (ii) weighted discriminant score (LZW-WDS). To assess these measures, we consider error-free tokenization of speech as well as artificially induced noise in the tokenization. It is shown that for a 6 language LID task of OGI-TS database with clean tokenization, the new model (LZW-WDS) performs slightly better than the conventional bigram model. For noisy tokenization, which is the more realistic case, LZW-WDS significantly outperforms the bigram technique
Resumo:
A hybrid simulation technique for identification and steady state optimization of a tubular reactor used in ammonia synthesis is presented. The parameter identification program finds the catalyst activity factor and certain heat transfer coefficients that minimize the sum of squares of deviation from simulated and actual temperature measurements obtained from an operating plant. The optimization program finds the values of three flows to the reactor to maximize the ammonia yield using the estimated parameter values. Powell's direct method of optimization is used in both cases. The results obtained here are compared with the plant data.
Resumo:
We analyze the AlApana of a Carnatic music piece without the prior knowledge of the singer or the rAga. AlApana is ameans to communicate to the audience, the flavor or the bhAva of the rAga through the permitted notes and its phrases. The input to our analysis is a recording of the vocal AlApana along with the accompanying instrument. The AdhAra shadja(base note) of the singer for that AlApana is estimated through a stochastic model of note frequencies. Based on the shadja, we identify the notes (swaras) used in the AlApana using a semi-continuous GMM. Using the probabilities of each note interval, we recognize swaras of the AlApana. For sampurNa rAgas, we can identify the possible rAga, based on the swaras. We have been able to achieve correct shadja identification, which is crucial to all further steps, in 88.8% of 55 AlApanas. Among them (48 AlApanas of 7 rAgas), we get 91.5% correct swara identification and 62.13% correct R (rAga) accuracy.
Resumo:
Fruit flies that belong to the genus Bactrocera (Diptera: Tephritidae) are major invasive pests of agricultural crops in Asia and Australia. Increased transboundary movement of agricultural produce has resulted in the chance introduction of many invasive species that include Bactrocera mainly as immature stages. Therefore quick and accurate species diagnosis is important at the port of entry, where morphological identification has a limited role, as it requires the presence of adult specimens and the availability of a specialist. Unfortunately when only immature stages are present, a lacunae in their taxonomy impedes accurate species diagnosis. At this juncture, molecular species diagnostics based on COX-I have become handy, because diagnosis is not limited by developmental stages. Yet another method of quick and accurate species diagnosis for Bactrocera spp. is based on the development of species-specific markers. This study evaluated the utility of COX-I for the quick and accurate species diagnosis of eggs, larvae, pupae and adults of B. zonata Saunders, B. tau Walker, and B. dorsalis Hendel. Furthermore the utility of species-specific markers in differentiating B. zonata (500bp) and B. tau (220bp) was shown. Phylogenetic relationships among five subgenera, viz., Austrodacus, Bactrocera, Daculus, Notodacus and Zeugodacus have been resolved employing the 5' region of COX-I (1490-2198); where COX-I sequences for B. dorsalis Hendel, B. tau Walker, B. correcta Bezzi and B. zonata Saunders from India were compared with other NCBI-GenBank accessions. Phylogenetic analysis employing Maximum Parsimony (MP) and Bayesian phylogenetic approach (BP) showed that the subgenus Bactrocera is monophyletic.
Resumo:
Parallel sub-word recognition (PSWR) is a new model that has been proposed for language identification (LID) which does not need elaborate phonetic labeling of the speech data in a foreign language. The new approach performs a front-end tokenization in terms of sub-word units which are designed by automatic segmentation, segment clustering and segment HMM modeling. We develop PSWR based LID in a framework similar to the parallel phone recognition (PPR) approach in the literature. This includes a front-end tokenizer and a back-end language model, for each language to be identified. Considering various combinations of the statistical evaluation scores, it is found that PSWR can perform as well as PPR, even with broad acoustic sub-word tokenization, thus making it an efficient alternative to the PPR system.
Resumo:
In this paper we have developed methods to compute maps from differential equations. We take two examples. First is the case of the harmonic oscillator and the second is the case of Duffing's equation. First we convert these equations to a canonical form. This is slightly nontrivial for the Duffing's equation. Then we show a method to extend these differential equations. In the second case, symbolic algebra needs to be used. Once the extensions are accomplished, various maps are generated. The Poincare sections are seen as a special case of such generated maps. Other applications are also discussed.
Resumo:
This paper proposes a new approach for solving the state estimation problem. The approach is aimed at producing a robust estimator that rejects bad data, even if they are associated with leverage-point measurements. This is achieved by solving a sequence of Linear Programming (LP) problems. Optimization is carried via a new algorithm which is a combination of “upper bound optimization technique" and “an improved algorithm for discrete linear approximation". In this formulation of the LP problem, in addition to the constraints corresponding to the measurement set, constraints corresponding to bounds of state variables are also involved, which enables the LP problem more efficient in rejecting bad data, even if they are associated with leverage-point measurements. Results of the proposed estimator on IEEE 39-bus system and a 24-bus EHV equivalent system of the southern Indian grid are presented for illustrative purpose.
Resumo:
We report gas phase mid-infrared spectra of 1- and 2- methyl naphthalenes at 0.2 cm(-1) resolution. Assignment of observed bands have been made using scaled quantum mechanical (SQM) calculations where the force fields rather the frequencies are scaled to find a close fit between observed and calculated bands. The structure of the molecules has been optimized using B3LYP level of theory in conjunction with standard 6-311G** basis set to obtain the harmonic frequencies. Using the force constants in Cartesian coordinates from the Gaussian output, scaled force field calculations are carried out using a modified version of the UMAT program in the QCPE package. Potential energy distributions of the normal modes obtained from such calculations helped us assign the observed bands and identify the unique features of the spectra of 1- and 2-MNs which are important for their isomeric identification.
Resumo:
A decade since the availability of Mycobacterium tuberculosis (Mtb) genome sequence, no promising drug has seen the light of the day. This not only indicates the challenges in discovering new drugs but also suggests a gap in our current understanding of Mtb biology. We attempt to bridge this gap by carrying out extensive re-annotation and constructing a systems level protein interaction map of Mtb with an objective of finding novel drug target candidates. Towards this, we synergized crowd sourcing and social networking methods through an initiative `Connect to Decode' (C2D) to generate the first and largest manually curated interactome of Mtb termed `3interactome pathway' (IPW), encompassing a total of 1434 proteins connected through 2575 functional relationships. Interactions leading to gene regulation, signal transduction, metabolism, structural complex formation have been catalogued. In the process, we have functionally annotated 87% of the Mtb genome in context of gene products. We further combine IPW with STRING based network to report central proteins, which may be assessed as potential drug targets for development of drugs with least possible side effects. The fact that five of the 17 predicted drug targets are already experimentally validated either genetically or biochemically lends credence to our unique approach.
Resumo:
Acetaminophen is a widely prescribed drug used to relieve pain and fever; however, it is a leading cause of drug-induced liver injury and a burden on public healthcare. In this study, hepatotoxicity in mice post oral dosing of acetaminophen was investigated using liver and sera samples with Fourier Transform Infrared microspectroscopy. The infrared spectra of acetaminophen treated livers in BALB/ mice show decrease in glycogen, increase in amounts of cholesteryl esters and DNA respectively. Rescue experiments using L-methionine demonstrate that depletion in glycogen and increase in DNA are abrogated with pre-treatment, but not post-treatment, with L-methionine. This indicates that changes in glycogen and DNA are more sensitive to the rapid depletion of glutathione. Importantly, analysis of sera identified lowering of glycogen and increase in DNA and chlolesteryl esters earlier than increase in alanine aminotransferase, which is routinely used to diagnose liver damage. In addition, these changes are also observed in C57BL/6 and Nos2(-/-) mice. There is no difference in the kinetics of expression of these three molecules in both strains of mice, the extent of damage is similar and corroborated with ALT and histological analysis. Quantification of cytokines in sera showed increase upon APAP treatment. Although the levels of Tnf alpha and Ifn gamma in sera are not significantly affected, Nos2(-/-) mice display lower Il6 but higher Il10 levels during this acute model of hepatotoxicity. Overall, this study reinforces the growing potential of Fourier Transform Infrared microspectroscopy as a fast, highly sensitive and label-free technique for non-invasive diagnosis of liver damage. The combination of Fourier Transform Infrared microspectroscopy and cytokine analysis is a powerful tool to identify multiple biomarkers, understand differential host responses and evaluate therapeutic regimens during liver damage and, possibly, other diseases.
Resumo:
Background: Bacteria such as Escherichia coli and Salmonella typhimurium can utilize acetate as the sole source of carbon and energy. Acetate kinase (AckA) and phosphotransacetylase (Pta), key enzymes of acetate utilization pathway, regulate flux of metabolites in glycolysis, gluconeogenesis, TCA cycle, glyoxylate bypass and fatty acid metabolism. Results: Here we report kinetic characterization of S. typhimurium AckA (StAckA) and structures of its unliganded (Form-I, 2.70 angstrom resolution) and citrate-bound (Form-II, 1.90 angstrom resolution) forms. The enzyme showed broad substrate specificity with k(cat)/K-m in the order of acetate > propionate > formate. Further, the K-m for acetyl-phosphate was significantly lower than for acetate and the enzyme could catalyze the reverse reaction (i.e. ATP synthesis) more efficiently. ATP and Mg2+ could be substituted by other nucleoside 5'-triphosphates (GTP, UTP and CTP) and divalent cations (Mn2+ and Co2+), respectively. Form-I StAckA represents the first structural report of an unliganded AckA. StAckA protomer consists of two domains with characteristic beta beta beta alpha beta alpha beta alpha topology of ASKHA superfamily of proteins. These domains adopt an intermediate conformation compared to that of open and closed forms of ligand-bound Methanosarcina thermophila AckA (MtAckA). Spectroscopic and structural analyses of StAckA further suggested occurrence of inter-domain motion upon ligand-binding. Unexpectedly, Form-II StAckA structure showed a drastic change in the conformation of residues 230-300 compared to that of Form-I. Further investigation revealed electron density corresponding to a citrate molecule in a pocket located at the dimeric interface of Form-II StAckA. Interestingly, a similar dimeric interface pocket lined with largely conserved residues could be identified in Form-I StAckA as well as in other enzymes homologous to AckA suggesting that ligand binding at this pocket may influence the function of these enzymes. Conclusions: The biochemical and structural characterization of StAckA reported here provides insights into the biochemical specificity, overall fold, thermal stability, molecular basis of ligand binding and inter-domain motion in AckA family of enzymes. Dramatic conformational differences observed between unliganded and citrate-bound forms of StAckA led to identification of a putative ligand-binding pocket at the dimeric interface of StAckA with implications for enzymatic function.
Resumo:
A newly implemented G-matrix Fourier transform (GFT) (4,3)D HC(C)CH experiment is presented in conjunction with (4,3)D HCCH to efficiently identify H-1/C-13 sugar spin systems in C-13 labeled nucleic acids. This experiment enables rapid collection of highly resolved relay 4D HC(C)CH spectral information, that is, shift correlations of C-13-H-1 groups separated by two carbon bonds. For RNA, (4,3)D HC(C)CH takes advantage of the comparably favorable 1'- and 3'-CH signal dispersion for complete spin system identification including 5'-CH. The (4,3)D HC(C)CH/HCCH based strategy is exemplified for the 30-nucleotide 3'-untranslated region of the pre-mRNA of human U1A protein.