997 resultados para n-gram model
Resumo:
In this paper a method of copy detection in short Malayalam text passages is proposed. Given two passages one as the source text and another as the copied text it is determined whether the second passage is plagiarized version of the source text. An algorithm for plagiarism detection using the n-gram model for word retrieval is developed and found tri-grams as the best model for comparing the Malayalam text. Based on the probability and the resemblance measures calculated from the n-gram comparison , the text is categorized on a threshold. Texts are compared by variable length n-gram(n={2,3,4}) comparisons. The experiments show that trigram model gives the average acceptable performance with affordable cost in terms of complexity
Resumo:
Existe um problema de representação em processamento de linguagem natural, pois uma vez que o modelo tradicional de bag-of-words representa os documentos e as palavras em uma unica matriz, esta tende a ser completamente esparsa. Para lidar com este problema, surgiram alguns métodos que são capazes de representar as palavras utilizando uma representação distribuída, em um espaço de dimensão menor e mais compacto, inclusive tendo a propriedade de relacionar palavras de forma semântica. Este trabalho tem como objetivo utilizar um conjunto de documentos obtido através do projeto Media Cloud Brasil para aplicar o modelo skip-gram em busca de explorar relações e encontrar padrões que facilitem na compreensão do conteúdo.
Resumo:
Background: In protein sequence classification, identification of the sequence motifs or n-grams that can precisely discriminate between classes is a more interesting scientific question than the classification itself. A number of classification methods aim at accurate classification but fail to explain which sequence features indeed contribute to the accuracy. We hypothesize that sequences in lower denominations (n-grams) can be used to explore the sequence landscape and to identify class-specific motifs that discriminate between classes during classification. Discriminative n-grams are short peptide sequences that are highly frequent in one class but are either minimally present or absent in other classes. In this study, we present a new substitution-based scoring function for identifying discriminative n-grams that are highly specific to a class. Results: We present a scoring function based on discriminative n-grams that can effectively discriminate between classes. The scoring function, initially, harvests the entire set of 4- to 8-grams from the protein sequences of different classes in the dataset. Similar n-grams of the same size are combined to form new n-grams, where the similarity is defined by positive amino acid substitution scores in the BLOSUM62 matrix. Substitution has resulted in a large increase in the number of discriminatory n-grams harvested. Due to the unbalanced nature of the dataset, the frequencies of the n-grams are normalized using a dampening factor, which gives more weightage to the n-grams that appear in fewer classes and vice-versa. After the n-grams are normalized, the scoring function identifies discriminative 4- to 8-grams for each class that are frequent enough to be above a selection threshold. By mapping these discriminative n-grams back to the protein sequences, we obtained contiguous n-grams that represent short class-specific motifs in protein sequences. Our method fared well compared to an existing motif finding method known as Wordspy. We have validated our enriched set of class-specific motifs against the functionally important motifs obtained from the NLSdb, Prosite and ELM databases. We demonstrate that this method is very generic; thus can be widely applied to detect class-specific motifs in many protein sequence classification tasks. Conclusion: The proposed scoring function and methodology is able to identify class-specific motifs using discriminative n-grams derived from the protein sequences. The implementation of amino acid substitution scores for similarity detection, and the dampening factor to normalize the unbalanced datasets have significant effect on the performance of the scoring function. Our multipronged validation tests demonstrate that this method can detect class-specific motifs from a wide variety of protein sequence classes with a potential application to detecting proteome-specific motifs of different organisms.
Resumo:
Thesis (Ph.D.)--University of Washington, 2016-08
Resumo:
This paper introduces a new neurofuzzy model construction and parameter estimation algorithm from observed finite data sets, based on a Takagi and Sugeno (T-S) inference mechanism and a new extended Gram-Schmidt orthogonal decomposition algorithm, for the modeling of a priori unknown dynamical systems in the form of a set of fuzzy rules. The first contribution of the paper is the introduction of a one to one mapping between a fuzzy rule-base and a model matrix feature subspace using the T-S inference mechanism. This link enables the numerical properties associated with a rule-based matrix subspace, the relationships amongst these matrix subspaces, and the correlation between the output vector and a rule-base matrix subspace, to be investigated and extracted as rule-based knowledge to enhance model transparency. The matrix subspace spanned by a fuzzy rule is initially derived as the input regression matrix multiplied by a weighting matrix that consists of the corresponding fuzzy membership functions over the training data set. Model transparency is explored by the derivation of an equivalence between an A-optimality experimental design criterion of the weighting matrix and the average model output sensitivity to the fuzzy rule, so that rule-bases can be effectively measured by their identifiability via the A-optimality experimental design criterion. The A-optimality experimental design criterion of the weighting matrices of fuzzy rules is used to construct an initial model rule-base. An extended Gram-Schmidt algorithm is then developed to estimate the parameter vector for each rule. This new algorithm decomposes the model rule-bases via an orthogonal subspace decomposition approach, so as to enhance model transparency with the capability of interpreting the derived rule-base energy level. This new approach is computationally simpler than the conventional Gram-Schmidt algorithm for resolving high dimensional regression problems, whereby it is computationally desirable to decompose complex models into a few submodels rather than a single model with large number of input variables and the associated curse of dimensionality problem. Numerical examples are included to demonstrate the effectiveness of the proposed new algorithm.
Resumo:
The in vitro post-antibiotic effects (PAEs) of eight different concentrations of linezolid against Gram-positive cocci were investigated and the results analysed using the sigmoid E-max model for mathematically modelling the PAE. Mean maximal linezolid PAEs against strains of Staphylococcus aureus, Staphylococcus epidermidis, Enterococcus faecalis, Enterococcus faecium and Streptococcus pneumoniae were 2.2, 1.8, 2.8, 2.0 and 3.0 h, respectively. Resistance to methicillin (for the staphylococci), vancomycin (for the enterococci) and penicillin (for the pneumococci) had no effect on the duration of the PAE. Results of PAE testing support twice-daily dosing of linezolid in humans.
Resumo:
Twenty specimens of Nectomys squamipes born in captivity, were infected with 500 cercariae by the transcutaneous route. Coprologic examinations were carried out from the 5th to 23rd week after infection. On the 7th, 8th, 12th, 16th, and 23rd weeks the animals were sacrificed and perfused. The oogram was performed in segments of the small intestine (proximal, medial and distal portions) and the large intestine. The average pre-patent period was of 42 days. The average number of eggs varied from 350 on 6th week, to 800 on the 13th. From the 14th week on, the average number of eggs eliminated was lower than 50 per gram of feces. The recovery of worms kept steady on the 7th, 8th, and 12th week (16.85%; 15.45% and 11.95%), decreasing to 7.70% on the 16th week and 8.45% on the 23rd week. The proportion of male/female worms was about the same on the first two weeks, but from the 12th week on, the proportion was: 1,4/1 on the 12th week; 2,5/1 on the 16thweek and 1,8/1 on the 23rd weekThese observations suggest that N. squamipes may used as an experimental model for schistosomiasis mansoni, to wich it develops resistance mechanism, useful for immunity studies.
Resumo:
The cytokine macrophage migration inhibitory factor (MIF) is an important component of the early proinflammatory response of the innate immune system. However, the antimicrobial defense mechanisms mediated by MIF remain fairly mysterious. In the present study, we examined whether MIF controls bacterial uptake and clearance by professional phagocytes, using wild-type and MIF-deficient macrophages. MIF deficiency did not affect bacterial phagocytosis, but it strongly impaired the killing of gram-negative bacteria by macrophages and host defenses against gram-negative bacterial infection, as shown by increased mortality in a Klebsiella pneumonia model. Consistent with MIF's regulatory role of Toll-like 4 expression in macrophages, MIF-deficient cells stimulated with lipopolysaccharide or Escherichia coli exhibited reduced nuclear factor κB activity and tumor necrosis factor (TNF) production. Addition of recombinant MIF or TNF corrected the killing defect of MIF-deficient macrophages. Together, these data show that MIF is a key mediator of host responses against gram-negative bacteria, acting in part via a modulation of bacterial killing by macrophages.
Resumo:
The intestinal anti-inflammatory effects of two probiotics isolated from breast milk, Lactobacillus reuteri and L. fermentum, were evaluated and compared in the trinitrobenzenesulfonic acid (TNBS) model of rat colitis. Colitis was induced in rats by intracolonic administration of 10 mg TNBS dissolved in 50% ethanol (0.25 ml). Either L. reuteri or L. fermentum was daily administered orally (5 x 10(8) colony-forming units suspended in 0.5 ml skimmed milk) to each group of rats (n 10) for 3 weeks, starting 2 weeks before colitis induction. Colonic damage was evaluated histologically and biochemically, and the colonic luminal contents were used for bacterial studies and for SCFA production. Both probiotics showed intestinal anti-inflammatory effects in this model of experimental colitis, as evidenced histologically and by a significant reduction of colonic myeloperoxidase activity (P<0.05). L. fermentum significantly counteracted the colonic glutathione depletion induced by the inflammatory process. In addition, both probiotics lowered colonic TNFalpha levels (P<0.01) and inducible NO synthase expression when compared with non-treated rats; however, the decrease in colonic cyclo-oxygenase-2 expression was only achieved with L.fermentum administration. Finally, the two probiotics induced the growth of Lactobacilli species in comparison with control colitic rats, but the production of SCFA in colonic contents was only increased when L. fermentum was given. In conclusion, L. fermentum can exert beneficial immunomodulatory properties in inflammatory bowel disease, being more effective than L. reuteri, a probiotic with reputed efficacy in promoting beneficial effects on human health.
Resumo:
High-resolution structural information on optimally preserved bacterial cells can be obtained with cryo-electron microscopy of vitreous sections. With the help of this technique, the existence of a periplasmic space between the plasma membrane and the thick peptidoglycan layer of the gram-positive bacteria Bacillus subtilis and Staphylococcus aureus was recently shown. This raises questions about the mode of polymerization of peptidoglycan. In the present study, we report the structure of the cell envelope of three gram-positive bacteria (B. subtilis, Streptococcus gordonii, and Enterococcus gallinarum). In the three cases, a previously undescribed granular layer adjacent to the plasma membrane is found in the periplasmic space. In order to better understand how nascent peptidoglycan is incorporated into the mature peptidoglycan, we investigated cellular regions known to represent the sites of cell wall production. Each of these sites possesses a specific structure. We propose a hypothetic model of peptidoglycan polymerization that accommodates these differences: peptidoglycan precursors could be exported from the cytoplasm to the periplasmic space, where they could diffuse until they would interact with the interface between the granular layer and the thick peptidoglycan layer. They could then polymerize with mature peptidoglycan. We report cytoplasmic structures at the E. gallinarum septum that could be interpreted as cytoskeletal elements driving cell division (FtsZ ring). Although immunoelectron microscopy and fluorescence microscopy studies have demonstrated the septal and cytoplasmic localization of FtsZ, direct visualization of in situ FtsZ filaments has not been obtained in any electron microscopy study of fixed and dehydrated bacteria.
Resumo:
Escherichia coli is commonly involved in infections with a heavy bacterial burden. Piperacillin-tazobactam and carbapenems are among the recommended empirical treatments for health care-associated complicated intra-abdominal infections. In contrast to amoxicillin-clavulanate, both have reduced in vitro activity in the presence of high concentrations of extended-spectrum β-lactamase (ESBL)-producing and non-ESBL-producing E. coli bacteria. Our goal was to compare the efficacy of these antimicrobials against different concentrations of two clinical E. coli strains, one an ESBL-producer and the other a non-ESBL-producer, in a murine sepsis model. An experimental sepsis model {~5.5 log10 CFU/g [low inoculum concentration (LI)] or ~7.5 log(10) CFU/g [high inoculum concentration (HI)]} using E. coli strains ATCC 25922 (non-ESBL producer) and Ec1062 (CTX-M-14 producer), which are susceptible to the three antimicrobials, was used. Amoxicillin-clavulanate (50/12.5 mg/kg given intramuscularly [i.m.]), piperacillin-tazobactam (25/3.125 mg/kg given intraperitoneally [i.p.]), and imipenem (30 mg/kg i.m.) were used. Piperacillin-tazobactam and imipenem reduced spleen ATCC 25922 strain concentrations (-2.53 and -2.14 log10 CFU/g [P < 0.05, respectively]) in the HI versus LI groups, while amoxicillin-clavulanate maintained its efficacy (-1.01 log10 CFU/g [no statistically significant difference]). Regarding the Ec1062 strain, the antimicrobials showed lower efficacy in the HI than in the LI groups: -0.73, -1.89, and -1.62 log10 CFU/g (P < 0.05, for piperacillin-tazobactam, imipenem, and amoxicillin-clavulanate, respectively, although imipenem and amoxicillin-clavulanate were more efficacious than piperacillin-tazobactam). An adapted imipenem treatment (based on the time for which the serum drug concentration remained above the MIC obtained with a HI of the ATCC 25922 strain) improved its efficacy to -1.67 log10 CFU/g (P < 0.05). These results suggest that amoxicillin-clavulanate could be an alternative to imipenem treatment of infections caused by ESBL- and non-ESBL-producing E. coli strains in patients with therapeutic failure with piperacillin-tazobactam.
Resumo:
A mixture of 3 MAbs directed against 3 different CEA epitopes was radiolabelled with 131I and used for the treatment of a human colon carcinoma transplanted s.c. into nude mice. Intact MAbs and F(ab')2 fragments were mixed because it had been shown by autoradiography that these 2 antibody forms can penetrate into different areas of the tumor nodule. Ten days after transplantation of colon tumor T380 a single dose of 600 microCi of 131I MAbs was injected i.v. The tumor grafts were well established (as evidenced by exponential growth in untreated mice) and their size continued to increase up to 6 days after radiolabelled antibody injection. Tumor shrinking was then observed lasting for 4-12 weeks. In a control group injected with 600 microCi of 131I coupled to irrelevant monoclonal IgG, tumor growth was delayed, but no regression was observed. Tumors of mice injected with the corresponding amount of unlabelled antibodies grew like those of untreated mice. Based on measurements of the effective whole-body half-life of injected 131I, the mean radiation dose received by the animals was calculated to be 382 rads for the antibody group and 478 rads for the normal IgG controls. The genetically immunodeficient animals exhibited no increase in mortality, and only limited bone-marrow toxicity was observed. Direct measurement of radioactivity in mice dissected 1, 3 and 7 days after 131I-MAb injection showed that 25, 7.2 and 2.2% of injected dose were recovered per gram of tumor, the mean radiation dose delivered to the tumor being thus more than 5,000 rads. These experiments show that therapeutic doses of radioactivity can be selectively directed to human colon carcinoma by i.v. injection of 131I-labelled anti-CEA MAbs.
Resumo:
The sensor kinase GacS and the response regulator GacA are members of a two-component system that is present in a wide variety of gram-negative bacteria and has been studied mainly in enteric bacteria and fluorescent pseudomonads. The GacS/GacA system controls the production of secondary metabolites and extracellular enzymes involved in pathogenicity to plants and animals, biocontrol of soilborne plant diseases, ecological fitness, or tolerance to stress. A current model proposes that GacS senses a still-unknown signal and activates, via a phosphorelay mechanism, the GacA transcription regulator, which in turn triggers the expression of target genes. The GacS protein belongs to the unorthodox sensor kinases, characterized by an autophosphorylation, a receiver, and an output domain. The periplasmic loop domain of GacS is poorly conserved in diverse bacteria. Thus, a common signal interacting with this domain would be unexpected. Based on a comparison with the transcriptional regulator NarL, a secondary structure can be predicted for the GacA sensor kinases. Certain genes whose expression is regulated by the GacS/GacA system are regulated in parallel by the small RNA binding protein RsmA (CsrA) at a posttranscriptional level. It is suggested that the GacS/GacA system operates a switch between primary and secondary metabolism, with a major involvement of posttranscriptional control mechanisms.
Resumo:
Résumé La structure, ou l'architecture, des êtres vivants définit le cadre dans lequel la physique de la vie s'accomplit. La connaissance de cette structure dans ses moindres détails est un but essentiel de la biologie. Son étude est toutefois entravée par des limitations techniques. Malgré son potentiel théorique, la microscopie électronique n'atteint pas une résolution atomique lorsqu'elle est appliquée ä la matièxe biologique. Cela est dû en grande partie au fait qu'elle contient beaucoup d'eau qui ne résiste pas au vide du microscope. Elle doit donc être déshydratée avant d'être introduite dans un microscope conventionnel. Des artéfacts d'agrégation en découlent inévitablement. La cryo-microscopie électronique des sections vitreuses (CEMOVIS) a ëté développée afin de résoudre cela. Les spécimens sont vitrifiés, c.-à-d. que leur eau est immobilisée sans cristalliser par le froid. Ils sont ensuite coupés en sections ultrafines et celles-ci sont observées à basse température. Les spécimens sont donc observés sous forme hydratée et non fixée; ils sont proches de leur état natif. Durant longtemps, CEMOVIS était très difficile à exécuter mais ce n'est plus le cas. Durant cette thèse, CEMOVIS a été appliqué à différents spécimens. La synapse du système nerveux central a été étudiée. La présence dans la fente synaptique d'une forte densité de molécules organisées de manière périodique a été démontrée. Des particules luminales ont été trouvées dans Ies microtubules cérébraux. Les microtubules ont servi d'objets-test et ont permis de démontrer que des détails moléculaires de l'ordre du nm sont préservés. La compréhension de la structure de l'enveloppe cellulaire des bactéries Grampositives aété améliorée. Nos observations ont abouti à l'élaboration d'un nouveau modèle hypothétique de la synthèse de la paroi. Nous avons aussi focalisé notre attention sur le nucléoïde bactérien et cela a suscité un modèle de la fonction des différents états structuraux du nucléoïde. En conclusion, cette thèse a démontré que CEMOVIS est une excellente méthode poux étudier la structure d'échantillons biologiques à haute résolution. L'étude de la structure de divers aspects des êtres vivants a évoqué des hypothèses quant à la compréhension de leur fonctionnement. Summary The structure, or the architecture, of living beings defines the framework in which the physics of life takes place. Understanding it in its finest details is an essential goal of biology. Its study is however hampered by technical limitations. Despite its theoretical potential, electron microscopy cannot resolve individual atoms in biological matter. This is in great part due to the fact. that it contains a lot of water that cannot stand the vacuum of the microscope. It must therefore be dehydrated before being introduced in a conventional mìcroscope. Aggregation artefacts unavoidably happen. Cryo-electron microscopy of vitreous sections (CEMOVIS) has been developed to solve this problem. Specimens are vitrified, i.e. they are rapidly cooled and their water is immobilised without crystallising by the cold. They are then. sectioned in ultrathin slices, which are observed at low temperatures. Specimens are therefore observed in hydrated and unfixed form; they are close to their native state. For a long time, CEMOVIS was extremely tedious but this is not the case anymore. During this thesis, CEMOVIS was applied to different specimens. Synapse of central nervous system was studied. A high density of periodically-organised molecules was shown in the synaptic cleft. Luminal particles were found in brain microtubules. Microtubules, used as test specimen, permitted to demonstrate that molecular details of the order of nm .are preserved. The understanding of the structure of cell envelope of Gram-positive bacteria was improved. Our observations led to the elaboration of a new hypothetic model of cell wall synthesis. We also focused our attention on bacterial nucleoids and this also gave rise to a functional model of nucleoid structural states. In conclusion, this thesis demonstrated that CEMOVIS is an excellent method for studying the structure of bìologìcal specimens at high resolution. The study of the structure of various aspects of living beings evoked hypothesis for their functioning.
Resumo:
Limited antimicrobial agents are available for the treatment of implant-associated infections caused by fluoroquinolone-resistant Gram-negative bacilli. We compared the activities of fosfomycin, tigecycline, colistin, and gentamicin (alone and in combination) against a CTX-M15-producing strain of Escherichia coli (Bj HDE-1) in vitro and in a foreign-body infection model. The MIC and the minimal bactericidal concentration in logarithmic phase (MBC(log)) and stationary phase (MBC(stat)) were 0.12, 0.12, and 8 μg/ml for fosfomycin, 0.25, 32, and 32 μg/ml for tigecycline, 0.25, 0.5, and 2 μg/ml for colistin, and 2, 8, and 16 μg/ml for gentamicin, respectively. In time-kill studies, colistin showed concentration-dependent activity, but regrowth occurred after 24 h. Fosfomycin demonstrated rapid bactericidal activity at the MIC, and no regrowth occurred. Synergistic activity between fosfomycin and colistin in vitro was observed, with no detectable bacterial counts after 6 h. In animal studies, fosfomycin reduced planktonic counts by 4 log(10) CFU/ml, whereas in combination with colistin, tigecycline, or gentamicin, it reduced counts by >6 log(10) CFU/ml. Fosfomycin was the only single agent which was able to eradicate E. coli biofilms (cure rate, 17% of implanted, infected cages). In combination, colistin plus tigecycline (50%) and fosfomycin plus gentamicin (42%) cured significantly more infected cages than colistin plus gentamicin (33%) or fosfomycin plus tigecycline (25%) (P < 0.05). The combination of fosfomycin plus colistin showed the highest cure rate (67%), which was significantly better than that of fosfomycin alone (P < 0.05). In conclusion, the combination of fosfomycin plus colistin is a promising treatment option for implant-associated infections caused by fluoroquinolone-resistant Gram-negative bacilli.