4 resultados para CMF, molecular cloud, extraction algorithm
Resumo:
The automatic acquisition of lexical associations from corpora is a crucial issue for Natural Language Processing. A lexical association is a recurrent combination of words that co-occur together more often than expected by chance in a given domain. In fact, lexical associations define linguistic phenomena such as idiomes, collocations or compound words. Due to the fact that the sense of a lexical association is not compositionnal, their identification is fundamental for the realization of analysis and synthesis that take into account all the subtilities of the language. In this report, we introduce a new statistically-based architecture that extracts from naturally occurring texts contiguous and non contiguous. For that purpose, three new concepts have been defined : the positional N-gram models, the Mutual Expectation and the GenLocalMaxs algorithm. Thus, the initial text is fisrtly transformed in a set of positionnal N-grams i.e ordered vectors of simple lexical units. Then, an association measure, the Mutual Expectation, evaluates the degree of cohesion of each positional N-grams based on the identification of local maximum values of Mutual Expectation. Great efforts have also been carried out to evaluate our metodology. For that purpose, we have proposed the normalisation of five well-known association measures and shown that both the Mutual Expectation and the GenLocalMaxs algorithm evidence significant improvements comparing to existent metodologies.
Resumo:
Dissertação para obtenção do Grau de Mestre em Engenharia Biomédica
Resumo:
Dissertação para obtenção do Grau de Mestre em Engenharia Biomédica
Resumo:
Mycobacterium avium Complex (MAC) comprises microorganisms that affect a wide range of animals including humans. The most relevant are Mycobacterium avium subspecies hominissuis (Mah) with a high impact on public health affecting mainly immunocompromised individuals and Mycobacterium avium subspecies paratuberculosis (Map) causing paratuberculosis in animals with a high economic impact worldwide. In this work, we characterized 28 human and 67 porcine Mah isolates and evaluated the relationship among them by Multiple-Locus Variable number tandem repeat Analysis (MLVA). We concluded that Mah population presented a high genetic diversity and no correlations were inferred based on geographical origin, host or biological sample. For the first time in Portugal Map strains, from asymptomatic bovine faecal samples were isolated highlighting the need of more reliable and rapid diagnostic methods for Map direct detection. Therefore, we developed an IS900 nested real time PCR with high sensitivity and specificity associated with optimized DNA extraction methodologies for faecal and milk samples. We detected 83% of 155 faecal samples from goats, cattle and sheep, and 26% of 98 milk samples from cattle, positive for Map IS900 nested real time PCR. A novel SNPs (single nucleotide polymorphisms) assay to Map characterization based on a Whole Genome Sequencing analysis was developed to elucidate the genetic relationship between strains. Based on sequential detection of 14 SNPs and on a decision tree we were able to differentiate 14 phylogenetic groups with a higher discriminatory power compared to other typing methods. A pigmented Map strain was isolated and characterized evidencing for the first time to our knowledge the existence of pigmented Type C strains. With this work, we intended to improve the ante mortem direct molecular detection of Map, to conscientiously aware for the existence of Map animal infections widespread in Portugal and to contribute to the improvement of Map and Mah epidemiological studies.