6 resultados para Extraction of BR from Source Code
Resumo:
The extraction of relevant terms from texts is an extensively researched task in Text- Mining. Relevant terms have been applied in areas such as Information Retrieval or document clustering and classification. However, relevance has a rather fuzzy nature since the classification of some terms as relevant or not relevant is not consensual. For instance, while words such as "president" and "republic" are generally considered relevant by human evaluators, and words like "the" and "or" are not, terms such as "read" and "finish" gather no consensus about their semantic and informativeness. Concepts, on the other hand, have a less fuzzy nature. Therefore, instead of deciding on the relevance of a term during the extraction phase, as most extractors do, I propose to first extract, from texts, what I have called generic concepts (all concepts) and postpone the decision about relevance for downstream applications, accordingly to their needs. For instance, a keyword extractor may assume that the most relevant keywords are the most frequent concepts on the documents. Moreover, most statistical extractors are incapable of extracting single-word and multi-word expressions using the same methodology. These factors led to the development of the ConceptExtractor, a statistical and language-independent methodology which is explained in Part I of this thesis. In Part II, I will show that the automatic extraction of concepts has great applicability. For instance, for the extraction of keywords from documents, using the Tf-Idf metric only on concepts yields better results than using Tf-Idf without concepts, specially for multi-words. In addition, since concepts can be semantically related to other concepts, this allows us to build implicit document descriptors. These applications led to published work. Finally, I will present some work that, although not published yet, is briefly discussed in this document.
Resumo:
Thesis submitted to Faculdade de Ciências e Tecnologia of the Universidade Nova de Lisboa, in partial fulfilment of the requirements for the degree of Master in Computer Science
Resumo:
Dissertation presented at Faculdade de Ciências e Tecnologia from Universidade Nova de Lisboa to obtain the degree of Master in Chemical and Biochemical Engineering
Resumo:
Dissertação para obtenção do Grau de Mestre em Engenharia Química e Bioquímica
Resumo:
Phenolic acids are aromatic secondary plant metabolites, widely spread throughout the plant kingdom. Due to their biological and pharmacological properties, they have been playing an important role in phytotherapy and consequently techniques for their separation and purification are in need. This thesis aims at exploring new sustainable separation processes based on ionic liquids (ILs) in the extraction of biologically active phenolic acids. For that purpose, three phenolic acids with similar chemical structures were selected: cinnamic acid, p-coumaric acid and caffeic acid. In the last years, it has been shown that ionic liquids-based aqueous biphasic systems (ABSs) are valid alternatives for the extraction, recovery and purification of biomolecules when compared to conventional ABS or extractions carried out with organic solvents. In particular, cholinium-based ILs represent a clear step towards a greener chemistry, while providing means for the implementation of efficient techniques for the separation and purification of biomolecules. In this work, ABSs were implemented using cholinium carboxylate ILs using either high charge density inorganic salt (K3PO4) or polyethylene glycol (PEG) to promote the phase separation of aqueous solutions containing three different phenolic acids. These systems allow for the evaluation of effect of chemical structure of the anion on the extraction efficiency. Only one imidazolium-based IL was used in order to establish the effect of the cation chemical structure. The selective extraction of one single acid was also researched. Overall, it was observed that phenolic acids display very complex behaviours in aqueous solutions, from dimerization to polymerization and also hetero-association are quite frequent phenomena, depending on the pH conditions. These phenomena greatly hinder the correct quantification of these acids in solution.
Resumo:
Different oil-containing substrates, namely, used cooking oil (UCO), fatty acids-byproduct from biodiesel production (FAB) and olive oil deodorizer distillate (OODD) were tested as inexpensive carbon sources for the production of polyhydroxyalkanoates (PHA) using twelve bacterial strains, in batch experiments. The OODD and FAB were exploited for the first time as alternative substrates for PHA production. Among the tested bacterial strains, Cupriavidus necator and Pseudomonas resinovorans exhibited the most promising results, producing poly-3-hydroxybutyrate, P(3HB), form UCO and OODD and mcl-PHA mainly composed of 3-hydroxyoctanoate (3HO) and 3-hydroxydecanoate (3HD) monomers from OODD, respectively. Afterwards, these bacterial strains were cultivated in bioreactor. C. necator were cultivated in bioreactor using UCO as carbon source. Different feeding strategies were tested for the bioreactor cultivation of C. necator, namely, batch, exponential feeding and DO-stat mode. The highest overall PHA productivity (12.6±0.78 g L-1 day-1) was obtained using DO-stat mode. Apparently, the different feeding regimes had no impact on polymer thermal properties. However, differences in polymer‟s molecular mass distribution were observed. C. necator was also tested in batch and fed-batch modes using a different type of oil-containing substrate, extracted from spent coffee grounds (SCG) by super critical carbon dioxide (sc-CO2). Under fed-batch mode (DO-stat), the overall PHA productivity were 4.7 g L-1 day-1 with a storage yield of 0.77 g g-1. Results showed that SCG can be a bioresource for production of PHA with interesting properties. Furthermore, P. resinovorans was cultivated using OODD as substrate in bioreactor under fed-batch mode (pulse feeding regime). The polymer was highly amorphous, as shown by its low crystallinity of 6±0.2%, with low melting and glass transition temperatures of 36±1.2 and -16±0.8 ºC, respectively. Due to its sticky behavior at room temperature, adhesiveness and mechanical properties were also studied. Its shear bond strength for wood (67±9.4 kPa) and glass (65±7.3 kPa) suggests it may be used for the development of biobased glues. Bioreactor operation and monitoring with oil-containing substrates is very challenging, since this substrate is water immiscible. Thus, near-infrared spectroscopy (NIR) was implemented for online monitoring of the C. necator cultivation with UCO, using a transflectance probe. Partial least squares (PLS) regression was applied to relate NIR spectra with biomass, UCO and PHA concentrations in the broth. The NIR predictions were compared with values obtained by offline reference methods. Prediction errors to these parameters were 1.18 g L-1, 2.37 g L-1 and 1.58 g L-1 for biomass, UCO and PHA, respectively, which indicates the suitability of the NIR spectroscopy method for online monitoring and as a method to assist bioreactor control. UCO and OODD are low cost substrates with potential to be used in PHA batch and fed-batch production. The use of NIR in this bioprocess also opened an opportunity for optimization and control of PHA production process.