986 resultados para Search based on sketch


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Choosing the right or the best option is often a demanding and challenging task for the user (e.g., a customer in an online retailer) when there are many available alternatives. In fact, the user rarely knows which offering will provide the highest value. To reduce the complexity of the choice process, automated recommender systems generate personalized recommendations. These recommendations take into account the preferences collected from the user in an explicit (e.g., letting users express their opinion about items) or implicit (e.g., studying some behavioral features) way. Such systems are widespread; research indicates that they increase the customers' satisfaction and lead to higher sales. Preference handling is one of the core issues in the design of every recommender system. This kind of system often aims at guiding users in a personalized way to interesting or useful options in a large space of possible options. Therefore, it is important for them to catch and model the user's preferences as accurately as possible. In this thesis, we develop a comparative preference-based user model to represent the user's preferences in conversational recommender systems. This type of user model allows the recommender system to capture several preference nuances from the user's feedback. We show that, when applied to conversational recommender systems, the comparative preference-based model is able to guide the user towards the best option while the system is interacting with her. We empirically test and validate the suitability and the practical computational aspects of the comparative preference-based user model and the related preference relations by comparing them to a sum of weights-based user model and the related preference relations. Product configuration, scheduling a meeting and the construction of autonomous agents are among several artificial intelligence tasks that involve a process of constrained optimization, that is, optimization of behavior or options subject to given constraints with regards to a set of preferences. When solving a constrained optimization problem, pruning techniques, such as the branch and bound technique, point at directing the search towards the best assignments, thus allowing the bounding functions to prune more branches in the search tree. Several constrained optimization problems may exhibit dominance relations. These dominance relations can be particularly useful in constrained optimization problems as they can instigate new ways (rules) of pruning non optimal solutions. Such pruning methods can achieve dramatic reductions in the search space while looking for optimal solutions. A number of constrained optimization problems can model the user's preferences using the comparative preferences. In this thesis, we develop a set of pruning rules used in the branch and bound technique to efficiently solve this kind of optimization problem. More specifically, we show how to generate newly defined pruning rules from a dominance algorithm that refers to a set of comparative preferences. These rules include pruning approaches (and combinations of them) which can drastically prune the search space. They mainly reduce the number of (expensive) pairwise comparisons performed during the search while guiding constrained optimization algorithms to find optimal solutions. Our experimental results show that the pruning rules that we have developed and their different combinations have varying impact on the performance of the branch and bound technique.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Paper presented at the Cloud Forward Conference 2015, October 6th-8th, Pisa

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Strasheela provides a means for the composer to create a symbolic score by formally describing it in a rule-based way. The environment defines a rich music representation for complex polyphonic scores. Strasheela enables the user to define expressive compositional rules and then to apply them to the score. Compositional rules can restrict many aspects of the music - including the rhythmic structure, the melodic structure and the harmonic structure - by constraining the parameters (e.g. duration or pitch) of musical events according to some numerical or logical relation. Strasheela combines this expressivity with efficient search strategies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The phnA gene that encodes the carbon-phosphorus bond cleavage enzyme phosphonoacetate hydrolase is widely distributed in the environment, suggesting that its phosphonate substrate may play a significant role in biogeochemical phosphorus cycling. Surprisingly, however, no biogenic origin for phosphonoacetate has yet been established. To facilitate the search for its natural source we have constructed a whole-cell phosphonoacetate biosensor. The gene encoding the LysR-type transcriptional activator PhnR, which controls expression of the phosphonoacetate degradative operon in Pseudomonas fluorescens 23F, was inserted in the broad-host-range promoter probe vector pPROBE-NT, together with the promoter region of the structural genes. Cells of Escherichia coli DH5a that contained the resultant construct, pPANT3, exhibited phosphonoacetate-dependent green fluorescent protein fluorescence in response to threshold concentrations of as little as 0.5 µM phosphonoacetate, some 100 times lower than the detection limit of currently available non-biological analytical methods; the pPANT3 biosensor construct in Pseudomonas putida KT2440 was less sensitive, although with shorter response times. From a range of other phosphonates and phosphonoacetate analogues tested, only phosphonoacetaldehyde and arsonoacetate induced green fluorescent protein fluorescence in the E. coli DH5a (pPANT3) biosensor, although at much-reduced sensitivities (50 µM phosphonoacetaldehyde and 500 µM arsonoacetate).

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The conventional radial basis function (RBF) network optimization methods, such as orthogonal least squares or the two-stage selection, can produce a sparse network with satisfactory generalization capability. However, the RBF width, as a nonlinear parameter in the network, is not easy to determine. In the aforementioned methods, the width is always pre-determined, either by trial-and-error, or generated randomly. Furthermore, all hidden nodes share the same RBF width. This will inevitably reduce the network performance, and more RBF centres may then be needed to meet a desired modelling specification. In this paper we investigate a new two-stage construction algorithm for RBF networks. It utilizes the particle swarm optimization method to search for the optimal RBF centres and their associated widths. Although the new method needs more computation than conventional approaches, it can greatly reduce the model size and improve model generalization performance. The effectiveness of the proposed technique is confirmed by two numerical simulation examples.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It is convenient and effective to solve nonlinear problems with a model that has a linear-in-the-parameters (LITP) structure. However, the nonlinear parameters (e.g. the width of Gaussian function) of each model term needs to be pre-determined either from expert experience or through exhaustive search. An alternative approach is to optimize them by a gradient-based technique (e.g. Newton’s method). Unfortunately, all of these methods still need a lot of computations. Recently, the extreme learning machine (ELM) has shown its advantages in terms of fast learning from data, but the sparsity of the constructed model cannot be guaranteed. This paper proposes a novel algorithm for automatic construction of a nonlinear system model based on the extreme learning machine. This is achieved by effectively integrating the ELM and leave-one-out (LOO) cross validation with our two-stage stepwise construction procedure [1]. The main objective is to improve the compactness and generalization capability of the model constructed by the ELM method. Numerical analysis shows that the proposed algorithm only involves about half of the computation of orthogonal least squares (OLS) based method. Simulation examples are included to confirm the efficacy and superiority of the proposed technique.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A rapidly increasing number of Web databases are now become accessible via
their HTML form-based query interfaces. Query result pages are dynamically generated
in response to user queries, which encode structured data and are displayed for human
use. Query result pages usually contain other types of information in addition to query
results, e.g., advertisements, navigation bar etc. The problem of extracting structured data
from query result pages is critical for web data integration applications, such as comparison
shopping, meta-search engines etc, and has been intensively studied. A number of approaches
have been proposed. As the structures of Web pages become more and more complex, the
existing approaches start to fail, and most of them do not remove irrelevant contents which
may a®ect the accuracy of data record extraction. We propose an automated approach for
Web data extraction. First, it makes use of visual features and query terms to identify data
sections and extracts data records in these sections. We also represent several content and
visual features of visual blocks in a data section, and use them to ¯lter out noisy blocks.
Second, it measures similarity between data items in di®erent data records based on their
visual and content features, and aligns them into di®erent groups so that the data in the
same group have the same semantics. The results of our experiments with a large set of
Web query result pages in di®erent domains show that our proposed approaches are highly
e®ective.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Using the foraging movements of an insectivorous bat, Myotis mystacinus, we describe temporal switching of foraging behaviour in response to resource availability. These observations conform to predictions of optimized search under the Lévy flight paradigm. However, we suggest that this occurs as a result of a preference behaviour and knowledge of resource distribution. Preferential behaviour and knowledge of a familiar area generate distinct movement patterns as resource availability changes on short temporal scales. The behavioural response of predators to changes in prey fields can elicit different functional responses, which are considered to be central in the development of stable predator-prey communities. Recognizing how the foraging movements of an animal relate to environmental conditions also elucidates the evolution of optimized search and the prevalence of discrete strategies in natural systems. Applying techniques that use changes in the frequency distribution of movements facilitates exploration of the processes that underpin behavioural changes. © 2012 The Author(s) Published by the Royal Society. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Web databases are now pervasive. Such a database can be accessed via its query interface (usually HTML query form) only. Extracting Web query interfaces is a critical step in data integration across multiple Web databases, which creates a formal representation of a query form by extracting a set of query conditions in it. This paper presents a novel approach to extracting Web query interfaces. In this approach, a generic set of query condition rules are created to define query conditions that are semantically equivalent to SQL search conditions. Query condition rules represent the semantic roles that labels and form elements play in query conditions, and how they are hierarchically grouped into constructs of query conditions. To group labels and form elements in a query form, we explore both their structural proximity in the hierarchy of structures in the query form, which is captured by a tree of nested tags in the HTML codes of the form, and their semantic similarity, which is captured by various short texts used in labels, form elements and their properties. We have implemented the proposed approach and our experimental results show that the approach is highly effective.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A futura e inevitável escassez dos recursos fósseis, juntamente com o aumento imprevisível dos seus preços, levou, nas últimas décadas, a um aumento impressionante de iniciativas dedicadas não só à procura de fontes alternativas de fornecedores de energia, mas também de produtos químicos e polímeros a partir de fontes renováveis, em particular da biomassa vegetal. Entre estes, os polímeros derivados de monómeros furânicos constituem uma classe única de materiais cujas estruturas podem, em princípio, simular virtualmente os seus homólogos actualmente derivados de recursos fósseis. O anel furânico é uma estrutura heterocíclica com um carácter diénico pronunciado, o que torna-o um dieno particularmente apropriado para a reacção de Diels-Alder (DA) com dienófilos como a maleimida. Um dos aspectos mais relevantes da reacção de DA é a sua reversibilidade em função da temperatura, a qual permite que os aductos sejam facilmente revertidos nos seus precursores por aumento da temperatura (reacção de retro-DA). No caso específico da combinação furano-maleimida, a formação do aducto predomina até cerca de 60ºC, enquanto a reacção inversa é dominante acima de 100ºC. A combinação desta característica da reacção de DA com a química de compostos furânicos pode abrir um novo caminho para a preparação de materiais macromoleculares funcionais com base em fontes renováveis e com aplicações promissoras como auto-reparação e reciclabilidade. O principal objectivo desta Tese, é a síntese e caracterização de novos materiais poliméricos termo-reversíveis, aplicando a reacção de DA a monómeros complementares com estruturas dos tipos furânico (o dieno, designado por A) e de maleimida (o dienófilo, designado por B). A primeira etapa neste trabalho envolveu a síntese, purificação e caracterização de novos monómeros furânicos e de maleimida do tipo AA, A3, BB, B3, AB, AB2, cada um com diferentes grupos separadores das funções reactivas. Posteriormente, estes monómeros foram polimerizados e despolimerizados por ciclos de DA/retro-DA utilizando diferentes combinações. A formação e dissociação de todos os aductos de DA foram seguidas por ambas espectroscopias de UV e RMN de 1H. O primeiro sistema de DA estudado foi uma combinação modelo entre reagentes mono-funcionais (-A+-B), nomeadamente o acetato furfurílico (FA) e a N-metilmaleimida (MM), ambos comercialmente disponíveis. O objectivo desta abordagem foi estudar a cinética e o equilíbrio da formação/dissociação dos aductos de DA e obter indicações sobre as condições mais adequadas a serem usadas na preparação dos correspondentes novos materiais macromoleculares. Além disso, pretendia-se verificar a presença ou ausência de reacções secundárias que poderiam intervir em ambas as vias directa e inversa das reacções, mesmo após vários ciclos. A espectroscopia de UV forneceu informação quantitativa sobre a cinética de formação do aducto através da diminuição progressiva da absorvência máxima a 293 nm correspondente ao grupo maleimida, a diferentes temperaturas (35, 50, 65 ºC) Reciprocamente, a correspondente reacção de retro-DA foi seguida a 90 ºC através do aumento do mesmo pico. A reversibilidade destes sistemas foi verificada com sucesso após uma sequência de ciclos de DA/retro-DA. Adicionalmente, verificou-se que os espectros originaram um ponto isosbéstico, provando que estes sistemas não envolvem quaisquer reacções secundárias. Uma vez que foi usado um excesso de FA, as reacções de DA modelo apresentaram um comportamento cinético de pseudo-primeira ordem, com a constante de velocidade k mais alta (2.1x10-5 dm3mol-1s-1) para T=65 ºC. A correspondente energia de activação foi de 39.0 kJ.mol-1. A reacção de retro-DA seguiu um comportamento de primeira ordem, com constante de velocidade de 1.6x10-6 s-1. A evolução deste sistema por RMN de 1H a 65ºC deu-nos informações mais detalhadas sobre a sua evolução estrutural, ou seja, à medida que a intensidade dos picos atribuídos à formação do aducto aumentaram progressivamente ao longo do tempo, os pertencentes aos reagentes iniciais diminuiram proporcionalmente. O “rendimento final”, calculado após 20 dias à temperatura ambiente, foi de aproximadamente 70%. A reacção de retro-DA foi depois seguida a 90ºC, observando-se tal como na espectroscopia de UV, o deslocamento da reacção no sentido da regeneração dos reagentes de partida. A viabilidade de múltiplos ciclos de DA/retro-DA estabelecidos pela espectroscopia de UV foi igualmente confirmada por RMN de 1H. O passo seguinte envolveu o estudo de um sistema de policondensação linear baseado no crescimento gradual por reacção de DA entre um monómero bisfurânico A-A e um do tipo bismaleimida B-B, seguindo a mesma abordagem que no sistema modelo. O poliaducto linear foi obtido a partir de soluções equimolares dos monómeros, por reacção de DA a 65ºC. O progresso desta polimerização foi seguido por espectroscopia de UV e RMN de 1H e, mais qualitativamente, pelo aumento da viscosidade do meio. A reacção seguiu um comportamento de segunda ordem, com uma constante de velocidade de 9.4x10-6 dm3mol-1s-1, e observou-se novamente um ponto isosbéstico nos dados de UV. Os espectros de RMN apresentaram o padrão esperado, nomeadamente o aumento progressivo dos sinais associados ao aducto e a correspondente diminuição dos grupos furano e maleimida livres. A despolimerização do poliaducto através da reacção de retro-DA foi seguida a 110ºC usando as mesmas técnicas. Os dados de UV mostraram o retorno progressivo da absorção dos grupos de maleimida, seguindo um comportamento cinético de primeira ordem, com constante de velocidade de 2.5x10-6 s-1, até à completa regeneração de ambos os monómeros. Os espectros de RMN providenciaram mais uma vez informação estrutural sobre o progresso da despolimerização, a qual foi acompanhada por uma diminuição progressiva da viscosidade. Adicionalmente, para seguir a retro- DA, adicionou-se um excesso de composto furânico monofuncional, nomeadamente o 2,5-dimetilfurano (DMFu), ao sistema de modo a bloquear as funções maleimida complementares, evitando assim a repolimerização após arrefecimento. Os productos isolados foram então o monómero bisfurânico AA, DMFu que não reagiu e o bisaducto não-polimerizável de BB com DMFu. Este resultado indicou claramente que o polímero foi de facto revertido nos seus monómeros durante a reacção de retro-DA. O terceiro sistema estudado foi outra polimerização linear, seguindo as mesmas condições experimentais que os anteriores, mas com uma estratégia diferente de modo a contornar o problema clássico de assegurar a estequiometria exacta dos monómeros. As estruturas dos monómeros utilizados incorporam ambos os grupos reactivos, i.e, moléculas do tipo A-B. A polimerização prematura destes monómeros intrinsecamente reactivos foi evitada com a protecção do grupo maleimida na forma de um aducto de DA com furano, até a incorporação do substituinte furânico na outra extremidade. Portanto, a policondensação destes monómeros foi iniciada após a desprotecção in situ deste composto mediante aquecimento, seguido de arrefecimento até à temperatura adequada para polimerizar. Os resultados obtidos por UV e RMN sugerem que de facto o uso de monómeros do tipo A-B oferece um melhor sistema linear. Em seguida, foram estudados sistemas de policondensação não-linear por reacção de DA, entre monómeros (um ou ambos) com funcionalidade superior a dois, nomeadamente sistemas do tipo A3+B-B ou A-A+B3, seguindo mais uma vez as mesmas condições experimentais. Uma vez que utilizam monómeros complementares contendo, em média, mais de duas funcionalidades, estes sistemas conduzem a materiais reticulados. Nestes estudos, foram usadas três razões molares de [maleimida]/[furano], nomeadamente 1.0, 0.75 e 0.5, de modo a estudar ambas as situações de não-gelificação e reticulação. Ambos sistemas apresentaram um comportamento regular e boa reciclabilidade quer para gerar situações que possam conduzir à formação de redes a diferentes graus de conversão, ou que possam parar antes da sua obtenção, conforme previsto pela equação de Flory-Stockmayer. Como esperado, a utilização de grupos complementares em quantidades estequiométricas produziu o espessamento mais rápido e a reticulação quase completa; à medida que a quantidade relativa de monómero trifuncional decresceu, as reacções pararam antes da reticulação, ou seja, originaram meios altamente viscosos contendo polímeros solúveis altamente ramificados. As reacções de retro-DA a 110 ºC conduziram à gradual dissolução das partículas de gel (quando presentes), tendo sido comprovado pelos espectros de UV e de RMN de 1H, evidenciado a regeneração dos monómeros. Tal como no sistema do tipo A-A+B-B, a reacção de retro-DA foi seguida adicionando um excesso de DMFu ao sistema reaccional. Como esperado, os produtos finais foram os monómeros furânicos, o DMFu em excesso e o trisaducto ou o bisaducto maleimida-DMFu, o que confirma a eficiência da despolimerização com regeneração dos monómeros iniciais. O último sistema de policondensação por reacção de DA envolveu um monómero assimetricamente substituído do tipo AB2, capaz de originar estruturas macromoleculares hiper-ramificadas que não reticulam. Este estudo preliminar deste sistema foi seguido nas mesmas condições experimentais que os anteriores, apresentando um comportamento com as características esperadas.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sea salt is a natural product obtained from the evaporation of seawater in saltpans due to the combined effect of wind and sunlight. Nowadays, there is a growing interest for protection and re-valorisation of saltpans intrinsically associated to the quality of sea salt that can be evaluated by its physico-chemical properties. These man-made systems can be located in different geographical areas presenting different environmental surroundings. During the crystallization process, organic compounds coming from these surroundings can be incorporated into sea salt crystals, influencing their final composition. The organic matter associated to sea salt arises from three main sources: algae, surrounding bacterial community, and anthropogenic activity. Based on the hypothesis that sea salt contains associated organic compounds that can be used as markers of the product, including saltpans surrounding environment, the aim of this PhD thesis was to identify these compounds. With this purpose, this work comprised: 1) a deep characterisation of the volatile composition of sea salt by headspace solid phase microextraction combined with comprehensive two-dimensional gas chromatography time-of-flight mass spectrometry (HS-SPME/GCGC–ToFMS) methodology, in search of potential sea salt volatile markers; 2) the development of a methodology to isolate the polymeric material potentially present in sea salt, in amounts that allow its characterisation in terms of polysaccharides and protein; and 3) to explore the possible presence of triacylglycerides. The high chromatographic resolution and sensitivity of GC×GC–ToFMS enabled the separation and identification of a higher number of volatile compounds from sea salt, about three folds, compared to unidimentional chromatography (GC–qMS). The chromatographic contour plots obtained revealed the complexity of marine salt volatile composition and confirmed the relevance of GC×GC–ToFMS for this type of analysis. The structured bidimentional chromatographic profile arising from 1D volatility and 2D polarity was demonstrated, allowing more reliable identifications. Results obtained for analysis of salt from two locations in Aveiro and harvested over three years suggest the loss of volatile compounds along the time of storage of the salt. From Atlantic Ocean salts of seven different geographical origins, all produced in 2007, it was possible to identify a sub-set of ten compounds present in all salts, namely 6-methyl-5-hepten-2-one, 2,2,6-trimethylcyclohexanone, isophorone, ketoisophorone, β-ionone-5,6-epoxide, dihydroactinidiolide, 6,10,14-trimethyl-2-pentadecanone, 3-hydroxy-2,4,4-trimethylpentyl 2-methylpropanoate, 2,4,4-trimethylpentane-1,3-diyl bis(2-methylpropanoate), and 2-ethyl-1-hexanol. These ten compounds were considered potential volatile markers of sea salt. Seven of these compounds are carotenoid-derived compounds, and the other three may result from the integration of compounds from anthropogenic activity as metabolites of marine organisms. The present PhD work also allowed the isolation and characterisation, for the first time, of polymeric material from sea salt, using 16 Atlantic Ocean salts. A dialysis-based methodology was developed to isolate the polymeric material from sea salt in amounts that allowed its characterisation. The median content of polymeric material isolated from the 16 salts was 144 mg per kg of salt, e.g. 0.014% (w/w). Mid-infrared spectroscopy and thermogravimetry revealed the main occurrence of sulfated polysaccharides, as well as the presence of protein in the polymeric material from sea salt. Sea salt polysaccharides were found to be rich in uronic acid residues (21 mol%), glucose (18), galactose (16), and fucose (13). Sulfate content represented a median of 45 mol%, being the median content of sulfated polysaccharides 461 mg/g of polymeric material, which accounted for 66 mg/kg of dry salt. Glycosidic linkage composition indicates that the main sugar residues that could carry one or more sulfate groups were identified as fucose and galactose. This fact allowed to infer that the polysaccharides from sea salt arise mainly from algae, due to their abundance and composition. The amino acid profile of the polymeric material from the 16 Atlantic Ocean salts showed as main residues, as medians, alanine (25 mol%), leucine (14), and valine (14), which are hydrophobic, being the median protein content 35 mg/g, i.e. 4,9 mg per kg of dry salt. Beside the occurrence of hydrophobic volatile compounds in sea salt, hydrophobic non-volatile compounds were also detected. Triacylglycerides were obtained from sea salt by soxhlet extraction with n-hexane. Fatty acid composition revealed palmitic acid as the major residue (43 mol%), followed by stearic (13), linolenic (13), oleic (12), and linoleic (9). Sea salt triacylglycerides median content was 1.5 mg per kg of dry salt. Both protein and triacylglycerides seem to arise from macro and microalgae, phytoplankton and cyanobacteria, due to their abundance and composition. Despite the variability resulting from saltpans surrounding environment, this PhD thesis allowed the identification of a sea salt characteristic organic compounds profile based on volatile compounds, polysaccharides, protein, and triacylglycerides.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The problem of uncertainty propagation in composite laminate structures is studied. An approach based on the optimal design of composite structures to achieve a target reliability level is proposed. Using the Uniform Design Method (UDM), a set of design points is generated over a design domain centred at mean values of random variables, aimed at studying the space variability. The most critical Tsai number, the structural reliability index and the sensitivities are obtained for each UDM design point, using the maximum load obtained from optimal design search. Using the UDM design points as input/output patterns, an Artificial Neural Network (ANN) is developed based on supervised evolutionary learning. Finally, using the developed ANN a Monte Carlo simulation procedure is implemented and the variability of the structural response based on global sensitivity analysis (GSA) is studied. The GSA is based on the first order Sobol indices and relative sensitivities. An appropriate GSA algorithm aiming to obtain Sobol indices is proposed. The most important sources of uncertainty are identified.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Optimization methods have been used in many areas of knowledge, such as Engineering, Statistics, Chemistry, among others, to solve optimization problems. In many cases it is not possible to use derivative methods, due to the characteristics of the problem to be solved and/or its constraints, for example if the involved functions are non-smooth and/or their derivatives are not know. To solve this type of problems a Java based API has been implemented, which includes only derivative-free optimization methods, and that can be used to solve both constrained and unconstrained problems. For solving constrained problems, the classic Penalty and Barrier functions were included in the API. In this paper a new approach to Penalty and Barrier functions, based on Fuzzy Logic, is proposed. Two penalty functions, that impose a progressive penalization to solutions that violate the constraints, are discussed. The implemented functions impose a low penalization when the violation of the constraints is low and a heavy penalty when the violation is high. Numerical results, obtained using twenty-eight test problems, comparing the proposed Fuzzy Logic based functions to six of the classic Penalty and Barrier functions are presented. Considering the achieved results, it can be concluded that the proposed penalty functions besides being very robust also have a very good performance.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This research project is a contribution to the global field of information retrieval, specifically, to develop tools to enable information access in digital documents. We recognize the need to provide the user with flexible access to the contents of large, potentially complex digital documents, with means other than a search function or a handful of metadata elements. The goal is to produce a text browsing tool offering a maximum of information based on a fairly superficial linguistic analysis. We are concerned with a type of extensive single-document indexing, and not indexing by a set of keywords (see Klement, 2002, for a clear distinction between the two). The desired browsing tool would not only give at a glance the main topics discussed in the document, but would also present relationships between these topics. It would also give direct access to the text (via hypertext links to specific passages). The present paper, after reviewing previous research on this and similar topics, discusses the methodology and the main characteristics of a prototype we have devised. Experimental results are presented, as well as an analysis of remaining hurdles and potential applications.