914 resultados para search and matching


Relevância:

40.00% 40.00%

Publicador:

Resumo:

Current-day web search engines (e.g., Google) do not crawl and index a significant portion of theWeb and, hence, web users relying on search engines only are unable to discover and access a large amount of information from the non-indexable part of the Web. Specifically, dynamic pages generated based on parameters provided by a user via web search forms (or search interfaces) are not indexed by search engines and cannot be found in searchers’ results. Such search interfaces provide web users with an online access to myriads of databases on the Web. In order to obtain some information from a web database of interest, a user issues his/her query by specifying query terms in a search form and receives the query results, a set of dynamic pages that embed required information from a database. At the same time, issuing a query via an arbitrary search interface is an extremely complex task for any kind of automatic agents including web crawlers, which, at least up to the present day, do not even attempt to pass through web forms on a large scale. In this thesis, our primary and key object of study is a huge portion of the Web (hereafter referred as the deep Web) hidden behind web search interfaces. We concentrate on three classes of problems around the deep Web: characterization of deep Web, finding and classifying deep web resources, and querying web databases. Characterizing deep Web: Though the term deep Web was coined in 2000, which is sufficiently long ago for any web-related concept/technology, we still do not know many important characteristics of the deep Web. Another matter of concern is that surveys of the deep Web existing so far are predominantly based on study of deep web sites in English. One can then expect that findings from these surveys may be biased, especially owing to a steady increase in non-English web content. In this way, surveying of national segments of the deep Web is of interest not only to national communities but to the whole web community as well. In this thesis, we propose two new methods for estimating the main parameters of deep Web. We use the suggested methods to estimate the scale of one specific national segment of the Web and report our findings. We also build and make publicly available a dataset describing more than 200 web databases from the national segment of the Web. Finding deep web resources: The deep Web has been growing at a very fast pace. It has been estimated that there are hundred thousands of deep web sites. Due to the huge volume of information in the deep Web, there has been a significant interest to approaches that allow users and computer applications to leverage this information. Most approaches assumed that search interfaces to web databases of interest are already discovered and known to query systems. However, such assumptions do not hold true mostly because of the large scale of the deep Web – indeed, for any given domain of interest there are too many web databases with relevant content. Thus, the ability to locate search interfaces to web databases becomes a key requirement for any application accessing the deep Web. In this thesis, we describe the architecture of the I-Crawler, a system for finding and classifying search interfaces. Specifically, the I-Crawler is intentionally designed to be used in deepWeb characterization studies and for constructing directories of deep web resources. Unlike almost all other approaches to the deep Web existing so far, the I-Crawler is able to recognize and analyze JavaScript-rich and non-HTML searchable forms. Querying web databases: Retrieving information by filling out web search forms is a typical task for a web user. This is all the more so as interfaces of conventional search engines are also web forms. At present, a user needs to manually provide input values to search interfaces and then extract required data from the pages with results. The manual filling out forms is not feasible and cumbersome in cases of complex queries but such kind of queries are essential for many web searches especially in the area of e-commerce. In this way, the automation of querying and retrieving data behind search interfaces is desirable and essential for such tasks as building domain-independent deep web crawlers and automated web agents, searching for domain-specific information (vertical search engines), and for extraction and integration of information from various deep web resources. We present a data model for representing search interfaces and discuss techniques for extracting field labels, client-side scripts and structured data from HTML pages. We also describe a representation of result pages and discuss how to extract and store results of form queries. Besides, we present a user-friendly and expressive form query language that allows one to retrieve information behind search interfaces and extract useful data from the result pages based on specified conditions. We implement a prototype system for querying web databases and describe its architecture and components design.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper studies the incidence and consequences of the mismatch between formal education and the educational requirements of jobs in Estonia during the years 1997-2003. We fi nd large wage penalties associated with the phenomenon of educational mismatch. Moreover, the incidence and wage penalty of mismatches increase with age. This suggests that structural educational mismatches can occur after fast transition periods. Our results are robust for various methodologies, and more importantly regarding departures from the exogeneity assumptions inherent in the matching estimators used in our analysis

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This dissertation explores the use of internal and external sources of knowledge in modern innovation processes. It builds on a framework that combines theories such as a behavioural theory of the firm, the evolutionary theory of economic change, and modern approaches to strategic management. It follows the recent increase in innovation research focusing on the firm-level examination of innovative activities instead of traditional industry-level determinants. The innovation process is seen as a problem- and slack- driven search process, which can take several directions in terms of organizational boundaries in the pursuit of new knowledge and other resources. It thus draws on recent models of technological change, according to which firms nowadays should build their innovative activities on both internal and external sources of innovation rather than relying solely on internal resources. Four different research questions are addressed, all of which are empirically investigated via a rich dataset covering Finnish innovators collected by Statistics Finland. Firstly, the study examines how the nature of problems shapes the direction of any search for new knowledge. In general it demonstrates that the nature of the problem does affect the direction of the search, although under resource constraints firms tend to use external rather than internal sources of knowledge. At the same time, it shows that those firms that are constrained in terms of finance seem to search both internally and externally. Secondly, the dissertation investigates the relationships between different kinds of internal and external sources of knowledge in an attempt to find out where firms should direct their search in order to exploit the potential of a distributed innovation process. The concept of complementarities is applied in this context. The third research question concerns how the use of external knowledge sources – openness to external knowledge – influences the financial performance of firms. Given the many advantages of openness presented in the current literature, the focus is on how it shapes profitability. The results reveal a curvilinear relationship between profitability and openness (taking an inverted U-shape), the implication being that it pays to be open up to a certain point, but being too open to external sources may be detrimental to financial performance. Finally, the dissertation addresses some challenges in CISbased innovation research that have received relatively little attention in prior studies. The general aim is to underline the fact that comprehensive understanding of the complex process of technological change requires the constant development of methodological approaches (in terms of data and measures, for example). All the empirical analyses included in the dissertation are based on the Finnish CIS (Finnish Innovation Survey 1998-2000).

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Bovine respiratory syncytial virus (BRSV) has been only sporadically identified as a causative agent of respiratory disease in Brazil. This contrasts with frequent reports of clinical and histopathological findings suggestive of BRSV-associated disease. In order to examine a possible involvement of BRSV in cases of calf pneumonia, a retrospective search was performed for BRSV antigens in histological specimens submitted to veterinary diagnostic services from the states of Rio Grande do Sul and Minas Gerais. Ten out of 41 cases examined (24.4%) were positive for BRSV antigens by immunohistochemistry (IPX). Eight of these cases (19.5%) were also positive by indirect immunofluorescence (IFA), and 31 cases (75.6%) were negative in both assays. In the lungs, BRSV antigens were predominantly observed in epithelial cells of bronchioles and less frequently found in alveoli. In one case, antigens were detected only in the epithelium of the alveolar septae. The presence of antigen-positive cells was largely restricted to epithelial cells of these airways. In two cases, positive staining was also observed in cells and cellular debris in the exudate within the pulmonary airways. The clinical cases positive for BRSV antigens were observed mainly in young animals (2 to 12 month-old) from dairy herds. The main microscopic changes included bronchointerstitial pneumonia characterized by thickening of alveolar septae adjacent to airways by mononuclear cell infiltrates, and the presence of alveolar syncytial giant cells. In summary, the results demonstrate the suitability of the immunodetection of viral antigens in routinely fixed tissue specimens as a diagnostic tool for BRSV infection. Moreover, the findings provide further evidence of the importance of BRSV as a respiratory pathogen of young cattle in southeastern and southern Brazil.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This doctoral dissertation investigates the adult education policy of the European Union (EU) in the framework of the Lisbon agenda 2000–2010, with a particular focus on the changes of policy orientation that occurred during this reference decade. The year 2006 can be considered, in fact, a turning point for the EU policy-making in the adult learning sector: a radical shift from a wide--ranging and comprehensive conception of educating adults towards a vocationally oriented understanding of this field and policy area has been observed, in particular in the second half of the so--called ‘Lisbon decade’. In this light, one of the principal objectives of the mainstream policy set by the Lisbon Strategy, that of fostering all forms of participation of adults in lifelong learning paths, appears to have muted its political background and vision in a very short period of time, reflecting an underlying polarisation and progressive transformation of European policy orientations. Hence, by means of content analysis and process tracing, it is shown that the new target of the EU adult education policy, in this framework, has shifted from citizens to workers, and the competence development model, borrowed from the corporate sector, has been established as the reference for the new policy road maps. This study draws on the theory of governance architectures and applies a post-ontological perspective to discuss whether the above trends are intrinsically due to the nature of the Lisbon Strategy, which encompasses education policies, and to what extent supranational actors and phenomena such as globalisation influence the European governance and decision--making. Moreover, it is shown that the way in which the EU is shaping the upgrading of skills and competences of adult learners is modeled around the needs of the ‘knowledge economy’, thus according a great deal of importance to the ‘new skills for new jobs’ and perhaps not enough to life skills in its broader sense which include, for example, social and civic competences: these are actually often promoted but rarely implemented in depth in the EU policy documents. In this framework, it is conveyed how different EU policy areas are intertwined and interrelated with global phenomena, and it is emphasised how far the building of the EU education systems should play a crucial role in the formation of critical thinking, civic competences and skills for a sustainable democratic citizenship, from which a truly cohesive and inclusive society fundamentally depend, and a model of environmental and cosmopolitan adult education is proposed in order to address the challenges of the new millennium. In conclusion, an appraisal of the EU’s public policy, along with some personal thoughts on how progress might be pursued and actualised, is outlined.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The usage of digital content, such as video clips and images, has increased dramatically during the last decade. Local image features have been applied increasingly in various image and video retrieval applications. This thesis evaluates local features and applies them to image and video processing tasks. The results of the study show that 1) the performance of different local feature detector and descriptor methods vary significantly in object class matching, 2) local features can be applied in image alignment with superior results against the state-of-the-art, 3) the local feature based shot boundary detection method produces promising results, and 4) the local feature based hierarchical video summarization method shows promising new new research direction. In conclusion, this thesis presents the local features as a powerful tool in many applications and the imminent future work should concentrate on improving the quality of the local features.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Presentation at Open Repositories 2014, Helsinki, Finland, June 9-13, 2014

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Calves born persistently infected with non-cytopathic bovine viral diarrhea virus (ncpBVDV) frequently develop a fatal gastroenteric illness called mucosal disease. Both the original virus (ncpBVDV) and an antigenically identical but cytopathic virus (cpBVDV) can be isolated from animals affected by mucosal disease. Cytopathic BVDVs originate from their ncp counterparts by diverse genetic mechanisms, all leading to the expression of the non-structural polypeptide NS3 as a discrete protein. In contrast, ncpBVDVs express only the large precursor polypeptide, NS2-3, which contains the NS3 sequence within its carboxy-terminal half. We report here the investigation of the mechanism leading to NS3 expression in 41 cpBVDV isolates. An RT-PCR strategy was employed to detect RNA insertions within the NS2-3 gene and/or duplication of the NS3 gene, two common mechanisms of NS3 expression. RT-PCR amplification revealed insertions in the NS2-3 gene of three cp isolates, with the inserts being similar in size to that present in the cpBVDV NADL strain. Sequencing of one such insert revealed a 296-nucleotide sequence with a central core of 270 nucleotides coding for an amino acid sequence highly homologous (98%) to the NADL insert, a sequence corresponding to part of the cellular J-Domain gene. One cpBVDV isolate contained a duplication of the NS3 gene downstream from the original locus. In contrast, no detectable NS2-3 insertions or NS3 gene duplications were observed in the genome of 37 cp isolates. These results demonstrate that processing of NS2-3 without bulk mRNA insertions or NS3 gene duplications seems to be a frequent mechanism leading to NS3 expression and BVDV cytopathology.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The thesis examines the phenomenon most commonly known as “ayahuasca tourism” – i.e. the practice of westerners traveling to South America and partaking in ceremonies in which a powerful entheogenic brew, ayahuasca, is consumed. While this popular phenomenon has been steadily increasing during the last decades, it has, however, been insufficiently studied by scholars. An important question which has not been properly addressed in earlier studies is how ayahuasca tourism relates to the wider occurrence of travel and how it should be perceived with reference to the theoretical frameworks on the subject of travel. Drawing on theories regarding pilgrimage and tourism, the main purpose of this thesis is to examine the relationship between ayahuasca tourism and the broader spectrum of travel. In particular, the study tests the designations “pilgrimage”, “religious tourism” and “spiritual tourism” with reference to ayahuasca tourism. Utilizing earlier literature as well as ayahuasca tourists‟ reports obtained from an Internet forum as a basis for analysis, I search for a suitable terminology to be used for the phenomenon. The study lays special emphasis on the protagonists‟ motivations, experiences and outcomes in order to take note of various aspects of the wide-ranging occurrence of ayahuasca tourism. Key findings indicate that ayahuasca tourism is best understood as a combination of pilgrimage and tourism. On the basis of the analysis I argue that ayahuasca tourism should be labeled as “pilgrimage” and/or “spiritual tourism”, and the tourists respectively as “pilgrims” and/or “spiritual tourists”. The category of “religious tourism/tourist”, on the other hand, turns out to be an inappropriate designation when describing the phenomenon. In general, through my study I show that the results are consistent with the present trend in the study of travel to perceive pilgrimage and tourism as theoretically similar phenomena. The study of ayahuasca tourism serves thus as living proof of contemporary travel, in which the categories of pilgrimage and tourism are often indistinguishable. I suggest that ayahuasca tourism is by no means exceptional on this point, but can rather be used as an illustration of modern travel forms on a general level. Thus, the present study does not only add to the research of ayahuasca tourism, but also provides additional insights into the study of travel.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The SEARCH-RIO study prospectively investigated electrocardiogram (ECG)-derived variables in chronic Chagas disease (CCD) as predictors of cardiac death and new onset ventricular tachycardia (VT). Cardiac arrhythmia is a major cause of death in CCD, and electrical markers may play a significant role in risk stratification. One hundred clinically stable outpatients with CCD were enrolled in this study. They initially underwent a 12-lead resting ECG, signal-averaged ECG, and 24-h ambulatory ECG. Abnormal Q-waves, filtered QRS duration, intraventricular electrical transients (IVET), 24-h standard deviation of normal RR intervals (SDNN), and VT were assessed. Echocardiograms assessed left ventricular ejection fraction. Predictors of cardiac death and new onset VT were identified in a Cox proportional hazard model. During a mean follow-up of 95.3 months, 36 patients had adverse events: 22 new onset VT (mean±SD, 18.4±4‰/year) and 20 deaths (26.4±1.8‰/year). In multivariate analysis, only Q-wave (hazard ratio, HR=6.7; P<0.001), VT (HR=5.3; P<0.001), SDNN<100 ms (HR=4.0; P=0.006), and IVET+ (HR=3.0; P=0.04) were independent predictors of the composite endpoint of cardiac death and new onset VT. A prognostic score was developed by weighting points proportional to beta coefficients and summing-up: Q-wave=2; VT=2; SDNN<100 ms=1; IVET+=1. Receiver operating characteristic curve analysis optimized the cutoff value at >1. In 10,000 bootstraps, the C-statistic of this novel score was non-inferior to a previously validated (Rassi) score (0.89±0.03 and 0.80±0.05, respectively; test for non-inferiority: P<0.001). In CCD, surface ECG-derived variables are predictors of cardiac death and new onset VT.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Ontology matching is an important task when data from multiple data sources is integrated. Problems of ontology matching have been studied widely in the researchliterature and many different solutions and approaches have been proposed alsoin commercial software tools. In this survey, well-known approaches of ontologymatching, and its subtype schema matching, are reviewed and compared. The aimof this report is to summarize the knowledge about the state-of-the-art solutionsfrom the research literature, discuss how the methods work on different application domains, and analyze pros and cons of different open source and academic tools inthe commercial world.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The genetic and environmental risk factors of vascular cognitive impairment are still largely unknown. This thesis aimed to assess the genetic background of two clinically similar familial small vessel diseases (SVD), CADASIL (Cerebral Autosomal Dominant Arteriopathy with Subcortical Infarcts and Leukoencephalopathy) and Swedish hMID (hereditary multi-infarct dementia of Swedish type). In the first study, selected genetic modifiers of CADASIL were studied in a homogenous Finnish CADASIL population of 134 patients, all carrying the p.Arg133Cys mutation in NOTCH3. Apolipoprotein E (APOE) genotypes, angiotensinogen (AGT) p.Met268Thr polymorphism and eight NOTCH3 polymorphisms were studied, but no associations between any particular genetic variant and first-ever stroke or migraine were seen. In the second study, smoking, statin medication and physical activity were suggested to be the most profound environmental differences among the monozygotic twins with CADASIL. Swedish hMID was for long misdiagnosed as CADASIL. In the third study, the CADASIL diagnosis in the Swedish hMID family was ruled out on the basis of genetic, radiological and pathological findings, and Swedish hMID was suggested to represent a novel SVD. In the fourth study, the gene defect of Swedish hMID was then sought using whole exome sequencing paired with a linkage analysis. The strongest candidate for the pathogenic mutation was a 3’UTR variant in the COL4A1 gene, but further studies are needed to confirm its functionality. This study provided new information about the genetic background of two inherited SVDs. Profound knowledge about the pathogenic mutations causing familial SVD is also important for correct diagnosis and treatment options.