Biblioteca Digital

967 resultados para Cartes-Col·lections

The efficiency of corpus-based distributional models for literature-based discovery on large data sets

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper evaluates the efficiency of a number of popular corpus-based distributional models in performing discovery on very large document sets, including online collections. Literature-based discovery is the process of identifying previously unknown connections from text, often published literature, that could lead to the development of new techniques or technologies. Literature-based discovery has attracted growing research interest ever since Swanson's serendipitous discovery of the therapeutic effects of fish oil on Raynaud's disease in 1986. The successful application of distributional models in automating the identification of indirect associations underpinning literature-based discovery has been heavily demonstrated in the medical domain. However, we wish to investigate the computational complexity of distributional models for literature-based discovery on much larger document collections, as they may provide computationally tractable solutions to tasks including, predicting future disruptive innovations. In this paper we perform a computational complexity analysis on four successful corpus-based distributional models to evaluate their fit for such tasks. Our results indicate that corpus-based distributional models that store their representations in fixed dimensions provide superior efficiency on literature-based discovery tasks.

Saliva-derived DNA performs well in large-scale, high-density single-nucleotide polymorphism microarray studies

Relevância:

10.00% 10.00%

Publicador:

Resumo:

As of June 2009, 361 genome-wide association studies (GWAS) had been referenced by the HuGE database. GWAS require DNA from many thousands of individuals, relying on suitable DNA collections. We recently performed a multiple sclerosis (MS) GWAS where a substantial component of the cases (24%) had DNA derived from saliva. Genotyping was done on the Illumina genotyping platform using the Infinium Hap370CNV DUO microarray. Additionally, we genotyped 10 individuals in duplicate using both saliva- and blood-derived DNA. The performance of blood- versus saliva-derived DNA was compared using genotyping call rate, which reflects both the quantity and quality of genotyping per sample and the “GCScore,” an Illumina genotyping quality score, which is a measure of DNA quality. We also compared genotype calls and GCScores for the 10 sample pairs. Call rates were assessed for each sample individually. For the GWAS samples, we compared data according to source of DNA and center of origin. We observed high concordance in genotyping quality and quantity between the paired samples and minimal loss of quality and quantity of DNA in the saliva samples in the large GWAS sample, with the blood samples showing greater variation between centers of origin. This large data set highlights the usefulness of saliva DNA for genotyping, especially in high-density single-nucleotide polymorphism microarray studies such as GWAS.

Not losing the plot : creating, collecting and curating qualitative data through a web-based application

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Collecting regular personal reflections from first year teachers in rural and remote schools is challenging as they are busily absorbed in their practice, and separated from each other and the researchers by thousands of kilometres. In response, an innovative web-based solution was designed to both collect data and be a responsive support system for early career teachers as they came to terms with their new professional identities within rural and remote school settings. Using an emailed link to a web-based application named goingok.com, the participants are charting their first year plotlines using a sliding scale from ‘distressed’, ‘ok’ to ‘soaring’ and describing their self-assessment in short descriptive posts. These reflections are visible to the participants as a developing online journal, while the collections of de-identified developing plotlines are visible to the research team, alongside numerical data. This paper explores important aspects of the design process, together with the challenges and opportunities encountered in its implementation. A number of the key considerations for choosing to develop a web application for data collection are initially identified, and the resultant application features and scope are then examined. Examples are then provided about how a responsive software development approach can be part of a supportive feedback loop for participants while being an effective data collection process. Opportunities for further development are also suggested with projected implications for future research.

XML clustering and its application to XML transformation

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The continuous growth of the XML data poses a great concern in the area of XML data management. The need for processing large amounts of XML data brings complications to many applications, such as information retrieval, data integration and many others. One way of simplifying this problem is to break the massive amount of data into smaller groups by application of clustering techniques. However, XML clustering is an intricate task that may involve the processing of both the structure and the content of XML data in order to identify similar XML data. This research presents four clustering methods, two methods utilizing the structure of XML documents and the other two utilizing both the structure and the content. The two structural clustering methods have different data models. One is based on a path model and other is based on a tree model. These methods employ rigid similarity measures which aim to identifying corresponding elements between documents with different or similar underlying structure. The two clustering methods that utilize both the structural and content information vary in terms of how the structure and content similarity are combined. One clustering method calculates the document similarity by using a linear weighting combination strategy of structure and content similarities. The content similarity in this clustering method is based on a semantic kernel. The other method calculates the distance between documents by a non-linear combination of the structure and content of XML documents using a semantic kernel. Empirical analysis shows that the structure-only clustering method based on the tree model is more scalable than the structure-only clustering method based on the path model as the tree similarity measure for the tree model does not need to visit the parents of an element many times. Experimental results also show that the clustering methods perform better with the inclusion of the content information on most test document collections. To further the research, the structural clustering method based on tree model is extended and employed in XML transformation. The results from the experiments show that the proposed transformation process is faster than the traditional transformation system that translates and converts the source XML documents sequentially. Also, the schema matching process of XML transformation produces a better matching result in a shorter time.

Hospital separations in the Northern Territory for varicella-zoster virus related illnesses, 1993-1997

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A varicella-zoster virus (VZV) vaccine is available overseas, and universal immunisation in childhood is recommended in the United States.1 Any decision to introduce the vaccine to Australia must be based on an assessment of potential benefits and harms. While there has been some assessment of VZV significance in populations in southern Australia,2 the impact on the NT population is not known. It is not a notifiable condition and information on morbidity and mortality is limited to a few data collections. These are hospital separation data, deaths registers, and in 1995 the inclusion of VZV congenital and neonatal complications in the Australian Paediatric Surveillance System. Hospital separation data were analysed to assess the importance of VZV as a cause of severe morbidity and mortality in the NT population.

Chinese fashion designers in Shanghai : a new perspective

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Shanghai possesses an apt legacy, once referred to as “Paris of the East”. Municipal aspirations for Shanghai to assume a position among the great fashion cities of the world have been integrated in the recent re-shaping of this modern city into a role model for Chinese creative enterprise yet China is still known primarily as centre of clothing production. Increasingly however, “Made in China” is being replaced by “Created in China” drawing attention to two distinct consumer markets for Chinese designers. Fashion designers who have entered the global fashion system for education or by showing their collections have generally adopted a design aesthetic that aligns with Western markets, allowing little competitive advantage. In contrast, Chinese designers who rest their attention on the domestic Chinese market find a disparate, highly competitive marketplace. The pillars of authenticity that for foreign fashion brands extend far into their cultural and creative histories, often for many decades in the case of Louis Vuitton, Hermes and Christian Dior do not yet exist in China in this era of rapid globalisation. Here, the cultural bedrock allows these same pillars to extend only thirty years or so into the past reaching the moments when Deng Xiaoping granted China’s creative entrepreneurs passage. To this end, interviews with fashion designers in Shanghai have been undertaken during the last twelve months for a PhD dissertation. Production of culture theory has been used to identify working methods, practices of production and the social and cultural milieu necessary for designers to achieve viability. Preliminary findings indicate that some fashion designers have adopted an as-yet unexplored strategy of business and brand development with a distinct Chinese aesthetic at its core, in contrast to the clichéd cultural iconography often viewed by Western viewers as representative of Chinese creativity.

Ephemerality, affect and the "Art Dump"

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This work was composed in relation to the author's research of the popularity of themes of ephemerality and affect in recent global art. This focus correlated with Chicks on Speed's ongoing inquiries into issues of collections and collecting in the artworld, articulated as 'the art dump' by the group. This work was subsequently performed as a contribution to a performance with international multidisciplinary group Chicks on Speed as a part of their residency during MONA FOMA in Tasmania.

Construct design for efficient, effective and high-throughput gene silencing in plants

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Post-transcriptional silencing of plant genes using anti-sense or co-suppression constructs usually results in only a modest proportion of silenced individuals. Recent work has demonstrated the potential for constructs encoding self-complementary 'hairpin' RNA (hpRNA) to efficiently silence genes. In this study we examine design rules for efficient gene silencing, in terms of both the proportion of independent transgenic plants showing silencing, and the degree of silencing. Using hpRNA constructs containing sense/anti-sense arms ranging from 98 to 853 nt gave efficient silencing in a wide range of plant species, and inclusion of an intron in these constructs had a consistently enhancing effect. Intron-containing constructs (ihpRNA) generally gave 90-100% of independent transgenic plants showing silencing. The degree of silencing with these constructs was much greater than that obtained using either co-suppression or anti-sense constructs. We have made a generic vector, pHANNIBAL, that allows a simple, single PCR product from a gene of interest to be easily converted into a highly effective ihpRNA silencing construct. We have also created a high-throughput vector, pHELLSGATE, that should facilitate the cloning of gene libraries or large numbers of defined genes, such as those in EST collections, using an in vitro recombinase system. This system may facilitate the large-scale determination and discovery of plant gene functions in the same way as RNAi is being used to examine gene function in Caenorhabditis elegans.

Chinese fashion designers in Shanghai : identifying a fresh perspective about their role in the world order of fashion

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Shanghai possesses an apt legacy, once referred to as “Paris of the East”. Municipal aspirations for Shanghai to assume a position among the great fashion cities of the world have been integrated in the recent re-shaping of this modern city into a role model for Chinese creative enterprise yet China is still known primarily as centre of clothing production. Increasingly however, “Made in China” is being replaced by “Created in China” drawing attention to two distinct consumer markets for Chinese designers. Fashion designers who have entered the global fashion system for education or by showing their collections have generally adopted a design aesthetic that aligns with Western markets, allowing little competitive advantage. In contrast, Chinese designers who rest their attention on the domestic Chinese market find a disparate, highly competitive marketplace. The pillars of authenticity that for foreign fashion brands extend far into their cultural and creative histories, often for many decades in the case of Louis Vuitton, Hermes and Christian Dior do not yet exist in China in this era of rapid globalisation. Here, the cultural bedrock allows these same pillars to extend only thirty years or so into the past reaching the moments when Deng Xiaoping granted China’s creative entrepreneurs passage. To this end, interviews with fashion designers in Shanghai have been undertaken during the last twelve months for a PhD dissertation. Production of culture theory has been used to identify working methods, practices of production and the social and cultural milieu necessary for designers to achieve viability.

The case of LLACE : challenges, triumphs, and lessons of a community archives

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This article uses the Lavender Library, Archives, and Cultural Exchange of Sacramento, Incorporated, a small queer community archives in Northern California, as a case study for expanding our knowledge of community archives and issues of archival practice. It explores why creating a separate community archives was necessary, the role of community members in founding and maintaining the archives, the development of its collections, and the ongoing challenges community archives face. The article also considers the implications community archives have for professional practice, particularly in the areas of collecting, description, and collaboration.

Efficient top-k retrieval with signatures

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper describes a new method of indexing and searching large binary signature collections to efficiently find similar signatures, addressing the scalability problem in signature search. Signatures offer efficient computation with acceptable measure of similarity in numerous applications. However, performing a complete search with a given search argument (a signature) requires a Hamming distance calculation against every signature in the collection. This quickly becomes excessive when dealing with large collections, presenting issues of scalability that limit their applicability. Our method efficiently finds similar signatures in very large collections, trading memory use and precision for greatly improved search speed. Experimental results demonstrate that our approach is capable of finding a set of nearest signatures to a given search argument with a high degree of speed and fidelity.

Overview of INEX 2013

Relevância:

10.00% 10.00%

Publicador:

Resumo:

INEX investigates focused retrieval from structured documents by providing large test collections of structured documents, uniform evaluation measures, and a forum for organizations to compare their results. This paper reports on the INEX 2013 evaluation campaign, which consisted of four activities addressing three themes: searching professional and user generated data (Social Book Search track); searching structured or semantic data (Linked Data track); and focused retrieval (Snippet Retrieval and Tweet Contextualization tracks). INEX 2013 was an exciting year for INEX in which we consolidated the collaboration with (other activities in) CLEF and for the second time ran our workshop as part of the CLEF labs in order to facilitate knowledge transfer between the evaluation forums. This paper gives an overview of all the INEX 2013 tracks, their aims and task, the built test-collections, and gives an initial analysis of the results

A web-based approach to data imputation

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, we present WebPut, a prototype system that adopts a novel web-based approach to the data imputation problem. Towards this, Webput utilizes the available information in an incomplete database in conjunction with the data consistency principle. Moreover, WebPut extends effective Information Extraction (IE) methods for the purpose of formulating web search queries that are capable of effectively retrieving missing values with high accuracy. WebPut employs a confidence-based scheme that efficiently leverages our suite of data imputation queries to automatically select the most effective imputation query for each missing value. A greedy iterative algorithm is proposed to schedule the imputation order of the different missing values in a database, and in turn the issuing of their corresponding imputation queries, for improving the accuracy and efficiency of WebPut. Moreover, several optimization techniques are also proposed to reduce the cost of estimating the confidence of imputation queries at both the tuple-level and the database-level. Experiments based on several real-world data collections demonstrate not only the effectiveness of WebPut compared to existing approaches, but also the efficiency of our proposed algorithms and optimization techniques.

Knowledge unlatched : encouraging innovation in academic publishing through collective action and Open Access licensing

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This case study examines the way in which Knowledge Unlatched is combining collective action and open access licenses to encourage innovation in markets for specialist academic books. Knowledge Unlatched is a not for profit organisation that has been established to help a global community of libraries coordinate their book purchasing activities more effectively and, in so doing, to ensure that books librarians select for their own collections become available for free for anyone in the world to read. The Knowledge Unlatched model is an attempt to re-coordinate a market in order to facilitate a transition to digitally appropriate publishing models that include open access. It offers librarians an opportunity to facilitate the open access publication of books that their own readers would value access to. It provides publishers with a stable income stream on titles selected by libraries, as well as an ability to continue selling books to a wider market on their own terms. Knowledge Unlatched provides a rich case study for researchers and practitioners interested in understanding how innovations in procurement practices can be used to stimulate more effective, equitable markets for socially valuable products.

Optimization of an integrated model for automatic reduction and expansion of long queries

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A long query provides more useful hints for searching relevant documents, but it is likely to introduce noise which affects retrieval performance. In order to smooth such adverse effect, it is important to reduce noisy terms, introduce and boost additional relevant terms. This paper presents a comprehensive framework, called Aspect Hidden Markov Model (AHMM), which integrates query reduction and expansion, for retrieval with long queries. It optimizes the probability distribution of query terms by utilizing intra-query term dependencies as well as the relationships between query terms and words observed in relevance feedback documents. Empirical evaluation on three large-scale TREC collections demonstrates that our approach, which is automatic, achieves salient improvements over various strong baselines, and also reaches a comparable performance to a state of the art method based on user’s interactive query term reduction and expansion.

«
1
2
...
57
58
59
60
61
62
63
64
65
»