985 resultados para INFORMATION DISCOVERY


Relevância:

30.00% 30.00%

Publicador:

Resumo:

A total of 10446 expressed sequence tags (ESTs) are obtained by a large-scale sequencing of a cDNA library from cephalothorax of adult Fenneropenaeus chinensis. An EST analysis platform was built up based on local computers and bioinformatic techniques were used to annotate these ESTs in order to promptly find possible functional genes, especially for immune related factors. About 4% of the ESTs show similarity to the coding sequences of such factors, including lectin, serine protease, serpin, lysozyme, etc. These ESTs provide a partial profile of the immune system in F. chinensis and useful information for further study on these genes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

TYPICAL is a package for describing and making automatic inferences about a broad class of SCHEME predicate functions. These functions, called types following popular usage, delineate classes of primitive SCHEME objects, composite data structures, and abstract descriptions. TYPICAL types are generated by an extensible combinator language from either existing types or primitive terminals. These generated types are located in a lattice of predicate subsumption which captures necessary entailment between types; if satisfaction of one type necessarily entail satisfaction of another, the first type is below the second in the lattice. The inferences make by TYPICAL computes the position of the new definition within the lattice and establishes it there. This information is then accessible to both later inferences and other programs (reasoning systems, code analyzers, etc) which may need the information for their own purposes. TYPICAL was developed as a representation language for the discovery program Cyrano; particular examples are given of TYPICAL's application in the Cyrano program.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Classifying novel terrain or objects front sparse, complex data may require the resolution of conflicting information from sensors working at different times, locations, and scales, and from sources with different goals and situations. Information fusion methods can help resolve inconsistencies, as when evidence variously suggests that an object's class is car, truck, or airplane. The methods described here consider a complementary problem, supposing that information from sensors and experts is reliable though inconsistent, as when evidence suggests that an object's class is car, vehicle, and man-made. Underlying relationships among objects are assumed to be unknown to the automated system or the human user. The ARTMAP information fusion system used distributed code representations that exploit the neural network's capacity for one-to-many learning in order to produce self-organizing expert systems that discover hierarchical knowledge structures. The system infers multi-level relationships among groups of output classes, without any supervised labeling of these relationships.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

MOTIVATION: Technological advances that allow routine identification of high-dimensional risk factors have led to high demand for statistical techniques that enable full utilization of these rich sources of information for genetics studies. Variable selection for censored outcome data as well as control of false discoveries (i.e. inclusion of irrelevant variables) in the presence of high-dimensional predictors present serious challenges. This article develops a computationally feasible method based on boosting and stability selection. Specifically, we modified the component-wise gradient boosting to improve the computational feasibility and introduced random permutation in stability selection for controlling false discoveries. RESULTS: We have proposed a high-dimensional variable selection method by incorporating stability selection to control false discovery. Comparisons between the proposed method and the commonly used univariate and Lasso approaches for variable selection reveal that the proposed method yields fewer false discoveries. The proposed method is applied to study the associations of 2339 common single-nucleotide polymorphisms (SNPs) with overall survival among cutaneous melanoma (CM) patients. The results have confirmed that BRCA2 pathway SNPs are likely to be associated with overall survival, as reported by previous literature. Moreover, we have identified several new Fanconi anemia (FA) pathway SNPs that are likely to modulate survival of CM patients. AVAILABILITY AND IMPLEMENTATION: The related source code and documents are freely available at https://sites.google.com/site/bestumich/issues. CONTACT: yili@umich.edu.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This study examines the relation between selection power and selection labor for information retrieval (IR). It is the first part of the development of a labor theoretic approach to IR. Existing models for evaluation of IR systems are reviewed and the distinction of operational from experimental systems partly dissolved. The often covert, but powerful, influence from technology on practice and theory is rendered explicit. Selection power is understood as the human ability to make informed choices between objects or representations of objects and is adopted as the primary value for IR. Selection power is conceived as a property of human consciousness, which can be assisted or frustrated by system design. The concept of selection power is further elucidated, and its value supported, by an example of the discrimination enabled by index descriptions, the discovery of analogous concepts in partly independent scholarly and wider public discourses, and its embodiment in the design and use of systems. Selection power is regarded as produced by selection labor, with the nature of that labor changing with different historical conditions and concurrent information technologies. Selection labor can itself be decomposed into description and search labor. Selection labor and its decomposition into description and search labor will be treated in a subsequent article, in a further development of a labor theoretic approach to information retrieval.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The cysteine protease cathepsin S (CatS) is involved in the pathogenesis of autoimmune disorders, atherosclerosis, and obesity. Therefore, it represents a promising pharmacological target for drug development. We generated ligand-based and structure-based pharmacophore models for noncovalent and covalent CatS inhibitors to perform virtual high-throughput screening of chemical databases in order to discover novel scaffolds for CatS inhibitors. An in vitro evaluation of the resulting 15 structures revealed seven CatS inhibitors with kinetic constants in the low micromolar range. These compounds can be subjected to further chemical modifications to obtain drugs for the treatment of autoimmune disorders and atherosclerosis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The G-protein-coupled receptor free fatty acid receptor 1 (FFAR1), previously named GPR40, is a possible novel target for the treatment of type 2 diabetes. In an attempt to identify new ligands for this receptor, we performed virtual screening (VS) based on two-dimensional (2D) similarity, three-dimensional (3D) pharmacophore searches, and docking studies by using the structure of known agonists and our model of the ligand binding site, which was validated by mutagenesis. VS of a database of 2.6 million compounds followed by extraction of structural neighbors of functionally confirmed hits resulted in identification of 15 compounds active at FFAR1 either as full agonists, partial agonists, or pure antagonists. Site-directed mutagenesis and docking studies revealed different patterns of ligand-receptor interactions and provided important information on the role of specific amino acids in binding and activation of FFAR1.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper identifies and analyses the means of accessing and collecting foreign-based evidence in transnational antitrust cases. It makes an original contribution to the existing scholarship by critically addressing the available mechanisms of judicial cooperation, the possibility of reliance on domestic discovery in transnational context, as well as the existing instruments allowing for cooperation between antitrust agencies. It identifies the shortcomings of the current regulatory framework and points out to the existing good practices in those jurisdictions which provide their antitrust agencies with more leeway in sharing confidential information with foreign counterparts.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Modern cancer research on prognostic and predictive biomarkers demands the integration of established and emerging high-throughput technologies. However, these data are meaningless unless carefully integrated with patient clinical outcome and epidemiological information. Integrated datasets hold the key to discovering new biomarkers and therapeutic targets in cancer. We have developed a novel approach and set of methods for integrating and interrogating phenomic, genomic and clinical data sets to facilitate cancer biomarker discovery and patient stratification. Applied to a known paradigm, the biological and clinical relevance of TP53, PICan was able to recapitulate the known biomarker status and prognostic significance at a DNA, RNA and protein levels.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Learning or writing regular expressions to identify instances of a specific
concept within text documents with a high precision and recall is challenging.
It is relatively easy to improve the precision of an initial regular expression
by identifying false positives covered and tweaking the expression to avoid the
false positives. However, modifying the expression to improve recall is difficult
since false negatives can only be identified by manually analyzing all documents,
in the absence of any tools to identify the missing instances. We focus on partially
automating the discovery of missing instances by soliciting minimal user
feedback. We present a technique to identify good generalizations of a regular
expression that have improved recall while retaining high precision. We empirically
demonstrate the effectiveness of the proposed technique as compared to
existing methods and show results for a variety of tasks such as identification of
dates, phone numbers, product names, and course numbers on real world datasets

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The rapid evolution and proliferation of a world-wide computerized network, the Internet, resulted in an overwhelming and constantly growing amount of publicly available data and information, a fact that was also verified in biomedicine. However, the lack of structure of textual data inhibits its direct processing by computational solutions. Information extraction is the task of text mining that intends to automatically collect information from unstructured text data sources. The goal of the work described in this thesis was to build innovative solutions for biomedical information extraction from scientific literature, through the development of simple software artifacts for developers and biocurators, delivering more accurate, usable and faster results. We started by tackling named entity recognition - a crucial initial task - with the development of Gimli, a machine-learning-based solution that follows an incremental approach to optimize extracted linguistic characteristics for each concept type. Afterwards, Totum was built to harmonize concept names provided by heterogeneous systems, delivering a robust solution with improved performance results. Such approach takes advantage of heterogenous corpora to deliver cross-corpus harmonization that is not constrained to specific characteristics. Since previous solutions do not provide links to knowledge bases, Neji was built to streamline the development of complex and custom solutions for biomedical concept name recognition and normalization. This was achieved through a modular and flexible framework focused on speed and performance, integrating a large amount of processing modules optimized for the biomedical domain. To offer on-demand heterogenous biomedical concept identification, we developed BeCAS, a web application, service and widget. We also tackled relation mining by developing TrigNER, a machine-learning-based solution for biomedical event trigger recognition, which applies an automatic algorithm to obtain the best linguistic features and model parameters for each event type. Finally, in order to assist biocurators, Egas was developed to support rapid, interactive and real-time collaborative curation of biomedical documents, through manual and automatic in-line annotation of concepts and relations. Overall, the research work presented in this thesis contributed to a more accurate update of current biomedical knowledge bases, towards improved hypothesis generation and knowledge discovery.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The purpose of this article is to investigate the involvement of Information and Learning Services staff in the delivery of the Research Training Programme at the University of Worcester, UK with a focus on researcher receptivity. I believe that by constantly reflecting on the development of that part of the programme delivered by ILS and by examining feedback from the sessions, it is possible to improve and increase the level of researcher receptivity. It is hoped that such examination and reflection will be of value and relevance to the IL community since by reflecting on success and failure in a local context and by mapping this reflection to existing research enables librarians to improve the support provided to researchers within their institutions. This article outlines the support given to research students at the University of Worcester in the past, examines the changes leading to present programme delivery and reflects on considerations for future support. The article is underpinned by reference to current research undertaken in international (albeit Western-centric) contexts. I note that the rationale behind changes is embedded in current adult learning and teaching theory. In an increasingly competitive research environment where funding is dependent on a statistically monitored research output, the aim of such support is to integrate any IL contribution into the wider research training programme. Thus resource discovery becomes part of the reflexive research cycle. Implicit in this investigative reflection is the desire of the IL community to constantly strive towards the positive reception of IL into research support programmes which are perceived by researchers as highly valuable to the process and progress of their work.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Tese de doutoramento, Biologia (Biologia Molecular), Universidade de Lisboa, Faculdade de Ciências, 2015

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis entitled “The right to freedom of information in india”.In a democracy, the citizens being the persons to choose their own governors, the right to know from the Government is a pre-condition for a properly evaluated election. Freedom of speech and expression, one of the repositories of self~government, forms the basis for the right to know in a wider scale. The functions which the free speech rights serve in a society also emphasize the need for more openness in the functioning of a democracy.Maintanance of law and order and investigation of crimes are highly important in a country like India, where no risk may be taken on account of the public‘s right to know. The Indian situations relating terrorist activities, riots based on language, region, religion and caste are important in this respect. The right to know of the citizens may be regulated in the interests of secrecy required in these areas.On the basis of the conclusions reached in this study, a draft Bill has been proposed for the passing of an Access to Public Documents Act. This Bill is appended to this Thesis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The need to structure knowledge is as important now as it ever has been. This paper has tried to study the ISP knowledge portal to explore how knowledge on various resources and topics in photonics and related areas are organized in the knowledge portal of International School of Photonics, CUSAT. The study revealed that ISP knowledge portal is one of the best portals in the filed. It provides a model for building an effective knowledge portal in other fields