129 resultados para Databases - Duplicate tuples


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract Since its creation, the Internet has permeated our daily life. The web is omnipresent for communication, research and organization. This exploitation has resulted in the rapid development of the Internet. Nowadays, the Internet is the biggest container of resources. Information databases such as Wikipedia, Dmoz and the open data available on the net are a great informational potentiality for mankind. The easy and free web access is one of the major feature characterizing the Internet culture. Ten years earlier, the web was completely dominated by English. Today, the web community is no longer only English speaking but it is becoming a genuinely multilingual community. The availability of content is intertwined with the availability of logical organizations (ontologies) for which multilinguality plays a fundamental role. In this work we introduce a very high-level logical organization fully based on semiotic assumptions. We thus present the theoretical foundations as well as the ontology itself, named Linguistic Meta-Model. The most important feature of Linguistic Meta-Model is its ability to support the representation of different knowledge sources developed according to different underlying semiotic theories. This is possible because mast knowledge representation schemata, either formal or informal, can be put into the context of the so-called semiotic triangle. In order to show the main characteristics of Linguistic Meta-Model from a practical paint of view, we developed VIKI (Virtual Intelligence for Knowledge Induction). VIKI is a work-in-progress system aiming at exploiting the Linguistic Meta-Model structure for knowledge expansion. It is a modular system in which each module accomplishes a natural language processing task, from terminology extraction to knowledge retrieval. VIKI is a supporting system to Linguistic Meta-Model and its main task is to give some empirical evidence regarding the use of Linguistic Meta-Model without claiming to be thorough.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Levels of circulating glucose are tightly regulated. To identify new loci influencing glycemic traits, we performed meta-analyses of 21 genome-wide association studies informative for fasting glucose, fasting insulin and indices of beta-cell function (HOMA-B) and insulin resistance (HOMA-IR) in up to 46,186 nondiabetic participants. Follow-up of 25 loci in up to 76,558 additional subjects identified 16 loci associated with fasting glucose and HOMA-B and two loci associated with fasting insulin and HOMA-IR. These include nine loci newly associated with fasting glucose (in or near ADCY5, MADD, ADRA2A, CRY2, FADS1, GLIS3, SLC2A2, PROX1 and C2CD4B) and one influencing fasting insulin and HOMA-IR (near IGF1). We also demonstrated association of ADCY5, PROX1, GCK, GCKR and DGKB-TMEM195 with type 2 diabetes. Within these loci, likely biological candidate genes influence signal transduction, cell proliferation, development, glucose-sensing and circadian regulation. Our results demonstrate that genetic studies of glycemic traits can identify type 2 diabetes risk loci, as well as loci containing gene variants that are associated with a modest elevation in glucose levels but are not associated with overt diabetes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Objectives -  Treatment of established status epilepticus (SE) requires immediate intravenous anticonvulsant therapy. Currently used first-line drugs may cause potentially hazardous side effects. We aimed to assess the efficacy and safety of intravenous lacosamide (LCM) in SE after failure of standard treatment. Methods -  We retrospectively analyzed 39 patients (21 women, 18 men, median age 62 years) from the hospital databases of five neurological departments in Germany, Austria and Switzerland between September 2008 and January 2010 who were admitted in SE and received at least one dose of intravenous LCM. Results -  Types of SE were generalized convulsive (n = 6), complex partial (n = 17) and simple partial (n = 16). LCM was administered after failure of benzodiazepins or other standard drugs in all but one case. Median bolus dose of LCM was 400 mg (range 200-400 mg), which was administered at 40-80 mg/min in those patients where infusion rate was documented. SE stopped after LCM in 17 patients, while 22 patients needed further anticonvulsant treatment. The success rate in patients receiving LCM as first or second drug was 3/5, as third drug 11/19, and as fourth or later drug 3/15. In five subjects, SE could not be terminated at all. No serious adverse events attributed to LCM were documented. Conclusions -  Intravenous LCM may be an alternative treatment for established SE after failure of standard therapy, or when standard agents are considered unsuitable.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

AimTo identify the bioclimatic niche of the endangered Andean cat (Leopardus jacobita), one of the rarest and least known felids in the world, by developing a species distribution model.LocationSouth America, High Andes and Patagonian steppe. Peru, Bolivia, Chile, Argentina.MethodsWe used 108 Andean cat records to build the models, and 27 to test them, applying the Maxent algorithm to sets of uncorrelated bioclimatic variables from global databases, including elevation. We based our biogeographical interpretations on the examination of the predicted geographic range, the modelled response curves and latitudinal variations in climatic variables associated with the locality data.ResultsSimple bioclimatic models for Andean cats were highly predictive with only 3-4 explanatory variables. The climatic niche of the species was defined by extreme diurnal variations in temperature, cold minimum and moderate maximum temperatures, and aridity, characteristic not only of the Andean highlands but also of the Patagonian steppe. Argentina had the highest representation of suitable climates, and Chile the lowest. The most favourable conditions were centrally located and spanned across international boundaries. Discontinuities in suitable climatic conditions coincided with three biogeographical barriers associated with climatic or topographic transitions.Main conclusionsSimple bioclimatic models can produce useful predictions of suitable climatic conditions for rare species, including major biogeographical constraints. In our study case, these constraints are also known to affect the distribution of other Andean species and the genetic structure of Andean cat populations. We recommend surveys of areas with suitable climates and no Andean cat records, including the corridor connecting two core populations. The inclusion of landscape variables at finer scales, crucially the distribution of Andean cat prey, would contribute to refine our predictions for conservation applications.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The position of a gene in the genome may have important consequences for its function. Therefore, when a new duplicate gene arises, its location may be critical in determining its fate. Our recent work in humans, mouse, and Drosophila provided a test by studying the patterns of duplication in sex chromosome evolution. We revealed a bias in the generation and recruitment of new gene copies involving the X chromosome that has been shaped largely by selection for male germline functions. The gene movement patterns we observed reflect an ongoing process as some of the new genes are very young while others were present before the divergence of humans and mouse. This suggests a continuing redistribution of male-related genes to achieve a more efficient allocation of male functions. This notion should be further tested in organisms employing other sex determination systems or in organisms differing in germline sex chromosome inactivation. It is likely that the selective forces that were detected in these studies are also acting on other types of duplicate genes. As a result, future work elucidating sex chromosome differentiation by other mutational mechanisms will shed light on this important process.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Data mining can be defined as the extraction of previously unknown and potentially useful information from large datasets. The main principle is to devise computer programs that run through databases and automatically seek deterministic patterns. It is applied in different fields of application, e.g., remote sensing, biometry, speech recognition, but has seldom been applied to forensic case data. The intrinsic difficulty related to the use of such data lies in its heterogeneity, which comes from the many different sources of information. The aim of this study is to highlight potential uses of pattern recognition that would provide relevant results from a criminal intelligence point of view. The role of data mining within a global crime analysis methodology is to detect all types of structures in a dataset. Once filtered and interpreted, those structures can point to previously unseen criminal activities. The interpretation of patterns for intelligence purposes is the final stage of the process. It allows the researcher to validate the whole methodology and to refine each step if necessary. An application to cutting agents found in illicit drug seizures was performed. A combinatorial approach was done, using the presence and the absence of products. Methods coming from the graph theory field were used to extract patterns in data constituted by links between products and place and date of seizure. A data mining process completed using graphing techniques is called ``graph mining''. Patterns were detected that had to be interpreted and compared with preliminary knowledge to establish their relevancy. The illicit drug profiling process is actually an intelligence process that uses preliminary illicit drug classes to classify new samples. Methods proposed in this study could be used \textit{a priori} to compare structures from preliminary and post-detection patterns. This new knowledge of a repeated structure may provide valuable complementary information to profiling and become a source of intelligence.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract : Gene duplication is an essential source of material for the origin of genetic novelty and the evolution of lineage- or species-specific phenotypic traits. The reverse transcription of source gene mRNA followed by the genomic insertion of the resulting cDNA - retroposition - has provided the human genome with a significant number of gene copies during the last ~63 million years (MYA) of primate evolution. We estimated that at least 1 new functional gene (retrogene) per MYA emerged by retroposition in the primate lineage leading to humans. Using a combination of comparative sequencing and evolutionary simulations, we obtained strong evidence of functionality for 7 primate specific retrogenes. Most of these genes are specifically expressed in testis suggesting that retroposition has contributed with genetic raw material necessary for the evolution ofmale-specific functions in primates. We characterized CDC14Bretro (identified in the previous survey) that originated from the retroposition of a cell cycle gene - CDC14B - in the common ancestor of humans and apes. We demonstrate that CDC14Bretro experienced a period of intense positive selection in the African ape ancestor. By virtue of the amino acid substitutions that occurred during this period CDC 14Bretro adapted to a new subcellular compartment in African apes. Further analyses indicate that this subcellular shift reflects the evolution of anew functional role of CDC 14Bretro. Prompted by this result, we used yeast (Saccharomyces cerevisiae) to investigate on a global scale the extent of functional diversification of duplicate genes through the subcellular adaptation of their encoded proteins. We found that duplicate proteins frequently evolved new cellular localization patterns, either by partitioning of ancestral localizations ("sublocalization"), or more frequently by relocalization to previously unoccupied compartments ("neolocalization"). Interestingly, proteins involved in processes with a wider subcellular distribution more frequently evolved new localization patterns suggesting that subcellular localization changes are dependent on progenitor gene functions. Relocated proteins adapted to their new subcellular environments and evolved new functional roles through changes of their physio-chemical properties, expression levels, and interaction partners. Our work suggests an important role of subcellular adaptation for the emergence of new gene functions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The InterPro database (http://www.ebi.ac.uk/interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of important domains and sites. Central to the InterPro database are predictive models, known as signatures, from a range of different protein family databases that have different biological focuses and use different methodological approaches to classify protein families and domains. InterPro integrates these signatures, capitalizing on the respective strengths of the individual databases, to produce a powerful protein classification resource. Here, we report on the status of InterPro as it enters its 15th year of operation, and give an overview of new developments with the database and its associated Web interfaces and software. In particular, the new domain architecture search tool is described and the process of mapping of Gene Ontology terms to InterPro is outlined. We also discuss the challenges faced by the resource given the explosive growth in sequence data in recent years. InterPro (version 48.0) contains 36 766 member database signatures integrated into 26 238 InterPro entries, an increase of over 3993 entries (5081 signatures), since 2012.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Genome-scale metabolic network reconstructions are now routinely used in the study of metabolic pathways, their evolution and design. The development of such reconstructions involves the integration of information on reactions and metabolites from the scientific literature as well as public databases and existing genome-scale metabolic models. The reconciliation of discrepancies between data from these sources generally requires significant manual curation, which constitutes a major obstacle in efforts to develop and apply genome-scale metabolic network reconstructions. In this work, we discuss some of the major difficulties encountered in the mapping and reconciliation of metabolic resources and review three recent initiatives that aim to accelerate this process, namely BKM-react, MetRxn and MNXref (presented in this article). Each of these resources provides a pre-compiled reconciliation of many of the most commonly used metabolic resources. By reducing the time required for manual curation of metabolite and reaction discrepancies, these resources aim to accelerate the development and application of high-quality genome-scale metabolic network reconstructions and models.