Biblioteca Digital

999 resultados para data mules

Des données aux connaissances, un chemin difficile: réflexion sur la place du data mining en analyse criminelle

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Le "data mining", ou "fouille de données", est un ensemble de méthodes et de techniques attractif qui a connu une popularité fulgurante ces dernières années, spécialement dans le domaine du marketing. Le développement récent de l'analyse ou du renseignement criminel soulève des problèmatiques auxqwuelles il est tentant de d'appliquer ces méthodes et techniques. Le potentiel et la place du data mining dans le contexte de l'analyse criminelle doivent être mieux définis afin de piloter son application. Cette réflexion est menée dans le cadre du renseignement produit par des systèmes de détection et de suivi systématique de la criminalité répétitive, appelés processus de veille opérationnelle. Leur fonctionnement nécessite l'existence de patterns inscrits dans les données, et justifiés par les approches situationnelles en criminologie. Muni de ce bagage théorique, l'enjeu principal revient à explorer les possibilités de détecter ces patterns au travers des méthodes et techniques de data mining. Afin de répondre à cet objectif, une recherche est actuellement menée au Suisse à travers une approche interdisciplinaire combinant des connaissances forensiques, criminologiques et computationnelles.

Estimating heritability from nuclear family and pedigree data.

Relevância:

20.00% 20.00%

Publicador:

Data Access Agreement template (PDF 247KB)

Relevância:

20.00% 20.00%

Publicador:

Data Protection Manual (MS Word 461KB)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Departmental Data Protection manual

DHSSPS Data Protection Policy (MS Word 67KB)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Statement of departmental data protection policy

Paper 3: EUROCAT data quality indicators for population-based registries of congenital anomalies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The European Surveillance of Congenital Anomalies (EUROCAT) network of population-based congenital anomaly registries is an important source of epidemiologic information on congenital anomalies in Europe covering live births, fetal deaths from 20 weeks gestation, and terminations of pregnancy for fetal anomaly. EUROCAT's policy is to strive for high-quality data, while ensuring consistency and transparency across all member registries. A set of 30 data quality indicators (DQIs) was developed to assess five key elements of data quality: completeness of case ascertainment, accuracy of diagnosis, completeness of information on EUROCAT variables, timeliness of data transmission, and availability of population denominator information. This article describes each of the individual DQIs and presents the output for each registry as well as the EUROCAT (unweighted) average, for 29 full member registries for 2004-2008. This information is also available on the EUROCAT website for previous years. The EUROCAT DQIs allow registries to evaluate their performance in relation to other registries and allows appropriate interpretations to be made of the data collected. The DQIs provide direction for improving data collection and ascertainment, and they allow annual assessment for monitoring continuous improvement. The DQI are constantly reviewed and refined to best document registry procedures and processes regarding data collection, to ensure appropriateness of DQI, and to ensure transparency so that the data collected can make a substantial and useful contribution to epidemiologic research on congenital anomalies.

Uncovering hidden duplicated content in public transcriptomics data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

As part of the development of the database Bgee (a dataBase for Gene Expression Evolution), we annotate and analyse expression data from different types and different sources, notably Affymetrix data from GEO and ArrayExpress, and RNA-Seq data from SRA. During our quality control procedure, we have identified duplicated content in GEO and ArrayExpress, affecting ∼14% of our data: fully or partially duplicated experiments from independent data submissions, Affymetrix chips reused in several experiments, or reused within an experiment. We present here the procedure that we have established to filter such duplicates from Affymetrix data, and our procedure to identify future potential duplicates in RNA-Seq data. Database URL: http://bgee.unil.ch/

A modular approach for integrative analysis of large-scale gene-expression and drug-response data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

High-throughput technologies are now used to generate more than one type of data from the same biological samples. To properly integrate such data, we propose using co-modules, which describe coherent patterns across paired data sets, and conceive several modular methods for their identification. We first test these methods using in silico data, demonstrating that the integrative scheme of our Ping-Pong Algorithm uncovers drug-gene associations more accurately when considering noisy or complex data. Second, we provide an extensive comparative study using the gene-expression and drug-response data from the NCI-60 cell lines. Using information from the DrugBank and the Connectivity Map databases we show that the Ping-Pong Algorithm predicts drug-gene associations significantly better than other methods. Co-modules provide insights into possible mechanisms of action for a wide range of drugs and suggest new targets for therapy

How to use the standard model with own data?

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this work discuss the use of the standard model for the calculation of the solvency capital requirement (SCR) when the company aims to use the specific parameters of the model on the basis of the experience of its portfolio. In particular, this analysis focuses on the formula presented in the latest quantitative impact study (2010 CEIOPS) for non-life underwriting premium and reserve risk. One of the keys of the standard model for premium and reserves risk is the correlation matrix between lines of business. In this work we present how the correlation matrix between lines of business could be estimated from a quantitative perspective, as well as the possibility of using a credibility model for the estimation of the matrix of correlation between lines of business that merge qualitative and quantitative perspective.

Worm burdens in outbred and inbred laboratory rats with morphometric data on Syphacia muris (Yamaguti, 1935) Yamaguti, 1941 (Nematoda, Oxyuroidea)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Syphacia muris worm burdens were evaluated in the rat Rattus norvegicus of the strains Wistar (outbred), Low/M and AM/2/Torr (inbred), maintained conventionally in institutional animal houses in Brazil. Morphometrics and illustration data for S. muris recovered from Brazilian laboratory rats are provided for the first time since its proposition in 1935.

New host and locality records for Tetrameres (Gynaecophila) spirospiculum Pinto & Vicente, 1995 (Nematoda: Tetrameridae), with new morphological data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report the finding of Tetrameres spirospiculum Pinto & Vicente, 1995 from Theristicus melanopis melanopis (Threskiornithidae) from Patagonia, Argentina. These constitute new host and locality records. We propose the assignation of this species to the subgenus T. (Gynaecophila) Gubanov, 1950, based on the presence of labia and the absence of cuticular flanges at the anterior end. Some new morphological data are provided, such as the arrangement of cuticular spines and the presence of a pair of somatic papillae at beginning of posterior third of body length. T. (G.) spirospiculum may probably be regarded as specific to birds of the genus Theristicus.

Feasibility of a population-based registration of phenotypic, anatomic and familial data for melanoma

Relevância:

20.00% 20.00%

Publicador:

Sequential Analysis of Developmental Change in Behavior Patterns Using Markov Modeling of Nonstationary Data.

Relevância:

20.00% 20.00%

Publicador:

Is real GDP stationary? Evidence from a panel unit root test with cross-sectional dependence and historical data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We use historical data that cover more than one century on real GDP for industrial countries and employ the Pesaran panel unit root test that allows for cross-sectional dependence to test for a unit root on real GDP. We find strong evidence against the unit root null. Our results are robust to the chosen group of countries and the sample period. Key words: real GDP stationarity, cross-sectional dependence, CIPS test. JEL Classification: C23, E32

An examination of male and female odds ratios by BMI, cigarette smoking, and alcohol consumption for cancers of the oral cavity, pharynx, and larynx in pooled data from 15 case-control studies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Greater tobacco smoking and alcohol consumption and lower body mass index (BMI) increase odds ratios (OR) for oral cavity, oropharyngeal, hypopharyngeal, and laryngeal cancers; however, there are no comprehensive sex-specific comparisons of ORs for these factors. METHODS: We analyzed 2,441 oral cavity (925 women and 1,516 men), 2,297 oropharynx (564 women and 1,733 men), 508 hypopharynx (96 women and 412 men), and 1,740 larynx (237 women and 1,503 men) cases from the INHANCE consortium of 15 head and neck cancer case-control studies. Controls numbered from 7,604 to 13,829 subjects, depending on analysis. Analyses fitted linear-exponential excess ORs models. RESULTS: ORs were increased in underweight (<18.5 BMI) relative to normal weight (18.5-24.9) and reduced in overweight and obese categories (>/=25 BMI) for all sites and were homogeneous by sex. ORs by smoking and drinking in women compared with men were significantly greater for oropharyngeal cancer (p < 0.01 for both factors), suggestive for hypopharyngeal cancer (p = 0.05 and p = 0.06, respectively), but homogeneous for oral cavity (p = 0.56 and p = 0.64) and laryngeal (p = 0.18 and p = 0.72) cancers. CONCLUSIONS: The extent that OR modifications of smoking and drinking by sex for oropharyngeal and, possibly, hypopharyngeal cancers represent true associations, or derive from unmeasured confounders or unobserved sex-related disease subtypes (e.g., human papillomavirus-positive oropharyngeal cancer) remains to be clarified.

«
1
2
...
44
45
46
47
48
49
50
...
66
67
»