832 resultados para databases and data mining
Resumo:
In this thesis the evolution of the techno-social systems analysis methods will be reported, through the explanation of the various research experience directly faced. The first case presented is a research based on data mining of a dataset of words association named Human Brain Cloud: validation will be faced and, also through a non-trivial modeling, a better understanding of language properties will be presented. Then, a real complex system experiment will be introduced: the WideNoise experiment in the context of the EveryAware european project. The project and the experiment course will be illustrated and data analysis will be displayed. Then the Experimental Tribe platform for social computation will be introduced . It has been conceived to help researchers in the implementation of web experiments, and aims also to catalyze the cumulative growth of experimental methodologies and the standardization of tools cited above. In the last part, three other research experience which already took place on the Experimental Tribe platform will be discussed in detail, from the design of the experiment to the analysis of the results and, eventually, to the modeling of the systems involved. The experiments are: CityRace, about the measurement of human traffic-facing strategies; laPENSOcosì, aiming to unveil the political opinion structure; AirProbe, implemented again in the EveryAware project framework, which consisted in monitoring air quality opinion shift of a community informed about local air pollution. At the end, the evolution of the technosocial systems investigation methods shall emerge together with the opportunities and the threats offered by this new scientific path.
Resumo:
Nowadays, more and more data is collected in large amounts, such that the need of studying it both efficiently and profitably is arising; we want to acheive new and significant informations that weren't known before the analysis. At this time many graph mining algorithms have been developed, but an algebra that could systematically define how to generalize such operations is missing. In order to propel the development of a such automatic analysis of an algebra, We propose for the first time (to the best of my knowledge) some primitive operators that may be the prelude to the systematical definition of a hypergraph algebra in this regard.
Resumo:
In this work we will discuss about a project started by the Emilia-Romagna Regional Government regarding the manage of the public transport. In particular we will perform a data mining analysis on the data-set of this project. After introducing the Weka software used to make our analysis, we will discover the most useful data mining techniques and algorithms; and we will show how these results can be used to violate the privacy of the same public transport operators. At the end, despite is off topic of this work, we will spend also a few words about how it's possible to prevent this kind of attack.
Resumo:
Aims of the study: To assess the prevalence of Antiepileptic Drug (AED) exposure in pregnant women with or without epilepsy and the comparative risk of terminations of pregnancy (TOPs), spontaneous abortions, stillbirth, major congenital malformations (MCMs) and foetal growth retardation (FGR) following intrauterine AED exposure in the Emilia Romagna region (RER), Northern Italy (4 million inhabitants). Methods: Data were obtained from official regional registries: Certificate of Delivery Assistance, Hospital Discharge Card, reimbursed prescription databases and Registry of Congenital Malformations. We identified all the deliveries, hospitalized abortions and MCMs occurred between January 2009 and December 2011. Results: We identified 145,243 pregnancies: 111,284 deliveries (112,845 live births and 279 stillbirths), 16408 spontaneous abortions and 17551 TOPs. Six hundred and eleven pregnancies (0.42% 95% Cl: 0.39-0.46) were exposed to AEDs. Twenty-one per cent of pregnancies ended in TOP in the AED group vs 12% in the non-exposed (OR:2.24; CI 1.41-3.56). The rate of spontaneous abortions and stillbirth was comparable in the two groups. Three hundred fifty-three babies (0.31%, 95% CI: 0.28-0.35) were exposed to AEDs during the first trimester. The rate of MCMs was 2.3% in the AED group (2.2% in babies exposed to monotherapy and 3.1% in babies exposed to polytherapy) vs 2.0% in the non-exposed. The risk of FGR was 12.7 % in the exposed group compared to 10% in the non-exposed. Discussion and Conclusion: The prevalence of AED exposure in pregnancy in the RER was 0.42%. The rate of MCMs in children exposed to AEDs in utero was almost superimposable to the one of the non-exposed, however polytherapy carried a slightly increased risk . The rate of TOPs was significantly higher in the exposed women. Further studies are needed to clarify whether this high rate reflects a higher rate of MCMs detected prenatally or other more elusive reasons.
Resumo:
Data sets describing the state of the earth's atmosphere are of great importance in the atmospheric sciences. Over the last decades, the quality and sheer amount of the available data increased significantly, resulting in a rising demand for new tools capable of handling and analysing these large, multidimensional sets of atmospheric data. The interdisciplinary work presented in this thesis covers the development and the application of practical software tools and efficient algorithms from the field of computer science, aiming at the goal of enabling atmospheric scientists to analyse and to gain new insights from these large data sets. For this purpose, our tools combine novel techniques with well-established methods from different areas such as scientific visualization and data segmentation. In this thesis, three practical tools are presented. Two of these tools are software systems (Insight and IWAL) for different types of processing and interactive visualization of data, the third tool is an efficient algorithm for data segmentation implemented as part of Insight.Insight is a toolkit for the interactive, three-dimensional visualization and processing of large sets of atmospheric data, originally developed as a testing environment for the novel segmentation algorithm. It provides a dynamic system for combining at runtime data from different sources, a variety of different data processing algorithms, and several visualization techniques. Its modular architecture and flexible scripting support led to additional applications of the software, from which two examples are presented: the usage of Insight as a WMS (web map service) server, and the automatic production of a sequence of images for the visualization of cyclone simulations. The core application of Insight is the provision of the novel segmentation algorithm for the efficient detection and tracking of 3D features in large sets of atmospheric data, as well as for the precise localization of the occurring genesis, lysis, merging and splitting events. Data segmentation usually leads to a significant reduction of the size of the considered data. This enables a practical visualization of the data, statistical analyses of the features and their events, and the manual or automatic detection of interesting situations for subsequent detailed investigation. The concepts of the novel algorithm, its technical realization, and several extensions for avoiding under- and over-segmentation are discussed. As example applications, this thesis covers the setup and the results of the segmentation of upper-tropospheric jet streams and cyclones as full 3D objects. Finally, IWAL is presented, which is a web application for providing an easy interactive access to meteorological data visualizations, primarily aimed at students. As a web application, the needs to retrieve all input data sets and to install and handle complex visualization tools on a local machine are avoided. The main challenge in the provision of customizable visualizations to large numbers of simultaneous users was to find an acceptable trade-off between the available visualization options and the performance of the application. Besides the implementational details, benchmarks and the results of a user survey are presented.
Resumo:
Background Pneumococcal conjugate vaccines (PCV) were first licensed for use with 3 primary doses in infancy and a booster dose. The evidence for the effects of different schedules was examined in this systematic review and meta-analysis. Methods We searched 12 databases and trial registers up to March 2010. We selected randomised controlled trials (RCTs), cohort and case–control studies making direct comparisons between PCV schedules with (2p) or (3p) primary doses, with (+1) or without (+0) a booster dose. We extracted data on clinical, nasopharyngeal carriage and immunological outcomes and used meta-analysis to combine results where appropriate. Results Seropositivity levels (antibody concentration ≥0.35 μg/ml) following 3p and 2p PCV schedules were high for most serotypes (5 RCTs). Differences between schedules were generally small and tended to favour 3p schedules, particularly for serotypes 6B and 23F; between-study heterogeneity was high. Seropositivity levels following 3p+1 and 2p+1 schedules were similar but small differences favouring 3p+1 schedules were seen for serotypes 6B and 23F. We did not identify any RCTs reporting clinical outcomes for these comparisons. In 2 RCTs there was weak evidence of a reduction in carriage of S. pneumoniae serotypes included in the vaccine when 3p+0 schedules were compared to 2p+0 at 6 months of age. Conclusions Most data about the relative effects of different PCV schedules relate to immunological outcomes. Both 3p and 2p schedules result in high levels of seropositivity. The clinical relevance of differences in immunological outcomes between schedules is not known. There is an absence of clinical outcome data from RCTs with direct comparisons of any 2p with any 3p PCV schedule.
Resumo:
Background. Measles control may be more challenging in regions with a high prevalence of HIV infection. HIV-infected children are likely to derive particular benefit from measles vaccines because of an increased risk of severe illness. However, HIV infection can impair vaccine effectiveness and may increase the risk of serious adverse events after receipt of live vaccines. We conducted a systematic review to assess the safety and immunogenicity of measles vaccine in HIV-infected children. Methods. The authors searched 8 databases through 12 February 2009 and reference lists. Study selection and data extraction were conducted in duplicate. Meta-analysis was conducted when appropriate. Results. Thirty-nine studies published from 1987 through 2008 were included. In 19 studies with information about measles vaccine safety, more than half reported no serious adverse events. Among HIV-infected children, 59% (95% confidence intervals [CI], 46–71%) were seropositive after receiving standard-titer measles vaccine at 6 months (1 study), comparable to the proportion of seropositive HIV-infected children vaccinated at 9 (8 studies) and 12 months (10 studies). Among HIV-exposed but uninfected and HIV-unexposed children, the proportion of seropositive children increased with increasing age at vaccination. Fewer HIV-infected children were protected after vaccination at 12 months than HIV-exposed but uninfected children (relative risk, 0.61; 95% CI, .50–.73). Conclusions. Measles vaccines appear to be safe in HIV-infected children, but the evidence is limited. When the burden of measles is high, measles vaccination at 6 months of age is likely to benefit children of HIV-infected women, regardless of the child's HIV infection status.
Resumo:
BACKGROUND: Neonates in a neonatal intensive care unit are exposed to a high number of painful procedures. Since repeated and sustained pain can have consequences for the neurological and behaviour-oriented development of the newborn, the greatest attention needs to be paid to systematic pain management in neonatology. Non-pharmacological treatment methods are being increasingly discussed with regard to pain prevention and relief either alone or in combination with pharmacological treatment. AIMS: To identify effective non-pharmacological interventions with regard to procedural pain in neonates. METHODS: A literature search was conducted via the MedLine, CINAHL, Cochrane Library databases and complemented by a handsearch. The literature search covered the period from 1984 to 2004. Data were extracted according to pre-defined criteria by two independent reviewers and methodological quality was assessed. RESULTS: 13 randomised controlled studies and two meta-analyses were taken into consideration with regard to the question of current nursing practice of non-pharmacological pain management methods. The selected interventions were "non-nutritive sucking", "music", "swaddling", "positioning", "olfactory and multisensorial stimulation", "kangaroo care" and "maternal touch". There is evidence that the methods of "non-nutritive sucking", "swaddling" and "facilitated tucking" do have a pain-alleviating effect on neonates. CONCLUSIONS: Some of the non-pharmacological interventions have an evident favourable effect on pulse rate, respiration and oxygen saturation, on the reduction of motor activity, and on the excitation states after invasive measures. However, unambiguous evidence of this still remains to be presented. Further research should emphasise the use of validated pain assessment instruments for the evaluation of the pain-alleviating effect of non-pharmacological interventions.
Resumo:
BACKGROUND: Chinese herbal medicine (CHM) is increasingly used in the West, but the evidence on its effectiveness is a matter of debate. We compared the characteristics, study quality and results of clinical trials of CHM and conventional medicine. METHODS: Comparative study of placebo-controlled trials of CHM and conventional medicine. Eleven bibliographic databases and searches by hand of 48 Chinese-language journals. Conventional medicine trials matched for condition and type of outcome were randomly selected from the Cochrane Controlled Trials Register (issue 1, 2003). Trials described as double-blind, with adequate generation of allocation sequence and adequate concealment of allocation, were assumed to be of high quality. Data were analysed using funnel plots and multivariable meta-regression models. RESULTS: 136 CHM trials (119 published in Chinese, 17 published in English) and 136 matched conventional medicine trials (125 published in English) were analysed. The quality of Chinese-language CHM trials tended to be lower than that of English-language CHM trials and conventional medicine trials. Three (2%) CHM trials and 10 (7%) conventional medicine trials were of high quality. In all groups, smaller trials showed more beneficial treatment effects than larger trials. CHM trials published in Chinese showed considerably larger effects than CHM trials published in English (adjusted ratio of ORs 0.29, 95% confidence intervals 0.17-0.52). CONCLUSIONS: Biases are present both in placebo-controlled trials of CHM and conventional medicine, but may be most pronounced in CHM trials published in Chinese-language journals. Only few CHM trials of adequate methodology exist and the effectiveness of CHM therefore remains poorly documented.
Resumo:
BACKGROUND: The retention of patients in antiretroviral therapy (ART) programmes is an important issue in resource-limited settings. Loss to follow up can be substantial, but it is unclear what the outcomes are in patients who are lost to programmes. METHODS AND FINDINGS: We searched the PubMed, EMBASE, Latin American and Caribbean Health Sciences Literature (LILACS), Indian Medlars Centre (IndMed) and African Index Medicus (AIM) databases and the abstracts of three conferences for studies that traced patients lost to follow up to ascertain their vital status. Main outcomes were the proportion of patients traced, the proportion found to be alive and the proportion that had died. Where available, we also examined the reasons why some patients could not be traced, why patients found to be alive did not return to the clinic, and the causes of death. We combined mortality data from several studies using random-effects meta-analysis. Seventeen studies were eligible. All were from sub-Saharan Africa, except one study from India, and none were conducted in children. A total of 6420 patients (range 44 to 1343 patients) were included. Patients were traced using telephone calls, home visits and through social networks. Overall the vital status of 4021 patients could be ascertained (63%, range across studies: 45% to 86%); 1602 patients had died. The combined mortality was 40% (95% confidence interval 33%-48%), with substantial heterogeneity between studies (P<0.0001). Mortality in African programmes ranged from 12% to 87% of patients lost to follow-up. Mortality was inversely associated with the rate of loss to follow up in the programme: it declined from around 60% to 20% as the percentage of patients lost to the programme increased from 5% to 50%. Among patients not found, telephone numbers and addresses were frequently incorrect or missing. Common reasons for not returning to the clinic were transfer to another programme, financial problems and improving or deteriorating health. Causes of death were available for 47 deaths: 29 (62%) died of an AIDS defining illness. CONCLUSIONS: In ART programmes in resource-limited settings a substantial minority of adults lost to follow up cannot be traced, and among those traced 20% to 60% had died. Our findings have implications both for patient care and the monitoring and evaluation of programmes.
Resumo:
BACKGROUND: The objective of this study was to systematically review the diagnostic criteria, indications, and need for treatment of ankyloglossia (tongue-tie), as well as the various treatment options for patients in different age groups. METHODS: The MEDLINE databases and the Cochrane Library were searched according to well-defined criteria, resulting in 64 included articles. The evidence regarding the classifications of tongue-tie, epidemiologic data, inheritance, breastfeeding problems, impaired tongue mobility, speech disorders, malocclusion, gingival recessions, therapy, and complications due to surgery was analyzed in detail. RESULTS: Different classifications for ankyloglossia have been proposed but not uniformly accepted. Breastfeeding problems in neonates could be associated with a tongue-tie, but not enough controlled trials have been performed to identify an ideal treatment option. In children and adults with ankyloglossia, limitations in tongue mobility are present, but the individual degree of discomfort, as well as the severity of an associated speech problem, are subjective and difficult to categorize. There is no evidence supporting the development of gingival recessions because of ankyloglossia. Frenotomy, frenectomy, and frenuloplasty are the main surgical treatment options to release/remove an ankyloglossia. Because of the limited evidence available, no specific surgical method can be favored. CONCLUSIONS: The lack of an accepted definition and classification of ankyloglossia makes comparisons between studies almost impossible. Because almost no controlled prospective trials for surgical interventions in patients with tongue-ties are present in the literature, no conclusive suggestions regarding the method of choice can be made. It also remains controversial which tongue-ties need to be surgically removed and which can be left to observation.
Resumo:
We describe the use of log file analysis to investigate whether the use of CSCL applications corresponds to its didactical purposes. Exemplarily we examine the use of the web-based system CommSy as software support for project-oriented university courses. We present two findings: (1) We suggest measures to shape the context of CSCL applications and support their initial and continuous use. (2) We show how log files can be used to analyze how, when and by whom a CSCL system is used and thus help to validate further empirical findings. However, log file analyses can only be interpreted reasonably when additional data concerning the context of use is available.
Resumo:
In recent years, learning analytics (LA) has attracted a great deal of attention in technology-enhanced learning (TEL) research as practitioners, institutions, and researchers are increasingly seeing the potential that LA has to shape the future TEL landscape. Generally, LA deals with the development of methods that harness educational data sets to support the learning process. This paper provides a foundation for future research in LA. It provides a systematic overview on this emerging field and its key concepts through a reference model for LA based on four dimensions, namely data, environments, context (what?), stakeholders (who?), objectives (why?), and methods (how?). It further identifies various challenges and research opportunities in the area of LA in relation to each dimension.