982 resultados para Data Aggregation


Relevância:

30.00% 30.00%

Publicador:

Resumo:

In the medical and healthcare arena, patients‟ data is not just their own personal history but also a valuable large dataset for finding solutions for diseases. While electronic medical records are becoming popular and are used in healthcare work places like hospitals, as well as insurance companies, and by major stakeholders such as physicians and their patients, the accessibility of such information should be dealt with in a way that preserves privacy and security. Thus, finding the best way to keep the data secure has become an important issue in the area of database security. Sensitive medical data should be encrypted in databases. There are many encryption/ decryption techniques and algorithms with regard to preserving privacy and security. Currently their performance is an important factor while the medical data is being managed in databases. Another important factor is that the stakeholders should decide more cost-effective ways to reduce the total cost of ownership. As an alternative, DAS (Data as Service) is a popular outsourcing model to satisfy the cost-effectiveness but it takes a consideration that the encryption/ decryption modules needs to be handled by trustworthy stakeholders. This research project is focusing on the query response times in a DAS model (AES-DAS) and analyses the comparison between the outsourcing model and the in-house model which incorporates Microsoft built-in encryption scheme in a SQL Server. This research project includes building a prototype of medical database schemas. There are 2 types of simulations to carry out the project. The first stage includes 6 databases in order to carry out simulations to measure the performance between plain-text, Microsoft built-in encryption and AES-DAS (Data as Service). Particularly, the AES-DAS incorporates implementations of symmetric key encryption such as AES (Advanced Encryption Standard) and a Bucket indexing processor using Bloom filter. The results are categorised such as character type, numeric type, range queries, range queries using Bucket Index and aggregate queries. The second stage takes the scalability test from 5K to 2560K records. The main result of these simulations is that particularly as an outsourcing model, AES-DAS using the Bucket index shows around 3.32 times faster than a normal AES-DAS under the 70 partitions and 10K record-sized databases. Retrieving Numeric typed data takes shorter time than Character typed data in AES-DAS. The aggregation query response time in AES-DAS is not as consistent as that in MS built-in encryption scheme. The scalability test shows that the DBMS reaches in a certain threshold; the query response time becomes rapidly slower. However, there is more to investigate in order to bring about other outcomes and to construct a secured EMR (Electronic Medical Record) more efficiently from these simulations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The traditional decomposition of the gender wage gap distinguishes between a component attributable to gender differences in productivity-related characteristics and a residual component that is often taken as a measure of discrimination. This study of data from the 1989 Canadian Labour Market Activity Survey shows that when occupation is treated as a productivity-related characteristic, the proportion of the gender wage gap labeled explained increases with the number of occupational classifications distinguished. However, on the basis of evidence that occupational differences reflect the presence of barriers faced by women attempting to enter male-dominated occupations, the authors conclude that occupation should not be treated as a productivity-related characteristic; and in a decomposition of the gender wage gap that treats occupation as endogenously determined, they find that the level of occupational aggregation has little effect on the size of the "explained" component of the gap.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis was a step forward in improving the stability of power systems by applying new control and modelling techniques. The developed methods use the data obtained from voltage angle measurement devices which are synchronized with GPS signals to stabilize the system and avoid system-wide blackouts in the event of severe faults. New approaches were developed in this research for identifying and estimating reduced dynamic system models using phasor measurement units. The main goal of this research is achieved by integrating the developed methods to obtain a feasible wide-area control system for stabilizing the power systems.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Bluetooth technology is being increasingly used to track vehicles throughout their trips, within urban networks and across freeway stretches. One important opportunity offered by this type of data is the measurement of Origin-Destination patterns, emerging from the aggregation and clustering of individual trips. In order to obtain accurate estimations, however, a number of issues need to be addressed, through data filtering and correction techniques. These issues mainly stem from the use of the Bluetooth technology amongst drivers, and the physical properties of the Bluetooth sensors themselves. First, not all cars are equipped with discoverable Bluetooth devices and the Bluetooth-enabled vehicles may belong to some small socio-economic groups of users. Second, the Bluetooth datasets include data from various transport modes; such as pedestrian, bicycles, cars, taxi driver, buses and trains. Third, the Bluetooth sensors may fail to detect all of the nearby Bluetooth-enabled vehicles. As a consequence, the exact journey for some vehicles may become a latent pattern that will need to be extracted from the data. Finally, sensors that are in close proximity to each other may have overlapping detection areas, thus making the task of retrieving the correct travelled path even more challenging. The aim of this paper is twofold. We first give a comprehensive overview of the aforementioned issues. Further, we propose a methodology that can be followed, in order to cleanse, correct and aggregate Bluetooth data. We postulate that the methods introduced by this paper are the first crucial steps that need to be followed in order to compute accurate Origin-Destination matrices in urban road networks.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A commitment in 2010 by the Australian Federal Government to spend $466.7 million dollars on the implementation of personally controlled electronic health records (PCEHR) heralded a shift to a more effective and safer patient centric eHealth system. However, deployment of the PCEHR has met with much criticism, emphasised by poor adoption rates over the first 12 months of operation. An indifferent response by the public and healthcare providers largely sceptical of its utility and safety speaks to the complex sociotechnical drivers and obstacles inherent in the embedding of large (national) scale eHealth projects. With government efforts to inflate consumer and practitioner engagement numbers giving rise to further consumer disillusionment, broader utilitarian opportunities available with the PCEHR are at risk. This paper discusses the implications of establishing the PCEHR as the cornerstone of a holistic eHealth strategy for the aggregation of longitudinal patient information. A viewpoint is offered that the real value in patient data lies not just in the collection of data but in the integration of this information into clinical processes within the framework of a commoditised data-driven approach. Consideration is given to the eHealth-as-a-Service (eHaaS) construct as a disruptive next step for co-ordinated individualised healthcare in the Australian context.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We describe here the role of muramidases present in clones of metagenomic DNA that result in cell aggregation and biofilm formation by Escherichia coli. The metagenomic clones were obtained from uncultured Lachnospiraceae-affiliated bacteria resident in the foregut microbiome of the Tammar wallaby. One of these fosmid clones (p49C2) was chosen for more detailed studies and a variety of genetic methods were used to delimit the region responsible for the phenotype to an open reading frame of 1425 bp. Comparative sequence analysis with other fosmid clones giving rise to the same phenotype revealed the presence of muramidase homologues with the same modular composition. Phylogenetic analysis of the fosmid sequence data assigned these fosmid inserts to recently identified, but uncultured, phylogroups of Lachnospiraceae believed to be numerically dominant in the foregut microbiome of the Tammar wallaby. The muramidase is a modular protein containing putative N-acetylmuramoyl--alanine amidase and an endo-β-N-acetylglucosaminidase catalytic module, with a similar organization and functional properties to some Staphylococcal autolysins that also confer adhesive properties and biofilm formation. We also show here that the cloned muramidases result in the production of extracellular DNA, which appears to be the key for biofilm formation and autoaggregation. Collectively, these findings suggest that biofilm formation and cell aggregation in gut microbiomes might occur via the concerted action of carbohydrate-active enzymes and the production of extracellular DNA to serve as a biofilm scaffold.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Heterogeneous health data is a critical issue when managing health information for quality decision making processes. In this paper we examine the efficient aggregation of lifestyle information through a data warehousing architecture lens. We present a proof of concept for a clinical data warehouse architecture that enables evidence based decision making processes by integrating and organising disparate data silos in support of healthcare services improvement paradigms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Smart Card Automated Fare Collection (AFC) data has been extensively exploited to understand passenger behavior, passenger segment, trip purpose and improve transit planning through spatial travel pattern analysis. The literature has been evolving from simple to more sophisticated methods such as from aggregated to individual travel pattern analysis, and from stop-to-stop to flexible stop aggregation. However, the issue of high computing complexity has limited these methods in practical applications. This paper proposes a new algorithm named Weighted Stop Density Based Scanning Algorithm with Noise (WS-DBSCAN) based on the classical Density Based Scanning Algorithm with Noise (DBSCAN) algorithm to detect and update the daily changes in travel pattern. WS-DBSCAN converts the classical quadratic computation complexity DBSCAN to a problem of sub-quadratic complexity. The numerical experiment using the real AFC data in South East Queensland, Australia shows that the algorithm costs only 0.45% in computation time compared to the classical DBSCAN, but provides the same clustering results.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With the extensive use of rating systems in the web, and their significance in decision making process by users, the need for more accurate aggregation methods has emerged. The Naïve aggregation method, using the simple mean, is not adequate anymore in providing accurate reputation scores for items [6 ], hence, several researches where conducted in order to provide more accurate alternative aggregation methods. Most of the current reputation models do not consider the distribution of ratings across the different possible ratings values. In this paper, we propose a novel reputation model, which generates more accurate reputation scores for items by deploying the normal distribution over ratings. Experiments show promising results for our proposed model over state-of-the-art ones on sparse and dense datasets.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With the ever increasing amount of eHealth data available from various eHealth systems and sources, Health Big Data Analytics promises enticing benefits such as enabling the discovery of new treatment options and improved decision making. However, concerns over the privacy of information have hindered the aggregation of this information. To address these concerns, we propose the use of Information Accountability protocols to provide patients with the ability to decide how and when their data can be shared and aggregated for use in big data research. In this paper, we discuss the issues surrounding Health Big Data Analytics and propose a consent-based model to address privacy concerns to aid in achieving the promised benefits of Big Data in eHealth.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Many websites presently provide the facility for users to rate items quality based on user opinion. These ratings are used later to produce item reputation scores. The majority of websites apply the mean method to aggregate user ratings. This method is very simple and is not considered as an accurate aggregator. Many methods have been proposed to make aggregators produce more accurate reputation scores. In the majority of proposed methods the authors use extra information about the rating providers or about the context (e.g. time) in which the rating was given. However, this information is not available all the time. In such cases these methods produce reputation scores using the mean method or other alternative simple methods. In this paper, we propose a novel reputation model that generates more accurate item reputation scores based on collected ratings only. Our proposed model embeds statistical data, previously disregarded, of a given rating dataset in order to enhance the accuracy of the generated reputation scores. In more detail, we use the Beta distribution to produce weights for ratings and aggregate ratings using the weighted mean method. Experiments show that the proposed model exhibits performance superior to that of current state-of-the-art models.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background The potential effect of ginger on platelet aggregation is a widely-cited concern both within the published literature and to clinicians; however, there has been no systematic appraisal of the evidence to date. Methods Using the PRISMA guidelines, we systematically reviewed the results of clinical and observational trials regarding the effect of ginger on platelet aggregation in adults compared to either placebo or baseline data. Studies included in this review stipulated the independent variable was a ginger preparation or isolated ginger compound, and used measures of platelet aggregation as the primary outcome. Results Ten studies were included, comprising eight clinical trials and two observational studies. Of the eight clinical trials, four reported that ginger reduced platelet aggregation, while the remaining four reported no effect. The two observational studies also reported mixed findings. Discussion Many of the studies appraised for this review had moderate risks of bias. Methodology varied considerably between studies, notably the timeframe studied, dose of ginger used, and the characteristics of subjects recruited (e.g. healthy vs. patients with chronic diseases). Conclusion The evidence that ginger affects platelet aggregation and coagulation is equivocal and further study is needed to definitively address this question.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Type 1 diabetes (T1D) is a common, multifactorial disease with strong familial clustering. In Finland, the incidence of T1D among children aged 14 years or under is the highest in the world. The increase in incidence has been approximately 2.4% per year. Although most new T1D cases are sporadic the first-degree relatives are at an increased risk of developing the same disease. This study was designed to examine the familial aggregation of T1D and one of its serious complications, diabetic nephropathy (DN). More specifically the study aimed (1) to determine the concordance rates of T1D in monozygotic (MZ) and dizygotic (DZ) twins and to estimate the relative contributions of genetic and environmental factors to the variability in liability to T1D as well as to study the age at onset of diabetes in twins; (2) to obtain long-term empirical estimates of the risk of T1D among siblings of T1D patients and the factors related to this risk, especially the effect of age at onset of diabetes in the proband and the birth cohort effect; (3) to establish if DN is aggregating in a Finnish population-based cohort of families with multiple cases of T1D, and to assess its magnitude and particularly to find out whether the risk of DN in siblings is varying according to the severity of DN in the proband and/or the age at onset of T1D: (4) to assess the recurrence risk of T1D in the offspring of a Finnish population-based cohort of patients with childhood onset T1D, and to investigate potential sex-related effects in the transmission of T1D from the diabetic parents to their offspring as well as to study whether there is a temporal trend in the incidence. The study population comprised of the Finnish Young Twin Cohort (22,650 twin pairs), a population-based cohort of patients with T1D diagnosed at the age of 17 years or earlier between 1965 and 1979 (n=5,144) and all their siblings (n=10,168) and offspring (n=5,291). A polygenic, multifactorial liability model was fitted to the twin data. Kaplan-Meier analyses were used to provide the cumulative incidence for the development of T1D and DN. Cox s proportional hazards models were fitted to the data. Poisson regression analysis was used to evaluate temporal trends in incidence. Standardized incidence ratios (SIRs) between the first-degree relatives of T1D patients and background population were determined. The twin study showed that the vast majority of affected MZ twin pairs remained discordant. Pairwise concordance for T1D was 27.3% in MZ and 3.8% in DZ twins. The probandwise concordance estimates were 42.9% and 7.4%, respectively. The model with additive genetic and individual environmental effects was the best-fitting liability model to T1D, with 88% of the phenotypic variance due to genetic factors. The second paper showed that the 50-year cumulative incidence of T1D in the siblings of diabetic probands was 6.9%. A young age at diagnosis in the probands considerably increased the risk. If the proband was diagnosed at the age of 0-4, 5-9, 10-14, 15 or more, the corresponding 40-year cumulative risks were 13.2%, 7.8%, 4.7% and 3.4%. The cumulative incidence increased with increasing birth year. However, SIR among children aged 14 years or under was approximately 12 throughout the follow-up. The third paper showed that diabetic siblings of the probands with nephropathy had a 2.3 times higher risk of DN compared with siblings of probands free of nephropathy. The presence of end stage renal disease (ESRD) in the proband increases the risk three-fold for diabetic siblings. Being diagnosed with diabetes during puberty (10-14) or a few years before (5-9) increased the susceptibility for DN in the siblings. The fourth paper revealed that of the offspring of male probands, 7.8% were affected by the age of 20 compared with 5.3% of the offspring of female probands. Offspring of fathers with T1D have 1.7 times greater risk to be affected with T1D than the offspring of mothers with T1D. The excess risk in the offspring of male fathers manifested itself through the higher risk the younger the father was when diagnosed with T1D. Young age at onset of diabetes in fathers increased the risk of T1D greatly in the offspring, but no such pattern was seen in the offspring of diabetic mothers. The SIR among offspring aged 14 years or under remained fairly constant throughout the follow-up, approximately 10. The present study has provided new knowledge on T1D recurrence risk in the first-degree relatives and the risk factors modifying the risk. Twin data demonstrated high genetic liability for T1D and increased heritability. The vast majority of affected MZ twin pairs, however, remain discordant for T1D. This study confirmed the drastic impact of the young age at onset of diabetes in the probands on the increased risk of T1D in the first-degree relatives. The only exception was the absence of this pattern in the offspring of T1D mothers. Both the sibling and the offspring recurrence risk studies revealed dynamic changes in the cumulative incidence of T1D in the first-degree relatives. SIRs among the first-degree relatives of T1D patients seems to remain fairly constant. The study demonstrates that the penetrance of the susceptibility genes for T1D may be low, although strongly influenced by the environmental factors. Presence of familial aggregation of DN was confirmed for the first time in a population-based study. Although the majority of the sibling pairs with T1D were discordant for DN, its presence in one sibling doubles and presence of ESRD triples the risk of DN in the other diabetic sibling. An encouraging observation was that although the proportion of children to be diagnosed with T1D at the age of 4 or under is increasing, they seem to have a decreased risk of DN or at least delayed onset.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In genetic epidemiology, population-based disease registries are commonly used to collect genotype or other risk factor information concerning affected subjects and their relatives. This work presents two new approaches for the statistical inference of ascertained data: a conditional and full likelihood approaches for the disease with variable age at onset phenotype using familial data obtained from population-based registry of incident cases. The aim is to obtain statistically reliable estimates of the general population parameters. The statistical analysis of familial data with variable age at onset becomes more complicated when some of the study subjects are non-susceptible, that is to say these subjects never get the disease. A statistical model for a variable age at onset with long-term survivors is proposed for studies of familial aggregation, using latent variable approach, as well as for prospective studies of genetic association studies with candidate genes. In addition, we explore the possibility of a genetic explanation of the observed increase in the incidence of Type 1 diabetes (T1D) in Finland in recent decades and the hypothesis of non-Mendelian transmission of T1D associated genes. Both classical and Bayesian statistical inference were used in the modelling and estimation. Despite the fact that this work contains five studies with different statistical models, they all concern data obtained from nationwide registries of T1D and genetics of T1D. In the analyses of T1D data, non-Mendelian transmission of T1D susceptibility alleles was not observed. In addition, non-Mendelian transmission of T1D susceptibility genes did not make a plausible explanation for the increase in T1D incidence in Finland. Instead, the Human Leucocyte Antigen associations with T1D were confirmed in the population-based analysis, which combines T1D registry information, reference sample of healthy subjects and birth cohort information of the Finnish population. Finally, a substantial familial variation in the susceptibility of T1D nephropathy was observed. The presented studies show the benefits of sophisticated statistical modelling to explore risk factors for complex diseases.