12 resultados para correlated information

em Deakin Research Online - Australia


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Privacy preserving on data mining and data release has attracted an increasing research interest over a number of decades. Differential privacy is one influential privacy notion that offers a rigorous and provable privacy guarantee for data mining and data release. Existing studies on differential privacy assume that in a data set, records are sampled independently. However, in real-world applications, records in a data set are rarely independent. The relationships among records are referred to as correlated information and the data set is defined as correlated data set. A differential privacy technique performed on a correlated data set will disclose more information than expected, and this is a serious privacy violation. Although recent research was concerned with this new privacy violation, it still calls for a solid solution for the correlated data set. Moreover, how to decrease the large amount of noise incurred via differential privacy in correlated data set is yet to be explored. To fill the gap, this paper proposes an effective correlated differential privacy solution by defining the correlated sensitivity and designing a correlated data releasing mechanism. With consideration of the correlated levels between records, the proposed correlated sensitivity can significantly decrease the noise compared with traditional global sensitivity. The correlated data releasing mechanism correlated iteration mechanism is designed based on an iterative method to answer a large number of queries. Compared with the traditional method, the proposed correlated differential privacy solution enhances the privacy guarantee for a correlated data set with less accuracy cost. Experimental results show that the proposed solution outperforms traditional differential privacy in terms of mean square error on large group of queries. This also suggests the correlated differential privacy can successfully retain the utility while preserving the privacy.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Traffic classification has wide applications in network management, from security monitoring to quality of service measurements. Recent research tends to apply machine learning techniques to flow statistical feature based classification methods. The nearest neighbor (NN)-based method has exhibited superior classification performance. It also has several important advantages, such as no requirements of training procedure, no risk of overfitting of parameters, and naturally being able to handle a huge number of classes. However, the performance of NN classifier can be severely affected if the size of training data is small. In this paper, we propose a novel nonparametric approach for traffic classification, which can improve the classification performance effectively by incorporating correlated information into the classification process. We analyze the new classification approach and its performance benefit from both theoretical and empirical perspectives. A large number of experiments are carried out on two real-world traffic data sets to validate the proposed approach. The results show the traffic classification performance can be improved significantly even under the extreme difficult circumstance of very few training samples.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

With the arrival of Big Data Era, properly utilizing the power of big data is becoming increasingly essential for the strength and competitiveness of businesses and organizations. We are facing grand challenges from big data from different perspectives, such as processing, communication, security, and privacy. In this talk, we discuss the big data challenges in network traffic classification and our solutions to the challenges. The significance of the research lies in the fact that each year the network traffic increase exponentially on the current Internet. Traffic classification has wide applications in network management, from security monitoring to quality of service measurements. Recent research tends to apply machine-learning techniques to flow statistical feature based classification methods. In this talk, we propose a series of novel approaches for traffic classification, which can improve the classification performance effectively by incorporating correlated information into the classification process. We analyze the new classification approaches and their performance benefit from both theoretical and empirical perspectives. A large number of experiments are carried out on two real-world traffic datasets to validate the proposed approach. The results show the traffic classification performance can be improved significantly even under the extreme difficult circumstance of very few training samples. Our work has significant impact on security applications.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Use of geographical information systems (GIS) in inland fisheries has hitherto been essentially restricted to site evaluation for aquaculture development and assessment of limnological changes in time and space in individual water bodies. The present GIS study was conducted on the land-use pattern of the catchments of nine reservoirs in Sri Lanka, for which detailed fishery data, viz. yield, fishing intensity, landing size of major constituent species, together with selected limnological data such as conductivity and chlorophyll-a, were available. Potential statistical relationships (linear, curvilinear, exponential and second-order polynomial) of fish yield (FY, in kg ha−1 yr−1) to different land-use patterns, such as forest cover (FC, in km2) and shrub-land (SL, in km2), either singly, or in combination, and/or the ratio of each land type to reservoir area (RA in km2) and reservoir capacity (RC in km3), were explored. Highly significant relationships were evident between FY to the ratio of SL and/or FC+SL to RA and/or RC. Similarly, the above land-use types to RA and RC ratios were significantly related to limnological features of the reservoirs. The relationships of FY to various parameters obtained in this study were much better correlated than those relationships of FY to limnological and biological parameters used in yield prediction in tropical and temperate lacustrine waters previously.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This letter addresses the issue of joint space-time trellis decoding and channel estimation in time-varying fading channels that are spatially and temporally correlated. A recursive space-time receiver which incorporates per-survivor processing (PSP) and Kalman filtering into the Viterbi algorithm is proposed. This approach generalizes existing work to the correlated fading channel case. The channel time-evolution is modeled by a multichannel autoregressive process, and a bank of Kalman filters is used to track the channel variations. Computer simulation results show that a performance close to the maximum likelihood receiver with perfect channel state information (CSI) can be obtained. The effects of the spatial correlation on the performance of a receiver that assumes independent fading channels are examined.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Constraint-based modeling of reconstructed genome-scale metabolic networks has been successfully applied on several microorganisms. In constraint-based modeling, in order to characterize all allowable phenotypes, network-based pathways, such as extreme pathways and elementary flux modes, are defined. However, as the scale of metabolic network rises, the number of extreme pathways and elementary flux modes increases exponentially. Uniform random sampling solves this problem to some extent to study the contents of the available phenotypes. After uniform random sampling, correlated reaction sets can be identified by the dependencies between reactions derived from sample phenotypes. In this paper, we study the relationship between extreme pathways and correlated reaction sets.

Results: Correlated reaction sets are identified for E. coli core, red blood cell and Saccharomyces cerevisiae metabolic networks respectively. All extreme pathways are enumerated for the former two metabolic networks. As for Saccharomyces cerevisiae metabolic network, because of the large scale, we get a set of extreme pathways by sampling the whole extreme pathway space. In most cases, an extreme pathway covers a correlated reaction set in an 'all or none' manner, which means either all reactions in a correlated reaction set or none is used by some extreme pathway. In rare cases, besides the 'all or none' manner, a correlated reaction set may be fully covered by combination of a few extreme pathways with related function, which may bring redundancy and flexibility to improve the survivability of a cell. In a word, extreme pathways show strong complementary relationship on usage of reactions in the same correlated reaction set.

Conclusion: Both extreme pathways and correlated reaction sets are derived from the topology information of metabolic networks. The strong relationship between correlated reaction sets and extreme pathways suggests a possible mechanism: as a controllable unit, an extreme pathway is regulated by its corresponding correlated reaction sets, and a correlated reaction set is further regulated by the organism's regulatory network.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This research identifies how the IT function can create agility in existing information systems. Agility is the capability to quickly sense and respond to environmental perturbations. We contrasted perspectives on agility from a widely used industry framework and that of the IS research literature. Beer’s Viable System Model was a useful meta-level theory to house agility elements from IS research and it introduced cybernetic principles to identify the processes required of the IT function. Indeed, our surveys of 70 organizations confirmed that the applied theory better correlates with reported agility than does existing industry best practice.

The research conducted two quantitative surveys to test the applied theory. The first survey mailed a Likert-type questionnaire to the clients of an Australian IT consultancy. The second survey invited international members of professional interest groups to complete a web-based questionnaire. The responses from the surveys were analyzed using partial-least-squares modeling. The data analysis positively correlated the maturity of IT function processes prescribed by the VSM and the likelihood of agility in existing information systems. We claim our findings generalize to other large organizations in OECD member countries.

The research offers an agility-capability model of the IT function to explain and predict agility in existing information systems. A further contribution is to improve industry ‘best practice’ frameworks by prescribing processes of the IT function to develop in maturity.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper presents a novel traffic classification scheme to improve classification performance when few training data arc available. In the proposed scheme, traffic flows are described using the discretized statistical features and flow correlation information is modeled by bag-of-flow (BoF). We solve the BoF-based traffic classification in a classifier combination framework and theoretically analyze the performance benefit. Furthermore, a new BoF-based traffic classification method is proposed to aggregate the naive Bayes (NB) predictions of the correlated flows. We also present an analysis on prediction error sensitivity of the aggregation strategies. Finally, a large number of experiments are carried out on two large-scale real-world traffic datasets to evaluate the proposed scheme. The experimental results show that the proposed scheme can achieve much better classification performance than existing state-of-the-art traffic classification methods.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper deals with blind separation of spatially correlated signals mixed by an instantaneous system. Taking advantage of the fact that the source signals are accessible in some man-made systems such as wireless communication systems, we preprocess the source signals in transmitters by a set of properly designed first-order precoders and then the coded signals are transmitted. At the receiving side, information about the precoders are utilized to perform signal separation. Compared with the existing precoder-based methods, the new method only employs the simplest first-order precoders, which reduces the delay in data transmission and is easier to implement in practical applications.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With the arrival of big data era, the Internet traffic is growing exponentially. A wide variety of applications arise on the Internet and traffic classification is introduced to help people manage the massive applications on the Internet for security monitoring and quality of service purposes. A large number of Machine Learning (ML) algorithms are introduced to deal with traffic classification. A significant challenge to the classification performance comes from imbalanced distribution of data in traffic classification system. In this paper, we proposed an Optimised Distance-based Nearest Neighbor (ODNN), which has the capability of improving the classification performance of imbalanced traffic data. We analyzed the proposed ODNN approach and its performance benefit from both theoretical and empirical perspectives. A large number of experiments were implemented on the real-world traffic dataset. The results show that the performance of “small classes” can be improved significantly even only with small number of training data and the performance of “large classes” remains stable.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

For dioecious animals, reproductive success typically involves an exchange between the sexes of signals that provide information about mate location and quality. Typically, the elaborate, secondary sexual ornaments of males signal their quality, while females may signal their location and receptivity. In theory, the receptor structures that receive the latter signals may also become elaborate or enlarged in a way that ultimately functions to enhance mating success through improved mate location. The large, elaborate antennae of many male moths are one such sensory structure, and eye size may also be important in diurnal moths. Investment in these traits may be costly, resulting in trade-offs among different traits associated with mate location. For polyandrous species, such trade-offs may also include traits associated with paternity success, such as larger testes. Conversely, we would not expect this to be the case for monandrous species, where sperm competition is unlikely. We investigated these ideas by evaluating the relationship between investment in sensory structures (antennae, eye), testis, and a putative warning signal (orange hindwing patch) in field-caught males of the monandrous diurnal painted apple moth Teia anartoides (Lepidoptera: Lymantriidae) in southeastern Australia. As predicted for a monandrous species, we found no evidence that male moths with larger sensory structures had reduced investment in testis size. However, contrary to expectation, investment in sensory structures was correlated: males with relatively larger antennae also had relatively larger eyes. Intriguingly, also, the size of male orange hindwing patches was positively correlated with testis size.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: Information seeking is an important coping mechanism for dealing with chronic illness. Despite a growing number of mental health websites, there is little understanding of how patients with bipolar disorder use the Internet to seek information.

METHODS: A 39 question, paper-based, anonymous survey, translated into 12 languages, was completed by 1222 patients in 17 countries as a convenience sample between March 2014 and January 2016. All patients had a diagnosis of bipolar disorder from a psychiatrist. Data were analyzed using descriptive statistics and generalized estimating equations to account for correlated data.

RESULTS: 976 (81 % of 1212 valid responses) of the patients used the Internet, and of these 750 (77 %) looked for information on bipolar disorder. When looking online for information, 89 % used a computer rather than a smartphone, and 79 % started with a general search engine. The primary reasons for searching were drug side effects (51 %), to learn anonymously (43 %), and for help coping (39 %). About 1/3 rated their search skills as expert, and 2/3 as basic or intermediate. 59 % preferred a website on mental illness and 33 % preferred Wikipedia. Only 20 % read or participated in online support groups. Most patients (62 %) searched a couple times a year. Online information seeking helped about 2/3 to cope (41 % of the entire sample). About 2/3 did not discuss Internet findings with their doctor.

CONCLUSION: Online information seeking helps many patients to cope although alternative information sources remain important. Most patients do not discuss Internet findings with their doctor, and concern remains about the quality of online information especially related to prescription drugs. Patients may not rate search skills accurately, and may not understand limitations of online privacy. More patient education about online information searching is needed and physicians should recommend a few high quality websites.