136 resultados para sequential frequent pattern
Resumo:
In molecular biology, it is often desirable to find common properties in large numbers of drug candidates. One family of methods stems from the data mining community, where algorithms to find frequent graphs have received increasing attention over the past years. However, the computational complexity of the underlying problem and the large amount of data to be explored essentially render sequential algorithms useless. In this paper, we present a distributed approach to the frequent subgraph mining problem to discover interesting patterns in molecular compounds. This problem is characterized by a highly irregular search tree, whereby no reliable workload prediction is available. We describe the three main aspects of the proposed distributed algorithm, namely, a dynamic partitioning of the search space, a distribution process based on a peer-to-peer communication framework, and a novel receiverinitiated load balancing algorithm. The effectiveness of the distributed method has been evaluated on the well-known National Cancer Institute’s HIV-screening data set, where we were able to show close-to linear speedup in a network of workstations. The proposed approach also allows for dynamic resource aggregation in a non dedicated computational environment. These features make it suitable for large-scale, multi-domain, heterogeneous environments, such as computational grids.
Resumo:
Previous work has established the value of goal-oriented approaches to requirements engineering. Achieving clarity and agreement about stakeholders’ goals and assumptions is critical for building successful software systems and managing their subsequent evolution. In general, this decision-making process requires stakeholders to understand the implications of decisions outside the domains of their own expertise. Hence it is important to support goal negotiation and decision making with description languages that are both precise and expressive, yet easy to grasp. This paper presents work in progress to develop a pattern language for describing goal refinement graphs. The language has a simple graphical notation, which is supported by a prototype editor tool, and a symbolic notation based on modal logic.
Resumo:
In real world applications sequential algorithms of data mining and data exploration are often unsuitable for datasets with enormous size, high-dimensionality and complex data structure. Grid computing promises unprecedented opportunities for unlimited computing and storage resources. In this context there is the necessity to develop high performance distributed data mining algorithms. However, the computational complexity of the problem and the large amount of data to be explored often make the design of large scale applications particularly challenging. In this paper we present the first distributed formulation of a frequent subgraph mining algorithm for discriminative fragments of molecular compounds. Two distributed approaches have been developed and compared on the well known National Cancer Institute’s HIV-screening dataset. We present experimental results on a small-scale computing environment.
Resumo:
Pattern-recognition receptors (PRRs) detect molecular signatures of microbes and initiate immune responses to infection. Prototypical PRRs such as Toll-like receptors (TLRs) signal via a conserved pathway to induce innate response genes. In contrast, the signaling pathways engaged by other classes of putative PRRs remain ill defined. Here, we demonstrate that the β-glucan receptor Dectin-1, a yeast binding C type lectin known to synergize with TLR2 to induce TNFα and IL-12, can also promote synthesis of IL-2 and IL-10 through phosphorylation of the membrane proximal tyrosine in the cytoplasmic domain and recruitment of Syk kinase. syk−/− dendritic cells (DCs) do not make IL-10 or IL-2 upon yeast stimulation but produce IL-12, indicating that the Dectin-1/Syk and Dectin-1/TLR2 pathways can operate independently. These results identify a novel signaling pathway involved in pattern recognition by C type lectins and suggest a potential role for Syk kinase in regulation of innate immunity.
Resumo:
Oral nutrition supplements (ONS) are routinely prescribed to those with, or at risk of, malnutrition. Previous research identified poor compliance due to taste and sweetness. This paper investigates taste and hedonic liking of ONS, of varying sweetness and metallic levels, over consumption volume; an important consideration as patients are prescribed large volumes of ONS daily. A sequential descriptive profile was developed to determine the perception of sensory attributes over repeat consumption of ONS. Changes in liking of ONS following repeat consumption were characterised by a boredom test. Certain flavour (metallic taste, soya milk flavour) and mouthfeel (mouthdrying, mouthcoating) attributes built up over increased consumption volume (p 0.002). Hedonic liking data from two cohorts, healthy older volunteers (n = 32, median age 73) and patients (n = 28, median age 85), suggested such build-up was disliked. Efforts made to improve the palatability of ONS must take account of the build up of taste and mouthfeel characteristics over increased consumption volume.
Resumo:
Seed set of rice (Oryza sativa L.) is highly sensitive to short episodes of high temperature at anthesis events that are likely to be more frequent in future climates. Breeding for tolerance is therefore an essential component of adaptation to climate variability and change. Experiments were conducted in 2003 and 2004 at optimum (30 degrees C daytime) and high (35 and 38 degrees C) air temperature using parents of some prominent mapping populations (i) to determine whether there were differences in the daily flowering pattern and hence a potential heat avoidance mechanism, and (ii) to identify rice genotypes having true heat tolerance during anthesis, that is, high seed set in spikelets exposed to high temperature. Rice cultivar CG14 (O. glaberrima) reached peak anthesis earlier in the morning (1.5 h after dawn) under both control (30 degrees C) and high (38 degrees C) temperature conditions than O. sativa genotypes (>= 3 h after dawn). Exposure to high temperature (centered on the time of peak anthesis) for 6 h reduced spikelet fertility more than exposure for 2 h, and fertility was lower at 38 degrees C than at 35 degrees C. Genotypic ranking for spikelet fertility at 35 and 38 degrees C was highly correlated in both 2003 and 2004. Fertility was also highly correlated across years, suggesting a consistent and reproducible response of spikelet fertility to temperature. The check cultivar N22 was the most heat tolerant genotype (64-86% fertility at 38 degrees C) and cultivars Azucena and Moroberekan the most susceptible (<8%).
Resumo:
It is accepted that an important source of variation in the response of anoestrous ewes, to the introduction of rams, is the intensity of male stimulation. The aim of this study was to investigate strategies capable of increasing the impact and transmission of the ram stimuli. In Experiment 1, two groups of seven ewes (Bluefaced Leicester male x Swaledale female) were individually penned with one ram and for the next 6 h the rams either remained in the pen or were replaced hourly. Blood samples revealed no difference in the pattern of plasma LH secretion. In Experiment 2, three groups of 16 ewes were either introduced to one ram, individually (H) or in groups of 8 (L), or remained isolated. Ram introduction increased the plasma LH pulsatility (P < 0.001). H ewes displayed more (nine versus six) male-induced LH pulses (pulses occurring within the first 45 min) and more pulses per 8 h intervals than the L group of ewes (1.9 +/- 0.3 versus 1.3 +/- 0.3), but these differences were not significant. It was concluded that (i) frequent replacement of rams within a few hours following ram introduction to ewes does not further improve the response of ewes, especially if the ram:ewe ratio is high; (ii) the characterization of the plasma LH secretion parameters during a period of 6-8 h does not seem to be an effective method to detect small differences in the intensity of stimulation received by the ewes when exposed to rams; (iii) North Country Mule ewes (Bluefaced Leicester male x Swaledale female) in the UK respond to the presence of rams in spring (late oestrous/early anoestrous season) with an elevation in plasma LH secretion. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
This study sets out to find the best calving pattern for small-scale dairy systems in Michoacan State, central Mexico. Two models were built. First, a linear programming model was constructed to optimize calving pattern and herd structure according to metabolizable energy availability. Second, a Markov chain model was built to investigate three reproductive scenarios (good, average and poor) in order to suggest factors that maintain the calving pattern given by the linear programming model. Though it was not possible to maintain the optimal linear programming pattern, the Markov chain model suggested adopting different reproduction strategies according to period of the year that the cow is expected to calve. Comparing different scenarios, the Markov model indicated the effect of calving interval on calving pattern and herd structure.
Resumo:
In clinical trials, situations often arise where more than one response from each patient is of interest; and it is required that any decision to stop the study be based upon some or all of these measures simultaneously. Theory for the design of sequential experiments with simultaneous bivariate responses is described by Jennison and Turnbull (Jennison, C., Turnbull, B. W. (1993). Group sequential tests for bivariate response: interim analyses of clinical trials with both efficacy and safety endpoints. Biometrics 49:741-752) and Cook and Farewell (Cook, R. J., Farewell, V. T. (1994). Guidelines for monitoring efficacy and toxicity responses in clinical trials. Biometrics 50:1146-1152) in the context of one efficacy and one safety response. These expositions are in terms of normally distributed data with known covariance. The methods proposed require specification of the correlation, ρ between test statistics monitored as part of the sequential test. It can be difficult to quantify ρ and previous authors have suggested simply taking the lowest plausible value, as this will guarantee power. This paper begins with an illustration of the effect that inappropriate specification of ρ can have on the preservation of trial error rates. It is shown that both the type I error and the power can be adversely affected. As a possible solution to this problem, formulas are provided for the calculation of correlation from data collected as part of the trial. An adaptive approach is proposed and evaluated that makes use of these formulas and an example is provided to illustrate the method. Attention is restricted to the bivariate case for ease of computation, although the formulas derived are applicable in the general multivariate case.