229 resultados para Cluster Counting Algorithm
Resumo:
A new sparse kernel density estimator is introduced. Our main contribution is to develop a recursive algorithm for the selection of significant kernels one at time using the minimum integrated square error (MISE) criterion for both kernel selection. The proposed approach is simple to implement and the associated computational cost is very low. Numerical examples are employed to demonstrate that the proposed approach is effective in constructing sparse kernel density estimators with competitive accuracy to existing kernel density estimators.
Resumo:
We have optimised the atmospheric radiation algorithm of the FAMOUS climate model on several hardware platforms. The optimisation involved translating the Fortran code to C and restructuring the algorithm around the computation of a single air column. Instead of the existing MPI-based domain decomposition, we used a task queue and a thread pool to schedule the computation of individual columns on the available processors. Finally, four air columns are packed together in a single data structure and computed simultaneously using Single Instruction Multiple Data operations. The modified algorithm runs more than 50 times faster on the CELL’s Synergistic Processing Elements than on its main PowerPC processing element. On Intel-compatible processors, the new radiation code runs 4 times faster. On the tested graphics processor, using OpenCL, we find a speed-up of more than 2.5 times as compared to the original code on the main CPU. Because the radiation code takes more than 60% of the total CPU time, FAMOUS executes more than twice as fast. Our version of the algorithm returns bit-wise identical results, which demonstrates the robustness of our approach. We estimate that this project required around two and a half man-years of work.
Resumo:
Exascale systems are the next frontier in high-performance computing and are expected to deliver a performance of the order of 10^18 operations per second using massive multicore processors. Very large- and extreme-scale parallel systems pose critical algorithmic challenges, especially related to concurrency, locality and the need to avoid global communication patterns. This work investigates a novel protocol for dynamic group communication that can be used to remove the global communication requirement and to reduce the communication cost in parallel formulations of iterative data mining algorithms. The protocol is used to provide a communication-efficient parallel formulation of the k-means algorithm for cluster analysis. The approach is based on a collective communication operation for dynamic groups of processes and exploits non-uniform data distributions. Non-uniform data distributions can be either found in real-world distributed applications or induced by means of multidimensional binary search trees. The analysis of the proposed dynamic group communication protocol has shown that it does not introduce significant communication overhead. The parallel clustering algorithm has also been extended to accommodate an approximation error, which allows a further reduction of the communication costs. The effectiveness of the exact and approximate methods has been tested in a parallel computing system with 64 processors and in simulations with 1024 processing elements.
Resumo:
Global communication requirements and load imbalance of some parallel data mining algorithms are the major obstacles to exploit the computational power of large-scale systems. This work investigates how non-uniform data distributions can be exploited to remove the global communication requirement and to reduce the communication cost in iterative parallel data mining algorithms. In particular, the analysis focuses on one of the most influential and popular data mining methods, the k-means algorithm for cluster analysis. The straightforward parallel formulation of the k-means algorithm requires a global reduction operation at each iteration step, which hinders its scalability. This work studies a different parallel formulation of the algorithm where the requirement of global communication can be relaxed while still providing the exact solution of the centralised k-means algorithm. The proposed approach exploits a non-uniform data distribution which can be either found in real world distributed applications or can be induced by means of multi-dimensional binary search trees. The approach can also be extended to accommodate an approximation error which allows a further reduction of the communication costs.
Resumo:
Reinforcing the Low Voltage (LV) distribution network will become essential to ensure it remains within its operating constraints as demand on the network increases. The deployment of energy storage in the distribution network provides an alternative to conventional reinforcement. This paper presents a control methodology for energy storage to reduce peak demand in a distribution network based on day-ahead demand forecasts and historical demand data. The control methodology pre-processes the forecast data prior to a planning phase to build in resilience to the inevitable errors between the forecasted and actual demand. The algorithm uses no real time adjustment so has an economical advantage over traditional storage control algorithms. Results show that peak demand on a single phase of a feeder can be reduced even when there are differences between the forecasted and the actual demand. In particular, results are presented that demonstrate when the algorithm is applied to a large number of single phase demand aggregations that it is possible to identify which of these aggregations are the most suitable candidates for the control methodology.
Resumo:
The mammalian lignan, enterolactone, has been shown to reduce the proliferation of the earlier stages of prostate cancer at physiological concentrations in vitro. However, efficacy in the later stages of the disease occurs at concentrations difficult to achieve through dietary modification. We have therefore investigated what concentration(s) of enterolactone can restrict proliferation in multiple stages of prostate cancer using an in vitro model system of prostate disease. We determined that enterolactone at 20 μM significantly restricted the proliferation of mid and late stage models of prostate disease. These effects were strongly associated with changes in the expression of the DNA licencing genes (GMNN, CDT1, MCM2 and 7), in reduced expression of the miR-106b cluster (miR-106b, miR-93, and miR-25), and in increased expression of the PTEN tumour suppressor gene. We have shown anti-proliferative effects of enterolactone in earlier stages of prostate disease than previously reported and that these effects are mediated, in part, by microRNA-mediated regulation.
Resumo:
The aim of the present study was to investigate whether the saliency effect for word beginnings reported in children with Dyslexia (Marshall & van der Lely, 2009) can be found also in TD children. Thirty-four TD Italian children aged 8-10 completed two specifically designed tasks: a production task and a perception task. Both tasks used nonwords containing clusters consisting of plosive plus liquid (eg. pl). Clusters could be either in a stressed or in an unstressed syllable, and could be either in initial position (first syllable) or in medial position (second syllable). In the production task children were asked to repeat the non-words. In the perception task, the children were asked to discriminate between two nonwords differing in one phoneme belonging to a cluster by reporting whether two repetitions were the same or different. Results from the production task showed that children are more accurate in repeating stressed than unstressed syllables, but there was no difference with respect to position of the cluster. Results from the perception task showed that children performed more accurately when discriminating word initial contrasts than when discriminating word medial contrasts, especially if the cluster was unstressed. Implications of this finding for clinical assessments are discussed.
Resumo:
The launch of the Double Star mission has provided the opportunity to monitor events at distinct locations on the dayside magnetopause, in coordination with the quartet of Cluster spacecraft. We present results of two such coordinated studies. In the first, 6 April 2004, both Cluster and the Double Star TC-1 spacecraft were on outbound transits through the dawn-side magnetosphere. Cluster observed northward moving FTEs with +/- polarity, whereas TC-1 saw -/+ polarity FTEs. The strength, motion and occurrence of the FTE signatures changes somewhat according to changes in IMF clock angle. These observations are consistent with ongoing reconnection on the dayside magnetopause, resulting in a series of flux transfer events (FTEs) seen both at Cluster and TC-1. The observed polarity and motion of each FTE signature advocates the existence of an active reconnection region consistently located between the positions of Cluster and TC-1, lying north and south of the reconnection line, respectively. This scenario is supported by the application of a model, designed to track flux tube motion, to conditions appropriate for the prevailing interplanetary conditions. The results from the model confirm the observational evidence that the low-latitude FTE dynamics is sensitive to changes in convected upstream conditions. In particular, changing the interplanetary magnetic field (IMF) clock angle in the model predicts that TC-1 should miss the resulting FTEs more often than Cluster, as is observed. For the second conjunction, on the 4 Jan 2005, the Cluster and TC-1 spacecraft all exited the dusk-side magnetosphere almost simultaneously, with TC-1 lying almost equatorial and Cluster at northern latitudes at about 4 RE from TC-1. The spacecraft traverse the magnetopause during a strong reversal in the IMF from northward to southward and a number of magnetosheath FTE signatures are subsequently observed. One coordinated FTE, studied in detail by Pu et al, [this issue], carries and inflowing energetic electron population and shows a motion and orientation which is similar at all spacecraft and consistent with the predictions of the model for the flux tube dynamics, given a near sub-solar reconnection line. This event can be interpreted either as the passage of two parallel flux tubes arising from adjacent x-line positions, or as a crossing of a single flux tube at different positions.
Resumo:
The recent launch of the equatorial spacecraft of the Double Star mission, TC-1, has provided an unprecedented opportunity to monitor the southern hemisphere dayside magnetopause boundary layer in conjunction with northern hemisphere observations by the quartet of Cluster spacecraft. We present first results of one such situation where, on 6 April 2004, both Cluster and the Double Star TC-1 spacecraft were on outbound transits through the dawnside magnetosphere. The observations are consistent with ongoing reconnection on the dayside magnetopause, resulting in a series of flux transfer events (FTEs) seen both at Cluster and TC-1, which appear to lie north and south of the reconnection line, respectively. In fact, the observed polarity and motion of each FTE signature advocates the existence of an active reconnection region consistently located between the positions of Cluster and TC-1, with Cluster observing northward moving FTEs with +/− polarity, whereas TC-1 sees −/+ polarity FTEs. This assertion is further supported by the application of a model designed to track flux tube motion for the prevailing interplanetary conditions. The results from this model show, in addition, that the low-latitude FTE dynamics are sensitive to changes in convected upstream conditions. In particular, changing the interplanetary magnetic field (IMF) clock angle in the model suggests that TC-1 should miss the resulting FTEs more often than Cluster and this is borne out by the observations.
Resumo:
A realistic representation of the North Atlantic tropical cyclone tracks is crucial as it allows, for example, explaining potential changes in US landfalling systems. Here we present a tentative study, which examines the ability of recent climate models to represent North Atlantic tropical cyclone tracks. Tracks from two types of climate models are evaluated: explicit tracks are obtained from tropical cyclones simulated in regional or global climate models with moderate to high horizontal resolution (1° to 0.25°), and downscaled tracks are obtained using a downscaling technique with large-scale environmental fields from a subset of these models. For both configurations, tracks are objectively separated into four groups using a cluster technique, leading to a zonal and a meridional separation of the tracks. The meridional separation largely captures the separation between deep tropical and sub-tropical, hybrid or baroclinic cyclones, while the zonal separation segregates Gulf of Mexico and Cape Verde storms. The properties of the tracks’ seasonality, intensity and power dissipation index in each cluster are documented for both configurations. Our results show that except for the seasonality, the downscaled tracks better capture the observed characteristics of the clusters. We also use three different idealized scenarios to examine the possible future changes of tropical cyclone tracks under 1) warming sea surface temperature, 2) increasing carbon dioxide, and 3) a combination of the two. The response to each scenario is highly variable depending on the simulation considered. Finally, we examine the role of each cluster in these future changes and find no preponderant contribution of any single cluster over the others.
Resumo:
This study monitored the dynamics and diversity of the human faecal 'Atopobium cluster' over a 3-month period using a polyphasic approach. Fresh faecal samples were collected fortnightly from 13 healthy donors (6 males and 7 females) aged between 26 and 61 years. Fluorescence in situ hybridization was used to enumerate total (EUB338mix) and 'Atopobium cluster' (ATO291) bacteria, with counts ranging between 1.12 × 1011 and 9.95 × 1011, and 1.03 × 109 and 1.16 × 1011 cells (g dry weight faeces)-1, respectively. The 'Atopobium cluster' population represented 0.2-22 % of the total bacteria, with proportions donor-dependent. Denaturing gradient gel electrophoresis (DGGE) using 'Atopobium cluster'-specific primers demonstrated faecal populations of these bacteria were relatively stable, with bands identified as Collinsella aerofaciens, Collinsella intestinalis/Collinsella stercoris, Collinsella tanakaei, Coriobacteriaceae sp. PEAV3-3, Eggerthella lenta, Gordonibacter pamelaeae, Olsenella profusa, Olsenella uli and Paraeggerthella hongkongensis in the DGGE profiles of individuals. Colony PCR was used to identify 'Atopobium cluster' bacteria isolated from faeces (n = 224 isolates). 16S rRNA gene sequence analysis of isolates demonstrated Collinsella aerofaciens represented the predominant (88 % of isolates) member of the 'Atopobium cluster' found in human faeces, being found in nine individuals. Eggerthella lenta was identified in three individuals (3.6 % of isolates). Isolates of Collinsella tanakaei, an 'Enorma' sp. and representatives of novel species belonging to the 'Atopobium cluster' were also identified in the study. Phenotypic characterization of the isolates demonstrated their highly saccharolytic nature and heterogeneous phenotypic profiles, and 97 % of the isolates displayed lipase activity.
Resumo:
On 14 January 2001, the four Cluster spacecraft passed through the northern magnetospheric mantle in close conjunction to the EISCAT Svalbard Radar (ESR) and approached the post-noon dayside magnetopause over Greenland between 13:00 and 14:00 UT During that interval, a sudden reorganisation of the high-latitude dayside convection pattern accurred after 13:20 UT most likely caused by a direction change of the Solar wind magnetic field. The result was an eastward and poleward directed flow-channel, as monitored by the SuperDARN radar network and also by arrays of ground-based magnetometers in Canada, Greenland and Scandinavia. After an initial eastward and later poleward expansion of the flow-channel between 13:20 and 13:40 UT, the four Cluster spacecraft, and the field line footprints covered by the eastward looking scan cycle of the Sondre Stromfjord incoherent scatter radar were engulfed by cusp-like precipitation with transient magnetic and electric field signatures. In addition, the EISCAT Svalbard Radar detected strong transient effects of the convection reorganisation, a poleward moving precipitation, and a fast ion flow-channel in association with the auroral structures that suddenly formed to the west and north of the radar. From a detailed analysis of the coordinated Cluster and ground-based data, it was found that this extraordinary transient convection pattern, indeed, had moved the cusp precipitation from its former pre-noon position into the late post-noon sector, allowing for the first and quite unexpected encounter of the cusp by the Cluster spacecraft. Our findings illustrate the large amplitude of cusp dynamics even in response to moderate solar wind forcing. The global ground-based data proves to be an invaluable tool to monitor the dynamics and width of the affected magnetospheric regions.
Resumo:
We study a series of transient entries into the low-latitude boundary layer (LLBL) of all four Cluster spacecraft during an outbound pass through the mid-afternoon magnetopause ([X(GSM), Y(GSM), Z(GSM)] approximate to [2, 7, 9] R(E)). The events take place during an interval of northward IMF, as seen in the data from the ACE satellite and lagged by a propagation delay of 75 min that is well-defined by two separate studies: (1) the magnetospheric variations prior to the northward turning (Lockwood et al., 2001, this issue) and (2) the field clock angle seen by Cluster after it had emerged into the magnetosheath (Opgenoorth et al., 2001, this issue). With an additional lag of 16.5 min, the transient LLBL events cor-relate well with swings of the IMF clock angle (in GSM) to near 90degrees. Most of this additional lag is explained by ground-based observations, which reveal signatures of transient reconnection in the pre-noon sector that then take 10-15 min to propagate eastward to 15 MLT, where they are observed by Cluster. The eastward phase speed of these signatures agrees very well with the motion deduced by the cross-correlation of the signatures seen on the four Cluster spacecraft. The evidence that these events are reconnection pulses includes: transient erosion of the noon 630 nm (cusp/cleft) aurora to lower latitudes; transient and travelling enhancements of the flow into the polar cap, imaged by the AMIE technique; and poleward-moving events moving into the polar cap, seen by the EISCAT Svalbard Radar (ESR). A pass of the DMSP-F15 satellite reveals that the open field lines near noon have been opened for some time: the more recently opened field lines were found closer to dusk where the flow transient and the poleward-moving event intersected the satellite pass. The events at Cluster have ion and electron characteristics predicted and observed by Lockwood and Hapgood (1998) for a Flux Transfer Event (FTE), with allowance for magnetospheric ion reflection at Alfvenic disturbances in the magnetopause reconnection layer. Like FTEs, the events are about 1 R(E) in their direction of motion and show a rise in the magnetic field strength, but unlike FTEs, in general, they show no pressure excess in their core and hence, no characteristic bipolar signature in the boundary-normal component. However, most of the events were observed when the magnetic field was southward, i.e. on the edge of the interior magnetic cusp, or when the field was parallel to the magnetic equatorial plane. Only when the satellite begins to emerge from the exterior boundary (when the field was northward), do the events start to show a pressure excess in their core and the consequent bipolar signature. We identify the events as the first observations of FTEs at middle altitudes.