972 resultados para cache consistency


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Per-core scratchpad memories (or local stores) allow direct inter-core communication, with latency and energy advantages over coherent cache-based communication, especially as CMP architectures become more distributed. We have designed cache-integrated network interfaces, appropriate for scalable multicores, that combine the best of two worlds – the flexibility of caches and the efficiency of scratchpad memories: on-chip SRAM is configurably shared among caching, scratchpad, and virtualized network interface (NI) functions. This paper presents our architecture, which provides local and remote scratchpad access, to either individual words or multiword blocks through RDMA copy. Furthermore, we introduce event responses, as a technique that enables software configurable communication and synchronization primitives. We present three event response mechanisms that expose NI functionality to software, for multiword transfer initiation, completion notifications for software selected sets of arbitrary size transfers, and multi-party synchronization queues. We implemented these mechanisms in a four-core FPGA prototype, and measure the logic overhead over a cache-only design for basic NI functionality to be less than 20%. We also evaluate the on-chip communication performance on the prototype, as well as the performance of synchronization functions with simulation of CMPs with up to 128 cores. We demonstrate efficient synchronization, low-overhead communication, and amortized-overhead bulk transfers, which allow parallelization gains for fine-grain tasks, and efficient exploitation of the hardware bandwidth.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We investigate the computational complexity of testing dominance and consistency in CP-nets. Previously, the complexity of dominance has been determined for restricted classes in which the dependency graph of the CP-net is acyclic. However, there are preferences of interest that define cyclic dependency graphs; these are modeled with general CP-nets. In our main results, we show here that both dominance and consistency for general CP-nets are PSPACE-complete. We then consider the concept of strong dominance, dominance equivalence and dominance incomparability, and several notions of optimality, and identify the complexity of the corresponding decision problems. The reductions used in the proofs are from STRIPS planning, and thus reinforce the earlier established connections between both areas.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Task-based dataflow programming models and runtimes emerge as promising candidates for programming multicore and manycore architectures. These programming models analyze dynamically task dependencies at runtime and schedule independent tasks concurrently to the processing elements. In such models, cache locality, which is critical for performance, becomes more challenging in the presence of fine-grain tasks, and in architectures with many simple cores.

This paper presents a combined hardware-software approach to improve cache locality and offer better performance is terms of execution time and energy in the memory system. We propose the explicit bulk prefetcher (EBP) and epoch-based cache management (ECM) to help runtimes prefetch task data and guide the replacement decisions in caches. The runtimem software can use this hardware support to expose its internal knowledge about the tasks to the architecture and achieve more efficient task-based execution. Our combined scheme outperforms HW-only prefetchers and state-of-the-art replacement policies, improves performance by an average of 17%, generates on average 26% fewer L2 misses, and consumes on average 28% less energy in the components of the memory system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The present study investigated the long-term consistency of individual differences in dairy cattles’ responses in tests of behavioural and hypothalamo–pituitary–adrenocortical (HPA) axis reactivity, as well as the relationship between responsiveness in behavioural tests and the reaction to first milking. Two cohorts of heifer calves, Cohorts 1 (N = 25) and 2 (N = 16), respectively, were examined longitudinally from the rearing period until adulthood. Cohort 1 heifers were subjected to open field (OF), novel object (NO), restraint, and response to a human tests at 7 months of age, and were again observed in an OF test during first pregnancy between 22 and 24 months of age. Subsequently, inhibition of milk ejection and stepping and kicking behaviours were recorded in Cohort 1 heifers during their first machine milking. Cohort 2 heifers were individually subjected to OF and NO tests as well as two HPA axis reactivity tests (determining ACTH and/or cortisol response profiles after administration of exogenous CRH and ACTH, respectively) at 6 months of age and during first lactation at approximately 29 months of age. Principal component analysis (PCA) was used to condense correlated response measures (to behavioural tests and to milking) within ages into independent dimensions underlying heifers’ reactivity. Heifers demonstrated consistent individual differences in locomotion and vocalisation during an OF test from rearing to first pregnancy (Cohort 1) or first lactation (Cohort 2). Individual differences in struggling in a restraint test at 7 months of age reliably predicted those in OF locomotion during first pregnancy in Cohort 1 heifers. Cohort 2 animals with high cortisol responses to OF and NO tests and high avoidance of the novel object at 6 months of age also exhibited enhanced cortisol responses to OF and NO tests at 29 months of age. Measures of HPA axis reactivity, locomotion, vocalisation and adrenocortical and behavioural responses to novelty were largely uncorrelated, supporting the idea that stress responsiveness in dairy cows is mediated by multiple independent underlying traits. Inhibition of milk ejection and stepping and kicking behaviours during first machine milking were not related to earlier struggling during restraint, locomotor responses to OF and NO tests, or the behavioural interaction with a novel object. Heifers with high rates of OF and NO vocalisation and short latencies to first contact with the human at 7 months of age exhibited better milk ejection during first machine milking. This suggests that low underlying sociality might be implicated in the inhibition of milk ejection at the beginning of lactation in heifers.