3 resultados para Escalonamento de threads

em Glasgow Theses Service


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Processors with large numbers of cores are becoming commonplace. In order to utilise the available resources in such systems, the programming paradigm has to move towards increased parallelism. However, increased parallelism does not necessarily lead to better performance. Parallel programming models have to provide not only flexible ways of defining parallel tasks, but also efficient methods to manage the created tasks. Moreover, in a general-purpose system, applications residing in the system compete for the shared resources. Thread and task scheduling in such a multiprogrammed multithreaded environment is a significant challenge. In this thesis, we introduce a new task-based parallel reduction model, called the Glasgow Parallel Reduction Machine (GPRM). Our main objective is to provide high performance while maintaining ease of programming. GPRM supports native parallelism; it provides a modular way of expressing parallel tasks and the communication patterns between them. Compiling a GPRM program results in an Intermediate Representation (IR) containing useful information about tasks, their dependencies, as well as the initial mapping information. This compile-time information helps reduce the overhead of runtime task scheduling and is key to high performance. Generally speaking, the granularity and the number of tasks are major factors in achieving high performance. These factors are even more important in the case of GPRM, as it is highly dependent on tasks, rather than threads. We use three basic benchmarks to provide a detailed comparison of GPRM with Intel OpenMP, Cilk Plus, and Threading Building Blocks (TBB) on the Intel Xeon Phi, and with GNU OpenMP on the Tilera TILEPro64. GPRM shows superior performance in almost all cases, only by controlling the number of tasks. GPRM also provides a low-overhead mechanism, called “Global Sharing”, which improves performance in multiprogramming situations. We use OpenMP, as the most popular model for shared-memory parallel programming as the main GPRM competitor for solving three well-known problems on both platforms: LU factorisation of Sparse Matrices, Image Convolution, and Linked List Processing. We focus on proposing solutions that best fit into the GPRM’s model of execution. GPRM outperforms OpenMP in all cases on the TILEPro64. On the Xeon Phi, our solution for the LU Factorisation results in notable performance improvement for sparse matrices with large numbers of small blocks. We investigate the overhead of GPRM’s task creation and distribution for very short computations using the Image Convolution benchmark. We show that this overhead can be mitigated by combining smaller tasks into larger ones. As a result, GPRM can outperform OpenMP for convolving large 2D matrices on the Xeon Phi. Finally, we demonstrate that our parallel worksharing construct provides an efficient solution for Linked List processing and performs better than OpenMP implementations on the Xeon Phi. The results are very promising, as they verify that our parallel programming framework for manycore processors is flexible and scalable, and can provide high performance without sacrificing productivity.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This study is concerned with the significance of Jungian and post-Jungian theory to the development of the contemporary Western Goddess Movement, which includes the various self-identified nature-based, Pagan, Goddess Feminism, Goddess Consciousness, Goddess Spirituality, Wicca, and Goddess-centred faith traditions that have seen a combined increase in Western adherents over the past five decades and share a common goal to claim Goddess as an active part of Western consciousness and faith traditions. The Western Goddess Movement has been strongly influenced by Jung’s thought, and by feminist revisions of Jungian Theory, sometimes interpreted idiosyncratically, but presented as a route to personal and spiritual transformation. The analysis examines ways in which women encounter Goddess through a process of Jungian Individuation and traces the development of Jungian and post-Jungian theories by identifying the key thinkers and central ideas that helped to shape the development of the Western Goddess Movement. It does so through a close reading and analysis of five biographical ‘rebirth’ memoirs published between 1981 and 1998: Christine Downing’s (1981) The Goddess: Mythological Images of the Feminine; Jean Shinoda Bolen’s (1994) Crossing to Avalon: A Woman’s Midlife Pilgrimage; Sue Monk Kidd’s (1996) The Dance of the Dissident Daughter: A Woman’s Journey from Christian Tradition to the Sacred Feminine; Margaret Starbird’s (1998) The Goddess in the Gospels: Reclaiming the Sacred Feminine; and Phyllis Curott’s (1998) Book of Shadows: A Modern Woman’s Journey into the Wisdom of Witchcraft and the Magic of the Goddess. These five memoirs reflect the diversity of the faith traditions in the Western Goddess Movement. The enquiry centres upon two parallel and complementary research threads: 1) critically examining the content of the memoirs in order to determine their contribution to the development of the Goddess Movement and 2) charting and sourcing the development of the major Jungian and post-Jungian theories championed in the memoirs in order to evaluate the significance of Jungian and post-Jungian thought in the Movement. The aim of this study was to gain a better understanding of the original research question: what is the significance of Jungian and post-Jungian theory for the development of the Western Goddess Movement? Each memoir is subjected to critical review of its intended audiences, its achievements, its functions and strengths, and its theoretical frameworks. Research results offered more than the experiences of five Western women, it also provided evidence to analyse the significance of Jungian and post-Jungian theory to the development of the Western Goddess Movement. The findings demonstrate the vital contributions of the analytical psychology of Carl Jung, and post-Jungians M Esther Harding, Erich Neumann, Christine Downing, E.C. Whitmont, and Jean Shinoda Bolen; the additional contributions of Sue Monk Kidd, Margaret Starbird, and Phyllis Curott, and exhibit Jungian and post-Jungian pathways to Goddess. Through a variety of approaches to Jungian categories, these memoirs constitute a literature of Individuation for the Western Goddess Movement.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cache-coherent non uniform memory access (ccNUMA) architecture is a standard design pattern for contemporary multicore processors, and future generations of architectures are likely to be NUMA. NUMA architectures create new challenges for managed runtime systems. Memory-intensive applications use the system’s distributed memory banks to allocate data, and the automatic memory manager collects garbage left in these memory banks. The garbage collector may need to access remote memory banks, which entails access latency overhead and potential bandwidth saturation for the interconnection between memory banks. This dissertation makes five significant contributions to garbage collection on NUMA systems, with a case study implementation using the Hotspot Java Virtual Machine. It empirically studies data locality for a Stop-The-World garbage collector when tracing connected objects in NUMA heaps. First, it identifies a locality richness which exists naturally in connected objects that contain a root object and its reachable set— ‘rooted sub-graphs’. Second, this dissertation leverages the locality characteristic of rooted sub-graphs to develop a new NUMA-aware garbage collection mechanism. A garbage collector thread processes a local root and its reachable set, which is likely to have a large number of objects in the same NUMA node. Third, a garbage collector thread steals references from sibling threads that run on the same NUMA node to improve data locality. This research evaluates the new NUMA-aware garbage collector using seven benchmarks of an established real-world DaCapo benchmark suite. In addition, evaluation involves a widely used SPECjbb benchmark and Neo4J graph database Java benchmark, as well as an artificial benchmark. The results of the NUMA-aware garbage collector on a multi-hop NUMA architecture show an average of 15% performance improvement. Furthermore, this performance gain is shown to be as a result of an improved NUMA memory access in a ccNUMA system. Fourth, the existing Hotspot JVM adaptive policy for configuring the number of garbage collection threads is shown to be suboptimal for current NUMA machines. The policy uses outdated assumptions and it generates a constant thread count. In fact, the Hotspot JVM still uses this policy in the production version. This research shows that the optimal number of garbage collection threads is application-specific and configuring the optimal number of garbage collection threads yields better collection throughput than the default policy. Fifth, this dissertation designs and implements a runtime technique, which involves heuristics from dynamic collection behavior to calculate an optimal number of garbage collector threads for each collection cycle. The results show an average of 21% improvements to the garbage collection performance for DaCapo benchmarks.