40 resultados para Pipeline


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The most promising way to maintain reliable data transfer across the rapidly fluctuating channels used by next generation multiple-input multiple-output communications schemes is to exploit run-time variable modulation and antenna configurations. This demands that the baseband signal processing architectures employed in the communications terminals must provide low cost and high performance with runtime reconfigurability. We present a softcore-processor based solution to this issue, and show for the first time, that such programmable architectures can enable real-time data operation for cutting-edge standards
such as 802.11n; furthermore, by exploiting deep processing pipelines and interleaved task execution, the cost and performance of these architectures is shown to be on a par with traditional dedicated circuit based solutions. We believe this to be the first such programmable architecture to achieve this, and the combination of implementation efficiency and programmability makes this implementation style the most promising approach for hosting such dynamic architectures.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Traditional static analysis fails to auto-parallelize programs with a complex control and data flow. Furthermore, thread-level parallelism in such programs is often restricted to pipeline parallelism, which can be hard to discover by a programmer. In this paper we propose a tool that, based on profiling information, helps the programmer to discover parallelism. The programmer hand-picks the code transformations from among the proposed candidates which are then applied by automatic code transformation techniques.

This paper contributes to the literature by presenting a profiling tool for discovering thread-level parallelism. We track dependencies at the whole-data structure level rather than at the element level or byte level in order to limit the profiling overhead. We perform a thorough analysis of the needs and costs of this technique. Furthermore, we present and validate the belief that programs with complex control and data flow contain significant amounts of exploitable coarse-grain pipeline parallelism in the program’s outer loops. This observation validates our approach to whole-data structure dependencies. As state-of-the-art compilers focus on loops iterating over data structure members, this observation also explains why our approach finds coarse-grain pipeline parallelism in cases that have remained out of reach for state-of-the-art compilers. In cases where traditional compilation techniques do find parallelism, our approach allows to discover higher degrees of parallelism, allowing a 40% speedup over traditional compilation techniques. Moreover, we demonstrate real speedups on multiple hardware platforms.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper we describe the design of a parallel solution of the inhomogeneous Schrodinger equation, which arises in the construction of continuum orbitals in the R-matrix theory of atomic continuum processes. A prototype system is described which has been programmed in occam2 and implemented on a bi-directional pipeline of transputers. Some timing results for the prototype system are presented, and the development of a full production system is discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Ubiquitous parallel computing aims to make parallel programming accessible to a wide variety of programming areas using deterministic and scale-free programming models built on a task abstraction. However, it remains hard to reconcile these attributes with pipeline parallelism, where the number of pipeline stages is typically hard-coded in the program and defines the degree of parallelism.

This paper introduces hyperqueues, a programming abstraction that enables the construction of deterministic and scale-free pipeline parallel programs. Hyperqueues extend the concept of Cilk++ hyperobjects to provide thread-local views on a shared data structure. While hyperobjects are organized around private local views, hyperqueues require shared concurrent views on the underlying data structure. We define the semantics of hyperqueues and describe their implementation in a work-stealing scheduler. We demonstrate scalable performance on pipeline-parallel PARSEC benchmarks and find that hyperqueues provide comparable or up to 30% better performance than POSIX threads and Intel's Threading Building Blocks. The latter are highly tuned to the number of available processing cores, while programs using hyperqueues are scale-free.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We introduce a new survey of massive stars in the Galaxy and the Magellanic Clouds using the Fibre Large Array Multi- Element Spectrograph ( FLAMES) instrument at the Very Large Telescope ( VLT). Here we present observations of 269 Galactic stars with the FLAMES- Giraffe Spectrograph ( R similar or equal to 25 000), in fields centered on the open clusters NGC3293, NGC4755 and NGC6611. These data are supplemented by a further 50 targets observed with the Fibre- Fed Extended Range Optical Spectrograph ( FEROS, R = 48 000). Following a description of our scientific motivations and target selection criteria, the data reduction methods are described; of critical importance the FLAMES reduction pipeline is found to yield spectra that are in excellent agreement with less automated methods. Spectral classifications and radial velocity measurements are presented for each star, with particular attention paid to morphological peculiarities and evidence of binarity. These observations represent a significant increase in the known spectral content of NGC3293 and NGC4755, and will serve as standards against which our subsequent FLAMES observations in the Magellanic Clouds will be compared.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The SuperWASP cameras are wide-field imaging systems at the Observatorio del Roque de los Muchachos on the island of La Palma in the Canary Islands, and at the Sutherland Station of the South African Astronomical Observatory. Each instrument has a field of view of some 482 deg2 with an angular scale of 13.7" pixel-1, and is capable of delivering photometry with accuracy better than 1% for objects having V~7.0-11.5. Lower quality data for objects brighter than V~15.0 are stored in the project archive. The systems, while designed to monitor fields with high cadence, are capable of surveying the entire visible sky every 40 minutes. Depending on the observational strategy, the data rate can be up to 100 Gbytes per night. We have produced a robust, largely automatic reduction pipeline and advanced archive, which are used to serve the data products to the consortium members. The main science aim of these systems is to search for bright transiting exoplanet systems suitable for spectroscopic follow-up observations. The first 6 month season of SuperWASP-North observations produced light curves of ~6.7 million objects with 12.9 billion data points.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

SuperWASP is an ultra-wide field (over 300 sq. degrees) photometric survey project designed to monitor stars between 7 - 15 mag to high precision and with high cadence over long (greater than or equal to2 months) timescales. The primary science goal of this project is the detection of exoplanetary transits, as well as NEOs and optical transients. The resulting photometric catalogue will be made public via a web-based interface. The SuperWASP instrument consists of an array of cameras each with a 7.8degrees x 7.8degrees field of view, guided by a robotic fork mount and sited in a fibreglass enclosure at the Observatorio de Roque de los Muchachos (ORM), La Palma, Canary Islands. In this progress report, we describe the specifications of the instrument, its semi-automated operation and pipeline data reduction.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We present the current status of the WASP project, a pair of wide angle photometric telescopes, individually called SuperWASP. SuperWASP-I is located in La Palma, and SuperWASP-II at Sutherland in South Africa. SW-I began operations in April 2004. SW-II is expected to be operational in early 2006. Each SuperWASP instrument consists of up to 8 individual cameras using ultra-wide field lenses backed by high-quality passively cooled CCDs. Each camera covers 7.8 x 7.8 sq degrees of sky, for nearly 500 sq degrees of total sky coverage. One of the current aims of the WASP project is the search for extra-solar planet transits with a focus on brighter stars in the magnitude range similar to 8 to 13. Additionally, WASP will search for, optical transients, track Near-Earth Objects, and study many types of variable stars and extragalactic objects. The collaboration has developed a custom-built reduction pipeline that achieves better than I percent photometric precision. We discuss future goals, which include: nightly on-mountain reductions that could be used to automatically drive alerts via a small robotic telescope network, and possible roles of the WASP telescopes as providers in such a network. Additional technical details of the telescopes, data reduction, and consortium members and institutions can be found on the web site at: http://www.superwasp.org/. (c) 2006 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

High-speed field-programmable gate array (FPGA) implementations of an adaptive least mean square (LMS) filter with application in an electronic support measures (ESM) digital receiver, are presented. They employ "fine-grained" pipelining, i.e., pipelining within the processor and result in an increased output latency when used in the LMS recursive system. Therefore, the major challenge is to maintain a low latency output whilst increasing the pipeline stage in the filter for higher speeds. Using the delayed LMS (DLMS) algorithm, fine-grained pipelined FPGA implementations using both the direct form (DF) and the transposed form (TF) are considered and compared. It is shown that the direct form LMS filter utilizes the FPGA resources more efficiently thereby allowing a 120 MHz sampling rate.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The WASP project and infrastructure supporting the SuperWASP Facility are described. As the instrument, reduction pipeline and archive system are now fully operative we expect the system to have a major impact in the discovery of bright exo-planet candidates as well in more general variable star projects.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper a novel scalable public-key processor architecture is presented that supports modular exponentiation and Elliptic Curve Cryptography over both prime GF(p) and binary GF(2) extension fields. This is achieved by a high performance instruction set that provides a comprehensive range of integer and polynomial basis field arithmetic. The instruction set and associated hardware are generic in nature and do not specifically support any cryptographic algorithms or protocols. Firmware within the device is used to efficiently implement complex and data intensive arithmetic. A firmware library has been developed in order to demonstrate support for numerous exponentiation and ECC approaches, such as different coordinate systems and integer recoding methods. The processor has been developed as a high-performance asymmetric cryptography platform in the form of a scalable Verilog RTL core. Various features of the processor may be scaled, such as the pipeline width and local memory subsystem, in order to suit area, speed and power requirements. The processor is evaluated and compares favourably with previous work in terms of performance while offering an unparalleled degree of flexibility. © 2006 IEEE.