Biblioteca Digital

129 resultados para intel processor

QRD and SVD processor design based on an approximate rotations algorithm

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A silicon implementation of the Approximate Rotations algorithm capable of carrying the computational load of algorithms such as QRD and SVD, within the real-time realisation of applications such as Adaptive Beamforming, is described. A modification to the original Approximate Rotations algorithm to simplify the method of optimal angle selection is proposed. Analysis shows that fewer iterations of the Approximate Rotations algorithm are required compared with the conventional CORDIC algorithm to achieve similar degrees of accuracy. The silicon design studies undertaken provide direct practical evidence of superior performance with the Approximate Rotations algorithm, requiring approximately 40% of the total computation time of the conventional CORDIC algorithm, for a similar silicon area cost. © 2004 IEEE.

Generic scheduling methods for a linear QR array SoC processor

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A scheduling method for implementing a generic linear QR array processor architecture is presented. This improves on previous work. It also considerably simplifies the derivation of schedules for a folded linear system, where detailed account has to be taken of processor cell latency. The architecture and scheduling derived provide the basis of a generator for the rapid design of System-on-a-Chip (SoC) cores for QR decomposition.

Achieving Multiprogramming Scalability of Parallel Programs on Intel SMP Platforms: Nanothreading in the Linux Kernel

Relevância:

20.00% 20.00%

Publicador:

PACMAN: A Performance Counters Manager for Intel Hyperthreaded Processors

Relevância:

20.00% 20.00%

Publicador:

Facing the Challenges of Multicore Processor Technologies using Autonomic System Software:Keynote Talk

Relevância:

20.00% 20.00%

Publicador:

Unified Scheduling of Polymorphic Parallelism on the Cell Processor:Abstracts of the 2008 SIAM Conference on Parallel Processing for Scientific Computing, Miniworkshop on the Cell Processor (SIAM PP)

Relevância:

20.00% 20.00%

Publicador:

Tagged Procedure Calls (TPC): Efficient Runtime Support for Task-Based Parallelism on the Cell Processor

Relevância:

20.00% 20.00%

Publicador:

IPPro: FPGA based Image Processing Processor

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The paper presents IPPro which is a high performance, scalable soft-core processor targeted for image processing applications. It has been based on the Xilinx DSP48E1 architecture using the ZYNQ Field Programmable Gate Array and is a scalar 16-bit RISC processor that operates at 526MHz, giving 526MIPS of performance. Each IPPro core uses 1 DSP48, 1 Block RAM and 330 Kintex-7 slice-registers, thus making the processor as compact as possible whilst maintaining flexibility and programmability. A key aspect of the approach is in reducing the application design time and implementation effort by using multiple IPPro processors in a SIMD mode. For different applications, this allows us to exploit different levels of parallelism and mapping for the specified processing architecture with the supported instruction set. In this context, a Traffic Sign Recognition (TSR) algorithm has been prototyped on a Zedboard with the colour and morphology operations accelerated using multiple IPPros. Simulation and experimental results demonstrate that the processing platform is able to achieve a speedup of 15 to 33 times for colour filtering and morphology operations respectively, with a reduced design effort and time.

THE DESIGN OF A VLSI ARRAY PROCESSOR CHIP FOR COMPUTING THE BASIC ARITHMETIC OPERATIONS

Relevância:

20.00% 20.00%

Publicador:

A VLSI PROCESSOR FOR HIGH-PERFORMANCE ARITHMETIC COMPUTATIONS

Relevância:

20.00% 20.00%

Publicador:

Low-power synthesis flow for regular processor design

Relevância:

20.00% 20.00%

Publicador:

Low-power synthesis flow for regular processor design

Relevância:

20.00% 20.00%

Publicador:

Histogram of oriented gradients front end processing: An FPGA based processor approach

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The Field Programmable Gate Array (FPGA) implementation of the commonly used Histogram of Oriented Gradients (HOG) algorithm is explored. The HOG algorithm is employed to extract features for object detection. A key focus has been to explore the use of a new FPGA-based processor which has been targeted at image processing. The paper gives details of the mapping and scheduling factors that influence the performance and the stages that were undertaken to allow the algorithm to be deployed on FPGA hardware, whilst taking into account the specific IPPro architecture features. We show that multi-core IPPro performance can exceed that of against state-of-the-art FPGA designs by up to 3.2 times with reduced design and implementation effort and increased flexibility all on a low cost, Zynq programmable system.

Power and Energy Implications of the Number of Threads Used on the Intel Xeon Phi

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Energy consumption has become an important area of research of late. With the advent of new manycore processors, situations have arisen where not all the processors need to be active to reach an optimal relation between performance and energy usage. In this paper a study of the power and energy usage of a series of benchmarks, the PARSEC and the SPLASH- 2X Benchmark Suites, on the Intel Xeon Phi for different threads configurations, is presented. To carry out this study, a tool was designed to monitor and record the power usage in real time during execution time and afterwards to compare the r

A floating point CORDIC based SVD processor

Relevância:

20.00% 20.00%

Publicador:

Resumo:

An SVD processor system is presented in which each processing element is implemented using a simple CORDIC unit. The internal recursive loop within the CORDIC module is exploited, with pipelining being used to multiplex the two independent micro-rotations onto a single CORDIC processor. This leads to a high performance and efficient hardware architecture. In addition, a novel method for scale factor correction is presented which only need be applied once at the end of the computation. This also reduces the computation time. The net result is an SVD architecture based on a conventional CORDIC approach, which combines high performance with high silicon area efficiency.

«
1
2
3
4
5
6
7
8
9
»