Biblioteca Digital

949 resultados para Parallel programming (computer)

Lock-free Parallel Dynamic Programming

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We show a method for parallelizing top down dynamic programs in a straightforward way by a careful choice of a lock-free shared hash table implementation and randomization of the order in which the dynamic program computes its subproblems. This generic approach is applied to dynamic programs for knapsack, shortest paths, and RNA structure alignment, as well as to a state-of-the-art solution for minimizing the m��ximum number of open stacks. Experimental results are provided on three different modern multicore architectures which show that this parallelization is effective and reasonably scalable. In particular, we obtain over 10 times speedup for 32 threads on the open stacks problem.

IDRA (IDeal Resource Allocation): Computing ideal speedups in parallel logic programming

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We present a technique to estimate accurate speedups for parallel logic programs with relative independence from characteristics of a given implementation or underlying parallel hardware. The proposed technique is based on gathering accurate data describing one execution at run-time, which is fed to a simulator. Alternative schedulings are then simulated and estimates computed for the corresponding speedups. A tool implementing the aforementioned techniques is presented, and its predictions are compared to the performance of real systems, showing good correlation.

Using attributed variables in the implementation of concurrent and parallel logic programming systems

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Incorporating the possibility of attaching attributes to variables in a logic programming system has been shown to allow the addition of general constraint solving capabilities to it. This approach is very attractive in that by adding a few primitives any logic programming system can be turned into a generic constraint logic programming system in which constraint solving can be user de��ned, and at source level - an extreme example of the "glass box" approach. In this paper we propose a different and novel use for the concept of attributed variables: developing a generic parallel/concurrent (constraint) logic programming system, using the same "glass box" flavor. We arg��e that a system which implements attributed variables and a few additional primitives can be easily customized at source level to implement many of the languages and execution models of parallelism and concurrency currently proposed, in both shared memory and distributed systems. We illustrate this through examples and report on an implementation of our ideas.

B-LOG: A branch and bound methodology for the parallel execution of logic programs

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We propose a computational methodology -"B-LOG"-, which offers the potential for an effective implementation of Logic Programming in a parallel computer. We also propose a weighting scheme to guide the search process through the graph and we apply the concepts of parallel "branch and bound" algorithms in order to perform a "best-first" search using an information theoretic bound. The concept of "session" is used to speed up the search process in a succession of similar queries. Within a session, we strongly modify the bounds in a local database, while bounds kept in a global database are weakly modified to provide a better initial condition for other sessions. We also propose an implementation scheme based on a database machine using "semantic paging", and the "B-LOG processor" based on a scoreboard driven controller.

On the uses of attributed variables in parallel and concurrent logic programming systems

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Incorporating the possibility of attaching attributes to variables in a logic programming system has been shown to allow the addition of general constraint solving capabilities to it. This approach is very attractive in that by adding a few primitives any logic programming system can be turned into a generic constraint logic programming system in which constraint solving can be user defined, and at source level - an extreme example of the "glass box" approach. In this paper we propose a different and novel use for the concept of attributed variables: developing a generic parallel/concurrent (constraint) logic programming system, using the same "glass box" flavor. We arg��e that a system which implements attributed variables and a few additional primitives can be easily customized at source level to implement many of the languages and execution models of parallelism and concurrency currently proposed, in both shared memory and distributed systems. We illustrate this through examples.

A proposal for enhancing the motivation in students of computer programming

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Computer programming is known to be one of the most difficult courses for students in the first year of engineering. They are faced with the challenge of abstract thinking and gaining programming skills for the first time. These skills are acquired by continuous practicing from the start of the course. In order to enhance the motivation and dynamism of the learning and assessment processes, we have proposed the use of three educational resources namely screencasts, self-assessment questionnaires and automated grading of assignments. These resources have been made available in Moodle which is a Learning Management System widely used in education environments and adopted by the Telecommunications Engineering School at the Universidad Polit��cnica de Madrid (UPM). Both teachers and students can enhance the learning and assessment processes through the use of new educational activities such as self-assessment questionnaires and automated grading of assignments. On the other hand, multimedia resources such as screencasts can guide students in complex topics. The resources proposed allow teachers to improve their tutorial actions since they provide immediate feedback and comments to students without the enormous effort of manual correction and evaluation by teachers specially taking into account the large number of students enrolled in the course. In this paper we present the case study where three proposed educational resources were applied. We describe the special features of the course and explain why the use of these resources can both enhance the students? motivation and improve the teaching and learning processes. Our research work was carried out on students attending the "Computer programming" course offered in the first year of a Telecommunications Engineering degree at UPM. This course is mandatory and has more than 450 enrolled students. Our purpose is to encourage the motivation and dynamism of the learning and assessment processes.

Parallel Computer Vision Algorithms for Graphics Processing Units

Relevância:

40.00% 40.00%

Publicador:

Resumo:

La evoluci��n de los tel��fonos m��viles inteligentes, dotados de c��maras digitales, est�� provocando una creciente demanda de aplicaciones cada vez m��s complejas que necesitan algoritmos de visi��n artificial en tiempo real; puesto que el tama��o de las se��ales de v��deo no hace sino aumentar y en cambio el rendimiento de los procesadores de un solo n��cleo se ha estancado, los nuevos algoritmos que se dise��en para visi��n artificial han de ser paralelos para poder ejecutarse en m��ltiples procesadores y ser computacionalmente escalables. Una de las clases de procesadores m��s interesantes en la actualidad se encuentra en las tarjetas gr��ficas (GPU), que son dispositivos que ofrecen un alto grado de paralelismo, un excelente rendimiento num��rico y una creciente versatilidad, lo que los hace interesantes para llevar a cabo computaci��n cient��fica. En esta tesis se exploran dos aplicaciones de visi��n artificial que revisten una gran complejidad computacional y no pueden ser ejecutadas en tiempo real empleando procesadores tradicionales. En cambio, como se demuestra en esta tesis, la paralelizaci��n de las distintas subtareas y su implementaci��n sobre una GPU arrojan los resultados deseados de ejecuci��n con tasas de refresco interactivas. Asimismo, se propone una t��cnica para la evaluaci��n r��pida de funciones de complejidad arbitraria especialmente indicada para su uso en una GPU. En primer lugar se estudia la aplicaci��n de t��cnicas de s��ntesis de im��genes virtuales a partir de ��nicamente dos c��maras lejanas y no paralelas��en contraste con la configuraci��n habitual en TV 3D de c��maras cercanas y paralelas��con informaci��n de color y profundidad. Empleando filtros de mediana modificados para la elaboraci��n de un mapa de profundidad virtual y proyecciones inversas, se comprueba que estas t��cnicas son adecuadas para una libre elecci��n del punto de vista. Adem��s, se demuestra que la codificaci��n de la informaci��n de profundidad con respecto a un sistema de referencia global es sumamente perjudicial y deber��a ser evitada. Por otro lado se propone un sistema de detecci��n de objetos m��viles basado en t��cnicas de estimaci��n de densidad con funciones locales. Este tipo de t��cnicas es muy adecuada para el modelado de escenas complejas con fondos multimodales, pero ha recibido poco uso debido a su gran complejidad computacional. El sistema propuesto, implementado en tiempo real sobre una GPU, incluye propuestas para la estimaci��n din��mica de los anchos de banda de las funciones locales, actualizaci��n selectiva del modelo de fondo, actualizaci��n de la posici��n de las muestras de referencia del modelo de primer plano empleando un filtro de part��culas multirregi��n y selecci��n autom��tica de regiones de inter��s para reducir el coste computacional. Los resultados, evaluados sobre diversas bases de datos y comparados con otros algoritmos del estado del arte, demuestran la gran versatilidad y calidad de la propuesta. Finalmente se propone un m��todo para la aproximaci��n de funciones arbitrarias empleando funciones continuas lineales a tramos, especialmente indicada para su implementaci��n en una GPU mediante el uso de las unidades de filtraje de texturas, normalmente no utilizadas para c��mputo num��rico. La propuesta incluye un riguroso an��lisis matem��tico del error cometido en la aproximaci��n en funci��n del n��mero de muestras empleadas, as�� como un m��todo para la obtenci��n de una partici��n cuasi��ptima del dominio de la funci��n para minimizar el error. ABSTRACT The evolution of smartphones, all equipped with digital cameras, is driving a growing demand for ever more complex applications that need to rely on real-time computer vision algorithms. However, video signals are only increasing in size, whereas the performance of single-core processors has somewhat stagnated in the past few years. Consequently, new computer vision algorithms will need to be parallel to run on multiple processors and be computationally scalable. One of the most promising classes of processors nowadays can be found in graphics processing units (GPU). These are devices offering a high parallelism degree, excellent numerical performance and increasing versatility, which makes them interesting to run scientific computations. In this thesis, we explore two computer vision applications with a high computational complexity that precludes them from running in real time on traditional uniprocessors. However, we show that by parallelizing subtasks and implementing them on a GPU, both applications attain their goals of running at interactive frame rates. In addition, we propose a technique for fast evaluation of arbitrarily complex functions, specially designed for GPU implementation. First, we explore the application of depth-image��based rendering techniques to the unusual configuration of two convergent, wide baseline cameras, in contrast to the usual configuration used in 3D TV, which are narrow baseline, parallel cameras. By using a backward mapping approach with a depth inpainting scheme based on median filters, we show that these techniques are adequate for free viewpoint video applications. In addition, we show that referring depth information to a global reference system is ill-advised and should be avoided. Then, we propose a background subtraction system based on kernel density estimation techniques. These techniques are very adequate for modelling complex scenes featuring multimodal backgrounds, but have not been so popular due to their huge computational and memory complexity. The proposed system, implemented in real time on a GPU, features novel proposals for dynamic kernel bandwidth estimation for the background model, selective update of the background model, update of the position of reference samples of the foreground model using a multi-region particle filter, and automatic selection of regions of interest to reduce computational cost. The results, evaluated on several databases and compared to other state-of-the-art algorithms, demonstrate the high quality and versatility of our proposal. Finally, we propose a general method for the approximation of arbitrarily complex functions using continuous piecewise linear functions, specially formulated for GPU implementation by leveraging their texture filtering units, normally unused for numerical computation. Our proposal features a rigorous mathematical analysis of the approximation error in function of the number of samples, as well as a method to obtain a suboptimal partition of the domain of the function to minimize approximation error.

An hybrid parallel algorithm for solving tridiagonal linear systems versus the Wang��s method in a Cray T3D BSP computer

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this paper we describe an hybrid algorithm for an even number of processors based on an algorithm for two processors and the Overlapping Partition Method for tridiagonal systems. Moreover, we compare this hybrid method with the Partition Wang��s method in a BSP computer. Finally, we compare the theoretical computation cost of both methods for a Cray T3D computer, using the cost model that BSP model provides.

A semi-virtual memory multi-programming system for the mini-computer PDP 8/3 (ILLIAC-3) /

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Thesis (M.S.)--University of Illinois at Urbana-Champaign.

Compiling serial languages for parallel machines /

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Thesis (M. S.)--University of Illinois at Urbana-Champaign.

Techniques for parallel computer design /

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Vita.

Solving triangular systems on a parallel computer /

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Includes bibliographical references.

Weather analysis on a parallel computer /

Relevância:

40.00% 40.00%

Publicador:

Resumo:

"December 1, 1969."

Analyzing smooth flowcharts : teaching structured programming in a computer-based education environment /

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Vita.

A nonlinear programming algorithm for an array computer /

Relevância:

40.00% 40.00%

Publicador:

Resumo:

"October 22, 1969."

«
1
2
...
5
6
7
8
9
10
11
...
63
64
»