Biblioteca Digital

Toward Effective Scalar Hardware for Highly Vectorizable Applications

**Autoria(s):** S, Vajapeyam; WC, Hsu
Data(s)	01/11/1993
Resumo	The performance of a program will ultimately be limited by its serial (scalar) portion, as pointed out by Amdahl′s Law. Reported studies thus far of instruction-level parallelism have mixed data-parallel program portions with scalar program portions, often leading to contradictory and controversial results. We report an instruction-level behavioral characterization of scalar code containing minimal data-parallelism, extracted from highly vectorized programs of the PERFECT benchmark suite running on a Cray Y-MP system. We classify scalar basic blocks according to their instruction mix, characterize the data dependencies seen in each class, and, as a first step, measure the maximum intrablock instruction-level parallelism available. We observe skewed rather than balanced instruction distributions in scalar code and in individual basic block classes of scalar code; nonuniform distribution of parallelism across instruction classes; and, as expected, limited available intrablock parallelism. We identify frequently occurring data-dependence patterns and discuss new instructions to reduce latency. Toward effective scalar hardware, we study latency-pipelining trade-offs and restricted multiple instruction issue mechanisms.
Formato	application/pdf
Identificador	http://eprints.iisc.ernet.in/35955/1/Toward.pdf S, Vajapeyam and WC, Hsu (1993) Toward Effective Scalar Hardware for Highly Vectorizable Applications. In: Journal of Parallel and Distributed Computing, 19 (3). 147-162 .
Publicador	Elsevier Science
Relação	http://dx.doi.org/10.1006/jpdc.1993.1101 http://eprints.iisc.ernet.in/35955/
Palavras-Chave	#Computer Science & Automation (Formerly, School of Automation) #Supercomputer Education & Research Centre
Tipo	Journal Article PeerReviewed

Acesso ao item digital