Re: How many vector registers are useful?

hyatt@cis.uab.edu (Robert Hyatt)
Wed, 27 Jan 1993 01:57:12 GMT

          From comp.compilers

Related articles
How many vector registers are useful? kirchner@uklira.informatik.uni-kl.de (1993-01-25)
Re: How many vector registers are useful? grunwald@tile.cs.colorado.edu (1993-01-25)
Re: How many vector registers are useful? pmontgom@math.orst.edu (1993-01-26)
Re: How many vector registers are useful? jlg@cochiti.lanl.gov (1993-01-26)
Re: How many vector registers are useful? hyatt@cis.uab.edu (1993-01-27)
Re: How many vector registers are useful? jrbd@craycos.com (1993-01-27)
Re: How many vector registers are useful? hrubin@pop.stat.purdue.edu (1993-01-28)
Re: How many vector registers are useful? sanjay@equalizer.cray.com (1993-01-29)
Re: How many vector registers are useful? shubu@cs.wisc.edu (1993-01-30)
Vector registers moliere!pmk@uunet.UU.NET (1993-01-30)
Re: How many vector registers are useful? kurz@math.uni-frankfurt.de (1993-02-01)
| List of all articles for this month |

Newsgroups: comp.sys.super,comp.arch,comp.compilers
From: hyatt@cis.uab.edu (Robert Hyatt)
Organization: University of Alabama at Birmingham
Date: Wed, 27 Jan 1993 01:57:12 GMT
References: 93-01-174 93-01-195
Keywords: architecture, vector

In article 93-01-195 jlg@cochiti.lanl.gov (J. Giles) writes:
>Vectors and scalars are not really all that different. Unless you have
>overlap between the vectors they really behave just like large scalars.


Good point. If you program in assembler on a Cray, you also use registers
to pre-fetch memory operands before you need 'em. In this regard, re-use
makes things even better, but even if you only do the equivalent of i++ in
C, you can win big if you don't run out of registers.... ie, load i n
clocks before it is needed, then s1 s1+1, then store it and don't use s1
for n clocks. the result is a three clock i++, as fast as it can be done
on a Cray. (n varies by cpu type.... and by memory type. can at least
span the interval 14 clocks to 24 clocks and even up to 50 clocks on some
Cray-2 processors I have used.)


Same holds true for vectors. You don't necessarily have to have high
reuse to win, just enough registers to keep data flowing (a) from memory
(b) to memory and (c) between functional units....
--
!Robert Hyatt Computer and Information Sciences !
!hyatt@cis.uab.edu University of Alabama at Birmingham !
--


Post a followup to this message

Return to the comp.compilers page.
Search the comp.compilers archives again.