vectors of i1 and vectors x86 long double don't work

Quuxplusone commented 16 years ago


Bugzilla Link	PR1784
Status	REOPENED
Importance	P normal
Reported by	Duncan Sands (baldrick@free.fr)
Reported on	2007-11-09 02:29:09 -0800
Last modified on	2021-09-12 19:47:18 -0700
Version	trunk
Hardware	Other Linux
CC	anton@korobeynikov.info, cameron@cs.sfu.ca, emvv@mail.ru, hfinkel@anl.gov, leissa@cs.uni-saarland.de, lennart@augustsson.net, llvm-bugs@lists.llvm.org, llvm-dev@redking.me.uk, llvm@sunfishcode.online, meadori@gmail.com, michael.hliao@gmail.com, nadav.rotem@me.com, nobled@dreamwidth.org, paulsson@linux.vnet.ibm.com, preston.gurd@intel.com, stpworld@narod.ru, tpr.ll@botech.co.uk, yilong.guo@intel.com
Fixed by commit(s)
Attachments	`sizes.patch` (3778 bytes, text/plain)
Blocks	PR31265, PR3037, PR3352, PR7303
Blocked by
See also	PR34405

I noticed that a lot of the vector code assumes that there is
no padding between vector elements.  For example, the size is
assumed to be the primitive size of the element times the number
of elements.  However for x86 long double the primitive size is
80 bits while elements will be spaced by 96/128 bits (depending
on the os) in order to maintain alignment.  There's a similar
problem for vectors of apints.

Quuxplusone commented 15 years ago

Does gcc support vector of long double? Maybe it would make sense to just reject all vectors with elements that would require intra-element padding?

Quuxplusone commented 15 years ago

Yes, it looks like gcc supports vectors of long double.  I compiled
this with it on x86-32 and it compiles (didn't check whether the
assembler is sensible, or how it lays out the vector).

typedef long double dvec __attribute__ ((__vector_size__ (24)));
void d(dvec *x);
int main() {
  dvec x, y, z;
  d(&x);
  d(&y);
  z = x + y;
  d(&z);
  return 0;
}

Also, we may want to support vectors of boolean as results
of vector comparisons.

Quuxplusone commented 15 years ago

To be more explicit about "There's a similar problem for vectors of apints",
consider a vector <n x i1>.  getPrimitiveSizeInBits reckons this has size
n bits, i.e. that it has been bitpacked.  Yet you can do GEP on them, and
I'm willing to bet that the GEP doesn't address individual bits!  Likewise,
vast parts of codegen assume that vectors are laid out like arrays, which
in this case means that each i1 is in its own byte (also incompatible with
what getPrimitiveSizeInBits returns).

Possible solutions:
(1) bitpack vectors.  This could be done but seems like a lot of work
for not much benefit.  Also, I think some targets support vector compare
results producing a "vector of booleans".  It would be nice if this mapped
in a convenient way to a vector of i1.
(2) lay vectors out like arrays.  So elements of a vector of long double
would be spaced by 12/16 bytes depending on the target.  Vectors of i1
would typically get 1 byte per i1.
(3) byte pack vectors.  Here vectors of long double would have elements
spaced by 10 bytes (unaligned!).  Vectors of i1 would have elements 1
byte apart.
My preference is for (2).  However if GEP is disallowed for vectors,
then other schemes become more feasible.

I think Chris needs to make a policy decision here.

Quuxplusone commented 15 years ago

PS: While LegalizeDAG will happily "codegen" vectors of i1
and long double, the result is often bogus.  This is why
I put asserts in LegalizeTypes in various spots where the
logic is wrong for such vectors

Quuxplusone commented 15 years ago

Sorry, should have been getBitWidth rather than getPrimitiveSizeInBits:
it is VectorType::getBitWidth that returns the number of elements * the
element size in bits.  Note that codegen does the same for MVT::getSizeInBits
for a vector.  The result is that vectors of i1 are considered to be
bitpacked...

Quuxplusone commented 15 years ago

_Bug 3434 has been marked as a duplicate of this bug._