As a Chapel user, I would like the Chapel compile to generate vectorized code for most forall loops so that my programs can have competitive performance with other programming models.
Note that the LLVM LoopVectorizer could be taught about custom reductions too, it views these as a kind of recurrence, so it's RecurrenceDescriptor that would need to be updated.
As a Chapel user, I would like the Chapel compile to generate vectorized code for most
forall
loops so that my programs can have competitive performance with other programming models.