Add data padding

The trip count %trip_count_wo_peel% remaining after the compiler generates the peeled loop is not a multiple of vector length %vl_x_uf% (vector length * unroll factor). To fix: Do one of the following:

Read More