Listing 1. Loop from a Program with Cache-Unfriendly Behavior for (j = 0; j < COLS; j++) for (i = 0; i < ROWS; i++) a[i][j] = b[i][j] + c[i]