commit | 0f563e5fadbccb10fabd6ac80c256a4321401e22 | [log] [tgz] |
---|---|---|
author | Jonathan Wright <jonathan.wright@arm.com> | Fri May 07 12:25:51 2021 |
committer | Jonathan Wright <jonathan.wright@arm.com> | Sun May 09 19:12:48 2021 |
tree | 8a888aa303277152fbe0fd8f5eb5c6c07722addd | |
parent | f7364c05748b70a1e0fd57849665a9d9f0990803 [diff] |
Optimize Neon reductions in sum_neon.h using ADDV instruction Use the AArch64-only ADDV and ADDLV instructions to accelerate reductions that add across a Neon vector in sum_neon.h. This commit also refactors the inline functions to return a scalar instead of a vector - allowing for optimization of the surrounding code at each call site. Bug: b/181236880 Change-Id: Ieed2a2dd3c74f8a52957bf404141ffc044bd5d79