4808d831dbc4e9ff83fa0efe11207bc135c6d6f5 - webm/libvpx

commit	4808d831dbc4e9ff83fa0efe11207bc135c6d6f5	[log] [tgz]
author	Jonathan Wright <jonathan.wright@arm.com>	Wed May 12 15:05:56 2021
committer	James Zern <jzern@google.com>	Thu May 13 22:41:15 2021
tree	b50075badb21b350dcd9ab058304e8eca6a95ef2
parent	231aa6ae32fca53efc45ffd39e14650346fcb030 [diff]

Optimize remaining mse and sse functions in variance_neon.c

Implement sum of squared difference calculations in vpx_mse16x16_neon
and vpx_get4x4sse_cs_neon using the ABD and UDOT instructions -
instead of widening subtracts followed by a sequence of MLAs.

The existing implementation is retained for use on CPUs that do not
implement the Armv8.4-A UDOT instruction. This commit also updates
the variable names used in the existing implementations to be more
descriptive.

Bug: b/181236880
Change-Id: Id4ad8ea7c808af1ac9bb5f1b63327ab487e4b1c7

vpx_dsp/arm/variance_neon.c[diff]

1 file changed