]> git.sesse.net Git - x264/commit
Vastly faster SATD/SA8D/Hadamard_AC/SSD/DCT/IDCT
authorHolger Lubitz <holger@lubitz.org>
Sat, 7 Mar 2009 02:16:30 +0000 (18:16 -0800)
committerFiona Glaser <fiona@x264.com>
Sat, 7 Mar 2009 03:07:56 +0000 (19:07 -0800)
commit54e38917b413e80b474d3ed7ba344e7c489b020c
treef3f43fe8c307d9849b53a3fc014419aaee9b42be
parent7501d9505a10d17d8cc238fd87af6330d2c1804c
Vastly faster SATD/SA8D/Hadamard_AC/SSD/DCT/IDCT
Heavily optimized for Core 2 and Nehalem, but performance should improve on all modern x86 CPUs.
16x16 SATD: +18% speed on K8(64bit), +22% on K10(32bit), +42% on Penryn(64bit), +44% on Nehalem(64bit), +50% on P4(32bit), +98% on Conroe(64bit)
Similar performance boosts in SATD-like functions (SA8D, hadamard_ac) and somewhat less in DCT/IDCT/SSD.
Overall performance boost is up to ~15% on 64-bit Conroe.
common/dct.c
common/pixel.c
common/x86/dct-32.asm
common/x86/dct-64.asm
common/x86/dct-a.asm
common/x86/dct.h
common/x86/pixel-32.asm
common/x86/pixel-a.asm
common/x86/pixel.h
common/x86/x86util.asm
tools/checkasm.c