Files
FFmpeg/libavcodec
Andreas Rheinhardt e1297f3080 avcodec/x86/h264_idct: Use tmp reg in SUMSUB_BA if possible
It allows to exchange a paddw by a movdqa.

Old benchmarks:
idct8_add4_8bpp_c:                                     664.6 ( 1.00x)
idct8_add4_8bpp_sse2:                                  142.2 ( 4.67x)
idct8_add_8bpp_c:                                      215.5 ( 1.00x)
idct8_add_8bpp_sse2:                                    35.1 ( 6.14x)

New benchmarks:
idct8_add4_8bpp_c:                                     666.9 ( 1.00x)
idct8_add4_8bpp_sse2:                                  135.3 ( 4.93x)
idct8_add_8bpp_c:                                      217.7 ( 1.00x)
idct8_add_8bpp_sse2:                                    34.0 ( 6.41x)

Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>
2026-04-06 11:28:49 +02:00
..
2026-03-28 11:25:38 +01:00
2026-03-10 13:52:18 +01:00
2026-03-10 13:52:19 +01:00
2026-03-10 13:52:19 +01:00
2026-02-23 23:58:11 +01:00
2026-03-13 16:09:40 +00:00
2026-03-13 02:49:59 +01:00
2026-03-10 13:52:19 +01:00
2026-03-07 19:22:40 -03:00
2026-02-11 20:35:20 +00:00
2026-03-10 13:52:19 +01:00
2026-02-19 22:39:35 +00:00
2026-03-28 11:25:38 +01:00
2026-03-28 11:25:38 +01:00
2026-03-28 11:25:38 +01:00
2026-03-02 19:01:46 +01:00
2026-03-02 19:01:46 +01:00
2026-03-10 13:52:18 +01:00
2026-03-03 02:41:05 +01:00
2026-03-10 13:52:19 +01:00
2026-02-23 15:29:27 +00:00
2026-03-10 13:52:18 +01:00
2026-03-16 10:24:33 +00:00