FFmpeg/libswscale at 2517c328fc314c6ea81516499fa7138e395929de - FFmpeg - Sk Repository

github-mirrors/FFmpeg

mirror of https://mirror.skon.top/https://github.com/FFmpeg/FFmpeg synced 2026-04-20 21:00:41 +08:00

Files

History

Ramiro Polla 2517c328fc swscale/aarch64: add NEON sws_ops backend

This commit pieces together the previous few commits to implement the
NEON backend for sws_ops.

In essence, a tool which runs on the target (sws_ops_aarch64) is used
to enumerate all the functions that the backend needs to implement. The
list it generates is stored in the repository (ops_entries.c).

The list from above is used at build time by a code generator tool
(ops_asmgen) to implement all the sws_ops functions the NEON backend
supports, and generate a lookup function in C to retrieve the assembly
function pointers.

At runtime, the NEON backend fetches the function pointers to the
assembly functions and chains them together in a continuation-passing
style design, similar to the x86 backend.

The following speedup is observed from legacy swscale to NEON:
A520: Overall speedup=3.780x faster, min=0.137x max=91.928x
A720: Overall speedup=4.129x faster, min=0.234x max=92.424x

And the following from the C sws_ops implementation to NEON:
A520: Overall speedup=5.513x faster, min=0.927x max=14.169x
A720: Overall speedup=4.786x faster, min=0.585x max=20.157x

The slowdowns from legacy to NEON are the same for C/x86. Mostly low
bit-depth conversions that did not perform dithering in legacy.

The 0.585x outlier from C to NEON is gbrpf32le -> gbrapf32le, which is
mostly memcpy with the C implementation. All other conversions are
better.

Sponsored-by: Sovereign Tech Fund
Signed-off-by: Ramiro Polla <ramiro.polla@gmail.com>

2026-03-30 11:38:35 +00:00

..

swscale/aarch64: add NEON sws_ops backend

2026-03-30 11:38:35 +00:00

all: fix typos found by codespell

2025-08-03 13:48:47 +02:00

swscale/loongarch: fix LASX YUV2RGB residual for multi-row slices

2026-03-02 13:14:07 +00:00

swscale/swscale_internal: Move altivec parts to ppc/

2026-02-28 09:56:01 +01:00

swscale/range_convert: saturate output instead of limiting input

2024-12-05 21:10:29 +01:00

swscale/aarch64: introduce tool to enumerate sws_ops for NEON backend

2026-03-30 11:38:35 +00:00

swscale/ops: simplify SwsOpList.order_src/dst

2026-03-29 09:39:09 +00:00

swscale/ops_chain: simplify ff_sws_compile_op_tables() with int index

2026-03-29 12:13:40 +02:00

alphablend.c

swscale/alphablend: don't overread alpha plane on subsampled odd size

2025-07-31 11:32:20 +00:00

bayer_template.c

swscale/internal: constify SwsFunc

2024-10-07 19:51:34 +02:00

cms.c

all: fix typos found by codespell

2025-08-03 13:48:47 +02:00

cms.h

swscale/utils: split off format code into new file

2025-03-14 19:50:44 +01:00

csputils.c

swscale/csputils: Remove unused ff_sws_matrix3x3_rmul()

2025-04-03 06:04:57 +02:00

csputils.h

swscale/csputils: Remove unused ff_sws_matrix3x3_rmul()

2025-04-03 06:04:57 +02:00

filters.c

swscale/filters: write new filter LUT generation code

2026-03-28 18:50:13 +01:00

filters.h

swscale/filters: write new filter LUT generation code

2026-03-28 18:50:13 +01:00

format.c

swscale/ops: add min/max to SwsDitherOp

2026-03-29 12:10:38 +02:00

format.h

swscale/format: add helper function to get "default" SwsFormat

2026-03-28 16:48:13 +00:00

gamma.c

swscale: rename SwsContext to SwsInternal

2024-10-24 22:50:00 +02:00

graph.c

swscale/graph: add scaling ops when required

2026-03-28 18:50:14 +01:00

graph.h

swscale/graph: add way to roll back passes

2026-03-28 18:50:13 +01:00

half2float.c

swscale/input: add rgbaf16 input support

2022-08-19 22:09:36 +02:00

hscale_fast_bilinear.c

swscale: rename SwsContext to SwsInternal

2024-10-24 22:50:00 +02:00

hscale.c

swscale/range_convert: fix mpeg ranges in yuv range conversion for non-8-bit pixel formats

2024-12-05 21:10:29 +01:00

input.c

Revert "swscale: add support for 10/12-bit grayscale MSB pixfmts"

2025-11-06 21:46:41 +01:00

libswscale.v

…

log2_tab.c

…

lut3d.c

swscale/lut3d: remove unused function

2025-07-22 19:56:34 +02:00

lut3d.h

swscale/utils: split off format code into new file

2025-03-14 19:50:44 +01:00

Makefile

swscale/aarch64: introduce tool to enumerate sws_ops for NEON backend

2026-03-30 11:38:35 +00:00

ops_backend.c

swscale/ops_chain: simplify ff_sws_compile_op_tables() with int index

2026-03-29 12:13:40 +02:00

ops_backend.h

swscale/ops_backend: add support for SWS_OP_FILTER_V

2026-03-28 18:50:14 +01:00

ops_chain.c

swscale/ops_chain: simplify ff_sws_compile_op_tables() with int index

2026-03-29 12:13:40 +02:00

ops_chain.h

swscale/ops_chain: simplify ff_sws_compile_op_tables() with int index

2026-03-29 12:13:40 +02:00

ops_dispatch.c

swscale/ops: simplify SwsOpList.order_src/dst

2026-03-29 09:39:09 +00:00

ops_dispatch.h

swscale/ops_dispatch: compute input x offset map for SwsOpExec

2026-03-28 18:50:14 +01:00

ops_internal.h

swscale/ops: add helper function to split filter subpasses

2026-03-28 18:50:13 +01:00

ops_memcpy.c

swscale/ops: add filter kernel to SwsReadWriteOp

2026-03-28 18:50:13 +01:00

ops_optimizer.c

swscale/ops_optimizer: check COMP_GARBAGE instead of next->comps.unused

2026-03-29 09:39:09 +00:00

ops_tmpl_common.c

swscale/ops_backend: add support for SWS_OP_FILTER_H

2026-03-28 18:50:14 +01:00

ops_tmpl_float.c

swscale/ops_backend: add support for SWS_OP_FILTER_H

2026-03-28 18:50:14 +01:00

ops_tmpl_int.c

swscale/ops_backend: add support for SWS_OP_FILTER_H

2026-03-28 18:50:14 +01:00

ops.c

swscale/aarch64: add NEON sws_ops backend

2026-03-30 11:38:35 +00:00

ops.h

swscale/ops: add min/max to SwsDitherOp

2026-03-29 12:10:38 +02:00

options.c

swscale: add enum SwsScaler, SwsContext.scaler to replace legacy flags

2026-03-12 22:09:04 +01:00

output.c

swscale/output: fix integer overflows in chroma in yuv2rgba64_X_c_template()

2026-03-13 02:51:19 +01:00

rgb2rgb_template.c

swscale/rgb2rgb: Remove set-but-unused functions

2026-03-01 23:45:11 +00:00

rgb2rgb.c

swscale/rgb2rgb: Remove set-but-unused functions

2026-03-01 23:45:11 +00:00

rgb2rgb.h

swscale/rgb2rgb: Remove set-but-unused functions

2026-03-01 23:45:11 +00:00

slice.c

swscale/slice: fix init of 32 bpc planes

2024-12-16 12:21:55 +01:00

swscale_internal.h

swscale: explicitly track if a context is "legacy" or not

2026-03-06 19:06:33 +01:00

swscale_unscaled.c

swscale/unscaled: fix planarCopyWrapper for float formats with same endianness

2026-03-09 08:22:58 +00:00

swscale.c

swscale/graph: allow setup() to return an error code

2026-03-12 21:02:48 +00:00

swscale.h

swscale/filters: write new filter LUT generation code

2026-03-28 18:50:13 +01:00

swscaleres.rc

…

utils.c

swscale: add enum SwsScaler, SwsContext.scaler to replace legacy flags

2026-03-12 22:09:04 +01:00

version_major.h

libs: bump major version for all libraries

2025-03-28 14:44:34 -03:00

version.c

lib*/version: Use static_assert for static asserts

2024-03-31 00:08:42 +01:00

version.h

swscale: add enum SwsScaler, SwsContext.scaler to replace legacy flags

2026-03-12 22:09:04 +01:00

vscale.c

swscale/internal: group user-facing options together

2024-11-21 12:49:56 +01:00

yuv2rgb.c

swscale/aarch64: add NEON YUV420P/YUV422P/YUVA420P to RGB conversion

2026-03-02 13:14:07 +00:00