FFmpeg

mirror of https://mirror.skon.top/https://github.com/FFmpeg/FFmpeg synced 2026-04-20 21:00:41 +08:00

Author	SHA1	Message	Date
Niklas Haas	cf2d40f65d	swscale/ops: add explicit clear mask to SwsClearOp Instead of implicitly testing for NaN values. This is mostly a straightforward translation, but we need some slight extra boilerplate to ensure the mask is correctly updated when e.g. commuting past a swizzle. Signed-off-by: Niklas Haas <git@haasn.dev>	2026-04-16 23:23:36 +02:00
Kacper Michajłow	369dbbe488	swscale/ops_memcpy: guard exec->in_stride[-1] access When use_loop == true and idx < 0, we would incorrectly check in_stride[idx], which is OOB read. Reorder conditions to avoid that. Signed-off-by: Kacper Michajłow <kasper93@gmail.com>	2026-04-16 18:56:22 +00:00
Niklas Haas	b6755b0158	swscale/ops_memcpy: always use loop on buffers with large padding The overhead of the loop and memcpy call is less than the overhead of possibly spilling into one extra unnecessary cache line. 64 is still a good rule of thumb for L1 cache line size in 2026. I leave it to future code archeologists to find and tweak this constant if it ever becomes unnecessary. Signed-off-by: Niklas Haas <git@haasn.dev>	2026-04-15 14:51:16 +00:00
Niklas Haas	85bef2c2bc	swscale/ops: split SwsConst up into op-specific structs It was a bit clunky, lacked semantic contextual information, and made it harder to reason about the effects of extending this struct. There should be zero runtime overhead as a result of the fact that this is already a big union. I made the changes in this commit by hand, but due to the length and noise level of the commit, I used Opus 4.6 to verify that I did not accidentally introduce any bugs or typos. Signed-off-by: Niklas Haas <git@haasn.dev>	2026-04-02 11:48:15 +00:00
Niklas Haas	bf09910292	swscale/ops: add filter kernel to SwsReadWriteOp This allows reads to directly embed filter kernels. This is because, in practice, a filter needs to be combined with a read anyways. To accomplish this, we define filter ops as their semantic high-level operation types, and then have the optimizer fuse them with the corresponding read/write ops (where possible). Ultimately, something like this will be needed anyways for subsampled formats, and doing it here is just incredibly clean and beneficial compared to each of the several alternative designs I explored. Sponsored-by: Sovereign Tech Fund Signed-off-by: Niklas Haas <git@haasn.dev>	2026-03-28 18:50:13 +01:00
Lynne	ad452205b6	swscale/ops: add SwsOpBackend.hw_format Allows to filter hardware formats. Sponsored-by: Sovereign Tech Fund	2026-02-26 14:10:22 +01:00
Lynne	00907e1244	swscale/ops: realign after adding slice_align This is a separate commit since it makes it easier to see the changes. Sponsored-by: Sovereign Tech Fund	2026-02-26 14:10:21 +01:00
Lynne	9c51aa1824	swscale: add SwsCompiledOp.slice_align Certain backends may not support (or need) slices, since they would handle slicing themselves. Sponsored-by: Sovereign Tech Fund	2026-02-26 14:10:21 +01:00
Niklas Haas	a151b426f9	swscale/ops_memcpy: add 'memcpy' backend for plane->plane copies Provides a generic fast path for any operation list that can be decomposed into a series of memcpy and memset operations. 25% faster than the x86 backend for yuv444p -> yuva444p 33% faster than the x86 backend for gray -> yuvj444p	2025-09-01 19:28:36 +02:00

9 Commits