FFmpeg

mirror of https://mirror.skon.top/https://github.com/FFmpeg/FFmpeg synced 2026-04-21 05:11:59 +08:00

Author	SHA1	Message	Date
Niklas Haas	6c89a30ecd	swscale: add FFFramePool and use it for allocating planes The major consequence of this is that we start allocating buffers per plane, instead of allocating one contiguous buffer. This makes the no-op/refcopy case slightly slower, but doesn't meaningfully affect the rest: yuva444p -> yuva444p, time=157/1000 us (ref=78/1000 us), speedup=0.497x slower Overall speedup=1.016x faster, min=0.983x max=1.092x However, this is a necessary consequence of the desire to allow partial plane allocations / single plane refcopies. This slowdown also does not affect vf_scale, which already uses avfilter/framepool.c (via ff_get_video_buffer). Signed-off-by: Niklas Haas <git@haasn.dev>	2026-04-10 15:12:18 +02:00
Ramiro Polla	a1bfaa0e78	swscale/aarch64: introduce tool to enumerate sws_ops for NEON backend The NEON sws_ops backend will use a build-time code generator for the various operation functions it needs to implement. This build time code generator (ops_asmgen) will need a list of the operations that must be implemented. This commit adds a tool (sws_ops_aarch64) that generates such a list (ops_entries.c). The list is generated by iterating over all possible conversion combinations and collecting the parameters for each NEON assembly function that has to be implemented, defined by an unique set of parameters derived from SwsOp. Whenever swscale evolves, with improved optimization passes, new pixel formats, or improvements to the backend itself, this file (ops_entries.c) should be regenerated by running: $ make sws_ops_entries_aarch64 Sponsored-by: Sovereign Tech Fund Signed-off-by: Ramiro Polla <ramiro.polla@gmail.com>	2026-03-30 11:38:35 +00:00
Niklas Haas	475b11b2e0	swscale/filters: write new filter LUT generation code This is a complete rewrite of the math in swscale/utils.c initFilter(), using floating point math and with a bit more polished UI and internals. I have also included a substantial number of improvements, including a method to numerically compute the true filter support size from the parameters, and a more robust logic for the edge conditions. The upshot of these changes is that the filter weight computation is now much simpler and faster, and with fewer edge cases. I copy/pasted the actual underlying kernel functions from libplacebo, so this math is already quite battle-tested. I made some adjustments to the defaults to align with the existing defaults in libswscale, for backwards compatibility. Note that this commit introduces a lot more filter kernels than what we actually expose; but they are cheap to carry around, don't take up binary space, and will probably save some poor soul from incorrectly reimplementing them in the future. Plus, I have plans to expand the list of functions down the line, so it makes sense to just define them all, even if we don't necessarily use them yet. Sponsored-by: Sovereign Tech Fund Signed-off-by: Niklas Haas <git@haasn.dev>	2026-03-28 18:50:13 +01:00
Niklas Haas	68f3886460	swscale/ops_dispatch: split off compile/dispatch code from ops.c This code is self-contained and logically distinct from the ops-related helpers in ops.c, so it belongs in its own file. Purely cosmetic; no functional change. Sponsored-by: Sovereign Tech Fund Signed-off-by: Niklas Haas <git@haasn.dev>	2026-03-05 23:34:56 +00:00
Niklas Haas	5b39be1f0a	swscale: fix build on --disable-unstable By excluding the Vulkan makefile entirely when --disable-unstable is passed. This also correctly avoids compiling e.g. unused GLSL compilers. Fixes: #22295 See-Also: #22366 Sponsored-by: Sovereign Tech Fund Signed-off-by: Niklas Haas <git@haasn.dev>	2026-03-04 11:53:10 +01:00
Lynne	1d2e616d5f	swscale: add a Vulkan backend for ops.c Sponsored-by: Sovereign Tech Fund	2026-02-26 14:10:22 +01:00
Niklas Haas	016749f2e1	swscale/tests: add new test for generated operation lists This is similar to swscale/tests/swscale.c, but significantly cheaper - it merely prints the generated (optimized) operation list for every format conversion. Mostly useful for my own purposes as a regression test when making changes to the ops optimizer. Note the distinction between this and tests/swscale.c, the latter of which tests the result of applying an operation list for equality. There is an argument to be made that the two tests could be merged, but I think the amount of overlap is small enough to not be worth the amount of differences.	2025-12-08 16:58:53 +00:00
Niklas Haas	4ec2bffe62	configure: allow disabling experimental swscale code In theory we can also expand this to disable e.g. experimental codecs.	2025-09-01 19:28:36 +02:00
Niklas Haas	a151b426f9	swscale/ops_memcpy: add 'memcpy' backend for plane->plane copies Provides a generic fast path for any operation list that can be decomposed into a series of memcpy and memset operations. 25% faster than the x86 backend for yuv444p -> yuva444p 33% faster than the x86 backend for gray -> yuvj444p	2025-09-01 19:28:36 +02:00
Niklas Haas	5aef513fb4	swscale/ops_backend: add reference backend basend on C templates This will serve as a reference for the SIMD backends to come. That said, with auto-vectorization enabled, the performance of this is not atrocious. It easily beats the old C code and sometimes even the old SIMD. In theory, we can dramatically speed it up by using GCC vectors instead of arrays, but the performance gains from this are too dependent on exact GCC versions and flags, so it practice it's not a substitute for a SIMD implementation.	2025-09-01 19:28:36 +02:00
Niklas Haas	99d73064f5	swscale/ops_chain: add internal abstraction for kernel linking See doc/swscale-v2.txt for design details.	2025-09-01 19:28:36 +02:00
Niklas Haas	ea9ca3ff35	swscale/optimizer: add high-level ops optimizer This is responsible for taking a "naive" ops list and optimizing it as much as possible. Also includes a small analyzer that generates component metadata for use by the optimizer.	2025-09-01 19:28:36 +02:00
Niklas Haas	16e191c8ef	swscale/ops: introduce new low level framework See docs/swscale-v2.txt for an in-depth introduction to the new approach. This commit merely introduces the ops definitions and boilerplate functions. The subsequent commits will flesh out the underlying implementation.	2025-09-01 19:28:36 +02:00
James Almer	bf22c4cc3e	avutil: only duplicate hal2float and float2half in shared builds Signed-off-by: James Almer <jamrial@gmail.com>	2025-03-18 17:21:23 -03:00
Niklas Haas	ae84aa775f	swscale/utils: split off format code into new file utils.c is getting quite crowded, and I need a new place to dump a lot of format handling code for the ongoing rewrite. Rather than bloating this file even more, start splitting format handling helpers off into a new file. This also renames the existing utils.h header, which didn't really contain anything except the SwsFormat definition anyway (the prototypes for what should have been in utils.h are all still in the legacy swscale_internal.h).	2025-03-14 19:50:44 +01:00
Niklas Haas	a57fe519b6	swscale/lut3d: add 3DLUT dispatch system This is a lightweight wrapper around the underlying color management system, whose job it is merely to manage the 3DLUT state and apply them to the frame data. This is where we might add platform-specific optimizations in the future. I also plan on adding support for more pixel formats in the future. In particular, we could support YUV or XYZ input formats directly using only negligible additional code in the 3DLUT setup functions. This would eliminate the major source of slowdown, which is currently the roundtrip to RGBA64.	2024-12-23 12:33:43 +01:00
Niklas Haas	dddf536d3d	swscale/cms: add color management subsystem The underlying color mapping logic was ported as straightforwardly as possible from libplacebo, although the API and glue code has been very heavily refactored / rewritten. In particular, the generalization of gamut mapping methods is replaced by a single ICC intent selection, and constants have been hard-coded. To minimize the amount of overall operations, this gamut mapping LUT now embeds a direct end-to-end transformation to the output color space; something that libplacebo does in shaders, but which is prohibitively expensive in software. In order to preserve compatibility with dynamic tone mapping without severely regressing performance, we add the ability to generate a pair of "split" LUTS, one for encoding the input and output to the perceptual color space, and a third to embed the tone mapping operation. Additionally, this intermediate space could be used for additional subjective effect (e.g. changing saturation or brightness). The big downside of the new approach is that generating a static color mapping LUT is now fairly slow, as the chromaticity lobe peaks have to be recomputed for every single RGB value, since correlated RGB colors are not necessarily aligned in ICh space. Generating a split 3DLUT significantly alleviates this problem because the expensive step is done as part of the IPT input LUT, which can share the same hue peak calculation at least for all input intensities.	2024-12-23 12:33:43 +01:00
Niklas Haas	2e674780b7	swscale/csputils: add internal colorspace math helpers Logic is, for the most part, a straight port of similar logic in liplacebo's colorspace.c, with some general edits and refactors.	2024-12-23 12:33:43 +01:00
Niklas Haas	bf738412e8	swscale/graph: add new high-level scaler dispatch mechanism This interface has been designed from the ground up to serve as a new framework for dispatching various scaling operations at a high level. This will eventually replace the old ad-hoc system of using cascaded contexts, as well as allowing us to plug in more dynamic scaling passes requiring intermediate steps, such as colorspace conversions, etc. The starter implementation merely piggybacks off the existing sws_init() and sws_scale(), functions, though it does bring the immediate improvement of splitting up cascaded functions and pre/post conversion functions into separate filter passes, which allows them to e.g. be executed in parallel even when the main scaler is required to be single threaded. Additionally, a dedicated (multi-threaded) noop memcpy pass substantially improves throughput of that fast path. Follow-up commits will eventually expand this to move all of the scaling decision logic into the graph init function, and also eliminate some of the current special cases. Sponsored-by: Sovereign Tech Fund Signed-off-by: Niklas Haas <git@haasn.dev>	2024-11-25 11:02:16 +01:00
Timo Rothenpieler	aca569aad2	swscale/input: add rgbaf16 input support This is by no means perfect, since at least ddagrab will return scRGB data with values outside of 0.0f to 1.0f for HDR values. Its primary purpose is to be able to work with the format at all.	2022-08-19 22:09:36 +02:00
Timo Rothenpieler	b77fff47d0	configure: always enable gnu_windres if available Use the appropiate Makefile variable to ensure the resource file is only built into shared libraries instead.	2022-08-13 14:42:36 +02:00
Andreas Rheinhardt	f2b79c5b85	lib*/version: Move library version functions into files of their own This avoids having to rebuild big files every time FFMPEG_VERSION changes (which it does with every commit). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-05-10 06:49:32 +02:00
Martin Storsjö	6cd2ac388d	libswscale: Split version.h Signed-off-by: Martin Storsjö <martin@martin.st>	2022-03-16 14:05:26 +02:00
Andreas Rheinhardt	20b0d24c2f	Makefile: Redo duplicating object files in shared builds In case of shared builds, some object files containing tables are currently duplicated into other libraries: log2_tab.c, golomb.c, reverse.c. The check for whether this is duplicated is simply whether CONFIG_SHARED is true. Yet this is crude: E.g. libavdevice includes reverse.c for shared builds, but only needs it for the decklink input device, which given that decklink is not enabled by default will be unused in most libavdevice.so. This commit changes this by making it more explicit about what to duplicate from other libraries. To do this, two new Makefile variables were added: SHLIBOBJS and STLIBOBJS. SHLIBOBJS contains the objects that are duplicated from other libraries in case of shared builds; STLIBOBJS contains stuff that a library has to provide for other libraries in case of static builds. These new variables provide a way to enable/disable with a finer granularity than just whether shared builds are enabled or not. E.g. lavd's Makefile now contains: SHLIBOBJS-$(CONFIG_DECKLINK_INDEV) += reverse.o Another example is provided by the golomb tables. These are provided by lavc for static builds, even if one uses a build configuration that makes only lavf use them. Therefore lavc's Makefile contains STLIBOBJS-$(CONFIG_MXF_MUXER) += golomb.o, whereas lavf's Makefile has a corresponding SHLIBOBJS-$(CONFIG_MXF_MUXER) += golomb_tab.o. E.g. in case the MXF muxer is the only component needing these tables only libavformat.so will contain them for shared builds; currently libavcodec.so does so, too. (There is currently a CONFIG_EXTRA group for golomb. But actually one would need two groups (golomb_avcodec and golomb_avformat) in order to know when and where to include these tables. Therefore this commit uses a Makefile-based approach for this and stops using these groups for the users in libavformat.) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-01-04 05:01:04 +01:00
Mark Reid	6bf57c6a2a	libswscale/tests: add floatimg_cmp test changes since v1: - made into fate test - fixed c90 warnings - tests more intermediate formats - tested on BE mips too Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2020-10-02 14:59:52 +02:00
James Almer	6fdd35a312	Merge commit '92db5083077a8b0f8e1050507671b456fd155125' * commit '92db5083077a8b0f8e1050507671b456fd155125': build: Generate pkg-config files from Make and not from configure build: Store library version numbers in .version files Includes cherry-picked commits `8a34f36593` and `ee164727dd` to fix issues. Changes were also made to retain support for raise_major and build_suffix. Reviewed-by: ubitux Merged-by: James Almer <jamrial@gmail.com>	2017-05-04 19:59:30 -03:00
Clément Bœsch	3f17751eeb	Merge commit '11a9320de54759340531177c9f2b1e31e6112cc2' * commit '11a9320de54759340531177c9f2b1e31e6112cc2': build: Move build-system-related helper files to a separate subdirectory "ffbuild" directory name is used instead of "avbuild". Merged-by: Clément Bœsch <u@pkh.me>	2017-05-03 16:49:12 +02:00
Clément Bœsch	08e1376d81	fate: add fate-sws-pixdesc-query Test the pixel format querying within libswscale.	2017-03-20 08:02:30 +01:00
Diego Biurrun	92db508307	build: Generate pkg-config files from Make and not from configure This moves work from the configure to the Make stage where it can be parallelized and ensures that pkgconfig files are updated when library versions change. Bug-Id: 449	2016-12-22 12:30:54 +01:00
Derek Buitenhuis	ca5ec2bf51	Merge commit '01621202aad7e27b2a05c71d9ad7a19dfcbe17ec' * commit '01621202aad7e27b2a05c71d9ad7a19dfcbe17ec': build: miscellaneous cosmetics Merged-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>	2016-05-09 16:25:28 +01:00
Diego Biurrun	01621202aa	build: miscellaneous cosmetics Restore alphabetical order in lists, break overly long lines, do some prettyprinting, add some explanatory section comments, group parts together that belong together logically.	2016-04-07 15:26:08 +02:00
Pedro Arthur	3059562aa1	swscale: re-enable gamma +added gamma conversion to refactored code	2015-09-04 19:00:20 -03:00
Pedro Arthur	62d176de12	swscale: refactor vertical scaler	2015-08-19 10:43:52 -03:00
Pedro Arthur	e0a3173a94	swscale: refactor horizontal scaling + split color conversion from scaling - disabled gamma correction, until it's refactored too Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2015-08-18 01:33:32 +02:00
Michael Niedermayer	d0e0757e9a	swscale: Implement alphablendaway for planar 4:4:4 formats Fixes Ticket4746 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2015-08-08 13:24:52 +02:00
James Almer	9ffac3d00d	lsws: duplicate ff_log2_tab libswscale uses the table but wasn't duplicating it like the rest of the libs. This should fix compilation failures on msvc/icl after lavu stopped exporting internal functions and tables. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2014-08-12 20:52:21 +02:00
Michael Niedermayer	e9f7c7aef9	sws: Move fast bilinear C code into seperate file Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2014-07-19 05:36:26 +02:00
Michael Niedermayer	3d7218d932	Merge commit '449511740f06a4675b0066730fa45cdb764ffafc' * commit '449511740f06a4675b0066730fa45cdb764ffafc': build: handle library dependencies in configure Conflicts: common.mak configure libavdevice/Makefile libavfilter/Makefile Merged-by: Michael Niedermayer <michaelni@gmx.at>	2014-05-13 22:40:32 +02:00
Janne Grunau	449511740f	build: handle library dependencies in configure Instead of setting FFLIBS in each library Makefile configure exports FFLIBS-$library in config.mak.	2014-05-13 20:02:01 +02:00
James Almer	56572787ae	Add Windows resource file support for shared libraries Originally written by James Almer <jamrial@gmail.com> With the following contributions by Timothy Gu <timothygu99@gmail.com> * Use descriptions of libraries from the pkg-config file generation function * Use "FFmpeg Project" as CompanyName (suggested by Alexander Strasser) * Use "FFmpeg" for ProductName as MSDN says "name of the product with which the file is distributed" [1]. * Use FFmpeg's version (N-xxxxx-gxxxxxxx) for ProductVersion per MSDN [1]. * Only build the .rc files when --enable-small is not enabled. [1] http://msdn.microsoft.com/en-us/library/windows/desktop/aa381058.aspx Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2013-12-05 23:42:07 +01:00
Michael Niedermayer	039e9fe01c	Merge remote-tracking branch 'qatar/master' * qatar/master: (29 commits) lavfi: reclassify showfiltfmts as a TESTPROG graph2dot: fix printf format specifier swscale: yuv2planeX 8bit >=sse2 functions need aligned stack on x86-32. vp8: loopfilter >=sse2 functions need aligned stack on x86-32. amr: remove shift out of the AMR_BIT() macro. dsputilenc: group yasm and inline asm function pointer assignment. mov: use forward declaration of a function instead of a table. Clarify Doxygen comment for FF_API_* #defines. configure: simplify get_version() Create version.h headers for libraries that lack them gitignore: Use full path instead of relative path to specify patterns mpegvideo: remove VLAs Add XTEA encryption support in libavutil Add Blowfish encryption support in libavutil eval: Add the isinf() function and tests for it flacdec: move lpc filter to flacdsp flacdec: split off channel decorrelation as flacdsp avplay: Add an option for not limiting the input buffer size FATE: add a test for WMA cover art. FATE: add a test for apetag cover art ... Conflicts: .gitignore configure ffplay.c libavcodec/Makefile libavcodec/error_resilience.c libavcodec/mpegvideo.c libavcodec/ratecontrol.c libavdevice/avdevice.h libavfilter/Makefile libavfilter/filtfmts.c libavfilter/version.h libavformat/mov.c libavformat/version.h libavutil/Makefile libavutil/avutil.h libavutil/version.h libswscale/swscale.h libswscale/x86/swscale_mmx.c tests/fate/libavutil.mak tests/lavfi-regression.sh tools/graph2dot.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-07-04 21:03:28 +02:00
Diego Biurrun	86ab7b0f2f	Create version.h headers for libraries that lack them	2012-07-04 15:10:06 +02:00
Michael Niedermayer	653d117c29	Merge remote-tracking branch 'qatar/master' * qatar/master: libschroedinger: Switch to function names more in line with Libav style. Move code shared between libdirac and libschroedinger to libschroedinger. lavfi: uninline avfilter_copy_buffer_ref_props(). lavf: add missing '*' in a doxy. h264: Remove a commented-out function pointer typedef. txd: Remove write-only variable in txd_decode_frame(). mmvideo.c: Remove unused variable in mm_decode_pal(). build: cosmetics: Add missing end-of-line backslashes to item lists. build: cosmetics: Split HEADERS/OBJS/PROGS lists into one entry per line. libschroedinger: Move a function to avoid a forward declaration. pthread: warn on high thread counts vf_yadif: fix missing error handling for avfilter_poll_frame() avprobe: allow showing only one container/stream property. lavfi: support audio in avfilter_copy_frame_props(). lavfi: avfilter_merge_formats: handle case where inputs are same lavc: add sample rate and channel layout to AVFrame. zerocodec: check if the previous frame is missing doc: clarify check for NULL pointer style Conflicts: doc/APIchanges doc/developer.texi ffprobe.c libavcodec/Makefile libavcodec/avcodec.h libavcodec/libdirac_libschro.c libavcodec/libdirac_libschro.h libavcodec/mmvideo.c libavcodec/txd.c libavcodec/version.h libavcodec/zerocodec.c libavfilter/Makefile libavfilter/avfilter.c libavfilter/version.h libavformat/Makefile libavutil/Makefile Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-05-07 22:51:34 +02:00
Diego Biurrun	9eb83a56aa	build: cosmetics: Split HEADERS/OBJS/PROGS lists into one entry per line.	2012-05-07 14:01:32 +02:00
Michael Niedermayer	367d9b2957	Merge remote-tracking branch 'qatar/master' * qatar/master: swscale: K&R formatting cosmetics (part II) tiffdec: Add a malloc check and refactor another. faxcompr: Check malloc results and unify return path configure: escape colons in values written to config.fate ac3dsp: call femms/emms at the end of float_to_fixed24() for 3DNow and SSE matroska: Fix leaking memory allocated for laces. pthread: Fix crash due to fctx->delaying not being cleared. vp3: Assert on invalid filter_limit values. h264: fix 10bit biweight functions after recent x86inc.asm fixes. ffv1: Fix size mismatch in encode_line. movenc: Remove a dead initialization git-howto: Explain how to avoid Windows line endings in git checkouts. build: Move all arch OBJS declarations into arch subdirectory Makefiles. Conflicts: configure libavcodec/vp3.c libavformat/matroskadec.c libavutil/Makefile libswscale/Makefile libswscale/swscale.c libswscale/swscale_internal.h libswscale/utils.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-04-13 21:50:37 +02:00
Michael Niedermayer	ca19862d38	Merge remote-tracking branch 'qatar/master' * qatar/master: libxvid: remove disabled code qdm2: make a table static const qdm2: simplify bitstream reader setup for some subpacket types qdm2: use get_bits_left() build: Consistently handle conditional compilation for all optimization OBJS. avpacket, bfi, bgmc, rawenc: K&R prettyprinting cosmetics msrle: convert MS RLE decoding function to bytestream2. x86inc improvements for 64-bit Conflicts: common.mak libavcodec/avpacket.c libavcodec/bfi.c libavcodec/msrledec.c libavcodec/qdm2.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-04-13 00:39:19 +02:00
Diego Biurrun	baaab6069a	build: Move all arch OBJS declarations into arch subdirectory Makefiles.	2012-04-12 21:30:13 +02:00
Diego Biurrun	7bb3a302fe	build: Consistently handle conditional compilation for all optimization OBJS.	2012-04-12 09:00:49 +02:00
Michael Niedermayer	7e496e1545	Merge remote-tracking branch 'qatar/master' * qatar/master: build: ppc: drop stray leftover backslash build: Only clean the architecture subdirectory we build for. build: drop some unnecessary dependencies from the H.264 parser build: prettyprinting cosmetics libavutil: Remove pointless rational test program. libavutil: Remove broken and pointless lzo test program. lavf doxy: expand AVStream.codec doxy. lavf doxy: improve AVStream.time_base doxy. lavf doxy: add some basic documentation about reading from the demuxer. lavf doxy: document passing options to demuxers. lavf doxy: clarify that an AVPacket contains encoded data. mpegtsenc: allow user triggered PES packet flushing APIchanges: mark the place where 0.7 was cut. APIchanges: mark the place where 0.8 was cut. APIchanges: fill in missing dates and hashes. smacker: convert palette and header reading to bytestream2. alac: convert extradata reading to bytestream2. Conflicts: doc/APIchanges libavcodec/smacker.c libavcodec/x86/Makefile libavfilter/Makefile libavutil/Makefile Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-03-26 20:52:52 +02:00
Diego Biurrun	e7e19b15c7	build: Only clean the architecture subdirectory we build for. This allows simplifying the Makefiles; it is no longer necessary to register arch subdirectory Makefiles, just putting them in place is enough.	2012-03-26 13:29:03 +02:00

1 2 3

139 Commits