mozjpeg

Author	SHA1	Message	Date
DRC	607b668ff9	MSVC: Eliminate C4996 warnings in API libs The primary purpose of this is to encourage adoption of libjpeg-turbo in downstream Windows projects that forbid the use of "deprecated" functions. libjpeg-turbo's usage of those functions was not actually unsafe, because: - libjpeg-turbo always checks the return value of fopen() and ensures that a NULL filename can never be passed to it. - libjpeg-turbo always checks the return value of getenv() and never passes a NULL argument to it. - The sprintf() calls in format_message() (jerror.c) could never overflow the destination string buffer or leave it unterminated as long as the buffer was at least JMSG_LENGTH_MAX bytes in length, as instructed. (Regardless, this commit replaces those calls with snprintf() calls.) - libjpeg-turbo never uses sscanf() to read strings or multi-byte character arrays. - Because of `b7d6e84d6a`, wrjpgcom explicitly checks the bounds of the source and destination strings before calling strcat() and strcpy(). - libjpeg-turbo always ensures that the destination string is terminated when using strncpy(). (`548490fe5e` made this explicit.) Regarding thread safety: Technically speaking, getenv() is not thread-safe, because the returned pointer may be invalidated if another thread sets the same environment variable between the time that the first thread calls getenv() and the time that that thread uses the return value. In practice, however, this could only occur with libjpeg-turbo if: (1) A multithreaded calling application used the deprecated and undocumented TJFLAG_FORCEMMX/TJFLAG_FORCESSE/TJFLAG_FORCESSE2 flags in the TurboJPEG API or set one of the corresponding environment variables (which are only intended for testing purposes.) Since the TurboJPEG API library only ever passed string constants to putenv(), the only inherent risk (i.e. the only risk introduced by the library and not the calling application) was that the SIMD extensions may have read an incorrect value from one of the aforementioned environment variables. or (2) A multithreaded calling application modified the value of the JPEGMEM environment variable in one thread while another thread was reading the value of that environment variable (in the body of jpeg_create_compress() or jpeg_create_decompress().) Given that the libjpeg API provides a thread-safe way for applications to modify the default memory limit without using the JPEGMEM environment variable, direct modification of that environment variable by calling applications is not supported. Microsoft's implementation of getenv_s() does not claim to be thread-safe either, so this commit uses getenv_s() solely to mollify Visual Studio. New inline functions and macros (GETENV_S() and PUTENV_S) wrap getenv_s()/_putenv_s() when building for Visual Studio and getenv()/setenv() otherwise, but GETENV_S()/PUTENV_S() provide no advantages over getenv()/setenv() other than parameter validation. They are implemented solely for convenience. Technically speaking, strerror() is not thread-safe, because the returned pointer may be invalidated if another thread changes the locale and/or calls strerror() between the time that the first thread calls strerror() and the time that that thread uses the return value. In practice, however, this could only occur with libjpeg-turbo if a multithreaded calling application encountered a file I/O error in tjLoadImage() or tjSaveImage(). Since both of those functions immediately copy the string returned from strerror() into a thread-local buffer, the risk is minimal, and the worst case would involve an incorrect error string being reported to the calling application. Regardless, this commit uses strerror_s() in the TurboJPEG API library when building for Visual Studio. Note that strerror_r() could have been used on Un*x systems, but it would have been necessary to handle both the POSIX and GNU implementations of that function and perform widespread compatibility testing. Such is left as an exercise for another day. Fixes #568	2022-02-23 15:57:01 -06:00
DRC	a3d4aadd0d	Build: Embed version/API/(C) info in MSVC DLLs Based on: `da7a18801a` Closes #576	2022-02-01 13:00:42 -06:00
DRC	172972394a	Eliminate non-ANSI C compatibility macros libjpeg-turbo has never supported non-ANSI C compilers. Per the spec, ANSI C compilers must have locale.h, stddef.h, stdlib.h, memset(), memcpy(), unsigned char, and unsigned short. They must also handle undefined structures.	2022-01-06 11:50:26 -06:00
DRC	2ce32e0fe5	cjpeg: automatically compress PGM-->grayscale JPEG (regression introduced by `aa7459050d`) cjpeg sets cinfo.in_color_space to JCS_RGB as an "arbitrary guess." Since tjLoadImage() never uses JCS_RGB, the PGM reader should treat JCS_RGB the same as JCS_UNKNOWN. Fixes #566	2021-11-30 16:09:21 -06:00
Alex Xu (Hello71)	18edeff4e8	Build: Set CMP0065 NEW (respect ENABLE_EXPORTS) Referring to https://cmake.org/cmake/help/latest/policy/CMP0065.html, CMake 3.3 and earlier automatically added compiler/linker flags such as -rdynamic/-export-dynamic, which caused symbols to be exported from executables. The primary purpose of this is to allow plugins loaded via dlopen() to access symbols from the calling program. libjpeg-turbo does not need this functionality, and enabling it needlessly increases the size of the libjpeg-turbo executables. Setting CMP0065 to NEW when using CMake 3.4 and later prevents CMake from automatically adding the aforementioned compiler/linker flags unless the ENABLE_EXPORTS property is set for a target (or the CMAKE_ENABLE_EXPORTS variable is set, which causes ENABLE_EXPORTS to be set for all targets.) Closes #554	2021-10-03 13:39:38 -05:00
DRC	129f0cb763	Neon/AArch64: Don't put GAS functions in .rodata Regression introduced by `240ba417aa` Closes #546	2021-08-25 12:54:24 -05:00
DRC	84d6306f64	Fix build w/CMake 3.14+ when CMAKE_SYSTEM_NAME=iOS Closes #539	2021-07-27 11:26:51 -05:00
DRC	5135c2e25d	Build: Use PIC for jsimd_none.o in shared libs In theory, all objects that will be included in a Un*x shared library must be built using PIC. In practice, most compilers don't require PIC to be explicitly specified for jsimd_none.o, either because the compiler automatically enables PIC in all cases (Ubuntu) or because the size of the generated object is too small. But some rare compilers do require PIC to be explicitly specified for jsimd_none.o. Fixes #520	2021-05-28 13:09:43 -05:00
DRC	3932190c2e	Fix build w/ non-GCC-compatible Un*x/Arm compilers Regression introduced by `d2c4079959` Closes #519	2021-05-17 13:09:37 -05:00
DRC	4f51f36eb3	Bump version to 2.1.0 to prepare for final release	2021-04-23 11:42:40 -05:00
DRC	2f9e8a1172	OSS-Fuzz integration This commit integrates OSS-Fuzz targets directly into the libjpeg-turbo source tree, thus obsoleting and improving code coverage relative to Google's OSS-Fuzz target for libjpeg-turbo (previously available here: https://github.com/google/oss-fuzz). I hope to eventually create fuzz targets for the BMP, GIF, and PPM readers as well, which would allow for fuzz-testing compression, but since those readers all require an input file, it is unclear how to build an efficient fuzzer around them. It doesn't make sense to fuzz-test compression in isolation, because compression can't accept arbitrary input data.	2021-03-30 20:59:41 -05:00
DRC	e795afc330	SSE2: Fix prog Huff enc err if Sl%32==0 && Al!=0 (regression introduced by `16bd984557`) This implements the same fix for jsimd_encode_mcu_AC_refine_prepare_sse2() that `a81a8c137b` implemented for jsimd_encode_mcu_AC_first_prepare_sse2(). Based on: `1a59587397` `eb176a91d8` Fixes #509 Closes #510	2021-03-25 22:46:14 -05:00
Adrian Bunk	2c01200c5d	Build: Fix incorrect regexes w/ if(...MATCHES...) "arm*" as a regex means 'ar' followed by zero or more 'm' characters, which matches 'parisc' and 'sparc64' as well.	2021-03-15 12:56:53 -05:00
DRC	8a2cad0201	Build: Handle CMAKE_OSX_ARCHITECTURES=(i386\|ppc) We don't officially support i386 or PowerPC Mac builds of libjpeg-turbo anymore, but they still work (bearing in mind that PowerPC builds require GCC v4.0 in Xcode 3.2.6, and i386 builds require Xcode 9.x or earlier.) Referring to #495, apparently MacPorts needs this functionality.	2021-01-21 10:51:49 -06:00
DRC	399aa374bd	Build: Support CMAKE_OSX_ARCHITECTURES ... as long as it contains only a singular value, which must equal "x86_64" or "arm64". Refer to #495	2021-01-19 12:25:11 -06:00
DRC	3e8911aad5	Build: Use correct SIMD exts w/VStudio IDE + Arm64 When configuring a Visual Studio IDE build and passing -A arm64 to CMake, CMAKE_SYSTEM_PROCESSOR will be amd64, so we should set CPU_TYPE based on the value of CMAKE_GENERATOR_PLATFORM rather than the value of CMAKE_SYSTEM_PROCESSOR.	2021-01-11 14:03:42 -06:00
DRC	cfc7e6e58e	Bump revision to 2.0.91 for post-beta fixes	2020-11-25 14:10:55 -06:00
DRC	8cf6f716bc	Bump revision to 2.0.90 to prepare for beta	2020-11-24 21:32:48 -06:00
Jonathan Wright	eb14189caa	Fix Neon SIMD build issues with Visual Studio - Use the _M_ARM and _M_ARM64 macros provided by Visual Studio for compile-time detection of Arm builds, since __arm__ and __aarch64__ are only present in GNU-compatible compilers. - Neon/intrinsics: Use the _CountLeadingZeros() and _CountLeadingZeros64() intrinsics provided by Visual Studio, since __builtin_clz() and __builtin_clzl() are only present in GNU-compatible compilers. - Neon/intrinsics: Since Visual Studio does not support static vector initialization, replace static initialization of Neon vectors with the appropriate intrinsics. Compared to the static initialization approach, this produces identical assembly code with both GCC and Clang. - Neon/intrinsics: Since Visual Studio does not support inline assembly code, provide alternative code paths for Visual Studio whenever inline assembly is used. - Build: Set FLOATTEST appropriately for AArch64 Visual Studio builds (Visual Studio does not emit fused multiply-add [FMA] instructions by default for such builds.) - Neon/intrinsics: Move temporary buffer allocation outside of nested loops. Since Visual Studio configures Arm builds with a relatively small amount of stack memory, attempting to allocate those buffers within the inner loops caused a stack overflow. Closes #461 Closes #475	2020-11-24 21:13:16 -06:00
DRC	292d78e786	Merge branch 'master' into dev	2020-11-16 15:28:02 -06:00
DRC	88bf1d1678	Build: Set FLOATTEST more intelligently The "32bit" vs. "64bit" floating point test results actually have nothing to do with the FPU. That was a fallacious assumption based on the observation that, with multiple CPU types, 32-bit and 64-bit builds produce different floating point test results. It seems that this is, in fact, due to differing compiler behavior-- more specifically, whether fused multiply-add (FMA) instructions are used to combine multiple floating point operations into a single instruction ("floating point expression contraction".) GCC does this by default if the target supports FMA instructions, which PowerPC and AArch64 targets both do. Fixes #468	2020-11-16 15:19:42 -06:00
DRC	8f8305981b	Merge branch 'master' into dev	2020-11-13 15:21:26 -06:00
DRC	33859880e9	Neon: Auto-detect compiler intrinsics completeness This allows the Neon intrinsics code to be built successfully (albeit likely with reduced run-time performance) with Xcode 5.0-6.2 (iOS/AArch64) and Android NDK < r19 (AArch32). Note that Xcode 5.0-6.2 will not build the Armv8 GAS code without gas-preprocessor.pl, and no version of Xcode will build the Armv7 GAS code without gas-preprocessor.pl, so we always use the full Neon intrinsics implementation by default with macOS and iOS builds. Auto-detecting the completeness of the compiler's set of Neon intrinsics also allows us to more intelligently set the default value of NEON_INTRINSICS, based on the values of HAVE_VLD1. This is a reasonable, albeit imperfect, proxy for whether a compiler has a full and optimal set of Neon intrinsics. Specific notes: - 64-bit RGB-to-YCbCr color conversion does not use any of the intrinsics in question, regresses with GCC - 64-bit accurate integer forward DCT uses vld1_s16_x3(), regresses with GCC - 64-bit Huffman encoding uses vld1q_u8_x4(), regresses with GCC - 64-bit YCbCr-to-RGB color conversion does not use any of the intrinsics in question, regresses with GCC - 64-bit accurate integer inverse DCT uses vld1_s16_x3(), regresses with GCC - 64-bit 4x4 inverse DCT uses vld1_s16_x3(). I did not test this algorithm in isolation, so it may in fact regress with GCC, but the regression may be hidden by the speedup from the new SIMD-accelerated upsampling algorithms. - 32-bit RGB-to-YCbCr color conversion: uses vld1_u16_x2(), regresses with GCC - 32-bit accurate integer forward DCT uses vld1_s16_x3(), regression irrelevant because there was no previous implementation - 32-bit accurate integer inverse DCT uses vld1_s16_x3(), regresses with GCC - 32-bit fast integer inverse DCT does not use any of the intrinsics in question, regresses with GCC - 32-bit 4x4 inverse DCT uses vld1_s16_x3(). I did not test this algorithm in isolation, so it may in fact regress with GCC, but the regression may be hidden by the speedup from the new SIMD-accelerated upsampling algorithms. Presumably when GCC includes a full and optimal set of Neon intrinsics, the HAVE_VLD1 tests will pass, and the full Neon intrinsics implementation will be enabled automatically.	2020-11-13 15:16:34 -06:00
DRC	3e9e7c7055	Fix build if WITH_12BIT==1 && WITH_JPEG(7\|8)==1 Fixes #466	2020-11-11 17:54:06 -06:00
Jonathan Wright	ba52a3de32	Neon: Intrinsics impl of h2v1 & h2v2 merged upsamp There was no previous GAS implementation. This commit also reverts `40557b2301` and `7723d7f7d0`. `7723d7f7d0` was only necessary because there was no Neon implementation of merged upsampling/color conversion, and `40557b2301` was only necessary because of `7723d7f7d0`.	2020-11-10 19:09:09 -06:00
Jonathan Wright	2acfb93c94	Neon: Intrinsics impl. of h1v2 fancy upsamling There was no previous GAS implementation.	2020-11-10 19:09:09 -06:00
Jonathan Wright	4f2216b435	Neon: Intrinsics implementation of RGB->YCbCr The previous AArch32 and AArch64 GAS implementations are retained by default when using GCC, in order to avoid a performance regression. The intrinsics implementation can be forced on or off using a new NEON_INTRINSICS CMake variable.	2020-11-10 19:09:05 -06:00
DRC	c7dd191271	Merge branch 'master' into dev	2020-11-08 15:15:02 -06:00
DRC	40557b2301	Build: Fix test failures w/ Arm Neon SIMD exts Regression caused by `a46c111d9f` Because of `7723d7f7d0`, which was introduced in libjpeg-turbo 1.5.1 in response to #81, merged upsampling/ color conversion is disabled on platforms that have SIMD-accelerated YCbCr -> RGB color conversion but not SIMD-accelerated merged upsampling/color conversion. This was intended to improve performance with the Neon SIMD extensions, since those are the only SIMD extensions for which those circumstances apply. Under normal circumstances, the separate "plain" (non-fancy) upsampling and color conversion routines will produce bitwise-identical output to the merged upsampling/color conversion routines, but that is not the case when skipping scanlines starting at an odd-numbered scanline. The modified test introduced in `a46c111d9f` does precisely that in order to validate the fixes introduced in `9120a24743` and `a46c111d9f`. Because of `7723d7f7d0`, the segfault fixed in `9120a24743` and `a46c111d9f` didn't affect the Neon SIMD extensions, so this commit effectively reverts the test modifications in `a46c111d9f` when using those SIMD extensions. We can get rid of this hack, as well as `7723d7f7d0`, once a Neon implementation of merged upsampling/color conversion is available.	2020-11-08 14:57:01 -06:00
DRC	59352195b2	Merge branch 'master' into dev	2020-10-19 21:17:46 -05:00
DRC	f7ca3c5a3d	Build: Improve Arm 32-bit cross-comp./packaging - Set CPU_TYPE=arm if performing a 32-bit build on an AArch64 system. This eliminates the need to use a CMake toolchain file. - Set RPMARCH=armv7hl if building on a 32-bit Arm system with an FPU. - Set RPMARCH=armv7hl and DEBARCH=armhf if performing a 32-bit build using a gnueabihf toolchain. - If performing a 32-bit Arm build, generate a 32-bit supplementary DEB package for AArch64 systems.	2020-10-19 16:25:11 -05:00
DRC	b8200c6601	Build: Add CMake package config files Based on: `d34b89b411` Closes #339 Closes #342	2020-10-15 10:26:54 -05:00
DRC	fe79f56b77	Merge branch 'master' into dev	2020-07-28 15:09:00 -05:00
DRC	a46c111d9f	Further jpeg_skip_scanlines() fixes - Introduce a partial image decompression regression test script that validates the correctness of jpeg_skip_scanlines() and jpeg_crop_scanlines() for a variety of cropping regions and libjpeg settings. This regression test catches the following issues: #182, fixed in `5bc43c7821` #237, fixed in 6e95c08649794f5018608f37250026a45ead2db8 #244, fixed in `398c1e9acc` #441, fully fixed in this commit It does not catch the following issues: #194, fixed in `773040f9d9` #244 (additional segfault), fixed in `9120a24743` - Modify the libjpeg-turbo regression test suite (make test) so that it checks for the issue reported in #441 (segfault in jpeg_skip_scanlines() when used with 4:2:0 merged upsampling/color conversion.) - Fix issues in jpeg_skip_scanlines() that caused incorrect output with h2v2 (4:2:0) merged upsampling/color conversion. The previous commit fixed the segfault reported in #441, but that was a symptom of a larger problem. Because merged 4:2:0 upsampling uses a "spare row" buffer, it is necessary to allow the upsampler to run when skipping rows (fancy 4:2:0 upsampling, which uses context rows, also requires this.) Otherwise, if skipping starts at an odd-numbered row, the output image will be incorrect. - Throw an error if jpeg_skip_scanlines() is called with two-pass color quantization enabled. With two-pass color quantization, the first pass occurs within jpeg_start_decompress(), so subsequent calls to jpeg_skip_scanlines() interfere with the multipass state and prevent the second pass from occurring during subsequent calls to jpeg_read_scanlines().	2020-07-28 12:47:53 -05:00
DRC	7d829bfa30	Bump version to 2.0.6 to prepare for new commits	2020-07-07 10:16:29 -05:00
DRC	aecee25695	Merge branch 'master' into dev	2020-06-19 00:03:51 -05:00
DRC	ae87a95861	TurboJPEG: Make global error handling thread-safe ... on platforms that support thread-local storage. This currently includes all supported platforms except 32-bit macOS. Fixes #396	2020-06-18 23:40:20 -05:00
DRC	70040cb7ee	Merge branch 'master' into dev	2020-06-02 15:05:43 -05:00
Andrew Childs	a2291b252d	Build: Add missing jpegtran-icc test dependency The jpegtran-icc test must run after the cjpeg-rgb-islow test, since the latter generates testout_rgb_islow.jpg.	2020-05-05 00:38:39 -05:00
DRC	ecf5f9a96a	Bump version to 2.0.5; Document previous commit	2020-02-18 10:43:48 -06:00
DRC	00d48d7e8c	Merge branch 'master' into dev	2020-02-17 18:14:10 -06:00
DRC	044c22e12f	Test: Honor CMAKE_CROSSCOMPILING_EMULATOR variable This CMake variable is intended to define a wrapper program for executing cross-compiled executables. However, CTest doesn't use CMAKE_CROSSCOMPILING_EMULATOR, because it isn't obvious which tests should be executed with the wrapper and which tests are scripts that don't need it. This commit manually prepends ${CMAKE_CROSSCOMPILING_EMULATOR} to all unit test command lines that execute a program built by the libjpeg-turbo build system. Thus, one can set CMAKE_CROSSCOMPILING_EMULATOR in a CMake toolchain file to (for instance) "qemu-{architecture} {qemu_arguments}") in order to execute all eligible unit tests using QEMU.	2020-02-17 14:41:43 -06:00
DRC	9a2cf32317	Build: Enable separate iOS pkg/DMG w/ sim support Refer to #406	2020-02-11 13:56:12 -06:00
DRC	163f0b1965	Bump version to 2.0.4 to prepare for new commits	2019-10-22 19:39:38 -05:00
DRC	b4110b65fc	Merge branch 'master' into dev	2019-09-04 18:58:12 -05:00
DRC	ded5a504b4	tjDecodeYUV*: Fix err if TJ inst used for prog dec If the TurboJPEG instance passed to tjDecodeYUV[Planes]() was previously used to decompress a progressive JPEG image, then we need to disable the progressive decompression parameters in the underlying libjpeg instance before calling jinit_master_decompress(). This commit also modifies the build system so that the "tjtest" target will test for this issue, and it corrects a previous oversight in the build system whereby tjbenchtest did not test progressive compression/decompression unless WITH_JAVA was true.	2019-08-15 13:57:36 -05:00
DRC	8ef53b102f	Merge branch 'master' into dev	2019-08-14 22:08:59 -05:00
DRC	a81a8c137b	SSE2 SIMD: Fix prog Huffman enc. error if Sl%16==0 (regression introduced by `5b177b3cab`) The SSE2 implementation of progressive Huffman encoding performed extraneous iterations when the scan length was a multiple of 16. Based on: `bb7f1ef983` Fixes #335 Closes #367	2019-08-14 22:01:30 -05:00
DRC	7fbfe29c65	Merge branch 'master' into dev	2019-07-18 15:18:27 -05:00
DRC	f37b7c1f96	Build: Fix build/install with Xcode IDE Closes #355	2019-07-02 11:28:26 -05:00

1 2 3 4 5

231 Commits