DRC
0d435698f4
AltiVec SIMD implementation of slow integer inverse DCT
...
git-svn-id: svn://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1461 632fc199-4ca6-4c93-a231-07263d6284db
2014-12-20 01:17:39 +00:00
DRC
c7dadd2d0b
AltiVec SIMD implementation of fast integer inverse DCT
...
git-svn-id: svn://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1445 632fc199-4ca6-4c93-a231-07263d6284db
2014-12-18 10:12:29 +00:00
DRC
25e40dc42c
AltiVec SIMD implementation of slow integer forward DCT; Clean up fast integer forward DCT code so that it is easier to see how it derives from the SSE2 code and to make it play more nicely with the slow FDCT code.
...
git-svn-id: svn://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1443 632fc199-4ca6-4c93-a231-07263d6284db
2014-12-17 08:04:39 +00:00
DRC
a2cc95b827
AltiVec SIMD implementation of fast forward DCT
...
git-svn-id: svn://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1405 632fc199-4ca6-4c93-a231-07263d6284db
2014-09-05 06:33:42 +00:00
DRC
b052d67eb1
ARM NEON SIMD support for YCC-to-RGB565 conversion, and optimizations to the existing YCC-to-RGB conversion code:
...
-----
aee36252be .patch
From aee36252be20054afce371a92406fc66ba6627b5 Mon Sep 17 00:00:00 2001
From: Siarhei Siamashka <siarhei.siamashka@gmail.com >
Date: Wed, 13 Aug 2014 03:50:22 +0300
Subject: [PATCH] ARM: Faster NEON yuv->rgb conversion for Krait and Cortex-A15
The older code was developed and tested only on ARM Cortex-A8 and ARM Cortex-A9.
Tuning it for newer ARM processors can introduce some speed-up (up to 20%).
The performance of the inner loop (conversion of 8 pixels) improves from
~27 cycles down to ~22 cycles on Qualcomm Krait 300, and from ~20 cycles
down to ~18 cycles on ARM Cortex-A15.
The performance remains exactly the same on ARM Cortex-A7 (~58 cycles),
ARM Cortex-A8 (~25 cycles) and ARM Cortex-A9 (~30 cycles) processors.
Also use larger indentation in the source code for separating two independent
instruction streams.
-----
a5efdbf22c .patch
From a5efdbf22ce9c1acd4b14a353cec863c2c57557e Mon Sep 17 00:00:00 2001
From: Siarhei Siamashka <siarhei.siamashka@gmail.com >
Date: Wed, 13 Aug 2014 07:23:09 +0300
Subject: [PATCH] ARM: NEON optimized yuv->rgb565 conversion
The performance of the inner loop (conversion of 8 pixels):
* ARM Cortex-A7: ~55 cycles
* ARM Cortex-A8: ~28 cycles
* ARM Cortex-A9: ~32 cycles
* ARM Cortex-A15: ~20 cycles
* Qualcomm Krait: ~24 cycles
Based on the Linaro rgb565 patch from
https://sourceforge.net/p/libjpeg-turbo/patches/24/
but implements better instructions scheduling.
git-svn-id: svn://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1385 632fc199-4ca6-4c93-a231-07263d6284db
2014-08-23 15:47:51 +00:00
DRC
6263c1fc1b
SIMD-accelerated int upsample routine for MIPS DSPr2
2014-05-18 20:04:47 +00:00
DRC
ecfbabdbf3
Fix MIPS build
2014-05-18 19:36:05 +00:00
DRC
f8301c92dd
Get rid of the HAVE_PROTOTYPES configuration option, as well as the related JMETHOD and JPP macros. libjpeg-turbo has never supported compilers that don't handle prototypes. Doing so requires ansi2knr, which isn't even supported in the IJG code anymore.
2014-05-16 10:43:44 +00:00
DRC
2c0b793539
Remove all of the NEED_SHORT_EXTERNAL_NAMES stuff. There is scant information available as to which linkers ever had a 15-character global symbol name limit. AFAICT, it might have been a VMS and/or a.out BSD thing, but none of those platforms have ever been supported by libjpeg-turbo (nor are such systems supported by other open source libraries of this nature.)
2014-05-15 20:30:16 +00:00
DRC
99de998e2c
SIMD-accelerated NULL convert routine for MIPS DSPr2
2014-05-15 18:26:01 +00:00
DRC
c4c3ac6305
SIMD-accelerated h2v2 smooth downsampling routine for MIPS DSPr2
2014-05-14 15:00:10 +00:00
DRC
38bfd451d5
SIMD-accelerated merged upsampling routines for MIPS DSPr2
2014-05-13 18:40:14 +00:00
DRC
7824f70008
SIMD-accelerated slow integer IDCT routine for MIPS DSPr2
2014-05-06 09:53:21 +00:00
DRC
c6c8c7911f
SIMD-accelerated integer convsamp routine for MIPS DSPr2
2013-10-12 21:39:20 +00:00
DRC
3c6b1ba545
SIMD-accelerated floating point quantize and convsamp routines for MIPS DSPr2
2013-10-09 18:39:44 +00:00
DRC
10138c9d35
SIMD-accelerated fast integer inverse DCT routine for MIPS DSPr2
2013-10-08 02:18:59 +00:00
DRC
6addfed58b
SIMD-accelerated fast integer forward DCT routine for MIPS DSPr2
2013-10-08 02:11:21 +00:00
DRC
01f46504ee
SIMD-accelerated slow integer forward DCT and quantize routines for MIPS DSPr2
2013-09-30 18:13:27 +00:00
DRC
198cc7c161
SIMD-accelerated 3/4 and 3/2 decompression scaling for MIPS DSPr2
2013-09-27 17:51:08 +00:00
DRC
f934fc621e
SIMD-accelerated 1/2 and 1/4 decompression scaling for MIPS DSPr2
2013-09-27 17:43:23 +00:00
DRC
154c2dc749
SIMD-optimized RGB-to-grayscale conversion for MIPS DSPr2
2013-09-27 17:39:57 +00:00
DRC
aa5a1808fe
SIMD support for performing upsampling using MIPS DSPr2 instructions
2013-07-27 21:50:02 +00:00
DRC
3f2e3b11f0
SIMD support for performing downsampling using MIPS DSPr2 instructions
2013-07-27 21:48:18 +00:00
DRC
41e3657631
SIMD support for performing fancy upsampling using MIPS DSPr2 instructions
2013-07-27 21:44:14 +00:00
DRC
64da9d6ba8
SIMD support for performing color conversion using MIPS DSPr2 instructions
2013-07-24 21:50:20 +00:00
DRC
dbfa2648d8
Accelerated 4:2:2 upsampling routine for ARM (improves performance ~20-30% when decompressing 4:2:2 JPEGs using fancy upsampling)
2012-02-02 22:32:45 +00:00
DRC
7672bd3ac5
NEON-accelerated slow integer inverse DCT
2011-08-22 13:48:01 +00:00
DRC
00a69f142a
NEON-accelerated quantization
2011-08-17 21:00:59 +00:00
DRC
22b4359e42
ARM NEON-accelerated RGB-to-YCbCr conversion
2011-08-12 19:27:20 +00:00
DRC
ce02d1d62a
Support for accelerated forward DCT using ARM NEON instructions
2011-08-10 23:31:13 +00:00
DRC
e3f7e75525
NEON-optimized 2x2 and 4x4 scaled iDCTs
2011-06-17 21:12:58 +00:00
DRC
99799a6c29
ARM NEON support
2011-05-03 08:47:43 +00:00
DRC
f2cb47a275
Add short names for RGB->grayscale MMX functions
2011-02-27 09:09:54 +00:00
DRC
25299d0d2f
Updated (C)
2011-02-18 20:43:04 +00:00
DRC
439527e0b9
SIMD-accelerated RGB-to-Grayscale color conversion
2011-02-18 11:23:45 +00:00
DRC
d83e1f8900
Clarify that the C wrappers fall under the same license as the rest of the SIMD code
2011-02-02 05:38:34 +00:00
DRC
b0716b6875
Add colorspace extensions to merged upsampling routines
2009-04-05 21:51:25 +00:00
DRC
f7e0f6d5f5
Implement new colorspaces to allow directly compressing from/decompressing to RGB/RGBX/BGR/BGRX/XBGR/XRGB without conversion
2009-04-03 12:00:51 +00:00
Pierre Ossman
093fe3bb51
Add SSE2 SIMD implementation of computationally intensive routines.
2009-03-09 13:34:17 +00:00
Pierre Ossman
205cdc87d4
Add SSE SIMD implementation of computationally intensive routines.
2009-03-09 13:31:56 +00:00
Pierre Ossman
9a4a8bb721
Add 3DNow SIMD implementation of computationally intensive routines.
2009-03-09 13:28:10 +00:00
Pierre Ossman
649fe159ce
Add MMX SIMD implementation of computationally intensive routines.
2009-03-09 13:25:30 +00:00
Pierre Ossman
863946aebd
Implement x86 SIMD framework
...
Add NASM support and stub routine for detecting SIMD extensions.
2009-03-09 13:21:27 +00:00