]>
git.sesse.net Git - ffmpeg/log
Clément Bœsch [Wed, 15 Mar 2017 20:56:47 +0000 (21:56 +0100)]
Merge commit '
f0ccc65bc9ab9ddf1366066395564c71bcc825ee '
* commit '
f0ccc65bc9ab9ddf1366066395564c71bcc825ee ':
ac3dec: Split coupling-specific code from decode_audio_block()
Merged-by: Clément Bœsch <u@pkh.me>
Clément Bœsch [Wed, 15 Mar 2017 20:43:59 +0000 (21:43 +0100)]
Merge commit '
a5ebe5d1217942238c641c83b24ef1106e53934a '
* commit '
a5ebe5d1217942238c641c83b24ef1106e53934a ':
ac3dec: Split spx-specific code from decode_audio_block()
Merged-by: Clément Bœsch <u@pkh.me>
Clément Bœsch [Wed, 15 Mar 2017 20:29:56 +0000 (21:29 +0100)]
Merge commit '
2ac00d2d1d51047c6ce69d5fbe1a08392d142658 '
* commit '
2ac00d2d1d51047c6ce69d5fbe1a08392d142658 ':
mov: Validate the ID number
This commit is a noop as the modified check is not present in FFmpeg.
See
d30870cc730337b7017cc194d696f68ee48f7f1d .
Merged-by: Clément Bœsch <u@pkh.me>
Clément Bœsch [Wed, 15 Mar 2017 20:23:30 +0000 (21:23 +0100)]
Merge commit '
a115eb9e750543f1d8bf951414d291069bf396c2 '
* commit '
a115eb9e750543f1d8bf951414d291069bf396c2 ':
mimic: do not release the newly obsolete reference at the end of decoding
Merged-by: Clément Bœsch <u@pkh.me>
Clément Bœsch [Wed, 15 Mar 2017 20:09:06 +0000 (21:09 +0100)]
Merge commit '
ae90119c6701fa09ff747cca35238e36b2d2ab2f '
* commit '
ae90119c6701fa09ff747cca35238e36b2d2ab2f ':
configure: Simplify license incompatibility check
An extra GPLV3 list is added for libsmbclient as having it in both GPL
and VERSION3 lists would cause a duplicate in the final config list.
Also, for consistency, libnpp is treated the same as the other nonfree
component (libfdk_aac and openssl).
Merged-by: Clément Bœsch <u@pkh.me>
Steven Liu [Tue, 14 Mar 2017 14:53:43 +0000 (22:53 +0800)]
avcodec/videotoolboxenc: add rc_max_bitrate control into videotoolbox
add kVTCompressionPropertyKey_DataRateLimits support by rc_max_bitrate
Reviewed-by: Rick Kern <kernrj@gmail.com>
Signed-off-by: Steven Liu <lq@chinaffmpeg.org>
Clément Bœsch [Wed, 15 Mar 2017 13:25:46 +0000 (14:25 +0100)]
Merge commit '
e46a6fb7732a7caef97a916a4f765ec0f779d195 '
* commit '
e46a6fb7732a7caef97a916a4f765ec0f779d195 ':
avconv: Check that muxing_queue exists before reading from it
Mostly noop. This was fixed in FFmpeg in
7f7c494a3 .
The merge makes the cosmetics match but does not include the weird
av_log().
Merged-by: Clément Bœsch <cboesch@gopro.com>
Clément Bœsch [Wed, 15 Mar 2017 11:49:03 +0000 (12:49 +0100)]
Merge commit '
100fb0ddfda958da70f98feac81f924c02483789 '
* commit '
100fb0ddfda958da70f98feac81f924c02483789 ':
configure: Allow detecting and using LLVM lld-link as linker for windows
Merged-by: Clément Bœsch <cboesch@gopro.com>
Clément Bœsch [Wed, 15 Mar 2017 11:44:35 +0000 (12:44 +0100)]
Merge commit '
79fb0692992c74214c6cf8e81350fc93eeffc5ec '
* commit '
79fb0692992c74214c6cf8e81350fc93eeffc5ec ':
configure: Move defines for controlling MSVCRT headers to the CRT detection section
Merged-by: Clément Bœsch <cboesch@gopro.com>
Clément Bœsch [Wed, 15 Mar 2017 11:39:34 +0000 (12:39 +0100)]
Merge commit '
56af0bc10f49654b5b5f3efe82c69a13bf15fc8b '
* commit '
56af0bc10f49654b5b5f3efe82c69a13bf15fc8b ':
configure: Check for strtoll and redirect to _strtoi64 in the msvcrt block
Also includes _strtoui64 in the check.
Merged-by: Clément Bœsch <cboesch@gopro.com>
Clément Bœsch [Wed, 15 Mar 2017 11:34:51 +0000 (12:34 +0100)]
Merge commit '
48b80f8393d418ad35d73f5a36f5011de1928f3c '
* commit '
48b80f8393d418ad35d73f5a36f5011de1928f3c ':
hpeldsp: Explain why put_no_rnd_pixels_tab is larger than necessary
Merged-by: Clément Bœsch <cboesch@gopro.com>
Clément Bœsch [Wed, 15 Mar 2017 11:32:15 +0000 (12:32 +0100)]
Merge commit '
40ad05bab206c932a32171d45581080c914b06ec '
* commit '
40ad05bab206c932a32171d45581080c914b06ec ':
checkasm: Cast unsigned to signed
Merged-by: Clément Bœsch <cboesch@gopro.com>
Clément Bœsch [Wed, 15 Mar 2017 11:23:18 +0000 (12:23 +0100)]
Merge commit '
b183abfb5b6366b177cf44f244c66156257a6fd6 '
* commit '
b183abfb5b6366b177cf44f244c66156257a6fd6 ':
vpx: Support color range
Decoder chunk not merged as the framework automatically copies avctx
color range to the frame color range. And we already set the avctx field
since
cbcc88c039 .
Merged-by: Clément Bœsch <cboesch@gopro.com>
Matthieu Bouron [Mon, 6 Feb 2017 16:14:57 +0000 (17:14 +0100)]
lavc/h264dec: use OFFSET macro
Steven Liu [Tue, 14 Mar 2017 23:37:11 +0000 (07:37 +0800)]
avformat/hlsenc: fix duration wrong when no pkt duration
when cannot get pkt duration, hlsenc segments duration will
be set to 0, this patch can fix it.
Signed-off-by: Steven Liu <lq@chinaffmpeg.org>
wm4 [Thu, 9 Mar 2017 14:43:30 +0000 (15:43 +0100)]
ffmpeg, ffprobe: don't "merge" side data into packet data by default
Preparation for potentially disabling merged side data by default in the
libs. Do this in particular because it affects fate tests.
The changed tests either reflect added packet side data, or the changed
packet size due to merged side data removal reducing the packet size.
Alexander Strasser [Sat, 11 Mar 2017 14:43:39 +0000 (15:43 +0100)]
lavf/avio: Be more explicit in logging white/black list matches
The current form of the messages indicating matches in the white
or black lists seems to be a bit too much relying on context.
Make the messages more explicit.
Signed-off-by: Alexander Strasser <eclipse7@gmx.net>
Alexander Strasser [Sat, 11 Mar 2017 11:02:32 +0000 (12:02 +0100)]
lavf/avio: Remove unnecessary escaping of ' in string literals
Signed-off-by: Alexander Strasser <eclipse7@gmx.net>
Michael Niedermayer [Mon, 13 Mar 2017 19:45:09 +0000 (20:45 +0100)]
avcodec/mpegaudiodec_template: Make l3_unscale() work with e=0
Fixes undefined behavior
Fixes: 830/clusterfuzz-testcase-6253175327686656
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Mon, 13 Mar 2017 19:45:08 +0000 (20:45 +0100)]
avcodec/tiff: Check for multiple geo key directories
Fixes memleak
Fixes: 826/clusterfuzz-testcase-5316921379520512
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Mon, 13 Mar 2017 19:45:07 +0000 (20:45 +0100)]
avcodec/pictordec: Fix runtime error: left shift of
805306368 by 2 places cannot be represented in type 'int'
Fixes: 823/clusterfuzz-testcase-6727060074528768
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Katherine Nagels [Sun, 12 Mar 2017 22:57:11 +0000 (11:57 +1300)]
doc/filters: Add colourspace values for colormatrix filter
Reviewed-by: Michael Niedermayer <michael@niedermayer.cc>
Signed-off-by: Kieran Kunhya <kierank@obe.tv>
Steven Liu [Tue, 14 Mar 2017 10:11:20 +0000 (18:11 +0800)]
avformat/flvenc: flx flvflags no_metadata bug
When use flvflags no_metadata , the FLV header will be cover by write tailer
This commit fix the bug
Signed-off-by: Steven Liu <lq@chinaffmpeg.org>
Steven Liu [Tue, 14 Mar 2017 02:16:00 +0000 (10:16 +0800)]
avformat/hlsenc: fix ticket 6231
check if the hls_flags is byterange_mode and check if should close fd
Signed-off-by: Steven Liu <lq@chinaffmpeg.org>
Michael Niedermayer [Mon, 13 Mar 2017 01:51:15 +0000 (02:51 +0100)]
avcodec/wavpack: Fix runtime error: shift exponent 32 is too large for 32-bit type 'int'
Fixes: 822/clusterfuzz-testcase-4873433189974016
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Sun, 12 Mar 2017 23:18:04 +0000 (00:18 +0100)]
avcodec/targa: Skip hflip on blank images
Fixes: timeout with 810/clusterfuzz-testcase-5249282825256960
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Sun, 12 Mar 2017 22:45:54 +0000 (23:45 +0100)]
avcodec/wavpack: Fix runtime error: shift exponent 137 is too large for 32-bit type 'int'
Fixes: 808/clusterfuzz-testcase-4715513349406720
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Fri, 10 Mar 2017 14:24:52 +0000 (15:24 +0100)]
avcodec/h264_direct: Fix runtime error: signed integer overflow:
2147483647 - -14133 cannot be represented in type 'int'
Fixes: 755/clusterfuzz-testcase-5369072516595712
See: [FFmpeg-devel] [PATCH 1/2] avcodec/h264_direct: Fix runtime error: signed integer overflow:
2147483647 - -14133 cannot be represented in type 'int'
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Paul B Mahol [Sun, 12 Mar 2017 22:13:26 +0000 (23:13 +0100)]
avcodec/xpmdec: there are XPM files with dos line endings
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 17:59:05 +0000 (18:59 +0100)]
avfilter/vf_lut3d: actually skip lines when encountering DOMAIN_ string
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Steven Liu [Mon, 13 Mar 2017 03:58:34 +0000 (11:58 +0800)]
avformat/hlsenc: second_levels flags process function extract
the SECOND_LEVEL* flags process and name is too long
extract all of them output to funtions, make code clear
Signed-off-by: Steven Liu <lq@chinaffmpeg.org>
Michael Niedermayer [Sun, 12 Mar 2017 02:04:06 +0000 (03:04 +0100)]
avcodec/vp6: clear dimensions on failed resolution change in vp6_parse_header()
Fixes: 807/clusterfuzz-testcase-6470061042696192
Fixes null pointer dereference
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Sun, 12 Mar 2017 02:04:05 +0000 (03:04 +0100)]
avcodec/rv34: Fix runtime error: signed integer overflow: 36880 * 66288 cannot be represented in type 'int'
Fixes: 768/clusterfuzz-testcase-4807444305805312
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Sun, 12 Mar 2017 02:04:04 +0000 (03:04 +0100)]
avcodec/mpeg12dec: Fix runtime error: left shift of negative value -1
Fixes: 764/clusterfuzz-testcase-6273034652483584
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Paul B Mahol [Sun, 12 Mar 2017 21:58:49 +0000 (22:58 +0100)]
avcodec/xpmdec: avoid "magic" numbers in function hex_char_to_number()
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 21:56:58 +0000 (22:56 +0100)]
avcodec/xpmdec: fix colors values which are different for X11
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 21:51:00 +0000 (22:51 +0100)]
avcodec/xpmdec: do not allow number of colors to be higher than allocated
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 21:43:49 +0000 (22:43 +0100)]
avcodec/xpmdec: rename yet another function
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 21:42:05 +0000 (22:42 +0100)]
avcodec/xpmdec: rename convert to hex_char_to_number
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 21:37:53 +0000 (22:37 +0100)]
avcodec/xpmdec: improve comment for one function and also fix 2 identation issues
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Muhammad Faiz [Sat, 11 Mar 2017 17:24:21 +0000 (00:24 +0700)]
swresample/swresample: do not reset tsf on swr_alloc_set_opts
so tsf option in aresample will have effect
previously tsf/internal_sample_format had no effect
fate is updated
s32p previously used fltp internally
dblp previously used fltp/dblp internally
Reviewed-by: Michael Niedermayer <michael@niedermayer.cc>
Signed-off-by: Muhammad Faiz <mfcc64@gmail.com>
Jun Zhao [Fri, 3 Mar 2017 01:25:53 +0000 (09:25 +0800)]
vf_hwupload: Add missing return value check
Add missing return value checks to suppress build warning and
remove noop ff_formats_unref() calling.
Note: most filters using ff_formats_ref() didn't have a suitable
error handling, it's a potential memory leak issue.
Signed-off-by: Jun Zhao <jun.zhao@intel.com>
Signed-off-by: Mark Thompson <sw@jkqxz.net>
James Almer [Sat, 11 Mar 2017 23:29:40 +0000 (20:29 -0300)]
tests/api-seek: fix memory leak on realloc() failure
Reviewed-by: Michael Niedermayer <michael@niedermayer.cc>
Signed-off-by: James Almer <jamrial@gmail.com>
James Almer [Sun, 12 Mar 2017 13:36:44 +0000 (10:36 -0300)]
test/api-seek: clean up properly on failure
Also propagate better error values.
Reviewed-by: Michael Niedermayer <michael@niedermayer.cc>
Signed-off-by: James Almer <jamrial@gmail.com>
James Almer [Sat, 11 Mar 2017 23:24:53 +0000 (20:24 -0300)]
tests/api-seek: make the crc array uint32_t
Reviewed-by: Michael Niedermayer <michael@niedermayer.cc>
Signed-off-by: James Almer <jamrial@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 14:07:02 +0000 (15:07 +0100)]
avcodec/xpmdec: skip everything before signature
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 13:50:33 +0000 (14:50 +0100)]
avcodec/codec_desc: add mime type for X-BitMap
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 13:49:26 +0000 (14:49 +0100)]
avcodec/codec_desc: add mime type for X-PixMap
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paul B Mahol [Sun, 12 Mar 2017 11:48:01 +0000 (12:48 +0100)]
avcodec/xpmdec: make convert function more picky about its input
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Paras Chadha [Sat, 11 Mar 2017 21:01:23 +0000 (02:31 +0530)]
avcodec: add XPM decoder and demuxer
Signed-off-by: Paras Chadha <paraschadha18@gmail.com>
Mark Thompson [Sun, 12 Mar 2017 16:55:32 +0000 (16:55 +0000)]
Merge commit '
ad71d3276fef0ee7e791e62bbfe9c4e540047417 '
* commit '
ad71d3276fef0ee7e791e62bbfe9c4e540047417 ':
lavfi: add a QSV deinterlacing filter
Minor fixup for lavfi differences.
Merged-by: Mark Thompson <sw@jkqxz.net>
Mark Thompson [Sun, 12 Mar 2017 15:21:41 +0000 (15:21 +0000)]
Merge commit '
8e07c22e508b349d145b9f142aa3ee8b3ce1d3a4 '
* commit '
8e07c22e508b349d145b9f142aa3ee8b3ce1d3a4 ':
qsvenc: print warnings from encode/init
Merged-by: Mark Thompson <sw@jkqxz.net>
Mark Thompson [Sun, 12 Mar 2017 15:19:52 +0000 (15:19 +0000)]
Merge commit '
0956fd460681e8ccbdae19f135f0d3970bf95c2f '
* commit '
0956fd460681e8ccbdae19f135f0d3970bf95c2f ':
qsvenc: do not re-execute encoding on all positive status codes
Noop, see
fb240a6276fa36fe120aadd67b4ca774e354f22b .
Merged-by: Mark Thompson <sw@jkqxz.net>
Mark Thompson [Sun, 12 Mar 2017 15:19:05 +0000 (15:19 +0000)]
Merge commit '
95414eb2dc63a6f934275b4ed33dedd4369f2c49 '
* commit '
95414eb2dc63a6f934275b4ed33dedd4369f2c49 ':
qsv: print more complete error messages
Merged-by: Mark Thompson <sw@jkqxz.net>
Mark Thompson [Sun, 12 Mar 2017 15:06:07 +0000 (15:06 +0000)]
Merge commit '
d9ec3c60143babe1bb77c268e1d5547d15acd69b '
* commit '
d9ec3c60143babe1bb77c268e1d5547d15acd69b ':
qsvenc: take only the allocated dimensions from the frames context
Merged-by: Mark Thompson <sw@jkqxz.net>
Mark Thompson [Sun, 12 Mar 2017 15:04:45 +0000 (15:04 +0000)]
Merge commit '
37a9015ee84c15fec5247ba8f6577351a25fa8d2 '
* commit '
37a9015ee84c15fec5247ba8f6577351a25fa8d2 ':
qsvenc: add support for p010
Merged-by: Mark Thompson <sw@jkqxz.net>
Anton Khirnov [Sat, 26 Mar 2016 11:39:58 +0000 (12:39 +0100)]
lavfi: add a QSV scaling filter
This merges libav commit
ac7bfd69678f3966e38debdb27f4bde94dc0345c ,
which was previously skipped.
(cherry picked from commit
ac7bfd69678f3966e38debdb27f4bde94dc0345c )
Signed-off-by: Mark Thompson <sw@jkqxz.net>
Mark Thompson [Sun, 12 Mar 2017 14:21:37 +0000 (14:21 +0000)]
Merge commit '
21962261c74aed4df00ae8348a5e2d1ecb67c52d '
* commit '
21962261c74aed4df00ae8348a5e2d1ecb67c52d ':
qsv: handle the semi-packed formats in map_fourcc as well
Merged-by: Mark Thompson <sw@jkqxz.net>
Clément Bœsch [Sun, 12 Mar 2017 12:21:01 +0000 (13:21 +0100)]
Merge commit '
f65285aba0df7d46298abe0c945dfee05cbc6028 '
* commit '
f65285aba0df7d46298abe0c945dfee05cbc6028 ':
lavc: set sw_pix_fmt for hwaccel encoding
Merged-by: Clément Bœsch <u@pkh.me>
Clément Bœsch [Sun, 12 Mar 2017 12:20:57 +0000 (13:20 +0100)]
Merge commit '
d59641abfd25a1007bdf4723d952887b1e3619c6 '
* commit '
d59641abfd25a1007bdf4723d952887b1e3619c6 ':
lavc: initialize AVCodecContext.sw_pix_fmt properly
Merged-by: Clément Bœsch <u@pkh.me>
Clément Bœsch [Sun, 12 Mar 2017 12:13:55 +0000 (13:13 +0100)]
Merge commit '
8b7a9729aa162e2bbd571933f1aa40767f1ff47b '
* commit '
8b7a9729aa162e2bbd571933f1aa40767f1ff47b ':
avconv_qsv: use the actual pixel format provided by lavc
This commit is a noop, see
03cef34aa66
Merged-by: Clément Bœsch <u@pkh.me>
Clément Bœsch [Sun, 12 Mar 2017 12:13:05 +0000 (13:13 +0100)]
Merge commit '
6f40181cad8ac04adff7bd10e1e1ab65f22bc1f0 '
* commit '
6f40181cad8ac04adff7bd10e1e1ab65f22bc1f0 ':
avconv_qsv: align the surface size to 32
This commit is a noop, see
03cef34aa66
Merged-by: Clément Bœsch <u@pkh.me>
Clément Bœsch [Sun, 12 Mar 2017 12:08:04 +0000 (13:08 +0100)]
Merge commit '
b0f36a0043d76436cc7ab8ff92ab99c94595d3c0 '
* commit '
b0f36a0043d76436cc7ab8ff92ab99c94595d3c0 ':
avconv: stop using setpts for input framerate forced with -r
Merged-by: Clément Bœsch <u@pkh.me>
Paul B Mahol [Sun, 12 Mar 2017 11:34:55 +0000 (12:34 +0100)]
avcodec/scpr: use correct linesize for prev frame
Signed-off-by: Paul B Mahol <onemda@gmail.com>
Michael Niedermayer [Wed, 8 Mar 2017 20:35:51 +0000 (21:35 +0100)]
avcodec/dca_xll: Fix runtime error: signed integer overflow:
2147286116 +
6298923 cannot be represented in type 'int'
Fixes: 732/clusterfuzz-testcase-4872990070145024
See: [FFmpeg-devel] [PATCH 2/6] avcodec/dca_xll: Fix runtime error: signed integer overflow:
2147286116 +
6298923 cannot be represented in type 'int'
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Sat, 11 Mar 2017 02:55:39 +0000 (03:55 +0100)]
avcodec/amrwbdec: Fix runtime error: left shift of negative value -1
Fixes: 763/clusterfuzz-testcase-6007567320875008
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Sat, 11 Mar 2017 02:38:01 +0000 (03:38 +0100)]
avcodec/dca_xll: Fix runtime error: signed integer overflow:
1762028192 +
698372290 cannot be represented in type 'int'
Fixes: 762/clusterfuzz-testcase-5927683747741696
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Michael Niedermayer [Sat, 11 Mar 2017 02:25:41 +0000 (03:25 +0100)]
avcodec/wavpack: Fix runtime error: signed integer overflow: -
2147483648 + -
83886075 cannot be represented in type 'int'
Fixes: 761/clusterfuzz-testcase-5442222252097536
Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Muhammad Faiz [Sat, 11 Mar 2017 00:54:30 +0000 (07:54 +0700)]
avfilter/af_firequalizer: add av_restrict on convolution func
slightly improved speed
Reviewed-by: wm4 <nfxjfg@googlemail.com>
Signed-off-by: Muhammad Faiz <mfcc64@gmail.com>
Przemysław Sobala [Fri, 3 Mar 2017 08:38:28 +0000 (09:38 +0100)]
lavf/dashenc: update bitrates on dash_write_trailer
Provides a way to change bandwidth parameter inside DASH manifest after a non-CBR H.264 encoding.
Caller now is able to compute the bitrate by itself, after all packets have been written, and then set that value in AVFormatContext->streams->codecpar->bit_rate before calling av_write_trailer. As a result that value will be set in DASH manifest.
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Steven Liu [Sat, 11 Mar 2017 13:11:38 +0000 (21:11 +0800)]
doc/muxers: move hls_flags temp_file to after SECOND LEVEL hls example
the temp_file hls_flags describe text offset is wrong, now move it after example
Signed-off-by: Steven Liu <lq@chinaffmpeg.org>
Martin Storsjö [Sat, 31 Dec 2016 20:27:13 +0000 (22:27 +0200)]
aarch64: vp9itxfm: Reorder iadst16 coeffs
This matches the order they are in the 16 bpp version.
There they are in this order, to make sure we access them in the
same order they are declared, easing loading only half of the
coefficients at a time.
This makes the 8 bpp version match the 16 bpp version better.
This is cherrypicked from libav commit
b8f66c0838b4c645227f23a35b4d54373da4c60a .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Sat, 31 Dec 2016 20:27:13 +0000 (22:27 +0200)]
arm: vp9itxfm: Reorder iadst16 coeffs
This matches the order they are in the 16 bpp version.
There they are in this order, to make sure we access them in the
same order they are declared, easing loading only half of the
coefficients at a time.
This makes the 8 bpp version match the 16 bpp version better.
This is cherrypicked from libav commit
08074c092d8c97d71c5986e5325e97ffc956119d .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Sat, 31 Dec 2016 12:18:31 +0000 (14:18 +0200)]
aarch64: vp9itxfm: Reorder the idct coefficients for better pairing
All elements are used pairwise, except for the first one.
Previously, the 16th element was unused. Move the unused element
to the second slot, to make the later element pairs not split
across registers.
This simplifies loading only parts of the coefficients,
reducing the difference to the 16 bpp version.
This is cherrypicked from libav commit
09eb88a12e008d10a3f7a6be75d18ad98b368e68 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Sat, 31 Dec 2016 12:05:44 +0000 (14:05 +0200)]
arm: vp9itxfm: Reorder the idct coefficients for better pairing
All elements are used pairwise, except for the first one.
Previously, the 16th element was unused. Move the unused element
to the second slot, to make the later element pairs not split
across registers.
This simplifies loading only parts of the coefficients,
reducing the difference to the 16 bpp version.
This is cherrypicked from libav commit
de06bdfe6c8abd8266d5c6f5c68e4df0060b61fc .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Mon, 2 Jan 2017 20:08:41 +0000 (22:08 +0200)]
aarch64: vp9itxfm: Avoid reloading the idct32 coefficients
The idct32x32 function actually pushed d8-d15 onto the stack even
though it didn't clobber them; there are plenty of registers that
can be used to allow keeping all the idct coefficients in registers
without having to reload different subsets of them at different
stages in the transform.
After this, we still can skip pushing d12-d15.
Before:
vp9_inv_dct_dct_32x32_sub32_add_neon: 8128.3
After:
vp9_inv_dct_dct_32x32_sub32_add_neon: 8053.3
This is cherrypicked from libav commit
65aa002d54433154a6924dc13e498bec98451ad0 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Mon, 2 Jan 2017 20:50:38 +0000 (22:50 +0200)]
arm: vp9itxfm: Avoid reloading the idct32 coefficients
The idct32x32 function actually pushed q4-q7 onto the stack even
though it didn't clobber them; there are plenty of registers that
can be used to allow keeping all the idct coefficients in registers
without having to reload different subsets of them at different
stages in the transform.
Since the idct16 core transform avoids clobbering q4-q7 (but clobbers
q2-q3 instead, to avoid needing to back up and restore q4-q7 at all
in the idct16 function), and the lanewise vmul needs a register in
the q0-q3 range, we move the stored coefficients from q2-q3 into q4-q5
while doing idct16.
While keeping these coefficients in registers, we still can skip pushing
q7.
Before: Cortex A7 A8 A9 A53
vp9_inv_dct_dct_32x32_sub32_add_neon: 18553.8 17182.7 14303.3 12089.7
After:
vp9_inv_dct_dct_32x32_sub32_add_neon: 18470.3 16717.7 14173.6 11860.8
This is cherrypicked from libav commit
402546a17233a8815307df9e14ff88cd70424537 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Sat, 14 Jan 2017 11:22:30 +0000 (13:22 +0200)]
arm: vp9lpf: Implement the mix2_44 function with one single filter pass
For this case, with 8 inputs but only changing 4 of them, we can fit
all 16 input pixels into a q register, and still have enough temporary
registers for doing the loop filter.
The wd=8 filters would require too many temporary registers for
processing all 16 pixels at once though.
Before: Cortex A7 A8 A9 A53
vp9_loop_filter_mix2_v_44_16_neon: 289.7 256.2 237.5 181.2
After:
vp9_loop_filter_mix2_v_44_16_neon: 221.2 150.5 177.7 138.0
This is cherrypicked from libav commit
575e31e931e4178e9f1e24407503c9b4ec0ef9ba .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Thu, 23 Feb 2017 21:33:58 +0000 (23:33 +0200)]
aarch64: vp9lpf: Use dup+rev16+uzp1 instead of dup+lsr+dup+trn1
This is one cycle faster in total, and three instructions fewer.
Before:
vp9_loop_filter_mix2_v_44_16_neon: 123.2
After:
vp9_loop_filter_mix2_v_44_16_neon: 122.2
This is cherrypicked from libav commit
3bf9c48320f25f3d5557485b0202f22ae60748b0 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Sat, 14 Jan 2017 18:49:19 +0000 (20:49 +0200)]
arm/aarch64: vp9lpf: Keep the comparison to E within 8 bit
The theoretical maximum value of E is 193, so we can just
saturate the addition to 255.
Before: Cortex A7 A8 A9 A53 A53/AArch64
vp9_loop_filter_v_4_8_neon: 143.0 127.7 114.8 88.0 87.7
vp9_loop_filter_v_8_8_neon: 241.0 197.2 173.7 140.0 136.7
vp9_loop_filter_v_16_8_neon: 497.0 419.5 379.7 293.0 275.7
vp9_loop_filter_v_16_16_neon: 965.2 818.7 731.4 579.0 452.0
After:
vp9_loop_filter_v_4_8_neon: 136.0 125.7 112.6 84.0 83.0
vp9_loop_filter_v_8_8_neon: 234.0 195.5 171.5 136.0 133.7
vp9_loop_filter_v_16_8_neon: 490.0 417.5 377.7 289.0 271.0
vp9_loop_filter_v_16_16_neon: 951.2 814.7 732.3 571.0 446.7
This is cherrypicked from libav commit
c582cb8537367721bb399a5d01b652c20142b756 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Thu, 16 Feb 2017 07:18:25 +0000 (09:18 +0200)]
aarch64: Add parentheses around the offset parameter in movrel
This fixes building with clang for linux with PIC enabled.
This is cherrypicked from libav commit
8847eeaa141898850381400000fb2b8a7adc7100 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Wed, 11 Jan 2017 09:58:02 +0000 (11:58 +0200)]
aarch64: vp9lpf: Fix broken indentation/vertical alignment
This is cherrypicked from libav commit
07b5136c481d394992c7e951967df0cfbb346c0b .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 10 Jan 2017 20:08:50 +0000 (22:08 +0200)]
aarch64: vp9lpf: Interleave the start of flat8in into the calculation above
This adds lots of extra .ifs, but speeds it up by a couple cycles,
by avoiding stalls.
This is cherrypicked from libav commit
b0806088d3b27044145b20421da8d39089ae0c6a .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 10 Jan 2017 14:49:13 +0000 (16:49 +0200)]
arm: vp9lpf: Interleave the start of flat8in into the calculation above
This adds lots of extra .ifs, but speeds it up by a couple cycles,
by avoiding stalls.
This is cherrypicked from libav commit
e18c39005ad1dbb178b336f691da1de91afd434e .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Fri, 13 Jan 2017 21:42:28 +0000 (23:42 +0200)]
arm: vp9lpf: Use orrs instead of orr+cmp
This is cherrypicked from libav commit
435cd7bc99671bf561193421a50ac6e9d63c4266 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Thu, 12 Jan 2017 14:52:33 +0000 (16:52 +0200)]
arm/aarch64: vp9lpf: Calculate !hev directly
Previously we first calculated hev, and then negated it.
Since we were able to schedule the negation in the middle
of another calculation, we don't see any gain in all cases.
Before: Cortex A7 A8 A9 A53 A53/AArch64
vp9_loop_filter_v_4_8_neon: 147.0 129.0 115.8 89.0 88.7
vp9_loop_filter_v_8_8_neon: 242.0 198.5 174.7 140.0 136.7
vp9_loop_filter_v_16_8_neon: 500.0 419.5 382.7 293.0 275.7
vp9_loop_filter_v_16_16_neon: 971.2 825.5 731.5 579.0 453.0
After:
vp9_loop_filter_v_4_8_neon: 143.0 127.7 114.8 88.0 87.7
vp9_loop_filter_v_8_8_neon: 241.0 197.2 173.7 140.0 136.7
vp9_loop_filter_v_16_8_neon: 497.0 419.5 379.7 293.0 275.7
vp9_loop_filter_v_16_16_neon: 965.2 818.7 731.4 579.0 452.0
This is cherrypicked from libav commit
e1f9de86f454861b69b199ad801adc2ec6c3b220 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Wed, 4 Jan 2017 10:57:56 +0000 (12:57 +0200)]
aarch64: vp9itxfm: Optimize 16x16 and 32x32 idct dc by unrolling
This work is sponsored by, and copyright, Google.
Before: Cortex A53
vp9_inv_dct_dct_16x16_sub1_add_neon: 235.3
vp9_inv_dct_dct_32x32_sub1_add_neon: 555.1
After:
vp9_inv_dct_dct_16x16_sub1_add_neon: 180.2
vp9_inv_dct_dct_32x32_sub1_add_neon: 475.3
This is cherrypicked from libav commit
3fcf788fbbccc4130868e7abe58a88990290f7c1 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Wed, 4 Jan 2017 11:08:51 +0000 (13:08 +0200)]
arm: vp9itxfm: Optimize 16x16 and 32x32 idct dc by unrolling
This work is sponsored by, and copyright, Google.
Before: Cortex A7 A8 A9 A53
vp9_inv_dct_dct_16x16_sub1_add_neon: 273.0 189.5 211.7 235.8
vp9_inv_dct_dct_32x32_sub1_add_neon: 752.0 459.2 862.2 553.9
After:
vp9_inv_dct_dct_16x16_sub1_add_neon: 226.5 145.0 225.1 171.8
vp9_inv_dct_dct_32x32_sub1_add_neon: 721.2 415.7 727.6 475.0
This is cherrypicked from libav commit
a76bf8cf1277ef6feb1580b578f5e6ca327e713c .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Sat, 17 Dec 2016 11:14:38 +0000 (13:14 +0200)]
aarch64: vp9mc: Calculate less unused data in the 4 pixel wide horizontal filter
No measured speedup on a Cortex A53, but other cores might benefit.
This is cherrypicked from libav commit
388e0d2515bc6bbc9d0c9af1d230bd16cf945fe7 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Sat, 17 Dec 2016 11:09:50 +0000 (13:09 +0200)]
arm: vp9mc: Calculate less unused data in the 4 pixel wide horizontal filter
Before: Cortex A7 A8 A9 A53
vp9_put_8tap_smooth_4h_neon: 378.1 273.2 340.7 229.5
After:
vp9_put_8tap_smooth_4h_neon: 352.1 222.2 290.5 229.5
This is cherrypicked from libav commit
fea92a4b57d1c328b1de226a5f213a629ee63754 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Fri, 16 Dec 2016 22:55:41 +0000 (00:55 +0200)]
aarch64: vp9mc: Simplify the extmla macro parameters
Fold the field lengths into the macro.
This makes the macro invocations much more readable, when the
lines are shorter.
This also makes it easier to use only half the registers within
the macro.
This is cherrypicked from libav commit
5e0c2158fbc774f87d3ce4b7b950ba4d42c4a7b8 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 3 Jan 2017 14:11:56 +0000 (16:11 +0200)]
aarch64: vp9itxfm: Fix incorrect vertical alignment
This is cherrypicked from libav commit
0c0b87f12d48d4e7f0d3d13f9345e828a3a5ea32 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 3 Jan 2017 21:11:51 +0000 (23:11 +0200)]
aarch64: vp9itxfm: Update a comment to refer to a register with a different name
This is cherrypicked from libav commit
8476eb0d3ab1f7a52317b23346646389c08fb57a .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 3 Jan 2017 14:46:17 +0000 (16:46 +0200)]
aarch64: vp9itxfm: Use the right lane sizes in 8x8 for improved readability
This is cherrypicked from libav commit
3dd7827258ddaa2e51085d0c677d6f3b1be3572f .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 3 Jan 2017 12:55:46 +0000 (14:55 +0200)]
aarch64: vp9itxfm: Use a single lane ld1 instead of ld1r where possible
The ld1r is a leftover from the arm version, where this trick is
beneficial on some cores.
Use a single-lane load where we don't need the semantics of ld1r.
This is cherrypicked from libav commit
ed8d293306e12c9b79022d37d39f48825ce7f2fa .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 3 Jan 2017 14:39:41 +0000 (16:39 +0200)]
aarch64: vp9itxfm: Share instructions for loading idct coeffs in the 8x8 function
This is cherrypicked from libav commit
4da4b2b87f08a1331650c7e36eb7d4029a160776 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 3 Jan 2017 14:38:56 +0000 (16:38 +0200)]
arm: vp9itxfm: Share instructions for loading idct coeffs in the 8x8 function
This is cherrypicked from libav commit
3933b86bb93aca47f29fbd493075b0f110c1e3f5 .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 22 Nov 2016 20:58:35 +0000 (22:58 +0200)]
aarch64: vp9itxfm: Do separate functions for half/quarter idct16 and idct32
This work is sponsored by, and copyright, Google.
This avoids loading and calculating coefficients that we know will
be zero, and avoids filling the temp buffer with zeros in places
where we know the second pass won't read.
This gives a pretty substantial speedup for the smaller subpartitions.
The code size increases from 14740 bytes to 24292 bytes.
The idct16/32_end macros are moved above the individual functions; the
instructions themselves are unchanged, but since new functions are added
at the same place where the code is moved from, the diff looks rather
messy.
Before:
vp9_inv_dct_dct_16x16_sub1_add_neon: 236.7
vp9_inv_dct_dct_16x16_sub2_add_neon: 1051.0
vp9_inv_dct_dct_16x16_sub4_add_neon: 1051.0
vp9_inv_dct_dct_16x16_sub8_add_neon: 1051.0
vp9_inv_dct_dct_16x16_sub12_add_neon: 1387.4
vp9_inv_dct_dct_16x16_sub16_add_neon: 1387.6
vp9_inv_dct_dct_32x32_sub1_add_neon: 554.1
vp9_inv_dct_dct_32x32_sub2_add_neon: 5198.5
vp9_inv_dct_dct_32x32_sub4_add_neon: 5198.6
vp9_inv_dct_dct_32x32_sub8_add_neon: 5196.3
vp9_inv_dct_dct_32x32_sub12_add_neon: 6183.4
vp9_inv_dct_dct_32x32_sub16_add_neon: 6174.3
vp9_inv_dct_dct_32x32_sub20_add_neon: 7151.4
vp9_inv_dct_dct_32x32_sub24_add_neon: 7145.3
vp9_inv_dct_dct_32x32_sub28_add_neon: 8119.3
vp9_inv_dct_dct_32x32_sub32_add_neon: 8118.7
After:
vp9_inv_dct_dct_16x16_sub1_add_neon: 236.7
vp9_inv_dct_dct_16x16_sub2_add_neon: 640.8
vp9_inv_dct_dct_16x16_sub4_add_neon: 639.0
vp9_inv_dct_dct_16x16_sub8_add_neon: 842.0
vp9_inv_dct_dct_16x16_sub12_add_neon: 1388.3
vp9_inv_dct_dct_16x16_sub16_add_neon: 1389.3
vp9_inv_dct_dct_32x32_sub1_add_neon: 554.1
vp9_inv_dct_dct_32x32_sub2_add_neon: 3685.5
vp9_inv_dct_dct_32x32_sub4_add_neon: 3685.1
vp9_inv_dct_dct_32x32_sub8_add_neon: 3684.4
vp9_inv_dct_dct_32x32_sub12_add_neon: 5312.2
vp9_inv_dct_dct_32x32_sub16_add_neon: 5315.4
vp9_inv_dct_dct_32x32_sub20_add_neon: 7154.9
vp9_inv_dct_dct_32x32_sub24_add_neon: 7154.5
vp9_inv_dct_dct_32x32_sub28_add_neon: 8126.6
vp9_inv_dct_dct_32x32_sub32_add_neon: 8127.2
This is cherrypicked from libav commit
a63da4511d0fee66695ff4afd264ba1dbf1e812d .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Tue, 22 Nov 2016 09:07:38 +0000 (11:07 +0200)]
arm: vp9itxfm: Do a simpler half/quarter idct16/idct32 when possible
This work is sponsored by, and copyright, Google.
This avoids loading and calculating coefficients that we know will
be zero, and avoids filling the temp buffer with zeros in places
where we know the second pass won't read.
This gives a pretty substantial speedup for the smaller subpartitions.
The code size increases from 12388 bytes to 19784 bytes.
The idct16/32_end macros are moved above the individual functions; the
instructions themselves are unchanged, but since new functions are added
at the same place where the code is moved from, the diff looks rather
messy.
Before: Cortex A7 A8 A9 A53
vp9_inv_dct_dct_16x16_sub1_add_neon: 273.0 189.5 212.0 235.8
vp9_inv_dct_dct_16x16_sub2_add_neon: 2102.1 1521.7 1736.2 1265.8
vp9_inv_dct_dct_16x16_sub4_add_neon: 2104.5 1533.0 1736.6 1265.5
vp9_inv_dct_dct_16x16_sub8_add_neon: 2484.8 1828.7 2014.4 1506.5
vp9_inv_dct_dct_16x16_sub12_add_neon: 2851.2 2117.8 2294.8 1753.2
vp9_inv_dct_dct_16x16_sub16_add_neon: 3239.4 2408.3 2543.5 1994.9
vp9_inv_dct_dct_32x32_sub1_add_neon: 758.3 456.7 864.5 553.9
vp9_inv_dct_dct_32x32_sub2_add_neon: 10776.7 7949.8 8567.7 6819.7
vp9_inv_dct_dct_32x32_sub4_add_neon: 10865.6 8131.5 8589.6 6816.3
vp9_inv_dct_dct_32x32_sub8_add_neon: 12053.9 9271.3 9387.7 7564.0
vp9_inv_dct_dct_32x32_sub12_add_neon: 13328.3 10463.2 10217.0 8321.3
vp9_inv_dct_dct_32x32_sub16_add_neon: 14176.4 11509.5 11018.7 9062.3
vp9_inv_dct_dct_32x32_sub20_add_neon: 15301.5 12999.9 11855.1 9828.2
vp9_inv_dct_dct_32x32_sub24_add_neon: 16482.7 14931.5 12650.1 10575.0
vp9_inv_dct_dct_32x32_sub28_add_neon: 17589.5 15811.9 13482.8 11333.4
vp9_inv_dct_dct_32x32_sub32_add_neon: 18696.2 17049.2 14355.6 12089.7
After:
vp9_inv_dct_dct_16x16_sub1_add_neon: 273.0 189.5 211.7 235.8
vp9_inv_dct_dct_16x16_sub2_add_neon: 1203.5 998.2 1035.3 763.0
vp9_inv_dct_dct_16x16_sub4_add_neon: 1203.5 998.1 1035.5 760.8
vp9_inv_dct_dct_16x16_sub8_add_neon: 1926.1 1610.6 1722.1 1271.7
vp9_inv_dct_dct_16x16_sub12_add_neon: 2873.2 2129.7 2285.1 1757.3
vp9_inv_dct_dct_16x16_sub16_add_neon: 3221.4 2520.3 2557.6 2002.1
vp9_inv_dct_dct_32x32_sub1_add_neon: 753.0 457.5 866.6 554.6
vp9_inv_dct_dct_32x32_sub2_add_neon: 7554.6 5652.4 6048.4 4920.2
vp9_inv_dct_dct_32x32_sub4_add_neon: 7549.9 5685.0 6046.9 4925.7
vp9_inv_dct_dct_32x32_sub8_add_neon: 8336.9 6704.5 6604.0 5478.0
vp9_inv_dct_dct_32x32_sub12_add_neon: 10914.0 9777.2 9240.4 7416.9
vp9_inv_dct_dct_32x32_sub16_add_neon: 11859.2 11223.3 9966.3 8095.1
vp9_inv_dct_dct_32x32_sub20_add_neon: 15237.1 13029.4 11838.3 9829.4
vp9_inv_dct_dct_32x32_sub24_add_neon: 16293.2 14379.8 12644.9 10572.0
vp9_inv_dct_dct_32x32_sub28_add_neon: 17424.3 15734.7 13473.0 11326.9
vp9_inv_dct_dct_32x32_sub32_add_neon: 18531.3 17457.0 14298.6 12080.0
This is cherrypicked from libav commit
5eb5aec475aabc884d083566f902876ecbc072cb .
Signed-off-by: Martin Storsjö <martin@martin.st>
Martin Storsjö [Sun, 5 Feb 2017 20:53:55 +0000 (22:53 +0200)]
aarch64: vp9itxfm: Move the load_add_store macro out from the itxfm16 pass2 function
This allows reusing the macro for a separate implementation of the
pass2 function.
This is cherrypicked from libav commit
79d332ebbde8c0a3e9da094dcfd10abd33ba7378 .
Signed-off-by: Martin Storsjö <martin@martin.st>