~siretart/x264/trunk

Viewing all changes in revision 2932.

  • Committer: Henrik Gramner
  • Author(s): Alexandra Hájková
  • Date: 2018-08-06 21:24:13 UTC
  • Revision ID: git-v1:303c484ec828ed0d8bfe743500e70314d026c3bd
ppc: Optimize quant functions

1) using xxpermdi + merge instead of 2 merges improves quant_8x8
performance by 5%

2) use vec_splats instead of vec_splat

checkasm timings when compiled with gcc:
                  C:            AltiVec:
                                before: after:
quant_2x2_dc:      57            163      46
quant_4x4_dc:     141            162      57

dequant_4x4_cmp:  104            101      45
dequant_4x4_flat: 104            106      46
dequant_8x8_cmp:  412            208     147
dequant_8x8_flat: 414            212     149

expand all expand all

Show diffs side-by-side

added added

removed removed

Lines of Context: