1
This directory contains mpn functions for various HP PA-RISC chips. Code
2
that runs faster on the PA7100 and later implementations, is in the pa7100
5
RELEVANT OPTIMIZATION ISSUES
9
On the PA7000 no memory instructions can issue the two cycles after a store.
10
For the PA7100, this is reduced to one cycle.
12
The PA7100 has a lookup-free cache, so it helps to schedule loads and the
13
dependent instruction really far from each other.
17
1. mpn_mul_1 could be improved to 6.5 cycles/limb on the PA7100, using the
18
instructions bwlow (but some sw pipelining is needed to avoid the
40
2. mpn_addmul_1 could be improved from the current 10 to 7.5 cycles/limb
41
(asymptotically) on the PA7100, using the instructions below. With proper
42
sw pipelining and the unrolling level below, the speed becomes 8