bin/build-release: support building each arch in parallel
If parallel is available, we can now build each arch in parallel.
you can set CIRROS_PARALLEL=N where N can be:
0 or 'true': use default parallel jobs (1 per core)
N: use N jobs
'auto': use parallel with default number of jobs if parallel available.