Modify bench to specify OMP_NUM_THREADS, needed on ARM architecture
Add script for bench on each implementation.
Tiny modifications for input/output
Add OpenMP version.