Move tfjs to first, average, worst scoring.

Since this test is relatively slow per iteration I reduced the iteration counts to:
non-simd: 15 iterations with worst 2.
simd: 80 iterations with worst 4 (default).

Also, I moved all the benchmark files to a tfjs directory to help organize the wasm directory a bit.
12 files changed