A Tensor Processing Unit Design For Fpga Benchmarking