simulation times from `library_benchmark` may be misleading

Thanks @migueldiascosta for raising this point. Indeed there is this asymmetry between the benchmark between qibo backends (circuit_benchmark) and other libraries (library_benchmark).

I cannot think of why this was intentional, other than that seperating these two times may have required a specialized treatment for each library. With the current structure, this could have been done by adding a to_numpy method in each backend under the libraries/ folder, but I guess we were not very interested at this figure for other libraries during the benchmark.

In the end it is a matter of defining the goal/figure of merit of the benchmark. For this benchmark the goal (expected outcome) for each library was set to be a np.ndarray in order to be fair, otherwise each library may return its own data type with less or more functionality for post-processing. If we are only interested in simulation, then we should indeed discard the transfer time, but we should keep in mind that in practice one may want to print/save to disk/calculate something using the state after simulation so some transfer time may be involved, most likely different depending on library, operation to be done and how efficiently it is implemented.

qiboteam / qibojit-benchmarks

simulation times from `library_benchmark` may be misleading #45