use rllib v2.5.1 because for v2.6 onwards there are architectural changes that affect logging, structure of agent_model.py and how rllib algorithms are used, i.e. impala. Updating to v2.6+ will require some syntax changes which will be done later
optimise how the raw data of each step is fetched from aerospike