Error when restarting simulation using Python API

AIasd commented 3 years ago

Hi, I am trying to run a simulation multiple times without restarting the python script. However, it seems that after the first run, the program gets stuck at "retrieving self.data".

In particular, when it invokes the second run_svl_simulation function and execute to the line print('dir(sim)', dir(sim)), it correctly print all the attributes names (including current_scene) of the object sim. But then it gets stuck at the line print('sim.current_scene', sim.current_scene) and stay there forever.

The code snippet looks like the following:

import os
import lgsvl
import time
import psutil
import math

def initialize_simulator_and_dv(map):
    SIMULATOR_HOST = os.environ.get("SIMULATOR_HOST", "127.0.0.1")
    SIMULATOR_PORT = int(os.environ.get("SIMULATOR_PORT", 8181))
    BRIDGE_HOST = os.environ.get("BRIDGE_HOST", "127.0.0.1")
    BRIDGE_PORT = int(os.environ.get("BRIDGE_PORT", 9090))

    sim = lgsvl.Simulator(SIMULATOR_HOST, SIMULATOR_PORT)
    print('dir(sim)', dir(sim))
    print('sim.current_scene', sim.current_scene)
    if sim.current_scene == map:
        sim.reset()
    else:
        sim.load(map, seed=0)

    spawns = sim.get_spawn()
    state = lgsvl.AgentState()
    state.transform = spawns[0]

    ego = sim.add_agent("2e9095fa-c9b9-4f3f-8d7d-65fa2bb03921", lgsvl.AgentType.EGO, state)
    ego.connect_bridge(BRIDGE_HOST, BRIDGE_PORT)

    # Dreamview setup
    dv = lgsvl.dreamview.Connection(sim, ego, BRIDGE_HOST)
    dv.set_hd_map('Borregas Ave')
    dv.set_vehicle('Lincoln2017MKZ_LGSVL')
    modules = [
        'Localization',
        'Perception',
        'Transform',
        'Routing',
        'Prediction',
        'Planning',
        'Camera',
        # 'Traffic Light',
        'Control'
    ]

    start = lgsvl.Transform(position=ego.state.transform.position, rotation=ego.state.transform.rotation)
    destination = lgsvl.Transform(position=lgsvl.Vector(24.970,-2.615,-29.956), rotation=lgsvl.Vector(0.731,104.547,358.660))
    dv.setup_apollo(destination.position.x, destination.position.z, modules, default_timeout=60)

    return sim, ego, start, destination

def run_svl_simulation(map, config):
    sim, ego, start, destination = initialize_simulator_and_dv(map)
    middle_point = lgsvl.Transform(position=(destination.position + start.position) * 0.5, rotation=start.rotation)

    ped_x, ped_z, ped_yaw, ped_speed, ped_trigger_distance, ped_travel_distance = config
    ped_position_offset = lgsvl.Vector(ped_x, 0, ped_z)
    ped_rotation_offset = lgsvl.Vector(0, ped_yaw, 0)
    ped_point = lgsvl.Transform(position=middle_point.position+ped_position_offset, rotation=middle_point.rotation+ped_rotation_offset)

    forward = lgsvl.utils.transform_to_forward(ped_point)

    wp = [
        lgsvl.WalkWaypoint(ped_point.position, 0, ped_trigger_distance),
        lgsvl.WalkWaypoint(ped_point.position + ped_travel_distance * forward, 0, 0) ]
    state = lgsvl.AgentState()
    state.transform = ped_point
    state.velocity = ped_speed * forward

    p = sim.add_agent("Pamela", lgsvl.AgentType.PEDESTRIAN, state)
    p.follow(wp, False)

    duration = 20
    step_time = 1
    step_rate = int(1.0 / step_time)
    steps = duration * step_rate

    for i in range(steps):
        sim.run(time_limit=step_time, time_scale=1)

        state = ego.state
        pos = state.position
        rot = state.rotation
        speed = state.speed * 3.6

if __name__ == '__main__':
    map = "BorregasAve"
    config = [4, 4, 2, 3, 10, 50]
    run_svl_simulation(map, config)
    run_svl_simulation(map, config)

and the traceback after I exited the program via ctrl-c looks like:

Traceback (most recent call last):
  File "demo_step2.py", line 178, in <module>
    run_svl_simulation(map, config)
  File "demo_step2.py", line 95, in run_svl_simulation
    sim, ego, start, destination = initialize_simulator_and_dv(map)
  File "demo_step2.py", line 38, in initialize_simulator_and_dv
    print('sim.current_scene', sim.current_scene)
  File "/home/zhongzzy9/Documents/self-driving-car/PythonAPI/lgsvl/simulator.py", line 57, in current_scene
    return self.remote.command("simulator/current_scene")
  File "/home/zhongzzy9/Documents/self-driving-car/PythonAPI/lgsvl/remote.py", line 64, in command
    self.cv.wait_for(lambda: self.data is not None)
  File "/home/zhongzzy9/anaconda3/envs/carla99/lib/python3.7/threading.py", line 331, in wait_for
    self.wait(waittime)
  File "/home/zhongzzy9/anaconda3/envs/carla99/lib/python3.7/threading.py", line 296, in wait
    waiter.acquire()

What can be the potential cause for this behavior? Thanks!

AIasd commented 3 years ago

Figure out the error. Only one client can be connected to simulator at a time. So I cannot call lgsvl.Simulator twice. I wonder is there a way to disconnect the current client so I can call lgsvl.Simulator again? Otherwise, I have to keep track of the instance sim and pass it across multiple simulations.

EricBoiseLGSVL commented 3 years ago

@hadiTab do we have any plans to support this?

AIasd commented 3 years ago

Another related question is that is there a way to avoid restarting apollo modules for every simulation? This step is quite time-consuming when I also enable the perception module.

EricBoiseLGSVL commented 3 years ago

I think you will need to post on Apollo's issues to see if they are aware of the issue or have a solution.

lemketron commented 3 years ago

If you don't need to have perception running, you can use ground truth from the simulator which will make apollo cycle time a little faster... Check out the modular testing docs.

AIasd commented 3 years ago

Hi @lemketron unfortunately I do want to test the perception module so it seems that it is a bit unavoidable there.

AIasd commented 3 years ago

Hi @EricBoiseLGSVL , yes I probably should open an issue there asking about this question.

lemketron commented 3 years ago

Hi @lemketron unfortunately I do want to test the perception module so it seems that it is a bit unavoidable there.

I understand. Unfortunately perception has been unavailable in Apollo for so long that we have focused most of our 6.0/master testing on using "modular testing". In fact even if you enable the modules, it will likely only work if you have a super powerful graphics workstation, and may still require a dedicated machine to run Apollo.

I can run modular testing with Apollo and SVL Simulator on a 8GB RTX 2070 MaxQ in a 15" Razer Blade laptop with a 6 (12) core i7 but there is not enough GPU memory to enable Apollo's perception modules on this one machine. I mention this just to be aware in case you're trying to do this on one single machine...

In any case if you're having issues with it then yes, please post the details over on Apollo issues and feel free to link to this issue as well.

AIasd commented 3 years ago

Hi @lemketron, thanks for the comments! I think my machine should probably be fine since it has 2*2080Ti (one for SVL and one for Apollo) and a 14 core i9. I currently can run Apollo with perception (although still not that smooth).

Another related question related to performance is that is it possible to save camera images and other simulator data (e.g. the ego car's and other cars' locations) in parallel to the simulation process (i.e. sim.run)? Right now I have to run a step (0.1s) for simulation, do the saving step, and then run the next step (0.1s) for simulation. The saving step is really time consuming and thus make the simulation time very long. Any ideas and related issues along that line?

lemketron commented 3 years ago

Wow, that sounds like a pretty impressive system, congrats!

As for saving images, we recommend using cyber-recorder to capture the images and any other data you wish to save to cyber bags and then you can post-process it. The python-API is for controlling the simulation but not for retrieving sensor data in real time.

And if you are stepping be sure to check out the clock sensor so Apollo can use simulator time rather than falling behind with real time especially if you're making a lot of python API calls at each step.

AIasd commented 3 years ago

Hi @lemketron ,

Are you referring to Cyber RT python API listener example?

Also, do you know how can I check if the clock sensor is being used properly?

AIasd commented 3 years ago

I have figured out the clock sensor part. If I understand correctly, do you mean I should create a new sensor (similar to the comfort sensor) that stream data (needs to create new *.proto for that purpose) I want and send to Apollo via CyberRT. Then I can create a listener that listens from that channel?

Is there an easier way for my purpose? Say, for example, I want to know the ego car's speed while sim.run is running, what is the simplest way to record this information locally? (It is ok to lose some of this info as long as most are recorded).

EricBoiseLGSVL commented 3 years ago

Camera images can be saved with the ColorCameraSensor, no need to make a new one. If you are just looking to get speed I would use already exiting sensors because it is a pain to make new proto files.

AIasd commented 3 years ago

Hi @EricBoiseLGSVL , thanks for the reply! in that case, all I need to do is to create a CyberRT listener listening to the ColorCameraSensor channel and store locally while the simulation is running, am I understanding it correctly?

I guess I also want info like the locations of other agents in real time. I might get that from the 3D Ground Truth Sensor? But I also don't want Apollo to use it. Is there a switch I can turn off for that in the config?

lemketron commented 3 years ago

Just change the topic name that the simulator publishes the ground truth on and the Apollo perception won't see it.

Instead of: /apollo/perception/obstacles

Use the default from the docs: /simulator/ground_truth/3d_detections

AIasd commented 3 years ago

@lemketron I see, thanks a lot!

EricBoiseLGSVL commented 3 years ago

Looks like the original issue was resolved, marking this as close soon. Thanks for the help @lemketron

lgsvl / simulator

Error when restarting simulation using Python API #1522