Issues with multi-agent settings #597

DongChen06 · 2024-04-28T19:40:16Z

Dear author, I am implementing the Multi-agent settings using the Highway-v0. I am not able to achieve stable training and the vehicles can run off the roads without terminating the environment. I took a look at the codes, in the reward function

HighwayEnv/highway_env/envs/highway_env.py

Lines 117 to 135 in 7415379

    
           def _rewards(self, action: Action) -> Dict[Text, float]: 
        
               neighbours = self.road.network.all_side_lanes(self.vehicle.lane_index) 
        
               lane = ( 
        
                   self.vehicle.target_lane_index[2] 
        
                   if isinstance(self.vehicle, ControlledVehicle) 
        
                   else self.vehicle.lane_index[2] 
        
               ) 
        
               # Use forward speed rather than speed, see https://github.com/eleurent/highway-env/issues/268 
        
               forward_speed = self.vehicle.speed * np.cos(self.vehicle.heading) 
        
               scaled_speed = utils.lmap( 
        
                   forward_speed, self.config["reward_speed_range"], [0, 1] 
        
               ) 
        
               return { 
        
                   "collision_reward": float(self.vehicle.crashed), 
        
                   "right_lane_reward": lane / max(len(neighbours) - 1, 1), 
        
                   "high_speed_reward": np.clip(scaled_speed, 0, 1), 
        
                   "on_road_reward": float(self.vehicle.on_road), 
        
               }

and terminate function

HighwayEnv/highway_env/envs/highway_env.py

Lines 136 to 142 in 7415379

    
           def _is_terminated(self) -> bool: 
        
               """The episode is over if the ego vehicle crashed.""" 
        
               return ( 
        
                   self.vehicle.crashed 
        
                   or self.config["offroad_terminal"] 
        
                   and not self.vehicle.on_road 
        
               )

It seems Only the self.vehicle is considered instead of self.controlled_vehicles. Any thoughts would be appreciated.

hkbharath · 2024-08-22T16:12:55Z

As far as I can see, it is necessary to implement a separate multi-agent version of the single agent highway-env along with specifying the multi-agent action and observation spaces in config. It looks like IntersectionEnv is implemented with multi-agent consideration, but other envs must be extended explicitly for multi-agent scenarios.

IsYang23 · 2024-09-04T03:13:52Z

Hey，when trying to run my highway script on multi-agent settings, I run into this error:
"
File ~.conda\envs\spyder\Lib\site-packages\stable_baselines3\common\base_class.py:180 in init
assert isinstance(self.action_space, supported_action_spaces), (

AssertionError: The algorithm only supports (<class 'gymnasium.spaces.discrete.Discrete'>,) as action spaces but Tuple(Discrete(5), Discrete(5)) was provided"

Did you encounter the same error too? How did you solve the issue?
Here is my env config:
config= {"action": {
"type": "MultiAgentAction",
"action_config":{
"type":"DiscreteMetaAction",
"longitudinal": True,
"lateral": True,
"target_speeds": [50, 60, 70, 80],
},

         },
    
    "observation":{
        "type":"MultiAgentObservation",
        "observation_config":{
            "type": "Kinematics",
            "vehicles_count": 8,
        "features": [
            "presence",
            "x",
            "y",
            "vx",
            "vy",
            "cos_h",
            "sin_h"
        ],
        "absolute": False                
            },
        },
    "lanes_count": 3, "vehicles_count": 10, "controlled_vehicles": 2, "collision_reward": -1, "right_lane_reward": 0, "high_speed_reward": 1, "lane_change_reward": 0.1, "reward_speed_range": [20, 30]},render_mode="rgb_array")

hkbharath · 2024-09-05T09:31:10Z

This looks like a separate issue. You should check the algorithm that you are using. The RL algorithm (from stable baselines 3) that you are using seem to support only single agent. Either you need to modify the algorithm for multi-agent settings or use the multi-agent version of the RL algorithms available in ray-rllib or other alternatives options to train multiple agents.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues with multi-agent settings #597

Issues with multi-agent settings #597

DongChen06 commented Apr 28, 2024

hkbharath commented Aug 22, 2024

IsYang23 commented Sep 4, 2024 •

edited

Loading

hkbharath commented Sep 5, 2024

Issues with multi-agent settings #597

Issues with multi-agent settings #597

Comments

DongChen06 commented Apr 28, 2024

hkbharath commented Aug 22, 2024

IsYang23 commented Sep 4, 2024 • edited Loading

hkbharath commented Sep 5, 2024

IsYang23 commented Sep 4, 2024 •

edited

Loading