Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature(wrh): add continuous action space in mt for unizero. #271

Closed
Changes from 1 commit
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
c1da960
polish(pu): polish readme using colorful icons
puyuan1996 Jul 12, 2024
fdd414c
fix(pu): fix lightzero versions
puyuan1996 Jul 12, 2024
83f10f0
fix(pu): fix empty_keys_values in init_infer
dyyoungg Jul 14, 2024
e11cf12
polish(pu): polish documentation in readme
puyuan1996 Jul 15, 2024
d2f5ba8
Merge branch 'main' of https://github.com/opendilab/LightZero
puyuan1996 Jul 15, 2024
230a4a5
feature(pu): add lightzero sphinx docs (#237)
puyuan1996 Jul 15, 2024
8c691d6
polish(pu): polish documentation structure
puyuan1996 Jul 15, 2024
032825f
feature(wrh): Add Harmony Dream loss balance in MuZero (#242)
ruiheng123 Jul 16, 2024
a44a2bf
polish(pu): polish documentation
puyuan1996 Jul 16, 2024
a0bf161
Merge branch 'main' of https://github.com/opendilab/LightZero
puyuan1996 Jul 16, 2024
00f82fb
polish(pu): add unizero quick start in readme
puyuan1996 Jul 24, 2024
fff7fde
polish(pu): rename model_update_ratio to replay_ratio
puyuan1996 Jul 25, 2024
c004eac
Update requirements.txt
TuTuHuss Jul 30, 2024
f062baa
polish(pu): polish algo_overview
puyuan1996 Aug 1, 2024
4c68ac8
feature(xcy): add Metadrive Env and its SEZ configs (#192)
HarryXuancy Aug 2, 2024
5de52bc
feature(hus): add self-hosted linux(ubuntu) ci runner (#259)
TuTuHuss Aug 2, 2024
3f6cb5a
polish(nyz): add metadrive version info
PaParaZz1 Aug 3, 2024
0064381
fix(pu): fix DownSample for different obs shape (#254)
puyuan1996 Aug 15, 2024
8300a52
feature(pu): add Sampled MuZero/UniZero, DMC env and related configs …
puyuan1996 Aug 18, 2024
9e372fd
fix(pu): fix np.asarray in sampled related buffer/policy
puyuan1996 Aug 19, 2024
0040f46
feature(wrh): add continuous in mt
ruiheng123 Aug 26, 2024
ac238e9
feature(wrh): add continuous in mt
ruiheng123 Aug 26, 2024
9201454
feature(wrh): add continuous in mt
ruiheng123 Aug 26, 2024
0358781
Merge remote-tracking branch 'origin/main' into dev-unizero-multitask…
puyuan1996 Aug 28, 2024
5871aa6
feature(pu): add init version of multitask_sampled_unizero pipeline f…
puyuan1996 Aug 28, 2024
5f95d52
feature(wrh): initial debug for cont
ruiheng123 Aug 29, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
polish(nyz): add metadrive version info
PaParaZz1 committed Aug 3, 2024

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
commit 3f6cb5a6b2b31616b9902efff927e46714d2304f
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -144,7 +144,8 @@ The environments and algorithms currently supported by LightZero are shown in th
| MiniGrid | --- | ✔ | ✔ | ✔ | 🔒 | 🔒 |✔|🔒 |
| Bsuite | --- | ✔ | ✔ | ✔ | 🔒 | 🔒 |✔|🔒 |
| Memory | --- | ✔ | ✔ | ✔ | 🔒 | 🔒 |✔|🔒 |
| SumToThree (billiards) | --- | 🔒 | 🔒 | ✔ | 🔒 | 🔒 |🔒|🔒 |
| SumToThree (billiards) | --- | 🔒 | 🔒 | ✔ | 🔒 | 🔒 |🔒|🔒 |
| MetaDrive | --- | 🔒 | 🔒 | ✔ | 🔒 | 🔒 | 🔒 |🔒 |


<sup>(1): "✔" means that the corresponding item is finished and well-tested.</sup>
5 changes: 5 additions & 0 deletions zoo/metadrive/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# Installation
```shell
pip install metadrive-simulator==0.3.0.1
pip install panda3d-simplepbr==0.9
```
4 changes: 2 additions & 2 deletions zoo/metadrive/env/drive_env.py
Original file line number Diff line number Diff line change
@@ -281,7 +281,7 @@ def reward_function(self, vehicle_id: str):

reward = 0.0
reward += self.config["driving_reward"] * (long_now - long_last) * lateral_factor * positive_road
reward += self.config["speed_reward"] * (vehicle.speed / vehicle.max_speed) * positive_road
reward += self.config["speed_reward"] * (vehicle.speed / vehicle.max_speed_km_h) * positive_road

step_info["step_reward"] = reward

@@ -391,4 +391,4 @@ def get_navigation_len(self, vehicle):
if hasattr(vehicle.navigation, 'u_turn_case'):
if vehicle.navigation.u_turn_case is True:
total_dist += 35
return total_dist
return total_dist