Skip to content

resources/tutorials/example for creating custom reward function? #218

Answered by Mark2000
MicahJSherry asked this question in Q&A
Discussion options

You must be logged in to vote

Here's what I would do:

  • Data: Contains the current difference in orbit from the desired orbit (in classical orbital elements, or Cartesian coordinates, or whatever). Addition would just produce a unit of data with the most recent measurement (may need to track the measurement time in the data to do this)
  • DataStore: get_log_state would return the current OEs/coordinates. compare_log_states would only use the second argument (new_state) to generate an instance of your Data class that contains the difference between the state and the desired state.
  • Rewarder: calculate_reward receives a dictionary with the satellite names and newest data, and needs to return a dictionary with satellite names…

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@MicahJSherry
Comment options

@Mark2000
Comment options

Answer selected by MicahJSherry
@MicahJSherry
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants