Property is not satisfied in Learning Queries #241
-
Dear UPPAAL Team, I trust this message reaches you in good health. I have been working on a system model using UPPAAL The model primarily comprises two templates: software and hardware. I am encountering an issue with strategy synthesis. While the query Your assistance is greatly appreciated. Best regards. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
When the In your particular case, you meet the timing-control limitation of the learning algorithm; namely that the learning algorithm only can learn non-lazy controller strategies. In the The are a couple of ways to get around it, the most common is to add a scheduler component (completely in the environments control) that emits "ticks". Each tick moves the These particular restrictions are discussed way back in the early papers of stratego: Uppaal Stratego and On Time with Minimal Expected Cost!. It is an interesting problem to study the extension of the learning algorithms to also deal with continuous and timed action spaces. |
Beta Was this translation helpful? Give feedback.
When the
minE
query fails, it implies that the learning algorithm cannot generate sufficient samples to train that validate the "sample termination objective" (in your case<> software.u4
.In your particular case, you meet the timing-control limitation of the learning algorithm; namely that the learning algorithm only can learn non-lazy controller strategies.
In the
software
component, the control atu1
,u2
andu3
all require the controller to propose the particular delay that should transpire before the edge is used. This is exactly a "lazy" behavior; namely that the controller postpones the use of an edge.The result is that the learning algorithm proposes "do nothing" in e.g.
software.u1
…