SIH-SITL integration tests #24237

mbjd · 2025-01-21T14:25:30Z

Overview

This is an attempt at using SIH-as-SITL for integration tests. All mavsdk integration tests pass, but still not 100% reliably.

This has grown to quite a large collection of changes. Here I summarise the main points. Apologies for the messy nature! If you want help or have any questions, don't hesitate to approach me.

This is mostly a proof of concept. Doubts about the implementation are highlighted with 🤔. Possibilities to proceed:

cherry pick small, self-contained changes from this to individual PRs
re-do this entire PR (or parts of it) from the ground up, learning from this here to arrive at a cleaner implementation
merge this PR directly, and improve the implementation subsequently
discard it entirely in favour of a deeper overhaul of the integration tests. it would be very desirable to integrate them more tightly with PX4, avoiding the unreliable communication layer in between.

Entry point

Tests are launched with the command test/mavsdk_tests/mavsdk_test_runner.py test/mavsdk_tests/configs/sitl_sih.json. The file sets config['simulator'] = sih, which causes mavsdk_test_runner.py to do the rest. Commit: 41c9d1f

The tests are ran as a github workflow thanks to the workflow file. Commit: dd882c2

🤔 hacky implementation with setting env vars in mavsdk_test_runner.py
🤔 bloated yml file with gazebo-related stuff. Not sure if the build system even allows for building tests without gazebo, maybe deeper changes there are needed and it is not worth it to save a couple seconds per CI run.

Sensor Failures

Many tests are about sensor failures. For gazebo, these are handled in SimulatorMavlink::check_failure_injections(), with the sensors models being contained in gazebo. However, in SIH, the sensors are simulated separately by modules such as SensorGpsSim etc. Therefore, we implement the sensor failures within those modules. They are mostly a copy of the logic in SimulatorMavlink::check_failure_injections(). They are triggered from the test suite (example) but can also be manually triggered from the MAVLink console with the failure command.

Commits: 9a801c6, 4056a5d, 75dd0ce, 55a195c, c209d11

🤔 code repetition... both in terms of repeating logic from SimulatorMavlink, but also almost shared logic between the Sensor...Sim modules

For the airspeed sensor, there is also an entirely different way to simulate failure, by setting the parameter SIM_ARSPD_FAIL (see #21126). With the implementation of "proper" failures as used in the test suite this is not necessary anymore but is still possible.

Sensor Implementation

SIH itself also implements an airspeed sensor. Here we switch it off (a2a6c35) in favour of SensorAirspeedSim, where we implement the sensor failures. Notice that SensorAirspeedSim is only started if the airframe file contains param set-default SENS_EN_ARSPDSIM 1 (7576725, 96105ca) and the launch script considers that parameter (55a195c). Be sure to only apply those changes together to avoid conflicting or missing sensor data.

SensorAirspeedSim calculated the airspeed previously as dot([1, 0, 0], velocity), corresponding to the assumption that the pitot tube is mounted in body-forward direction. This is incorrect for tailsitters, in which case body-forward points forward in hovering/tail-sitting pose, but downwards in forward flight, where airspeed measurements are needed. As a simple fix, we now instead calculate airspeed as norm(velocity) (same as in SIH). Strictly speaking this is incorrect, as sideways components of the velocity now also contribute to the airspeed, which with an acutal pitot tube measurement would not happen. But in well-behaved fixed wing flight those should be small enough to ignore, and in line with the modelling accuracy standards for SIH generally. Commit: 7576725

Deterministic Simulation

To exclude the effect of sensor noise, we add a parameter SIH_NOISE_SCALE which when set to 0 disables sensor noise. The simulation is still not entirely deterministic due to timing. Commits: 4666a97, 1470a1c

🤔 This was mostly implemented as a debugging tool. If merging, we could probably drop it unless someone has a strong need.
💡 Very cool would also be to have a deterministic PRNG for reproducibility. Jax does this very nicely in the context of multiple threads but with some restrictions (pure functions i.e. no global state). But again, the advantage is probably not worth the effort.

Timing bugfixes

When running at large speed factors the tests were previously very unreliable. This was due to timing issues: When waiting for some condition or sleeping for a given amount of simulation time, this is done with real-time polling. However, if running with larger and larger speed factor, polling with the same speed (in real-time) becomes more and more inaccurate. In extreme cases this led to timeouts, with tests failing for example because the VTOL transition took too long, even though the log clearly showed that the transition occured faster than the prescribed timeout.

We alleviate this by increasing the polling frequency with the speed factor, so it stays equally fast in simulation time across all speed factors (if the simulation is slower than the set speed factor, polling becomes quicker w.r.t. simulation time. This should not be problematic unless we set extremely high, unrealistic speed factors like 10000000, in which case polling will hog the CPU). Commit: e16bf22

Also, some tests used real-time sleep statements, leading to obvious timing issues. Adapted to sim-time sleep here: 5a03cfb, PR here: #24266

🤔 Probably even sim-time sleep statements in tests are not a great idea and there should be better alternatives such as waiting explicitly until the condition we actually depend on is satisfied (or failing if not).
🤔 Fundamentally, is polling the right appproach to check conditions like these? Every relevant condition will only change once per simulation step. Can we couple the checks to that? Check not every x (real-time) milliseconds, but once at every lockstep simulation step? Or setup some interrupts for the relevant message directly?

Model & parameter changes

It is quite common for airframe files to be self-contradictory because some physical model parameters are redundant (e.g. SIH_T_MAX follows from CA_ROTOR*_CT, SIH_Q_MAX from CA_ROTOR*_KM, SIH_L_ROLL and SIH_L_PITCH from CA_ROTOR*_PX and CA_ROTOR*_PY). I corrected some of the contradictions here (350f160, 96105ca, 6779c70) but most likely I missed other ones. In the long term it would be very cool to have a minimal, non-redundant set of parameters that avoids this issue altogether and removes an opportunity for the user to mess up.

Standard VTOL: Magic number in sih.cpp is now explained as the factor by which the pusher motor is stronger than the MC motors: a2a6c35. Still hardcoded though.

Tailsitter: Channel reversal moved to SIH model to clean up config file: dd882c2

🤔 this will break all other SIH tailsitter airframes. in the PX4 repo atm there is only 1102, but externally people might have additional ones. If merging, think of some way to handle this smoothly.

Test changes

We adapt some test slightly to be more lenient w.r.t. model and tuning variations.

a2a6c35

larger position tolerance for MC offboard control
Higher takeoff altitude for VTOL Loiter with airspeed failure (standard VTOL)
- For some configurations, the "wrong" (= too low) airspeed failure causes quite a significant pitch down (caused by TECS wanting to gain airspeed after reading the lower airspeed), before the failure being detected and corrected for. This pitch down is more severe with lower airspeed, and the SIH model flies at lower airspeed than the gazebo one. We also increase airspeed for this airframe in this commit.

🤔 Passing tests should also be possible to achieve by only changing the model towards the gazebo version which passes tests.... The gazebo models are sometimes very different than the SIH ones we are using here in terms of size/mass, so it's not entirely surprising that tests designed for the former sometimes fail with the latter.

also, fix tailsitter model so actuator signs follow convention

airspeed, baro, and mag off. agp has no failures, and other failure types like stuck etc not implemented at all. doubting this implementation, lots of repetitions and almost-repetitions. maybe we should do it more similar to SimulatorMavlink.cpp? Where simulator_sih handles the sensor failures itself, with one big if mess, but the rest of it nice and lean. failing tests for quadx: - 'Continue on baro stuck during mission (baro height mode)': failed - 'Continue on baro stuck during mission (GPS height mode)': failed - 'Fly straight Multicopter Mission': failed - 'Offboard takeoff and land': failed - 'Offboard position control': failed

this makes SIH artificial sensor noise smaller or larger. useful for deterministic-ish sim. setting it to 0 does not work right now for mysterious reasons.

Previously, DataValidator would automatically reject sensor data as stale if (almost) constant. But if setting SIH_NOISE_SCALE = 0 for deterministic sim, constant sensor data are to be expected. This adds logic to not flag sensor values as stale if noise scale is very small. If the sensor has *actually* gone stale with very low noise scale, this means we cannot detect it anymore.

AFAIK this is not done anywhere else for SIH-as-SITL. Just below for gazebo it is already present. the corresponding check for SIH-on-FC happens at ROMFS/px4fmu_common/init.d/rcS:388 - also no airspeed is enabled anywhere. is that intended? also, handle airspeed failure correctly in SensorAirspeedSim.cpp

previously I removed this assuming that it would add the forward thrust as a resulting force. but it just considers its influence on the wing segment.

the original std::this_thread::sleep_for is with respect to host system time which is different from autopilot time if speed factor != 1, if some component runs slower than real time, or if debugging. tester.sleep_for (which already existed) correctly sleeps w.r.t. px4/simulation time.

maybe this is a band-aid fix, and we actually should make sure the transition can be achieved in the given 5 seconds, as it is very reliably in the corresponding gazebo test case. check: - does SIH_T_MAX make sense? - does SIH_MASS make sense? - and any other model parameters which are suspicious

- higher airspeed targets - higher pusher thrust - higher takeoff attitude (arsp failure test) in the airspeed failure case, the lowered airspeed reading causes the controller to want to speed up until the (wrong) airspeed is at the setpoint, i.e. the real airspeed quite a bit faster. without these fixes, tecs does a nosedive to regain airspeed, but never becomes fast enough (with already maxed out pusher thrust). we pervent this with the first two adaptations, while the last one gives space for the remaining nosedive. fix this permanently by: - ensuring the model makes sense definitely (pusher thrust, mass, aero properties) - noticing the failure faster - adapting tecs so it doesn't nosedive? - stop sending airspeed in sih src/modules/simulation/sensor_airspeed_sim/SensorAirspeedSim.cpp is already doing that. in the sensor failure case (wrong), that implements the failure, but the SIH one not, giving conflicting data. so switch it off - larger acceptance radius quadx position mode is just not that accurate. is it a control/model mismatch problem, or a simulation problem? - wait longer for disarm in test_vtol_rtl this would fail at large speed factors previously. timing bug or acceptable small variation?

SensorAirspeedSim has hardcoded pitot tube direction in body front velocity. for tailsitter, this points DOWN in forward flight, thus the simulated sensor would read 0 in forward flight for tailsitters. fixed by using norm(velocity) instead like in sih.

If sim is running faster, we should also poll faster to keep the same timing resolution. This fixed quite a large number of test failures.

mbjd force-pushed the sih_sitl_testing branch from b5c22e9 to ca166bf Compare January 21, 2025 14:35

mbjd requested review from dagar, Jaeyoung-Lim, sfuhrer, KonradRudin and MaEtUgR January 23, 2025 11:06

mbjd force-pushed the sih_sitl_testing branch 5 times, most recently from 1111095 to f2461db Compare January 30, 2025 08:58

mbjd added 19 commits January 30, 2025 10:39

add svtol airframe for sih-in-sitl

5d72a47

update output docstring

268865d

remove confusing 'SVTOL' abbreviation

8e07b4e

(very wip) sitl-sih integration tests

41c9d1f

add workflow for sih-sitl tests

dd882c2

also, fix tailsitter model so actuator signs follow convention

first sensor failure

9a801c6

implement baro stuck failure

75dd0ce

don't --abort-early

a94928e

make tailsitter airframes less wrong

350f160

model mismatch, more conservative speed factor

6779c70

add noise scale parameter

4666a97

this makes SIH artificial sensor noise smaller or larger. useful for deterministic-ish sim. setting it to 0 does not work right now for mysterious reasons.

clean up comments

44e8c4b

format

dbaf31b

implement additional failure: airspeed wrong

c209d11

specify timeout duration to wait for disarm

5151c90

consider thrust in aero model

3001b2f

previously I removed this assuming that it would add the forward thrust as a resulting force. but it just considers its influence on the wing segment.

mbjd added 2 commits January 30, 2025 10:39

cleanup

ac9145a

mbjd force-pushed the sih_sitl_testing branch from f2461db to ac9145a Compare January 30, 2025 09:41

mbjd added 4 commits January 30, 2025 13:03

attempt at fixing xvert

7576725

SensorAirspeedSim has hardcoded pitot tube direction in body front velocity. for tailsitter, this points DOWN in forward flight, thus the simulated sensor would read 0 in forward flight for tailsitters. fixed by using norm(velocity) instead like in sih.

AutopilotTester: Speed up polling w speed factor

e16bf22

If sim is running faster, we should also poll faster to keep the same timing resolution. This fixed quite a large number of test failures.

mbjd force-pushed the sih_sitl_testing branch from 7202b97 to e16bf22 Compare February 3, 2025 11:12

mbjd mentioned this pull request Feb 4, 2025

mavsdk tests: use tester sleep_for function #24266

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SIH-SITL integration tests #24237

SIH-SITL integration tests #24237

mbjd commented Jan 21, 2025 •

edited

Loading

SIH-SITL integration tests #24237

Are you sure you want to change the base?

SIH-SITL integration tests #24237

Conversation

mbjd commented Jan 21, 2025 • edited Loading

Overview

Entry point

Sensor Failures

Sensor Implementation

Deterministic Simulation

Timing bugfixes

Model & parameter changes

Test changes

mbjd commented Jan 21, 2025 •

edited

Loading