-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tb3_simulation remains idle with no movement after long hours of running #1955
Comments
Tried generating backtrace the prefix is not giving any gdb session prompt after a crash. Trying to use @facontidavide 's suggestion of using backward-cpp @SteveMacenski |
WARNING: Be aware that are nodes in the graph that share an exact name, this can have unintended side effects. is this normal @SteveMacenski ? |
This is fixed in 606ca7a - but hasn't been backported yet to Foxy |
@mikeferguson any issues when ported to foxy? Ill check it up. When is the next update planned? Also the changes were done only for waypoint? |
Should be no issues - I'm mainly developing off the foxy branch and did test it there. |
/rviz2 |
No - that seems like a different issue - does rviz2 not start up with a randomized name? |
Now it seems to weird. @mikeferguson |
@rightbot-abhinav This is probably a local issue to you, especially given the other tickets you've been filing. I'd suggest slowing down and thinking things through more before reacting. Changing a ticket from Server A crashing to Server B crashing shows some issue you should think on more than just changing the name. Please find a backtrace, I provide this great tutorial about it https://navigation.ros.org/tutorials/docs/get_backtrace.html. We can't do anything with a crash report that others can't reproduce. I'm going on a hunch that its the same issue you had in your last ticket with DDS issues around message filters. You're also running foxy source on 18.04, which is unsupported. Foxy targets 20.04 and the versions of critical libraries might not be compatible. |
@SteveMacenski thanks for the info. Sure will take this into consideration. |
Could you please confirm if after checking everything (please make sure you are using the latest Foxy sources), and changing back to Fast DDS everything works? |
@JaimeMartin the earlier crashing issue was solved just by changing the middleware to cyclone dds (#1950) but the current issue persists as @SteveMacenski it might be something local to my system/build and not linked to anything else |
Very similar to #1889 Stable initially but leaving it idle for some time causes the below |
I haven't really had time to dig too much more into it. I just added a restart button to my system as a temporary remedy but, it is still an issue for me. |
Do you know if these two messages are close together? Like, does the message filter drop and then you crash? |
@mikeferguson I get a stream of message filter drop and then this occurs like this |
@mikeferguson I think the key line is
Anytime I see RCL errors that's not a good situation. The next line:
I think is the generic exception handling from the laser scan coming into the costmap in the planner server by message filters. My guess is that is actually catching the exception thrown by rcl or something. Then crash. We definitely need a traceback @rightbot-abhinav |
@SteveMacenski I will try to post the traceback. In the meanwhile I made a clone of the system on a supported OS (Focal). The same issue is occurring. Let me know if you want to open the issue |
I'm not sure the best plan here - this isn't something that anyone else can reproduce so there's not a good next step unless we have a backtrace to work from |
@SteveMacenski valid point will add the backtrace. Would be great if you could enlighten on this
|
Please see our tutorial which has instructions for this https://navigation.ros.org/tutorials/docs/get_backtrace.html |
This issue has been resolved. Thanks a ton @daisukes for the fix to the issue ros2/rclcpp#1266. the behavior observed was very similar. The stack has been running overnight without any error's :) A problem still persists for which @Michael-Equi I have been observing that you have the same issues which I was facing would be great if you could integrate the above commits to your code base and test it out ! Thanks @SteveMacenski as you had mentioned rightly it had nothing to do with nav stack and I had removed the plugins Navfn and DWB and yet it occurred. Thanks for your help ! |
What is "this"? That doesn't seem to relate to this project's branches. |
Yeah this is not related to this project's branch. I was wrong to mention it here. It's a fix for the rclcpp repo master branch and has not been ported to the foxy branch. I incorporated the changes and it solved the issue for me. Thanks ! Link: ros2/rclcpp#1266 |
**Bug report
Required Info:**
**Operating System:
Ubuntu 18.04
ROS2 Version:
Foxy source
Version or commit hash:
DDS implementation:
PLATFORM INFORMATION
system : Linux
platform info : Linux-4.9.140-tegra-aarch64-with-Ubuntu-18.04-bionic
release : 4.9.140-tegra
processor : aarch64
RMW MIDDLEWARE
middleware name : rmw_cyclonedds_cpp
ROS 2 INFORMATION
distribution name : foxy
distribution type : ros2
distribution status : active
release platforms : {'ubuntu': ['focal']}
ROS 2 INFORMATION
distribution name : foxy
distribution type : ros2
distribution status : active
release platforms : {'ubuntu': ['focal']}
gazebo version 11 installed from binary
Steps to reproduce issue
ros2 launch nav2_bringup tb3_simulation_launch.py
leave the simulation idle for some time around 30 mins
try to give a navigation waypoint
Expected behavior
nav2bringup should run continuously after a certain idle time without any issues
Actual behavior
ros2 launch nav2_bringup tb3_simulation_launch.py
Since planner server process exits the goalsset are not achieved and it remains idle
Additional information**
[planner_server-9] [ERROR] [1597850629.255140273] [rcl]: Failed to get trigger guard condition in jump callback
[planner_server-9] [INFO] [1597850631.461968287] [global_costmap.global_costmap_rclcpp_node]: Message Filter dropping message: frame 'base_scan' at time 255.201 for reason 'Unknown'
[ERROR] [planner_server-9]: process has died [pid 14670, exit code -7, cmd '/home/warry/navigation2_ws/install/nav2_planner/lib/nav2_planner/planner_server --ros-args -r __node:=planner_server --params-file /tmp/tmpzu66z_32 -r /tf:=tf -r /tf_static:=tf_static'].
The text was updated successfully, but these errors were encountered: