[Bug 2121931] Re: OSD crash in OSD::tick_without_osd_lock() due to race condition
Launchpad Bug Tracker
2121931 at bugs.launchpad.net
Wed Sep 3 10:51:21 UTC 2025
Status changed to 'Confirmed' because the bug affects multiple users.
** Changed in: ceph (Ubuntu Noble)
Status: New => Confirmed
--
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to ceph in Ubuntu.
https://bugs.launchpad.net/bugs/2121931
Title:
OSD crash in OSD::tick_without_osd_lock() due to race condition
Status in ceph package in Ubuntu:
Confirmed
Status in ceph source package in Noble:
Confirmed
Status in ceph source package in Plucky:
Confirmed
Status in ceph source package in Questing:
Confirmed
Bug description:
OSDs access and modify epoch maps without a lock from multiple
threads. This leads a race condition and results in a crash due to
iterator invalidation.
Typical stack trace looks like:
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: *** Caught signal (Segmentation fault) **
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: in thread 7e821b800640 thread_name:safe_timer
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: ceph version 19.2.0 (16063ff2022298c9300e49a547a16ffda59baf13) squid (stable)
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 1: /lib/x86_64-linux-gnu/libc.so.6(+0x42520) [0x7e8226442520]
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 2: (std::_Rb_tree_decrement(std::_Rb_tree_node_base const*)+0xe) [0x7e82268c65ee]
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 3: (OSD::tick_without_osd_lock()+0x4ac) [0x5e326eb66c6c]
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 4: (Context::complete(int)+0xd) [0x5e326eb8971d]
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 5: (CommonSafeTimer<std::mutex>::timer_thread()+0x12d) [0x5e326f1f64ed]
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 6: (CommonSafeTimerThread<std::mutex>::entry()+0x11) [0x5e326f1f7991]
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 7: /lib/x86_64-linux-gnu/libc.so.6(+0x94ac3) [0x7e8226494ac3]
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 8: /lib/x86_64-linux-gnu/libc.so.6(+0x126850) [0x7e8226526850]
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: 2025-07-05T01:34:19.417+0000 7e821b800640 -1 *** Caught signal (Segmentation fault) **
Jul 05 01:34:19 ps7-ra1-n1 ceph-osd[2083916]: in thread 7e821b800640 thread_name:safe_timer
This has been fixed in upstream and backport to Squid is in progress.
Upstream bug trcker: https://tracker.ceph.com/issues/66819
Upstream patch: https://github.com/ceph/ceph/pull/62916
Bug tracker for Squid: https://tracker.ceph.com/issues/72070
Backport patch for Squid: https://github.com/ceph/ceph/pull/64732
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/ceph/+bug/2121931/+subscriptions
More information about the Ubuntu-openstack-bugs
mailing list