[Bug 1891567] Re: [SRU] ceph_osd crash in _committed_osd_maps when failed to encode first inc map
Launchpad Bug Tracker
1891567 at bugs.launchpad.net
Wed Aug 19 14:44:42 UTC 2020
This bug was fixed in the package ceph - 15.2.3-0ubuntu3
---------------
ceph (15.2.3-0ubuntu3) groovy; urgency=medium
* d/control: Drop BD on obsolete cython (LP: #1891820).
ceph (15.2.3-0ubuntu2) groovy; urgency=medium
* d/p/fix-crash-in-committed-osd-maps.patch: Fix ceph-osd crash
when processing osd map updates (LP: #1891567).
-- Corey Bryant <corey.bryant at canonical.com> Mon, 17 Aug 2020 13:46:06
-0400
** Changed in: ceph (Ubuntu Groovy)
Status: Fix Committed => Fix Released
--
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to Ubuntu Cloud Archive.
https://bugs.launchpad.net/bugs/1891567
Title:
[SRU] ceph_osd crash in _committed_osd_maps when failed to encode
first inc map
Status in Ubuntu Cloud Archive:
Triaged
Status in Ubuntu Cloud Archive ussuri series:
Triaged
Status in Ubuntu Cloud Archive victoria series:
Triaged
Status in ceph package in Ubuntu:
Fix Released
Status in ceph source package in Focal:
Triaged
Status in ceph source package in Groovy:
Fix Released
Bug description:
[Impact]
Upstream tracker: issue#46443 [0].
The ceph-osd service can crash when processing osd map updates.
When the osd encounters a CRC error while processing an incremental
map update, it will request a full map update from its peers. In this
code path, an uninitialized variable was recently introduced and that
will get de-referenced causing a crash.
The uninitialized variable was introduced in nautilus 14.2.10, and
octopus 15.2.1.
[Test Case]
# Inject osd_inject_bad_map_crc_probability = 1
sudo ceph daemon osd.{id} config set osd_inject_bad_map_crc_probability 1
# Trigger some osd map updates by restarting a different osd
sudo systemctl restart osd@{diff-id}
[Other Info]
Upstream has released a fix for this issue in Nautilus 14.2.11. The SRU for this point release is being tracked by LP: #1891077
Upstream has merged a fix for this issue in Octopus [1], but there is
no current release target. The ceph packages in focal, groovy, and the
ussuri cloud archive are exposed to this critical regression.
[0] https://tracker.ceph.com/issues/46443
[1] https://github.com/ceph/ceph/pull/36340
To manage notifications about this bug go to:
https://bugs.launchpad.net/cloud-archive/+bug/1891567/+subscriptions
More information about the Ubuntu-openstack-bugs
mailing list