[Bug 1970460] Re: [SRU] Avoid premature onode release

Launchpad Bug Tracker 1970460 at bugs.launchpad.net
Mon May 23 19:18:16 UTC 2022


Status changed to 'Confirmed' because the bug affects multiple users.

** Changed in: ceph (Ubuntu)
       Status: New => Confirmed

-- 
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to ceph in Ubuntu.
https://bugs.launchpad.net/bugs/1970460

Title:
  [SRU] Avoid premature onode release

Status in ceph package in Ubuntu:
  Confirmed

Bug description:
  [Impact]

   OSDs crash at randomly due to race condition that can occur
   at times.

   This was observed when onode's removal is followed by reading
   and the latter causes object release before the removal is finalized.
   The root cause is an improper 'pinned' state assessment in Onode::get().

  [Test Plan]

   Deploy a ceph cluster and do write some data to the cluster.
   While performing some reads again from the cluster, no crashes
   are seen in any OSDs. The race condition can be mimicked
   by holding one thread (under debugger) while the other one
   continues to update 'nput' counter.

  [Where problems could occur]

   Despite the new atomic counter it might not be cover cases
   and still introduce further data race and/or crashes continue
   to happen.

  [Other Info]

  It's been backported to relevant releases upstream (Octopus, Pacific, and
  Quincy). Octopus 15.2.16 has the fix. So does Quincy 17.2.0. However,
  the latest Pacific release missed out this fix. So SRU is needed for
  Pacific (only).

  Master tracker: https://tracker.ceph.com/issues/53002

  Pacific tracker: https://tracker.ceph.com/issues/53608

  Pacific PR: https://github.com/ceph/ceph/pull/44723

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/ceph/+bug/1970460/+subscriptions




More information about the Ubuntu-openstack-bugs mailing list