SRU request for LP#208551
Stefan Bader
stefan.bader at canonical.com
Wed Sep 10 21:55:51 UTC 2008
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Colin Ian King wrote:
> https://bugs.launchpad.net/ubuntu/hardy/+source/linux/+bug/208551
>
> SRU justification:
>
> Impact: mdadm, Raid5 get stuck in uninterruptable sleep under heavy I/O
> load. Copying data to a Raid 5 XFS partition results in a permanent lock
> on several processes related to it, getting stuck in the D(+) state.
> Occurs when large quantities of data (10-40 GB) is copied, resulting in
> processes being unkillable, and the system cannot reboot and requires
> power cycling the server.
>
> Fix: The patch from commit 6ed3003c19a96fe18edf8179c4be6fe14abbebbc. The
> fix is to not make any generic_make_request() calls in raid5
> make_request until all waiting has been done. We do this by simply
> setting STRIPE_HANDLE instead of calling handle_stripe(). This causes a
> performance hit, so this patch also only calls raid5_activate_delayed()
> at unplug time, never in raid5. This seems to bring back the
> performance numbers. [quoting the commit message]
>
> Testing: Without the patch, Raid 5 using md on an XFS filesystem locks
> up under heavy data copying - this is repeatable. With the patch, the
> lock up does not occur.
>
> Patch tested in my PPA by Andrew Cholakian
> https://bugs.launchpad.net/ubuntu/hardy/+source/linux/+bug/208551/comments/16
> on 2 64 bit servers.
>
> Patch attached.
>
ACK. (Note: patch also in 2.6.26.y)
- --
When all other means of communication fail, try words!
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iD8DBQFIyEJnP+TjRTJVqvQRAsC/AKDYFbuSXjXnyZc2M9EXp4CGcmLhjQCfRFoD
DNOjyaLabRvI9E8b1ifvuiY=
=xe8x
-----END PGP SIGNATURE-----
More information about the kernel-team
mailing list