[Bug 1540407] Re: multipathd drops paths of a temporarily lost device
ChristianEhrhardt
1540407 at bugs.launchpad.net
Mon Feb 29 10:01:15 UTC 2016
Hi,
I couldn't reproduce today.
The device stays in:
0:0:0:1074675728 sdb 8:16 active faulty offline
And when re-adding it comes back online just nice.
The way I configured was via chzdev like:
I used the z/VM approach with detaching the FCP adapter.
I used the multipath.conf as reported initially.
chzdev zfcp --enable 7200
chzdev zfcp --enable 7300
chzdev zfcp-lun --enable --online <- warning I enable ALL, you might want just some of them
We realized as difference to Thorstens systems I also had from this ppa: sudo add-apt-repository ppa:xnox/nonvir
>From here I installed packages:
libzfcphbaapi0
zfcp-hbaapi-utils
Versions:
libzfcphbaapi0 2.1.1-0ubuntu1
zfcp-hbaapi-utils 2.1.1-0ubuntu1
multipath-tools-boot 0.5.0-7ubuntu15
Kernel 4.4.0-8-generic
I had a call with Thorsten and he will re-verify with the latest version on his system.
If his case still fails he might give it a try with the "way I configured it".
Other than that Thorsten suggested he would like it to see the merge
happening to the more recent upstream content that Ryan mentioned.
--
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to multipath-tools in Ubuntu.
https://bugs.launchpad.net/bugs/1540407
Title:
multipathd drops paths of a temporarily lost device
Status in multipath-tools package in Ubuntu:
New
Bug description:
== Comment: #0 - Thorsten Diehl <thorsten.diehl at de.ibm.com> - 2016-02-01 08:57:28 ==
# uname -a
Linux s83lp31 4.4.0-1-generic #15-Ubuntu SMP Thu Jan 21 22:19:04 UTC 2016 s390x s390x s390x GNU/Linux
# dpkg -s multipath-tools|grep ^Version:
Version: 0.5.0-7ubuntu9
# cat /etc/multipath.conf
defaults {
default_features "1 queue_if_no_path"
user_friendly_names yes
path_grouping_policy multibus
dev_loss_tmo 2147483647
fast_io_fail_tmo 5
}
blacklist {
devnode '*'
}
blacklist_exceptions {
devnode "^sd[a-z]+"
}
---------------------------------------
On a z Systems LPAR with a single LUN, 2 zfcp devices, 2 storage ports, and the following multipath topology:
mpatha (36005076304ffc3e80000000000003050) dm-0 IBM,2107900
size=1.0G features='1 queue_if_no_path' hwhandler='0' wp=rw
`-+- policy='round-robin 0' prio=1 status=active
|- 0:0:0:1079001136 sda 8:0 active ready running
|- 0:0:1:1079001136 sdb 8:16 active ready running
|- 1:0:0:1079001136 sdc 8:32 active ready running
`- 1:0:1:1079001136 sdd 8:48 active ready running
I observed the following:
When I deconfigure one of the two zfcp devices (e.g. via chchp -c 0, or directly on the HMC), the multipathd removes the two paths via these devices from the pathgroup after 10 seconds. When the zfcp devices comes back, it runs through zfcp error recovery and is being set up properly, and also the mid layer objects are looking fine. However, the multipathd does not add them to the path group again.
Expected behaviour: multipathd does not remove the paths from topology
list, but holds them as "failed faulty offline" until dev_loss_tmo
timout is reached (which is infinite here).
I discussed this already with zfcp development, and it looks most
likely as a problem with multipathd, rather than zfcp or mid-layer.
Easy to reproduce: you need two zfcp devices, one LUN, and preferably
two ports on the storage server (WWPNs). Configure LUN via 2 zfcp
devices * 2 WWPNs = 4 paths.
This can be also reproduced on a z/VM guest. Instead of configuing the
CHPID off, just detach one zfcp device and re-attach it after 30....60
seconds. Same problem.
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/multipath-tools/+bug/1540407/+subscriptions
More information about the foundations-bugs
mailing list