[Bug 1163513] [NEW] dbus/udev send NFS client connection out out the wrong interface

Brian Morris 1163513 at bugs.launchpad.net
Tue Apr 2 18:40:41 UTC 2013


Public bug reported:

I apologize for being vague , but I am not certain which package is
responsible for this bug. Both dbus and udev are the most likely
candidates as both contain recent change logs that mention "Interface
00" binding. Both were upgraded during the application of all waiting
system updates for Ubuntu Server 12.04.2 LTS on 03/22/2013, and the
previous patch application before that was mid-December 2012.

After applying all waiting updates on 03/22 we began to notice that MPI
jobs were failing on our HPC clusters. This behavior was traced down to
NFS (which was not updated in this patch cycle) clients improperly
routing traffic. These clients use NFS on a VLAN that is unique to each
cluster, which is only accessible to each node on their eth1 interface.
On the nodes that would still work we noticed that the NFS mount had
bound to the client address (clientaddr) of eth1, and on the nodes that
were no longer functioning they had bound to '0.0.0.0' instead. This
behavior was proven by finding a high count of discarded packets at hop
1, as the upstream router is not aware of each cluster's internal VLAN.

I could not get this behavior to change simply by remounting, but it can
definitely change after a reboot. For one 'up' cycle the NFS mount may
bind to eth1, and on the next it may bind to 0.0.0.0 instead. Manually
configuring each node's NFS mounts in /etc/fstab to use the client
address of eth1 (clientaddr=<eth1's address>) corrects the issue, but
this extra configuration has never been needed in the past, nor can I
find any documentation regarding this change.

Here is the complete list of what was updated in this patch cycle:

dmsetup
apparmor
apport
apt
apt-transport-https
apt-utils
base-files
bind9-host
dnsutils
dpkg
gir1.2-gudev-1.0
gnupg
gpgv
grub-common
grub-pc
grub-pc-bin
grub2-common
initramfs-tools
initramfs-tools-bin
iptables
isc-dhcp-client
isc-dhcp-common
language-pack-en
language-pack-en-base
language-selector-common
libapt-inst1.4
libapt-pkg4.12
libbind9-80
libdbus-glib-1-2
libdevmapper1.02.1
libdns81
libdrm-intel1
libdrm-nouveau1a
libdrm-radeon1
libdrm2
libfreetype6
libgnutls26
libgudev-1.0-0
libisc83
libisccc80
libisccfg82
liblwres80
libmysqlclient18
libnih-dbus1
libnih1
libpciaccess0
libplymouth2
libssl1.0.0
libudev0
libxml2
linux-headers-server
linux-image-server
linux-server
man-db
mountall
mysql-common
openssl
perl
perl-base
perl-modules
plymouth
Plymouth-theme-ubuntu-text
postfix
puppet
puppet-common
python-apport
python-problem-report
sudo
ubuntu-minimal
ubuntu-standard
udev
upstart
x11-common

** Affects: dbus (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to dbus in Ubuntu.
https://bugs.launchpad.net/bugs/1163513

Title:
  dbus/udev send NFS client connection out out the wrong interface

Status in “dbus” package in Ubuntu:
  New

Bug description:
  I apologize for being vague , but I am not certain which package is
  responsible for this bug. Both dbus and udev are the most likely
  candidates as both contain recent change logs that mention "Interface
  00" binding. Both were upgraded during the application of all waiting
  system updates for Ubuntu Server 12.04.2 LTS on 03/22/2013, and the
  previous patch application before that was mid-December 2012.

  After applying all waiting updates on 03/22 we began to notice that
  MPI jobs were failing on our HPC clusters. This behavior was traced
  down to NFS (which was not updated in this patch cycle) clients
  improperly routing traffic. These clients use NFS on a VLAN that is
  unique to each cluster, which is only accessible to each node on their
  eth1 interface. On the nodes that would still work we noticed that the
  NFS mount had bound to the client address (clientaddr) of eth1, and on
  the nodes that were no longer functioning they had bound to '0.0.0.0'
  instead. This behavior was proven by finding a high count of discarded
  packets at hop 1, as the upstream router is not aware of each
  cluster's internal VLAN.

  I could not get this behavior to change simply by remounting, but it
  can definitely change after a reboot. For one 'up' cycle the NFS mount
  may bind to eth1, and on the next it may bind to 0.0.0.0 instead.
  Manually configuring each node's NFS mounts in /etc/fstab to use the
  client address of eth1 (clientaddr=<eth1's address>) corrects the
  issue, but this extra configuration has never been needed in the past,
  nor can I find any documentation regarding this change.

  Here is the complete list of what was updated in this patch cycle:

  dmsetup
  apparmor
  apport
  apt
  apt-transport-https
  apt-utils
  base-files
  bind9-host
  dnsutils
  dpkg
  gir1.2-gudev-1.0
  gnupg
  gpgv
  grub-common
  grub-pc
  grub-pc-bin
  grub2-common
  initramfs-tools
  initramfs-tools-bin
  iptables
  isc-dhcp-client
  isc-dhcp-common
  language-pack-en
  language-pack-en-base
  language-selector-common
  libapt-inst1.4
  libapt-pkg4.12
  libbind9-80
  libdbus-glib-1-2
  libdevmapper1.02.1
  libdns81
  libdrm-intel1
  libdrm-nouveau1a
  libdrm-radeon1
  libdrm2
  libfreetype6
  libgnutls26
  libgudev-1.0-0
  libisc83
  libisccc80
  libisccfg82
  liblwres80
  libmysqlclient18
  libnih-dbus1
  libnih1
  libpciaccess0
  libplymouth2
  libssl1.0.0
  libudev0
  libxml2
  linux-headers-server
  linux-image-server
  linux-server
  man-db
  mountall
  mysql-common
  openssl
  perl
  perl-base
  perl-modules
  plymouth
  Plymouth-theme-ubuntu-text
  postfix
  puppet
  puppet-common
  python-apport
  python-problem-report
  sudo
  ubuntu-minimal
  ubuntu-standard
  udev
  upstart
  x11-common

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/dbus/+bug/1163513/+subscriptions




More information about the foundations-bugs mailing list