[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
Brian Murray
brian at ubuntu.com
Thu Nov 30 23:43:31 UTC 2017
Hello Xav, or anyone else affected,
Accepted neutron into artful-proposed. The package will build now and be
available at
https://launchpad.net/ubuntu/+source/neutron/2:11.0.2-0ubuntu1.1 in a
few hours, and then in the -proposed repository.
Please help us by testing this new package. See
https://wiki.ubuntu.com/Testing/EnableProposed for documentation on how
to enable and use -proposed.Your feedback will aid us getting this
update out to other Ubuntu users.
If this package fixes the bug for you, please add a comment to this bug,
mentioning the version of the package you tested and change the tag from
verification-needed-artful to verification-done-artful. If it does not
fix the bug for you, please add a comment stating that, and change the
tag to verification-failed-artful. In either case, details of your
testing will help us make a better decision.
Further information regarding the verification process can be found at
https://wiki.ubuntu.com/QATeam/PerformingSRUVerification . Thank you in
advance!
** Changed in: neutron (Ubuntu Artful)
Status: Triaged => Fix Committed
** Tags added: verification-needed verification-needed-artful
--
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to neutron in Ubuntu.
https://bugs.launchpad.net/bugs/1731595
Title:
L3 HA: multiple agents are active at the same time
Status in Ubuntu Cloud Archive:
Triaged
Status in Ubuntu Cloud Archive ocata series:
Triaged
Status in Ubuntu Cloud Archive pike series:
Triaged
Status in Ubuntu Cloud Archive queens series:
Triaged
Status in neutron:
In Progress
Status in neutron package in Ubuntu:
Triaged
Status in neutron source package in Zesty:
Triaged
Status in neutron source package in Artful:
Fix Committed
Status in neutron source package in Bionic:
Triaged
Bug description:
OS: Xenial, Ocata from Ubuntu Cloud Archive
We have three neutron-gateway hosts, with L3 HA enabled and a min of 2, max of 3. There are approx. 400 routers defined.
At some point (we weren't monitoring exactly) a number of the routers
changed from being one active, and 1+ others standby, to >1 active.
This included each of the 'active' namespaces having the same IP
addresses allocated, and therefore traffic problems reaching
instances.
Removing the routers from all but one agent, and re-adding, resolved
the issue. Restarting one l3 agent also appeared to resolve the
issue, but very slowly, to the point where we needed the system alive
again faster and reverted to removing/re-adding.
At the same time, a number of routers were listed without any agents
active at all. This situation appears to have been resolved by adding
routers to agents, after several minutes downtime.
I'm finding it very difficult to find relevant keepalived messages to
indicate what's going on, but what I do notice is that all the agents
have equal priority and are configured as 'backup'.
I am trying to figure out a way to get a reproducer of this, it might
be that we need to have a large number of routers configured on a
small number of gateways.
To manage notifications about this bug go to:
https://bugs.launchpad.net/cloud-archive/+bug/1731595/+subscriptions
More information about the Ubuntu-openstack-bugs
mailing list