Thursday, 2022-09-29

jrwrNova Errors like this keep me up at night -- requires a full stack reboot to fix https://hastebin.com/raw/ajefokeseh00:01
*** rlandy is now known as rlandy|out00:07
*** mhen_ is now known as mhen01:12
jrwrNeutron created hundreds of TAP interfaces... I go and try and reboot.. kernel panics -- now I gotta wait until I get back into work to iDrac into them01:26
*** yadnesh|away is now known as yadnesh03:36
*** rlandy|out is now known as rlandy10:28
jrwrERROR neutron.agent.linux.dhcp [req-2af2610f-2c6b-4854-b3f8-762584e210b3 - - - - -] Unable to plug DHCP port for network d4467036-0416-4526-a8d2-8867033571bd. Releasing port.: ovsdbapp.exceptions.TimeoutException:  exceeded timeout 10 seconds, cause: TXN queue is full -- I wonder how I would fix this14:56
*** yadnesh is now known as yadnesh|away16:18
jrwrIf you ever hit a bunch of TAP interfaces desync'd from Neutron and are clogging up OVS/OVN18:15
jrwrcat openvswitch/ovs-vswitchd.log | grep "could not open network" | cut -d' ' -f6 | uniq | xargs -n1 -P1 ovs-vsctl del-port br-int18:15
jrwrcleared out over 8k TAPs in my OVS today18:15
JayFjrwr: Glad you figured some of that out; if you want more guidance about that nova error, you might have more luck emailing openstack-discuss@ 18:18
jrwrIts a Kolla Deployment, but Ill take you up on that18:19
jrwrI've had RabbitMQ desync, I've had Cinder, Neutron, and others all get out of sync and just hardlock deployments18:20
jrwrit /really/ dislikes 1200 VM deployments18:20
JayFI've absolutely used nova past that scale with success, but rabbitmq was consistently the issue I hit scaling it up. 18:23
JayFHopefully someone on the list with more knowledge of kolla+VM-nova will be able to help out :D 18:23
jrwrThe hosts are def beefy (8 hosts, Each with 2TB Ram, 200Gbit/s 256 CPU Cores)18:24
lowercaseI have a much a large deployment than 1200. We don't use ovs and instead opt'd to use LACP bond on every hypervisor which is port channeled into a vpc. then the ip addresses are BGP routed to our vms.18:55
lowercaseand each hypervisor has a couple bridges and vlans that are specific to our deployment, vlans for customer traffic, management, database and rabbit are all segemented.18:56
jrwrWe are doing student competitions -- so its like 30vms and two networks per team (that are the same IP) with Secgroups 18:58
jrwrtimes 4018:58
jrwrgod I wish I could just dumb the traffic on the network18:58
jrwrdump*18:58
lowercasebgp would allow that to be possible.19:05
lowercasethe openstack network range hands out ip addresses in a subnet, that is routable by the entire internal network space.19:06
lowercaseThink floating ips, but instead, the "floating ips" are handed out by neutron and assigned to the vms dynamically.19:07
jrwrfun part is, these networks are isolated, I don't need to to talk anything, Just need NAT for the publics19:22
jrwrand they are all on the same L2 VLAN, so I could get away with a good bit19:26
jrwrOh, and its a Uni, so I don't even control my poor switches19:29
*** lifeless_ is now known as lifeless20:14
*** rlandy is now known as rlandy|out22:10

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!