Friday, 2022-03-18

opendevreview	Andre Aranha proposed openstack/octavia master: Change FIPS jobs to centos-9-stream https://review.opendev.org/c/openstack/octavia/+/831840	11:03
spatel	johnsom hey	17:51
johnsom	spatel Hi	17:51
spatel	what could be wrong here - https://paste.opendev.org/show/bP7JdWsLvWE66gXdTSlu/	17:51
spatel	I am not seeing ip address demo-net=10.0.0.202 in amphora image	17:52
johnsom	spatel For security reasons we isolate tenant traffic in a network namespace. Use "sudo ip netns exec amphora-haproxy ip a"	17:52
spatel	oh!!	17:53
spatel	i can see it now	17:53
spatel	trying to troubleshoot issue.. i have 2 www server behind LB but not able to curl from VIP address	17:55
spatel	i am able to curl from amphora image	17:55
johnsom	What is the output of "openstack loadbalancer status show" ?	17:55
spatel	https://paste.opendev.org/show/bIsSQzgPbbcgKaRPEzie/	17:57
spatel	OFFLINE not sure why	17:57
johnsom	Yeah, that is odd. Can you check that the admin state is up for all of those objects? "openstack loadbalancer show web-lb-1", etc.	17:58
spatel	https://paste.opendev.org/show/b38m46BhNRj25WXPkIfL/	17:59
johnsom	It kind of looks like maybe the pool or listener is disabled	18:00
spatel	hmm	18:00
johnsom	check "openstack loadbalancer listener show listen-port-80" and openstack loadbalancer pool show pool-80" as well	18:01
spatel	https://paste.opendev.org/show/bDsiPKO7dens0kPKJe4T/	18:01
johnsom	Hmm listener looks good	18:02
spatel	everything looks ok for me.. then not sure why its offline	18:03
spatel	i can curl www machine from amphora	18:03
spatel	I have used PING for monitoring is that ok?	18:03
johnsom	It's not great, since this is HTTP, try switching it to an HTTP health monitor	18:03
johnsom	Or TCP if they aren't web servers	18:04
spatel	Let me switch	18:04
spatel	I can't do it without removing pool	18:05
johnsom	PING has a bunch of gotchas with neutron security groups, etc. that an cause it to get odd results.	18:05
johnsom	Yes, you can, just remove the health monitor and add a new one	18:05
spatel	hmm ok	18:05
spatel	i have changed it to HTTP but still same problem	18:07
spatel	operating status: Offline	18:07
johnsom	can you paste an updated "openstack loadbalancer status show"?	18:07
johnsom	Oh, I wonder if your lb-mgmt-net isn't working right. Maybe the heartbeats aren't coming into the controllers. Though the curl to the VIP should still work even if lb-mgmt-net is not getting the heartbeats.	18:08
spatel	https://paste.opendev.org/show/bB2LuweXEmeobeVwab9e/	18:09
spatel	I can ping lb-mgmt-net from worker and manager	18:09
johnsom	if "openstack loadbalancer stats show web-lb-1" doesn't show any metrics (all zeros) it's likely a problem with the lb-mgmt-net	18:10
johnsom	There should be values in "total connections" for the curl calls to the vip	18:10
johnsom	If that is zero, check your octavia.conf file, [health_manager] controller_ip_port_list setting. Make sure the health manager IP and port for your controller(s) is correct there.	18:13
johnsom	Also make sure your health manager process is running	18:14
spatel	hmm	18:15
spatel	oh let me check health_manager	18:15
spatel	give me sec i check all those setting	18:15
johnsom	Sure, NP	18:15
spatel	https://paste.opendev.org/show/bdM8jhqA24LcpYOrj2cg/	18:16
spatel	10.30.1.230 is my host IP address	18:16
spatel	is this correct?	18:17
johnsom	For the o-hm0 interface?	18:17
spatel	its 10.1.0.10	18:17
spatel	i should use that IP means...	18:17
spatel	let me check it and restart service... that could be my problem	18:18
johnsom	Ok, so that is probably the problem. The amps are sending to 10.30.1.230 over the lb-mgmt-net via o-hm0 interfaces on the controllers.	18:18
johnsom	When you change the octavia.conf config, you will need to either failover the loadbalancer or use "openstack loadbalancer amphora configure [--wait] <amphora-id>" to push the updated endpoint to the amp	18:18
spatel	ok, do i need to change on worker config also or not?	18:19
johnsom	Yes	18:19
johnsom	All except the API will need to be updated	18:19
spatel	cool	18:19
spatel	johnsom he amphora ID is nova list ?	18:27
johnsom	no	18:27
spatel	where should i find amphora ID?	18:27
johnsom	openstack loadbalancer amphora list	18:27
spatel	:)	18:28
spatel	johnsom its still showing OFFLINE	18:35
spatel	thinking i can redeploy LB	18:35
johnsom	Try loadbalancer failover	18:35
spatel	one odd thing i can't see 5555 port listen on anywhere on host machine netstat -natp \| grep 5555	18:36
spatel	oh its UDP	18:38
johnsom	The health manager should listen on it on the o-hm0 ip. You did fix the binding address too right?	18:39
spatel	now i can see 5555 bind to 10.1.0.10 (earlier i was looking at TCP but now i found in UDP)	18:39
spatel	udp 0 0 10.1.0.10:5555 0.0.0.0:*	18:40
johnsom	+1	18:40
spatel	redeploying whole LB :) its on the way	18:40
spatel	is this normal ? - https://paste.opendev.org/show/baH8eVseNQZGoQ6tYZSd/	18:46
spatel	assuming amphora isn't ready yet so its just keep trying	18:46
johnsom	Yeah, that is normal. It is waiting for nova to full boot the vm	18:48
spatel	pool member show error	18:48
spatel	https://paste.opendev.org/show/bt6l2Ffj4E1VUOkTNH3z/	18:49
johnsom	Well, you fixed the heartbeat!	18:50
spatel	?	18:50
johnsom	So the web server isn’t responding as the health monitor expects	18:50
spatel	i can curl so port 80 is alive	18:50
spatel	from amphora i can curl web server	18:51
spatel	let me change to PING and see	18:51
johnsom	If you curl / on a web server with verbose on, it doesn’t send a 302 does it?	18:51
spatel	sudo ip netns exec amphora-haproxy curl http://10.0.0.100/	18:53
spatel	it return default apache page	18:53
spatel	i switch back to PING but still error	18:54
spatel	lets debug and see what is going on	18:55
spatel	in my other datacenter i used PING for health and it works without issue	18:55
johnsom	Ok, yeah, that is odd.	18:55
johnsom	Check the haproxy log file too, just to see what it says	18:55
spatel	in which place amphora keep lb config?	18:57
spatel	i think /var/lib/octavia/	18:58
johnsom	Yes	18:58
johnsom	Unlikely to be a config problem	18:58
spatel	https://paste.opendev.org/show/bsxcPsMngHs0en2xOqAP/	19:00
johnsom	That says the ping is not getting a response	19:01
spatel	hmm but i can ping from amphora :)	19:04
spatel	let me debug this...	19:05
spatel	or i can use http and see	19:05
spatel	here is the ping script	19:05
spatel	assuming it will run inside namespace	19:06
johnsom	Yes, the health monitor checks run from inside the namespace	19:07
johnsom	Worst case, you can run tcpdump from inside the namespace and see the packets leaving and the response	19:09
spatel	johnsom i am doing all basic OS level debug,, i will let you know :)	19:22
johnsom	+1	19:23

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!