Wednesday, 2022-04-27

__ministryeandersson: it loaded with duplicate data of cluster, each cluster was loaded more time. I'm running with 3 controller and 32 worker for healthcheck each node. In a node, healthcheck process do load to 29 times with same cluster id.01:28
dtruong__ministry What detection mode are using for the health policy?  We have seen problems with the NODE_STATUS_POLL_URL not scaling well to large number of nodes. 05:01
eanderssonI suspect that this is a worker bug. Can you try to set the workers to the default 1 and see if the issue consists?06:10
eanderssonWe run it with one worker, and I suspect that the CI only uses a single worker.06:11
eanderssonActually the CI does use more than one worker.06:15
eandersson__ministry can you add something like this and see if this gets triggered? https://paste.openstack.org/show/bAi27dkPPHyYdWviIoXm/06:33
opendevreviewErik Olof Gunnar Andersson proposed openstack/senlin master: [DNM] Testing  https://review.opendev.org/c/openstack/senlin/+/83947306:39
eanderssonAlso, can you confirm if this is the policy you are using? 07:23
eanderssonhttps://github.com/openstack/senlin/blob/master/examples/policies/health_policy_poll.yaml07:23
eanderssonUnfortunately none of my theories checked out and not able to reproduce this.07:38
__ministryeandersson: yep. I just use native above health policy. I don't get this error often, but I've seen it a few times, and this time I found the root cause.08:27
__ministrydtruong: I used "NODE_STATUS_POLLING"08:28
__ministrywhen a cluster was running registry healthcheck in a node 1, I has seen it auto run and load in node 2, and error was happend.08:30
__ministrythis is log when worker start register: https://pastebin.com/EfY3K1WR08:38
__ministryit do register more time with policy: https://pastebin.com/a8LpaLF608:40
__ministryabove logs just a part of all log.08:41
__ministrypolicy healthcheck: https://pastebin.com/xE1rHq8k08:45
eanderssonInteresting. I wonder why I wasn't able to reproduce this. Maybe I need to run two separate instances and not just more workers.20:23
eanderssonAny ideas dtruong?20:23

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!