Tuesday, 2022-05-03

*** rlandy|rover|bbl is now known as rlandy|rover00:36
rlandy|rovermerged the revert of victoria00:38
*** rlandy|rover is now known as rlandy|out00:39
dasm|ruck|bblnice00:44
*** dasm|ruck|bbl is now known as dasm|ruck|off03:11
*** marios is now known as marios|ruck05:08
marios|ruckmorning 05:08
chandankumarmarios|ruck: Good morning :-)05:15
*** jpena|off is now known as jpena07:34
frenzy_fridayhey marios|ruck Good morning. https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-build-containers-centos-9-quay-master looks like quay master is passing again08:30
marios|ruckfrenzy_friday: o/ morning ok good so you merged some fix? 08:30
frenzy_fridaynope, maybe something in quay was down ?08:30
marios|ruckfrenzy_friday: ah i see 08:31
marios|ruckwell fantastic then 08:31
marios|ruck;)08:31
marios|ruckfrenzy_friday: thanks08:31
frenzy_fridaythat was quay trolling us 08:31
marios|ruck:D08:31
marios|ruckcoffee brb08:32
chandankumarmarios|ruck: need any help on ruck rover?10:12
marios|ruckchandankumar: thanks ok for now chasing promotions mostly atm ... gates look ok but lets not focus on that too much ;)10:12
chandankumarmarios|ruck: ok10:13
marios|ruckchandankumar: thanks will ping when i need sthing10:13
chandankumarsure sure10:13
marios|ruckdasm|ruck|off: rlandy|out: o/ question about how network promoted without fs1 did you remove from criteria? https://bugs.launchpad.net/tripleo/+bug/1970899/comments/10 310:14
rlandy|outmarios|ruck: ack10:19
rlandy|outharold's note said we needed newer neutron10:19
rlandy|outmarios|ruck: dasm|ruck|off also added a patch to skip train fs01 and fs03510:20
rlandy|outnot sure how I feel about that10:20
rlandy|outwe should investigate what died there10:20
rlandy|outmarios|ruck: is master any better with newer neutron?10:20
marios|ruckrlandy|out: yeah i blocked it for now trying our luck with 2 hashes first ;) details there https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/42450/1#message-e243ae370c46aa1d9e1ddbcac379bd742eac87d310:21
chandankumardasm|ruck|off: hello, please remove -1 from https://review.rdoproject.org/r/c/config/+/42226 as https://github.com/rdo-infra/rdo-jobs/blob/master/zuul.d/ansible-role-container-registry.yaml#L16 is removed now10:21
marios|ruckrlandy|out: well master integration line today is at least no longer hitting +bug/1970899/ (i wrote in comment #10)10:22
*** rlandy|out is now known as rlandy10:24
rlandymarios|ruck: quick chat re: invoice10:24
rlandyhttps://meet.google.com/wzf-vowr-oxu?pli=1&authuser=010:24
rlandymarios|ruck: ^^10:25
marios|ruckrlandy: sure sec (sorry had temp issue with connection router dropped for a minute)10:27
rlandychandankumar: arxcruz: can you guys join us on https://meet.google.com/wzf-vowr-oxu?pli=1&authuser=010:53
marios|ruckarxcruz: https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-master/65b4123/logs/undercloud/var/log/tempest/stestr_results.html.gz10:57
chandankumarpassing https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-master/b0fc454/logs/overcloud-novacompute-0/var/log/extra/errors.txt.gz11:03
chandankumarfailure: https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset001-master/65b4123/logs/overcloud-novacompute-0/var/log/extra/errors.txt.gz11:03
chandankumararxcruz: https://opendev.org/openstack/tripleo-quickstart/src/branch/master/config/general_config/featureset001.yml#L14711:07
chandankumarhttps://opendev.org/openstack/tripleo-quickstart/src/branch/master/config/general_config/featureset035.yml#L21711:07
chandankumarrlandy: https://codesearch.opendev.org/?q=tempest_run_concurrency&i=nope&literal=nope&files=&excludeFiles=&repos= most places it is 411:09
rlandymarios|ruck: https://review.rdoproject.org/r/c/testproject/+/36255 - ready for depends-on11:14
marios|ruckrlandy: ack https://review.opendev.org/c/openstack/tripleo-quickstart/+/84028311:18
*** dviroel|out is now known as dviroel11:18
rlandythanks11:19
rlandymarios|ruck: need help with anything else?11:26
marios|ruckrlandy: not currently will let you know thanks main focus is train promotion currently 11:27
rlandyk11:27
rlandyjm1: frenzy_friday, arxcruz, chandankumar, marios|ruck, rcastillo, dasm|ruck|off, dviroel: any topics for today's community call?11:28
rlandyindia has a public holiday today11:29
rlandyI have a clashing meeting11:29
rlandywhich I may be able to skip some of11:29
marios|ruckrlandy: ack we'll join in case there are any guests11:30
rlandymarios|ruck; pls do11:30
rlandywill join if I can11:30
marios|ruckk np11:30
chandankumarrlandy: dviroel marios|ruck https://review.opendev.org/c/openstack/tripleo-ansible/+/839319 please have a look when free, thanks!11:41
marios|ruckchandankumar: k adding to reviews 11:41
reviewbotDo you want me to add your patch to the Review list? Please type something like add to review list <your_patch> so that I can understand. Thanks.11:41
chandankumarI am still working on final change to close out cs9 tripleo-ansible work https://review.opendev.org/c/openstack/tripleo-ansible/+/839688/1111:41
chandankumarrlandy: marios|ruck leaving early today, see ya tomorrow11:42
marios|ruckchandankumar: o/ have a good one mate11:43
dviroelk11:43
rlandyrcastillo: updated your molecule patch for stable/wallaby11:45
rlandyfreeze graph issue11:45
rlandyrcastillo: testing nested-virt here: https://review.rdoproject.org/r/c/testproject/+/4243411:48
rlandydasm|ruck|off: marios|ruck: rhos-17 on rhel-9 missing fs035 to promo -  rerunning that now12:09
marios|ruckthank you rlandy 12:09
* marios|ruck coffee brb 12:09
*** dasm|ruck|off is now known as dasm|ruck12:17
dasm|rucko/12:17
jm1rlandy: prepared a intro for our zuul ci jobs, but can give it next week as well12:17
jm1rlandy: *ci jobs in ansible openstack collection12:17
jm1marios|ruck: ^12:25
marios|rucko/ jm112:25
marios|rucksure sounds good - maybe both today and next week? 12:26
jm1hey hey .)12:26
jm1that might bore half of our team ^^12:26
jm1..next week12:26
marios|ruckjm1: also fine if you want to wait until next week :)12:26
jm1i am fine with both but if india is out today it might make sense to postpone it because in particular chandan asked about it ^^12:27
marios|ruckjm1: sounds like a plan then ;)12:27
marios|ruckship it!12:28
jm1marios|ruck: ack, thx :)12:28
rlandyjm1: let's see who is there12:29
jm1rlandy: ack, will be there12:30
rlandychandankumar: when you are in tomorrow - let's discuss https://review.rdoproject.org/r/c/config/+/4244212:53
rlandyordering there12:53
rlandyI think that's the right way around12:53
rlandyjm1: hey - going to send you a test email - can you just tell me when you receive it?12:56
rlandymarios|ruck: ^^ checking12:56
marios|ruckrlandy: :) thanks12:57
jm1rlandy: none yet ^^12:58
marios|rucknothing reported there https://www.google.com/appsstatus/dashboard/ fwiw12:59
rlandyjm1: frenzy_friday, arxcruz, chandankumar, marios|ruck, rcastillo, dasm|ruck|off, dviroel: community call notes hackmd ready for notes if I can't attend: 13:05
rlandyhttps://hackmd.io/MMg4WDbYSqOQUhU2Kj8zNg13:05
rlandy05/0313:05
marios|ruckthanks rlandy 13:05
*** rlandy is now known as rlandy|mtg13:05
marios|ruckjm1: did you still not receive an email fro rlandy|mtg ?13:06
dasm|ruckmarios|ruck: i responded to your review comments on https://review.rdoproject.org/r/q/topic:rr_refactor+is:open13:06
dasm|ruckhmm.. rdoproject is 503-ing me13:06
marios|ruckdasm|ruck: thanks will revisit13:07
marios|ruckdasm|ruck: yes there is software factory upgrade today 13:07
marios|rucktotally forgot 13:07
marios|ruck:/13:07
marios|ruckmust have just started dasm|ruck 13:07
dasm|ruckah13:07
dasm|ruckack13:07
marios|rucki guess zuul will go at some point still there currently 13:08
dasm|ruckrepos are unavailable atm13:08
jm1marios|ruck: nope13:10
marios|ruckjm1: k thank you must be some wider email issue 13:10
jm1marios|ruck, rlandy|mtg: a regular mail to my company mail address? 13:11
marios|ruckjm1: yeah work red hat email13:11
jm1marios|ruck, rlandy|mtg: last mail in my inbox is from 3h ago13:13
jm1marios|ruck, rlandy|mtg: which is not normal13:13
rlandy|mtgjm1: marios|ruck; ack  - seems like an issue or lag13:16
rlandy|mtgthanks for checking13:16
rlandy|mtglast email 6:1913:17
rlandy|mtgconfirmed 3 hours ago13:17
marios|ruckthanks jm1 seems same for me (~3 hour ago last email)13:24
jm1marios|ruck, rlandy|mtg: we just got an emergency alert email13:24
marios|ruckjm1: ehm... where? i mean we don't have email :)13:25
jm1marios|ruck: oh, right. i got it to my personal email.. no idea why13:27
marios|ruckjm1: ack is it isaac alert thing maybe13:27
marios|ruckhope nothing serious happened if so... 13:28
jm1marios|ruck, rlandy|mtg: send you the incident link as pm13:29
marios|ruckthanks jm1 13:29
rlandy|mtgthanks13:30
marios|ruckdasm|ruck: rdo/gerrit back fyi 13:43
dasm|ruckack13:44
dviroelyep, service-now says mail outage - started at 9:1913:47
rlandy|mtghey - sorry missed community call14:02
rlandy|mtganything happen??14:02
*** rlandy|mtg is now known as rlandy14:02
marios|ruckrlandy|mtg: nah we dropped in 5 mins no topics 14:02
rlandymarios|ruck: arxcruz: https://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset035-master/498eb76/logs/undercloud/var/log/tempest/stestr_results.html.gz :(14:19
rlandywith reduced workers concurrency14:20
arxcruzrlandy that's a different problem, either there is a firewall rule blocking the connection to the identity service (keystone) or the service did not started14:20
arxcruzit'sa bug 14:21
marios|ruckrlandy: i dont think it will help us much, because a lot of the issues are not tempest (a lot are but quite a few arent)...e.g. latest example from the master line https://review.rdoproject.org/r/c/testproject/+/42518/2#message-6725e09951c0779190e83eb4844ba3482eb9fe1f   - only one of those is tempest14:21
arxcruzfor ipv6 14:21
rlandymarios|ruck: re: arxcruz's comment above14:21
rlandycan you confirm that for ipv6?14:21
marios|ruckrlandy: well if we can get some of those consistently yeah lets file a bug but not really seeing that yet 14:22
marios|ruckit seems like different issues each time14:22
marios|rucke.g. fs35 in latest master fails on provision (https://review.rdoproject.org/r/c/testproject/+/42518/2#message-6725e09951c0779190e83eb4844ba3482eb9fe1f )14:22
arxcruzmarios|ruck rlandy urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='2001:db8:fd00:1000::5', port=13000): Max retries exceeded with url: /v3/auth/tokens (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x7f3e9f60cd90>: Failed to establish a new connection: [Errno 111] Connection refused'))14:23
arxcruzit's not being able to connect 14:24
arxcruzso, all tests fails 14:24
arxcruzmarios|ruck rlandy the other tests marked as passed are actually skipped14:26
arxcruz{0} setUpClass (tempest.api.compute.admin.test_floating_ips_bulk.FloatingIPsBulkAdminTestJSON) ... SKIPPED: nova-network is gone14:26
marios|ruckthanks arxcruz 14:26
rlandyarxcruz: marios|ruck: ok - so that should be a master fs035 bug only14:27
rlandyfs001 and other releases still have a chance14:27
dasm|ruckbrb14:27
arxcruzrlandy marios|ruck seems to be amq server14:28
arxcruz2022-05-03 13:40:24.509 16 ERROR oslo.messaging._drivers.impl_rabbit [-] [ea0597e8-f3e2-49d4-8646-f633653e2527] AMQP server on overcloud-controller-1.internalapi.localdomain:5672 is unreachable: <RecoverableConnectionError: unknown error>. Trying again in 1 seconds.: amqp.exceptions.RecoverableConnectionError: <RecoverableConnectionError: unknown error>14:28
marios|ruckarxcruz: rlandy: k i am not going to file a bug for that until we start seeing it consistently though things are not stable enough to call it right now14:28
rlandyk14:28
marios|ruckrlandy: i've seen fs35 fail on 3 different things today and not 2 of those yet14:28
arxcruzrlandy marios|ruck yes, haproxy is down14:29
arxcruzhttps://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset035-master/498eb76/logs/overcloud-controller-0/var/log/containers/haproxy/haproxy.log.txt.gz14:29
rlandydasm|ruck: ^^ pls read back on all this14:29
rlandyre: tempest investigation14:29
arxcruzMay  3 13:15:52 overcloud-controller-0 haproxy[7]: Server swift_proxy_server_be/overcloud-controller-2.storage.localdomain is DOWN, reason: Layer4 connection problem, info: "Connection refused", check duration: 3ms. 0 active and 0 backup servers left. 0 sessions active, 0 requeued, 0 remaining in queue.14:29
marios|ruckfeels like it could be more related to the nodes themselves as discusse earlier14:29
marios|ruckthanks arxcruz  :D14:29
marios|ruckdasm|ruck: lets do a sync in half hour if you available?14:29
arxcruzmarios|ruck so, i can tell the issue is not related to tempest, it's actually related to the installation / network setup 14:29
arxcruzeither there is a firewall rule blocking 14:30
marios|ruckarxcruz: ack14:30
arxcruzor some service weren't installed properly 14:30
arxcruzor the network where the ovb vm's were deployed is not working properly 14:30
dasm|ruckback14:38
dasm|ruckmarios|ruck: sure14:38
rlandymarios|ruck: so all the fs035 jobs could probably get bug'ed14:48
marios|ruckrlandy: what do you mean 14:50
marios|ruckrlandy: going to sync in 10 with dasm if available? 14:50
rlandyall releases are failing14:50
rlandyack - will join then14:50
marios|ruckrlandy: right but random things...e g. just saw one for train like that: https://logserver.rdoproject.org/62/42462/2/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-train/3dabaa0/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz14:51
marios|ruckrlandy:  FATAL | Check Keystone user assignment to roles status | undercloud | item=cinderv3 | error={"ansible_job_id": "965216880256.483604", "ansible_loop_var": 14:51
marios|ruckrlandy: so what can we file this is madness14:51
marios|rucki mean the sheer number of different things14:51
marios|rucki have seen today 14:51
marios|rucklog files are all starting to blur into one 14:51
rlandyI'd follow fs035 train14:52
marios|ruckrlandy: that's why i think it may be some performance issue i mean 'we need bigger machines' but i have no evidence yet14:52
rlandyit was stable at one point then stopped14:52
rlandymarios|ruck: compare:14:52
rlandyhttps://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-train&result=success14:52
rlandyvs14:52
rlandyhttps://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-train&result=success14:53
marios|ruckrlandy: sure but we cant track/file something if each run is different error14:53
marios|rucki mean once we have 2 the same14:53
marios|rucki'll file it 14:53
marios|ruck:D14:53
marios|ruckrlandy: those links are both train did you mean to point to something else? 14:53
rlandysomewhere on 03/26 we started to die consistently14:53
rlandyno -14:53
rlandyI am talking about two different status links on the same train job14:54
marios|ruck17:52 < rlandy> https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-train&result=success14:54
marios|ruck17:52 < rlandy> vs14:54
marios|ruck17:53 < rlandy> https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-train&result=success14:54
marios|ruckrlandy: same links? 14:54
rlandyhttps://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-train&result=failure14:54
rlandysorry - ^^ that14:54
marios|ruckah k looking14:54
marios|ruckrlandy: right14:54
marios|ruckrlandy: dasm|ruck: https://meet.google.com/dqv-rjoq-gmr 14:59
dasm|ruckk14:59
dasm|ruckjoining14:59
rlandyjoining15:00
marios|ruckdasm|ruck: master https://review.rdoproject.org/r/c/testproject/+/4251815:31
rlandymarios|ruck: emails are back15:43
marios|ruckrlandy: thanks, did you get the one i sent to concilium with the timesheet? (so i know not to re-send it tomorrow :))15:44
marios|ruckrlandy: i see the one you sent to me as well now (the one that was missing earlier)15:45
rlandymarios|ruck: yes - I have that one15:46
marios|ruckrlandy: thanks15:46
rlandymarios|ruck: arxcruz: dasm|ruck: same tempest disaster with concurrency set to 2: https://logserver.rdoproject.org/91/42491/1/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-train/626ac5d/logs/undercloud/var/log/tempest/stestr_results.html.gz15:49
dasm|ruckk15:49
marios|ruckrlandy: ack 15:49
* marios|ruck going to dream about tempest tonight15:49
rlandytempest.lib.exceptions.IdentityError: Got identity error15:49
dasm|ruckmarios|ruck: it's going to be a bad dream?15:49
dasm|ruckor even nightmare?15:49
rlandyarxcruz: ^^ do we need the same key changes we had for octavia?15:49
marios|ruckright dasm|ruck 15:50
rlandytrain c815:50
rlandyfeatureset001-train15:50
rlandyso no ipv615:50
rlandytempest.lib.exceptions.IdentityError: Got identity error15:52
rlandyhttps://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby/2eef1a4/logs/undercloud/var/log/tempest/stestr_results.html.gz15:52
rlandysame error in wallaby c815:52
rlandyperiodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby15:52
rlandywe may have legit c8 errors15:53
*** dviroel is now known as dviroel|lunch15:53
rlandymarios|ruck: dasm|ruck: ^^ c8 shows this15:54
marios|ruckrlandy: k will check more tomorrow - may be there are some legit bugs in there, as well ;)15:55
rlandyfs001 and fs03515:55
rlandywaiting to see if arxcruz is still around15:55
rlandyif he can comment on that15:55
marios|ruckack 15:56
* marios|ruck has to go 15:57
marios|ruckwill pickup tomorrow bai 15:57
*** marios|ruck is now known as marios|out15:57
arxcruzrlandy checking 16:00
rlandyarxcruz: can we chat for a few?16:00
arxcruzyes 16:00
rlandyarxcruz: https://meet.google.com/vkq-dxan-ftb?pli=1&authuser=016:00
rlandydasm|ruck: you can join if you like16:01
arxcruzrlandy https://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby/2eef1a4/logs/overcloud-controller-0/var/log/containers/haproxy/haproxy.log.txt.gz16:03
arxcruzhttps://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset035-master/498eb76/logs/undercloud/var/log/tempest/stestr_results.html.gz16:06
arxcruzrlandy ^16:06
rlandyarxcruz: https://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-wallaby/9967e52/logs/undercloud/var/log/tempest/stestr_results.html.gz16:09
arxcruzdasm|ruck we start here: https://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset035-master/498eb76/logs/undercloud/var/log/tempest/stestr_results.html.gz16:29
arxcruzdasm|ruck then I came here https://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset035-master/498eb76/logs/overcloud-controller-0/var/log/containers/haproxy/haproxy.log.txt.gz16:31
arxcruzand found May  3 13:15:50 overcloud-controller-0 haproxy[7]: Server cinder_be/overcloud-controller-0.internalapi.localdomain is DOWN, reason: Layer4 connection problem, info: "Connection refused", check duration: 0ms. 2 active and 0 backup servers left. 0 sessions active, 0 requeued, 0 remaining in queue.16:31
arxcruzthis is for centos 916:31
arxcruzfor centos 8:16:31
dasm|ruckrlandy: fyi, this is overall pass rate for jast 1k tests: https://paste.opendev.org/show/bqMe8dzIzOyTowakeCzm/16:32
arxcruzdasm|ruck first we start here:16:32
dasm|rucknow, i'm looking into when it started failing first time16:32
arxcruzhttps://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset035-master/498eb76/logs/undercloud/var/log/tempest/stestr_results.html.gz16:32
dasm|ruckrlandy: in 15 mints i'll need to leave for about 1h. Doc's app.16:33
rlandydasm|ruck: ok - ping me when you leave with what you have and I'll log the bug16:33
arxcruzdasm|ruck sorry, centos 8 we start here:16:33
arxcruzhttps://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-wallaby/9967e52/logs/undercloud/var/log/tempest/stestr_results.html.gz16:33
dasm|ruckrlandy: k16:33
arxcruzthen i check here:https://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-wallaby/9967e52/logs/overcloud-controller-0/var/log/containers/keystone/keystone.log.txt.gz16:35
arxcruzand got this 16:35
arxcruz2022-05-03 14:27:43.142 169 ERROR oslo_db.sqlalchemy.engines [req-d6926730-7334-4db4-9cca-c4760d5dbb71 - - - - -] Database connection was found disconnected; reconnecting: oslo_db.exception.DBConnectionError: (pymysql.err.OperationalError) (2013, 'Lost connection to MySQL server during query')16:35
* rlandy starting bug16:35
arxcruzthen i check this:16:35
arxcruzhttps://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-wallaby/9967e52/logs/overcloud-controller-0/var/log/containers/haproxy/haproxy.log.txt.gz16:35
rlandyarxcruz: will send to you for review16:35
arxcruzand got this May  3 13:24:01 overcloud-controller-0 haproxy[13]: Backup Server mysql/overcloud-controller-0.internalapi.localdomain is DOWN, reason: Layer4 connection problem, info: "Connection refused", check duration: 0ms. 0 active and 2 backup servers left. Running on backup. 0 sessions active, 0 requeued, 0 remaining in queue.16:35
arxcruzrlandy ok 16:35
dasm|ruckrlandy: fyi, i'm afk. i'll be back in ~1h16:45
rlandyk - bug in progress16:45
rlandyjuts looking for c9 logs16:45
dasm|rucki pulled up initial stats. i'll refine them after coming back16:45
dasm|ruckit looks like fs002 is in a good shape16:46
dasm|rucks/good/better 16:46
dasm|ruckthan others16:46
dasm|ruckbut overall, ovb is in a miserable shape16:46
rlandyhttps://bugs.launchpad.net/tripleo/+bug/197146516:48
dasm|ruckk16:48
rlandyarxcruz: dasm|ruck: ^^ we have a start16:48
rlandyediting bug to add stats16:48
dasm|ruckrlandy: you want me to add notes to this bug or create a separate one to track general ovb failures?16:49
dasm|ruckbbl16:49
rlandyadding that info16:49
*** dasm|ruck is now known as dasm|ruck|bbl16:50
arxcruzrlandy can explain that is not a tempest issue, but the fail in the connectivity between the controllers / haproxy 16:50
rlandyarxcruz: yep - adding more info now16:50
rlandyjust wanted to keep description short enough to read16:50
rlandyarxcruz: does https://bugs.launchpad.net/tripleo/+bug/1971465 cover it?16:52
arxcruzrlandy yes 16:53
rlandyarxcruz: dasm|ruck|bbl: going to start by pinging the DF for some help in getting debug direction16:53
*** dviroel|lunch is now known as dviroel16:56
arxcruzok16:58
*** jpena is now known as jpena|off17:00
*** dasm|ruck|bbl is now known as dasm|ruck18:01
* dasm|ruck is back18:01
rlandydasm|ruck: pls see conversation on #tripleo18:06
rlandytrying to add swap to overcloud nodes18:06
rlandydasm|ruck: bug logged at https://bugs.launchpad.net/tripleo/+bug/197146518:07
dasm|ruckk18:11
dasm|ruckrlandy: did you retrigger ovb jobs on https://review.opendev.org/c/openstack/tripleo-quickstart/+/840283 ?18:12
dasm|ruckdo you want me to do so?18:13
rlandydasm|ruck: already done18:17
dasm|ruckk18:17
rlandywaiting for rhos-17 on rhel-9 to complete as well18:17
dasm|ruckack18:19
rlandydasm|ruck: fs039 has been failing as well - maybe an install issue still - you can check into that18:20
dasm|ruckyep18:20
* rlandy back to 16.2 base image work18:20
dasm|ruck> periodic-tripleo-ci-centos-9-ovb-1ctlr_1comp-featureset002-master started passing18:46
dasm|ruckbrb18:59
dasm|ruckback19:06
rlandydviroel: dasm|ruck: pls vote/review so I don't just self merge https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-config/+/40663020:04
rlandyno idea if it's right or wrong before  I merge it :(20:04
rlandyconfig 20:04
dasm|ruckchecking20:04
dviroellooks correct I think20:06
dasm|ruckrlandy: hmm.. to me it looks like this secret isn't even used20:07
dasm|ruckhttps://sf.hosted.upshift.rdu2.redhat.com/codesearch/?q=registry_redhat_io&i=nope&files=&repos=20:07
dasm|ruckbut i might be wrong.w20:07
rlandydasm|ruck: I am going to add the usage now20:07
dasm|ruckit would mean, we can merge it, and it won't break anything20:07
dasm|ruckack20:07
rlandybusy coding that change20:07
rlandyack20:07
rcastillolgtm too20:07
rlandyit needs to be merged to reference it20:07
rlandythank you20:07
dasm|ruckk20:08
dasm|ruckdviroel: wanna pull the trigger? rcastillo and myself technically have this superpower too20:08
dviroeldone20:10
rlandythanks20:15
rlandyadding secret usage in next review20:15
dasm|ruckk20:16
rlandyhttps://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-config/+/40663120:23
rlandy^^ usage20:23
rlandydasm|ruck: going to w+ the train patch20:57
rlandywill revert20:58
dasm|ruckLack20:59
dasm|ruckyours is still running. currently on tempest, rlandy 20:59
rlandyyep I know20:59
rlandywallaby c9 fs001 and fs035 passed20:59
dasm|ruck\o/21:00
rlandynot with swap21:00
rlandyjust generally21:00
dasm|ruckoh21:00
dasm|rucki was checking ovb jobs. in last few hours they seem to be in better shape.21:01
dasm|ruckthey've started passing21:01
dasm|ruckrlandy: cs8 train - disable fs001 & fs035 got merged. I'm gonna kickstart promotion.21:28
* dviroel out o/21:30
*** dviroel is now known as dviroel|out21:30
dasm|ruckrenning21:30
dasm|ruck*running21:30
dasm|ruckdviroel|out: take care o/21:30
rcastillodviroel|out: o/21:30
dviroel|outtks o/21:30
rlandycool22:00
rlandytimed out22:01
rlandydid not solve the issue22:01
rlandydasm|ruck: the promotion runs on its own22:02
rlandyyou don't have to start anything22:02
rlandyhttp://promoter.rdoproject.org/promoter_logs/container-push/22:03
rlandyyou rekciked the line???22:03
dasm|rucki did. a while ago22:08
dasm|ruckdid i break something, rlandy ?22:09
dasm|ruckugh. time out on periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-train22:20
dasm|ruckbbl22:21
rlandydasm|ruck: no22:21
rlandybut you restarted the like22:21
rlandyline22:21
rlandythat has nothing to do with the promotion22:21
rlandydasm|ruck: I can explain the difference22:22
rlandydasm|ruck: train promoted - revert test skip22:24
rlandyreverted22:24
rlandywow - we have a long gate22:24
rlandyhttps://logserver.rdoproject.org/55/36255/67/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-wallaby/e61020e/logs/undercloud/var/log/tempest/stestr_results.html.gz22:25
rlandyonly one test failed here :)22:25
rlandyputting in a separate patch with swap and no concurrency change22:26
dasm|ruckack22:38
dasm|ruckok, i see cs8 train promoted.22:40

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!