Thursday, 2021-06-03

*** ysandeep has joined #oooq00:52
ysandeepweshay|ruck, rlandy we promoted tripleo in 16.2? that had legit issue and now everything is hosed :)00:56
ysandeephttps://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-component-tempest/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-standalone-tempest-rhos-16.2/4b94358/logs/undercloud/home/zuul/install_packages.sh.log00:56
ysandeepimho..We should have waited for ansible dist-git changes before we promoted tripleo..00:59
* ysandeep checking where we are on merging that podman patch00:59
ysandeephurrah we finally merged.. https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/79290401:00
ysandeepchecking if sshnaidm already created reverts for distgits, otherwise i will create those reverts now01:01
rlandyysandeep: we didn't01:08
rlandypromoted tripleo component01:08
rlandyrdo zuul went down01:08
rlandywe need to revert the distgit changes01:08
rlandyysandeep: https://review.rdoproject.org/r/c/openstack/tripleo-ansible-distgit/+/3384001:09
ysandeeprlandy, i have just created the reverts01:10
ysandeephttps://review.rdoproject.org/r/c/openstack/tripleo-ansible-distgit/+/33843 https://review.rdoproject.org/r/c/openstack/tripleo-ansible-distgit/+/33844 and https://review.rdoproject.org/r/c/openstack/tripleo-ansible-distgit/+/3384501:10
ysandeeprlandy, weird.. 16.2 current-tripleo for tripleo looks new to me: http://osp-trunk.hosted.upshift.rdu2.redhat.com/rhel8-osp16-2/component/tripleo/01:11
rlandyysandeep: yep - weshay|ruck promoted the component through as they were cutting beta01:12
rlandybut01:12
rlandytripleo-ci-testing is nehind01:12
rlandybehind01:12
rlandyyou can see my comment abive01:12
rlandy<weshay|ruck> I promoted tripleo 16.2 component all the way through01:12
rlandybut somehow one was missed01:13
rlandymaybe it got recreated01:13
rlandywe also need to promote network component01:13
rlandywhich died during rerun due to ^^01:14
ysandeepohh.. zuul went down in internal again :) fun..01:15
rlandyysandeep: it went down everywhere .01:15
rlandyapevec> rlandy, looks like neutron router for the whole infra-rdo project went into bad state, vexxhost is looking into it now01:15
rlandy<apevec> "somehow we're in a state where neutron route l3 ha is active on 2 nodes and standby on one"01:15
rlandy<apevec> sounds familiar01:15
rlandy<apevec> rlandy, neutron router fixed, VMs are reachable01:15
rlandyysandeep: also I have one more fox for container updates for multinode01:17
rlandystandalone is working fine01:17
rlandywill test that new patch tomorrow01:17
ysandeeprlandy, just wondering if tripleo-ci-testing still contains old tripleo.. why its failing in integration line01:17
rlandyysandeep: yep that's a mess01:18
ysandeepmy bad.. i have seen component line result.. https://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-component-tempest/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-standalone-tempest-rhos-16.2/4b94358/logs/undercloud/home/zuul/install_packages.sh.log01:18
rlandyidk how that one hash got missed01:18
rlandythe next run should pick up promoted components01:18
rlandywhich is newer01:18
rlandyysandeep: if you can just get those reverts merged, I can pick this up when I come back on line01:19
rlandydidn't help that zuul died at the same time :(01:19
rlandyysandeep: see you in a few hours01:20
* rlandy out01:20
ysandeeprlandy, yeah.. lets get those revert merged, promote tripleo and network component, and hopefully promotion tomorrow in your morning01:20
*** rlandy has quit IRC01:20
weshay|ruckysandeep, don't worry about 16.2 it promoted...02:19
weshay|ruckthey have the content now.. we're done for a bit02:19
ysandeepweshay|ruck, but 16.2 tripleo had legit issue..02:20
ysandeepweshay|ruck, now everything failing with https://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-component-tempest/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-standalone-tempest-rhos-16.2/4b94358/logs/undercloud/home/zuul/install_packages.sh.log02:20
weshay|ruckysandeep, yes.. I warned jjoyce about the issues02:20
weshay|ruckthey needed to cut the beta..02:20
weshay|ruckthis is just kind of how it goes... we're not the last line of defense.. but when we get to this point of the release...02:21
ysandeepweshay|ruck, ack fine then.. but beta will be hosed then with same issue :)02:21
weshay|ruckI saw the baremetal tests passing02:21
weshay|ruckso.. it's ok02:21
weshay|ruckboth in integration and on the tripleo component hash we promoted..02:21
ysandeepweshay|ruck, want to chat for quick sec02:22
weshay|ruckwe'll see what happens in phase1/2 .. but jjoyce was warned02:22
weshay|rucksure02:22
ysandeepgrabbing my earphone02:22
weshay|ruckmeet.google.com/xrh-oaup-pph02:22
ysandeepsshnaidm, arxcruz|rover fyi.. regarding that msr issue.. The node on which overcloud deployment failed because its not reachable.. i was able to access that after reboot.. i notice this traceback in cloud-int log.. http://paste.openstack.org/show/806297/02:49
*** frenzy_friday has quit IRC02:54
*** frenzy_friday has joined #oooq02:54
*** ysandeep is now known as ysandeep|afk03:23
*** ysandeep|afk has quit IRC03:54
*** ykarel|away has joined #oooq03:56
*** ykarel|away is now known as ykarel04:14
*** ysandeep|afk has joined #oooq04:30
*** ysandeep|afk is now known as ysandeep|ruck04:35
*** ysandeep__ has joined #oooq04:38
*** soniya29 has joined #oooq04:40
*** ysandeep|ruck has quit IRC04:44
*** soniya29 has quit IRC05:04
*** marios has joined #oooq05:15
*** soniya29 has joined #oooq05:19
*** ysandeep__ is now known as ysandeep_afk05:26
*** soniya29 has quit IRC05:31
*** soniya29 has joined #oooq05:32
*** pojadhav- has joined #oooq05:37
*** pojadhav has quit IRC05:41
*** soniya29 has quit IRC05:43
*** soniya29 has joined #oooq05:43
*** ysandeep_afk is now known as ysandeep|ruck05:51
bhagyashrisysandeep|ruck, hey i want to know do we have promotable hash for ussuri because i want to test one change on new promoter server06:06
ysandeep|ruckbhagyashris, ussuri is in good shape.. only fs02/fs035 are failing with ovb issue.. but that's not ussuri specific.. but i would let weshay|ruck to comment if we can wave that off.. we did yesterday for victoria.. but i don't want to take that call myself :)06:10
*** slaweq has joined #oooq06:16
bhagyashrisysandeep|ruck, ack06:26
bhagyashristhanks you :)06:26
bhagyashrisysandeep|ruck, akahat i am looking into the c7 train promotion06:26
*** amoralej|off has joined #oooq06:28
*** pojadhav- is now known as pojadhav06:29
*** amoralej|off is now known as amoralej06:30
*** slaweq[m] has joined #oooq06:46
*** slaweq has quit IRC06:57
*** slaweq[m] is now known as slaweq07:15
*** jpena|off is now known as jpena07:20
*** tosky has joined #oooq07:20
*** pojadhav has quit IRC07:44
*** pojadhav has joined #oooq07:44
bhagyashrisysandeep|ruck, hey just want to confirm the c7 train images are stored here http://images.rdoproject.org/centos7/train/rdo_trunk/ right?08:10
*** derekh has joined #oooq08:11
ysandeep|ruckbhagyashris, yes afaik08:11
bhagyashrisysandeep|ruck, ok, wondering the promotable hash is not there https://trunk.rdoproject.org/api-centos-train/api/civotes_detail.html?commit_hash=6059d84f17fae8745e10fd5aa37fc3b787159dd1&distro_hash=583c625c52d3a386d4cae90db02a1ddfe62b3a7008:12
ysandeep|ruckbhagyashris, you can check with weshay|ruck if he decided to wave off some jobs..08:16
bhagyashrisysandeep|ruck, ok dropping mail thanks08:17
*** ysandeep|ruck is now known as ysandeep|lunch08:32
*** ykarel is now known as ykarel|lunch08:48
*** pojadhav has quit IRC09:01
*** jbadiapa_ has joined #oooq09:19
arxcruz|roverysandeep|lunch: let me know when you're back09:23
*** jbadiapa has quit IRC09:24
*** strider has joined #oooq09:37
*** ysandeep|lunch is now known as ysandeep|ruck09:41
ysandeep|ruckarxcruz|rover, back o/09:42
*** soniya29 has quit IRC09:45
arxcruz|roverysandeep|ruck: so, how did you get that info from the vm that was failing ?09:47
ysandeep|ruckarxcruz|rover, reboot the failing vm, then you will be able to access the vm.. login to vm via undercloud node..09:48
*** ykarel|lunch is now known as ykarel09:49
ysandeep|ruckarxcruz|rover, wanna tmate? i can show you how..09:49
ysandeep|ruckarxcruz|rover, we can play on env which yatin gave to us.09:50
arxcruz|roverysandeep|ruck: sure09:52
ysandeep|ruckpm09:52
sshnaidmysandeep|ruck, seems like cloud-init doesn't find user "centos" on image, does it exist there?10:09
sshnaidmysandeep|ruck, probably broken by https://gitlab.cee.redhat.com/virt/cloud-init/-/merge_requests/2810:13
sshnaidmmaybe worth to try cloud-init before cloud-init-20.3-6.el810:13
sshnaidmor after 20.3-9, where they reverted - https://gitlab.cee.redhat.com/virt/cloud-init/-/commit/4dde2a9bed58aba13c730bf4a7314b21038d7a3110:15
ysandeep|rucksshnaidm,  Issue reproduced in test patch even which using older cloud-int:- details here: https://bugs.launchpad.net/tripleo/+bug/1929745/comments/1010:16
opendevmeetLaunchpad bug 1929745 in tripleo "Unchecked MSR access error - overcloud deploy "timed out waiting for ping module test" [Critical,Triaged]10:16
ysandeep|ruckwe were using cloud-int 20.3-10.el8 from long  and  20.3-10.el8_4.2 recently came, but adding new one in exclude didn't help10:19
sshnaidmysandeep|ruck, so we use the latest: https://gitlab.cee.redhat.com/virt/cloud-init/-/commits/rhel840/master-20.3/10:19
sshnaidmysandeep|ruck, an we try 20.3-5.el8 ?10:20
sshnaidms/an/can/10:20
ysandeep|rucksshnaidm: thanks, I will explore that o/10:21
sshnaidmysandeep|ruck, can you check also - does user centos exist on the image?10:21
ysandeep|rucksshnaidm: cat /etc/passwd | grep -i centos -> returns nothing10:22
sshnaidmysandeep|ruck, I think that's the problem..10:23
ysandeep|rucki ran this on overcloud node10:23
sshnaidmysandeep|ruck, I'd try to add user centos into image and try this image in job10:23
sshnaidmseems like cloud-init is looking for that user and fails, not sure why..10:24
sshnaidmysandeep|ruck, can you paste the full cloud-init log? it's in /var/log/cloud-init.log and /var/log/cloud-init-output.log10:25
ysandeep|rucksure 1 mins10:26
ysandeep|rucksshnaidm, http://paste.openstack.org/show/806312/10:32
ysandeep|rucksshnaidm, and http://paste.openstack.org/show/806313/10:33
sshnaidmysandeep|ruck, hmm.. I see network is up10:34
sshnaidmin failed cases I think it wasn't up..10:34
*** soniya29 has joined #oooq10:34
ysandeep|rucksshnaidm, arxcruz|rover cloud-int engineering friend is with us helping in debug10:35
sshnaidmysandeep|ruck, great10:36
sshnaidmysandeep|ruck, look at failed image, it doesn't have network up: https://logserver.rdoproject.org/63/794363/2/openstack-check/tripleo-ci-centos-8-ovb-1ctlr_1comp-featureset001/53b2ebc/logs/baremetal_2_24756_0-console.log10:36
sshnaidmno interfaces table in the log10:36
*** soniya has joined #oooq10:57
*** soniya29 has quit IRC10:57
*** soniya29 has joined #oooq11:15
*** pojadhav has joined #oooq11:18
*** dviroel|out is now known as dviroel11:19
*** soniya has quit IRC11:20
*** jpena is now known as jpena|lunch11:33
ysandeep|rucksshnaidm: after setting password for console, we can cloud-int manually and noticed that intermittently we are losing IP from interface.11:34
ysandeep|ruckwe ran*11:34
sshnaidmysandeep|ruck, yep, it's what we see in logs also, it can't set interfaces up11:35
sshnaidmysandeep|ruck, maybe something with networking in centos stream11:35
ysandeep|rucksshnaidm, diff b/w older logs and current rpm versions https://termbin.com/9r2r111:36
sshnaidmysandeep|ruck, yeah, there are a lot of changes :)11:39
sshnaidmis there anything in logs about losing IP, maybe network manager logs?11:40
sshnaidmprobably need to find logs before this started and right after, to minimize the diff11:40
ysandeep|rucksshnaidm, I will ping back in sometime11:42
sshnaidmsure11:42
*** soniya29 has quit IRC11:43
*** soniya29 has joined #oooq11:43
*** pojadhav has quit IRC11:43
*** akahat_ has joined #oooq11:44
*** akahat has joined #oooq11:48
*** akahat_ has quit IRC11:49
*** soniya29 has quit IRC11:51
*** rlandy has joined #oooq11:53
*** akahat_ has joined #oooq11:55
rlandyysandeep|ruck: how are things downstream?11:56
*** akahat_ has quit IRC11:56
ysandeep|ruckrlandy, hosed , we merged tripleo-ansible distgit change, i ran upstream train component so that we get them to promoted-component soon.. https://review.rdoproject.org/r/c/testproject/+/3205412:00
rlandyysandeep|ruck: it looks like that passed12:00
ysandeep|ruck^^ passed.. we should have/ will get new package in downstream soon..12:00
rlandyk12:00
*** ysandeep|ruck is now known as ysandeep|debug_ssn12:01
rlandydid we run the promote to promoted components job?12:01
ysandeep|debug_ssnrlandy, i haven't .. not sure automatic build ran or not ..i didn't get chance to check status after throwing that test patch..12:02
rlandyysandeep|debug_ssn: ok - I'll pick that up now12:02
ysandeep|debug_ssnthanks!12:02
*** pojadhav has joined #oooq12:04
rlandyperiodic-tripleo-centos-8-train-component-tripleo-promote-to-promoted-components12:04
rlandy^^ job is currently queued12:05
*** amoralej is now known as amoralej|lunch12:12
weshay|ruckarxcruz|rover, hey.. so repos on the overcloud images need a full rm -Rf /etc/yum.repos.d/12:13
arxcruz|roverweshay|ruck: you mean because the exclude thing ?12:13
weshay|ruckya12:14
arxcruz|roverweshay|ruck: ok12:15
arxcruz|roverweshay|ruck: can we sync?12:15
arxcruz|roverysandeep|debug_ssn: ^12:15
weshay|ruck2021-06-03 03:54:59.755 |  cloud-init                               noarch  20.3-10.el8_4.2              appstream                1.0 M12:15
arxcruz|roverweshay|ruck: well, we are also hitting the same thing with 20.3-10.el812:15
weshay|ruckmeet.google.com/tqj-hbmi-ymv12:15
akahatneed +1 and +w : https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3385512:16
ysandeep|debug_ssnarxcruz|rover, I am in a debug session for same issue with yatin, arxcruz|rover can fill weshay|ruck with details what we find.12:17
akahatweshay|ruck, arxcruz|rover zbr marios ^^12:17
*** jpena|lunch is now known as jpena12:23
pojadhavmarios, 0/12:29
pojadhavmarios, I just started with the adding upgrades jobs.. please have a look once you free : https://review.opendev.org/c/openstack/tripleo-ci/+/79457112:29
pojadhavmarios, also one concern is there- standalone upgrades and undercloud upgrades jobs are present atm at : https://github.com/openstack/tripleo-ci/blob/3f9eb85616ac96d29135d930c380afd420527457/zuul.d/upgrades-jobs-templates.yaml. and now we are adding them into periodic. so should we remove them from where they are present now? or we need to keep them as it..?12:35
*** soniya29 has joined #oooq12:35
mariospojadhav: ok added for next reviews will check12:39
mariospojadhav: no you should not move the job definitions into the periodic.yaml template leave them in the upgrades-jobs-templates.yaml please12:40
mariospojadhav: you should only put the jobs into the zuul layout in periodic.yaml12:41
pojadhavmarios, ack.. will do the changes.. thanks for clarification.12:42
mariospojadhav: np12:42
weshay|ruckysandeep|debug_ssn, check out out cloud-init-20.3-10.el8_4.3.noarch.rpm2021-06-02 15:151.0M12:51
bhagyashrisakahat, arxcruz|rover marios pojadhav zbr soniya29 rlandy weshay|ruck sshnaidm13:00
bhagyashrisscrum time13:00
bhagyashrisfrenzy_friday, ^13:01
*** soniya has joined #oooq13:06
*** amoralej|lunch is now known as amoralej13:07
soniyabhagyashris, here there is power supply failure, hence i may lose internet connection during the meeting13:11
bhagyashrissoniya, ack13:11
*** soniya29 has quit IRC13:13
mariosweshay|ruck: rlandy: https://review.opendev.org/q/topic:tripleo-get-hash13:15
arxcruz|roverysandeep|debug_ssn: ykarel i'll update https://review.opendev.org/c/openstack/tripleo-ci/+/79458513:19
rlandyhttp://git.app.eng.bos.redhat.com/git/openstack/tripleo-ci-internal-jobs.git/tree/zuul.d/required-projects-overrides.yaml13:19
arxcruz|roverwith a new cloud-init version 4.3 in an image that me and weshay|ruck uploaded right now13:19
ysandeep|debug_ssnack o/13:20
ysandeep|debug_ssni am throwing a test patch also.. to try pinning NetworkManager instead..13:21
arxcruz|roverykarel: can you put a hold on this ?13:21
marioshttps://review.opendev.org/c/openstack/tripleo-ci/+/794194/1#message-b75e024ca7d96cef5baa940270adc8f0152d7aa613:42
ykarelarxcruz|rover, which job and which patch?14:03
arxcruz|roverykarel: https://review.rdoproject.org/r/c/rdo-jobs/+/3396114:03
arxcruz|roverpatchset 314:03
*** ysandeep|debug_ssn is now known as ysandeep14:05
*** ysandeep is now known as ysandeep|ruck14:05
ykarelarxcruz|rover, done14:06
weshay|ruckarxcruz|rover, ysandeep|ruck https://meet.google.com/xnf-tvdh-pmk?authuser=114:17
ysandeep|ruckjoining14:17
arxcruz|roversec, grab coffee14:18
*** ykarel is now known as ykarel|away14:37
*** soniya has quit IRC14:41
weshay|ruckbhagyashris, can you promote.. wallaby w/14:44
weshay|ruck<ysandeep|ruck> weshay|ruck 3e4ca88391cf85cd127b130745319d4514:44
weshay|ruck[08:43:10] <ysandeep|ruck> │ periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-wallaby               │14:44
weshay|ruck[08:43:10] <ysandeep|ruck> │ periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby               │14:44
weshay|ruck[08:43:10] <ysandeep|ruck> │ periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp_1supp-featureset039-wallaby14:44
weshay|ruckskip those three jobs14:44
weshay|ruckresults are here: https://trunk.rdoproject.org/api-centos8-wallaby/api/civotes_agg_detail.html?ref_hash=3e4ca88391cf85cd127b130745319d4514:56
rlandymarios: https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/24500815:00
rlandymay also need an entry in sf-config15:01
rlandychecking15:01
rlandyhttp://git.app.eng.bos.redhat.com/git/openstack/sf-config.git/tree/resources/tripleo-ci-internal.yaml#n21315:01
rlandyhas that - ok15:01
rlandymarios: ^^ ok - so the above review should be all that is needed15:02
rlandyysandeep|ruck: weshay|ruck: if all the rest of the train jobs pass for tripleo component - ok if we remove OVB from criteria?15:07
rlandywaiting on the third attmept for that job15:07
mariosrlandy: sorry dealing something else right now will check in bit15:07
ysandeep|ruckrlandy, yes we will need to wave that off .. ovb in bad condition15:07
rlandyneeded to clean up 16.215:07
rlandyysandeep|ruck: k - will take care of that when the rest of the jobs pass15:08
rlandywill put in a dnm patch and run the promote job15:08
weshay|rucksshnaidm, fyi https://review.rdoproject.org/r/c/rdo-jobs/+/3398215:12
weshay|ruckrlandy, fs002 pass?15:12
sshnaidmweshay|ruck, when previous-current-tripleo-rdo is from?15:14
rlandyweshay|ruck: np fs00215:14
rlandyno15:14
rlandyit's the component line15:14
sshnaidmoh, it's 2021-05-18, seems fine15:14
weshay|ruckarxcruz|rover, https://4d0f809b29b439d33d96-ee10ddbfa4b3a945502b6b74773c0853.ssl.cf1.rackcdn.com/793507/1/gate/tripleo-ci-centos-8-containers-multinode/18e1500/logs/undercloud/var/log/tempest/stestr_results.html15:14
rlandywe need to get the change to promoted-components15:14
rlandyto import downstream15:15
weshay|rucksshnaidm, fyi https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/79460815:16
sshnaidmweshay|ruck++15:17
*** ykarel|away has quit IRC15:24
rlandyysandeep|ruck: weshay|ruck: lol - nvm- it I guess it doesn't take much to promote train tripleo these days: https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/dlrnapi_promoter/config/CentOS-8/component/train.yaml#L8215:27
weshay|ruckarxcruz|rover, https://389a51b11efe9e84b938-e69135b470b32777c2fd508640e5c31f.ssl.cf1.rackcdn.com/792888/2/gate/tripleo-ci-centos-8-containers-multinode/284389a/logs/undercloud/var/log/tempest/stestr_results.html15:35
arxcruz|roverweshay|ruck: adding15:35
weshay|ruckarxcruz|rover, use a new patch15:36
weshay|ruckarxcruz|rover, 608 is about to merge15:37
arxcruz|roverweshay|ruck: yes, just waiting to gate finish, to not have conflict later15:37
weshay|ruckarxcruz|rover++15:37
weshay|ruckwoot.. https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/794608 merged15:37
weshay|ruckTHANK YOU for not using hte tripleo pipeline :)) so awesome15:37
arxcruz|roverweshay|ruck: https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/79461215:43
weshay|ruckarxcruz|rover, also.. remember http://dashboard-ci.tripleo.org/d/3pUqDadGk/tempest-skipped-tests?orgId=1 ?15:43
weshay|ruckarxcruz|rover, the skip list may be able to be pruned.. not today.. but I'd like to work w/ you on this..15:44
arxcruz|roverit's been a long time...15:44
weshay|ruckand figuring out why wallably and victoria are not showing up :)15:44
arxcruz|roverweshay|ruck: probably the job name15:45
weshay|ruckarxcruz|rover, jobs run here: https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-weekend15:46
weshay|ruckbhagyashris, fyi https://review.rdoproject.org/r/c/rdo-jobs/+/3398415:54
ysandeep|ruckweshay|ruck, sshnaidm arxcruz|rover fyi.. https://review.opendev.org/c/openstack/tripleo-quickstart/+/794636 , testprojecting based on it16:02
sshnaidmysandeep|ruck, thanks16:03
rlandyperiodic-tripleo-ci-centos-8-multinode-1ctlr-featureset010-tripleo-train reporting now16:07
rlandygetting promotion patch ready16:08
*** marios is now known as marios|out16:08
weshay|ruckysandeep|ruck, you have a testproject for NM?16:13
rlandy periodic-tripleo-centos-8-train-component-tripleo-promote-to-promoted-components running16:17
ysandeep|ruckweshay|ruck, https://review.rdoproject.org/r/c/testproject/+/2844616:17
weshay|ruckthanks..16:17
weshay|ruckysandeep|ruck, arxcruz|rover I've organized our three test projects in the google tasks16:17
ysandeep|ruckweshay|ruck, rlandy you need anything before i leave for the day?16:21
weshay|ruckysandeep|ruck, I'm gong to promote wallaby16:21
rlandyysandeep|ruck: I don't think so16:22
weshay|ruckysandeep|ruck, /me looks at ur patch16:22
rlandywaiting for promotion16:22
weshay|rucksec16:22
rlandytrain triplei16:22
rlandythen will pick it up downstream16:22
ysandeep|ruckweshay|ruck, sure16:22
rlandyysandeep|ruck: chat with you your time tomorrow16:23
*** amoralej is now known as amoralej|off16:23
weshay|ruckysandeep|ruck, k.. nothing needed in tripleo-ci to exclude network-manager?16:23
weshay|ruckya.. we're good16:23
weshay|ruckysandeep|ruck, k.. I'll watch it through.. good night :)16:23
ysandeep|ruckweshay|ruck, i don't think if we are building images16:23
weshay|rucksee you in 8 hours lolz16:23
weshay|ruck10 hours16:24
ysandeep|ruckweshay|ruck, sshnaidm one more thing .. i have enabled the cloud cleanup script back(we disabled to debug).. it seems to be hitting some timeout trying to reach vexx host.. openstack token issue also facing same issue.. probably some netorwk issue and will clear in sometime.. horizon seems to be working16:25
ysandeep|ruckon toolbox16:26
*** jpena is now known as jpena|off16:27
*** ysandeep|ruck is now known as ysandeep|away16:27
*** sshnaidm is now known as sshnaidm|afk16:32
*** marios|out has quit IRC16:40
*** jlarriba has quit IRC16:46
arxcruz|roverweshay|ruck: failure :(16:54
weshay|ruckwhere?16:54
arxcruz|roverhttps://review.rdoproject.org/zuul/stream/2b5e93f00cff4e1db30b5c53b78302a5?logfile=console.log16:54
weshay|ruckarxcruz|rover, https://logserver.rdoproject.org/61/33961/3/check/tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-2/080b663/logs/undercloud/home/zuul/build.log.txt.gz16:58
weshay|ruckcloud-init was removed16:58
weshay|ruckweird16:58
weshay|ruckcloud-init.noarch                             20.3-10.el8_4.3                           @appstream16:59
weshay|ruckhttps://logserver.rdoproject.org/61/33961/3/check/tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-2/080b663/logs/overcloud-controller-0/var/log/extra/package-list-installed.txt.gz16:59
weshay|ruckarxcruz|rover, so..  on to the Network-Manager theory?16:59
arxcruz|rovercloud-init.noarch                             20.3-10.el8_4.3                           @appstream16:59
arxcruz|roverweshay|ruck: cloud init is latest one, the new one17:00
weshay|ruckyes17:00
weshay|ruckand didn't fix it17:00
weshay|ruckrlandy, we may need to hold back / pin some core networking / os packages and use a dep pipeline17:01
weshay|ruckif NetworkManager is the culprit here... it's just going to get worse in el917:02
arxcruz|roverweshay|ruck: well, it's worse now17:03
arxcruz|roverhttps://logserver.rdoproject.org/61/33961/3/check/tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-2/080b663/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz17:03
arxcruz|rovernow two controllers fail17:03
arxcruz|roverso far we saw only one failing17:03
rlandyweshay|ruck: k - give me a list and I'll set something up17:04
weshay|ruckarxcruz|rover, I've seen two I think previously17:04
weshay|ruckrlandy, not ready yet.. just sowing seeds... we probably will need to close out tripleo-repos prior17:05
rlandyk - trying to clean up train -16.217:05
weshay|ruckarxcruz|rover, is it worth trying again w/ a much older version?17:05
weshay|ruckof cloud-init17:06
weshay|ruckor just go after NM now?17:06
*** derekh has quit IRC17:06
arxcruz|roveryou mean old cloud-init and nm version?17:08
arxcruz|roverat this point, i don't know, i'm out of ideas already17:08
weshay|ruckarxcruz|rover, let's see what happens w/ https://review.rdoproject.org/r/c/testproject/+/2844617:10
weshay|ruckand to some degree https://review.rdoproject.org/r/c/rdo-jobs/+/3398217:10
rlandytrain promoted17:11
rlandypromoted-components17:11
weshay|ruckrlandy, rdo?17:13
rlandyweshay|ruck: ack - waiting for that to hit downstream17:13
rlandythen we go with that17:13
rlandyhttp://osp-trunk.hosted.upshift.rdu2.redhat.com/rhel8-osp16-2/component/tripleo/consistent/commit.yaml17:14
rlandyhasn't hit there yet17:14
rlandyweshay|ruck: once we get that through, we can use double component job17:14
bhagyashrisweshay|ruck, ack17:14
rlandyand promote network and tripleo17:14
rlandyweshay|ruck: do you know how often consistent updates?17:17
weshay|ruckdate stamp17:17
weshay|ruckya17:17
bhagyashrisweshay|ruck, http://10.0.148.74/promoter_logs/container-push/20210603-170952.log17:18
weshay|ruckbhagyashris++17:19
*** ysandeep|away has quit IRC17:31
arxcruz|roverweshay|ruck: lol https://review.opendev.org/c/openstack/tripleo-ci/+/79458517:32
arxcruz|roverbleh, didn't built a new image, it's with cloud-init 4.217:34
weshay|ruckarxcruz|rover, ?17:36
weshay|ruckarxcruz|rover, but you had to_build true17:36
weshay|ruckand the updated cloud-init was installed17:36
weshay|ruckfrenzy_friday, you got a sec?17:42
*** pojadhav has quit IRC17:43
*** pojadhav has joined #oooq17:44
frenzy_fridayweshay|ruck, O/17:44
weshay|ruckfrenzy_friday, we can add a key to sova: regexes: I assume to mark wether or not there is a matching query.yml right?17:45
weshay|ruckharmless extra key for human eyes only17:45
weshay|ruckor is there a better way.. to note in sova-patterns.yml it's converted..17:46
weshay|ruckI guess I can cross reference queries.. because that has it17:46
frenzy_fridaydidnt get that. queries.yml and sova regexes will always be in sync after we shift to the new format right?17:46
frenzy_fridayso all regexes in sova will have an entry in queries.yml17:46
weshay|ruckya.. if you add something to queries.yml it will fail if there isn't a match.. in sova-patterns.yml iirc17:47
frenzy_fridayno, it will fail if you dont add a string against which it will be checked by tox. When you add a query tox will automatically convert it for sova and for ER. So all regexes in queries, sova and er will be in sync17:49
weshay|ruckk17:52
bhagyashrisweshay|ruck, wallaby promoted https://trunk.rdoproject.org/centos8-wallaby/current-tripleo/delorean.repo.md5 , reverted criteria file changes17:59
weshay|ruckbhagyashris, thank you!!!!18:00
bhagyashris:)18:00
* bhagyashris out18:00
weshay|ruckfrenzy_friday, have you had any luck w/ wild cards.. for things like build_name?18:01
weshay|ruck   build_name:"tempest-slow"18:01
weshay|rucklike that?18:01
frenzy_fridayweshay|ruck, In the regex?18:02
weshay|rucky18:02
* weshay|ruck messing w/18:03
weshay|ruckmessage:"Could not resolve host:" AND (tags:"console") AND NOT build_name:"openstack-*"18:03
frenzy_fridaySpecial chars in regex should work18:04
frenzy_fridayfor build_name tag I need to check18:04
weshay|ruckeye.. looking at https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-wildcard-query.html#wildcard-query-field-params18:04
frenzy_friday^ this has a different structure.. with more fields18:06
weshay|ruckfrenzy_friday, sent a screenshot w/ something that is working18:07
weshay|ruckquerystring...18:08
weshay|ruckbut I'm not familiar enough w/ the query format.. on how to do that properly18:08
frenzy_fridayooh, ok. build_name wont work now, I'll add it as a param tomorrow.18:08
frenzy_fridayNOT is missing in out structure as well18:09
frenzy_friday*our18:10
weshay|ruckfrenzy_friday, some of these errors we should filter the name... because we don't want hits from jobs we don't care about18:10
weshay|ruckfrenzy_friday, k18:10
weshay|ruckinfra errors... for the most part.. we should filter in.. build_name:"tripleo*" or AND NOT build_name:"openstack-*"18:11
frenzy_fridayok, I'll add build_name param and check how to define NOT in our queries format.18:12
rlandyweshay|ruck: promoting tripleo to component-ci-testing and trying one standalone job to see if we have the right hash18:18
weshay|ruckfrenzy_friday, and.. we need.. "build_status"18:20
*** amoralej|off has quit IRC18:21
*** amoralej|off has joined #oooq18:30
weshay|ruckrlandy, https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/79466418:32
rlandyweshay|ruck: https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/794664 nit on placement18:34
weshay|ruckrlandy, sorin says it needs to be at the top..18:34
weshay|ruckto ensure it's collected18:34
rlandyokie dokie18:35
weshay|ruckso it should not be alphabetical, but ranked by importance18:35
rlandyrevoted18:35
*** jlarriba has joined #oooq18:59
weshay|ruckrlandy, hey.. want to look at 16.2 for a minute w/ me and chat about it moving forward?19:04
rlandyweshay|ruck: ack19:20
rlandyjust looking at log that ran19:20
rlandyfailed deploy19:20
weshay|ruckrlandy, heh.. old overcloud-images fixes the issue w/ ovb19:21
weshay|ruckarxcruz|rover, ^19:22
weshay|ruckfrom 5/1819:22
weshay|ruckat least we can get a proper rpm diff now19:22
rlandyweshay|ruck: k - let's meet19:24
weshay|ruckk19:24
weshay|ruckmeet.google.com/jbk-tntw-quv19:24
rlandy Not found image: https://docker-registry.upshift.redhat.com/v2/tripleorhos-16-2/openstack-swift-account/manifests/a68f2f7c63f98fb923595bdd144ce819"]19:25
*** amoralej|off has quit IRC19:37
weshay|ruckrlandy, https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/79290419:38
rlandyhttps://review.opendev.org/c/openstack/tripleo-quickstart/+/79281820:08
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/builds?pipeline=openstack-periodic-integration-rhos-17 - not too bad20:25
rlandyoh - the registry is defined in a diff place21:00
weshay|ruckreally?21:01
rlandyyeah - files gets built from tripleo-ansible or something21:02
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-periodic-integration-rhos-17/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-multinode-1ctlr-featureset010-rhos-17/278840f/logs/undercloud/etc/containers/registries.conf21:03
rlandywe made a change on container builds21:04
weshay|ruckrlandy, ya.. that file changed iirc w/ containers_tools 2.0 -> 3.021:05
rlandyhttp://git.app.eng.bos.redhat.com/git/openstack/tripleo-ci-internal-jobs.git/tree/zuul.d/tripleo-build-containers.yaml21:05
rlandyhttp://git.app.eng.bos.redhat.com/git/openstack/tripleo-ci-internal-jobs.git/tree/zuul.d/tripleo-build-containers.yaml#n4721:05
rlandyneed to see that that is matched in settings somewhere21:05
rlandyhttp://git.app.eng.bos.redhat.com/git/tripleo-environments.git/tree/config/release/promotion-testing-hash-rhos-17.yml#n19521:07
rlandyit is - but maybe we switch the order21:07
rlandyinteresting ho standalone passes21:17
rlandyoh21:17
rlandyI defined that in standalone config21:17
rlandy2021-06-03 16:46:31.953066 | primary | TASK [os_tempest : Execute tempest tests] **************************************21:22
rlandy2021-06-03 16:46:31.953072 | primary | Thursday 03 June 2021  16:46:31 -0400 (0:00:00.048)       0:50:30.484 *********21:22
rlandy2021-06-03 16:46:37.327022 | primary | fatal: [undercloud]: FAILED! => {21:22
rlandy2021-06-03 16:46:37.327416 | primary |     "changed": false,21:22
rlandy2021-06-03 16:46:37.327458 | primary |     "cmd": "set -e\nif [ -d /openstack/venvs/tempest-untagged/bin ];\nthen\n. /openstack/venvs/tempest-untagged/bin/activate\nfi\ntempest run   --concurrency 2   --blacklist-file /home/zuul/tempest/etc/tempest_blacklist.txt   --whitelist-file /home/zuul/tempest/etc/tempest_whitelist.txt > /var/log/tempest/tempest_run.log\n",21:22
rlandy2021-06-03 16:46:37.327481 | primary |     "delta": "0:00:04.887361",21:22
rlandy2021-06-03 16:46:37.327490 | primary |     "end": "2021-06-03 20:46:37.288135",21:22
rlandy2021-06-03 16:46:37.327507 | primary |     "rc": 1,21:22
rlandy2021-06-03 16:46:37.327515 | primary |     "start": "2021-06-03 20:46:32.400774"21:22
rlandy2021-06-03 16:46:37.327523 | primary | }21:22
rlandyha no tempest tests running on scenario00721:22
weshay|ruckoh no21:31
weshay|ruckprobably because we accidently skipped21:31
rlandyyeah21:32
rlandydigging through that21:32
weshay|ruckrlandy, fyi.. I pointed arx at http://dashboard-ci.tripleo.org/d/3pUqDadGk/tempest-skipped-tests?orgId=121:32
rlandynice21:32
weshay|ruckafter we get through the ovb fire drill hopefully he can restore some tests21:32
weshay|ruckrlandy, do you know what to change to get the registry right?21:37
* weshay|ruck steps away21:39
rlandyweshay|ruck: yep22:20
rlandyunder test22:20
* rlandy going o warehouse22:20
rlandyback later22:20
*** rlandy is now known as rlandy|bbl22:20
*** yamamoto has quit IRC22:45
*** tosky has quit IRC23:00

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!