Tuesday, 2019-10-08

*** rlandy has quit IRC00:01
*** slaweq has joined #oooq00:11
*** slaweq has quit IRC00:15
*** saneax has quit IRC00:25
*** dsneddon_ has quit IRC00:42
*** dsneddon_ has joined #oooq00:43
*** slaweq has joined #oooq02:11
*** slaweq has quit IRC02:16
*** rnoriega_ has joined #oooq02:19
*** rfolco has quit IRC03:12
*** ykarel has joined #oooq04:08
*** holser has joined #oooq04:10
*** dsneddon_ has quit IRC04:26
*** aakarsh has joined #oooq04:55
*** dsneddon_ has joined #oooq04:58
*** ykarel is now known as ykarel|away05:16
*** marios has joined #oooq05:20
*** slaweq has joined #oooq06:11
*** slaweq has quit IRC06:15
*** dtantsur|afk is now known as dtantsur06:28
*** ccamacho has quit IRC06:35
*** slaweq has joined #oooq06:35
*** jaosorior has quit IRC06:41
arxcruz|roverykarel|away: i'll rerun the scenario003 so we can have a promotion today, only fs020 is running, all the others passes so far06:48
*** holser has quit IRC06:51
ykarel|awayarxcruz|rover, ack, hopefully stein should also promote today06:56
ykarel|awaytill now running good06:56
*** tesseract has joined #oooq07:10
*** jfrancoa has joined #oooq07:14
*** jtomasek has joined #oooq07:21
*** jfrancoa has quit IRC07:23
*** tosky has joined #oooq07:24
*** jaosorior has joined #oooq07:24
*** ccamacho has joined #oooq07:26
toskyuh, did a train promotion happen?07:27
*** ccamacho has quit IRC07:27
*** ccamacho has joined #oooq07:27
arxcruz|rovertosky: nope, i'll run the two failing jobs again, one was timeout, the other fails on tempest, i'm checking the reason07:38
toskyarxcruz|rover: but I've seen a successfull run of the scenario jobs that was failing07:39
arxcruz|rovertosky: waiting for the logs07:40
*** amoralej|off is now known as amoralej07:42
*** jpena|off is now known as jpena07:47
arxcruz|roverykarel|away: https://review.rdoproject.org/r/#/c/23010/ fyi07:54
ykarel|awayack07:56
*** holser has joined #oooq08:16
arxcruz|roverfingers crossed for stein :D08:18
arxcruz|rovertwo more jobs :D08:18
*** holser has quit IRC08:21
*** holser has joined #oooq08:26
*** jaosorior has quit IRC08:32
*** derekh has joined #oooq08:38
ykarel|awayarxcruz|rover, can u also track failure for train phase1:- https://ci.centos.org/view/rdo/view/promotion-pipeline/job/rdo_trunk-promote-train-current-tripleo/08:47
ykarel|awaythe tripleo job failure in ^^08:48
ykarel|awayCould not find or access '/home/jenkins/workspace/tripleo-quickstart-promote-train-current-tripleo-delorean-minimal/config/release/centosci/train-current-tripleo.yml'08:48
ykarel|awayneed a patch similar to https://review.opendev.org/#/c/650259/08:49
arxcruz|roverykarel|away: fs020 stein is running tempest now08:50
ykarel|awayack for stein, can u check ^^ also08:50
arxcruz|roverykarel|away: working on this patch08:52
ykarel|awayarxcruz|rover, Thanks08:52
arxcruz|roverykarel|away: https://review.opendev.org/#/c/687249/08:57
arxcruz|rovermarios: panda ^08:58
arxcruz|roverplease +2 when you guys have chance :) promotion blocker08:58
mariosarxcruz|rover: ack09:02
arxcruz|roveri mean, please, review first :P09:02
ykarel|awayarxcruz|rover, which file u took reference to prepare that?09:03
arxcruz|roverykarel|away: stein09:03
ykarel|awayshould use config/release/centosci/master-current-tripleo.yml as base and adjust for train09:03
mariosarxcruz|rover: ack i couldn't spot something added comment on commit message09:03
mariosykarel|away: that points to master (config/release/centosci/master-current-tripleo.yml its a link)09:03
mariosykarel|away: arxcruz|rover arx i thought you used master at least thats what i compared it to09:04
ykarel|awaymarios, where?09:04
ykarel|awayi don't see it as link09:04
mariosykarel|away: yeah sorry master is a link but master-current-tripleo isnt09:05
ykarel|awaymarios, yup, that should be used as a reference for preparing train-current-tripleo09:05
arxcruz|roverykarel|away: marios give me a few minutes, i'll fix it and open a bug09:05
ykarel|awayi noticed that when i saw reference to ceph-luminous09:05
ykarel|awayarxcruz|rover, ack09:05
pandammmh what happened to distro_ver ?09:08
mariosykarel|away: arxcruz|rover revoted (ykarel indeed the biggest diff is the ceph stuff)09:08
ykarel|awayyes09:08
arxcruz|roverykarel|away: marios done, bug opened, changed based on master09:12
ykarel|awayThanks looks good now09:13
arxcruz|roverykarel|away: periodic-tripleo-ci-centos-7-scenario003-standalone-master passed, only fs020 now missing :)09:15
arxcruz|roverfor master09:15
arxcruz|roverstill waiting stein09:15
ykarel|awayack cool09:16
*** rfolco has joined #oooq09:32
*** jaosorior has joined #oooq09:44
arxcruz|roverykarel|away: fs020 stein fail, i'll execute again, once i check what fails there09:55
ykarel|awayarxcruz|rover, that test(test_delete_saving_image) fails randomly10:07
arxcruz|roverykarel|away: yeah, i'm rerunning10:08
ykarel|awayack10:08
*** ykarel|away has quit IRC10:09
*** slaweq has quit IRC10:09
*** slaweq_ has joined #oooq10:09
arxcruz|roveryolanda: when weshay|ruck we can decide if we can skip this job to get the promotion also10:10
arxcruz|roveryolanda: sorry, wrong person :)10:11
*** jaosorior has quit IRC10:18
*** amoralej is now known as amoralej|lunch11:12
*** slaweq_ is now known as slaweq11:15
*** chem has quit IRC11:16
*** chem has joined #oooq11:19
*** jpena is now known as jpena|lunch11:41
weshay|ruckarxcruz|rover, howdy11:53
arxcruz|roverweshay|ruck: hey boss11:53
arxcruz|roverwant to sync?11:53
weshay|rucksure.. I have 6min11:54
arxcruz|roverlol11:54
weshay|ruckarxcruz|rover, https://meet.google.com/gfz-ybik-uik11:54
arxcruz|rover bt or meet?11:54
weshay|ruckarxcruz|rover, https://review.rdoproject.org/r/#/c/21672/11:59
weshay|ruckarxcruz|rover, master is promoting12:02
weshay|ruckpromoter Running: env ANSIBLE_LOG_PATH=/home/centos/promoter_logs/container-push/20191008-022432.log RELEASE=master COMMIT_HASH=12a897fb9b218be970090996da0e21a82cedda1a DISTRO_HASH=aba8ec542777d218da0a947f2fa3a801fde3696b FULL_HASH=12a897fb9b218be970090996da0e21a82cedda1a_aba8ec54 PROMOTE_NAME=current-tripleo SCRIPT_ROOT=/home/centos/ci-config/ DISTRO_NAME=rhel DISTRO_VERSION=8 ansible-playbook /home/centos/ci-config/ci-scripts/container-p12:03
weshay|ruckush/container-push.yml12:03
weshay|ruckarxcruz|rover, hrm.. maybe that didn't happen.. I'll make it happen12:08
*** jaosorior has joined #oooq12:08
pandarfolco: https://review.rdoproject.org/r/22994 seesm to be stable, the last 18 job runs passed the point with the "connection reset" problem12:14
weshay|ruck2019-10-08 12:13:45,458 12955 INFO     promoter Running: env ANSIBLE_LOG_PATH=/home/centos/promoter_logs/container-push/20191008-121345.log RELEASE=master COMMIT_HASH=12a897fb9b218be970090996da0e21a82cedda1a DISTRO_HASH=aba8ec542777d218da0a947f2fa3a801fde3696b FULL_HASH=12a897fb9b218be970090996da0e21a82cedda1a_aba8ec54 PROMOTE_NAME=current-tripleo SCRIPT_ROOT=/home/centos/ci-config/ DISTRO_NAME=centos DISTRO_VERSION=7 ansible-playbook12:14
weshay|ruck/home/centos/ci-config/ci-scripts/container-push/container-push.yml12:14
weshay|ruckarxcruz|rover, this is promoting ^12:14
rfolcopanda, I still have to adapt the playbook to run as include_from... probably need to move vars to defaults and leave only the tasks directly there...12:26
rfolco2019-10-08 09:53:49.090857 | TASK [Run tripleo-common scenario tests]12:26
rfolco2019-10-08 09:53:49.140313 | rdo-centos-7 | ok12:26
rfolcopanda, it doesn't seem to be running12:27
*** amoralej|lunch is now known as amoralej12:32
pandarfolco: want to sync ?12:34
rfolcopanda, yes12:34
rfolcopanda, https://meet.google.com/bqx-xwht-wky12:35
*** jpena|lunch is now known as jpena12:37
*** rlandy has joined #oooq12:39
weshay|ruckrfolco, panda couple things to review for promoter https://review.rdoproject.org/r/#/c/22998/ https://review.rdoproject.org/r/#/c/22892/12:40
weshay|ruckrfolco, panda btw.. IMHO let's cap the retrospective at 1.5 hours max.. spend the rest of the time w/ promoter team standing up a new node12:41
rlandypanda: hi12:41
weshay|ruckpanda, is about an hour to stand up a new promoter and see where we are at about right?12:41
*** chem has quit IRC12:41
rlandypanda: tests are passing other than the 'failed attempt' test that is failing in all three12:41
rlandymarios: are we sync'ing today?12:42
rlandypanda: ^^ that failed promotion never shows up in the logs12:42
rlandyneed to check that with you12:42
rlandyother than that the tests pass12:42
mariosrlandy: 15:35 < rfolco> panda, https://meet.google.com/bqx-xwht-wky12:43
*** chem has joined #oooq12:43
rfolcorlandy, come to the party12:43
mariosrlandy: impromptu i crashed their party and now i'm inviting friends12:43
mariosi'm _that_ guy12:44
pandarlandy: https://review.rdoproject.org/r/22994 seesm to be stable, the last 18 job runs passed the point with the "connection reset" problem12:50
weshay|ruckpanda, rfolco re: retrospective day.. is that a reasonable request?12:52
rfolcopanda, you'll have to limit retro cards to 1 or 2 max /person12:52
rfolcoweshay|ruck, ^12:52
weshay|ruckrfolco, essentially I would like to skip the card / board review12:53
weshay|ruckin favor of the promoter stand up12:53
rfolcoweshay|ruck, lgtm12:54
arxcruz|roverweshay|ruck: fs020 master pass12:56
weshay|ruckarxcruz|rover, rock on12:56
weshay|ruckpanda, ?12:59
weshay|ruckpanda, if it fails... that's fine.. if it passes that's great.. need you to respond12:59
*** matbu has quit IRC12:59
*** matbu has joined #oooq13:00
arxcruz|roverweshay|ruck: master should be promoted now13:11
pandaweshay|ruck: yes, I'm preparing some patches to make it work, I have a promoter server running, and I'm checking the things that I can13:12
weshay|ruckarxcruz|rover, ya.. it's uploading containers13:14
arxcruz|roverweshay|ruck: ok, cool13:14
*** aakarsh has quit IRC13:14
arxcruz|roverweshay|ruck: will you skip fs020 on stein, or let it go ?13:14
weshay|ruckpanda, ok.. sounds perfect.. I think we can just include anyone who worked on it... but anyone is welcome13:14
weshay|ruckrfolco, please adjust the retro cal invite to 1.5 hr max, and send an additional invite to the promoter time13:14
rfolcoweshay|ruck, additional time = remaining (1h) or extra ?13:15
weshay|ruckrfolco, I have a potential conflict.. let's start promoter stand up at 3pm utc13:16
weshay|rucklet's skip board review13:17
rfolcoweshay|ruck, retro starts 1pm UTC, we cap it at 1.5h, so 2:30pm UTC we can start promoter standup, isn't it ?13:19
weshay|ruckrfolco, 3pm.. I have prod chain council 2-313:21
rfolcoweshay|ruck, we'd have 2 hours for retro then13:22
weshay|ruckrfolco, /me notes.. keep folks on for 3 straight hours.. may be rough13:23
weshay|ruckrfolco,  ur the tc though, up to u13:23
rfolcoweshay|ruck, ok just confirming what you want to do.... I'll shift retro 30min or 1h then13:23
weshay|ruckrfolco, let's retro for 1 - 1.5 hours.. break.. and pickup the install at 3pm imho13:29
rfolcoweshay|ruck, ok just did that, invite sent.... will send invite for promoter stand up next13:30
rfolcow/ same gmeet13:30
rfolcocommunity call starts now at https://meet.google.com/bqx-xwht-wky13:31
rfolcoping marios, sshnaidm, weshay, panda, rlandy, arxcruz, rfolco, chandankumar, zbr, kopecmartin13:31
rfolcoci community call ^13:31
weshay|ruckon a call.. myself13:33
*** chem has quit IRC13:34
*** chem has joined #oooq13:36
weshay|ruckarxcruz|rover, fyi.. this is how we update the status board fyi https://code.engineering.redhat.com/gerrit/18292213:44
weshay|ruckarxcruz|rover, can you make sure scen010 gets into the master / train pipelines and critieria https://review.rdoproject.org/r/#/c/22867/2/zuul.d/standalone-jobs.yaml13:49
arxcruz|roverweshay|ruck: sure, submiting the patch13:50
*** Vorrtex has joined #oooq13:50
arxcruz|roverweshay|ruck: https://review.rdoproject.org/r/#/c/23019/13:56
*** ykarel|awat has joined #oooq13:56
*** ykarel|awat is now known as ykarel|away13:56
weshay|ruckarxcruz|rover, thanks13:57
rlandypanda: still a no show on the failed attempt - see logs ... https://review.rdoproject.org/r/#/c/22958/13:59
rlandyno trace of the 'skipping' in promotion log13:59
pandarlandy: I see now. TO have the skippin g essage we would need to do something different14:03
rlandypanda: even the py27 does not work14:04
rlandypanda: I am going to comment out the failed_attempt check to see that everything else works14:04
weshay|ruckrfolco, you guys still on a call?14:04
rfolcoweshay|ruck, no we dropped.14:04
rfolcoweshay|ruck, we has updates from mikhal and discussed timing w/ ppc folks14:05
rfolcohad*14:05
pandarlandy: the py27 fails even before, if nothing matches that a search returns None, and you're not handling that case14:06
*** aakarsh has joined #oooq14:06
rlandypanda: it should match14:06
rlandyI guess in the py27, is it checking all hashes?14:07
pandarlandy: but to make the Skipping message show, we would need to run the promotion twice, the sequence would be: 1) inject fixtures with a missing vote, 2) run promoter, 3) inject a fake vote to dlrnapi, 4) rerun promotion. At the end the log should have a "Skipping"14:07
weshay|ruckrfolco, just the folks who participated in the promoter work14:08
weshay|rucknot the whole team14:08
rlandypanda: I think we should get the successful promotion check working first14:08
rlandypanda: wrt py27 test,14:08
rlandylet's talk about what's going on there ...14:08
rfolcoweshay|ruck, ok, I thought you would like to disseminate knowledge14:08
rfolcoweshay|ruck, optional is ok?14:09
rlandypanda: "if nothing matches that a search returns None"14:09
weshay|ruckaye14:09
*** aakarsh|2 has joined #oooq14:10
rlandy^^ it should only check if the hashes match the promotion_candidate or the failed_attempt14:10
*** chem has quit IRC14:11
*** aakarsh has quit IRC14:13
*** chem has joined #oooq14:14
*** ykarel|away is now known as ykarel14:15
weshay|ruckarxcruz|rover, when you have a sec.. spot check this http://dashboard-ci.tripleo.org/d/si1tipHZk/jobs-exploration?orgId=1&fullscreen&panelId=914:16
weshay|ruckwe still have too many 0ns14:16
rlandypanda:  I need to take a few minutes of your time to finish this up ... per last results in https://review.rdoproject.org/r/#/c/22958/14:19
weshay|ruckarxcruz|rover, master promoted, stein starting now14:21
arxcruz|roverweshay|ruck: it's to stand up and glorify!14:24
arxcruz|roverright rfolco14:25
rlandypanda: no where does this file : http://logs.rdoproject.org/58/22958/45/check/tripleo-ci-promotion-staging/cdea9f6/logs/stage-info.yaml have faied_attempt defined14:26
rlandyfailed_attempt14:26
pandarlandy: ready to chat14:29
rlandypanda: https://meet.google.com/dqh-ordn-wiw14:31
*** chem has quit IRC14:36
*** chem has joined #oooq14:38
rfolcoarxcruz|rover, :)14:52
arxcruz|roverweshay|ruck: 2019-10-08 14:49:59,101 24426 ERROR    promoter Command '[u'env', u'ANSIBLE_LOG_PATH=/home/centos/promoter_logs/container-push/20191008-142114.log', u'RELEASE=stein', u'COMMIT_HASH=355abb693dbb43fa429939e494a33e362075f1f8', u'DISTRO_HASH=647b08e47a72f7533142c09074f227628f08f9fa', u'FULL_HASH=355abb693dbb43fa429939e494a33e362075f1f8_647b08e4', u'PROMOTE_NAME=current-tripleo',14:55
arxcruz|roveru'SCRIPT_ROOT=/home/centos/ci-config/', u'DISTRO_NAME=centos', u'DISTRO_VERSION=7', u'ansible-playbook', u'/home/centos/ci-config/ci-scripts/container-push/container-push.yml']' returned non-zero exit status 214:55
arxcruz|roverfail for stein14:55
weshay|ruckarxcruz|rover, ya.. one failed.. not uncommon, /me rekicks14:57
*** aakarsh|2 has quit IRC15:01
*** aakarsh|2 has joined #oooq15:01
*** ykarel is now known as ykarel|away15:01
rfolcopanda, "Could not find or access '/tmp/stage-info.yaml'15:04
rfolcorlandy, ^15:04
rfolcothat file should exist, right?15:04
rlandyack - see the logs on my patch - fixed15:04
rlandyrfolco: ^^15:05
rfolcorlandy, did you rebase on panda's patch for unicorn ?15:05
rlandyyes15:05
rfolcoso I can rebase on yours then15:05
rfolcook15:05
rlandyrfolco: wait15:05
rfolcothx15:05
rfolcowhat?15:05
rlandyjust spoke to panda - removing the one negative test15:05
rlandyputting in a new patch that should pass15:06
rlandygive me a few15:06
rfolcorlandy, ok I am in tc mtg, pls ping when its ready to rebase on15:06
weshay|ruckarxcruz|rover, we need to chat w/ the infra folks about the rhel 8 images being used.. selinux should be permissive out of the image https://bugs.launchpad.net/tripleo/+bug/184728215:07
openstackLaunchpad bug 1847282 in tripleo "rhel 8 tripleo Destination directory /etc/modules-load.d does not exist" [Critical,In progress]15:07
arxcruz|roverweshay|ruck: ack15:12
weshay|ruckarxcruz|rover, https://review.opendev.org/68733015:20
arxcruz|roverweshay|ruck: but it should be on the image right ?15:20
weshay|ruckarxcruz|rover, /me was wondering how other teams would use that.. should we check the distribution or if the command exists?15:20
weshay|ruckor let ignore_errors handle it..15:20
arxcruz|roverweshay|ruck: ignore_errors is better15:21
arxcruz|roverless checking15:21
arxcruz|roverif it fails means, there's no selinux, the collect logs should not fail because of that right ?15:21
weshay|ruckarxcruz|rover, ya.. the collect logs playbook should be ignore_errors15:23
weshay|ruckarxcruz|rover, the playbook is not in the role though.. so another team could pick up the role.. and have a ton of failures15:24
weshay|ruckarxcruz|rover, hrm... https://github.com/openstack/tripleo-quickstart-extras/blob/master/playbooks/collect-logs.yml15:26
weshay|ruckarxcruz|rover, what the duece https://github.com/openstack/tripleo-quickstart-extras/blob/master/playbooks/collect-logs.yml#L2215:26
arxcruz|roverweshay|ruck: workigng on it15:28
weshay|ruckhttp://paste.openstack.org/show/782108/15:28
weshay|ruckPLAY RECAP *********************************************************************15:29
weshay|rucklocalhost                  : ok=41   changed=32   unreachable=0    failed=0    skipped=78   rescued=0    ignored=715:29
weshay|ruckovercloud-controller-0     : ok=59   changed=55   unreachable=0    failed=0    skipped=42   rescued=0    ignored=915:29
weshay|ruckovercloud-controller-1     : ok=59   changed=55   unreachable=0    failed=0    skipped=42   rescued=0    ignored=915:29
weshay|ruckovercloud-controller-2     : ok=59   changed=55   unreachable=0    failed=0    skipped=42   rescued=0    ignored=915:29
weshay|ruckovercloud-novacompute-0    : ok=58   changed=54   unreachable=0    failed=0    skipped=43   rescued=0    ignored=815:29
weshay|ruckundercloud                 : ok=70   changed=57   unreachable=0    failed=0    skipped=57   rescued=0    ignored=715:29
weshay|ruckanyone know where ignore_errors is set on collect logs these days?15:29
arxcruz|roverweshay|ruck: should be in all include tasks no ?15:30
weshay|ruckarxcruz|rover, ah.. https://opendev.org/openstack/ansible-role-collect-logs/src/branch/master/tasks/collect.yml#L315:30
weshay|ruckphew15:30
arxcruz|roverweshay|ruck: hmmm, it's not in the whole file, just https://opendev.org/openstack/ansible-role-collect-logs/src/branch/master/tasks/collect.yml#L3-L2015:31
arxcruz|roverweshay|ruck: we might need to add ignore in the includes as well15:31
weshay|ruckarxcruz|rover, https://review.opendev.org/68733515:35
arxcruz|roveror add a note :)15:35
weshay|ruckarxcruz|rover, https://opendev.org/openstack/ansible-role-collect-logs/src/branch/master/tasks/collect/system.yml#L315:36
weshay|ruckhttps://opendev.org/openstack/ansible-role-collect-logs/src/branch/master/tasks/collect/monitoring.yml#L315:36
weshay|ruckarxcruz|rover, you wrote this bit didn't you?15:36
weshay|rucklolz15:36
weshay|ruckmemory SHOT!15:36
arxcruz|roverweshay|ruck: that's what I was wondering now15:36
arxcruz|roverlol15:36
weshay|ruckarxcruz|rover, ok. next question... where is the node defs.. for the rhel8 ovb stack15:44
weshay|ruckarxcruz|rover, do you know?15:44
weshay|ruckor the dib15:44
arxcruz|rovernope15:44
arxcruz|roveri'm very low on rhel knowledge15:45
arxcruz|roverrfolco: ? ^15:45
weshay|ruckarxcruz|rover, we may need to update our image build scripts15:46
weshay|ruckI think that is the issue actually15:47
*** kopecmartin is now known as kopecmartin|off15:47
weshay|ruckarxcruz|rover, https://github.com/openstack/tripleo-ci/blob/master/roles/oooci-build-images/templates/build-images.sh.j215:48
weshay|ruckarxcruz|rover, we need to add a dib element to disable selinux some where around https://github.com/openstack/tripleo-ci/blob/master/roles/oooci-build-images/tasks/main.yaml#L2415:49
arxcruz|roverweshay|ruck: i don't think so, because the overcloud build image is called to create the image, the line 24 already have the image created15:51
weshay|ruckarxcruz|rover, ya.. we build the images in the promotion pipeline you gooose15:52
weshay|ruck:)15:52
weshay|ruckit uses those scripts15:52
arxcruz|roverweshay|ruck: yes, i understand15:52
arxcruz|roverbut thhe build-images.sh just call the overcloud build image15:53
weshay|ruckhttps://meet.google.com/jtp-kxij-guy15:53
weshay|ruckarxcruz|rover, we need to add https://docs.openstack.org/diskimage-builder/2.6.1/elements/selinux-permissive/README.html15:54
*** marios has quit IRC15:57
weshay|ruckarxcruz|rover, diskimage-builder/diskimage_builder/elements/selinux-permissive15:58
arxcruz|roverweshay|ruck: https://github.com/openstack/tripleo-ci/blob/fedc702d84b8fb7ba7c7b2cc8b77f44f1363b537/roles/oooci-build-images/templates/pathfix_repos.sh.j215:59
weshay|ruckhttps://github.com/openstack/tripleo-ci/blob/master/roles/oooci-build-images/templates/build-images.sh.j2#L716:00
arxcruz|roverweshay|ruck: https://github.com/openstack/tripleo-ci/blob/master/roles/oooci-build-images/tasks/main.yaml#L2416:01
*** dtantsur is now known as dtantsur|afk16:03
*** tesseract has quit IRC16:08
weshay|ruckopenstack overcloud image build --type overcloud-full \16:09
weshay|ruck    --builder-extra-args selinux-permissive16:09
weshay|ruckarxcruz|rover, ^16:09
arxcruz|roverweshay|ruck: rhel_image_source16:10
arxcruz|roverweshay|ruck: https://review.opendev.org/#/c/687345/16:10
weshay|ruckarxcruz|rover, related-bug: https://bugs.launchpad.net/tripleo/+bug/184728216:20
openstackLaunchpad bug 1847282 in tripleo "rhel 8 tripleo Destination directory /etc/modules-load.d does not exist" [Critical,In progress]16:20
rfolcorlandy, panda do you remove stage-info somewhere after your tests run ?16:26
pandarfolco: not anymore16:26
pandarfolco: now it's left for collection16:26
pandarfolco: correction16:26
pandarfolco: we copy it before it's removed16:27
rfolcopanda, Could not find or access '/tmp/stage-info.yaml'16:27
rlandypanda: rfolco: fingers crossed that the current patch will work and we can merge16:27
rlandythat will get you the stage-info you need16:27
pandarlandy: logs ?16:27
rlandygetting16:28
pandarlandy: sorry, I meant rfolco16:29
rlandylol16:29
rfolcohttp://logs.rdoproject.org/88/22988/14/check/tripleo-ci-promotion-staging/2b4b115/job-output.txt16:29
*** rfolco is now known as not_rlandy16:29
*** not_rlandy is now known as rfolco16:29
*** bogdando has quit IRC16:29
rfolcopanda, the idea was to use the playbook exactly like in the molecule test, without any change in the tasks so i can just include tasks_from16:31
pandarfolco: there should be nothing there that deletes /tmp/stage-info.yaml16:33
rlandypanda: rfolco: https://review.rdoproject.org/r/#/c/22958/ - we're green16:34
pandarfolco: "message": "Could not find or access '/tmp/stage-info.yaml' on the Ansible Controller16:34
pandarlandy: you're trying to include it in localhost ?16:34
pandarlandy: \o/16:34
rfolcohahaha16:34
rfolcolol16:35
rfolcono, I'm not16:35
pandarlandy: the images test ir RED16:35
rfolco rdo-centos-716:35
rfolcopanda, ^16:35
rfolcorlandy, looks like marios test is broken now16:36
rlandypanda: that wasn't red before?16:38
pandarlandy: I suggest you remove any reference to failed_attempt16:38
rlandylooking16:38
pandarlandy: it's probably confusing the test, and we are not going to use it anyway.16:39
rlandyfixing16:39
pandarlandy: sorry, I sneaked thos jobs in again, they were not running before, but I was sorried womthing like this would happen.16:41
pandathe stagin environment affects every other jobs , so they should run16:42
rlandypanda: np - updated16:43
pandarlandy: rfolco the real-life scenario works at least to the point when we run the promoter itself. I'm not sure how I can test further without interfering with the other promoter16:44
rlandypanda: one thing at a time16:44
pandarfolco: really weird16:49
rfolcopanda, do I need to include remote_src ?16:49
rfolco[gather stage info] step above just works16:49
rfolcowhy include_vars doesn't ?16:50
rfolcopanda, will test w/         remote_src: yes16:50
pandarfolco: include_vars takes vars only from the ansible controller16:51
*** derekh has quit IRC16:52
pandarfolco: it's wokring with the other test becaus they use localhost16:52
rfolcopanda, ok I am officially declaring war against de-duplication efforts. I'll duplicate the whole thing.16:54
rfolcoto get things done16:55
rfolcomolecule is molecule, zuul is zuul. period.16:56
pandarfolco: if you copy the file locally, you should be fine17:01
pandarfolco: no wait, it's already done, I copy it in /home/{{ promoter_user }}17:01
rlandypanda: so marios test is still failing ... the only relevant change left is https://review.rdoproject.org/r/#/c/22958/54/ci-scripts/dlrnapi_promoter/tests/staging-setup/fixtures/scenario-1.yaml17:03
rlandywhich is required17:03
rlandy assertion: previous_current_tripleo.stat.islnk is defined17:04
rlandycorrect it's not17:04
rlandywould need to change his test17:04
pandarlandy: required ? rlandy https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/infra-setup/roles/promoter/molecule/promote-images/playbook.yml#L4217:07
pandait's checkign for current-tripleo17:08
rlandywell that was what was defined before17:09
rlandythe next lime is dependent on that17:10
rfolcoif remote_src does not work, will get it from promoter_user home dir, got it panda17:11
pandarlandy: it should check tripleo-ci-staging-promoted17:15
rlandypanda: I'll update - let's see17:17
arxcruz|roverweshay|ruck: am i missing something here? https://review.rdoproject.org/r/#/c/22800/1/zuul.d/tripleo.yaml17:22
arxcruz|roverthe patch was merged17:23
arxcruz|roverweshay|ruck: do you want the ovb job ?17:27
weshay|ruckarxcruz|rover, ya.. don't confuse scenario 001 w/ fs00117:28
weshay|ruckarxcruz|rover, we need the ovb job17:28
weshay|ruckthat's the only place where selinux is getting us atm17:28
*** ccamacho has quit IRC17:29
weshay|ruckarxcruz|rover, before you sign off for the day.. I'm not clear what the status of https://trello.com/c/DvgaBFim/1104-cixlp1843259tripleociproa-periodic-rocky-fs020-job-fails-tempest-tests-tempestscenariotestsecuritygroupsbasicopstestsecuritygrou is17:29
weshay|ruckplease update17:29
arxcruz|roverweshay|ruck: same as yesterday, the patch was merged to skip the test17:29
arxcruz|roverwe had a promotion17:29
arxcruz|roverthe root cause isn't fixed yet17:29
*** Vorrtex has quit IRC17:32
*** jpena is now known as jpena|off17:32
weshay|ruckarxcruz|rover, perfect.. thanks17:33
arxcruz|roverweshay|ruck: https://review.rdoproject.org/r/#/c/23022/17:34
weshay|ruckarxcruz|rover, I think that is triggering because of https://github.com/rdo-infra/review.rdoproject.org-config/blob/master/zuul.d/tripleo.yaml#L54617:38
weshay|ruckor https://github.com/rdo-infra/review.rdoproject.org-config/blob/master/zuul.d/tripleo.yaml#L54517:38
weshay|ruckarxcruz|rover, tbh.. not 100% sure17:39
weshay|ruckarxcruz|rover, merged yours17:39
arxcruz|roverok17:39
*** amoralej is now known as amoralej|off17:47
*** chem is now known as chem|eod17:49
rlandypanda: rfolco: https://review.rdoproject.org/r/#/c/22958/ - legit green now17:53
pandarlandy: SHOP IT!17:53
pandaooopp17:54
pandaahah17:54
rlandyI'm always u for shopping17:54
rlandypanda: I need to merge the patches underneath that17:55
rlandypanda: ok to merge https://review.rdoproject.org/r/#/c/22994/17:56
pandarlandy: yep17:56
rlandygoing17:56
pandarfolco: sorry for the wrong suggestion17:57
pandarfolco: /home/zuul/stage-info is still on the host, not the executor17:57
rfolcopanda, include_vars also does not have remote_src looks like17:57
rfolcopanda, hopefullt delegate_to works18:01
pandarfolco: it will not, the easiest way at this point is to fetch the file with the fetch module, then use include_vars on the fetched file18:03
rfolcopanda, ok thx for the suggestion18:04
rlandyoh come on gate18:05
pandarfolco: if that doesn't work, the only other wasy is to cat the file from shell and register the output , then set_fact: stage_info: {{ registered_var | from_yaml }} but the vairables will all be under stage_info18:07
rfolcopanda, ack18:07
rlandypanda: thanks for w+'ing patch - updating cards now with negative test cases required18:18
*** holser has quit IRC18:19
rlandyrfolco: panda: put all the promoter test cards in 'QE' column except https://tree.taiga.io/project/tripleo-ci-board/task/1284?kanban-status=144727518:24
rlandy^^ adding negative test requirements18:24
rlandywe may want to complete that card and move the requirements to another card on the next sprint18:24
rfolcopanda, "msg": "Accessing files from outside the working dir /var/opt/rh/rh-python35/lib/zuul/builds/139b330387624563be7052f1ba44da31/work is prohibited",18:27
rfolcopanda, finding a solution18:28
pandarfolco: that dir is referenced in zuul.work_dir variable18:28
rfolcook will give it a try18:28
pandarfolco: sorry, zuul.executor.work_dir18:29
rfolcopanda, this?18:30
rfolco      args:18:30
rfolco        chdir: "{{ zuul.executor.work_dir }}"18:30
pandarfolco: yes18:30
rfolcothx18:30
*** ksambor has joined #oooq18:56
*** ksambor has quit IRC18:56
*** ykarel|away has quit IRC19:00
mjturekdoes anyone know what this file is? https://github.com/rdo-infra/review.rdoproject.org-config/blob/master/ci-scripts/tripleo-upstream/dlrnapi_venv.sh#L319:01
mjturekthe dlrnapi_venv19:02
weshay|ruckmjturek, dlrnapi creds19:03
weshay|ruckrfolco, ^19:03
mjturekweshay|ruck: can we assume that they are in the cico nodes?19:03
* mjturek is tracing through get-hash.sh in an attempt to catch any gotchas19:04
rfolcomjturek, your jenkins job should be able to load env vars.... DLRNAPI_PASSWORD for ex19:04
weshay|ruckrlandy,  when you have a moment https://review.opendev.org/#/c/687361/19:05
mjturekyep, but this script seems to reference a venv file that I'm not positive exists in cico19:06
mjturekoh shoot nvm19:08
mjturekI'm dumb.19:08
rfolcohmm that one is venv dir19:08
mjturekit's just creating a venv19:08
rfolcoyes19:08
mjtureksorry rfolco should be fine lol19:09
rfolcomy answer was not right, I thought you were asking for shell env vars like dlrnapi_password19:09
rfolcomjturek, np19:09
*** holser has joined #oooq19:34
weshay|ruckrfolco, rlandy https://review.opendev.org/#/c/687361/19:36
weshay|ruckthanks!19:36
rlandyweshay|ruck: anything else while I am here?19:37
weshay|ruckrlandy, no get out of here.. have an easy fast :)19:38
rlandyweshay|ruck: not yet :)19:38
weshay|ruckoh maybe one more then https://review.opendev.org/#/c/687330/19:40
weshay|ruckrfolco, ^19:40
weshay|ruckhttps://openstack.fortnebula.com:13808/v1/AUTH_e8fd161dc34c421a979a9e6421f823e9/zuul_opendev_logs_cb8/687330/2/check/tripleo-ci-centos-7-containers-multinode/cb83085/logs/undercloud/var/log/extra/selinux.txt.gz19:40
weshay|ruckhttp://logs.rdoproject.org/30/687330/2/openstack-check/tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset001/4a52dff/logs/overcloud-controller-0/var/log/extra/selinux.txt.gz19:41
weshay|ruckoh dang.. my follow up patch was never sent19:41
weshay|ruckthat's ok19:41
*** jtomasek has quit IRC19:58
rlandyweshay|ruck: did ovb jobs hit this wget problem? https://sf.hosted.upshift.rdu2.redhat.com/logs/periodic-hourly/code.engineering.redhat.com/openstack/tripleo-ci-internal-jobs/master/periodic-tripleo-ci-centos-7-bm_envA-3ctlr_1comp-featureset001-master/ba65b5c/logs/undercloud/home/zuul/overcloud_image_build.log.txt.gz20:03
rlandy2019-10-08 08:15:55 | /home/zuul/overcloud_image_build_script.sh: line 20: wget: command not found20:03
* weshay|ruck looks20:04
rlandyfixing bm20:04
rlandydoesn't look like it20:05
weshay|ruckrlandy, we're running the image build job internally?20:05
rlandybm is20:05
weshay|ruckwget looks like a legit problem20:05
rlandyapprently20:06
weshay|ruckrlandy, I thought bm built the image like ovb?20:06
rlandyidk how no other job hits this20:06
rlandyweshay|ruck: it should20:06
rlandyunless bm is missing a setting20:06
weshay|ruckoh.. sorry.. that IS a bm job..20:07
* weshay|ruck keeps looking20:07
weshay|ruckrlandy, we'd be smart to change that to curl20:07
rlandyweshay|ruck: ack I just want to know why bm is the only one afflicted20:08
rlandydoesn't make sense20:08
weshay|ruckrlandy, wget is not installed https://sf.hosted.upshift.rdu2.redhat.com/logs/periodic-hourly/code.engineering.redhat.com/openstack/tripleo-ci-internal-jobs/master/periodic-tripleo-ci-centos-7-bm_envA-3ctlr_1comp-featureset001-master/ba65b5c/logs/undercloud/var/log/extra/rpm-list.txt.gz20:09
weshay|ruckso that's a legit fail20:09
weshay|ruckya.. rlandy maybe nodepool installs wget20:09
rlandyhttps://github.com/openstack/tripleo-quickstart-extras/blame/master/roles/build-images/templates/overcloud-image-build.sh.j220:09
* weshay|ruck checks20:09
rlandythat code is 5 months old20:09
rlandyso nothing new there20:09
weshay|ruckrlandy,  http://logs.rdoproject.org/openstack-regular/opendev.org/openstack/tripleo-ci/master/tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001/b2b2a5d/logs/undercloud/var/log/extra/rpm-list.txt.gz20:10
weshay|ruckrlandy, so it's installed on the undercloud by default20:10
rlandywget-1.14-18.el7_6.1.x86_6420:10
weshay|ruckhttp://logs.rdoproject.org/openstack-regular/opendev.org/openstack/tripleo-ci/master/tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001/b2b2a5d/logs/undercloud/var/log/yum.log.txt.gz20:10
rlandyweird20:11
weshay|ruckand it's not installed by us ^20:11
weshay|ruckya.. weird20:11
weshay|rucklet's switch it to curl20:11
rlandyI saw it once before20:11
rlandybut now it's consistent20:11
weshay|ruckrlandy, it could be another package had wget a dep at some point20:11
weshay|ruckand that was removed20:11
weshay|ruckis it just master20:11
rlandyweshay|ruck: request to spent some time downstream this sprint20:11
weshay|ruck?20:11
weshay|ruckgranted20:11
rlandyweshay|ruck: bring up osp 1620:12
rlandyclean up20:12
weshay|ruck+120:12
rlandyadd to cockpit20:12
rlandybe done here going solo20:12
rlandyit's stein I think20:12
rlandychecking20:12
rlandyyep - stein as well20:13
rlandyneed to bring up train here20:13
rlandyanyways ... now I do have to go20:13
weshay|ruckrlandy, https://opendev.org/openstack/tripleo-quickstart-extras/src/branch/master/roles/baremetal-undercloud/packages/defaults/main.yml#L720:14
rlandyweshay|ruck: ok - will pick this up in next sprint ... have an easy fast ... good new year etc.20:14
rlandyyeah - but this is virt undercloud20:14
weshay|ruckthanks :) OH20:14
pandarfolco: drop me an email if I can continue something tomorrow morning20:14
weshay|ruckrlandy,20:15
weshay|ruckk20:15
rlandyyeah?20:15
weshay|ruckaccident ping20:15
rfolcopanda, I am still struggling with loading files that were generated locally in the host controller20:15
weshay|ruckrlandy, :) take care20:15
rfolcopanda, trying to move stage-info to executor root, only from there I can fetch20:16
*** rlandy has quit IRC20:16
rfolcopanda, will update you on where I stop20:16
*** ksambor has joined #oooq20:23
*** ksambor has quit IRC20:23
*** dsneddon_ is now known as dsneddon20:26
*** aakarsh|3 has joined #oooq20:31
*** holser has quit IRC20:33
*** aakarsh|2 has quit IRC20:33
*** aakarsh|3 has quit IRC20:38
*** holser has joined #oooq20:43
*** holser has quit IRC20:45
*** holser has joined #oooq20:47
*** slaweq has quit IRC20:59
*** jbadiapa has quit IRC21:03
*** holser has quit IRC21:18
*** saneax has joined #oooq23:08
*** tosky has quit IRC23:10
*** aakarsh has joined #oooq23:55

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!