Thursday, 2021-07-29

*** ykarel_ is now known as ykarel05:15
*** pojadhav|out is now known as pojadhav05:38
*** marios is now known as marios|ruck05:41
*** amoralej|off is now known as amoralej06:26
zbrsshnaidm: do you have any idea on how to get more infor about the stopped container with podman? the one with no logs? I am afraid that this some kind of ticking bomb for us using the molecule-podman driver as I seen lots of users reporting the same kind of problem. I really want to find a wayt to improve the UX on these failures, making clear what07:04
zbrhappened. If the containers failed to start, create should not report success, and coverage should never start, agree?07:04
sshnaidmzbr, paste "podman inspect container_name"07:05
zbryeah, that is what i did yesterday, any interesting things on https://gist.github.com/ssbarnea/80941778d05dbd9060f6a446a8ca951c ?07:05
zbrthat bash command should create an infinite loop, i doubt is broken considering it was not touched in years07:06
zbrthat is amazing, "state: running" but "podman ps" does not see it, going to try again, just to be sure.07:08
zbrbut how can a container be running, and still have and exitcode=0 ?07:08
sshnaidmzbr, well, the container is running07:12
* zbr trying to reproduce it...07:13
sshnaidmmaybe sudo podman ps?07:13
sshnaidmif you don't see it in ps, something bad happened with your local podman setup07:13
sshnaidmI ran the command just now and see container07:13
zbrnow it started, but gather facts still fails, with interesting error: https://gist.github.com/ssbarnea/57736b3fcd2841b9fbe909d2c6a43c8c07:15
zbrthat is from ansible 2.9.9 (the one on ubuntu)07:16
zbrthat line "/usr/bin/python3.6 && sleep 0" looks weird, it does succeed but we still get setup module failure.07:19
sshnaidmthat's the usual command07:22
sshnaidmand container is still running?07:22
zbryeah07:22
sshnaidmand what happens when you run podman exec centos8 /bin/sh -c "/usr/bin/python3.6 && sleep 0"07:24
zbrreturns 0, no output at all.07:24
zbrcan it have anything to do with the warning above, "Failed to mount container centos8"07:26
sshnaidmno, it's fine07:27
sshnaidmenable -vvvvv07:27
sshnaidm5 v's07:27
sshnaidmas I see you use vfs, maybe it's related..07:28
sshnaidmwith 5 v's you should see rc, stdout, stderr07:28
zbrupdated https://gist.github.com/ssbarnea/57736b3fcd2841b9fbe909d2c6a43c8c with 5v's07:29
zbrit confirms what happens to me, no stdout/err output and exit code 0.07:29
sshnaidmidk, do you have a molecule file to reproduce?07:32
zbryeah, i am now trying to do the same on a totally different system, with fedora 34, just to eliminate some questions.07:33
zbrsshnaidm: https://github.com/openstack/ansible-role-collect-logs/blob/master/roles/collect_logs/molecule/infrared/molecule.yml#L3 -- the only change was to switchi from docker to podman.07:36
zbri get the same behavior on fedora, so is not a broken system or podman.07:36
zbri think found it!07:37
zbralmost sure is https://github.com/openstack/ansible-role-collect-logs/blob/master/roles/collect_logs/molecule/infrared/molecule.yml#L25-L2607:37
zbrthat explains why so many people report this error, as this is a very popular ansible performance optimization07:38
zbrnow I need to find a way to detect if pipelining is enabled at runtime, so I can fail before even trying to gather.07:40
zbransible did mention that is using pipelining but who reads every debug line ;)07:41
sshnaidmyes, we always said pipelining should be disabled in podman connection07:49
sshnaidmidk why to enable it in docker connection as well, tbh07:49
zbrprobably is a leftover from when it was delegated, is not uncommon to see leftovers in molecule files.07:52
zbrnow i am trying to find a ~smart way to detect this at runtime07:53
zbri do not want to force people to disable pipeline but i need to detect if someone sets it wrong for podman containers to prevent that07:53
sshnaidmif [[ pipeline detected ]] then <fail with message> ?07:59
sshnaidmfor podman driver07:59
sshnaidmzbr, any idea how to install additional RPMs in tox jobs?08:02
sshnaidmI need to install collections, but they're not python modules, I can't use requirements file08:02
sshnaidmotherwise linters don't understand "from ansible_collections import ..."08:03
*** ykarel is now known as ykarel|lunch08:24
soniya|rovermarios|ruck, tempest.scenario.test_network_basic_ops.TestNetworkBasicOps.test_connectivity_between_vms_on_different_networks is failing on wallaby, is this a known issue?09:26
marios|rucksoniya|rover: link? 09:28
soniya|roverhttps://logserver.rdoproject.org/openstack-periodic-integration-stable1/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby/384226d/logs/undercloud/var/log/tempest/tempest_run.log.txt.gz09:28
marios|rucksoniya|rover: did you check cix? sounds familiar but not sure09:28
soniya|rovermarios|ruck, on cix we have different test i.e test_port_security_macspoofing_port varies the above one is test_connectivity_between_vms_on_different_networks09:29
marios|rucksoniya|rover: so i don't think we have enough for a bug yet because we only have one example09:30
marios|rucksoniya|rover: it fails twice in the last two runs, https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby09:30
soniya|rovermarios|ruck, two failures for today09:31
marios|rucksoniya|rover: the one you point to here is the latest, the one before it is not the sam e09:31
marios|rucksoniya|rover: all sorts failing there https://logserver.rdoproject.org/openstack-periodic-integration-stable1/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby/5c177c6/logs/undercloud/var/log/tempest/stestr_results.html.gz09:31
marios|rucksoniya|rover: so you can run a testproject to get a test of periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby 09:31
marios|rucksoniya|rover: if it reproduces there then we can say we have a bug09:31
marios|rucksoniya|rover: otherwise you can wait for wallaby to run again and see 09:31
marios|rucksoniya|rover: make sense? 09:32
soniya|rovermarios|ruck, okay, thanks :)09:32
*** ykarel|lunch is now known as ykarel09:51
soniya29|rovermarios|ruck, I am going out for half an hour10:54
marios|rucksoniya29|rover: thanks10:58
frenzy_fridayHas anyone faced this: InsecureRequestWarning: Unverified HTTPS request is being made to host 'mirror.kna1.airship-citycloud.opendev.org'. Adding certificate verification is strongly advised. See: https://urllib3.readthedocs.io/en/latest/advanced-usage.html#ssl-warnings ?11:57
frenzy_friday^ https://zuul.opendev.org/t/openstack/build/72738d1c0db8457c90a3c477812b105211:58
marios|ruckfrenzy_friday: not come across that 12:00
chandankumarrlandy: Hello, Does PSI is up now for running testproject?12:02
rlandychandankumar: yep PSI is back12:02
chandankumarrlandy: cool, thanks :-)12:02
rlandymarios|ruck: soniya|rover: hey - saw your notes ... just trying to get downstream in shape12:03
rlandyping if you need me to look upstream12:03
marios|ruckrlandy: ack 12:03
*** chem is now known as Guest269912:12
*** rlandy is now known as rlandy|ruck12:19
*** amoralej is now known as amoralej|lunch12:31
soniya|roverrlandy|ruck, ack12:52
sshnaidmfrenzy_friday, saw something similar with different proxy hosts, but seems like doesn't affect anything?13:00
rlandy|ruckchandankumar: zbr: akahat:  scrum time 13:01
rlandy|ruckhttps://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3433413:07
rlandy|ruckhttps://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_727/787502/9/check/tox-ansible-test-sanity/72738d1/job-output.txt13:11
chandankumarhttps://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3468813:14
chandankumarhttps://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3427313:14
* zbr still rebooting after an unexpected upgradeā€¦13:16
*** amoralej|lunch is now known as amoralej13:24
marios|ruckakahat: o/ hey can you pleas go make the latest_hashes 100 on the master promoter ? 13:24
marios|ruckakahat: pretty please & cherries? 13:24
marios|ruckakahat: let me check if i can login 13:24
akahatmarios|ruck, okay.13:25
akahatmarios|ruck, are you able to login?13:28
marios|ruckakahat: are you doing that? vim is telling me there is  a sswapfile already 13:29
marios|ruckakahat: Swap file "ci-scripts/dlrnapi_promoter/config_environments/rdo/CentOS-8/.master.yaml.swp13:29
akahatmarios|ruck, yes.. please join tmux session13:29
marios|ruck" already exists!13:29
marios|ruckakahat: ah right thx sec13:29
marios|ruckakahat: thanks :D13:30
rlandy|ruckzbr: <rlandy|ruck> https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_727/787502/9/check/tox-ansible-test-sanity/72738d1/job-output.txt13:30
akahatmarios|ruck, restarting service.13:30
akahatmarios|ruck, master is promoting :)13:32
marios|ruckakahat: thank you 13:32
marios|ruckakahat: :D13:32
rlandy|ruckakahat: what number of hashes did you use?13:32
marios|ruckakahat: sorry for fighting you for the cursor on tmux ;)13:32
rlandy|ruckif that changed13:32
marios|ruckrlandy|ruck: 10013:32
rlandy|ruckperfect13:32
akahat10013:32
zbrrlandy|ruck: sanity checks are not expected to pass in initial review, too many changes are needed for a single review. still, i need to fix the infrared issue before considering that ready for review.13:38
rlandy|ruckok13:38
sshnaidmzbr, do you remember how to include "ensure-docker" in job definition? like  roles: zuul: something../ensure-docker13:41
zbrsshnaidm: with some help from codesearch, i supposed is something like https://opendev.org/openstack/cinder/src/branch/master/.zuul.yaml#L15313:44
zbrlikele just `- zuul: zuul-jobs/ensure-docker` ?13:44
sshnaidmoh, forgot "opendev"13:45
zbrclearly we never used it like this, 13:45
sshnaidmI'm trying to use ansible-test with --docker13:45
sshnaidmit should solve problem with packaging13:46
sshnaidmand it doesn't support podman yet13:46
zbri only used it with --venv, w/o containers.13:46
zbrand it worked, mainly underlined a ton of issues within out code13:47
zbrprobably you want to try "tox-ansible", it should make it much easier to run ansible-test.13:47
sshnaidmzbr, the problem is with ansible-test13:48
*** pojadhav- is now known as pojadhav13:50
rlandy|ruckmarios|ruck: you mentioned a few promoter issues in the chat ... anything else needed for promotions?14:26
marios|ruckrlandy|ruck: i think we're good. akahat said master started promoting \o/ got some success with the others too victoria 9d53349f9c48ea9802b5f5f4ed74b243 https://review.rdoproject.org/r/c/testproject/+/34733 train 2155fc11a6d9458255c9c0f28833ea8d https://review.rdoproject.org/r/c/testproject/+/34742 should promote14:35
marios|ruckrlandy|ruck: not sure if we need to increase the fetched hashes to 100 for the other branches too probably should14:35
marios|ruckrlandy|ruck: on the other promoter 14:35
marios|ruckrlandy|ruck: ussuri failed but it should run OK now after we remerged the skip https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/802874 (https://logserver.rdoproject.org/41/34741/1/check/periodic-tripleo-ci-centos-8-scenario004-standalone-ussuri/7238660/logs/undercloud/var/log/tempest/stestr_results.html.gz) rechecking that 14:36
marios|ruckakahat: around? 14:37
akahatmarios|ruck, yes14:38
akahatmarios|ruck, i'll update it on other promoter as well.14:38
marios|ruckakahat: hey can we do the same (100 hashes) on the other promoter for branches? 14:38
marios|ruckakahat: right14:38
marios|ruckakahat: thank you is there tmux there too? 14:39
marios|ruckakahat: am there 14:40
akahatmarios|ruck, yes. full screen please. :)14:40
marios|ruckakahat: ack 14:40
akahatmarios|ruck, can14:40
marios|ruckakahat: victoria promoting? 14:40
akahatmarios|ruck, can't restart it now.. 14:41
marios|ruckakahat: yup14:41
marios|ruckakahat: thanks for checking!14:41
akahatyes. victoria is promoting14:41
marios|ruckakahat: i am wathcing it on tmux14:41
marios|ruckakahat: ok lets leave till tomorrow14:41
marios|ruckakahat: thank you amol!14:41
akahatmarios|ruck, okay. i"ll update it tomorrow. 14:41
marios|ruckakahat: ack 14:41
marios|ruckrlandy|ruck: ^^ won't switch 100 for other branches now v is promoting, (maybe also T )14:41
marios|ruckrlandy|ruck: bump tomorrow14:41
rlandy|ruckmarios|ruck: ack ok14:43
marios|ruckrlandy|ruck: akahat: nice now i see master running container-push too for 6392edc0eaefb612986de9809a0195ba14:49
marios|ruckfeels_good_man.jpg14:50
akahat\o/14:51
marios|rucktrain too :D14:54
marios|ruckhttps://trunk.rdoproject.org/centos8-train/current-tripleo/delorean.repo.md5 train 2155fc11a6d9458255c9c0f28833ea8d https://review.rdoproject.org/r/c/testproject/+/3474214:54
zbrsshnaidm: got some feedback from ansible core, podman collection must be sure that it has `has_pipelining = False`14:54
sshnaidmhmm.. and how to do it?14:56
zbrhttps://github.com/containers/ansible-podman-collections/blob/ba763e2988c4d3f94940695a5699f26fb94ed31f/plugins/connection/podman.py#L8214:58
zbrapparently buildah has the same setting, not sure if it does suppor it.14:58
*** ykarel is now known as ykarel|away15:50
marios|ruckrlandy|ruck: please add to your reviews https://review.opendev.org/q/topic:upstream-periodic-run-once workflow if happy some have votes but no workflow15:51
marios|ruckzbr: chandankumar: please add to your review queue https://review.opendev.org/q/topic:upstream-periodic-run-once15:52
rlandy|ruckmarios|ruck: ack ... marios - ok by me15:55
rlandy|ruckwant me to vote or w+?15:55
rlandy|ruckboth?15:55
marios|ruckrlandy|ruck: all of the above please15:55
marios|ruckrlandy|ruck: thank you 15:55
marios|ruckchandankumar: zbr: rlandy|ruck: sshnaidm: please add this to your queue when you next have time https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3475115:57
rlandy|ruckdone15:57
marios|ruckrlandy|ruck: thanks15:58
rlandy|ruckhttps://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-centos-8-undercloud-upgrade-ussuri15:58
rlandy|ruckresults not great15:58
rlandy|ruckdid something change?15:58
rlandy|ruckone success15:58
marios|ruckrlandy|ruck: yes see https://bugs.launchpad.net/tripleo/+bug/1936825/comments/515:58
marios|ruckrlandy|ruck: the fix merged yesterday and latest periodic run didn't have it, but my testproject did 15:59
rlandy|ruckmarios|ruck: srry missed that15:59
rlandy|ruckvoted15:59
marios|ruckrlandy|ruck: no problem 15:59
marios|ruckrlandy|ruck: thank you 15:59
rlandy|ruckk - getting15:59
rlandy|ruckthere downstream15:59
rlandy|ruckone  promotion on 16.215:59
rlandy|ruckstill working on 1715:59
marios|ruckrlandy|ruck: `nice15:59
rlandy|ruckstill debug issues15:59
marios|ruckrlandy|ruck: so "its back" at least sounds like? 16:00
rlandy|ruckbut want to clear the decks before next program call16:00
marios|ruckrlandy|ruck: i mean psi 16:00
rlandy|ruckcontainer push on 16.2 is broken now though16:00
marios|ruckrlandy|ruck: starting shutdown in a few minutes 16:04
rlandy|ruckmarios|ruck: k - anything on the watch list?16:07
marios|ruckrlandy|ruck: not really looks like the promotion chasing was frutiful ... maybe checkout wallaby but no urgent can do tomorrow it is only 2 days old16:08
marios|ruckrlandy|ruck: nothing then ;)16:08
marios|ruckrlandy|ruck: hope it is a quiet day 16:08
rlandy|ruckok - sure16:08
rlandy|ruckmarios|ruck: hang in there - tomorrow is friday16:08
rlandy|ruckand we get days off!!16:09
marios|ruckrlandy|ruck: yes :D16:09
*** amoralej is now known as amoralej|off16:10
*** marios|ruck is now known as marios|out16:18
rlandy|ruckakahat: hey 17:41
akahatrlandy|ruck, hello17:42
rlandy|ruckhttp://10.0.148.74/promoter_logs/centos8_master.log17:42
rlandy|rucklooks like we have a successful promotion for  6392edc0eaefb612986de9809a0195ba17:42
rlandy|ruckbut that's not the hash in https://trunk.rdoproject.org/centos8-master/current-tripleo/delorean.repo.md517:43
rlandy|ruckis it still busy promoting?17:43
akahatlooking17:43
akahatrlandy|ruck, container promote role failed again for manifest push.17:46
akahathttp://10.0.148.74/promoter_logs/centos8_master_2021-07-29T17:01.log17:46
akahatI'll disable manifest push. And re run it.. it will promote dlrn17:47
rlandy|ruckakahat: thank you17:47
akahatdone.17:52
akahathttps://trunk.rdoproject.org/centos8-master/current-tripleo/delorean.repo.md517:52
rlandy|ruckakahat: thank you, sir17:52
rlandy|ruckakahat: here's another mystery18:14
rlandy|ruckhttps://trunk.rdoproject.org/centos8-victoria/component/tripleo/18:14
rlandy|ruckhttps://trunk.rdoproject.org/centos8-victoria/component/tripleo/current-tripleo/commit.yaml18:15
rlandy|ruckis marked 07/0618:15
rlandy|ruckhttps://trunk.rdoproject.org/centos8-victoria/current-tripleo/ 07/2918:15
rlandy|ruckmaybe we need another promotion18:21
akahatrlandy|ruck, yes. current-tripleo promotion is not happend recently, https://trunk.rdoproject.org/centos8-victoria/component/tripleo/current-tripleo/?C=M;O=D18:24
rlandy|ruckk - rekciking failed jobs18:24
*** sshnaidm is now known as sshnaidm|afk18:30

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!