Monday, 2018-06-04

*** d0ugal has joined #oooq00:01
hubbotFAILING CHECK JOBS on stable/ocata: gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-ocata @ https://review.openstack.org/56429100:29
*** weshay has quit IRC01:30
*** weshay has joined #oooq01:36
*** hamzy has quit IRC01:56
*** hamzy has joined #oooq01:56
*** atoth has quit IRC01:57
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.02:29
*** EmilienM has quit IRC03:22
*** EmilienM has joined #oooq03:22
*** EmilienM has joined #oooq03:22
*** jaganathan has joined #oooq03:33
*** jaganathan has quit IRC03:34
*** udesale has joined #oooq03:54
*** jaganathan has joined #oooq04:04
*** tcw has quit IRC04:15
*** tcw has joined #oooq04:17
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.04:29
*** holser__ has joined #oooq04:42
*** holser__ has quit IRC04:52
*** pgadiya has joined #oooq05:07
*** pgadiya has quit IRC05:07
*** hamzy has quit IRC05:12
*** quiquell|off is now known as quiquell05:33
*** udesale_ has joined #oooq05:35
*** marios has joined #oooq05:36
*** marios has quit IRC05:36
*** udesale has quit IRC05:37
*** marios has joined #oooq05:38
*** links has joined #oooq05:38
*** marios has quit IRC05:49
*** marios has joined #oooq05:49
*** udesale__ has joined #oooq06:16
*** kopecmartin has joined #oooq06:16
*** udesale_ has quit IRC06:18
*** pgadiya has joined #oooq06:25
*** pgadiya has quit IRC06:25
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.06:29
*** holser__ has joined #oooq06:37
*** ccamacho has joined #oooq06:40
*** ssbarnea has quit IRC06:43
*** ssbarnea has joined #oooq06:44
*** pgadiya has joined #oooq06:58
*** pgadiya has quit IRC06:58
*** bogdando has joined #oooq06:59
*** zoli|gone is now known as zoli07:01
*** zoli is now known as zoli|wfh07:01
*** zoli|wfh is now known as zoli07:01
*** saneax has joined #oooq07:05
*** jbadiapa has joined #oooq07:06
*** sshnaidm has joined #oooq07:10
*** ratailor has joined #oooq07:14
*** jtomasek has joined #oooq07:15
*** tesseract has joined #oooq07:16
quiquellsshnaidm: Welcome back !07:17
*** quiquell is now known as quiquell|afk07:19
*** yolanda_ has joined #oooq07:20
*** yolanda has quit IRC07:23
*** holser__ has quit IRC07:24
*** saneax has quit IRC07:25
*** saneax has joined #oooq07:26
*** tosky has joined #oooq07:29
*** amoralej|off is now known as amoralej07:31
*** holser__ has joined #oooq07:36
sshnaidmquiquell|afk, I'm  mostly off this week :) in Brno now, but will poke here07:38
*** sshnaidm is now known as sshnaidm|brq07:38
quiquell|afksshnaidm|brq: Ahh ok07:43
*** quiquell|afk is now known as quiquell07:43
quiquellsshnaidm|brq: back to lonely mornings :-(07:44
sshnaidm|brqquiquell, why? where is everybody? :)07:44
*** gkadam has joined #oooq07:44
quiquellsshnaidm|brq: Good one :-)07:45
*** tesseract-RH has joined #oooq07:45
sshnaidm|brqquiquell, thanks for working on grafana, looks amazing07:46
quiquellsshnaidm|brq: I have focused on alarms07:46
quiquellsshnaidm|brq: RDO guys have discover our toy, for alarms they use sensu07:47
*** tesseract has quit IRC07:48
quiquellsshnaidm|brq: you can ask for the alarms to the ruck-rover-alert, in the IRC channel07:48
*** tesseract-RH has quit IRC07:48
*** tesseract has joined #oooq07:49
sshnaidm|brqquiquell, yeah, I thought to use their sensu.. they have it in #rdo-dev07:50
sshnaidm|brqneed to check it with them07:50
quiquellsshnaidm|brq: Yep, don't like grafana alerts too much, you have to hardcode too much07:50
quiquellLet's just use it with thresholds and put the alerts in sensu07:51
sshnaidm|brqagree07:54
quiquellsshnaidm|brq: #rdo-dev is the one at freenode, a don't see too much people there07:55
sshnaidm|brqquiquell, it's only for alerts07:57
sshnaidm|brqquiquell, like tripleo-ci07:57
quiquellsshnaidm|brq: Do it make sense to install a sensu to play around in our ruck-rover sandbox ?07:57
sshnaidm|brqquiquell, seems like that07:58
quiquellwould like to do the grafan witout the constraints of adding alerts07:58
sshnaidm|brqquiquell, worth to check sensu configs on rdo07:58
sshnaidm|brqmaybe it's simple enough just to use it..07:58
quiquellsshnaidm|brq: will take  look07:59
*** holser__ has quit IRC08:04
*** holser__ has joined #oooq08:04
*** jaosorior has joined #oooq08:05
*** gkadam has quit IRC08:05
*** gkadam has joined #oooq08:10
*** jfrancoa has joined #oooq08:28
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.08:29
quiquellsshnaidm|brq: Interesting https://rdo.fsn.ponee.io/thread.html/71eba78db3681759d19cc0a7b561726b6cc86632ade5315d484faf6b@%3Cdev.lists.rdoproject.org%3E08:37
quiquellarxcruz|ruck: Do you have the access info of myoung promoter ?09:07
arxcruz|ruckquiquell: yes i do, i think i sent to you by mail no ?09:23
quiquellarxcruz|ruck: Can't access my pub key is not in the server, will wait for myoung09:25
arxcruz|ruckquiquell: gimme your key09:25
quiquellssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQC9dblk/9GGZQmklr0TPcJtgG8c5ikgG3nXj/iAahtIVHjT0jailjvtdspidJnySb5jbJOK6O0654hLaIIqxTxiBu4PwdwrSXbLzk00yZCQNk2+F4aGz3IybMX2DZsPf0ByQ7LC3EcV9q1lLLNVXnzyZMez2+pNuGFNvbvaOpX+5Tgl8lDgcdu05VK8ooWhiFjwkJ3D1+zlszDmJBmwgElHh81SqMtF2SpRB5L4sMvliIjOP59Ie/i21QmBrLzCW1p4I8xPQc5cgDU6Rdn0D8DbbhzoCpRBSw7NQh/9YKxffwmwIlJ5oF7OqSRk2ja9Ktwexnlhq9F//84iFfBGpB7b ellorent@redhat.com09:26
*** jfrancoa has quit IRC09:50
quiquellarxcruz|ruck: Can you join to #tripleo-ci ? Want to show you womething09:57
*** sai- has joined #oooq10:20
*** sai_ has quit IRC10:20
*** udesale__ has quit IRC10:23
saneaxfolks on a rdo cloud ovb deploy, facing the certmonger service start error - https://bugs.launchpad.net/tripleo/+bug/177094410:25
openstackLaunchpad bug 1770944 in tripleo "CI: centos.ci: certmonger service fails while installing undercloud" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)10:25
*** hamzy has joined #oooq10:27
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.10:29
*** zoli is now known as zoli|lunch10:36
*** tesseract has quit IRC10:41
*** tesseract has joined #oooq10:44
*** holser__ has quit IRC10:59
sshnaidm|brqsaneax, I think it should be resolved with newer certmonger package maybe?11:02
saneaxsshnaidm|brq, I am using certmonger-0.78.4-3.el7_5.1.x86_6411:04
sshnaidm|brqsaneax, try asking in #tripleo , it doesn't seems like oooq problem11:07
sshnaidm|brqsaneax, maybe jaosorior knows more11:07
saneaxsure, thanks sshnaidm|brq11:07
*** quiquell is now known as quiquell|lunch11:25
*** amoralej is now known as amoralej|lunch11:31
*** jbadiapa has quit IRC11:39
*** atoth has joined #oooq11:44
*** zoli|lunch is now known as zoli|wfh11:49
jaosoriorsshnaidm|brq, saneax it's a certmonger issue. arxcruz|ruck tried to submit a patch for it in puppet-certmonger. But it hasn't merged yet11:51
jaosoriorhere it is https://github.com/saltedsignal/puppet-certmonger/pull/2011:51
jaosoriorI tried pinging the maintainer but there's no answer yet11:51
*** rfolco has joined #oooq11:55
*** udesale has joined #oooq11:57
saneaxjaosorior, is it possible to install undercloud on rhel 7.5 with this issue ?11:59
saneaxis there a hack?12:00
arxcruz|rucksaneax: the problem is:12:00
arxcruz|ruckbefore undercloud install, there's an undercloud update/upgrade package12:00
arxcruz|ruckthat brings the latest dbus12:01
arxcruz|ruckthat according developers, need a reboot to work properly12:01
arxcruz|rucki notice a restart into dbus service fix the problem, but they said that's how it's gonna be12:01
arxcruz|rucknot a bug, a feature lol12:01
arxcruz|ruckso, after the rpm upgrade, the dbus service is in a bad state12:01
arxcruz|ruckso certmonger that depends on dbus, fails to start12:01
arxcruz|rucknot only that, any service depending on dbus fails to start12:02
saneaxarxcruz|ruck, thanks for the info12:03
*** jbadiapa has joined #oooq12:07
saneaxarxcruz|ruck, can you point me the specific dbus issue please?12:08
*** trown|outtypewww is now known as trown12:09
weshayarxcruz|ruck, how are you sir? How is the ruck/rovering going.. need anything?12:12
weshaypanda, top of the morning to ya irishman12:14
*** quiquell|lunch is now known as quiquell12:14
weshayhey quiquell :)12:17
quiquellweshay: welcome back !12:17
weshaythank you12:17
weshayrfolco, you have a few minutes today to sync w/ me?12:17
rfolcoweshay, sure. welcome back :)12:18
rfolcoweshay, just tell me what time works best12:18
*** amoralej|lunch is now known as amoralej12:19
arxcruz|ruckweshay: hey boss, i'm good and you? hope you enjoy vacation12:21
weshayrfolco, k.. thanks man. I sent an invite12:21
arxcruz|ruckweshay: everything is green :)12:21
weshayarxcruz|ruck, ya.. all is good here12:21
weshayarxcruz|ruck, I saw :) very nice12:21
arxcruz|ruckall phases, all branches12:21
weshayarxcruz|ruck, obviously something must be broken then :P12:21
chkumar246arxcruz|ruck: kopecmartin any one wants to be QE for this one https://trello.com/c/pLrKDqWt/789-make-python-tempestconf-backward-compatible?12:22
kopecmartinchkumar246, sure, you can add me12:22
arxcruz|rucksaneax: the dbus bug is https://bugzilla.redhat.com/show_bug.cgi?id=156912212:23
openstackbugzilla.redhat.com bug 1569122 in instack-undercloud "Undercloud installation fails with "Execution of '/bin/getcert list' returned 1: Error org.freedesktop.DBus.Error.TimedOut"" [High,New] - Assigned to jslagle12:23
chkumar246arxcruz|ruck: https://review.rdoproject.org/r/#/c/14023/ -> once merged let me know if it breaks any job12:23
chkumar246in tripleo ci12:23
arxcruz|ruckweshay: please, don't say that12:23
saneaxthanks arxcruz|ruck12:23
toskychkumar246: are those the only requirements for 2.0.0? No more refactoring?12:23
toskyjust to set expectation (iirc we discussed it last week)12:23
*** apetrich has quit IRC12:23
arxcruz|ruckchkumar246: can you test it before ?12:24
*** apetrich has joined #oooq12:24
arxcruz|ruckweshay: welcome back boss, when you have time, let me know, i would like to talk with you :)12:24
chkumar246tosky: no we have to complete refactoring as discussed.12:24
chkumar246tosky: rdo trunk should align with tempestconf master, if it's break, we can fix it12:25
toskychkumar246: oki, I asked because the card says "Once the card is done, let's create a new release of python-tempestconf"12:25
weshayarxcruz|ruck, ok.. how about after the ci-escalation mtg12:26
arxcruz|rucksure12:27
*** holser__ has joined #oooq12:27
chkumar246arxcruz|ruck: few jobs will pass for sure, but not sure which will break12:28
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.12:29
arxcruz|rucksaneax: are you working on the upgrades job right?12:31
arxcruz|ruckanywhere between the install and upgrade, if you restart the dbus, you problems are solved :)12:31
saneaxarxcruz|ruck, not quite12:32
saneaxthis was completely new deploy with master tag12:33
saneaxbut i will try with restart of dbus12:33
*** rlandy has joined #oooq12:34
chkumar246arxcruz|ruck: keep an eye on rdo tempestconf patches, let me know if you donot get the changes12:35
*** rlandy is now known as rlandy|rover12:35
saneaxyes restart of dbus fixed the certmonger issue arxcruz|ruck12:36
saneaxundercloud deploy is going ahead12:36
saneaxthanks for your help12:37
rlandy|rovermyoung: I'd like to turn the old promoter on and turn your server off12:40
chkumar246arxcruz|ruck: kopecmartin once we unpin master, feel free to get your oooq-extras patches ready for merging12:41
chkumar246master of tempestconf12:42
arxcruz|ruckchkumar246: i'm not confortable with that unpin before a DNM patch testing it...12:47
chkumar246arxcruz|ruck: do we have tripleo experimental job running against rdoinfo? As we cannot test rdoinfo changes against upstream ci.12:48
arxcruz|ruckchkumar246: we can get a dummy patch on python-tempestconf and run the job setting from git, or depends on ?12:49
rlandy|rover!gatestatus12:49
openstackrlandy|rover: Error: "gatestatus" is not a valid command.12:49
chkumar246arxcruz|ruck: it does not work for Depends on: <rdo patch> in upstream patch.12:49
arxcruz|ruckchkumar246: yeah, but we get latest tempestconf, that works with depends-on12:50
chkumar246arxcruz|ruck: got that12:54
quiquelltrown, panda: Do we need this https://github.com/openstack-infra/tripleo-ci/blob/master/toci_quickstart.sh#L98 ?12:58
weshayrlandy|rover, howdy12:59
rlandy|roverweshay: welcome back!!12:59
* rlandy|rover though weshay was out until wednesday??12:59
rlandy|roverarxcruz|ruck; so you want to switch over roles or stay as we are?13:00
rlandy|roverarxcruz|ruck: just logged this ... https://bugs.launchpad.net/tripleo/+bug/1774990 - looking into it13:01
openstackLaunchpad bug 1774990 in tripleo "[queens promotion] RDO phase 2 baremetal env E jobs are failing to deploy the overcloud" [High,Triaged] - Assigned to Ronelle Landy (rlandy)13:01
trownquiquell: we shouldnt if the undercloud upgrade job is using the script13:01
weshayrlandy|rover, can you join my blue for a quick sync ruck/rover13:02
rlandy|roveryep13:03
weshaythanks13:03
quiquelltrown: Going to clean this up, then.13:04
quiquellto have less sh... to debug13:04
quiquellHave a zuul question, does the zuul.d jobs of a Depends-On get executed ?13:20
quiquellmyoung: Are you there ?13:38
myoungquiquell: yup13:39
myoungrlandy|rover: ack13:39
myoungrlandy|rover: ready to flip the switch?13:39
quiquellmyoung: Have add a telegraf to sol, to monitor the promoter13:39
myoung%gatestatus13:39
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.13:39
myoungquiquell: ahh, cool.  we are about to turn it off and turn back on the other one it appears13:39
quiquellmyoung: Will add telegraf to the other when you have it.13:40
myoungweshay: welcome back13:43
arxcruz|ruckmyoung: Mr. Young13:47
sshnaidm|brqmyoung, rlandy|rover don't we have downstream gates running?13:47
arxcruz|ruckyou can call me Mr. Old13:47
rlandy|roversshnaidm|brq: rhos012 gates should be running again13:48
myoungsshnaidm|brq: as of last week we had the rhos-12 gates running13:48
rlandy|rovertq was runnin13:48
myoungrhos-13 gates are defined but have a few issues13:48
rlandy|roverwe just enabled tqe today13:48
rlandy|rovermyoung: yep - we are ready to flip the switch - arxcruz|ruck  will work with you on it13:49
sshnaidm|brqis there a patch I can see rhos gates runs on it?13:51
sshnaidm|brq... and pass13:52
weshaymyoung, thanks man13:52
myoungarxcruz|ruck: could you please update https://bugs.launchpad.net/tripleo/+bug/1770860 with details heading into returning to the tripleo-infra instance?  have we root caused what's going on there?13:53
openstackLaunchpad bug 1770860 in tripleo "tracker-bug: network lag in tripleo-infra tenant prevents container promotions" [Critical,Triaged]13:53
quiquellmyoung, arxcruz|ruck: ping when promoter-server is up and running13:53
arxcruz|ruckmyoung: no root cause :(13:54
myoungarxcruz|ruck: is it still experiencing 30 seconds to log in, 3 years to wget a small text file?13:54
myoung:)13:54
myoungarxcruz|ruck: let's chat after scrums...13:55
arxcruz|ruckweshay: ^13:55
arxcruz|ruckok13:55
myoungo/ all - tripleo-ci standup in 513:55
quiquellHave to restart the laptop13:58
*** links has quit IRC14:10
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.14:29
*** hamzy has quit IRC14:36
*** saneax has quit IRC14:37
EmilienMrlandy|rover: hi it's me again :D14:40
rlandy|roverEmilienM: hey there - what's up?14:40
EmilienMCI folks: please look https://review.openstack.org/#/c/571529/ (sent on ML yesterday) - I thought someone from CI would review it but it was missed probably, just let me know if any problem14:40
EmilienMrlandy|rover: I need a reproducer again14:41
EmilienMon the same pathc14:41
myoungquiquell: this is what was talking about https://review.rdoproject.org/r/#/c/14027/14:41
EmilienMrlandy|rover: we made progress but now have another failure14:41
EmilienMlet me link the script14:41
EmilienMhttps://logs.rdoproject.org/16/566916/11/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-master/Zb3297f9fb4a44a10b34f7afa1b9e860d/reproducer-quickstart.sh14:42
EmilienMif you can :-)14:42
rlandy|roverEmilienM: ok - I'll set it and ping you with the undercloud ip - we should also look through the errors you get on running a reproducer at some point :)14:43
bogdandoo/ do you know something of 'Failure prepping block device., Code: 500' %%14:43
bogdando^^14:43
bogdandoI have this when reproing that patch14:43
EmilienMrlandy|rover: can you show me how you reproduce? can you share your openrc file (without password)14:43
EmilienMso I can look what's different14:43
rlandy|roveryep14:45
EmilienMrlandy|rover: do we have the same kind of tenant?14:46
quiquellmyoung: Nice the promoter stuff sure we can put something of this in the ruck/rover cockpit14:46
EmilienMrlandy|rover: maybe I have less quotas :P14:46
*** quiquell is now known as quiquell|off14:46
rlandy|roverEmilienM: http://pastebin.test.redhat.com/59950614:46
rlandy|roverlast time you reported an issue with the key being missing14:47
rlandy|roverEmilienM: we all have the same tenants - except weshay14:47
rlandy|roverhe can access the CI quotas14:47
rlandy|roverbut I can't14:47
EmilienMok let me try to reproduce14:48
EmilienMweshay has more quotas? ppfffttt14:48
rlandy|roverEmilienM; setting it up on my tenant as well14:49
EmilienMhe's always on PTO, that's unfair14:49
rlandy|roverhe's back - careful14:49
EmilienMoh hey wes how are you14:50
EmilienMbogdando, rlandy|rover: my stack is deployed14:55
EmilienMand quickstart is running14:55
EmilienM:-O :-O :-O14:56
EmilienMI just used rlandy|rover's openrc14:56
rlandy|roverstep 1 - you'll need the zuul change included14:56
EmilienMhow, I ran with -a14:56
EmilienMI shouldn't do that probably14:56
bogdandowoot woot14:56
rlandy|roveryou can edit tripleo-quickstart in /opt/stack14:57
EmilienMok14:57
rlandy|roverI have one running as well - w/o -a14:57
rlandy|roverI included the change to start14:57
rlandy|roverwill see if that works14:57
EmilienMok I just updated /opt/stack/tripleo-quickstart/config/general_config/featureset035.yml14:58
EmilienMrlandy|rover: thanks again! I guess I can continue alone from here14:58
EmilienMand stop using your time/resources :-)14:58
rlandy|roverEmilienM: cool - in the mean time, I added your key to zuul@38.145.33.10 - I'll watch to see if the change is included there14:59
EmilienMok14:59
weshayrfolco, ping15:01
weshayrfolco, let's chat15:01
*** tesseract-RH has joined #oooq15:01
*** tesseract has quit IRC15:02
myoungchkumar246, kopecmartin, arxcruz|ruck, weshay: tempest squad standup/scrum/sync in 26 min, please update cards if not already done15:04
rfolcoweshay, give me 5 min, lunch15:06
*** sshnaidm|brq has quit IRC15:10
*** jbadiapa has quit IRC15:15
*** hamzy has joined #oooq15:21
myoungarxcruz|ruck, rlandy|rover: are either of you free at 1pm EDT (5pm UDT) for promoter massaging?  Do you want/need to sync on this?  You both have access to cipromo@sol.redacted.com, HTH if you need/want it.15:22
arxcruz|ruckmyoung: i do have access, but i don't know exactly what i need to do15:23
myoungarxcruz|ruck: ack, we have tempest scrum in 6 mins, then bug triage.  after *that* I can assist.  between now and then, if you want to log into the promoter in tripleo-infra, and just do a few tests of pulling containers from docker.io and the RDO registry, curling a few files, determine if it still takes 30-60 sec to connect via ssh, etc...that would be a good starting point15:24
myoungsee if we have basic networking or if we're still in a state of "you can't get there from here"15:25
rlandy|rovermyoung: on meeting15:26
myoungarxcruz|ruck, rlandy|rover: if I had to shoot from the hip and propose something, I  think it would be be a good idea to spin up a new VM on the tenant, give it more than 2 cores (like we have now), use overlay2 FS driver (https://docs.docker.com/storage/storagedriver/overlayfs-driver), and enable verbose logging in the dockerd configuration so we can understand what's going on there.15:27
myoungarxcruz|ruck, rlandy|rover, alternate proposal is to turn it on and see what happens lol.  but I don't think we've addressed any of the root issue(s) so same inputs, likely same outputs?15:28
rlandy|rovermyoung: can chat later - talking with rasca now15:28
myoungrlandy|rover: ack, i don't have cycles till after 1pm EDT anyway15:28
myoung:)15:28
rlandy|rover1 pm is fine15:28
arxcruz|ruckmyoung: okay, let's spin a new vm15:28
arxcruz|ruckrlandy|rover: myoung here's is almost 6pm15:29
myoungchkumar246, kopecmartin, arxcruz|ruck, weshay: tempest squad scrum starts shortly, https://etherpad.openstack.org/p/tripleo-tempest-squad-meeting, https://bluejeans.com/705085945515:29
*** dtrainor has joined #oooq15:31
*** ratailor has quit IRC15:32
myoungweshay / arxcruz|ruck, coming or shoudl we start now?15:32
arxcruz|ruckcomming15:33
*** rfolco_ has joined #oooq15:36
*** marios has quit IRC15:37
*** marios_ has joined #oooq15:37
*** rfolco has quit IRC15:37
*** marios_ is now known as marios15:37
*** bogdando has quit IRC15:38
*** bogdando has joined #oooq15:40
*** bogdando has quit IRC15:46
EmilienMrlandy|rover: for the record, my OS_IDENTITY_VERSION was set on '3' instead of '2', I suspect it was the reason why my stack was failing.16:05
rlandy|roverrasca++16:06
hubbotrlandy|rover: rasca's karma is now 116:06
rlandy|rovernice work on that backport16:07
rlandy|roverEmilienM: happy you found the issue16:07
rascarlandy|rover, thanks for your help there!16:07
rlandy|roverneed to submit the review to include a tqe/tq change16:07
*** gkadam has quit IRC16:08
*** gkadam has joined #oooq16:08
chkumar246myoung: kopecmartin I have closed and updated few bugs related to tempest from that query16:14
chkumar246i think we need to a sprint to get it cleared16:15
*** sshnaidm|brq has joined #oooq16:16
*** sanjay__u has quit IRC16:19
myoungchkumar246: ack16:23
*** chkumar246 is now known as chandankumar16:24
*** trown is now known as trown|lunch16:27
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.16:29
*** udesale has quit IRC16:30
*** tesseract-RH has quit IRC16:34
*** holser__ has quit IRC16:38
*** tosky has quit IRC16:47
*** kopecmartin has quit IRC17:00
* EmilienM loves the green on https://dashboards.rdoproject.org/rdo-dev17:19
* EmilienM sends kudos to people here17:19
*** zoli|wfh is now known as zoli|gone17:32
*** zoli|gone is now known as zoli17:32
*** jaganathan has quit IRC17:39
*** amoralej is now known as amoralej|off17:45
*** trown|lunch is now known as trown17:54
EmilienMrlandy|rover: FYI I don't need your env, I've reproduce the env myself and good news fixed the bug I had (FYI, it was https://review.openstack.org/#/c/572151/)18:06
rlandy|roverEmilienM: good to know - thanks18:19
rfolco_myoung, ping18:25
hubbotAll check jobs are working fine on stable/ocata, stable/ocata, master, stable/queens.18:29
myoungrfolco_:  what's up18:34
rfolco_myoung, DoD not clear18:34
rfolco_https://github.com/openstack-infra/tripleo-ci/blob/master/zuul.d/multinode-jobs.yaml#L8318:34
rfolco_upgrade job has fs051, ok, goal achieved18:34
rfolco_but it run on experimental pipeline only18:34
rfolco_myoung, should we move this job to check/gate, or even 3rd party pipelines ?18:35
myoungrfolco_: sorry was multitasking.  which card?18:39
* myoung reloads state/context18:39
rfolco_https://trello.com/c/Ji8RaoHy/776-ci-job-create-keystone-only-full-upgrade-undercloud-overcloud-new-job?menu=filter&filter=label:Sprint%2014%20CI18:39
rfolco_myoung, ^18:39
myoungrfolco_: it was my understanding that for this the DoD was that it was running as nonvoting, triggering on changes to tq/tqe/tu.  if it's running now in experimental only, then we're not done...18:41
myoungrfolco_: i've also been in BJ for the past nearly 5 hours straight so I could have wires crossed...need to drop to get some lunch18:43
rfolco_myoung, :)18:43
rfolco_thanks18:43
*** myoung is now known as myoung|lunch18:43
*** dtrainor has quit IRC18:47
*** dtrainor has joined #oooq18:48
*** quiquell|off has quit IRC19:03
rlandy|roverarxcruz|ruck: looking at https://rhos-dev-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/job/tqe-gate-rhos-12-ci-rhos-ovb-minimal-pacemaker-public-bond/ - looks like the role failure was just one job - the current running jobs are passed thga t19:12
rlandy|roverthat19:12
arxcruz|ruckrlandy|rover: okay, cool :) one problem less :)19:12
rlandy|roverarxcruz|ruck: yeah - we aren't short of problems so one less is better19:13
rlandy|roverrasca: hey - looking at the failures here ... https://review.openstack.org/#/c/572155/19:16
rlandy|roveryou still on line?19:16
rlandy|roverotherwise I will update19:16
*** jaosorior has quit IRC19:19
rlandy|roverhow do I get access to https://registry.rdoproject.org?19:34
rlandy|roverarxcruz|ruck: ^^ do you have access?19:34
arxcruz|ruckrlandy|rover: i don't19:34
rlandy|roverneed to see progress on registry19:35
arxcruz|ruckchandankumar: do you know ?19:35
rlandy|roverI don;t see any updates on docker yet19:35
rlandy|roverI think apevec probably can19:35
rlandy|rovercurrent-tripleo19:36
rlandy|rover374 MB19:36
rlandy|rover14 hours ago on docker19:36
rlandy|roverprocess is still running19:36
arxcruz|ruckrlandy|rover: using the password from the script doesn't work ?19:37
rlandy|roverno - I think you haveto auth against your login19:37
rlandy|roverthere is no opportunity to enter a password19:37
*** holser__ has joined #oooq19:41
*** myoung|lunch is now known as myoung19:57
myoungrlandy|rover: queens promotion is done, and I've killed the promoter on sol19:58
rlandy|rovermyoung: thank you19:58
rlandy|roverwaiting on the master one19:58
myoungaccess to the rdo registery is herre: https://console.registry.rdoproject.org/19:59
rlandy|rovermyoung: I get auth failed19:59
myoungrlandy|rover, arxcruz|ruck, and afaik is controlled / acl'd by membership to https://github.com/orgs/rdo-infra/people19:59
myoungI think we need to have tripleo-ci memebers added to that group, or we need the auth on the RDO side to be able to include us via some other mechanism20:00
rlandy|rovermyoung: thanks - will ask apevec when he is on line20:00
rlandy|rover2018-06-04 17:58:08,765 16603 INFO     promoter Promoting the container images for dlrn hash 3a65b17da7b98c83dbfda432af88bb56d3501de9 on master to current-tripleo20:00
myoungrlandy|rover: what can be done in the meantime is CLI access via either docker commands directly, or the openshift command line, using the creds on the promoter in the secrets file20:00
rlandy|roveris all I follow atm20:01
myoungto track status, on the promoter instance can "sudo docker images | grep 3a65b17da7b98c83dbfda432af88bb56d3501de9 to track it's progress downloading / uploading images to/from rdo and docker.io20:01
myoungi think it would also be helpful to enable verbose(er) dockerd logging as well20:02
* myoung will capture additional ideas in LP rfe's20:03
rlandy|rovernothing there20:19
rlandy|rover"sudo docker images | grep 3a65b17da7b98c83dbfda432af88bb56d3501de9"20:19
rlandy|roverlatest is from 7 days ago20:20
rlandy|roverprocess is still running20:20
*** atoth has quit IRC20:20
rlandy|roverafaict the process is still running but I don't see any pushes going on20:23
rlandy|roverno updates since 018-06-04 17:58:08,765 16603 INFO     promoter Promoting the container images for dlrn hash 3a65b17da7b98c83dbfda432af88bb56d3501de9 on master to current-tripleo20:23
rlandy|roverweshay: ^^20:29
rlandy|roverdo you see something going on that I don't?20:29
weshayrlandy|rover, it may be downloading from the rdo registry20:29
* weshay looks20:29
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.20:30
weshayhubbot++20:30
hubbotweshay: hubbot's karma is now 120:30
weshayrlandy|rover, I suspect it's here https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/container-push/container-push.yml#L7820:32
weshaya lot of that can be removed w/ the oc client20:32
rlandy|roverthat would be nice20:32
weshayrlandy|rover, also20:33
weshaydam20:33
rlandy|roverI'll request access to https://registry.rdoproject.org:8443 tomorrow20:33
weshay\/var/lib/docker is at 100%20:33
rlandy|roverI am auth denied20:33
rlandy|roverclean that20:33
weshay?20:34
weshaysudo su -?20:34
rlandy|rover?20:35
rlandy|rovercan we get rid of manual_promotion?20:37
weshaywe can just stop it20:38
weshayrlandy|rover, are you on it?20:38
weshaytrown, myoung have you guys ever had to clean up the local containers on the promoter?20:44
myoungweshay: I have not, are we getting close to full on the disk?20:45
weshaymyoung, 80gb is full20:46
myoungweshay: afaik the promoter should be cleaning up after itself, unless it's failed or killed20:46
* myoung logs in and looks20:46
myoungwe might need to run a prune20:46
weshaymyoung, I did.. nothing came up20:46
myoungomg it's so nice to log in in ~2 sec :)20:47
trownI also thought it cleaned up after itself20:47
myoungoof...we might need to flip back to other promoter and clean this up...it appears we're super low on space and even docker calls are just hanging..."20:52
* myoung looks deeper20:52
myoung/dev/vdb1        80G   80G  173M 100% /var/lib/docker20:52
myoungahh that's why...on tmux already had commands running :)20:53
* myoung is experimenting locally with things here https://lebkowski.name/docker-volumes/20:57
myoungweshay, trown, testing out cleanup options on the other promoter (now not runnign anything)21:00
myoung^^ big hammer running now... "docker rmi $(docker images -a -q)"21:01
myoung(on sol)21:01
myoungweshay: interesting...looks like since we ahve containers from multiple repos (rdo, docker) that have shared layers, we might be running into this (silently) when attempting to remove...21:04
myoungError response from daemon: conflict: unable to delete d5381dcd3b00 (must be forced) - image is referenced in multiple repositories21:04
myoung"docker rmi --force $(docker images -a -q)" just ripped thru very quickly and obliterated 16G of old images21:05
*** trown is now known as trown|outtypewww21:06
myoungweshay: ^^21:06
*** jfrancoa has joined #oooq21:07
*** holser___ has joined #oooq21:07
* myoung watches space free up21:08
*** jfrancoa has quit IRC21:09
*** holser__ has quit IRC21:10
myounglooking better, up to 7g freed up21:12
myoungweshay, rlandy|rover, trown|outtypewww, so I guess https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/container-push/container-push.yml#L157 is not doing what we thought w.r.t. actually freeing up the space for all these layers21:15
myounggot it...this is why21:16
myoungWhen absent an image will be removed. Use the force option to un-tag and remove all images matching the provided name.21:16
myounghttp://docs.ansible.com/ansible/latest/modules/docker_image_module.html :: state flag21:16
* myoung makes a patch21:16
*** holser___ has quit IRC21:17
myoungweshay, rlandy|rover, trown|outtypewww: https://review.rdoproject.org/r/14048 promoter: Use force parameter with removing images21:21
*** holser__ has joined #oooq21:30
*** myoung is now known as myoung|off21:31
*** holser___ has joined #oooq21:38
*** holser__ has quit IRC21:41
*** holser___ has quit IRC21:53
rlandy|roverweshay: going to kick promoter.sh21:57
rlandy|roverwoohoo - some action on runk.registry.rdoproject.org/tripleomaster/centos-binary-cinder-volume                3a65b17da7b98c83dbfda432af88bb56d3501de9_dba0473522:04
rlandy|rover18% /var/lib/docker22:12
rlandy|roverfills up quickly22:12
*** tcw has quit IRC22:14
*** tcw has joined #oooq22:15
hubbotAll check jobs are working fine on stable/queens, stable/ocata, stable/ocata, master.22:30
rlandy|rover3a65b17da7b98c83dbfda432af88bb56d3501de9_dba0473522:35
rlandy|rover278 MB22:35
rlandy|rover7 minutes ago22:35
rlandy|roverupdated22:35
*** sshnaidm|brq has quit IRC22:55
rlandy|roverweshay: still around?23:00
rlandy|roverweshay: https://review.rdoproject.org/r/#/c/14048/ - looks like a good shot to me - thoughts?23:10
weshayrlandy|rover, /me looks23:14
rlandy|roverscript is moving along nicely now23:14
weshayrlandy|rover, ah good23:14
weshayrlandy|rover, that is worth a shot :)23:15
rlandy|roverweshay: merge it?23:15
weshayaye23:15
rlandy|roverweshay: ok - I think I will need to clean up after this run23:15
rlandy|roverbut will watch it23:16
rlandy|roveralready tagging on docker.io23:16
rlandy|roverweshay: looks pretty clean - should we re-enable the cron?23:54
rlandy|roverqueens running now23:55

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!