Wednesday, 2018-07-04

*** rlandy has quit IRC00:04
*** sai- has left #oooq00:17
*** yolanda_ has joined #oooq00:29
*** yolanda__ has quit IRC00:31
*** yolanda__ has joined #oooq00:55
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario001-multinode-oooq-container @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, legacy-tripleo-ci-centos-7 (1 more message)00:57
*** yolanda_ has quit IRC00:58
*** yolanda_ has joined #oooq01:06
*** yolanda__ has quit IRC01:08
*** yolanda__ has joined #oooq01:23
*** yolanda_ has quit IRC01:26
*** ykarel|away has joined #oooq01:56
*** ykarel|away has quit IRC02:02
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario001-multinode-oooq-container @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, legacy-tripleo-ci-centos-7 (1 more message)02:57
*** sanjayu__ has joined #oooq03:09
*** jaganathan has joined #oooq03:09
*** skramaja has joined #oooq03:09
*** udesale has joined #oooq03:46
*** ykarel|away has joined #oooq03:47
*** ykarel|away is now known as ykare03:47
*** ykare is now known as ykarel03:47
ykarelis there known issue with promoter, all required jobs passing but there is no promotion in master/queens Tripleo and RDO phase 104:05
ykarelchkumar|ruck, sshnaidm|rover weshay ^^04:05
*** ccamacho has joined #oooq04:33
*** yolanda_ has joined #oooq04:55
*** pgadiya has joined #oooq04:57
*** pgadiya has quit IRC04:57
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci- (1 more message)04:57
*** yolanda__ has quit IRC04:58
*** yolanda__ has joined #oooq05:05
*** yolanda_ has quit IRC05:09
*** yolanda__ has quit IRC05:09
*** ratailor has joined #oooq05:12
chkumar|ruck%gatestatus05:27
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci- (1 more message)05:27
*** chkumar|ruck has quit IRC05:30
*** udesale has quit IRC05:31
*** udesale has joined #oooq05:31
*** chandankumar has joined #oooq05:32
*** chandankumar is now known as chkumar|ruck05:32
*** bogdando has joined #oooq05:39
*** quiquell|off is now known as quiquell|rover05:40
quiquell|roverchkumar|ruck: Good morning05:40
quiquell|roverykarel: Did you say something about promoter ?05:40
ykarelquiquell|rover, yes05:40
ykarelpromoter is not promoting05:41
ykarelcan you check05:41
chkumar|ruckquiquell|rover: Good morning :-)05:41
quiquell|roverchkumar|ruck: Are you ruck already ?05:41
chkumar|ruckquiquell|rover: yup, But I am totally noob.05:42
chkumar|ruckquiquell|rover: I am going through noop jobs failures05:42
*** quiquell|rover is now known as quiquell05:42
quiquellchkumar|ruck: Join #tripleo-ci, we have a toy there05:43
quiquellchkumar|ruck: Found the issue05:48
chkumar|ruckquiquell: what was that05:50
quiquellchkumar|ruck: Missing pip dependency after some changes in promoter05:51
chkumar|ruckquiquell: do we gate promoter script also?05:52
quiquellchkumar|ruck: Now it has some unit tests05:52
quiquellchkumar|ruck: I have to change the test to use the requirements.txt from promoter05:52
chkumar|ruckquiquell: sshnaidm|rover nopp job failures https://review.rdoproject.org/etherpad/p/chkumar-ruck-rover-sprint16-notes05:52
quiquellchkumar|ruck: It was using its own05:52
chkumar|ruckon master05:52
quiquellchkumar|ruck: Take a look at this documen t05:53
quiquellchkumar|ruck: https://docs.google.com/document/d/1lTTOW-UDXTvxkJofGKS7wptmwifh2TVRdmMEcHuKl-o/edit#heading=h.6mividzgm94r05:53
quiquellchkumar|ruck: Not everything is true, and with the RR dashhboard it less manual process05:54
chkumar|ruckquiquell: sure, going through that05:54
*** skramaja_ has joined #oooq06:00
*** skramaja has quit IRC06:02
*** ykarel_ has joined #oooq06:07
*** ykarel has quit IRC06:10
quiquellchkumar|ruck: https://review.rdoproject.org/r/#/c/14594 and https://review.rdoproject.org/r/#/c/1457406:13
quiquellTo gate promoter06:13
*** jfrancoa has joined #oooq06:15
quiquellchkumar|ruck: Unit tests are passing https://review.rdoproject.org/r/#/c/1457406:18
quiquellWe are good for gating, let's get this merged06:18
*** quiquell is now known as quiquell|bbl06:21
*** yolanda__ has joined #oooq06:28
*** ykarel__ has joined #oooq06:32
*** ykarel_ has quit IRC06:35
*** agopi has quit IRC06:39
*** amoralej|off is now known as amoralej06:44
chkumar|rucksshnaidm|rover: https://thirdparty.logs.rdoproject.org/jenkins-periodic-master-rdo_trunk-featureset020-1ctlr_1comp_64gb-87/undercloud/home/stack/install-undercloud.log.txt.gz and https://thirdparty.logs.rdoproject.org/jenkins-tripleo-quickstart-master-rdo_trunk-baremetal-hp_dl360_envD-single_nic_vlans-103/undercloud/home/stack/install-undercloud.log.txt.gz06:50
chkumar|ruckboth are failing on master promotion with same error06:50
chkumar|ruckssbarnea: I checked the respective heat logs but not found any traceback06:50
chkumar|rucksshnaidm|rover: ^^06:51
chkumar|ruckssbarnea: sorry06:51
chkumar|rucksshnaidm|rover: it is telling no more  NO MORE HOSTS LEFT06:51
*** ykarel__ is now known as ykarel06:52
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci- (1 more message)06:57
*** quiquell|bbl is now known as quiquell06:58
ykarelchkumar|ruck, sshnaidm|rover happening after this if ntp_servers is wrong or not reachable:- https://review.openstack.org/#/c/57688807:00
quiquellykarel, chkumar|ruck, sshnaidm|rover: I think the fix for https://bugs.launchpad.net/tripleo/+bug/177964207:07
openstackLaunchpad bug 1779642 in tripleo "http://mirror.centos.org/centos/7/virt/x86_64/kvm-common/repodata/b58d947530c09c958e0d78c2c9029ef1fbb4d341c68ce99e3b0133561ca70421-primary.sqlite.bz2: [Errno 14] HTTP Error 404 - Not Found"" [High,In progress] - Assigned to Quique Llorente (quiquell)07:07
quiquellReduces the time for jobs07:08
quiquellThe fix https://review.openstack.org/#/c/579803/07:08
ykarelquiquell, Good, how much improvement u see07:09
*** udesale_ has joined #oooq07:09
quiquellykarel: 10 minutes or so I think07:09
quiquellykarel: I am comparing07:09
quiquellykarel: This is the testing patch https://review.openstack.org/57981507:10
quiquellykarel: But maybe it was just lucky07:10
*** udesale has quit IRC07:10
ykarelquiquell, Good07:10
quiquellykarel: Let's merge it07:11
*** udesale__ has joined #oooq07:11
quiquellmarios, sshnaidm|rover: +1w https://review.openstack.org/#/c/579803/07:11
ykarelquiquell, yes +107:11
quiquellTo possible reduce job times07:11
*** ykarel is now known as ykarel|mtg07:11
*** ykarel_ has joined #oooq07:13
*** udesale_ has quit IRC07:14
*** florianf has joined #oooq07:16
*** ykarel|mtg has quit IRC07:16
quiquellchkumar|ruck, sshnaidm|rover: Going to open a ticket for the DELETE_COMPLETE stack at the RDO nodepool07:19
*** yolanda__ is now known as yolanda07:21
mariosquiquell: o/ looking07:24
quiquellmarios: Testing patch https://review.openstack.org/57981507:24
quiquellComparing with the noop change, feels like reducing job times https://review.openstack.org/#/c/560445/07:26
quiquellFor the ones that do the overcloud_prep_containers (but maybe it's just lucky)07:27
mariosquiquell: nice, well lgtm but i've not seen this dockerfile.j2 before just now :) to be honest07:27
*** tesseract has joined #oooq07:28
mariosquiquell: ah i see this tripleo-modify-image is pretty new thing https://review.openstack.org/#/c/570444/ ?07:28
quiquellmarios: Yep, only stevebaker and EmilienM can +3 it07:29
quiquellmarios: That's interesting, around 6 weeks we have started to feel the timeout pain07:30
quiquellmarios: Just speculation07:30
mariosquiquell: ;)07:30
*** ykarel__ has joined #oooq07:34
*** ykarel_ has quit IRC07:37
*** tosky has joined #oooq07:37
*** gkadam__ has joined #oooq07:59
*** pliu_ has joined #oooq08:07
*** holser_ has joined #oooq08:08
*** kopecmartin has joined #oooq08:24
*** ykarel__ is now known as ykarel08:27
*** pliu_ has quit IRC08:30
*** pliu_ has joined #oooq08:33
*** panda|off is now known as panda08:39
*** Goneri has joined #oooq08:43
quiquellpanda: Good morning08:44
pandaquiquell: top o the morning to ye08:45
*** holser_ has quit IRC08:46
*** jaganathan has quit IRC08:48
quiquellpanda: Gating for promoter unit test https://review.rdoproject.org/r/#/c/1457408:48
pandalucasagomes: morning, scenario007 failing even after the rebase http://logs.openstack.org/53/579653/3/check/tripleo-ci-centos-7-scenario007-multinode-oooq-container/90c84b2/job-output.txt.gz08:48
pandalucasagomes: if you have time to look08:49
pandaquiquell: so all the tox configiuration patches merged ?08:50
pandaquiquell: why are you testing also python3 ?08:51
quiquellpanda: kind of for free, now we know that promoter also works for python308:51
quiquellpanda: So we will be able to switch08:51
quiquellpanda: The tox config is in the gates now08:51
*** holser_ has joined #oooq08:54
pandaquiquell: one thing caught my attention in the previous patch, you use the new configparser even for python208:54
pandaquiquell: do you need to installa a separate package for that ?08:55
quiquellpanda: Yep, for python3 compatibility08:57
quiquellpanda: ConfigParser doesn't work has to use configparser08:57
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci- (1 more message)08:57
pandaquiquell: where do you ensure that we have it installed ?08:59
quiquellrequirements.txt09:00
quiquellpanda: https://review.rdoproject.org/r/#/c/14594/2/ci-scripts/dlrnapi_promoter/requirements.txt09:00
lucasagomespanda, oh right on! I will take a look at it soon-ish09:00
*** matbu has quit IRC09:01
pandalucasagomes: thanks!09:01
pandaquiquell: A depends-on would have helped :)09:02
quiquellpanda: Is the same repo, we can't do a Depends-On, have to use parenting09:02
quiquellpanda: I want to separate the tox jobs and the fix09:03
*** jaganathan has joined #oooq09:04
pandaquiquell: good, my local tests pass09:06
quiquellpanda: nice !!!09:06
*** matbu has joined #oooq09:07
pandamy super secret locally hidden shadowed private dark tests too09:10
quiquellpanda: You mean new unit tests ?09:10
sshnaidm|roverquiquell, I wouldn't worry much about delete complete, it doesn't create any problems and doesn't take resources09:11
pandaquiquell: can't tell you, it's a secret09:11
quiquellsshnaidm|rover: They appear as old stacks09:13
quiquellsshnaidm|rover: In the dashboard09:13
quiquellsshnaidm|rover: Would be nicer if rdocloud.py just push the current stacks and we count and filter at grafana09:16
quiquellsshnaidm|rover: What do you think ?09:16
sshnaidm|roverquiquell, hmm.. not sure grafana could do it same way as a script09:18
quiquellsshnaidm|rover: if we set states and timestamps it can09:18
sshnaidm|roverquiquell, how do you filter by timestamp?09:18
quiquellsshnaidm|rover: Also this way we can explore the problems from grafana09:18
quiquellsshnaidm|rover: where time [>] ....09:19
quiquellsshnaidm|rover: We can even add a timestamp field for it09:19
sshnaidm|roverquiquell, I mean timestamp of stack09:19
quiquellsshnaidm|rover: If we use the creationg time as influxdb timestamp09:19
quiquellthe field "time" is the influxdb line timestamp09:20
chkumar|ruckquiquell: what about using six instead of configparser dependency/09:20
chkumar|ruck?09:20
quiquellchkumar|ruck: Don't know about six, but promoter is very delicate, in case we want to use it09:20
quiquellchkumar|ruck: Now we can add some unit tests to test it before merge09:21
chkumar|ruckquiquell: https://pythonhosted.org/six/. let me put a patch up for configparser09:21
quiquellchkumar|ruck: Add unit tests to it, if it's not unit tested it's not implemented09:22
chkumar|ruckquiquell: sure09:22
pandachkumar|ruck: what's the advantage of six over using backported configparser ?09:22
sshnaidm|roverquiquell, ok, will do second script, so we can test its results in grafana, although not sure why we need it, scripts are more powerful09:22
chkumar|ruckpanda: with six, we can write py2 and py3 compatible code09:22
chkumar|ruckwithout much modification09:23
chkumar|ruckhttps://pythonhosted.org/six/09:23
quiquellsshnaidm|rover: We open the door to explore the stacks09:23
quiquellsshnaidm|rover: btw, need a +3 at correct release file for periodics https://review.openstack.org/#/c/578793/09:23
pandachkumar|ruck: configparser is already backported an dcompatible with both. I see six as a general framework, not for this specific case09:23
pandachkumar|ruck: I know, it's used all over openstack09:24
quiquellsshnaidm|rover: Already tested with reproducer and dry run and it's passing the correct one09:24
sshnaidm|roverquiquell, ack09:24
sshnaidm|roverquiquell, in stacks I don't see much to explore, tbh09:24
quiquellsshnaidm|rover: We can see the stuack ones, without accessing the tenant09:25
quiquellsshnaidm|rover: Good to open RDO issues and show09:25
sshnaidm|roverquiquell, yeah, we can see them now too09:26
quiquellsshnaidm|rover: where ?09:27
sshnaidm|roverquiquell, in grafana09:27
quiquellsshnaidm|rover: We get the counts but not the names of it09:27
sshnaidm|roverquiquell, seeing one individual stack won't help much and will overload db09:27
sshnaidm|roverquiquell, I don't think we need so detailed..09:27
quiquellsshnaidm|rover: ok, let's just count, if wee need the info we can go back and add it09:28
quiquellsshnaidm|rover: btw, can we remove the STACK_DELETED from the old_stacks ?09:28
sshnaidm|roverquiquell, only by special treatment, but I'd like not to do it, it's a problem, we don't need to forget about it09:30
sshnaidm|roverquiquell, let's increase alarm to >109:30
sshnaidm|roverquiquell, it'll be still on graph, but won't alarm09:30
quiquellsshnaidm|rover: Ok, also has increase to 30m the time window09:31
quiquellsshnaidm|rover: To remove the fail NoData alerts09:31
sshnaidm|roverok09:31
quiquellsshnaidm|rover: btw, we have to start to think on moving it elsewhere09:33
sshnaidm|roverquiquell, yeah, will try again to talk on #rdo..09:34
quiquellsshnaidm|rover: Maybe for now moving it to tripleo-ci tenant09:35
sshnaidm|roverif not, maybe we can host on openshift, trown|outtypewww will help :)09:35
quiquellsshnaidm|rover: Then using triple-ci influxdb09:35
quiquellsshnaidm|rover: And later use rdo grafana with grafyaml (I have doubts about grafyaml)09:35
sshnaidm|roverquiquell, we don't have tripleo-ci influxdb09:35
sshnaidm|roverquiquell, I mean it's in rdo-infra, not in our tenant09:36
quiquellsshnaidm|rover: haha No problem on start playing with opensthift09:36
quiquellsshnaidm|rover: But we can access to it from our tenant09:36
quiquellsshnaidm|rover: Moving things step by step09:36
sshnaidm|roverquiquell, agree09:36
quiquellsshnaidm|rover: The easier is influxdb09:36
quiquellsshnaidm|rover: Just a mather of changing telegraf and datasouces at grafana09:36
quiquells/mather/matter/09:36
quiquellpanda: About sprint16, do we have to convert emit_releases_file into a ansible module ?09:45
pandaquiquell: yes, but I'm not surewe can do it in the next sprint09:47
quiquellpanda: sprint16 is not about convert toci to ansible ?09:48
quiquellpanda: To integrate with zuulv3 '09:48
pandaquiquell: sprint16 is about bringing tripleo CI a step closer to how zuulv3 support09:51
pandas/how/what09:52
pandaquiquell: there's a lot to do to make this possible, and we need to choose a sustainable path09:52
pandaquiquell: so, I can see in the end the python script will become a module, but if we want to do this as next step, I'm not sure.09:54
chkumar|rucksshnaidm|rover: http://38.145.34.55/ if we go to promotion url, for each release there is a <release>.log but it is queens it is not there09:55
chkumar|rucksshnaidm|rover: http://38.145.34.55/queens.log09:55
quiquellpanda: So first we do like the low hanging fruit stuff ?09:56
pandaquiquell: first we define a path09:57
pandaquiquell: wnat to chat ?09:58
*** jfrancoa has quit IRC09:58
quiquellpanda: Think so, or maybe understand where we are now from the previous sprint09:58
quiquellpanda: Like like a converted job and similar09:58
pandaquiquell: ok ok, sure sure09:59
pandaquiquell: in my channel in my channel09:59
*** holser_ has quit IRC09:59
sshnaidm|roverchkumar|ruck, hmm... looking10:00
*** jfrancoa has joined #oooq10:01
*** jaganathan has quit IRC10:02
*** holser_ has joined #oooq10:05
sshnaidm|roverquiquell, do we still run promoter in tmux session?10:07
quiquellsshnaidm|rover: Nope, it's systemd now10:08
quiquellsudo service dlrn-promoter start/stop ..10:08
sshnaidm|roverquiquell, was there change in ci-scripts repo?10:08
quiquellsshnaidm|rover: just the requirements.txt10:09
quiquelland refresh10:10
quiquellfrom repo with the unit test and configparser10:10
sshnaidm|roverquiquell, why do we run telegraf on promoter..?10:13
quiquellsshnaidm|rover: To get the alarms in case promoter doesn't run10:13
quiquellsshnaidm|rover: http://38.145.34.131:3000/d/pgdr_WVmk/cockpit?orgId=1&panelId=130&fullscreen&from=1530526455445&to=1530699255445&var-launchpad_tags=alert&var-promotion_names=current-tripleo&var-promotion_names=current-tripleo-rdo&var-promotion_names=current-tripleo-rdo-testing&var-releases=master&var-releases=queens&var-releases=pike&var-releases=ocata10:14
sshnaidm|roverquiquell, can you please fix your commit? https://github.com/rdo-infra/ci-config/commit/677be5713b70112bc97dcb8cb9948cd8cda66c5a10:15
sshnaidm|roverquiquell, "service" should be with colon10:15
*** jaganathan has joined #oooq10:15
*** holser_ has quit IRC10:15
sshnaidm|roverquiquell, we need some ansible lint job there to cover it..10:15
chkumar|rucksshnaidm|rover: quiquell https://review.rdoproject.org/r/#/c/14598/10:17
*** holser_ has joined #oooq10:17
sshnaidm|roverchkumar|ruck, did you talk about ImportError: No module named configparser errors?10:22
quiquellsshnaidm|rover, chkumar|ruck: need a fiew minutes10:22
chkumar|rucksshnaidm|rover: in the mroning ykarel noticed promoter script was not working, quiquell found that it was import error issue10:23
chkumar|rucksshnaidm|rover: https://review.rdoproject.org/r/#/c/14594/10:23
sshnaidm|roverchkumar|ruck, yeah, also there is different issue with queens10:23
sshnaidm|roverquiquell, did you modify manually ci-scripts/container-push/container-push.yml and ci-scripts/dlrnapi_promoter/requirements.txt on promoter?10:24
quiquellrequirements.txt yes, container-push.yml not10:25
quiquellDon't know who changeed container-push.yml10:25
quiquellsshnaidm|rover: the requirements.txt is aalready merge in master10:27
quiquellyou can checkout that from master10:27
sshnaidm|roverquiquell, please, try to avoid in any way to change files manually10:28
quiquellsshnaidm|rover: Just did with requirement.txt to have queens promotion10:28
sshnaidm|rovernow we don't know who change container file and why10:28
chkumar|rucksshnaidm|rover: quiquell in last 2 months https://github.com/rdo-infra/ci-config/commit/f801e5c6d026944cd295cde9f41d9ae8bf7eec3810:28
quiquellsshnaidm|rover: not idea on who did, asked Wes he didn't know either10:28
chkumar|ruckhttps://github.com/rdo-infra/ci-config/commit/99c64836ea50c1f3c3aad8e96d7fba3978fa1cca10:29
sshnaidm|roverquiquell, in this case you'd better to create a patch in ci-config repo that updates it, we have ansible running every 5 mins there10:29
sshnaidm|roverif we do infrastructure-as-code, but still manually change files, we did nothing10:29
quiquellsshnaidm|rover: agre, will do10:29
sshnaidm|roverquiquell, who was ruck/rover before you?10:30
quiquellrlandy  and arx I think10:30
*** udesale__ has quit IRC10:34
sshnaidm|roverwell, last change was in 2018-07-01 22:31:47..10:34
sshnaidm|roverok, seems like I just checkout it from repo..10:34
sshnaidm|roverpanda, weshay did you make some changes to ci-scripts/container-push/container-push.yml on promter server recently?10:35
chkumar|rucksshnaidm|rover: what to do with the gate jobs failing with timed out?10:39
sshnaidm|roverchkumar|ruck, well, not much to do right now10:39
chkumar|rucksshnaidm|rover: ack!10:39
sshnaidm|roverchkumar|ruck, I'm working on ara for them, it might help to discover longest parts of deployments10:40
chkumar|ruckarxcruz: hey10:40
sshnaidm|roverchkumar|ruck, so just let's keep an eye and maybe will discover some pattern10:40
chkumar|rucksshnaidm|rover: sure10:40
arxcruzchkumar|ruck: hello10:40
chkumar|ruckarxcruz: you were working on a FS where we will run tempest tests without skip list10:40
chkumar|ruckna10:40
arxcruzchkumar|ruck: yes10:41
arxcruzi think was 5510:41
arxcruzneed to check10:41
chkumar|ruckarxcruz: it is the refstack one10:41
arxcruzor 4810:41
arxcruzchkumar|ruck: just a sec, let me check10:41
chkumar|ruckarxcruz: checking10:41
arxcruzchkumar|ruck: yup, fs4810:43
*** jaganathan has quit IRC10:44
*** quiquell is now known as quiquell|mtg10:50
*** sanjayu__ has quit IRC10:53
arxcruzsshnaidm|rover: is sova data being updated?10:56
sshnaidm|roverarxcruz, not for rdo cloud10:56
arxcruzi see10:56
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades, tripleo-ci-centos-7-scenario001-multinode-oooq-container @  (1 more message)10:57
*** dtantsur|afk is now known as dtantsur11:10
*** quiquell|mtg is now known as quiquell11:35
pandalucasagomes: any pointers on the failing scenario ? I see that we cant ping the floating ip of the test instance11:36
chkumar|rucksshnaidm|rover: it is something new cames up http://logs.openstack.org/91/564291/15/check/tripleo-ci-centos-7-scenario003-multinode-oooq/f44bc41/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-07-04_09_39_4011:38
chkumar|ruckon stable/ocata noop jobs11:38
*** udesale__ has joined #oooq11:40
*** udesale has joined #oooq11:46
*** udesale__ has quit IRC11:47
*** ratailor has quit IRC11:51
*** quiquell is now known as quiquell|lunch12:03
lucasagomespanda, yeah, I'm wondering if the problem is the same as we saw here: https://mail.openvswitch.org/pipermail/ovs-dev/2018-June/348812.html12:04
lucasagomespanda, I've tested the fix proposed in OVS but it's nor merged yet, I wonder if that's the same problem. We don't have an env to troubleshoot do we ?12:04
lucasagomes(the link to the patch with the fix is here https://mail.openvswitch.org/pipermail/ovs-dev/2018-June/348818.html)12:05
pandalucasagomes: we were able to reproduce the rpblem witht he reproducer script12:05
chkumar|ruck%gatestatus12:06
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq- (1 more message)12:06
quiquell|lunchsshnaidm|rover: Going to move rdo scripts of RR dashboard to zuulv312:06
*** amoralej is now known as amoralej|lunch12:06
quiquell|lunchall jobs are there now12:06
sshnaidm|roverquiquell|lunch, ack12:06
sshnaidm|roverquiquell|lunch, also promotions?12:06
quiquell|lunchsshnaidm|rover: Sorry about the promoter btw :-/12:07
sshnaidm|roverquiquell|lunch, no worries12:07
quiquell|lunchsshnaidm|rover: periodics are there too I think12:07
quiquell|lunchGoing to lunch now12:07
sshnaidm|roverquiquell|lunch, I don't see promotion jobs there..12:07
lucasagomespanda, right yeah I can try that too. Cause I can't find anything specific in the logs12:08
sshnaidm|roverquiquell|lunch, there is no even pipeline openstack-periodic, I think Paul didn't move them yet12:08
lucasagomespanda, talking about the logs would be nice as well if we could collect the ovsdb's after the tests runs12:09
pandalucasagomes: are you seing this in any other place ?12:09
lucasagomespanda, we saw something similar in one of the tempest recently12:09
lucasagomeswhich is what daniel pointed out in that email12:09
lucasagomesin the same thread someone pointed to a patch that addressed that problem (I tested and it worked)12:10
pandalucasagomes: http://logs.openstack.org/53/579653/3/check/tripleo-ci-centos-7-scenario007-multinode-oooq-container/90c84b2/logs/undercloud/var/log/extra/network-bridges.gz12:10
pandalucasagomes: I think we are dumping something here, not sure if we are dumping ovsdb12:10
lucasagomespanda, no it's not there, in networking-ovn we collect the dbs here: http://logs.openstack.org/43/574743/3/check/networking-ovn-tempest-dsvm-ovs-release/998fb10/logs/ovs_dbs/12:11
lucasagomesbut it's just a enhancement12:11
lucasagomesit has nothing to do with the failure12:12
pandalucasagomes: ok, I'll try to reproduce this somewhere that can be reached by you too12:22
pandalucasagomes: doesnt' seem easy to tst this in a TripleO run12:23
pandathe patch is not even in gerrit :(12:23
*** saneax has joined #oooq12:24
sshnaidm|roverquiquell|lunch, as I understood we stopped to use venv for promoter script? https://github.com/rdo-infra/ci-config/blob/b319c0b98c166ee799f598b93ca439f80a770526/ci-scripts/infra-setup/roles/promoter/tasks/main.yml#L159-L16212:26
pandalucasagomes: can you comment here https://bugs.launchpad.net/tripleo/+bug/1780082 what commands are used to correctly gather ovsdb dump for OVN debugging ?12:28
openstackLaunchpad bug 1780082 in tripleo "quickstart log collection does not collect OVN ovsdb dump" [Medium,Triaged]12:28
*** saneax has quit IRC12:28
*** saneax has joined #oooq12:28
chkumar|rucksshnaidm|rover: weshay can we promote pike by disabling fs20 because of this fix https://review.openstack.org/#/c/579426/ ?12:30
chkumar|ruckwe know this review fixes the tests12:30
ykareland upstream queue is in bad shape currently, so that merge would take time12:31
sshnaidm|roverchkumar|ruck, so if it fixes, we'll see it in next run, right?12:31
chkumar|rucksshnaidm|rover: yup12:31
chkumar|rucksshnaidm|rover: but the patch is blocked on gates due to timed out12:32
sshnaidm|roverchkumar|ruck, well, anyway next run is tonight for pike, so we'll see later, if it's still not merged, maybe we'll do it12:33
sshnaidm|roverchkumar|ruck, does it solve problem for queens too? I see it's blocked by 020 job12:40
chkumar|rucksshnaidm|rover: yup12:41
chkumar|rucksshnaidm|rover: https://review.rdoproject.org/r/#/c/1452112:41
sshnaidm|roverchkumar|ruck, ok, so I'll make a patch to unblock both branches..12:41
chkumar|rucksshnaidm|rover: https://review.rdoproject.org/r/1460712:41
chkumar|ruckfor pike12:41
sshnaidm|roverykarel, chkumar|ruck hmm... but why queens promotion does still want 020 to pass? from tripleo-ci-testing to current-tripleo, missing successful jobs: [u'periodic-ovb-1ctlr_1comp-featureset020']12:43
sshnaidm|roverhttp://38.145.34.55/queens.log12:43
ykareli think we removed it from criteria12:44
ykarelchecking, is promoter has old copy?12:44
ykarelsshnaidm|rover, that's also duplicated :(12:45
ykarelhttps://github.com/rdo-infra/ci-config/blob/master/ci-scripts/dlrnapi_promoter/config/queens.ini#L1812:45
sshnaidm|roverykarel, oh, I see12:46
mariossshnaidm|rover: updated when you have a chance thanks for review https://review.openstack.org/#/c/578081/9/roles/create-reproducer-script/templates/reproducer-quickstart.sh.j212:46
arxcruzweshay: chkumar|ruck kopecmartin I'm assuming there will not have the sprint plan today right ?12:50
pandaarxcruz: you're missing the PM. your UA is rucking for the first time, your TC is on PTO and the rest of the team will be for the remainder of the sprint12:51
*** quiquell|lunch is now known as quiquell12:54
arxcruzpanda: okay, just wanted to make sure :)12:55
arxcruzmy calendar is a bit crazy with the timezone difference12:55
quiquellsshnaidm|rover: it's using venv12:56
chkumar|rucksshnaidm|rover: ykarel let's promote the pike12:56
ykarelack12:57
sshnaidm|roverquiquell, I don't see it.. where12:57
quiquellsshnaidm|rover: source ~/promoter_venv/bin/activate12:57
chkumar|rucksshnaidm|rover: weshay needs +2 https://review.rdoproject.org/r/1459812:57
quiquellsshnaidm|rover: wait... https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/dlrnapi_promoter/dlrn-promoter.sh#L412:57
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq- (1 more message)12:57
chkumar|ruckquiquell: I have few more ideas for ruck-rover bot12:58
sshnaidm|roverquiquell, uh, now see it. sorry, need more coffee12:58
chkumar|ruckquiquell: like finding failed jobs and how many times it failed in a day12:58
chkumar|ruckshow me logs for a particular failed jobs12:59
quiquellchkumar|ruck: you have that in the dashboard12:59
sshnaidm|roverchkumar|ruck, it could be done with variables in a graph12:59
sshnaidm|roverquiquell, this bar chart with jobs we need to do with variable "job_name"12:59
quiquellchkumar|ruck: Doing exploration with a IRC bot is not the best way13:00
kopecmartinarxcruz, I have labeled all cards mentioned in the planning etherpad as Sprint 16 Tempest13:00
sshnaidm|roverquiquell, chkumar|ruck then we can see the graph for one (or multiple) job only13:00
quiquellsshnaidm|rover: more than variables, using ad-hoc grafan varible13:00
quiquelllook13:00
arxcruzkopecmartin: ack13:00
quiquellhttp://38.145.34.131:3000/d/2kHMNHvik/exploration?orgId=113:01
kopecmartinarxcruz, we're just missing epic card13:01
chkumar|ruckarxcruz: kopecmartin I am creating the epic card13:01
quiquelllook at the last one13:01
quiquellin the influxdb_filter you can put whatever field you have at your influxdb lines13:01
kopecmartinchkumar|ruck, great then13:01
mariosweshay: panda no call?13:01
sshnaidm|roverquiquell, yeah, but I'm talking about this one: http://38.145.34.131:3000/d/pgdr_WVmk/cockpit?orgId=1&panelId=80&fullscreen&from=1530536511291&to=1530709311291&var-launchpad_tags=alert&var-promotion_names=current-tripleo&var-promotion_names=current-tripleo-rdo&var-promotion_names=current-tripleo-rdo-testing&var-releases=master&var-releases=queens&var-releases=pike&var-releases=ocata13:02
quiquellsshnaidm|rover: We can add the ad-hoc filter to the cockpit too13:02
quiquellIt will apply to all the graphs there13:03
*** amoralej|lunch is now known as amoralej13:03
quiquellsshnaidm|rover: reload the panel you paste me now13:04
sshnaidm|roverquiquell, mm.. not sure we want to apply to all graphs though.. need to play with it13:04
quiquellsshnaidm|rover: You can now select all the fields you want13:04
quiquellsshnaidm|rover: It's not permament just per user13:04
quiquellby default it's not filtered13:04
quiquellad-hoc are very powerful13:04
chkumar|ruckarxcruz: kopecmartin https://trello.com/c/1v1dYRnP/844-closing-python-tempestconf-items-out13:06
sshnaidm|roverquiquell, well, you're right, it's much better13:06
rfolcoJuly 4th today13:06
sshnaidm|roverquiquell, we can see also this job in failed and failed-gates..13:06
quiquellsshnaidm|rover: It's really what you need for exploration13:06
rfolcoI don't think US folks will join13:06
sshnaidm|roverquiquell, cool! great, let's save it!13:06
quiquellsshnaidm|rover: Sure13:06
sshnaidm|roverchkumar|ruck, you should try it ^^13:07
sshnaidm|roverquiquell, maybe also will be useful - to have table with job name and amount of failures, sorted by fails13:08
sshnaidm|roverquiquell, so we can have most problematic jobs..13:08
chkumar|ruckkopecmartin: arxcruz please add the related sprint 16 cards to epic13:08
quiquellsshnaidm|rover: That's why there is a exploration dashboard13:08
sshnaidm|roverquiquell, yep13:08
quiquellsshnaidm|rover: https://review.rdoproject.org/r/1461013:08
chkumar|rucksshnaidm|rover: will i file a bug for ocata failure http://logs.openstack.org/91/564291/15/check/tripleo-ci-centos-7-scenario003-multinode-oooq/f44bc41/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-07-04_09_39_42 ?13:11
sshnaidm|roverchkumar|ruck, the bug is "Timed out waiting for messages ..." line: http://logs.openstack.org/91/564291/15/check/tripleo-ci-centos-7-scenario003-multinode-oooq/f44bc41/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-07-04_09_46_3813:14
sshnaidm|roverquiquell, so, how can we do table of most 10 failing jobs for example?13:16
quiquellLet me check if we can use count function in ad-hoc13:17
quiquellsshnaidm|rover: Trying to do a frequency table13:23
quiquellgive me amin13:23
quiquellsshnaidm|rover: Check exploration dashboard now13:27
quiquellYou have a frequency table13:27
quiquellyou can short the table and add a filter with passed = False13:27
sshnaidm|roverquiquell, wow.. is it really 330 failures in last 3 days of containers-multinode??13:32
quiquellsshnaidm|rover: Feels like too much...13:33
quiquellsshnaidm|rover: Going to remove the filter from table so everyone can change it with the global13:34
sshnaidm|roverquiquell, as I see from cockpit graph it should be about 2013:35
sshnaidm|roverquiquell, maybe you need to add "when passed = false" there13:36
quiquellsshnaidm|rover: You can add this to the filter13:36
quiquellthe exploration board is open13:36
quiquellby default it show you everything13:36
quiquellYou play with the influxdb filte r13:37
quiquellsshnaidm|rover: So if I do the following13:37
quiquellhttp://38.145.34.131:3000/d/2kHMNHvik/exploration?orgId=1&var-influxdb_filter=passed%7C%3D%7CFalse&var-influxdb_filter=job_name%7C%3D%7Ctripleo-ci-centos-7-containers-multinode13:37
quiquellI get 3213:37
quiquellWich feels ok for 7 days13:37
quiquellIt's super powerful this ad-hoc stuff13:38
sshnaidm|roverquiquell, I see.. but maybe would fine to have some graphs predefined, just to save time13:39
quiquellsshnaidm|rover: not so much time... what we can do is have queries perdefined13:39
sshnaidm|roverquiquell, I think we need to think about a documentation or tutorial :)13:39
quiquellor link13:39
quiquellthe query is in the url, we can have links prefedined13:39
quiquellsshnaidm|rover: ack, I think we have to stop adding features, and productify13:40
quiquellsshnaidm|rover: btw, do you know how to make the dashboard apperas as selected items in the top left ?13:41
sshnaidm|roverquiquell, yeah, not everybody is aware of all these features13:41
sshnaidm|roverquiquell, not sure I understand13:41
quiquellsshnaidm|rover: me neither, I don't understand myself :-)13:41
quiquellsshnaidm|rover: Changint issues, I have do the ara-report for the undercloud13:42
quiquellsshnaidm|rover: And discover an issue at ara + ansible.cfg13:42
quiquellsshnaidm|rover: Will comment at your OC ara WIP patch13:42
chkumar|rucksshnaidm|rover: first bug filed https://bugs.launchpad.net/tripleo/+bug/178009113:43
openstackLaunchpad bug 1780091 in tripleo "containerized undercloud deployment failed on periodic jobs" [Undecided,New]13:43
quiquellchkumar|ruck: \o/ !!!13:43
sshnaidm|roverquiquell, currently I'm working on getting all ara undercloud to influxdb13:43
sshnaidm|roverquiquell, but ara with OC doesn't work well :(13:43
quiquellsshnaidm|rover: that's super cool, one influxdb line per task ?13:44
sshnaidm|roverquiquell, no, I think all tasks with their lengths in one line, we need to tie them to the job13:44
sshnaidm|roverquiquell, I think to take 15 longest of kind of13:44
quiquellsshnaidm|rover: Humm sure we don't want to do that at filtering ?13:45
sshnaidm|roverquiquell, and then we'll need to do graph for each of them.. *phew13:45
quiquellsshnaidm|rover: line per task is too much ?13:45
sshnaidm|roverquiquell, well, one job has a few tasks with lengths, why to separate them13:45
quiquellsshnaidm|rover: Depending on th eplaybook or position they can take longer13:46
quiquellsshnaidm|rover: we can count the frequency of them too13:46
sshnaidm|roverquiquell, frequency?13:46
quiquellif we put really raw data into influxdb, we will be able to count whatever we want13:47
quiquellsshnaidm|rover: The more we groom the date the lest we can do later on13:47
quiquells/lest/less/13:47
sshnaidm|roverquiquell, it will be raw data - job with tasks13:47
quiquellsshnaidm|rover: ack13:47
sshnaidm|roverquiquell, we need to identify in which jobs same tasks accidentally take longer time13:47
quiquellsshnaidm|rover: just one influxdbline per task can do that too and more stuff13:48
sshnaidm|roverquiquell, well, let me finish the tasks extraction part and then we'll see how to parse it in influx already..13:48
quiquellsshnaidm|rover: Yep13:48
quiquellsshnaidm|rover: tuple for the line can be job,playbook,task duration13:49
quiquellFrom there we can do all the counts we want13:49
quiquellHave to leave earlier today13:49
*** quiquell is now known as quiquell|off13:49
*** agopi has joined #oooq13:50
sshnaidm|roverquiquell|off, sure13:50
*** ykarel is now known as ykarel|away13:55
*** skramaja_ has quit IRC13:58
*** ykarel|away has quit IRC14:01
*** sshnaidm|rover is now known as sshnaidm|afk14:11
*** ykarel|away has joined #oooq14:31
*** ykarel|away is now known as ykarel14:32
*** bogdando has quit IRC14:40
chkumar|rucksshnaidm|afk: now scenario 001 and 002 is timing out on tempest and for 003 still on overcloud deploy14:51
chkumar|rucksshnaidm|afk: on rdocloud node status rdocloud server delete Error state is 6, do we need to do something there?14:57
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq- (1 more message)14:58
*** pliu_ has quit IRC15:04
ykarelchkumar|ruck, pike promoted, and queens promotion started15:15
chkumar|ruckweshay: sshnaidm|afk ^^ sweet15:16
*** sshnaidm|afk is now known as sshnaidm|rover15:38
*** udesale_ has joined #oooq15:40
*** udesale has quit IRC15:43
*** sanjay__u has quit IRC15:51
*** yolanda has quit IRC15:54
*** ykarel has quit IRC16:08
*** gkadam__ has quit IRC16:16
amoralejcould i get reviews on https://review.openstack.org/#/c/579888/ ?16:22
amoraleji need it for pike job in rdoinfo gate16:22
*** tesseract has quit IRC16:30
*** ykarel has joined #oooq16:32
*** florianf has quit IRC16:38
*** ykarel has quit IRC16:39
*** udesale_ has quit IRC16:40
*** kopecmartin has quit IRC16:48
*** dtantsur is now known as dtantsur|afk16:56
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq- (1 more message)16:58
*** amoralej is now known as amoralej|off17:11
chkumar|ruck%gatestatus17:17
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq- (1 more message)17:17
*** agopi has quit IRC18:34
*** jfrancoa has quit IRC18:39
*** agopi has joined #oooq18:53
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario002-multinode-oooq-container, tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario001-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq- (1 more message)18:58
*** agopi has quit IRC20:03
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @  (1 more message)20:58
*** holser_ has quit IRC21:23
*** Goneri has quit IRC21:35
*** holser_ has joined #oooq21:53
sshnaidm|rover%gatestatus22:28
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @  (1 more message)22:28
*** holser_ has quit IRC22:48
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-scenario001-multinode-oooq-container, tripleo-ci-centos-7-scenario002-multinode-oooq-container @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @  (1 more message)22:58
*** agopi has joined #oooq23:18
*** tosky has quit IRC23:55
*** matbu has quit IRC23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!