Friday, 2016-05-27

*** sdake_ has joined #openstack-infra00:00
*** ajmiller_ has quit IRC00:00
bkeroI think apache reload will serve old requests with old certs and new requests with new certs. And as long as you don't have long-running transactions or state that matters on something like a LB it's okay00:00
bkerobut I imagine clients can have state that gets pretty grumpy if the server cert changes during a live connection00:01
clarkbya I don't have details on why the new cert broke things today00:01
clarkband it was broken for new connections not existing ones00:01
bkeroSo...whenever my letsencrypt upgrade cert script runs, I need to cat the entire chain into the cert file to be served00:02
fungiapache needs a restart to load new certs, reload won't do it last i checked. but anyway i expect that wasn't the root of the problem00:02
*** zhurong has quit IRC00:02
bkeroOtherwise there's an incomplete chain to my end cert00:02
*** sdake has quit IRC00:02
bkerosuper lazymode script: http://paste.openstack.org/show/505759/00:03
*** earlephilhower has quit IRC00:03
openstackgerritIan Wienand proposed openstack-infra/project-config: Add tracing flag to dib-buildimage-atomic  https://review.openstack.org/32190000:03
madhuvishyzaro: yup! I understand it needs to be +2-ed :) It's currently blocking some of my work at Wikimedia on automating maven jar releases, so was wondering if someone could help it get merged sooner! Thank you :)00:04
fungii wish i'd had an opportunity to point openssl s_client at it while broken, but i missed the excitement so no idea what was wrong with it really00:04
*** nelsnelson has joined #openstack-infra00:11
*** ddieterly is now known as ddieterly[away]00:12
*** banix has joined #openstack-infra00:12
*** dims has quit IRC00:13
*** vhosakot has quit IRC00:14
*** SumitNaiksatam has quit IRC00:14
*** mtanino has joined #openstack-infra00:16
*** denisra has quit IRC00:17
*** Jeffrey4l has joined #openstack-infra00:17
*** _sarob has quit IRC00:18
*** vhosakot has joined #openstack-infra00:20
*** vhosakot has quit IRC00:21
*** ddieterly[away] is now known as ddieterly00:21
openstackgerritSachi King proposed openstack-dev/pbr: Restore warnerrors behavior  https://review.openstack.org/22995100:22
*** xarses has joined #openstack-infra00:22
*** vhosakot has joined #openstack-infra00:22
*** baoli has quit IRC00:22
*** baoli has joined #openstack-infra00:23
*** dimtruck is now known as zz_dimtruck00:24
openstackgerritPaul Belanger proposed openstack-infra/project-config: Add support for xenial-backports  https://review.openstack.org/32190400:24
pabelangerfungi: clarkb: jeblair: xenial-backports are a thing now^00:24
fungiyay progress!00:25
*** matrohon has quit IRC00:25
*** r-mibu has quit IRC00:26
*** r-mibu has joined #openstack-infra00:26
*** pvaneck has quit IRC00:27
*** markvoelker has joined #openstack-infra00:29
*** Qiming has quit IRC00:29
*** bpokorny has quit IRC00:31
*** cody-somerville has joined #openstack-infra00:33
*** nadya has joined #openstack-infra00:34
*** markvoelker has quit IRC00:36
*** zz_dimtruck is now known as dimtruck00:37
*** mixos has joined #openstack-infra00:38
*** nadya has quit IRC00:38
*** ddieterly is now known as ddieterly[away]00:40
*** mtanino has quit IRC00:41
*** shashank_hegde has quit IRC00:42
JayFfungi: another fun email responder gerrit case. Someone subscribed to all of nova is apparently autoresponding directly to me, and it's from the same organization/domain as the other problem from the other day.00:45
fungiJayF: another "i don't work here" autoresponder? let me know the e-mail address and i'll null it out00:47
JayFfungi: not an "i don't work here", it's an "I'm on vacation" responder, but went directly to me. I think the other day you indicated that would be bad behavior by that domain (to send to me instead of to gerrit)00:47
fungiJayF: er, yeah or responding to it at all for a number of reasons00:48
*** vhosakot has quit IRC00:50
*** ddieterly[away] is now known as ddieterly00:54
*** cody-somerville has quit IRC00:56
*** dims has joined #openstack-infra00:56
*** ddieterly is now known as ddieterly[away]00:57
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move puppet4 jobs into check pipeline  https://review.openstack.org/32183700:58
*** ddieterly[away] is now known as ddieterly01:00
*** gyee has quit IRC01:03
*** asettle has joined #openstack-infra01:10
*** zhurong has joined #openstack-infra01:11
*** esker has joined #openstack-infra01:15
*** Daisy has joined #openstack-infra01:16
*** Daisy_ has joined #openstack-infra01:17
*** asettle has quit IRC01:17
*** esker has quit IRC01:19
*** Daisy has quit IRC01:20
*** rhallisey has quit IRC01:21
*** gomarivera has joined #openstack-infra01:22
*** vhosakot has joined #openstack-infra01:23
*** kzaitsev_mb has quit IRC01:24
*** mixos has quit IRC01:27
*** mixos has joined #openstack-infra01:28
*** baoli has quit IRC01:29
*** vhosakot has quit IRC01:30
*** baoli has joined #openstack-infra01:31
*** gomarivera has quit IRC01:34
*** claudiub has joined #openstack-infra01:35
*** Daisy_ has quit IRC01:35
*** Daisy has joined #openstack-infra01:36
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 Fix: Use six.ByesIO  https://review.openstack.org/32191801:36
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 Fix: cmp -> key function  https://review.openstack.org/32191901:36
*** yanyanhu has joined #openstack-infra01:38
*** Qiming has joined #openstack-infra01:38
*** claudiub|2 has quit IRC01:39
*** yamahata has quit IRC01:42
*** sdake has joined #openstack-infra01:45
*** hichihara has joined #openstack-infra01:45
*** SumitNaiksatam has joined #openstack-infra01:47
*** amitgandhinz has joined #openstack-infra01:47
*** Daisy has quit IRC01:48
*** sdake_ has quit IRC01:48
*** Daisy has joined #openstack-infra01:48
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 fix: Use new-style raise syntax  https://review.openstack.org/32192601:49
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 Fixes: Encode config write in tests  https://review.openstack.org/32192701:49
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 fixes: dict.iteritems  https://review.openstack.org/32192801:49
*** esker has joined #openstack-infra01:51
*** amrith is now known as _amrith_01:51
*** _amrith_ is now known as amrith01:52
*** amitgandhinz has quit IRC01:53
*** Daisy has quit IRC01:53
*** esker has quit IRC01:55
*** Daisy has joined #openstack-infra01:57
*** claudiub|2 has joined #openstack-infra02:01
*** Apoorva has quit IRC02:02
*** amrith is now known as _amrith_02:04
*** claudiub has quit IRC02:04
*** _amrith_ is now known as amrith02:06
*** bpokorny has joined #openstack-infra02:06
*** Daisy has quit IRC02:07
*** Daisy has joined #openstack-infra02:07
*** kushal has quit IRC02:07
*** amrith is now known as _amrith_02:08
*** bpokorny_ has joined #openstack-infra02:08
*** _amrith_ is now known as amrith02:09
*** ddieterly is now known as ddieterly[away]02:10
*** amrith is now known as _amrith_02:11
*** bpokorny has quit IRC02:11
*** _amrith_ is now known as amrith02:11
*** bpokorny_ has quit IRC02:12
*** hparekh has quit IRC02:15
*** tlian has quit IRC02:18
*** nwkarsten has joined #openstack-infra02:19
*** Daisy_ has joined #openstack-infra02:25
*** sdake_ has joined #openstack-infra02:26
*** amrith is now known as _amrith_02:27
*** sdake has quit IRC02:28
*** Sam-I-Am has quit IRC02:28
*** Daisy has quit IRC02:29
*** _amrith_ is now known as amrith02:29
*** Daisy has joined #openstack-infra02:32
*** antonym has quit IRC02:32
*** markvoelker has joined #openstack-infra02:32
*** nwkarsten has quit IRC02:32
*** Madasi has quit IRC02:32
*** Daisy_ has quit IRC02:33
*** cody-somerville has joined #openstack-infra02:34
*** nwkarsten has joined #openstack-infra02:36
*** openstackgerrit has quit IRC02:36
*** hockeynut has quit IRC02:36
*** markvoelker has quit IRC02:36
*** hockeynut has joined #openstack-infra02:37
*** erikwilson has quit IRC02:37
mwhahahaso where'd gerrit go?02:38
*** Madasi has joined #openstack-infra02:38
yanyanhuit's broken?02:39
mwhahahaseems down?02:40
*** erikmwilson has joined #openstack-infra02:40
*** nwkarsten has quit IRC02:40
*** openstackgerrit has joined #openstack-infra02:42
ianwmwhahaha : seems ok here02:42
mwhahahajust came back02:42
ianwheh, i guess jhesketh's fix for http://cacti.openstack.org/ hasn't hit02:43
jheskethianw: yeah, you can still access it at http://cacti.openstack.org/cacti/graph_view.php though02:43
mwhahahaalso i'm getting 500s when i review02:44
jheskethbut if people want to review https://review.openstack.org/#/c/321352/ , that'll solve it02:44
jheskethmwhahaha: any particular reviews?02:44
*** Madasi has quit IRC02:45
mwhahahai reviewed https://review.openstack.org/#/c/312280/ and https://review.openstack.org/#/c/321860/, when i hit submit both 500ed but it seemed to still work02:45
jheskethhmm02:45
*** antonym has joined #openstack-infra02:47
ianwyeah, actually just reviewing that change i got a 50002:48
*** amitgandhinz has joined #openstack-infra02:49
ianwload average, memory usuage look about right02:49
jheskethianw: which of the two did you review?02:50
jheskethit looks like yours may not have been saved02:50
*** yuanying has quit IRC02:50
*** Madasi has joined #openstack-infra02:50
ianwhttps://review.openstack.org/#/c/321352/02:50
jheskethoh right02:50
jheskethand that one was saved02:51
jheskethyep, got it too02:51
jheskethso it's possibly all reviews02:51
mwhahahawondering if it was/is a network issue because it was down-down for me and i did one of those is it down or just me things and it was reporting down02:51
fungijava gc chewing up the system again?02:51
jheskethfungi: what's the best way to tell?02:52
fungijavamelody graphs02:52
jheskethyeah I was looking at those and they seem okay02:53
*** amitgandhinz has quit IRC02:54
fungithe garbage collection graph doesn't show it grinding continually?02:55
mesterymwhahaha: I'm also seeing issues doign "git review", getting "unpack failed: error Read-onlyu file system"02:55
*** openstackgerrit has quit IRC02:56
mwhahaha:o02:56
*** mahito has joined #openstack-infra02:56
mesterymwhahaha: http://paste.openstack.org/show/505797/02:56
*** rfolco has quit IRC02:56
*** antonym has quit IRC02:57
ianychoiNeither do I. I cannot do code-review now.02:57
mwhahahacan't load paste.openstack.org now heh02:57
fungi[Fri May 27 02:37:49 2016] end_request: I/O error, dev xvdc, sector 6312642402:57
mesteryfungi: That doesn't look good ;)02:57
ianwthere is this sort of odd square-wave for i/o on several mounted drives -> http://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=4588&rra_id=all02:57
fungiwe've got block storage errors from cinder02:57
fungii'll offline i and fsck02:57
ianychoidraft comments are fine..02:58
*** Madasi has quit IRC02:58
jheskethfungi: ouch..02:58
jheskethshould we send a status02:58
fungi#status alert Gerrit is going offline briefly to check possible filesystem corruption02:58
openstackstatusfungi: sending alert02:58
*** woodster_ has quit IRC02:58
*** Sukhdev has joined #openstack-infra02:58
*** Daisy_ has joined #openstack-infra02:59
*** mahito has quit IRC02:59
fungiprobably network maintenance or an unplanned outage in rackspace dfw impacting connectivity between the nova host and cinder backend02:59
fungior at least that's the usual cause03:00
ianychoiI see, thanks! Hope that gerrit will recover soon..03:00
-openstackstatus- NOTICE: Gerrit is going offline briefly to check possible filesystem corruption03:00
*** ChanServ changes topic to "Gerrit is going offline briefly to check possible filesystem corruption"03:00
*** jamielennox is now known as jamielennox|away03:01
fungiokay, gerrit is on its way back up now03:01
fungisee if it's any better and then we can #status ok03:01
*** Daisy has quit IRC03:02
fungimwhahaha: mestery: ianychoi: ianw: jhesketh: ^03:03
openstackstatusfungi: finished sending alert03:03
*** hockeynut has quit IRC03:04
*** thorst_ has joined #openstack-infra03:07
*** shashank_hegde has joined #openstack-infra03:07
*** erikmwilson has quit IRC03:07
mwhahahadidn't get a 500 on one review, so that's an improvement :D03:07
*** erikmwilson has joined #openstack-infra03:07
*** sdake has joined #openstack-infra03:07
fungimestery: hopefully your git review command works now if you retry?03:08
*** anteaya has quit IRC03:08
*** hockeynut has joined #openstack-infra03:08
ianwgit.openstack.org[0: 104.130.246.128]: errno=Connection timed out03:09
funginice03:09
ianwmight be unrelated, that's from a dib build i'm doing03:09
fungi"On 26 May 2016, at 21:26 CDT, engineers were alerted to a switching loop occurring in the DFW1 data center. Engineers are engaged and working to resolve the issue. During this time, Customers may be unable to access their Cloud instances hosted within the DFW1 data center."03:10
fungihttps://status.rackspace.com/03:10
*** antonym has joined #openstack-infra03:10
*** ddieterly[away] has quit IRC03:10
*** Madasi has joined #openstack-infra03:11
fungiso i'm going with "unplanned outage" as the cause here ;)03:11
*** sdake_ has quit IRC03:11
* ianw takes an unplanned afternoon tea break03:13
fungithere's also...03:13
fungi"The Rackspace Open Cloud system engineers will perform a priority maintenance to the control infrastructure of our Next Generation Cloud Servers regions during the following dates and times: [...] DFW Region - May 26th from 10:00 PM CDT - May 27th 5:00 AM CDT"03:13
*** SumitNaiksatam has quit IRC03:13
fungithough that one's only supposed to impact api endpoints03:14
fungiso this is more likely the bridge loop impact (somebody got drunk and turned off stp?)03:14
fungi(i used to do that all the time, just for laughs)03:15
*** Daisy_ has quit IRC03:15
*** Daisy has joined #openstack-infra03:16
*** antonym has quit IRC03:16
fungianyway, no new errors reported for gerrit's cinder volume since the fsck and remount03:16
fungii'm going to go with it should be all better now03:17
*** Madasi has quit IRC03:17
fungi#status ok after a quick check, gerrit and its filesystem have been brought back online and should be working again03:18
openstackstatusfungi: sending ok03:18
*** antonym has joined #openstack-infra03:18
*** openstackgerrit has joined #openstack-infra03:18
fungimestery: thanks for saying "Read-only file system" since that helped me zero in on the problem instantly03:19
*** amotoki has quit IRC03:20
*** ChanServ changes topic to "[sprint in progress on #openstack-sprint] Discussion of OpenStack Developer and Community Infrastructure | docs http://docs.openstack.org/infra/ | bugs https://storyboard.openstack.org/ | source https://git.openstack.org/cgit/openstack-infra/ | channel logs http://eavesdrop.openstack.org/irclogs/%23openstack-infra/"03:20
-openstackstatus- NOTICE: after a quick check, gerrit and its filesystem have been brought back online and should be working again03:20
mwhahaha500s again :(03:20
fungino new filesystem errors, so that may just be network still broken in parts of rackspace03:21
mwhahahak03:21
fungiare the 5xx errors intermittent?03:23
openstackstatusfungi: finished sending ok03:23
*** Madasi has joined #openstack-infra03:23
*** Sam-I-Am has joined #openstack-infra03:25
fungioh, you know what? i bet it's also disrupting connectivity to gerrit's trove instance03:25
fungisome of the messages in gerrit's error log (ones that i don't recognize as the usual flood of benign noise in there) imply database socket timeouts03:26
fungianyway, it's well past my bedtime03:27
fungisorry jhesketh to leave you dealing with broken rackspace03:27
jheskethno worries, it's not your fault03:27
jheskethprobably isn't much I can do in regards to the network03:27
fungii don't think there's much we can do 'till this storm blows over03:27
jheskethyeah03:28
jheskeththanks for fixing the filesystem though03:28
jheskethfungi: get some sleep :-)03:28
fungithanks. hopefully they'll have figured out how to use spanning tree by the time i wake up ;)03:28
funginight all!03:29
ianychoi:) Thanks!03:31
jhesketho.03:32
jhesketh*o/03:32
*** yamahata has joined #openstack-infra03:34
mesteryfungi: Thanks for the help! :)03:36
*** thorst_ has quit IRC03:38
*** thorst_ has joined #openstack-infra03:39
*** Douhet has quit IRC03:40
*** bpokorny has joined #openstack-infra03:41
*** Douhet has joined #openstack-infra03:43
*** Daisy has quit IRC03:43
*** Daisy has joined #openstack-infra03:44
*** baoli has quit IRC03:44
*** phschwartz has joined #openstack-infra03:45
*** baoli has joined #openstack-infra03:45
*** Sukhdev has quit IRC03:45
*** links has joined #openstack-infra03:46
*** thorst_ has quit IRC03:48
*** gomarivera has joined #openstack-infra03:48
*** Daisy has quit IRC03:48
*** yuanying has joined #openstack-infra03:48
*** baoli has quit IRC03:50
*** amitgandhinz has joined #openstack-infra03:50
*** baoli has joined #openstack-infra03:51
*** amitgandhinz has quit IRC03:55
*** fawadkhaliq has joined #openstack-infra04:04
*** nadya has joined #openstack-infra04:05
*** sree has joined #openstack-infra04:06
*** fawadkhaliq has quit IRC04:07
*** Daisy has joined #openstack-infra04:08
*** zhurong has quit IRC04:09
*** nadya has quit IRC04:09
*** cody-somerville has quit IRC04:10
*** claudiub|2 has quit IRC04:10
*** Sam-I-Am has quit IRC04:12
*** banix has quit IRC04:12
*** yamamoto has quit IRC04:13
*** zhurong has joined #openstack-infra04:14
*** amotoki has joined #openstack-infra04:14
*** Daisy has quit IRC04:17
*** Sam-I-Am has joined #openstack-infra04:18
*** Daisy has joined #openstack-infra04:18
*** cody-somerville_ has joined #openstack-infra04:18
Qimingjhesketh, still there?04:19
*** yamahata has quit IRC04:20
Qimingthanks for w+1 this patch: https://review.openstack.org/#/c/318453/04:20
jheskethQiming: yep, I'm around04:20
Qimingbut I'm afraid the gate job was not in queue due to the gerrit reboot? could you please help re-approve it? ... not sure if it is necessary04:21
Qimingthanks!04:21
*** armax has quit IRC04:21
jheskethQiming: I've left a recheck which should get it picked up again04:21
Qimingokay, great! thank you.04:22
jheskethbecause of network issues at rackspace though it's likely the system will be under a little bit of turbulence so it may take a bit still04:22
Qimingjhesketh, will keep an eye on the progress04:23
*** kdas__ has joined #openstack-infra04:23
*** psachin has joined #openstack-infra04:24
*** sdake_ has joined #openstack-infra04:25
*** nwkarsten has joined #openstack-infra04:27
*** sdake has quit IRC04:27
*** sree_ has joined #openstack-infra04:28
*** sree_ is now known as Guest3941104:28
*** amotoki has quit IRC04:28
*** sree has quit IRC04:29
*** kdas__ is now known as kushal04:29
*** yamahata has joined #openstack-infra04:29
*** kushal has quit IRC04:29
*** kushal has joined #openstack-infra04:29
*** sdake has joined #openstack-infra04:31
*** markvoelker has joined #openstack-infra04:32
*** Sukhdev has joined #openstack-infra04:33
*** sdake_ has quit IRC04:33
*** Daisy_ has joined #openstack-infra04:34
*** yfried has quit IRC04:34
*** Daisy has quit IRC04:35
*** markvoelker has quit IRC04:37
*** Douhet has quit IRC04:38
*** maishsk has quit IRC04:41
*** nwkarsten has quit IRC04:42
*** gomarivera has quit IRC04:42
*** bpokorny has quit IRC04:44
*** nwkarsten has joined #openstack-infra04:44
*** jamesmcarthur has joined #openstack-infra04:45
*** thorst_ has joined #openstack-infra04:45
*** roxanaghe has joined #openstack-infra04:48
*** jamesmcarthur has quit IRC04:49
*** jaosorior has joined #openstack-infra04:50
*** amitgandhinz has joined #openstack-infra04:51
*** thorst_ has quit IRC04:52
*** amitgandhinz has quit IRC04:56
*** flwang1 has quit IRC04:56
*** yamamot__ has joined #openstack-infra04:57
*** Guest39411 has quit IRC04:58
*** dimtruck is now known as zz_dimtruck04:58
*** gomarivera has joined #openstack-infra04:58
*** hparekh has joined #openstack-infra05:00
*** gomarivera has quit IRC05:03
*** nadya has joined #openstack-infra05:03
*** nwkarsten has quit IRC05:03
*** nwkarsten has joined #openstack-infra05:06
openstackgerritColleen Murphy proposed openstack-infra/puppet-bandersnatch: Fix acceptance tests  https://review.openstack.org/32006805:08
*** maishsk has joined #openstack-infra05:08
*** amotoki has joined #openstack-infra05:11
*** maishsk has quit IRC05:12
*** ilyashakhat has joined #openstack-infra05:14
*** roxanaghe has quit IRC05:14
*** maishsk has joined #openstack-infra05:15
*** zhurong has quit IRC05:17
*** zhurong has joined #openstack-infra05:20
*** armax has joined #openstack-infra05:23
*** armax has quit IRC05:23
*** gildub has joined #openstack-infra05:28
*** baoli has quit IRC05:29
*** sarob has joined #openstack-infra05:31
*** salv-orlando has joined #openstack-infra05:33
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 Fixes: use bytes instead of str  https://review.openstack.org/32195705:34
*** sarob has quit IRC05:35
*** nwkarsten has quit IRC05:35
*** xwizard has quit IRC05:35
*** xwizard has joined #openstack-infra05:36
rakhmerovhi, seems like new jobs don't start05:37
rakhmerovis it being taken care of?05:37
rakhmerovfungi: ^05:38
openstackgerritzhurong proposed openstack-infra/project-config: Add check-requirements for solum  https://review.openstack.org/32195805:38
*** ramishra has quit IRC05:38
*** nwkarsten has joined #openstack-infra05:38
*** ramishra has joined #openstack-infra05:40
zaromadhuvishy: you might want to ping hashar for a review05:44
*** mixos has quit IRC05:45
zarorakhmerov: which ones?  have you tried 'recheck'?05:45
rakhmerovzaro: I sent https://review.openstack.org/#/c/317879/ ~ 20 mins ago and still see 15 check jobs in Zuul05:46
rakhmerovmine didn't start yet05:46
*** bhavik has joined #openstack-infra05:47
openstackgerritIan Wienand proposed openstack/diskimage-builder: Cleanup source-repositories output  https://review.openstack.org/32196105:49
*** binbincong has quit IRC05:49
*** thorst_ has joined #openstack-infra05:50
*** nwkarsten has quit IRC05:50
*** nwkarsten has joined #openstack-infra05:51
*** amitgandhinz has joined #openstack-infra05:51
*** sdake_ has joined #openstack-infra05:53
*** sdake_ has quit IRC05:53
*** sdake_ has joined #openstack-infra05:53
*** nwkarsten has quit IRC05:55
*** nadya has quit IRC05:55
*** amitgandhinz has quit IRC05:56
*** sdake has quit IRC05:56
*** thorst_ has quit IRC05:57
rakhmerovzaro: do I need to let someone else know about it?05:58
rakhmerovdon't know exactly who to ping05:58
*** sdake_ has quit IRC05:59
*** ilyashakhat has quit IRC06:04
*** rcernin has joined #openstack-infra06:05
rakhmerovSergeyLukjanov: hi Sergey, do you happen to know about what I wrote above?06:06
*** ilyashakhat has joined #openstack-infra06:06
*** binbincong has joined #openstack-infra06:10
*** aeng has quit IRC06:10
jaosorioryep, seems that jobs are gettings stuck06:10
jaosoriorrechecks won't help06:10
*** YorikSar has quit IRC06:13
*** YorikSar has joined #openstack-infra06:15
*** rcernin has quit IRC06:15
*** ffrank has joined #openstack-infra06:17
*** yfried has joined #openstack-infra06:17
*** binbincong has quit IRC06:18
*** rcernin has joined #openstack-infra06:20
*** salv-orlando has quit IRC06:24
*** ilyashakhat has quit IRC06:24
jheskethRackspace has had some networking trouble so I suspect zuul is stuck in a bad state06:25
jheskethI'll take a look adn see if I can get it moving along06:25
*** Sukhdev has quit IRC06:27
*** javeriak has joined #openstack-infra06:27
rakhmerovjhesketh: yes, thanks06:27
*** nadya has joined #openstack-infra06:30
*** binbincong has joined #openstack-infra06:31
*** markvoelker has joined #openstack-infra06:33
jheskethzuul isn't processing it's queues, but I'm not sure why... it likely got stuck talking to gerrit when we had to restart it06:34
*** megm has quit IRC06:34
jheskethI think it'll require a restart to fix but that'll lose 4000+ events...06:35
jheskethyolanda: ping06:35
*** mikelk has joined #openstack-infra06:35
*** megm has joined #openstack-infra06:36
*** markvoelker has quit IRC06:38
jheskethI'm going to shut down zuul and hope it writes out its events.. otherwise people will need to recheck their patches06:39
rakhmerovok06:40
*** daemontool has joined #openstack-infra06:44
*** flepied has joined #openstack-infra06:46
*** kushal has quit IRC06:47
*** Daisy has joined #openstack-infra06:47
jheskeththe queue wasn't able to be reloaded... it's back up and running though so I'm going to watch some results before sending a notice for people to recheck missing jobs06:48
jheskethhmm the multinode jobs aren't registered...06:48
*** kushal has joined #openstack-infra06:48
*** ffrank has quit IRC06:49
openstackgerritMartin André proposed openstack-dev/cookiecutter: Add missing license info to requirements.txt  https://review.openstack.org/32197506:50
*** Daisy_ has quit IRC06:51
*** amitgandhinz has joined #openstack-infra06:52
openstackgerritMartin André proposed openstack-dev/cookiecutter: Add missing license info to requirements.txt  https://review.openstack.org/32197506:55
*** thorst_ has joined #openstack-infra06:55
*** amitgandhinz has quit IRC06:57
openstackgerritMerged openstack-infra/project-config: Add Senlin support to rally-gate  https://review.openstack.org/31845307:02
*** thorst_ has quit IRC07:02
*** maishsk has quit IRC07:04
openstackgerritVasyl Saienko proposed openstack-infra/devstack-gate: Allow to pass OS_TEST_TIMEOUT for grenade job  https://review.openstack.org/31666207:05
*** maishsk has joined #openstack-infra07:06
openstackgerritVasyl Saienko proposed openstack-infra/devstack-gate: DO NOT REVIEW  https://review.openstack.org/31549907:06
*** tdasilva has quit IRC07:07
*** ilyashakhat has joined #openstack-infra07:08
*** Daisy has quit IRC07:09
*** Daisy has joined #openstack-infra07:09
*** frickler has quit IRC07:10
*** vincentll has joined #openstack-infra07:10
*** Mmike has quit IRC07:10
*** Daisy has quit IRC07:11
*** Daisy has joined #openstack-infra07:11
*** Daisy has quit IRC07:11
*** Daisy has joined #openstack-infra07:12
jhesketh#status notice zuul required a restart due to network outages. If your change is not listed on http://status.openstack.org/zuul/ and is missing results, please issue a 'recheck'.07:12
openstackstatusjhesketh: sending notice07:12
-openstackstatus- NOTICE: zuul required a restart due to network outages. If your change is not listed on http://status.openstack.org/zuul/ and is missing results, please issue a 'recheck'.07:13
*** ccamacho has quit IRC07:14
openstackstatusjhesketh: finished sending notice07:15
*** ifarkas has joined #openstack-infra07:15
*** ccamacho has joined #openstack-infra07:16
*** Mmike has joined #openstack-infra07:16
openstackgerritMartin André proposed openstack-dev/cookiecutter: Add missing license info to requirements.txt  https://review.openstack.org/32197507:16
*** ilyashakhat has quit IRC07:17
*** Daisy has quit IRC07:17
*** camunoz has quit IRC07:19
openstackgerritMerged openstack-infra/tripleo-ci: Add MysqlInternal endpoint to enable-tls  https://review.openstack.org/32136307:21
*** schang has quit IRC07:24
*** kushal has quit IRC07:24
*** bhavik has quit IRC07:25
*** tdasilva has joined #openstack-infra07:27
*** flepied has quit IRC07:27
*** frickler has joined #openstack-infra07:31
*** schang has joined #openstack-infra07:31
*** daemontool has quit IRC07:32
*** oanson has joined #openstack-infra07:33
*** strigazi has quit IRC07:33
*** bhavik has joined #openstack-infra07:35
*** bauzas is now known as bauwser07:35
*** tesseract has joined #openstack-infra07:36
*** claudiub|2 has joined #openstack-infra07:40
*** amotoki_ has joined #openstack-infra07:41
*** hashar has joined #openstack-infra07:43
*** amotoki has quit IRC07:43
*** amoralej|off is now known as amoralej07:44
*** yamahata has quit IRC07:46
*** arxcruz has joined #openstack-infra07:49
*** salv-orlando has joined #openstack-infra07:50
*** ilyashakhat has joined #openstack-infra07:51
*** amitgandhinz has joined #openstack-infra07:53
*** ilyashakhat has quit IRC07:55
*** gildub has quit IRC07:56
*** ifarkas_ has joined #openstack-infra07:56
*** ifarkas has quit IRC07:56
*** sarob has joined #openstack-infra07:58
*** amitgandhinz has quit IRC07:58
*** sree has joined #openstack-infra07:59
*** zzzeek has quit IRC08:00
*** zzzeek has joined #openstack-infra08:00
*** thorst_ has joined #openstack-infra08:00
*** sarob has quit IRC08:02
*** afazekas|sick is now known as afazekas08:03
*** pilgrimstack has joined #openstack-infra08:03
*** shashank_hegde has quit IRC08:05
*** thorst_ has quit IRC08:07
*** flepied has joined #openstack-infra08:08
*** bhavik has quit IRC08:09
*** jordanP has joined #openstack-infra08:13
*** sree has quit IRC08:13
*** sree has joined #openstack-infra08:19
*** hichihar_ has joined #openstack-infra08:19
*** claudiub|2 has quit IRC08:19
*** pahuang has quit IRC08:20
*** slaweq has quit IRC08:21
*** hichihara has quit IRC08:22
*** sree has quit IRC08:24
*** asettle has joined #openstack-infra08:25
*** slaweq has joined #openstack-infra08:26
*** asettle has quit IRC08:26
*** tosky has joined #openstack-infra08:30
*** tosky has left #openstack-infra08:31
*** tosky has joined #openstack-infra08:31
*** markusry has joined #openstack-infra08:31
*** vincentll has quit IRC08:34
*** YorikSar has quit IRC08:35
*** derekh has joined #openstack-infra08:35
*** YorikSar has joined #openstack-infra08:36
*** zeih has joined #openstack-infra08:42
*** e0ne has joined #openstack-infra08:43
*** arxcruz has quit IRC08:43
*** dmk0202 has joined #openstack-infra08:46
*** strigazi has joined #openstack-infra08:47
*** amrith is now known as _amrith_08:47
*** yuanying has quit IRC08:47
*** pbourke_ has quit IRC08:49
*** pbourke_ has joined #openstack-infra08:49
*** zhurong has quit IRC08:52
*** zhurong has joined #openstack-infra08:53
wznoinskhi all08:53
*** yuanying has joined #openstack-infra08:53
*** yanyanhu has quit IRC08:54
*** amitgandhinz has joined #openstack-infra08:54
*** yanyanhu has joined #openstack-infra08:54
*** yuanying has quit IRC08:54
*** yanyanhu has quit IRC08:55
wznoinskI'm looking for a best way to 'pause' my CI, in case of a site-wide issue where all/most of the jobs are impacted I'd like to stop running any real jobs and do a testing to find a workaround, and when solution is found unpause the zuul and jobs... I'm wondering what's the best way to do it?08:55
*** javeriak has quit IRC08:56
*** Qiming has quit IRC08:57
*** amitgandhinz has quit IRC08:59
*** jaosorior is now known as jaosorior_lunch08:59
*** dizquierdo has joined #openstack-infra08:59
rcarrillocruzwznoinsk: http://git.openstack.org/cgit/openstack-infra/system-config/tree/doc/source/zuul.rst#n11109:00
*** flwang1 has joined #openstack-infra09:01
*** YorikSar has quit IRC09:01
*** eezhova has joined #openstack-infra09:01
*** YorikSar has joined #openstack-infra09:03
*** HeOS has joined #openstack-infra09:04
*** thorst_ has joined #openstack-infra09:05
openstackgerritMarkus Zoeller (markus_z) proposed openstack-infra/release-tools: update README for the script to expire old bug reports  https://review.openstack.org/32201909:06
*** amotoki_ has quit IRC09:10
*** yuanying has joined #openstack-infra09:10
*** pbourke_ has quit IRC09:10
*** esikachev has joined #openstack-infra09:10
*** nadya has quit IRC09:12
*** pbourke_ has joined #openstack-infra09:12
*** thorst_ has quit IRC09:12
*** yuanying has quit IRC09:15
wznoinskrcarrillocruz that's not exactly what I was looking for... I know how to restart zuul, I'm more interested in how to keep it reading events while not executing the jobs till I give it a 'green' light that some site-wide issue is now resolved09:15
*** vincentll has joined #openstack-infra09:15
rcarrillocruzwell, that's not just restart zuul, but saving the queues state and reloading them after a zuul restart. If you are looking for a way for zuul to start reading the events stream , I don't think there's a way to do that09:16
rcarrillocruzs/start/stop09:16
wznoinskfor the moment I've put Jenkins into shutdown mode hence it's not executing anything but the jobs are still registered with gearman server so does what I wanted to do but I would like to be able to still kick off some test jobs from jenkins while resovling the issue manually09:17
*** bhavik has joined #openstack-infra09:17
strigazihi all, I'd like some feedback on https://review.openstack.org/#/c/321026/09:18
*** daemontool has joined #openstack-infra09:18
*** zzzeek has quit IRC09:20
*** amotoki has joined #openstack-infra09:20
rcarrillocruzwznoinsk: zuul is now under some major change, switching from zuul-gearman-jenkins to zuul launching jobs via ansible09:22
rcarrillocruzcheck it out with jeblair  to discuss that use case09:23
rcarrillocruzjhesketh too09:23
*** ilyashakhat has joined #openstack-infra09:24
wznoinskthanks, will do09:25
*** mhickey has joined #openstack-infra09:26
*** amotoki has quit IRC09:27
yolandagood morning09:27
*** electrofelix has joined #openstack-infra09:27
strigaziHi yolanda, I'm Spyros from magnum team. I want to add a non-voting job at the rally gate to test our benchmark scenarios. I think this change needs more work. Can you have a look: https://review.openstack.org/#/c/321026/09:31
*** arxcruz has joined #openstack-infra09:35
yolandastrigazi, sure09:36
*** jlanoux has joined #openstack-infra09:36
strigaziyolanda: thanks09:36
*** salv-orl_ has joined #openstack-infra09:38
*** Guest98278 has quit IRC09:38
openstackgerritMerged openstack-infra/project-config: Add Non-voting job for nodepool py34  https://review.openstack.org/32188509:38
*** gomarivera has joined #openstack-infra09:39
*** Hal has joined #openstack-infra09:40
*** salv-orlando has quit IRC09:40
*** Hal is now known as Guest5378909:40
*** jyuso1 has quit IRC09:41
yolandastrigazi, initially looks good. I see you miss the rally-plot publisher, you don't need it?09:41
*** nadya has joined #openstack-infra09:41
strigaziyolanda: we do, thanks09:42
strigaziyolanda: new patch comming09:43
*** mpaolino has joined #openstack-infra09:43
*** gomarivera has quit IRC09:44
strigaziyolanda: Isn't included in line 934 https://review.openstack.org/#/c/321026/4/jenkins/jobs/rally.yaml09:45
yolandaoh, last line was cut on my screen!!!09:46
* yolanda switched from computers yesterday09:46
*** Qiming has joined #openstack-infra09:46
strigazi:)09:47
yolandathe change looks good, my screen doesn't :)09:47
strigaziyolanda: thanks09:50
jheskethwznoinsk: so it'd be a little hacky, but I think you should be able to do what you want with some reloads... You could configure a job that will never run with every project as its only job (it'll need to be registered with gearman, but you can do that via telnet or a simple gear client). Then play with your jenkins configuration/jobs however you want, and when you're ready configure the layout.yaml to have all the jobs09:50
jheskethagain and reload09:50
jheskethwznoinsk: zuul will correct the jobs that should be ran for a change that is still in the pipeline. So if you've added or taken jobs while it is there it will figure out what to do09:51
jheskethif that makes sense09:51
odyssey4meyolanda is there anyone available to add another review to yours for my stream of patches? https://review.openstack.org/#/q/owner:jesse-pretorius+status:open+project:openstack-infra/project-config09:51
yolandaodyssey4me, any infra core or projec-config core could help with that09:52
odyssey4meyolanda unfortunately it seems that everyone's been busy with sprints, so even though I've been asking no-one's managed to get to them09:53
yolandajhesketh seem to be around... or ping pabelanger on few hours09:53
*** permalac has joined #openstack-infra09:54
*** permalac has quit IRC09:54
jheskethodyssey4me: I can take a look09:54
odyssey4methanks jhesketh09:54
*** permalac has joined #openstack-infra09:54
*** amitgandhinz has joined #openstack-infra09:54
*** jianghuaw has joined #openstack-infra09:55
*** permalac has quit IRC09:55
*** permalac has joined #openstack-infra09:55
jianghuawHi, anyone met this failure which nodepool image-update: oslo_db.exception.DBConnectionError: (pymysql.err.OperationalError) (2003, "Can't connect to MySQL server on 'logstash.openstack.org' ([Errno 101] Network is unreachable)")09:56
wznoinskjhasketh I had the same idea, hence pointed my joubs to ubuntu-trusty-dummy instead of ubuntu-trusty (in jbb projects.yaml) only then to learn the jobs don't register when they don't have slave available during registration attempt (and I did restart zuul in the meantime)... if I would not restart zuul and registered jobs at gearman server would be still the old names jobA:ubuntu-trusty I guess I should be fine? (jobs will not run09:56
wznoinsk - not slave available but it's fixable by updating jbb projects.yaml -> jenkins slave info) ?09:56
jianghuawmy nodepool ran well until sometime back in today. it always failed with this error.09:57
*** javeriak has joined #openstack-infra09:58
wznoinskjianghuaw check 'ip -r' for default gateway09:58
wznoinsk'ip r' even09:59
*** ociuhandu has quit IRC09:59
*** amitgandhinz has quit IRC09:59
jheskethjianghuaw: logstash may have changed ip's..09:59
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/project-config: Rename openstack-ansible-ironic to openstack-ansible-os_ironic  https://review.openstack.org/29919209:59
jheskethwznoinsk: I didn't fully follow sorry... It depends what is in your layout... so long as the jobs in layout.yaml are registered with the gearman server at some point then you should be fine10:00
jianghuawoslo_db.exception.DBConnectionError: (pymysql.err.OperationalError) (2003, "Can't connect to MySQL server on 'logstash.openstack.org' ([Errno 101] Network is unreachable)")10:01
jianghuawoslo_db.exception.DBConnectionError: (pymysql.err.OperationalError) (2003, "Can't connect to MySQL server on 'logstash.openstack.org' ([Errno 101] Network is unreachable)")10:01
jianghuawwznoinsk10:01
jianghuaw@wznoinsk: gateway has no change and it's ok to reach other internal address.10:01
*** sdague has joined #openstack-infra10:01
jianghuawwznoinsk: there is no change with the route and it can reach other internal address.10:02
openstackgerritMerged openstack-infra/project-config: Add api-ref job for Zaqar  https://review.openstack.org/32132410:02
*** redixin has joined #openstack-infra10:03
redixinhiyo. Tell me please where I can find some info about openstack proposal bot? fungi jeblair SergeyLukjanov10:04
wznoinskjhesketh: yeah I think it see the full picture, I don't want to change layout unless I have to (I want to keep my current production layout) so I'll test only pointing jobs to a nonexistent node in jbb/jenkins hence jobs will be waiting in zuul for jenkins with the proper slave for the job allowing me to troubleshoot, i'll change project.yaml and node used for my jobs to the real slave label and zuul should kick off again, is my t10:04
wznoinskhinking correct?10:04
openstackgerritMerged openstack-infra/project-config: Add CloudKitty role to OpenStack-Ansible  https://review.openstack.org/31883610:07
*** zhurong has quit IRC10:07
wznoinskjianghuaw: to check jhesketh suggestion go to https://toolbox.googleapps.com/apps/dig/#A/logstash.openstack.org and compare it with out of 'host logstash.openstack.org' on the machine with the problem10:07
jianghuawwznoinsk: thanks. I will try.10:08
wznoinskyou may have the domain resolving to a diff/old ip10:08
*** thorst_ has joined #openstack-infra10:09
jianghuawon the node, I can successfully ping to logstash.openstack.org.10:09
*** javeriak has quit IRC10:10
wznoinskcan you connect to the mysql on it ?10:10
*** ilyashakhat has quit IRC10:10
*** javeriak has joined #openstack-infra10:10
*** _degorenko|afk is now known as degorenko10:11
*** yuanying has joined #openstack-infra10:11
openstackgerritMateusz Matuszkowiak proposed openstack-infra/project-config: Added new repo for fuel-plugin-datera-cinder  https://review.openstack.org/31565110:11
jheskethwznoinsk: that's an interesting question... so I think zuul will only request the job, not a specific node (unless you name the job's in zuul with a :node-type suffix). So it's the specific node that is registering with gearman as able to do the job.10:11
jianghuawI've restart the image-building, I will check it after the VM's up.10:11
jheskethwznoinsk: so as long as another node previously registered with gearman, I think it'd be okay... but I'm not sure10:11
*** oanson has quit IRC10:12
wznoinskjhesketh: I haven't checked the code but I think zuul will only send to a specific jenkins only if it sees nodepool providing a properly labeled slave to that jenkins10:13
openstackgerritMerged openstack-infra/project-config: Retire openstack-ansible-py_from_git repository  https://review.openstack.org/31932210:14
*** lezbar has quit IRC10:16
jheskethwznoinsk: I don't think that's the case, but I may be wrong...10:17
*** ociuhandu has joined #openstack-infra10:17
jheskethodyssey4me: I've reviewed your patches and there are a couple requiring feedback10:17
*** Na3iL has joined #openstack-infra10:17
odyssey4methanks jhesketh - resolving the resulting merge conflicts and updating the patches10:17
*** thorst_ has quit IRC10:18
openstackgerritKirill Bespalov proposed openstack-infra/project-config: add reno jobs for oslo projects  https://review.openstack.org/32090410:18
openstackgerritKirill Bespalov proposed openstack-infra/project-config: add reno jobs for oslo projects  https://review.openstack.org/32090410:19
openstackgerritKirill Bespalov proposed openstack-infra/project-config: add reno jobs for oslo projects  https://review.openstack.org/32090410:19
wznoinskjhesketh: that was my understanding of gearman making use of multiple jenkins masters avaiable, it sends the job to the jenkins that has everything needed to run the job10:20
*** ilyashakhat has joined #openstack-infra10:20
jheskethI'd have to look at the code sorry10:21
jheskethand haven't got time right now :-(10:21
*** jaosorior_lunch is now known as jaosorior10:22
wznoinskjhesketh what are the time-wise plans regarding the zuul ansible?10:23
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/project-config: Add release announcement jobs for major OSA repos  https://review.openstack.org/31923410:23
jheskethwznoinsk: as soon as it's ready most likely10:23
odyssey4mejhesketh are my changes to https://review.openstack.org/319234 what you meant?10:23
jheskethjeblair has been working really hard on it and made great progres... I'd say it's very close.. hopefully next week, but no idea :-)10:24
wznoinskjhesketh ok, I'll touch base with him later then, thanks10:24
*** sarob has joined #openstack-infra10:25
jheskethodyssey4me: yep, thanks10:26
wznoinskjhesketh btw. I think stopping zuul-merger would have a similar effect to the solution we've discussed above, zuul will wait for merge to complete before it sends the job to jenkins...10:26
jianghuawwznoinsk: yes. connection to mysql on logstash failed: ERROR 2003 (HY000): Can't connect to MySQL server on 'logstash.openstack.org' (101)10:27
jianghuawmaybe the mysql service failed on it?10:27
jheskethwznoinsk: oh yeah, that's probably a much easier solution10:27
wznoinskjianghuaw it looks like it says network unreachable again, if the mysql service would be down you'd get connection refused most likely10:29
*** javeriak has quit IRC10:29
*** yamamot__ has quit IRC10:29
jianghuawwznoinsk: but same error code: 101.10:30
*** sarob has quit IRC10:30
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/project-config: Add Sahara role to OpenStack-Ansible  https://review.openstack.org/31793110:30
odyssey4methanks for picking up on that error jhesketh ^ updated10:30
wznoinskjianghuaw; can you telnet logstash.openstack.org 3306 ?10:30
jheskethno worries10:31
wznoinskbtw. are you running the nodepool in some sort of networking isolation? namespace/(docker) container etc?10:32
jianghuawtelnet logstash.openstack.org 330610:32
jianghuawTrying 23.253.230.235...10:32
jianghuawTrying 2001:4800:7817:103:be76:4eff:fe05:f1cc...10:32
jianghuawtelnet: Unable to connect to remote host: Network is unreachable10:32
rcarrillocruzyolanda: mind doing a quick review for https://review.openstack.org/#/c/322053/2/tasks/create_clouds_resources.yml ?10:33
wznoinskthere you go, solve this problem and you're good then ;-) try traceroute maybe http://www.howtogeek.com/134132/how-to-use-traceroute-to-identify-network-problems/10:33
rcarrillocruztrivial, but as it's 'largish' i rather get another core +210:33
yolandasure10:33
rcarrillocruzthx10:34
*** markvoelker has joined #openstack-infra10:34
jianghuawwznoinsk: No, nodepool run in a VM from the RAX cloud.10:34
*** lezbar has joined #openstack-infra10:34
wznoinskI'm surprised you were a ble to ping that ip tho10:35
jianghuawwznoinsk: but it does work.10:37
jianghuawping  logstash.openstack.org10:37
jianghuawPING logstash.openstack.org (23.253.230.235) 56(84) bytes of data.10:37
jianghuaw64 bytes from logstash.openstack.org (23.253.230.235): icmp_seq=1 ttl=48 time=172 ms10:37
wznoinskok, maybe try disabling ipv6 if you don't use it i.e.: 'sysctl -w net.ipv6.conf.all.disable_ipv6=1'10:38
*** markvoelker has quit IRC10:39
*** kien-ha has joined #openstack-infra10:39
*** mpaolino has quit IRC10:39
jianghuawwznoinsk: got the same error10:40
vponomaryovHello everyone, I need to add "debootstrap" package to ubuntu-trusty, where is correct place to do it?10:41
*** javeriak has joined #openstack-infra10:41
odyssey4mevponomaryov you make use of the file other-requirements.txt in your own repository10:43
wznoinskjianghuaw telnet 23.253.230.235 3306 ?10:43
odyssey4mevponomaryov take a look at http://docs.openstack.org/infra/bindep/ for how it works10:44
vponomaryovodyssey4me:  other-requirements.txt intended to install system packages?10:44
vponomaryovodyssey4me: reading doc, thanks10:45
jianghuawwznoinsk: interesting... I can reach via the ip. - error is "Connection refused".10:46
*** markusry has quit IRC10:46
wznoinskjianghuaw: or use the port your nodepool has configured for mysql on logstash.openstack.org instead of 3306... it looks like the network unreachable error comes back from the ipv6 connection attempt, and it tries ipv6  because the first attempt on ipv4 fails for a readon, it fails for me with refused hence the mysql service is either down or we try to connect to the wrong port10:46
sdagueyolanda / jhesketh - either of you want to help me land enforcing unit tests - https://review.openstack.org/#/c/321176/ ?10:47
wznoinskjianghuaw or logstash.openstack.org does not accept connections from outside world to their mysql?10:47
*** openstackgerrit has quit IRC10:47
*** openstackgerrit has joined #openstack-infra10:48
*** esikachev has quit IRC10:48
yolandasure10:48
odyssey4mevponomaryov other-requirements.txt is intended to record binary dependencies for your project, and jenkins will install them all on the node prior to executing your job10:50
*** abregman has joined #openstack-infra10:50
vponomaryovodyssey4me: I assumed exactly this after reading doc, thank you very much!10:50
odyssey4mevponomaryov note that if you currently do not have an other-requirements.txt file then your jobs will be using the fallback deps, so you may find that once you populate the file you'll need to add a few more that you never needed to do before10:50
jianghuawwznoinsk: Thanks. I think the problem is on the logstash.openstack.org which is out of my control. Will see if it will recover sometime later.10:51
odyssey4mejhesketh feedback in https://review.openstack.org/31938110:51
*** javeriak has quit IRC10:52
wznoinskjianghuaw it may be a planned change of configuration to disalow these connections (but double check the port you should be using whether it's 3306 or a different one), it may be 'just a failure' too10:53
*** abregman has quit IRC10:53
*** abregman has joined #openstack-infra10:54
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Only pre install packages for master jobs  https://review.openstack.org/32207310:55
*** amitgandhinz has joined #openstack-infra10:55
*** johnchalekson has joined #openstack-infra10:57
*** javeriak has joined #openstack-infra10:59
*** amitgandhinz has quit IRC11:00
openstackgerritMerged openstack-infra/project-config: add python 2.7 tests to os-api-ref  https://review.openstack.org/32117611:01
*** maishsk_ has joined #openstack-infra11:01
*** maishsk has quit IRC11:02
*** maishsk_ is now known as maishsk11:02
*** lezbar__ has joined #openstack-infra11:03
*** lezbar has quit IRC11:04
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/project-config: Enable check/gate tests for ansible-role-cloud-launcher project  https://review.openstack.org/32208111:05
rcarrillocruzyolanda: does ^ look good?11:07
*** johnchalekson has quit IRC11:07
*** kien-ha has quit IRC11:08
*** johnchalekson has joined #openstack-infra11:11
*** _amrith_ is now known as amrith11:13
yolandalet me see11:15
*** thorst_ has joined #openstack-infra11:16
yolandarcarrillocruz, do you need documentation?11:17
*** rfolco has joined #openstack-infra11:18
yolandawondering if docs-on-rtfd is needed11:18
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/project-config: Retire openstack-ansible-py_from_git repository  https://review.openstack.org/31933511:18
*** yamamoto has joined #openstack-infra11:20
*** ddieterly has joined #openstack-infra11:22
*** Na3iL has quit IRC11:23
rcarrillocruznice to have, other roles docs are also published11:23
*** ihrachys has joined #openstack-infra11:24
*** DevBox has joined #openstack-infra11:24
*** EricGonczer_ has joined #openstack-infra11:25
*** thorst_ has quit IRC11:25
*** yamamoto has quit IRC11:26
odyssey4meyolanda please review https://review.openstack.org/317931 when you have a moment11:28
*** lucasagomes is now known as lucas-hungry11:29
openstackgerritMerged openstack-infra/project-config: Add Ops repo to OpenStack-Ansible  https://review.openstack.org/31938111:30
openstackgerrityolanda.robla proposed openstack-infra/shade: Use keystoneauth1.betamax for shade mocks  https://review.openstack.org/29864711:32
*** johnchalekson has quit IRC11:32
*** EricGonczer_ has quit IRC11:33
*** ldnunes has joined #openstack-infra11:33
*** johnchalekson has joined #openstack-infra11:34
*** EricGonczer_ has joined #openstack-infra11:34
*** kzaitsev_mb has joined #openstack-infra11:36
*** Kennan has quit IRC11:36
*** johnchalekson has quit IRC11:36
*** dave-mcnally has quit IRC11:37
*** johnchalekson has joined #openstack-infra11:38
*** Kennan has joined #openstack-infra11:39
yolandaodyssey4me, approved11:40
odyssey4methanks yolanda11:40
*** johnchalekson has quit IRC11:41
*** bhavik has quit IRC11:41
*** kzaitsev_mb has quit IRC11:41
*** johnchalekson has joined #openstack-infra11:41
*** johnchalekson has quit IRC11:42
odyssey4meyolanda jhesketh https://review.openstack.org/319335 is now ready for workflow when you have a moment11:42
*** thorst_ has joined #openstack-infra11:42
*** johnchalekson has joined #openstack-infra11:42
*** johnchalekson has quit IRC11:43
*** johnchalekson has joined #openstack-infra11:43
yolandaapproved11:44
*** dizquierdo has quit IRC11:44
odyssey4methanks yolanda11:45
*** rhallisey has joined #openstack-infra11:46
*** openstackgerrit has quit IRC11:47
*** openstackgerrit has joined #openstack-infra11:48
*** ddieterly is now known as ddieterly[away]11:48
*** ilyashakhat has quit IRC11:49
*** daemontool has quit IRC11:51
*** daemontool has joined #openstack-infra11:51
openstackgerritMerged openstack-infra/project-config: Add Sahara role to OpenStack-Ansible  https://review.openstack.org/31793111:53
*** aysyd has joined #openstack-infra11:54
*** kzaitsev_mb has joined #openstack-infra11:56
*** amitgandhinz has joined #openstack-infra11:56
*** jaosorior has quit IRC12:01
*** amitgandhinz has quit IRC12:01
*** jaosorior has joined #openstack-infra12:01
*** yamamoto has joined #openstack-infra12:01
*** psilvad has joined #openstack-infra12:04
*** esikachev has joined #openstack-infra12:04
*** yolanda has quit IRC12:04
openstackgerritEmilien Macchi proposed openstack-infra/project-config: zuul/layout: add puppet-unit 4.5 jobs  https://review.openstack.org/32212412:04
*** yamamoto has quit IRC12:06
*** yolanda has joined #openstack-infra12:06
*** ilyashakhat has joined #openstack-infra12:07
*** daemontool has quit IRC12:08
*** markvoelker has joined #openstack-infra12:08
*** amrith is now known as _amrith_12:08
*** daemontool has joined #openstack-infra12:08
*** maishsk_ has joined #openstack-infra12:09
*** maishsk has quit IRC12:09
*** maishsk_ is now known as maishsk12:09
*** vgridnev has joined #openstack-infra12:09
*** ddieterly[away] is now known as ddieterly12:12
*** exploreshaifali has joined #openstack-infra12:13
*** salv-orl_ has quit IRC12:13
*** salv-orlando has joined #openstack-infra12:16
EmilienMhello infra, can we get a review on https://review.openstack.org/#/c/322124/ please?12:16
odyssey4mejhesketh yolanda the regex used for the jobs - is that bash, python, perl, ??12:17
*** deadnull_ has joined #openstack-infra12:17
*** daemontool has quit IRC12:18
*** daemontool has joined #openstack-infra12:18
*** sarob has joined #openstack-infra12:19
*** banix has joined #openstack-infra12:20
*** dmellado is now known as dmellado|lunch12:20
*** dmellado|lunch is now known as dmellado12:20
*** trown|outtypewww is now known as trown12:23
haypohi. gate-tempest-dsvm-full failed on my tiny patch for nova http://logs.openstack.org/40/322040/1/check/gate-tempest-dsvm-full/6bdad07/console.html : "devstack-gate/devstack-vm-gate.sh: No such file or directory"12:23
*** sarob has quit IRC12:24
haypois someone aware of this issue? i see also "/tmp/ansible/bin/ansible: No such file or directory" error and ".../logs/reproduce.sh: No such file or directory" error, no idea if it's related12:24
*** markusry has joined #openstack-infra12:25
haypohum, it looks like many files are missing12:25
*** lucas-hungry is now known as lucasagomes12:28
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/project-config: Skip OSA CentOS-7/Xenial role jobs for Liberty/Mitaka  https://review.openstack.org/32213512:29
*** kgiusti has joined #openstack-infra12:31
*** daemontool has quit IRC12:31
*** daemontool has joined #openstack-infra12:32
*** dprince has joined #openstack-infra12:33
*** rodrigods has quit IRC12:34
*** rodrigods has joined #openstack-infra12:34
*** johnchalekson has quit IRC12:34
*** zhurong has joined #openstack-infra12:37
wznoinskhaypo something's completely wrong with that build, try recheck12:37
haypowznoinsk: what about debugging random bugs? :-/12:38
wznoinskit looks like network issue, the VMs are already gone I suppose so troubleshooting would be done on log files only12:39
haypowznoinsk: which log files? are you able to troubleshoot this issue?12:40
haypowhy do we get network issues?12:40
*** yolanda has quit IRC12:41
wznoinskit could be rax related network issue - in your logs: Retrying (Retry(total=4, connect=None, read=None, redirect=None)) after connection broken by 'ReadTimeoutError("HTTPConnectionPool(host='mirror.dfw.rax.openstack.org', port=80): Read timed out.12:41
wznoinskwhere no issue in other job but on ovh:12:42
*** daemontool has quit IRC12:42
wznoinskhttp://logs.openstack.org/40/322040/1/check/gate-tempest-dsvm-neutron-full/605b6d7/console.html12:42
*** daemontool has joined #openstack-infra12:42
wznoinskDownloading http://mirror.bhs1.ovh.openstack.org/pypi/packages/25/90/a0baec87a353c4c5418ecc974d6cc3663d4404f367ea890f0f25ba968a83/paramiko-1.16.0-py2.py3-none-any.whl (169kB)12:42
*** doug-fish has joined #openstack-infra12:43
jordanPyes network issue, it happens, you should just recheck12:43
jordanPthere's not a lot you can do, network issues happen and will happen again12:43
*** amoralej is now known as amoralej|lunch12:44
*** nwkarsten has joined #openstack-infra12:44
*** banix has quit IRC12:44
wznoinskis logstash borken or I'm doing something wrong? http://logstash.openstack.org/12:44
openstackgerritValeriy Ponomaryov proposed openstack-infra/project-config: Add install-distro-packages template to manila-image-elements jobs  https://review.openstack.org/32214312:44
rcarrillocruzwznoinsk: logstash has been in migration process since yesterday12:45
rcarrillocruznot sure if it's done yet12:45
rcarrillocruzclarkb and pabelanger were working on that yesterday12:45
wznoinskok, cheers12:45
*** tlian has joined #openstack-infra12:46
*** zhurong has quit IRC12:46
*** openstackgerrit has quit IRC12:48
*** yolanda has joined #openstack-infra12:48
*** openstackgerrit has joined #openstack-infra12:48
*** zhurong has joined #openstack-infra12:48
*** nwkarsten has quit IRC12:48
openstackgerritMerged openstack-infra/release-tools: update README for the script to expire old bug reports  https://review.openstack.org/32201912:49
*** edmondsw has joined #openstack-infra12:49
openstackgerritIvan Kolodyazhny proposed openstack-infra/devstack-gate: Add python-brick-cinderclient-ext workspace setup  https://review.openstack.org/32184512:50
*** pilgrimstack has quit IRC12:50
*** yamamoto has joined #openstack-infra12:51
openstackgerrityolanda.robla proposed openstack-infra/shade: Use keystoneauth1.betamax for shade mocks  https://review.openstack.org/29864712:51
*** pilgrimstack has joined #openstack-infra12:51
*** coreyob has joined #openstack-infra12:51
*** abregman has quit IRC12:51
wznoinskjianghuaw ^ see above12:53
*** zhurong has quit IRC12:54
openstackgerritEmilien Macchi proposed openstack-infra/project-config: Notify #puppet-openstack with puppet-ceph/stable changes  https://review.openstack.org/32215112:54
*** baoli has joined #openstack-infra12:54
*** zhurong has joined #openstack-infra12:55
*** baoli_ has joined #openstack-infra12:56
*** amitgandhinz has joined #openstack-infra12:57
pabelangerwznoinsk: rcarrillocruz: clarkb: I have restarted jenkins-log-client on logstash.o.o, I believe things will work better now12:59
*** zhurong has quit IRC12:59
fungijhesketh: i guess we ended up needing a zuul restart eventually too? any other persistent impact? do the network issues seem to have subsided?12:59
*** baoli has quit IRC12:59
jheskethfungi: they seem to have subsided... no other issues that I've noticed13:00
*** piet has joined #openstack-infra13:00
jhesketh(had to clean up a few nodepool nodes to get jobs to re-register so they'd be picked up in demand calcs)13:00
fungimakes sense13:00
*** |-paul-| has joined #openstack-infra13:01
fungilooks like rackspace has taken the incident off their status page entirely, so no mention of the resolution time13:01
pabelangerjhesketh: fungi Ya, it looks like we have leaked a lot of ready nodes in nodepool13:01
jheskethfungi: if you missed it, we did lose a bunch of state including ~100 results and ~4000 events13:01
pabelangerI can clean them up if needed13:01
jheskethhopefully people saw the notice and are rechecking13:01
jheskethpabelanger: yeah I didn't clean them all up so if you want to figure out what are stale that might be handy13:02
fungijhesketh: yep, not much we can do about that i guess13:02
*** amitgandhinz has quit IRC13:02
*** matt-borland has joined #openstack-infra13:02
redixinHi all. Does anybody know where is sources of openstack proposal bot?13:02
redixinfungi: ^13:02
pabelangerjhesketh: sure, let me do that now13:02
fungiredixin: it's not a piece of software, it's just a colloquial name for a bunch of different ci jobs that use a common gerrit account to propose changes for review13:03
*** burgerk has joined #openstack-infra13:03
fungiredixin: so you'll need to be more specific about what you're looking for13:04
*** ddieterly has quit IRC13:04
redixinfungi: i trying to make something similar13:05
*** _ari_ has joined #openstack-infra13:05
*** zhurong has joined #openstack-infra13:05
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move puppet4 jobs into check pipeline  https://review.openstack.org/32183713:05
fungiredixin: the rule of thumb is that most of the time having something propose git commits for review containing autogenerated content is a terrible idea13:06
rcarrillocruzpabelanger: added tests for the launcher, https://review.openstack.org/#/c/322081/ enables them13:06
fungiredixin: we rely on it in a few cases where there is no other option, and even for those we're constantly looking for alternatives so we can stop13:07
*** electrofelix has quit IRC13:07
pabelangerrcarrillocruz: nice!13:07
*** maishsk has quit IRC13:07
fungiredixin: basically adding generated content in a revision control system runs counter to expectations and is better served through some other means of publication13:08
*** bhavik has joined #openstack-infra13:08
*** nwkarsten has joined #openstack-infra13:08
rcarrillocruzthx13:09
*** pilgrimstack has quit IRC13:09
redixinfungi: the second option is -1 all patches if we have buggy/vulnerable dependency in requiements.txt13:09
*** hichihar_ has quit IRC13:10
fungiredixin: buggy/vulnerable in ways which don't impact testing?13:10
openstackgerritRodrigo Duarte proposed openstack-infra/project-config: Make keystone functional tests job voting  https://review.openstack.org/32189013:11
fungiredixin: there's been consensus for a long time that our community isn't going to use our coordinated requirements list to communicate security vulnerabilities in our dependencies, if that's what you're suggesting13:11
redixinfungi: it may help to save some time. to have proposed change instead of looking for a problem with new release of whatever-pythonclient13:11
*** _amrith_ is now known as amrith13:11
fungiredixin: to what repo are you considering proposing these updates, and what would trigger that?13:12
redixinfungi: I mean we can have whatever-pythonclient===1.1.1 (known good version) instead of whatever-pythonclient<=1.1.1 (1.1.2 may be broken)13:13
redixin(in requiements.txt)13:13
fungiredixin: that's what upper-constraints.txt is meant to achieve13:13
fungiredixin: how does it not fill the need you're seeing?13:14
redixinfungi: so we can just have upper-constraints instead of requirements.txt?13:15
*** |-paul-| has quit IRC13:15
fungiredixin: no, they're separate mechanisms13:16
*** alaski is now known as lascii13:16
redixinfungi: hmm ill try to google about using upper constraints. thanks a lot13:17
fungiredixin: http://git.openstack.org/cgit/openstack/requirements/tree/README.rst13:18
fungiit's pretty thoroughly documented there13:18
redixinok thanks13:19
*** nwkarsten has quit IRC13:20
*** _vs has joined #openstack-infra13:20
*** akshai has joined #openstack-infra13:20
fungiredixin: if you're considering altering/augmenting that, i recommend talking to the requirements team in #openstack-requirements or in their weekly meeting http://eavesdrop.openstack.org/#Requirements_Team_Meeting13:21
*** Na3iL has joined #openstack-infra13:21
*** ayoung has joined #openstack-infra13:22
*** akshai has quit IRC13:25
*** xyang1 has joined #openstack-infra13:26
*** asettle has joined #openstack-infra13:27
*** markusry has quit IRC13:27
*** ddieterly has joined #openstack-infra13:29
*** akshai has joined #openstack-infra13:30
pabelangerinfra-root: I have restarted nodepoold, jenkins02 and jenkins06 we not responding to zmq13:30
pabelangerin the process of shutting each down to clean up stale nodes13:30
*** rbradf_not_found is now known as rbradfor13:30
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move puppet4 jobs into check pipeline  https://review.openstack.org/32183713:30
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move xenial integrations jobs into gate  https://review.openstack.org/32217713:30
openstackgerritMerged openstack-infra/project-config: Enable check/gate tests for ansible-role-cloud-launcher project  https://review.openstack.org/32208113:31
*** amitgandhinz has joined #openstack-infra13:31
pabelanger#status log nodepoold restarted to address zmq issue with jenkins02 and jenkins0613:32
openstackstatuspabelanger: finished logging13:32
*** amitgandhinz has quit IRC13:32
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move xenial integrations jobs into gate  https://review.openstack.org/32217713:32
*** whoops has quit IRC13:33
*** amitgandhinz has joined #openstack-infra13:33
*** bknudson has left #openstack-infra13:33
*** markusry has joined #openstack-infra13:35
*** _vs has quit IRC13:35
*** bknudson has joined #openstack-infra13:36
*** _vs has joined #openstack-infra13:37
wznoinskit seems I'm affected by rax outage, http://intel-openstack-ci-logs.ovh/32/321932/1/check/tempest-dsvm-intel-nfv/2db9baf/logs/devstacklog.txt.gz, I can't find how the rax.openstack.org is set as pypi index-url and where... could someone have a look and try to help me?13:38
*** whoops has joined #openstack-infra13:39
*** ayoung has quit IRC13:40
fungipabelanger: thanks, the outage in dfw likely dropped the zmq connections in a less-than-graceful manner13:40
pabelangerfungi: Ya, jenkins02 is still struggling to come backonline13:40
pabelangerjust growing ready nodes13:41
*** gomarivera has joined #openstack-infra13:41
*** asettle has quit IRC13:42
*** exploreshaifali has quit IRC13:42
*** redixin has quit IRC13:42
pabelangergoing to take it out of server again, give things a moment to settle before starting it again13:42
fungiwznoinsk: are you maybe installing an unmodified copy of http://git.openstack.org/cgit/openstack-infra/system-config/tree/modules/openstack_project/files/pydistutils.cfg13:42
*** Volundr has joined #openstack-infra13:43
wznoinskfungi quite possible but I see the same problem (trying to use rax as idnex-url) on a ndoepool node that hasn't been 'built' yet, i'm logged in to one in 'ready' state13:44
*** whoops has quit IRC13:44
wznoinskI see pydistutils installs during setup_host but I see it before ... ^, checking elements...13:44
fungiwznoinsk: right, it's probably getting installed by puppet during your image builds13:45
*** nwkarsten has joined #openstack-infra13:45
*** javeriak has quit IRC13:46
*** gomarivera has quit IRC13:46
*** Julien-zte has joined #openstack-infra13:46
*** deadnull_ has quit IRC13:46
*** ilyashakhat has quit IRC13:47
*** Goneri has joined #openstack-infra13:47
*** zzzeek has joined #openstack-infra13:48
*** _vs has quit IRC13:48
openstackgerritTristan Cacqueray proposed openstack-infra/nodepool: Make nodepool cmd use logfile  https://review.openstack.org/32218713:48
*** daemontool has quit IRC13:48
*** zzzeek has quit IRC13:49
*** zzzeek has joined #openstack-infra13:49
*** daemontool has joined #openstack-infra13:49
*** dansmith is now known as superdan13:50
*** _vsaienko has joined #openstack-infra13:53
*** zz_dimtruck is now known as dimtruck13:53
*** markusry has quit IRC13:53
*** piet has quit IRC13:55
*** ilyashakhat has joined #openstack-infra13:55
*** itisha has joined #openstack-infra13:55
*** rbrndt has joined #openstack-infra13:56
*** akshai has quit IRC13:56
*** ilyashakhat has quit IRC13:56
pabelangerokay, jenkins02.o.o back online13:56
pabelangerit was in some rough shape, mulitple jenkins services running13:56
pabelangerdecided to reboot the server and bring it up fresh13:57
clarkbpabelanger: that can happen if you use service restart. You need to stop, check ps, kill, check ps again, start13:57
*** eezhova has quit IRC13:58
pabelangerclarkb: ack13:58
*** banix has joined #openstack-infra13:58
clarkbtheir init script is of not amazing quality13:58
clarkbin theory it shoukd do that for you13:59
pabelangerI think nodepool.o.o is happy again13:59
rcarrillocruzyeah, jenkins 'restart' is legendary...13:59
openstackgerritMerged openstack-infra/project-config: Retire openstack-ansible-py_from_git repository  https://review.openstack.org/31933514:01
fungiright, i usually stop, wait, kill -1, wait, kill -7... after a bit longer it'll usually die though often need to do both parent and child processes14:02
openstackgerritTristan Cacqueray proposed openstack-infra/nodepool: Make nodepool cmd use logfile  https://review.openstack.org/32218714:02
*** nadya has quit IRC14:02
*** johnthetubaguy_ has joined #openstack-infra14:02
pabelangerfungi: hopefully not for much longer.14:02
*** daemontool has quit IRC14:03
*** eharney has joined #openstack-infra14:03
*** piet has joined #openstack-infra14:03
*** daemontool has joined #openstack-infra14:04
*** ilyashakhat has joined #openstack-infra14:04
*** _vsaienko has quit IRC14:04
*** eezhova has joined #openstack-infra14:04
*** johnthetubaguy has quit IRC14:04
*** johnthetubaguy_ is now known as johnthetubaguy14:05
*** pilgrimstack has joined #openstack-infra14:05
*** nelsnelson has quit IRC14:05
*** nelsnelson has joined #openstack-infra14:05
fungipabelanger: indeed!14:06
*** xarses has quit IRC14:07
*** bhavik has quit IRC14:08
*** _vs has joined #openstack-infra14:08
openstackgerritEmilien Macchi proposed openstack-infra/project-config: zuul/layout: run puppet unit4 jobs on puppet-ceph again  https://review.openstack.org/32219014:08
*** markusry has joined #openstack-infra14:08
*** jamesmcarthur has joined #openstack-infra14:08
*** Ravikiran_K has joined #openstack-infra14:09
pabelangerthis is going to sound bad, but both OSIC and bluehost look pretty good ATM14:09
pabelangererr14:09
pabelangerbluebox14:09
*** esikachev has quit IRC14:09
*** akaszuba has joined #openstack-infra14:10
tdasilvahello, I have a question about pushing new releases to pypi. I followed the directions here: http://docs.openstack.org/infra/manual/creators.html#give-openstack-permission-to-publish-releases and have pushed a new release tag, but I don't see the new version in pypi14:10
tdasilvathis is the project: https://pypi.python.org/pypi/PyECLib14:10
wznoinskfungi: found it and fixed it in jenkin's home dir, I'll set it to my local pypi mirror soon14:11
wznoinskthanks14:11
*** eezhova has quit IRC14:12
fungitdasilva: let's track it down...14:13
tdasilvafungi: thanks!14:13
*** tonytan4ever has joined #openstack-infra14:13
fungitdasilva: this was the tag you pushed, presumably? http://git.openstack.org/cgit/openstack/pyeclib/tag/?h=v1.2.114:13
tdasilvafungi: I tried running git os-job v1.2.1 but that returned a page with "File Not Found"14:14
tdasilvayes14:14
*** eezhova has joined #openstack-infra14:14
fungitag sha is fc14225584037ee76d2bc611207f00d9ec17a33b so we should have logs at http://logs.openstack.org/fc/fc14225584037ee76d2bc611207f00d9ec17a33b/14:14
fungiand yes, that's a 40414:14
fungii'll check zuul's debug log to see what happened between the tag push and the logs not uploading14:14
*** ddieterly is now known as ddieterly[away]14:16
fungithis'll take a sec. zuul makes big debug logs and it's rotated and compressed since you pushed that14:16
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move puppet4 jobs into check pipeline  https://review.openstack.org/32183714:16
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move xenial integrations jobs into gate  https://review.openstack.org/32217714:17
*** nwkarsten has quit IRC14:17
*** yamahata has joined #openstack-infra14:17
*** denisra has joined #openstack-infra14:17
fungitdasilva: oh! i see it14:17
fungitdasilva: your tag is not a valid pep-440 version number14:18
*** amoralej|lunch is now known as amoralej14:18
*** openstackgerrit has quit IRC14:18
*** openstackgerrit has joined #openstack-infra14:18
fungitdasilva: to enqueue into the release pipeline, you need to match this regex http://git.openstack.org/cgit/openstack-infra/project-config/tree/zuul/layout.yaml#n11814:18
*** nwkarsten has joined #openstack-infra14:18
fungitdasilva: basically, the "v" at the beginning is the problem14:19
*** esikachev has joined #openstack-infra14:19
*** _vs has quit IRC14:19
*** vdrok is now known as vdrok-afk14:19
fungitdasilva: so if you push a tag named "1.2.1" instead it should work14:20
*** pilgrimstack has quit IRC14:21
fungipabelanger: when you were cleaning up nodepool, i guess you deleted held nodes too?14:22
*** yamahata has quit IRC14:22
pabelangerfungi: Oh, sorry. possible.  I don't explicitly ignore them14:22
funginot a big deal, just making sure we don't have something else weird going on14:23
pabelangerI can update my shell scripts to do that in the future14:23
fungiprobably a good idea in case anyone is doing something with one that can take a little time14:23
pabelangerYup, again apologies14:23
*** markusry has quit IRC14:24
*** rossella_s has quit IRC14:24
*** salv-orlando has quit IRC14:24
fungino need. i only noticed in this case because i went to clean one up i'd held yesterday and it was already gone14:24
*** rossella_s has joined #openstack-infra14:24
*** woodster_ has joined #openstack-infra14:29
*** kushal has joined #openstack-infra14:29
tdasilvafungi: thank you, will try again!14:30
*** inc0 has joined #openstack-infra14:31
*** zhurong has quit IRC14:32
*** denisra_ has joined #openstack-infra14:32
*** denisra has quit IRC14:33
*** jaosorior has quit IRC14:34
*** ddieterly[away] is now known as ddieterly14:34
*** akaszuba has quit IRC14:35
fungiinfra-root: heads up... according to rackspace this is the list of our volumes which may have been impacted by the network issues in dfw http://paste.openstack.org/show/505900/14:35
*** dmk0202 has quit IRC14:35
*** akaszuba has joined #openstack-infra14:35
*** akshai has joined #openstack-infra14:35
fungii'm going to start checking the corresponding servers for any indication of distress14:36
sigmavirus24Didn't project-config/gerrit/projects.yaml used to have a launchpad key for each particular project?14:36
*** pt_15 has joined #openstack-infra14:36
fungisigmavirus24: it's the "groups" setting14:36
sigmavirus24fungi: thanks14:36
*** esimone has joined #openstack-infra14:36
*** akaszuba has quit IRC14:36
sigmavirus24so the group name is the project name used by jeepyb, then, right?14:37
fungisigmavirus24: for projects doing task tracking that's their corresponding lp project name if it's not the same as the name of the repo. for projects using storyboard for task tracking it's a list of names of project-groups to which they should be added14:37
sigmavirus24Thanks fungi. That clears things up14:37
fungiyep14:37
*** akaszuba has joined #openstack-infra14:37
* sigmavirus24 suspects jeepyb just failed to update some of the bugs I was working on then14:38
sigmavirus24or launchpad failed to apply the updates or whatever14:38
fungijeepyb assumes the lp project name is the same as the repo's short name (the part after the /), but if it's not you can use the groups option to override it14:38
pabelangerfungi: thanks! Let me know if you find anything on ES02, was considering putting it into shutdown more, so I can safe remove the volume for the migration (not to repeat our hung detach on graphite.o.o)14:38
fungisigmavirus24: which change was it? i can check and make sure lp permissions look correct14:38
*** xarses has joined #openstack-infra14:39
sigmavirus24fungi: it was a change with the openstack-ansible project from one of the other projects. The change number from yesterday was 32165714:39
*** EricGonczer_ has quit IRC14:40
jeblairfungi: that looks very close to a list of our volumes in dfw :)14:41
*** amrith is now known as _amrith_14:41
fungijeblair: i think it is an exact match :/14:41
*** nwkarsten has quit IRC14:41
jeblairfungi: afs01.dfw vicepa appears to be ro14:42
fungisigmavirus24: you need to add the "OpenStack Infra (hudson-openstack)" account to https://launchpad.net/~openstack-ansible-bugs/+members so that our bug update hook has adequate permission to reassign bugs in projects for which that group is a bug supervisor14:42
jeblairafs02 is rw14:42
fungijeblair: does that mean it switched over?14:42
sigmavirus24fungi: thanks I'll make sure odyssey4me sees that14:42
*** vdrok has joined #openstack-infra14:43
fungisigmavirus24: basically the hook tried to reassign that bug to you and leave a comment on it with a link to your change, but lp denied the api call because of insufficient permission to reassign14:43
sigmavirus24weird14:43
fungisigmavirus24: if the bug had already been assigned to you, the script would only have attempted to leave a comment (which would have worked fine)14:43
sigmavirus24before the subproject split, jeepyb used to work for osa.14:43
* sigmavirus24 nods14:44
sigmavirus24I've pinged the appropriate people14:44
sigmavirus24Thanks fungi14:44
*** nwkarsten has joined #openstack-infra14:44
fungiso either you just didn't ever notice that it has failed in the past when a bug needed reassignment, or the bug supervisor for that project changed at some point14:44
*** _amrith_ is now known as amrith14:44
*** akaszuba has quit IRC14:45
*** akaszuba has joined #openstack-infra14:45
openstackgerritMerged openstack-dev/hacking: Updated from global requirements  https://review.openstack.org/32165814:45
fungijeblair: somehow static.o.o came out of this unscathed. it had 14x the chance to get impacted of review.o.o14:46
jeblairwow14:46
* fungi won't willingly roll those dice again14:46
fungii'm assuming you saw in scrollback i had to take gerrit offline late last night and fsck /home/gerrit2, then remount it rw14:47
jeblairi missed the fsck part14:47
fungithere may be some discontinuities. we also saw frequent 500 errors from it because it looked like it kept losing contact with its trove instance14:47
*** nelsnelson has quit IRC14:47
fungijeblair: zuul.o.o's dmesg shows some segfaults from apache mod_mem_cache btw14:49
fungiat least a few a day going back to the last time it rebooted, presumably much longer14:49
fungijeblair: the fsck for the gerrit volume was a purely prophylactic measure; it didn't report actual corruption14:53
jeblairSetting free inodes count to 125879692 (was 132954091)14:53
jeblairSetting free blocks count to 230175018 (was 466121641)14:53
jeblairvicepa on afs01 reported only that14:53
fungithat's good at least14:53
*** ayoung has joined #openstack-infra14:54
fungiso presumably only an accounting problem with incomplete deletions14:54
*** jlanoux has quit IRC14:54
fungioh, nevermind i read that backwards14:54
fungiso it found unaccounted-for inodes/blocks14:54
openstackgerritMerged openstack-infra/system-config: Migrate elasticsearch to ubuntu-trusty  https://review.openstack.org/32064214:54
jeblairafs is salvaging volumes now...14:55
fungias far as the free count was concerned14:55
jeblair(this is automatic at startup, progress in /var/log/openafs/SalsrvLog)14:55
wznoinskjeblair hi, I think you may be able to help me with my question... this morning I had a site-wide issue for my CI, all jobs failing (as I figured it out later on it was rax outage affecting me), I'm wondering what would be the best way to 'pause' running jobs in the CI... til I troubleshoot the problem and give zuul grenn light again...? (so far I figured out two ways: 1. I could point jobs to non-existent salves in jenkins once I14:56
wznoinsk have all jobs registered with gearman - it will hold off submitting jobs to jenkins till there are slaves to run them, 2. stop zuul-merger causing zuul no to submit a job (as it doesn't have any OVERRIDE_ZUUL_REF to pass on)... I'm wondering is there a preferred/less hacky way to follow in such situations?14:56
openstackgerritVladyslav Drok proposed openstack-infra/project-config: Remove pxe_libvirt experimental job  https://review.openstack.org/32221514:56
jeblairwznoinsk: you can set jenkins in 'shutdown' mode and it won't launch any new jobs14:56
wznoinskjeblair yes, I've excercies that too but ideally I want to run test jobs from within jenkins to have the params same14:57
wznoinsks/excercies/exercised14:57
*** yfried has quit IRC14:57
openstackgerritVladyslav Drok proposed openstack-infra/project-config: Remove pxe_libvirt experimental job  https://review.openstack.org/32221514:57
fungiwznoinsk: maybe disable the gearman plugin in jenkins temporarily?14:57
jeblairyeah, that's a good one14:58
*** akaszuba has quit IRC14:58
wznoinskyeah, gearman should then have no place to send... wouldn't it unregister jobs from gearman server then?14:58
*** jlanoux has joined #openstack-infra14:59
jeblairyes it should14:59
jeblair(but that's fine)14:59
fungino, gearman only removes job registrations if you restart it/zuul14:59
fungiunless i'm misunderstanding14:59
jeblairoh i think we're talking about different things14:59
fungiwhat it shouldn't do is cause zuul to abort the jobs with a NOT_REGISTERED result15:00
wznoinskfungi that was my impression too, when I was changing project.yaml in jbb to use different labeled slave gearman server had the new job:slave and old one job:slave registered15:00
jeblairdisabling the gearman plugin should mean that the workers do not pick up a job from the server.  the server never forgets the names of jobs that have been registered, so there will be no NOT_REGISTERED errors as long as you don't restart zuul15:00
wznoinskkewl, thanks guys, I'll test it next big time15:01
*** jistr is now known as jistr|call15:01
*** isaacb has joined #openstack-infra15:01
openstackgerritEmilien Macchi proposed openstack-infra/project-config: jjb/puppet: fix conditional for xenial jobs  https://review.openstack.org/32221615:01
wznoinskbtw. I'm trying to find a script from one of the 3rdparty Cis to generate DEVSTACK_GATE_TEMPEST_REGEX based on exlusion list... would anyone have it to hand maybe?15:02
jeblairthe openafs client on the dfw mirror seems unhappy15:02
*** Julien-zte has quit IRC15:03
fungiyep, i was just looking at the logs15:03
jeblairah, i think its afs cache volume died15:03
fungithe logical volume on it seems not impacted though15:03
fungirecovered though?15:04
fungimount show it still read-write15:04
jeblairtouch: cannot touch ‘/var/cache/openafs/foo’: Read-only file system15:04
fungiargh15:04
fungii was having trouble sifting through all the afs-related kernel errors in dmesg15:04
fungibut yes, now i see that buried in amongst them15:04
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/project-config: Add release announcement jobs for major OSA repos  https://review.openstack.org/31923415:05
wznoinskfound it: https://github.com/a10networks-ci/neutron-thirdparty-ci/blob/master/slave/v1-testcases15:05
jeblairlooks like we're up to using 7.4G of that now... so we might be able to move that to the ephemeral volume to increase resiliency15:05
jeblair(it's a bit much for locating on /)15:05
fungiseems reasonable15:06
fungithere we go... [Fri May 27 02:31:50 2016] end_request: I/O error, dev xvdb, sector 100942424[Fri May 27 02:31:50 2016] end_request: I/O error, dev xvdb, sector 10094242415:06
jeblairpossibly due to the kernel panics, i don't think i can recover this without a reboot15:06
fungithe afs errors just (barely) preceded the block device errors15:07
*** kzaitsev_mb has quit IRC15:07
fungiyep, i'd expect to reboot it anyway15:07
fungi(and looks like you just did)15:07
openstackgerritJesse Pretorius (odyssey4me) proposed openstack-infra/project-config: Rename openstack-ansible-ironic to openstack-ansible-os_ironic  https://review.openstack.org/29919215:07
*** ifarkas_ has quit IRC15:08
jeblairoh, heh, sorry.  i'm pre-breakfast and did not quite connect "fungi is reading the log files" with "fungi is logged into the system"15:08
fungino worries. i was "done" anyway ;)15:08
fungiyep, broken, needs reboot15:08
funginot much else to see there. we know what caused it15:09
notmorganfungi, jeblair: I think one of the git mirrors in git.openstack.org (or two) are unhappy.15:09
funginotmorgan: how recently? rackspace had some terrible network outages overnight in that region15:10
notmorganfungi, jeblair: was getting weird SSL errors, followed by repositories disappearing and reapearring on refreshes of cgit page15:10
notmorganfungi: ... 3 minutes ago?15:10
fungioh, ick15:10
notmorgannotably openstack-infra namespace was missing 5-10 repos on a couple refreshes of cgit (web browsing) and was getting  SSL read: error:00000000:lib(0):func(0):reason(0), errno 10415:11
*** tesseract has quit IRC15:11
notmorganwhen trying to clone.15:11
notmorganhit/miss15:11
notmorganso sometimes it would work.15:11
fungiyeah, sounds like you were sometimes getting load-balanced to a broken server. i'll see if i can find it15:12
notmorganwish i could make it easier to determine which one(s) were broken15:12
clarkbthe backens are directly accessible15:12
*** denisra_ is now known as denisra15:13
fungias for http://paste.openstack.org/show/505900/ i've gotten through eavesdrop, so at this point just need to check on all the elasticsearch servers besides 0415:13
fungialso pabelanger already checked on 02 because he's in the process of replacing it15:13
pabelangerYup15:13
fungihere are some fun kernel messages...15:14
fungi[Fri May 27 15:13:38 2016] xen_netfront: xennet: skb rides the rocket: 20 slots15:14
fungisay wha?!?15:14
notmorganfungi: heh "fun"15:14
fungii'm guessing this is related to connection rate limiting for haproxy15:15
*** jlanoux has quit IRC15:15
*** jlanoux has joined #openstack-infra15:15
*** Jeffrey4l has quit IRC15:16
notmorganls15:16
jeblairNo such file or directory15:16
notmorganjeblair: ++15:16
jeblairbandersnatch just successfully updated, so i think afs rw volumes are happy now15:17
*** amrith is now known as _amrith_15:17
*** nwkarsten has quit IRC15:17
*** armax has joined #openstack-infra15:17
*** Kaiyan has joined #openstack-infra15:17
EmilienMI see a lot of RAX repos timeouts15:18
EmilienMcan't reach http://mirror.dfw.rax.openstack.org/centos/7/updates/x86_64/repodata/repomd.xml15:18
EmilienMlot of jobs are currently failing15:18
jeblairEmilienM: yeah, i'm repairing that mirror right now15:18
*** cody-somerville_ has quit IRC15:18
jeblairshould be just another minute15:18
*** _amrith_ is now known as amrith15:19
*** cody-somerville has joined #openstack-infra15:19
jeblairdone15:20
funginotmorgan: clarkb: looks like git08 is unhappy. systemd is rapidly restarting the git daemon15:20
*** nwkarsten has joined #openstack-infra15:20
jeblairfungi: where do you see that?15:23
fungijeblair: dmesg -T15:23
*** salv-orlando has joined #openstack-infra15:23
*** rcernin has quit IRC15:24
fungii don't see any of the other 7 git servers exhibiting this logging at least15:24
jeblairwow.  i like that's not logged anywhere.15:24
fungiyeah, i'm thinking it's in the systemd journal15:24
*** jistr|call is now known as jistr15:25
openstackgerritMorgan Fainberg proposed openstack-infra/project-config: Add non-voting py34 job for zuul  https://review.openstack.org/32223015:25
jeblairapparently the journal runs from October through mid December15:25
EmilienMjeblair: cool thx15:25
fungii see that15:25
notmorganjeblair: well at least it's not in systemctl-journal only15:26
openstackgerritIsaac Beckman proposed openstack-infra/nodepool: Add log config option to nodepool cmd  https://review.openstack.org/32148015:26
jeblairnotmorgan: i think it is?15:26
notmorganjeblair: erm... systemd...15:26
notmorganjeblair: iirc on my local system i don't even see most of that stuff in dmesg15:26
jeblairoh, you mean the crumbs in dmesg15:26
notmorganjeblair: yeah.15:26
*** Qiming has quit IRC15:26
notmorganjeblair: i am *not* a fan of the systemd journal thing.15:27
jeblairapparently in december it was also starting the git daemon a lot15:27
* notmorgan kindof misses rsyslog15:27
notmorganjeblair: interesting. wonder if it's something with that host, something with the LB sending off traffic to it, etc.15:28
clarkbsudo journalctl -f doesnt follow a current log?15:28
jeblairclarkb: nope, ends December 1615:28
jeblair-- Logs begin at Tue 2015-10-27 06:05:40 UTC, end at Wed 2015-12-16 09:08:31 UTC. --15:28
notmorganoh thats fun.15:28
* notmorgan remembers to vacuum local logs.15:29
jeblairNOW WHO"S STUCK IN THE PAST, SYSTEMD!15:29
fungibwahahahahaha15:29
notmorganjeblair: LOL15:29
wznoinsk+!15:29
*** links has quit IRC15:29
fungii love how systemctl status paginates in more which refuses to render its fancy tree characters15:30
fungihave to |cat to see a proper rendering15:30
*** hongbin has joined #openstack-infra15:31
clarkbI want to say on debuntu at least installing rsyslog sets up journal to rsyslog stuff. But if journald isnt recording it wont write to rsyslog either15:31
*** gomarivera has joined #openstack-infra15:32
jeblairerm, does centos even write the journal to disk?15:32
jeblairi can't find the file to even see what the size is...15:32
openstackgerritMerged openstack-infra/tripleo-ci: Add md5 files to images upload  https://review.openstack.org/32090615:32
jeblairjournalctl --disk-usage15:33
jeblairArchived and active journals take up 368.0M on disk.15:33
jeblair/run/log!15:33
jeblair(i straced that command to find out where the journals were :)15:33
jeblairso it's in a tmpfs15:34
fungii still can't find where the service definition is for the git daemon15:34
*** mixos has joined #openstack-infra15:34
clarkbfungi: its in with all the others, we write our own out though because the centos one is broken15:34
fungiyeah, just can't *find* it15:35
funginot in /etc/systemd, not in /usr/share/systemd, not in /etc/init.d...15:35
clarkbI think it is in /usr/share/systemd15:36
fungiaha, /usr/lib/systemd/system15:36
fungipuppet manifest ftw15:36
clarkbnote the filename has an @ in it because systemd uses symbols in filenames to affect behavior15:36
*** Swami has joined #openstack-infra15:36
*** kzaitsev_mb has joined #openstack-infra15:37
*** salv-orl_ has joined #openstack-infra15:37
fungifor some reason `systemctl status git` and `systemctl status git-daemon` both act like those aren't defined15:38
jeblairapparently it's supposed to rotate automatically15:38
fungieven appending the @ to them15:38
jeblairso basically, no idea why we stopped getting journal entries15:39
clarkbfungi so it does socket activate them I wonder if that causes status to be weird15:39
* jeblair thinks we should nuke 08 and rebuild15:39
fungiclarkb: yeah, the units for them must be dynamically created by socket activation because they show up like git-daemon@16841920-104.239.146.131:29418-104.130.246.128:55211.service15:40
*** salv-orlando has quit IRC15:40
*** vhosakot has joined #openstack-infra15:41
jeblairgit07 journal ends Dec 1515:41
*** arxcruz has quit IRC15:41
*** d34dh0r53 is now known as h0m3r15:41
*** roxanaghe has joined #openstack-infra15:41
*** lezbar__ has quit IRC15:42
*** ddieterly is now known as ddieterly[away]15:43
openstackgerritMorgan Fainberg proposed openstack-infra/zuul: Python 3 Fixes: Use print() not print  https://review.openstack.org/32223815:43
*** jordanP has quit IRC15:43
jeblairthe journals on all 8 servers end either dec 15 or 16, regardless of when they started15:43
*** sigmavirus24 is now known as m3du5a15:43
*** h0m3r is now known as d34dh0r5315:43
jeblair(they start various times sept thru oct)15:44
*** m3du5a is now known as sigmavirus2415:44
notmorganjeblair: that is weird.15:44
openstackgerritPaul Belanger proposed openstack-infra/puppet-elasticsearch: Set permissions on /var/lib/elasticsearch  https://review.openstack.org/32224215:44
fungimaybe we merged a change around then to adjust their logging?15:44
pabelangerclarkb: ^ think that should work15:45
*** hashar is now known as hasharAway15:45
fungithat was, i think, only a few days before our gerrit upgrade though i can't think of anything related to prep for that which might cause it15:45
*** ddieterly[away] is now known as ddieterly15:45
openstackgerritPaul Belanger proposed openstack-infra/puppet-elasticsearch: Set permissions on /var/lib/elasticsearch  https://review.openstack.org/32224215:45
rcarrillocruzclarkb , pabelanger : do our centos7/trusty dib images have /usr/local/bin/env or /usr/bin/env ?15:46
fungijeblair: hah! run `systemctl --failed`15:46
fungisystemd-journald.service     loaded failed failed Journal Service15:46
rcarrillocruzi smell i'm getting test failures on https://review.openstack.org/#/c/322189/2/tests/inventory related to that15:46
rcarrillocruzissue is the shade module can't be found by ansible in the tox venv15:46
*** nwkarsten has quit IRC15:46
openstackgerritLukas Bednar proposed openstack-infra/jenkins-job-builder: Builders: Add ansible-playbook builder  https://review.openstack.org/32224315:46
pabelangerrcarrillocruz: I would think /usr/bin/env15:47
pabelangerbut would need to confirm15:47
pabelanger(can't actually confirm ATM)15:47
rcarrillocruzi'll push a change with /usr/local/bin/env , /usr/bin/env not working in the gate15:47
rcarrillocruzit's not urgent, you have better things to do in the sprint15:48
*** Swami has quit IRC15:48
jeblairfungi: wow.  i don't.  wow.15:48
fungijeblair: we could check its log to see why it... no, wait15:49
*** Swami has joined #openstack-infra15:49
*** nwkarsten has joined #openstack-infra15:49
*** lezbar has joined #openstack-infra15:51
fungii'm going to try starting it and see if it says why it's not able to start15:52
*** vincentll has quit IRC15:52
fungiof course, it fails and suggests checking `journalctl -xe` for the cause15:53
fungiwhich... no. still has nothing since december15:54
fungiof course, i should look in dmesg!15:55
fungi[Fri May 27 15:52:38 2016] systemd-journald[10568]: Failed to get machine id: Permission denied15:55
* fungi smells selinux at work15:55
*** deadnull_ has joined #openstack-infra15:56
fungihttps://bugzilla.redhat.com/show_bug.cgi?id=131200115:57
openstackbugzilla.redhat.com bug 1312001 in systemd "systemd-journal won't start with avc: denied" [Unspecified,Closed: worksforme] - Assigned to systemd-maint15:57
*** ddieterly is now known as ddieterly[away]15:57
fungican someone who speaks redhatese translate that for me?15:58
jeblairtype=AVC msg=audit(1464364360.197:185776433): avc:  denied  { read } for  pid=10582 comm="systemd-journal" name="machine-id" dev="tmpfs" ino=7471 scontext=system_u:system_r:syslogd_t:s0 tcontext=system_u:object_r:var_run_t:s0 tclass=file15:58
jeblairfungi: i think you are correct about the selinux involvement15:58
fungithere is a "solution" in the reply to that bug15:58
*** liusheng has quit IRC15:59
* jeblair reads bug15:59
ttxodyssey4me: fwiw we don't need openstack-admins as team members in Launchpad. Just as team *owners*, so we can escalate to admin role in case of need.15:59
fungisuggests rerunning restorecon15:59
*** lakshmiS has joined #openstack-infra15:59
*** liusheng has joined #openstack-infra15:59
jeblair-rw-r--r--. root root system_u:object_r:var_run_t:s0   /etc/machine-id15:59
odyssey4mettx ah ok - I'm just doing some housekeeping16:00
ttxodyssey4me: that way we don't have rights on everything and if we escalate for admin reasons we leave a trail16:00
jeblairour machine-id does in fact have the wrong label16:00
ttxodyssey4me: I re-deactivated us16:00
fungijeblair: yeah, just confirmed it myself16:00
odyssey4mettx ok, thanks16:00
*** bpokorny has joined #openstack-infra16:00
*** cody-somerville has quit IRC16:01
clarkbcloud init at fault maybe16:01
ttxodyssey4me: the reason we are added in the first place is that LP automatically adds the team owner as an 'admin' member16:01
fungijeblair: not a fan of the bug resolution there, as it gives us no indication of what happened in mid-december to cause this16:01
fungiclarkb: yeah, there's a possible explanation. i wonder if a reboot will un-fix it again16:01
odyssey4mettx ah ok, makes sense to me now16:02
jeblairoh wow, i just noticed something -- our logs aren't from oct 27 -- dec 1616:02
openstackgerritLukas Bednar proposed openstack-infra/jenkins-job-builder: Builders: Add ansible-playbook builder  https://review.openstack.org/32224316:02
jeblairthey are for oct 27 *and* dec 1616:02
jeblairno other days16:02
*** Guest53789 has quit IRC16:02
jeblairwhere do puppet logs go on this host?16:04
*** psachin has quit IRC16:04
clarkbthey might go to /var/log/messages if they somehow bypass journald16:05
fungiseems they don't16:05
jeblairclarkb: that file only contains a startup line for rsyslogd16:05
fungii expect they go to /var/log/messages _by way of_ journald16:05
jeblairyum.log says a bunch of packages were updated on dec 1616:06
fungipretty sure the chain is log socket -> journald -> rsyslog export -> logfile16:06
jeblairfungi: except i don't think we have an rsyslog export configured16:06
*** bhavik has joined #openstack-infra16:06
fungiahh16:06
fungii fitured it had simply rotated away if journald had been sending nothing to it since december16:07
fungier, figured16:07
jeblairMay 22 03:24:01 git08 rsyslogd: [origin software="rsyslogd" swVersion="7.4.7" x-pid="23926" x-info="http://www.rsyslog.com"] rsyslogd was HUPed16:07
*** trown is now known as trown|lunch16:08
greghaynessounds like what happens if you logrotate under rsyslog and don't hup it16:08
greghayneshaving done that one before16:09
*** isaacb has quit IRC16:10
fungiyeah, i'm assuming with journald sending nothing to rsyslogd, the only thing rsyslogd is going to put in the logs is its own log entries about itself16:10
greghaynesor that16:10
*** esikachev has quit IRC16:10
jeblairwho owns /dev/log?16:10
jeblairdoes that go to systemd?16:10
fungiroot run barter town16:10
fungiouch. if i try to echo something to it, bash: /dev/log: No such device or address16:11
fungibut ls shows it there16:11
jeblairyeah, logger gets ECONNREFUSED16:11
*** _vs has joined #openstack-infra16:12
greghayneson a systemd system it should be journald16:12
fungiwhich makes sense with journald unable to start16:12
*** salv-orl_ has quit IRC16:12
*** oanson has joined #openstack-infra16:13
*** jlanoux has quit IRC16:13
jeblairso yeah, that seems to support the socket->journald->rsyslog->file chain16:13
jeblairokay, so i think the only thing left to do here wrt logging is restorecon, yeah?16:13
jeblairi'm kind of assuming one of those package updates on dec 16 munged the file context16:14
openstackgerritCaleb Boylan proposed openstack-infra/shade: [WIP] Add function to update object metadata  https://review.openstack.org/32187816:14
fungithat's the only working theory i have16:16
melwittlogstash.openstack.org appears blank. is there a known issue about it?16:16
fungiwere you going to run it, or shall i?16:16
jeblairfungi: why don't you?16:17
fungimelwitt: pabelanger is in the middle of replacing the elasticsearch cluster members i think, so that might be having an impact on lookups16:17
fungimelwitt: especially if it's the one kibana is pointed as as the master16:17
openstackgerritCaleb Boylan proposed openstack-infra/shade: Make it easier to give swift objects metadata  https://review.openstack.org/32183516:17
melwittfungi: ah, okay. thanks16:17
fungijeblair: oh, even better!16:18
fungirestorecon set context /etc/machine-id->system_u:object_r:machineid_t:s0 failed:'Read-only file system'16:18
melwittI went there to see if many other jobs are failing to fetch packages that I saw in a recent job http://logs.openstack.org/58/317958/4/check/gate-devstack-bashate/6ad3060/console.html#_2016-05-27_15_14_03_87516:18
*** ddieterly[away] is now known as ddieterly16:19
clarkbmelwitt: pabelanger fungi the kibana instance talks to elasticsearch02.openstack.org when it proxies iirc and that was the first one replaced16:19
*** johnny___ has joined #openstack-infra16:19
clarkbso ya definitely possible that it is related16:19
fungiclarkb: pabelanger: maybe kibana is still trying to connect to the old ip address and needs to re-resolve it?16:19
jeblairmelwitt: the cause of those failures should be corrected now16:19
*** amrith is now known as _amrith_16:20
melwittjeblair: okay. is the "/tmp/ansible/bin/ansible: No such file or directory" also related to that? http://logs.openstack.org/58/317958/4/experimental/gate-tempest-dsvm-cells/eeed3f5/console.html#_2016-05-27_15_14_24_13916:20
pabelangerclarkb: fungi Oh, maybe. I thought I restarted all the firewalls16:20
*** nwkarsten has quit IRC16:21
fungihttps://bugzilla.redhat.com/show_bug.cgi?id=123186916:21
openstackbugzilla.redhat.com bug 1231869 in selinux-policy "rsyslog stops working after restart if SELinux is enabled" [Unspecified,Closed: notabug] - Assigned to mgrepl16:21
*** yamahata has joined #openstack-infra16:21
jeblairmelwitt: i don't know about that16:22
fungitmpfs on /etc/machine-id type tmpfs (ro,relatime,seclabel,mode=755)16:22
pabelangerokay, I've restarted apache on logstash.o.o16:22
*** dtantsur is now known as dtantsur|afk16:22
clarkbpabelanger: ya that could be it since it goes through the proxy there16:23
pabelangerclarkb: Yup, it was16:23
jeblairwow16:23
jeblairthat's a tmpfs16:23
clarkbpabelanger: confirmed working for me now. melwitt you should be able to use it now16:23
fungijeblair: seems it's also unfixable without a reboot16:24
*** nwkarsten has joined #openstack-infra16:24
fungii guess i can try to remount it rw16:24
melwittclarkb: got it, thanks! already on it doing searches :)16:24
*** oanson has quit IRC16:24
jeblairfungi: ok, though i'm leaning toward reboot16:25
fungijeblair: actually, remounting it rw seems to have allowed restorecon to dtrt16:25
jeblairalrighty then16:25
jeblairfungi: maybe remount ro now?16:25
fungii'll remount it ro again in a sec, yep16:25
fungirunning fixfiles -f relabel to see if there's anything else that needs fixing16:26
funginope, that seems to have been it16:26
fungihah, can't remount ro now... mount: /etc/machine-id is busy16:26
*** kzaitsev_mb has quit IRC16:26
*** oanson has joined #openstack-infra16:27
fungianyway, going to try to start journald up again and see if we can get some details on why git-daemon is breaking16:29
*** cindy has joined #openstack-infra16:30
fungistill getting "systemd-journald[16236]: Failed to get machine id: Permission denied"16:30
fungii guess at this point we probably need to just take it out of the pools in haproxy and reboot the server?16:31
cindyHi.  I have a opendaylight CI build error that i don’t understand.  Any ideas?  https://jenkins.opendaylight.org/releng/job/docs-verify-rtd-boron/28/16:31
jeblairdo we need to remove it from the pool?  it doesn't handle that automatically?16:31
fungii thought it didn't do health checks16:32
clarkbit does do health checks16:33
fungicindy: i'm curious why you're asking in here about problems with a job running in opendaylight's ci16:33
clarkbbut if you remove it without telling haproxy any existing connections can have a sad16:33
fungicindy: care to elaborate?16:33
jeblairif it's layer7 lbing, it shouldn't need to (every req is a health check, right?)16:33
*** oanson has quit IRC16:33
melwitthits on "/tmp/ansible/bin/ansible: No such file or directory" http://goo.gl/NQu3rk are many starting today16:33
*** salv-orlando has joined #openstack-infra16:33
clarkbjeblair: its an l3 thing iirc, it just checks a 3 way handshake16:33
cindy@fungi sorry, i’m not sure what room to ask about opendaylight problems, i was surprised to see it16:34
melwittthe share url didn't keep that I used last 7 days16:34
clarkbmelwitt: ya the kibana 3 url sharing is somewhat hacky and doesn't pass through that value16:34
jeblairmelwitt: have you rechecked that job since the mirror was fixed?16:34
fungicindy: opendaylight isn't part of openstack as far as i know, but maybe ask in the monasca channel since this is a third party ci reporting on changes for one of their repositories16:35
*** cody-somerville has joined #openstack-infra16:35
*** _vs has quit IRC16:36
melwittjeblair: not yet. I didn't know if the /tmp/ansible/bin/ansible thing was related to that16:36
*** mhickey has quit IRC16:36
fungicindy: also the error message it's leaving on that change lists contact info for their ci linked at https://wiki.openstack.org/wiki/ThirdPartySystems/OpenDaylight_CI16:36
jeblairmelwitt: i think rechecking may help us find out the answer to that16:36
*** mikelk has quit IRC16:36
melwittjeblair: okay, will do16:36
clarkbjeblair: melwitt if I had to guess the bit that installs ansible is not run with set -e, it fails then we get to a bit that is run with err exit and it errors because no such file or dir16:37
fungiclarkb: so what's the preference? just reboot git08 or admin down it in the haproxy pools and then reboot it?16:37
cindy@fungi interesting, we recently must have added it, i’ll find out why16:37
clarkbfungi: admin down then reboot it is always preferable as that should more gracefully handle existing connections16:37
fungicindy: they only list it as reporting on neutron changes, so maybe they've misconfigured it to start reporting on monasca changes too?16:37
*** 32NAA99AQ has joined #openstack-infra16:37
jeblairfungi: hrm, i restorecon'd and it just changed the context to system_u:object_r:machineid_t:s016:38
jeblairfungi: can you try starting again?16:38
fungijeblair: that seems to have worked16:38
jeblairfungi: i did restorecon /etc/machine-id16:39
*** 32NAA99AQ has quit IRC16:39
jeblair(i have restarted haproxy-statsd so we get data to graphite/grafana)16:40
*** _ari_ is now known as _ari_|afk16:40
fungijeblair: strange... here is is out of my console history http://paste.openstack.org/show/505927/16:40
jeblairfungi: huh, did something set it back?16:41
fungijeblair: oh! fixfiles -f relabel looks like it set it back again16:41
jeblairwow16:41
jeblairwhat is fixfiles?  never used it16:41
cindy@fungi the monasca team doesn’t know why opendaylight is reporting on us.  Should we go to a neutron room you think to change this?16:41
*** thorst_ has quit IRC16:42
clarkbcindy: you should contact the people running the CI and talk to them16:42
clarkbcindy: https://wiki.openstack.org/wiki/ThirdPartySystems/OpenDaylight_CI includes contact info16:42
fungijeblair: yep http://paste.openstack.org/show/505928/16:42
*** _vs has joined #openstack-infra16:42
fungijeblair: fixfiles is _supposed_ to relabel according to configured policy16:43
cindy@clarkb thanks! I just saw the contact link above. Thanks fungi too!16:44
*** thorst_ has joined #openstack-infra16:44
*** asettle has joined #openstack-infra16:44
*** pt_15 has quit IRC16:46
*** sarob has joined #openstack-infra16:46
fungijeblair: anyway, journalctl -xe has some details for us about git-daemon now16:47
*** nwkarsten has quit IRC16:47
*** kzaitsev_mb has joined #openstack-infra16:47
*** dizquierdo has joined #openstack-infra16:48
*** thorst_ has quit IRC16:48
fungijeblair: i'm now thinking the service failures for git-daemon are misleading16:48
pabelangerjeblair: fungi: +1 for fixfiles. Recently started using it over restorecon16:49
fungipabelanger: well, in this case fixfiles seems to set an incorrect label for /etc/machine-id16:49
fungipabelanger: while restorecon sets a working one16:49
*** nwkarsten has joined #openstack-infra16:49
*** asettle has quit IRC16:50
pabelangerfungi: Odd, haven't had that issue before16:50
*** lucasagomes is now known as lucas-dinner16:50
pabelangermy issues with restorecon were running it in a chroot16:50
pabelangerit checks the host system for SELinux, and if not found it silently fails, and return success16:50
*** _vs has quit IRC16:50
clarkbmtreinish: you around? before I delete the old logstash.o.o can you double check that the subunit2sql things are all working as you expect? you are getting new data into mysql and the mysql proxy is functional16:51
*** ilyashakhat has quit IRC16:52
*** sdake has joined #openstack-infra16:52
fungialso, even with journald running again we're still not getting anything in /var/log/messages16:52
clarkbfungi: I think jeblair is right on centos7 (unlike debuntu) we must not get that configured when we install rsyslog16:52
*** burgerk has quit IRC16:53
sarobi have a small problem with https://review.openstack.org/#/c/320645/ not creating a new irc meeting16:53
fungithough the journal for the systemd-journal service indicates that it's being flooded by messages from /system.slice/system-git\x2ddaemon.slice16:53
tdasilvafungi: I tried to create a new tag for pyeclib '1.2.1' but 'git os-job 1.2.1' still returns with a 404, any ideas?16:54
*** Apoorva has joined #openstack-infra16:54
*** javeriak has joined #openstack-infra16:54
*** csomerville has joined #openstack-infra16:55
jeblairsarob: i'll check on it16:55
*** cloudtrainme has joined #openstack-infra16:56
*** cody-somerville has quit IRC16:56
fungitdasilva: zuul's debug log says there are no jobs configured for it. do you have any release pipeline jobs set up at all?16:56
*** liusheng has quit IRC16:57
fungitdasilva: looks like no... http://git.openstack.org/cgit/openstack-infra/project-config/tree/zuul/layout.yaml#n999616:57
*** liusheng has joined #openstack-infra16:58
*** javeriak_ has joined #openstack-infra16:58
openstackgerritLukas Bednar proposed openstack-infra/jenkins-job-builder: Builders: Add ansible-playbook builder  https://review.openstack.org/32224316:58
tdasilvafungi: ah ok, so I need to add publish-to-pypi and pypy-jobs ?16:58
*** derekh has quit IRC16:58
fungitdasilva: yeah, see http://docs.openstack.org/infra/manual/creators.html#configure-zuul-to-run-jobs16:59
fungitdasilva: also python-jobs or at least something that'll run a tarball job for you16:59
*** javeriak has quit IRC16:59
*** asettle has joined #openstack-infra16:59
*** haypo has left #openstack-infra16:59
fungitdasilva: pypy-jobs is for running your unit tests under the "pypy" interpreter, so that's probably not one you care to add17:00
tdasilvafungi: yeah, i have this patch up for review: https://review.openstack.org/#/c/317672/17:00
openstackgerritIhar Hrachyshka proposed openstack-infra/release-tools: Added lp-tag.py tool that helps adding tags to bugs  https://review.openstack.org/32227017:00
openstackgerritIhar Hrachyshka proposed openstack-infra/release-tools: docs: added missing .py extension to annotate-lp-bugs example  https://review.openstack.org/32227117:00
tdasilvafungi: so I will just update that17:00
clarkbunless you want to make usre you support pypy17:00
fungiright17:00
*** kzaitsev_mb has quit IRC17:00
tdasilvafungi: ok, thanks for the heads up on pypy-jobs17:00
jeblairsarob: the publishing job raced a second one that didn't have that meeting in it.  it should show up the next time a change lands to that repo17:01
fungi(though it's also globally nonvoting at the moment, and the version on trusty doesn't work with some projects' dependencies such as cryptography>=1.0)17:01
tdasilvaclarkb, fungi. I read that too quickly as pypi-jobs17:01
*** jamesmcarthur has quit IRC17:01
openstackgerrityolanda.robla proposed openstack-infra/shade: Add magnum services call to shade  https://review.openstack.org/31358317:01
*** asettle has quit IRC17:02
clarkbfungi: I was thinking about that a bit more. We can switch pypy to xenial across the board, any of the jobs that fail we delete, the rest we restrict to newer than mitaka, then email dev list and say "hey we did this if pypy is important to you let us know and we can add it back in but please get it working"17:03
*** eezhova has quit IRC17:04
clarkbthat should hopeflly put us in a good position to have meaningful pypy testing going forwward because right no wI think its just a bunch tests we run that never work17:04
fungiclarkb: sure, seems a fine solution to me17:04
*** jamesmcarthur has joined #openstack-infra17:05
*** gyee has joined #openstack-infra17:05
openstackgerritMichael Krotscheck proposed openstack-infra/puppet-phabricator: Development tools for puppet-phabricator  https://review.openstack.org/31032017:06
*** oanson has joined #openstack-infra17:06
openstackgerritMichael Krotscheck proposed openstack-infra/puppet-phabricator: De-Montyfy Puppet-phabricator  https://review.openstack.org/31031917:06
openstackgerritThiago da Silva proposed openstack-infra/project-config: add python jobs to pyeclib project  https://review.openstack.org/31767217:06
*** trown|lunch is now known as trown17:07
tdasilvafungi, clarkb ^^^17:07
*** _sarob has joined #openstack-infra17:07
clarkband then when 2020 comes around we drop py27 and say use pypy :)17:07
*** e0ne has quit IRC17:07
*** shashank_hegde has joined #openstack-infra17:09
*** e0ne has joined #openstack-infra17:09
*** sarob has quit IRC17:09
*** nwkarsten has quit IRC17:10
*** e0ne has quit IRC17:10
*** tonytan4ever has quit IRC17:10
*** akshai has quit IRC17:10
*** cloudtrainme has quit IRC17:10
*** nwkarsten has joined #openstack-infra17:11
*** oanson has quit IRC17:11
*** Goneri has quit IRC17:12
*** jheroux has joined #openstack-infra17:12
*** eezhova has joined #openstack-infra17:13
*** dguitarbite_ has joined #openstack-infra17:14
fungii'm not getting anywhere on the restarting git processes on git0817:14
fungithe journal doesn't include anything useful, just startup messages17:14
clarkbare we sure those git daemons are non functional? they are being inetd'd basically by systemd using socket activation17:15
fungiactually, the flood of them in dmesg ceased at 16:39 utc17:16
clarkbgit clone git://git08.openstack.org:29418/openstack/neutron seems to work from here17:16
fungiwhich is exactly when i started systemd-journald again17:16
fungispooky17:17
*** csomerville has quit IRC17:17
*** abregman has joined #openstack-infra17:17
*** gyee has quit IRC17:18
notmorganfungi: weird.17:18
*** ilyashakhat has joined #openstack-infra17:18
fungiahh, because it's going into the journal and not being reported back to dmesg i guess? i see some similar starting/started messages looking at journalctl -xe17:19
clarkbdid we change anything on our end to make osic happier? we seem to be using a proper amount of quota there now17:19
*** ayoung has quit IRC17:20
jeblairfungi: is the git daemon an inet daemon, and so perhaps those messages are normal?17:20
*** kzaitsev_mb has joined #openstack-infra17:20
*** gyee has joined #openstack-infra17:20
*** kushal has quit IRC17:20
notmorganjeblair: that seems... odd to make it inet.17:20
clarkbjeblair: basically yes, it uses systemd's similar functionality (this is what the @ in the name means)17:20
notmorganjeblair: i mean, i don't knpow the best practices today but that seems weird to me.17:21
clarkbnotmorgan: its actually how you are supposed to do things with systemd now didn'y you know?17:21
* notmorgan remembers a big push to move away from inet-like things.17:21
jeblairnotmorgan: i don't think there is another option with the git protocol17:21
notmorganjeblair: ah.17:21
clarkbjeblair: that too17:21
notmorganjeblair: that makes more sense17:21
fungijeblair: possibly, though we've got 4 git-daemon processes on our other servers and only one on git08. also systemctl --failed reports failed git-daemon processes on 08 not not on others17:21
*** mixos has quit IRC17:21
notmorganclarkb: let me just get angry and find a soapbox for more ... reasons about systemd now. :P17:21
*** ociuhandu has quit IRC17:22
fungino, i take that back, we have a varying number. i just got lucky on some spot checks17:22
clarkbnotmorgan: basically you put an @ in the unit file name (says allow me to run many instances of this) then set up the socket activation stuff and you have an inet17:22
fungihowever, systemctl --failed is only reporting failed git-daemon services on 0817:22
jeblairfungi: perhaps the failed ones are just individual spawn instances that have failed for some reason or other?17:23
*** cindy has left #openstack-infra17:23
jeblairthe unit name "git-daemon@16841920-104.239.146.131:29418-104.130.246.128:55211.service" looks very specific ...17:23
fungijeblair: seems probable, but would be good to know what that reason is17:23
notmorganclarkb: i *still* don't like that. it makes my skin crawl [then again i don't like piling tons of things on the same system, so dynamic up/down scaling is less predictable]-- i might be stuck in the past of systemsengineering/admin though :P17:23
*** nwkarsten has quit IRC17:24
clarkbnotmorgan: the argument for it is it allows your system to only do the work necessary rather than having a gazillion daemons all hanging out waiting for connections.17:24
*** salv-orlando has quit IRC17:24
*** nwkarsten has joined #openstack-infra17:24
mtreinishclarkb: I'm around now, what do I need to check?17:24
*** kushal has joined #openstack-infra17:25
fungijeblair: if you `sudo systemctl status -l git-daemon@16841920-104.239.146.131:29418-104.130.246.128:55211` for example, you'll see they're recent17:25
clarkbmtreinish: just double check that the db is being updated properly and the mysql proxy works17:25
clarkbmtreinish: we replaced the instance so before I delete the old one want to make sure the new one is happy17:25
*** shashank_hegde has quit IRC17:25
mtreinishclarkb: nope, can't connect17:25
notmorganclarkb: sure. i also have historically been dealing with environments where it's spike-y enough to justify hanving daemons lingering around (video games), and spinning up new servers/vms on demand with fixed utilization to handle larger/smaller loads (since you can't share the system resources cleanly)17:25
mtreinishclarkb: http://paste.openstack.org/show/505935/17:25
notmorganclarkb: so different backgrounds ;)17:26
*** ilyashakhat has quit IRC17:26
*** HeOS has quit IRC17:27
*** Goneri has joined #openstack-infra17:27
*** tqtran has joined #openstack-infra17:27
clarkbmtreinish: start-stop-daemon: user 'logstash' not found17:28
clarkbmtreinish: we are using a user that isn't on that host beacuse we don't install logstash there17:28
*** ddieterly is now known as ddieterly[away]17:28
*** nwkarsten has quit IRC17:28
*** gomarivera has quit IRC17:29
mtreinishclarkb: hmm, ok. I guess we should update the puppet setting up simpleproxy17:29
clarkbmtreinish: this is a simple fix, will have a patch soon17:29
mtreinishclarkb: ok17:29
mtreinishclarkb: also looking at openstack-health it doesn't look like there is any data in the db since 2:00am (I think it's utc, but I'm not sure)17:30
*** maestro has joined #openstack-infra17:32
clarkbmtreinish: I have kicked the subunit gearman worker it was hanging out waiting for gearman jobs and not getting any17:32
*** yamamoto has quit IRC17:33
*** yamamoto has joined #openstack-infra17:34
pabelangerfungi: I've manually promoted 315894,3 in the gate queue, the tox-db-legacy_drivers job was hung at 4+ hours and I couldn't see any nodes it was actually using17:34
pabelangerthat should help clear out the integrated queue17:34
*** pfallenop has quit IRC17:35
pabelangersame problem is happening for a few jobs in check17:35
*** ayoung has joined #openstack-infra17:35
*** yamamoto has quit IRC17:37
*** Kaiyan has quit IRC17:39
*** Na3iL has quit IRC17:40
*** ihrachys has quit IRC17:40
openstackgerritClark Boylan proposed openstack-infra/puppet-simpleproxy: Create a simpleproxy user  https://review.openstack.org/32228417:40
mtreinishclarkb: on o-h it looks like subunit2sql just started getting data again17:40
clarkbmtreinish: pabelanger ^ I think that is the fix for the proxy17:41
pabelangerclarkb: where does that run?17:43
pabelangernever see simpleproxy before17:43
*** vdrok has quit IRC17:43
pabelangerseen*17:43
*** vdrok has joined #openstack-infra17:44
clarkbpabelanger: on logstash.openstack.org, it provides read only access to the trove subunit2sql db17:44
pabelangerthanks17:44
clarkbotherwise you have to be on the rax network and know what the instance name/ip is17:44
*** vdrok has quit IRC17:45
*** vdrok has joined #openstack-infra17:45
*** nadya has joined #openstack-infra17:45
*** vdrok has quit IRC17:46
*** roxanaghe has quit IRC17:46
*** pfallenop has joined #openstack-infra17:47
*** ociuhandu has joined #openstack-infra17:47
*** roxanaghe has joined #openstack-infra17:48
mtreinishpabelanger: oh, you missed way back when I was working on getting a proxy setup. Had a lot of fun trying to get mysql proxy to work17:49
*** thorst_ has joined #openstack-infra17:49
mtreinishit turns out you could DOS mysql proxy with telnet17:49
mtreinishso we just went with a tcp proxy17:49
*** links has joined #openstack-infra17:49
*** thorst_ has quit IRC17:50
*** thorst_ has joined #openstack-infra17:50
openstackgerritJohn Trowbridge proposed openstack-infra/tripleo-ci: Change DLRN promote method  https://review.openstack.org/32180117:51
*** sdague has quit IRC17:51
pabelangermtreinish: Oh, neat17:51
clarkbinfra-root crinkle https://review.openstack.org/322284 will get logstash.o.o sorted and we can finish its trusty upgrade17:51
*** _vs has joined #openstack-infra17:51
*** shashank_hegde has joined #openstack-infra17:52
*** twm2016 has joined #openstack-infra17:52
pabelangerinfra-root: jenkins05 looks offline, going to start the recovery process17:52
*** sdake_ has joined #openstack-infra17:52
twm2016Is this channel the place to ask questions related, to gerrit?17:52
mtreinishpabelanger: https://bugs.launchpad.net/ubuntu/+source/mysql-proxy/+bug/140201117:53
openstackLaunchpad bug 1402011 in mysql-proxy (Ubuntu) "telnet crashes mysql-proxy" [Undecided,New]17:53
clarkbtwm2016: if you are looking for review.openstack.org help then yes, but general gerrit questions may be better directed at #gerrit or their google group (we can still try to help though)17:53
crinkleclarkb: i don't think the user resource creates the homedir automatically, is that okay?17:53
mtreinishit doesn't seem to have moved at all, I guess it wasn't a high priority17:53
clarkbcrinkle: yup the package creates that dir17:53
clarkbcrinkle: which is why I chose it, you get dumped into the help docs dir if you ever su to that user17:54
twm2016clarkb: thanks, just asking for a friend :)17:54
crinkleah fungi got it17:54
fungiclarkb: crinkle: my only concern is that it might introduce a bootstrapping ordering issue since the user isn't created until after the service is installed/configured17:55
*** sdake has quit IRC17:55
*** twm2016 has left #openstack-infra17:55
*** vhosakot has quit IRC17:55
clarkbfungi: oh does the service.pp not depend on the init.pp as a whole?17:55
fungiif the simpleproxy package tries to start the service at installation, it will likely fail17:55
openstackgerritEmilien Macchi proposed openstack-infra/project-config: tempest: move puppet jobs from exp to check pipeline  https://review.openstack.org/32117417:55
clarkbif so then yes17:55
EmilienMoomichi: ^17:55
clarkb(I sort of operated under the assumption it did but that wasn't double checked by me)17:55
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move puppet4 jobs into check pipeline  https://review.openstack.org/32183717:56
*** julim has quit IRC17:56
openstackgerritEmilien Macchi proposed openstack-infra/project-config: puppet: move xenial integrations jobs into gate  https://review.openstack.org/32217717:56
*** vhosakot has joined #openstack-infra17:57
mtreinishcrinkle: any ideas on what I did wrong here: https://review.openstack.org/#/c/321147/ I'm not sure how that change broke the beaker tests17:57
*** bpokorny_ has joined #openstack-infra17:57
fungiclarkb: there's likely a bit of a dependency loop there if you rely on the package to create the homedir, but configure the package to start as a user that depends on the package being installed first17:57
pabelanger#status log jenkins05.o.o back online17:58
openstackstatuspabelanger: finished logging17:58
*** deadnull_ has quit IRC17:58
fungiclarkb: er, configure the service to start, i meant17:58
clarkbfungi: it should be package <- user <- service17:58
crinklemtreinish: looks like its 500ing :(17:58
clarkbwith package and user happening first because they are in the init manifest and service happening after because it is in th service manifest but I don't know that this bit is enforced inthe puppet17:59
*** eezhova has quit IRC17:59
pabelangerand cleaning up jenkins06 now17:59
*** _vs has quit IRC17:59
mtreinishcrinkle: oh, I know what it is thanks18:00
mtreinishcrinkle: http://logs.openstack.org/47/321147/2/check/gate-puppet-openstack_health-puppet-beaker-rspec-ubuntu-trusty/b4a3b2e/logs/apache/openstack-health-api-error.txt.gz18:00
crinklemtreinish: neat18:00
clarkbya it only requires the package not the entirety of init.pp18:00
*** bpokorny has quit IRC18:01
EmilienMhey infra folks, I know most of you are busy in sprint but if someone has time, I have some project-config changes to improve our Puppet OpenStack CI, https://goo.gl/Sa8cSx - thanks18:01
mtreinishcrinkle: https://review.openstack.org/#/c/321202/4 should fix it (well by accident I fixed it in there without even thinking about it)18:01
*** EricGonczer_ has joined #openstack-infra18:02
*** Goneri has quit IRC18:02
*** cloudtrainme has joined #openstack-infra18:02
fungiclarkb: oh, i see the service definition is elsewhere18:02
crinklemtreinish: accidentally fixing things is the best18:02
fungiclarkb: so probably not an issue as long as the simpleproxy package doesn't try to start the service automatically at installation (which is somewhat typical for debian packages, but maybe not this one as it's a lot more generic and configuration-dependent(18:03
mtreinishfungi: luckily simpleproxy isn't actually a daemon so that shouldn't be an issue18:04
fungiclarkb: and also, we're feedint it our own initscript, so almost certainly not18:04
mtreinishfungi: we wrote our own initscript for it18:04
fungiyeah, just realized that18:04
clarkbfungi: right18:04
fungiso anyway, lgtm18:04
*** dizquierdo has quit IRC18:05
pabelanger#status log jenkins06.o.o back online18:06
openstackstatuspabelanger: finished logging18:06
*** gomarivera has joined #openstack-infra18:06
*** sdake_ is now known as sdake18:06
*** flepied has quit IRC18:06
*** cloudtrainme has quit IRC18:07
*** Sukhdev_ has joined #openstack-infra18:07
*** EricGonc_ has joined #openstack-infra18:08
notmorganjeblair: so in doing the py3 things for nodepool (easy way to look at all the code), i think I'm going to marshal b'' -> str vs force everything to b''. it seems like it would be more fragile/harder to maintain18:09
notmorganjeblair: unless i am misunderstanding something about gear and a requirement for things to be in b'' form (i know ZK is coming, but i'm getting familiar with the codebase)18:10
*** links has quit IRC18:10
clarkbnotmorgan: gear shuffles bytes around not python strings18:11
notmorganclarkb: ok, so i'll need to be aware that when it drops into gear it needs to be marshalled back to bytes18:11
clarkbso at least at the edges where you submit and receive jobs you will need to encode/decode18:11
*** EricGonczer_ has quit IRC18:11
fungiyeah, dealing with data at the protocol level there18:12
*** jheroux has quit IRC18:12
notmorganclarkb: yeah. thats fine -- easier to not need to remember to make everything b'' in the codebase though18:12
fungiif you do have wrapper functions which are handling the protocol layer communication and do all the encoding/decoding within them, then the rest of the program can just assume strings and not need to care18:13
*** jamesmcarthur has quit IRC18:13
*** degorenko is now known as _degorenko|afk18:15
fungiwhich i guess is another way to describe marshalling18:16
*** mtanino has joined #openstack-infra18:16
*** mtanino has quit IRC18:16
*** kzaitsev_mb has quit IRC18:16
*** piet has quit IRC18:20
*** piet has joined #openstack-infra18:21
fungiokay, i've gotten systemd-journald successfully started again on all the git servers18:21
notmorganfungi: basically thats the plan I'm going with18:23
fungistrangely, the rngd service is reported as failed on all the git servers except git0818:23
*** yamahata has quit IRC18:23
notmorganfungi: woo ^5, glad my "hey these servers are b0rked" comment helped discover a separate issue.18:23
fungiwell, i still haven't gotten to the bottom of the git issues you were seeing18:24
*** yamahata has joined #openstack-infra18:24
notmorganfungi: but hey, not having systemd-journal running is bad, so...18:25
*** nwkarsten has joined #openstack-infra18:25
*** Goneri has joined #openstack-infra18:25
fungiStarting Hardware RNG Entropy Gatherer Daemon... Unable to open file: /dev/tpm0... can't open any entropy source... Maybe RNG device modules are not loaded... rngd.service: main process exited, code=exited, status=1/FAILURE18:26
notmorganfungi: maybe the physical hosts under those VMs aren't exposing it?18:26
fungipossible18:27
*** ddieterly[away] has quit IRC18:28
fungiwell, on git08 i still get "Unable to open file: /dev/tpm0" during startup, but the service is up and running18:29
*** pvaneck has joined #openstack-infra18:29
fungiso presumably it found a different entropy source. maybe lsmod will enlighten me18:29
clarkbhrm we may not have the entropy package thing installed on centos18:30
clarkbwe do on ubuntu18:30
clarkbhaveged?18:30
*** jerryz has joined #openstack-infra18:30
*** kushal has quit IRC18:33
fungithe only kernel module difference between 01 and 08 is that 01 has intel_rapl loaded, so i doubt that's related18:33
fungihaveged isn't installed on either of them18:34
fungithough we should probably do that18:34
*** nwkarsten has quit IRC18:35
*** nwkarsten has joined #openstack-infra18:35
openstackgerritMerged openstack-infra/puppet-simpleproxy: Create a simpleproxy user  https://review.openstack.org/32228418:36
fungicpu flags ftw!18:36
fungion git08, /proc/cpuinfo indicates rdrand is present, while on git01 it is not18:36
*** yamamoto has joined #openstack-infra18:38
*** piet has quit IRC18:38
jeblairclarkb, fungi, mtreinish, (where is sdague?): http://www.fedmsg.com/en/latest/18:38
*** sdague has joined #openstack-infra18:39
mtreinishjeblair: I think sdague is at home depot18:39
*** piet has joined #openstack-infra18:39
jeblairmaybe i will see him there this weekend18:39
fungiheh18:39
clarkbI should be at home depot18:39
sdagueI just got back18:39
notmorganfungi: yep. ok18:39
jeblair(i prefer lowes when possible; it's not always possible)18:39
*** nwkarsten has quit IRC18:39
jeblairsdague: http://www.fedmsg.com/en/latest/18:40
notmorganfungi: so it's a VM / host issue *shrug*18:40
clarkbjeblair: that looks like ti uses zmq18:40
jeblairbummer18:40
clarkbwhich automatically sort of puts it in the bin of probably not a good idea for me18:40
jeblairit's in my don't touch it with a 10ft pole bin18:40
*** e0ne has joined #openstack-infra18:40
sdagueyeh, fedmsg seems neat in concept, I just wish they used a proper bus18:40
sdagueI conceptually want the same thing as fedmsg18:41
jeblairi'll ask em18:41
clarkbbut with proper error handling18:41
*** maestro has quit IRC18:41
fungi(but with more working!)18:41
*** EricGonc_ has quit IRC18:42
sdagueright, the nice thing about mosquitto is there is a ton of stuff to talk mqtt, including arduino code :)18:42
*** ilyashakhat has joined #openstack-infra18:43
*** banix has quit IRC18:43
sdaguehttps://github.com/mqtt/mqtt.github.io/wiki/libraries18:44
*** roxanaghe has quit IRC18:44
clarkb0mq also has a really volatile community18:44
clarkbthere are 2 or 3 forks now that have decided backward compat is impossible and even they have then had similar issues with development18:45
sdagueright, it's also much lower level and you have to build the semantics yourself. vs. a semantic pub / sub18:45
jeblairyeah, i like what i've seen of mqtt18:46
sdaguethe retain and will concepts also let you build proactive status reporting, where you can make a part of you subtree the pub status, so easy to know that a publisher went goofy18:46
*** yamamoto has quit IRC18:48
*** ilyashakhat has quit IRC18:49
mtreinishooh, http://status.openstack.org/openstack-health/#/ finally is showing a elastic-recheck hit (well 3 of them)18:50
clarkbmtreinish: did we do a mass cleanup of the bug list yet?18:50
mtreinishclarkb: yeah I did that before we landed the o-h change18:50
mtreinishclarkb: https://review.openstack.org/#/c/315765/18:51
*** _sarob has quit IRC18:51
clarkbnice18:51
*** amoralej is now known as amoralej|off18:52
*** markusry has joined #openstack-infra18:52
*** gomarivera has quit IRC18:53
*** javeriak_ has quit IRC18:55
*** javeriak has joined #openstack-infra18:55
clarkbmtreinish: ok try the mysql proxy now18:56
mtreinishclarkb: it works!18:57
clarkbyay, so from your end we are good ya?18:57
mtreinishclarkb: I think so18:57
*** csomerville has joined #openstack-infra18:57
clarkbgreat any objectsion to deleting the old logstash.o.o to complete the trusty update?18:57
mtreinishclarkb: go for it18:58
*** bpokorny_ has quit IRC18:58
*** flepied has joined #openstack-infra18:58
*** Sukhdev_ has quit IRC18:59
*** bpokorny has joined #openstack-infra18:59
*** somerville32 has joined #openstack-infra18:59
*** markusry has quit IRC19:01
*** csomerville has quit IRC19:02
mtreinishclarkb, fungi, jeblair, nibalizer: I'm getting 500s on: http://health.openstack.org/tests/recent/fail if you get a sec can you pull out the stacktrace from the log19:02
nibalizeroh i got it19:02
nibalizeri have my script and everything19:02
nibalizerhttp://paste.openstack.org/show/505946/ boom19:03
*** ayoung has quit IRC19:03
*** ayoung has joined #openstack-infra19:03
mtreinishnibalizer: heh, nice19:03
mtreinishoh, that's more than I was expecting19:03
mtreinishhmm, half of that is elasticsearch errors19:04
*** mtanino has joined #openstack-infra19:04
mtreinishthe other stuff looks like it's related to the change which just landed19:04
clarkbmtreinish: pabelanger is in the process of upgrading the entire cluster right now19:04
clarkbwhich could slow queries and or make hosts unavailable19:04
mtreinishclarkb: yeah I figured that's what the es errors were related too19:05
mtreinishs/too/to19:05
*** sdake_ has joined #openstack-infra19:09
clarkbnibalizer: any chance yo uare going to be able to work on puppetdb today? I see your name on it19:10
*** sdake has quit IRC19:10
*** inc0 has quit IRC19:11
*** _ari_|afk has quit IRC19:11
nibalizeryes my name is on it19:13
*** e0ne has quit IRC19:13
nibalizeri have an errand to run but can kick it this afternoon19:14
nibalizerif someone else really wants it they can go for it19:14
nibalizerthere is a puppetdb01 that got created but it doesn't work :(19:14
mtreinishnibalizer: https://review.openstack.org/#/c/322304/ should fix it19:14
openstackgerritAntoine Musso proposed openstack/diskimage-builder: dpkg: fake initctl version now parseable by puppet  https://review.openstack.org/32230519:14
*** nadya has quit IRC19:15
*** ddieterly has joined #openstack-infra19:17
*** e0ne has joined #openstack-infra19:18
*** e0ne has quit IRC19:18
*** e0ne has joined #openstack-infra19:18
*** sdake has joined #openstack-infra19:18
*** daemontool has quit IRC19:18
*** eezhova has joined #openstack-infra19:19
*** rbrndt has quit IRC19:19
*** sdake_ has quit IRC19:19
openstackgerritColleen Murphy proposed openstack-infra/puppet-bandersnatch: Fix acceptance tests  https://review.openstack.org/32006819:20
*** e0ne has quit IRC19:20
*** javeriak_ has joined #openstack-infra19:20
*** ilyashakhat has joined #openstack-infra19:20
openstackgerritsebastian marcet proposed openstack-infra/openstackid-resources: Upgrade Laravel and ORM  https://review.openstack.org/32230719:21
openstackgerritColleen Murphy proposed openstack-infra/puppet-bandersnatch: Fix acceptance tests  https://review.openstack.org/32006819:22
*** javeriak has quit IRC19:24
*** e0ne has joined #openstack-infra19:25
*** e0ne has quit IRC19:27
*** gomarivera has joined #openstack-infra19:27
*** burgerk has joined #openstack-infra19:28
*** dimtruck is now known as zz_dimtruck19:28
*** eezhova has quit IRC19:29
*** ddieterly has quit IRC19:31
*** e0ne has joined #openstack-infra19:31
*** chem`` has quit IRC19:31
*** chem`` has joined #openstack-infra19:31
*** burgerk has quit IRC19:33
*** e0ne has quit IRC19:34
*** markusry has joined #openstack-infra19:37
fungiclarkb: it looks like we only install haveged on job nodes and on servers where we install kerberos. i wonder if we should expand it to be installed in our template class or something?19:39
*** salv-orlando has joined #openstack-infra19:39
clarkbprobably a good idea19:39
fungiat least that's what i'm interpreting http://codesearch.openstack.org/?q=haveged to indicate19:40
clarkbIt doesnt reduce our security tremendously right?19:41
*** mixos has joined #openstack-infra19:41
fungishouldn't reduce it at all19:41
*** gomarivera has quit IRC19:41
fungimixing more sources into an entropy pool, as long as the mixing algorithm is cryptographically sound, should never reduce the amount of entropy19:42
fungieven if the additional sources are completely non-entropic19:42
*** openstack has joined #openstack-infra21:43
*** lascii is now known as alaski21:43
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 Fix: cmp -> key function  https://review.openstack.org/32191921:46
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 fix: Use new-style raise syntax  https://review.openstack.org/32192621:46
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 Fixes: Encode config write in tests  https://review.openstack.org/32192721:46
openstackgerritMorgan Fainberg proposed openstack-infra/nodepool: Python 3 fixes: dict.iteritems  https://review.openstack.org/32192821:46
*** gordc has quit IRC21:46
*** esker has quit IRC21:47
*** tlian has quit IRC21:47
openstackgerritJames E. Blair proposed openstack-infra/zuul: Ansible launcher: support static workers  https://review.openstack.org/32156921:49
openstackgerritJames E. Blair proposed openstack-infra/zuul: Ansible launcher: some ansible fixes  https://review.openstack.org/32233121:49
*** jamesmcarthur has quit IRC21:50
*** rbradfor is now known as rbradf_not_found21:51
*** amrith is now known as _amrith_21:53
*** johnny___ has quit IRC21:53
openstackgerritJames E. Blair proposed openstack-infra/zuul: Ansible launcher: handle JJB with no macros  https://review.openstack.org/32233221:54

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!