Thursday, 2019-07-18

*** weifan has quit IRC00:01
*** ociuhandu has joined #openstack-infra00:03
*** jamesmcarthur has quit IRC00:06
*** jamesmcarthur has joined #openstack-infra00:08
*** jamesmcarthur has quit IRC00:09
*** jamesmcarthur has joined #openstack-infra00:09
*** slaweq has joined #openstack-infra00:11
*** slaweq has quit IRC00:16
*** iurygregory has quit IRC00:17
*** ociuhandu has quit IRC00:17
*** jamesmcarthur has quit IRC00:33
*** jamesmcarthur has joined #openstack-infra00:33
*** jamesmcarthur has quit IRC00:39
*** gyee has quit IRC00:46
*** goldyfruit has quit IRC00:50
*** goldyfruit has joined #openstack-infra00:53
*** betherly has joined #openstack-infra00:55
*** betherly has quit IRC01:00
*** jamesmcarthur has joined #openstack-infra01:09
*** apetrich has quit IRC01:12
*** ociuhandu has joined #openstack-infra01:15
*** betherly has joined #openstack-infra01:15
*** jamesmcarthur has quit IRC01:15
*** imacdonn has quit IRC01:16
*** imacdonn has joined #openstack-infra01:17
*** ramishra has joined #openstack-infra01:20
*** betherly has quit IRC01:20
*** ruffian_sheep has joined #openstack-infra01:23
*** ociuhandu has quit IRC01:28
*** whoami-rajat has joined #openstack-infra01:31
*** jamesmcarthur has joined #openstack-infra01:40
*** apetrich has joined #openstack-infra01:44
*** betherly has joined #openstack-infra01:57
*** apetrich has quit IRC01:58
*** betherly has quit IRC02:02
*** goldyfruit has quit IRC02:07
*** goldyfruit has joined #openstack-infra02:08
*** mattw4 has quit IRC02:09
openstackgerritMasayuki Igawa proposed opendev/subunit2sql master: Fix gate error "mysqladmin: command not found"  https://review.opendev.org/67137902:11
*** slaweq has joined #openstack-infra02:11
*** slaweq has quit IRC02:15
*** ociuhandu has joined #openstack-infra02:26
*** yamamoto has joined #openstack-infra02:27
openstackgerritMasayuki Igawa proposed opendev/subunit2sql master: Fix gate error "mysqladmin: command not found"  https://review.opendev.org/67137902:27
fungipabelanger: here's a question which might be right up your alley... http://lists.openstack.org/pipermail/openstack-discuss/2019-July/007865.html02:33
fungi(someone has mistakenly e-mailed an openstack mailing list looking for asterisk help)02:33
openstackgerritMasayuki Igawa proposed opendev/subunit2sql master: Fix gate error "mysqladmin: command not found"  https://review.opendev.org/67137902:37
*** ociuhandu has quit IRC02:38
*** bhavikdbavishi has joined #openstack-infra02:44
*** bhavikdbavishi has quit IRC02:48
*** jamesmcarthur has quit IRC02:54
*** jamesmcarthur has joined #openstack-infra02:55
*** jamesmcarthur has quit IRC02:59
*** yamamoto has quit IRC03:02
openstackgerritMasayuki Igawa proposed opendev/subunit2sql master: Fix gate error "mysqladmin: command not found" and etc  https://review.opendev.org/67137903:02
*** betherly has joined #openstack-infra03:09
*** betherly has quit IRC03:14
*** yamamoto has joined #openstack-infra03:16
*** yamamoto has quit IRC03:16
*** yamamoto has joined #openstack-infra03:17
openstackgerritMasayuki Igawa proposed opendev/subunit2sql master: Fix gate error "mysqladmin: command not found" and etc  https://review.opendev.org/67137903:18
*** yamamoto has quit IRC03:21
*** goldyfruit has quit IRC03:30
*** redrobot has quit IRC03:32
*** rh-jelabarre has quit IRC03:34
*** jamesmcarthur has joined #openstack-infra03:35
*** psachin has joined #openstack-infra03:36
*** ociuhandu has joined #openstack-infra03:36
*** Guest99405 has joined #openstack-infra03:39
*** betherly has joined #openstack-infra03:40
*** bhavikdbavishi has joined #openstack-infra03:44
*** betherly has quit IRC03:45
*** ociuhandu has quit IRC03:50
openstackgerritMasayuki Igawa proposed opendev/subunit2sql master: Add .zuul.yaml and python 3 Train unit tests  https://review.opendev.org/67138903:51
*** slaweq has joined #openstack-infra04:11
*** yamamoto has joined #openstack-infra04:13
*** apetrich has joined #openstack-infra04:16
*** slaweq has quit IRC04:16
lennybfungi, thanks. Outdated  https://github.com/openstack/networking-mlnx repo is NOT longer needed for us. What is a procedure for infra team to delete/archived it? We prefer not having this repo at all to prevent users using outdated code.04:19
*** yamamoto_ has joined #openstack-infra04:20
*** yamamoto has quit IRC04:20
*** aaronsheffield has joined #openstack-infra04:21
*** yamamoto has joined #openstack-infra04:21
*** yamamoto_ has quit IRC04:24
*** sgw has joined #openstack-infra04:26
*** ykarel|away has joined #openstack-infra04:37
openstackgerritMerged opendev/subunit2sql master: Fix gate error "mysqladmin: command not found" and etc  https://review.opendev.org/67137904:38
*** udesale has joined #openstack-infra04:39
*** ociuhandu has joined #openstack-infra04:48
*** yamamoto has quit IRC04:52
openstackgerritSimon Westphahl proposed zuul/zuul master: Record and report time for trigger events  https://review.opendev.org/67120404:56
*** yamamoto has joined #openstack-infra04:57
*** auristor has quit IRC04:59
*** ociuhandu has quit IRC05:00
*** auristor has joined #openstack-infra05:03
*** ykarel|away has quit IRC05:04
*** jamesmcarthur has quit IRC05:10
*** slaweq has joined #openstack-infra05:11
*** slaweq has quit IRC05:15
*** yamamoto has quit IRC05:18
*** ykarel|away has joined #openstack-infra05:20
*** ykarel|away is now known as ykarel05:24
*** yamamoto has joined #openstack-infra05:31
*** ccamacho has quit IRC05:33
*** pcaruana has joined #openstack-infra05:44
*** yamamoto has quit IRC05:45
*** yamamoto has joined #openstack-infra05:46
*** yamamoto has quit IRC05:46
*** yamamoto has joined #openstack-infra05:49
*** yamamoto has quit IRC05:49
openstackgerritMerged opendev/subunit2sql master: Add attachments flag to get_test_runs_by_status  https://review.opendev.org/55215205:52
*** ociuhandu has joined #openstack-infra05:57
*** logan- has quit IRC06:00
*** logan_ has joined #openstack-infra06:01
*** logan_ is now known as logan-06:01
AJaegerlennyb: https://docs.openstack.org/infra/manual/drivers.html#retiring-a-project06:01
*** ociuhandu has quit IRC06:02
*** dpawlik has joined #openstack-infra06:04
*** ruffian_sheep14 has joined #openstack-infra06:12
*** igordc has quit IRC06:14
*** ruffian_sheep has quit IRC06:14
*** ruffian_sheep14 is now known as ruffian_sheep06:18
openstackgerritOpenStack Proposal Bot proposed openstack/project-config master: Normalize projects.yaml  https://review.opendev.org/67140706:23
*** yamamoto has joined #openstack-infra06:26
*** slaweq has joined #openstack-infra06:28
*** strigazi has quit IRC06:31
*** raukadah is now known as chandankumar06:35
*** yamamoto has quit IRC06:35
fungiAJaeger: in this case lennyb is asking about just deleting the stale mirror of it on github06:43
funginot retiring the project (it will continue to be maintained in opendev)06:44
*** rcernin has quit IRC06:44
fungilennyb: i'm about to go to sleep (nearly 3am here) but can do it when i wake up06:46
*** joeguo has quit IRC06:47
fungilennyb: if one of the people listed here https://review.opendev.org/#/admin/groups/networking-mlnx-release can e-mail the openstack-infra mailing list requesting deletion of the stale copy on github, that should be sufficient authority (optionally, get one of them to add you to that group and then i'm happy to do it on your authority)06:48
*** odicha has joined #openstack-infra06:49
* fungi disappears for some low-power standby time06:50
*** yamamoto has joined #openstack-infra06:51
openstackgerritMasayuki Igawa proposed openstack/project-config master: Cleanup in-tree removed jobs  https://review.opendev.org/67141206:55
*** kopecmartin|off is now known as kopecmartin06:59
*** ociuhandu has joined #openstack-infra07:00
openstackgerritMerged openstack/project-config master: Normalize projects.yaml  https://review.opendev.org/67140707:03
*** jbadiapa has joined #openstack-infra07:04
*** xek has joined #openstack-infra07:06
*** ginopc has joined #openstack-infra07:07
*** ociuhandu has quit IRC07:08
*** pgaxatte has joined #openstack-infra07:11
*** rpittau|afk is now known as rpittau07:11
*** aaronsheffield has quit IRC07:16
AJaegerfungi: good night - ah, missed the context. Thakns for explaining.07:27
*** ruffian_sheep16 has joined #openstack-infra07:36
*** ccamacho has joined #openstack-infra07:36
*** ruffian_sheep has quit IRC07:36
*** ykarel is now known as ykarel|lunch07:37
*** ruffian_sheep16 is now known as ruffian_sheep07:41
*** ruffian_sheep has quit IRC07:52
*** dchen has quit IRC07:55
*** ralonsoh has joined #openstack-infra07:56
*** dchen has joined #openstack-infra07:56
*** lucasagomes has joined #openstack-infra07:56
*** ociuhandu has joined #openstack-infra08:07
*** pkopec has joined #openstack-infra08:07
*** dchen has quit IRC08:07
*** dtantsur|afk is now known as dtantsur08:09
*** yolanda has joined #openstack-infra08:09
*** yolanda has quit IRC08:10
*** ruffian_sheep has joined #openstack-infra08:10
*** ociuhandu has quit IRC08:11
*** ruffian_sheep has quit IRC08:16
*** ruffian_sheep has joined #openstack-infra08:19
*** derekh has joined #openstack-infra08:20
*** rascasoft has quit IRC08:25
*** rascasoft has joined #openstack-infra08:25
*** sshnaidm|afk is now known as sshnaidm08:27
*** tosky has joined #openstack-infra08:31
*** tkajinam has quit IRC08:40
*** xek has quit IRC08:42
*** xek has joined #openstack-infra08:43
*** iurygregory has joined #openstack-infra08:46
*** xek has quit IRC08:47
*** ruffian_sheep has quit IRC08:48
openstackgerritJoshua Hesketh proposed opendev/glean master: Sync when writing the file  https://review.opendev.org/65223808:48
*** yamamoto has quit IRC08:51
*** xek has joined #openstack-infra08:51
*** ruffian_sheep has joined #openstack-infra08:52
*** xek has quit IRC08:52
*** yamamoto has joined #openstack-infra08:57
*** ykarel|lunch is now known as ykarel09:00
*** arxcruz|ruck is now known as arxcruz09:01
*** apetrich has quit IRC09:06
*** apetrich has joined #openstack-infra09:07
openstackgerritJoshua Hesketh proposed opendev/system-config master: Update zuul status URL's to opendev  https://review.opendev.org/67143509:16
openstackgerritJoshua Hesketh proposed opendev/system-config master: Toggle CI should also hide old zuul comments  https://review.opendev.org/67143609:16
*** yamamoto has quit IRC09:17
*** apetrich has quit IRC09:19
openstackgerritNatal Ngétal proposed openstack/diskimage-builder master: [Configuration] Switch to stestr.  https://review.opendev.org/62941409:19
*** tdasilva_ has quit IRC09:25
*** tdasilva_ has joined #openstack-infra09:26
*** xek has joined #openstack-infra09:26
*** hrw has joined #openstack-infra09:26
hrwmorning09:26
hrwwho I should speak to when it comes to instances in linaro-london?09:27
*** electrofelix has joined #openstack-infra09:28
hrwwe created os.extra.large.infra flavour there to give 16vcpu/16GB ram instances for infra as current 8vcpu ones do not cope with kolla ci requirements09:28
*** apetrich has joined #openstack-infra09:31
*** tdasilva_ has quit IRC09:47
*** tdasilva_ has joined #openstack-infra09:47
mordredhrw: you want to make a patch to update the flavor name here: https://opendev.org/openstack/project-config/src/branch/master/nodepool/nl03.openstack.org.yaml#L401-L42409:49
*** yamamoto has joined #openstack-infra09:50
*** gfidente|afk is now known as gfidente09:52
*** yamamoto has quit IRC09:53
hrwmordred: thanks. will do09:54
openstackgerritMatthieu Huin proposed zuul/zuul master: Zuul CLI: allow access via REST  https://review.opendev.org/63631509:55
openstackgerritMatthieu Huin proposed zuul/zuul master: Use a requests session to simplify auth'd calls  https://review.opendev.org/67051109:56
openstackgerritMarcin Juszkiewicz proposed openstack/project-config master: Linaro London: use new bigger flavour  https://review.opendev.org/67144510:03
openstackgerritKien Nguyen proposed opendev/system-config master: Allow to rsync Centos Software Collections repo  https://review.opendev.org/67144910:07
hrwpatch created, sent, can move to other stuff10:09
mordredhrw: sweet!10:12
hrwmordred: time to add some reviewers ;D10:13
*** ruffian_sheep has quit IRC10:14
*** jamesmcarthur has joined #openstack-infra10:15
*** thiago__ has joined #openstack-infra10:15
*** tdasilva_ has quit IRC10:17
*** jamesmcarthur has quit IRC10:19
*** pkopec has quit IRC10:20
*** pkopec has joined #openstack-infra10:20
*** joeguo has joined #openstack-infra10:22
*** siqbal has joined #openstack-infra10:23
*** whoami-rajat has quit IRC10:30
*** yamamoto has joined #openstack-infra10:32
*** takamatsu has joined #openstack-infra10:36
*** yamamoto has quit IRC10:39
siqbalHi, I'm working on devstack and trying to locate CommsPowerManagement (https://github.com/intel/CommsPowerManagement/) within devstack, which file on devstack I can place this which would be suitable for SST. Any help is appreciated10:52
AJaegersiqbal: for devstack, best use #openstack-qa10:54
*** hrw has left #openstack-infra11:01
openstackgerritM V P Nitesh proposed openstack/diskimage-builder master: efibootmgr is need to create boot entries  https://review.opendev.org/67145611:01
siqbalAJarger Thanks I'll try that channel11:04
*** yamamoto has joined #openstack-infra11:05
openstackgerritMonty Taylor proposed opendev/system-config master: Build a docker image of the existing gerrit 2.13  https://review.opendev.org/67145711:09
openstackgerritMerged openstack/pbr master: Fix parsing on egg names with dashes from git URLs  https://review.opendev.org/64872711:16
*** goldyfruit has joined #openstack-infra11:20
mordredknikolla: I just tried to use an application credential on MOC with the following clouds.yaml: http://paste.openstack.org/show/754541/ and I got Attempted to authenticate with an unsupported method. (HTTP 401) (Request-ID: req-ccced950-9dfb-4a44-81ed-c61af2951287)11:29
mordredknikolla: this is using python-openstackclient that I just installed via pip into a venv11:30
mordredknikolla: (creating teh application credential worked great!)11:31
*** rh-jelabarre has joined #openstack-infra11:33
mordredcmurphy: ^^ also just keeping you in the loop, since you typically know everything11:34
*** tesseract has joined #openstack-infra11:38
*** eharney has quit IRC11:40
openstackgerritMonty Taylor proposed opendev/system-config master: Add clouds.yaml entry for MOC control plane project  https://review.opendev.org/67146311:51
mordredinfra-root: ^^ that doesn't work yet (see issue raised with knikolla) but I've put the secret into host_vars and have put information into root secrets11:51
*** rfolco is now known as rfolco|rover12:12
*** dpawlik has quit IRC12:16
mnasermordred, knikolla: i think the issue here is the keystone is not configurd with application credentials enabled12:18
mnaserif i remember right, the api for app creds works regardless if its enabled or not12:18
*** joeguo has quit IRC12:19
mnaser'application_credential' has to be inside [auth]/methods12:21
mordredmnaser: awesome - thanks!12:24
mnaserhttp://lists.openstack.org/pipermail/openstack-infra/2019-July/006422.html12:25
mnaserscary typo in the title12:25
*** eharney has joined #openstack-infra12:26
*** udesale has quit IRC12:26
*** thiago__ has quit IRC12:26
*** thiago__ has joined #openstack-infra12:27
*** udesale has joined #openstack-infra12:27
mordredmnaser: hah12:28
*** thiago__ has quit IRC12:33
*** thiago__ has joined #openstack-infra12:33
*** tdasilva_ has joined #openstack-infra12:35
*** thiago__ has quit IRC12:38
openstackgerritJoshua Hesketh proposed opendev/system-config master: Add some pointers on the OpenDev PPA  https://review.opendev.org/67095212:40
openstackgerritMerged zuul/zuul-jobs master: Add add-launchpad-credentaials test job  https://review.opendev.org/67020712:42
openstackgerritMerged zuul/zuul-jobs master: Add add-sshkey test job  https://review.opendev.org/67020812:44
*** piotrowskim has quit IRC12:47
*** ekultails has joined #openstack-infra12:50
pabelangerfungi: ha12:53
openstackgerritTobias Henkel proposed zuul/zuul master: Evaluate CODEOWNERS settings during canMerge check  https://review.opendev.org/64455712:54
openstackgerritTobias Henkel proposed zuul/zuul master: Evaluate CODEOWNERS settings during canMerge check  https://review.opendev.org/64455712:56
*** gtarnaras has joined #openstack-infra12:58
*** aaronsheffield has joined #openstack-infra12:59
openstackgerritMerged opendev/system-config master: Add letsencrypt documentation  https://review.opendev.org/66960212:59
*** siqbal has quit IRC13:03
*** siqbal has joined #openstack-infra13:03
openstackgerritMerged zuul/zuul master: Record and report time for trigger events  https://review.opendev.org/67120413:05
stephenfinharlowja: You might be interested in http://lists.openstack.org/pipermail/openstack-discuss/2019-July/007878.html13:11
stephenfinharlowja: Assuming you agree, any chance I could get added as core on x/doc8 so I can close out the few open patches there?13:11
* stephenfin is clearly on a crusade to be core of allllll the projects13:12
*** Guest99405 is now known as redrobot13:13
*** dpawlik has joined #openstack-infra13:15
pabelangerfungi: replied13:17
openstackgerritStephen Finucane proposed openstack/project-config master: Ends project gating for Bandit  https://review.opendev.org/67148313:25
*** rfarr has joined #openstack-infra13:26
*** rfarr_ has joined #openstack-infra13:26
*** rfarr_ has quit IRC13:26
openstackgerritMerged zuul/zuul master: Parse nodeset.group.nodes as a list  https://review.opendev.org/67137213:28
mordredstephenfin: watch out - I'll make you core on sdk and then I'll make you be ptl and then I'll ghost13:32
*** siqbal90 has joined #openstack-infra13:32
openstackgerritMerged opendev/system-config master: Streamline documented bup setup process  https://review.opendev.org/66619413:33
stephenfin😨😨😨13:33
*** siqbal has quit IRC13:35
openstackgerritFrancis Deslauriers proposed opendev/git-review master: Add labels on change submission  https://review.opendev.org/66630113:36
*** beekneemech has joined #openstack-infra13:36
*** bnemec has quit IRC13:37
*** tdasilva_ has quit IRC13:41
knikollamordred: mnaser: oh right! I’ll make a note to enable app creds as an authentication method asap.13:41
*** tdasilva_ has joined #openstack-infra13:42
openstackgerritMerged opendev/system-config master: Remove apport package from ubuntu servers  https://review.opendev.org/65214313:43
mordredknikolla: woot!13:45
mordredknikolla: with app creds, if I want to use the cred by name, I need to provide username too, yes?13:46
*** sgw has quit IRC13:47
*** bhavikdbavishi has quit IRC13:50
*** rfarr has quit IRC13:53
openstackgerritMerged zuul/zuul master: Fix GitEventFilter.__init__  https://review.opendev.org/61198913:54
*** whoami-rajat has joined #openstack-infra13:55
mnaseri'd like some help, tripleo+openstack-ansible are looking to work together on sharing ansible code13:59
mnaseri'd like to create a group in gerrit called openstack-ansible-collab13:59
mnaserseeded with tripleo-ansible-core and openstack-ansible-core13:59
mnaserbut.. how13:59
mnaseri cant find a 'list of groups' inside gerrit/acls/openstack/*.config .. i just see things assigned to a group though14:00
*** mattmceuen has left #openstack-infra14:02
*** bhavikdbavishi has joined #openstack-infra14:03
fungimnaser: they're created automatically when they're referenced14:04
mnaseroh.. okay cool14:04
fungiso just update an acl to use that group for some permission, and then one of us can seed the membership for you to get you going14:04
mnaserok great, thanks fungi14:05
fungiany time!14:05
*** sgw has joined #openstack-infra14:05
*** tdasilva_ has quit IRC14:09
*** yolanda has joined #openstack-infra14:09
*** tdasilva_ has joined #openstack-infra14:09
*** ykarel is now known as ykarel|away14:12
*** tdasilva_ is now known as tdasilva14:15
*** ccamacho has quit IRC14:15
*** ykarel|away has quit IRC14:17
*** jamesmcarthur has joined #openstack-infra14:19
*** ccamacho has joined #openstack-infra14:20
*** Lucas_Gray has joined #openstack-infra14:22
*** iurygregory has quit IRC14:23
*** jamesmcarthur has quit IRC14:23
*** jamesmcarthur has joined #openstack-infra14:24
*** Wryhder has joined #openstack-infra14:25
*** Lucas_Gray has quit IRC14:27
*** Wryhder is now known as Lucas_Gray14:27
AJaegermnaser: you can also just add one group to another ACL, so no need to create a new ACL...14:27
fungiyes, one permission can be delegated to multiple groups14:29
fungior a group can contain a mix of other groups and individuals14:29
fungithe latter is a bit more flexible if you want to avoid making more changes to the acl over time14:30
fungihard-coding a list of groups into the acl means more acl patches to change that14:30
*** ricolin_ is now known as ricolin14:34
*** iurygregory has joined #openstack-infra14:40
*** jamesmcarthur has quit IRC14:43
*** priteau has joined #openstack-infra14:48
clarkbgitea06 has not OOM'd since the swapfile was added14:51
clarkbgitea01 however has OOM'd since we added haproxy logging14:52
mordredclarkb: both of those things are good14:52
mordredclarkb: I mean, ooming isn't good - but ooming with logging in place to find the source of the oom is better than the opposite14:53
*** kaiokmo has joined #openstack-infra14:58
clarkbthere is an IP in vexxhost that does a bunch of requests around when it happens14:59
clarkbdoes not appears to be one of ours (grepped sytem-cofnig which has our inventory and the nodepool launcher logs)14:59
roman_gHello team. This https://review.opendev.org/#/c/668665/ change has 2xCR+2 and WF+1, and also has `Depends-on:` another change which has been merged already. How to I push Gerrit to merge it?15:02
*** kopecmartin is now known as kopecmartin|off15:03
clarkbroman_g: if the projects share a gate queue (thats a bit of config you can set) then zuul with automatically enqueue the child when the parent merges. Otherwise you can either recheck it which will rerun check then gate then merge, or reapprove it which should just run gate jobs then merge it15:04
fungirecheck's working now?15:04
fungi(for this case)15:04
roman_gclarkb: got it. Thank you!15:04
clarkbfungi: it always worked?15:04
roman_gfungi: trying15:04
fungii thought it needed a second workflow +1 or the workflow +1 removed and readded15:04
fungibecause of gerrit not emitting an event when zuul replaces the old verify +1 with a new one15:05
clarkbfungi: oh that was fixed ages ago by zaro by having gerrit always emit vote details and not just the delta in the events15:05
fungiahh15:05
*** tdasilva has quit IRC15:05
clarkbfungi: there is actually a change proposed to zuul to undo that behavior change whcih I have -1'd15:05
fungiheh15:05
*** tdasilva has joined #openstack-infra15:05
mordredclarkb: can you tell what sort of requests? like, is it doing a full clone of nova or something like that? and/or I wonder if putting in a rate limit would help15:06
mordred(if it's just someone ooming us by doing a shotgun of requests all at once, that sounds like something we'd want to be resilient against)15:06
clarkbmordred: no I haven't tracked it back that far yet. First trying to get some general stats on requestors15:09
mordred++15:09
*** eharney_ has joined #openstack-infra15:12
clarkbThe IP making the most requests over that hour block of time is an OVH IP that we do end up using during that block of time (so that implies there is maybe a job inefficiency we can cleanup but that happens after 07:30 when oom was at 07:00 so probably unrelated to the OOM)15:13
*** eharney has quit IRC15:15
clarkbhttp://logs.openstack.org/78/671378/2/check/openstack-tox-docs/ba3dec1/job-output.txt.gz#_2019-07-18_07_48_18_626049 is the potential job inefficiency15:15
openstackgerritTobias Rydberg proposed opendev/irc-meetings master: After transforming publiccloud wg to sig, meeting info changes as well.  https://review.opendev.org/67150915:15
clarkbI've also just now realized I wasn't grepping in the correct place for the vexxhost ip in nodepool (due to rollover)15:15
clarkbso need to double check on that15:15
clarkbthough it is an ipv4 IP and we should have requests from ipv6 from our test nodes in vexxhost so that implies it isn't one of ours15:16
*** eharney_ is now known as eharney15:17
*** Lucas_Gray has quit IRC15:17
clarkbok confirmed does not show up in any of the nodepool logs on nl03 created since july 115:20
clarkbThe top requestor is that job I linked above, then the vexxhost IP that does a lot around the OOM then an addr from NTT15:21
clarkbso I think that means the vexxhost IP is likely the one to look at since the job doesn't overlap with the OOM15:21
*** ykarel|away has joined #openstack-infra15:23
*** odicha has quit IRC15:25
cmurphyi'm cleaning out my gerrit queue, could i get someone to review https://review.opendev.org/628189 ? i promise the log in the commit message used to prove my point15:25
*** tdasilva_ has joined #openstack-infra15:26
mordredcmurphy: and the puppet output presumably already doesn't show hiera values, so we're asserting it's safe to not no_log?15:26
corvus(i assumed we no_logged it just because it was verbose)15:27
clarkbya the reason we don't log that iirc is beacuse it is very verbose15:27
clarkbnot because it was secrets15:27
cmurphymordred: it's just a file copy so i can't imagine there are secrets15:27
clarkbshould be safe to approve but may increase log verbosity15:27
corvusclarkb: what's the actual request in that log you pasted?15:27
clarkbcorvus: well there are thousands of requests from that IP over that hour according to the haproxy log15:28
*** tdasilva has quit IRC15:28
clarkbcorvus: it isn't just one, but also that happens well after the OOM so I think it is unrelated15:28
mordredclarkb: yeah - I was going to ask that as well - looking through the log I see all the urls, but I haven't found what it's doing with the urls et15:29
clarkbI'm currently trying to work out how to correlate the haproxy logs to the macaron logs from gitea15:30
clarkbit would be nice if they both logged the ports they were using oh well15:30
corvusoooh i see it.15:31
corvusclarkb, mordred: https://opendev.org/openstack/tempest/src/branch/master/tools/generate-tempest-plugins-list.py15:32
corvusthat stuff is the output from this script15:32
corvuswhich apparently takes 14m to run15:32
clarkbI've got what I think is a very suspicious connection on the haproxy side (based on how long it takes to process and nearness to the OOM) I'll work on getting a paste up with some of these log details15:33
corvusthat's only fetching a raw file (eg https://opendev.org/openstack/tempest/raw/branch/master/setup.cfg) so, honestly, it's about as nice as it could be.  i'm not sure we should frown on it.15:33
clarkbya was mostly just wanting to sanity check that the job shouldn't be relying on zuul's git repos instead15:34
mordredcorvus: yeah - it seems like an operation we should be able to handle15:34
clarkbif it is sanely requesting git repo data without bypassing zuul then that should be fine15:34
*** bhavikdbavishi has quit IRC15:37
clarkbhttp://paste.openstack.org/show/754556/15:45
clarkbI've tried to collect the relevant logs I've found so far there and sanitized IP addrs15:45
clarkb(the IP addr I didn't sanitize is for the load balancer)15:45
*** mattw4 has joined #openstack-infra15:46
clarkbI guess we can check other instances of OOMs for similar git cat file errors just prior?15:46
*** pgaxatte has quit IRC15:47
clarkbthe git cat file error does not show up on the last OOM on gitea06 so probably unrelated15:48
mordredclarkb: anything in indexer?15:49
*** bhavikdbavishi has joined #openstack-infra15:49
clarkbmordred: not sure I understand the question15:49
clarkbthe only error near gitea06's last oom was a user lookup that failed15:49
mordredclarkb: a file-based indexer could easily have concurrency problems - the git cat-file is related to the call "processRepoIndexerOperationQueue"15:49
mordredah - nod15:49
mordredwell - there goes that theory :)15:50
clarkbhaproxy log format is configurable and we can add the source port of the backend connection to the logs (but we'd need to log that data on the gitea side too for it to be useful and I don't know if we can do that)15:51
clarkbas a sanity check the clocks on gitea-lb and gitea01 seem to be in sync15:54
clarkbI think I've managed to line up some of the requests at least15:55
clarkbbut the timestamps seem just off? hard to tell15:56
openstackgerritJames E. Blair proposed opendev/system-config master: WIP: Add gerrit to gitea job  https://review.opendev.org/67116215:57
*** beekneemech has quit IRC15:57
*** bnemec has joined #openstack-infra15:58
clarkbwe can set ACCESS_LOG_TEMPLATE to update the macaron logs15:58
*** mattw4 has quit IRC15:58
clarkbwhich needs to be enabled with ENABLE_ACCESS_LOG15:59
clarkbhowever the macaron context object doesn't seem to expose the remote port (just the remote addr)16:00
*** iurygregory has quit IRC16:01
*** gtarnaras has quit IRC16:01
*** eharney has quit IRC16:02
*** ginopc has quit IRC16:02
*** lucasagomes has quit IRC16:05
*** rpittau is now known as rpittau|afk16:05
clarkbI'm having a hard time matching haproxy logs up with the gitea logs so looking at the OOM logs a bit more closely. There are git processes with 700MB or resident memory, interesting16:06
mordredclarkb: that's a lot of memory for a git operation16:07
corvuswe build that image from debian-testing; i wonder if any git updates have happened since16:07
corvus(maybe we got a bad version of git?)16:07
mordredcorvus: I was just thinking the same thing16:08
clarkb`/usr/lib/git-core/git pack-objects --revs --thin --stdout --progress --delta-base-offset` is currently using > 1GB on gitea01 and is running in nova's repo16:08
corvusgit version 2.20.116:08
mordredmaybe it's just that git takes a lot of ram to process nova?16:08
clarkbThis may also explain why it is so slow if looking up nova causes it to OOM then try again16:09
corvusmore specifically debian version 1:2.20.1-216:09
corvuswhich is what's current16:09
mordredcorvus: yeah - or if looking up nova causes it to read all of nova into ram from disk16:10
clarkbWe may be able to try reproducing this with direct nova renders like https://gitea01.opendev.org/openstack/nova/$stuff16:10
clarkbany idea what is packing objects?16:10
clarkbdoes not appear to be our gc cron (based on ps having no gc lines and our cron starting 4 hours ago)16:11
clarkboh wait packing objects happens when you clone nova right?16:11
clarkbor when you fetch things and it neesd a new pack16:11
mordreddoes it? so random clones/fetches could cause nova to decide to repack?16:11
clarkbits possible this was always in OOM territory and swap on old rax setup caused it to mostly work16:12
mordredyeah16:12
corvusi think it's more like the smart http protocol generates dynamic object packs depending on what the remote side is missing16:12
clarkbmordred: ya I think if say my repo locally is out of sync with upstream but has half the objects in upstreams packfile then upstream will make a new packfile of the delta and send that to me16:12
clarkbcorvus: ya16:12
mordredwhat both of you said16:12
corvusmakes it twice as true16:13
mordreddoes that mean it's twice the price?16:13
*** siqbal90 has quit IRC16:13
corvusmordred: yes, special price for you.16:13
clarkbclearly if we just delete the openwrt image in nova everything will be better16:14
clarkbif only we could do that without writing all of novas history16:14
clarkb(I also don't know that will help much)16:14
mordredcorvus: we so should have done that when we did the bzr->git transform - would have been a good time to rewrite the history16:14
mordrednow that I think about it16:14
mordredclarkb: I choose to believe it would at least help something16:15
corvusyeah.  i don't think i knew there was an openwrt image in there then; in fact, i think i only learned about that like a year or two ago.16:15
openstackgerritMerged zuul/nodepool master: Don't pause static pool on single label quota  https://review.opendev.org/66737116:16
mordredwould turning on git lfs support help with the openwrt image? like, maybe push nova into a gitea with lfs enabled and maybe it'll cause the openwrt image to go into LFS with a pointer in the real repo and nobody will need to derefernce the LFS unless they actually want those commits? /me has no idea how git lfs works16:16
clarkbI need to go make tea and find some breakfast but I'm beginning to think that nova is the thread we should pull on and it may lead us to better gitea performance for nova too16:16
mordredclarkb: yeah. I was having a similar thought16:17
clarkbmordred: I think you have to add objects with lfs at commit time16:17
clarkbor something, its not automagic16:17
mordredugh. oh well :)16:17
clarkbother thoughts really quickly for short term help: rebuild all the servers with more disk and swapfiles (https://review.opendev.org/#/c/671102/) and/or scale the servers up on memory and maybe reduce server count16:21
*** siqbal90 has joined #openstack-infra16:24
*** gyee has joined #openstack-infra16:27
openstackgerritMerged opendev/ansible-role-puppet master: Revert "Don't log puppot module sync"  https://review.opendev.org/62818916:37
*** ccamacho has quit IRC16:37
*** sshnaidm is now known as sshnaidm|off16:42
*** ramishra has quit IRC16:49
*** panda is now known as panda|off16:51
*** igordc has joined #openstack-infra16:54
*** chandankumar is now known as raukadah16:56
*** derekh has quit IRC17:00
*** udesale has quit IRC17:00
*** xek has quit IRC17:06
*** mattw4 has joined #openstack-infra17:08
*** eernst has joined #openstack-infra17:11
*** eharney has joined #openstack-infra17:13
*** mattw4 has quit IRC17:14
*** mattw4 has joined #openstack-infra17:14
*** eernst has quit IRC17:15
*** igordc has quit IRC17:15
*** igordc has joined #openstack-infra17:16
*** tosky has quit IRC17:17
*** panda|off has quit IRC17:19
*** panda has joined #openstack-infra17:21
*** armax has quit IRC17:28
openstackgerritMatthew Thode proposed openstack/diskimage-builder master: support alternate portage directories  https://review.opendev.org/67153017:33
prometheanfireinfra may need this for builds ^17:34
prometheanfireI'm currently running a basic systemd build to test17:34
*** tdasilva_ is now known as tdasilva17:45
*** aprice has quit IRC17:45
*** hogepodge has quit IRC17:45
*** petevg has quit IRC17:45
*** clayg has quit IRC17:45
*** seyeongkim has quit IRC17:45
*** guilhermesp has quit IRC17:46
*** aprice has joined #openstack-infra17:47
*** hogepodge has joined #openstack-infra17:47
clarkbI have confirmed via direct clone against gitea06 (using that one since it has swap) that cloning nova seems to take 1.3GB memory or so resident for about 2.5 minutes17:47
clarkbthough that time value is likely related to teh bw between my home and vexxhost17:47
*** petevg has joined #openstack-infra17:48
*** clayg has joined #openstack-infra17:48
*** guilhermesp has joined #openstack-infra17:48
*** seyeongkim has joined #openstack-infra17:48
corvusclarkb, mordred: on a related but different tack, i just did a little bit of poking at gitea source code; i suspect (but have not proven) that the lag in rendering nova is due to looking up the last commit info for each item in the directory, with directories themselves having an especially high cost (the only time it gets down to 1-2 seconds is in a directory with no subdirs).  gitea uses go-git17:51
corvusto traverse the git tree (i don't think it shells out for this).  the only thing it caches is the commit count for the repo.  so opportunities here are: 1) dig further into the go-git tree lookups to see if there's room for improvement; 2) look into caching more stuff (but i'm not sure we'd be able to have a large enough cache to keep the initial render time for nova reasonable)17:51
*** ralonsoh has quit IRC17:51
clarkbcorvus: and maybe 3) don't lookup that data? or only do it on files?17:52
clarkbfor memory usage it appears there are git settings we could tweak like core.packedGitWindowSize which limits how much of a pack file is mmap'd into memory at once17:52
clarkbby default it is mmaping the whole thing which seems to be where the memory use is coming from17:53
clarkbthat said if the issue is due to mmaping files we may not suffer a huge penalty to lean on swap (since its going to disk anyway right?)17:53
clarkbso maybe leaning on swap isn't the end of the world here17:53
corvusclarkb: also mmap pages may not necessarily be resident17:55
*** bhavikdbavishi has quit IRC17:56
clarkbhttps://git-scm.com/docs/git-config/1.9.2#git-config-corebigFileThreshold may also help though I don't think nova has any files that are bigger than that in it17:56
clarkbso we would be tuning that value down17:56
clarkbhttps://git-scm.com/docs/git-config/1.9.2#git-config-corepackedGitWindowSize defaults to 1GB in our 64bit systems which lines up closely to the 1.3GB resident sizes we see17:57
* clarkb checks how big nova's pack files are17:57
mordredclarkb: although those settings would only impact things where gitea is shelling out to git, not necessarily places where it's using go-git17:58
fungi#status log deleted stale networking-mlnx repository mirror from the openstack organization on github at lennyb's request17:58
openstackstatusfungi: finished logging17:58
clarkbnova's largest packfile is 548MB17:58
clarkbmordred: unless go git also respects git config?17:58
mordredclarkb: yeah. it's a good question17:59
clarkbmordred: in this case the memory use we see is from shelled out git processes17:59
mordredthat is an excellent point17:59
clarkbat least the large git processes I've been able to identify are17:59
corvusyeah, the "nova web page is slow" and "gitea is ooming" probs are tangentially related, but not identical18:00
clarkbhttps://git-scm.com/docs/git-config/1.9.2#git-config-packdeltaCacheSize is another option that may play into the OOMing18:00
clarkbworht noting that the cloning doesn't seem particularly slow though and seems more limited by bw than by memory on the gitea node18:03
clarkbwhich does make me wonder if dipping into swap a bit really would be fine18:03
*** psachin has quit IRC18:04
fungiwe had swap partitions on the old cgit servers, right?18:04
clarkbyes18:04
*** armax has joined #openstack-infra18:05
*** weifan has joined #openstack-infra18:06
*** irclogbot_1 has quit IRC18:07
clarkbI'm wary of changing defaults in git's config just to accomodate memory usage beacuse it is using the memory to avoid unnecessary disk writes and aid performance. If we start tweaking that we open ourselves to much poorer performance potentially (and we have to track and set up that config everytime we configure a new repo or maybe we just do the big through special? manuals neutron nova)18:07
*** altlogbot_2 has quit IRC18:07
*** irclogbot_3 has joined #openstack-infra18:08
*** altlogbot_3 has joined #openstack-infra18:08
*** electrofelix has quit IRC18:10
fungi#status log recovered corrupt https://etherpad.openstack.org/p/security-sig-newsletter content and restored it at the old name after moving the corrupt pad to "broken-security-sig-newsletter"18:20
openstackstatusfungi: finished logging18:20
fungigagehugo: ^ lmk if you need anything else18:20
gagehugofungi: ty!18:21
openstackgerritMatthew Thode proposed openstack/diskimage-builder master: support alternate portage directories  https://review.opendev.org/67153018:23
fungigonna go grab some groceries around the corner real quick before the openstack release management meeting starts18:23
fungibrb18:23
*** gfidente has quit IRC18:34
openstackgerritMatthew Thode proposed openstack/diskimage-builder master: support alternate portage directories  https://review.opendev.org/67153018:40
*** weifan has quit IRC18:46
openstackgerritMatt McEuen proposed openstack/project-config master: Remove airship/utils from infrastructure  https://review.opendev.org/67154718:49
openstackgerritMatt McEuen proposed openstack/project-config master: Remove airship/berth from infra  https://review.opendev.org/67155018:58
*** weifan has joined #openstack-infra19:00
*** ricolin_ has joined #openstack-infra19:03
*** weifan has quit IRC19:04
*** weifan has joined #openstack-infra19:05
*** ricolin has quit IRC19:05
*** weifan has quit IRC19:05
*** weifan has joined #openstack-infra19:12
*** weifan has quit IRC19:15
*** siqbal90 has quit IRC19:19
*** _erlon_ has joined #openstack-infra19:23
*** weifan has joined #openstack-infra19:25
*** weifan has quit IRC19:30
*** weifan has joined #openstack-infra19:32
*** e0ne has joined #openstack-infra19:38
*** weifan has quit IRC19:39
*** weifan has joined #openstack-infra19:39
*** weifan has quit IRC19:44
*** whoami-rajat has quit IRC19:44
openstackgerritMatthew Thode proposed openstack/diskimage-builder master: support alternate portage directories  https://review.opendev.org/67153019:48
*** joeguo has joined #openstack-infra19:54
corvusclarkb: yeah, i think we should subscribe to the "be efficient by getting the process done faster" school of thought on this.20:01
*** pcaruana has quit IRC20:01
openstackgerritJames E. Blair proposed opendev/system-config master: WIP: Add gerrit to gitea job  https://review.opendev.org/67116220:02
*** weifan has joined #openstack-infra20:04
*** tesseract has quit IRC20:05
openstackgerritJames E. Blair proposed opendev/system-config master: Build a docker image of the existing gerrit 2.13  https://review.opendev.org/67145720:09
corvusmordred: ^ i made a trivial fix to that20:09
mordredcorvus: I agree with your trivial fix20:10
mordredcorvus: I'm hoping that image helps us move forward and isn't just a waste of time, but it makes sense in my headhole20:12
fungiShrews: i'm starting to think there may be some sort of image leak... noticing image 5675b0f7-510f-422b-bbda-325e5d7d4c27 (centos-7-1562751318) is hanging around in vexxhost-sjc1 but nodepool doesn't seem to think it exists any longer20:12
corvusmordred: totally20:12
fungiShrews: that was uploaded 8 days ago20:12
corvusmordred: in https://review.opendev.org/671463 i think you need to add some fake entries20:14
mordredcorvus: oh! so I do20:15
mordredcorvus: I'm going to mark that WIP for now - MOC needs an update before we can move forward20:15
corvuskk20:16
Shrewsfungi: http://paste.openstack.org/show/754568/20:16
mordredcorvus: I left the WIP with a note about what's IP20:16
fungiopenstack.exceptions.ConflictException: ConflictException: 409: Client Error for url: https://image-sjc1.vexxhost.us/v2/images/f670e6be-953b-4d4b-a931-6cbb5b568410, Conflict20:16
fungium, neat!20:16
Shrewsfungi: it's been repeatedly trying to delete it. why do you think nodepool doesn't think it exists?20:17
fungioh, i grepped image-list for it and didn't see it20:17
fungimaybe i fat-fingered thaht20:17
Shrewsfungi: possible it doesn't show up in image-list given the state it is in. i forget offhand20:18
Shrewsreal question is... wth does that error mean?20:19
fungiyeah, that's mostly what i was wondering20:19
clarkbI think that means two images have the same id?20:20
clarkbusually you get that when using nes and there is a collision20:20
fungihttps://opendev.org/openstack/openstacksdk/src/branch/master/openstack/exceptions.py#L119-L12120:21
funginot very helpful20:21
clarkbcorvus I need to look at 06's swap usagebutmaybe we can make a smaller swapfile on the older servers with less disk then they will get updated when replaced20:21
*** joeguo has quit IRC20:21
corvusclarkb: ++20:21
*** joeguo has joined #openstack-infra20:22
Shrewsfungi: oh actually, my grep skills failed me too. i only see a single attempt to delete that image20:22
clarkbhttp://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=66750&rra_id=all is current 06 swap usage. Based on that even 1GB swapfile would be ~3x max usage there20:23
clarkbhttp://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=66636&rra_id=all 01 should be able to support a 1 or 2 GB swap file20:23
fungiShrews: okay, so maybe nodepool didn't put that image back in the delete list when it encountered a 409 coming back from the api?20:23
clarkbinfra-root so how about that as the immediate bandaid, I manually add a 1GB swapfile to 01-05,07-0820:23
corvusclarkb: ++20:24
fungiclarkb: sounds safe to me, and a reasonable precaution, yes20:24
Shrewsfungi: maybe. the znode for in zookeeper is gone20:24
Shrewsfor it20:24
clarkband anyone else want to review https://review.opendev.org/#/c/671102/ before it gets approved which will help ensure replacement gitea servers get properly sized swapfiles?20:24
mordredclarkb: ++20:25
mordredclarkb: lgtm20:25
clarkboh wow so my tumblweed updates from yesterday include warnings when you try to paste shell code beacuse ti might get executed (I'm pasting into a vi buffer)20:26
corvusclarkb: what process is producing those warnings?20:26
mordredcorvus: probably systemd20:27
corvusvia dbus20:28
clarkblooks like xfce4-terminal is doing it on input to it20:28
mordredcorvus: running inside of a hidden browser tab20:28
clarkbaccording to xprop20:28
corvusclarkb: oh neat, that's not as terrible as i feared :)20:29
*** eharney has quit IRC20:32
Shrewsfungi: well, it's possible we've identified the one cloud operation we do not retry. i can't recall if this is intentional or not. i'm guessing not20:32
clarkbok all of the other gitea servers now have 1GB swapfiles20:34
clarkb05 is the one with the least amount of disk after that with over 6GB20:34
clarkbthe others are just under 10GB20:34
clarkbI'm going to trigger replication now across all of the giteas to ensure there are no lost refs on any of them20:34
Shrewsfungi: i've copied the log file off to my home directory so i can look more closely at this tomorrow. i have a PT session coming up now20:35
fungiShrews: no worries, just wanted to make sure i wasn't the only one aware of a possible image leak20:35
clarkb01 is replicating now20:36
openstackgerritJames E. Blair proposed opendev/system-config master: WIP: run a gerrit container  https://review.opendev.org/63040620:38
fungiShrews: okay if i delete that image manually, or do you want to keep it around for further inspection?20:38
corvusmordred: ^ that had bitrotted alot, i'm dragging it into the present20:38
Shrewsfungi: go ahead20:39
fungithanks! done, it deleted cleanly, no errors20:39
fungiso whatever it's problem, it was transient20:40
fungier, its problem20:40
openstackgerritSlawek Kaplonski proposed openstack/project-config master: Update Neutron grafana dashboard  https://review.opendev.org/67156820:40
corvusmordred: i'm confused about the error in http://logs.openstack.org/57/671457/2/check/system-config-build-image-gerrit/8c6fd8d/job-output.txt.gz#_2019-07-18_20_26_09_08829320:43
corvusmordred: it looks like even the current build shouldn't work because there's no "FROM foo as gerrit" line in the dockefile20:44
petevgfungi: Hi! Is there a cap of half an hour on build time in the gate? And if so, is there any way to get that raised for a project? (I ask because x/microstack times out before the snap finishes building ... https://review.opendev.org/#/c/671358/)20:46
fungipetevg: that's just a default. you can override it with whatever timeout you need20:46
openstackgerritMerged opendev/system-config master: Disable cloud launcher cron job during CI  https://review.opendev.org/67094620:47
openstackgerritMerged opendev/system-config master: Publish .log files as text/plain  https://review.opendev.org/67093420:47
openstackgerritMerged opendev/system-config master: Add mirror-update to run_all.sh  https://review.opendev.org/67092720:47
fungipetevg: an example: https://opendev.org/openstack/nova/src/branch/master/.zuul.yaml#L920:47
fungi(value is in seconds)20:47
petevgfungi: I was just looking for an example. Thank you :-)20:47
fungiyou're welcome!20:48
openstackgerritJames E. Blair proposed opendev/system-config master: Remove targets from docker build  https://review.opendev.org/67157020:48
clarkbfungi: petevg there is a cap of 3 hours though iirc20:49
corvusmordred: ^ if that works, my best guess is that we just tightnened up the image build jobs after writing the original version of that.20:49
fungipetevg: just be wary that the longer a job runs, the longer your test results for a change will be delayed. so if you have a job that takes 2-3 hours to complete and it runs for most changes, then developers wait at least that long for aggregate job results20:49
*** dpawlik has quit IRC20:53
openstackgerritMerged opendev/system-config master: mirror-update: update docs for mirror-update.opendev.org  https://review.opendev.org/66883320:54
openstackgerritMerged opendev/system-config master: launch-node.py : add option to skip ipv6 address checks  https://review.opendev.org/66754820:54
openstackgerritMerged opendev/system-config master: Use swapfile if no extra device is present  https://review.opendev.org/67110220:55
*** priteau has quit IRC21:05
*** raissa has joined #openstack-infra21:06
*** raissa has quit IRC21:06
*** raissa has joined #openstack-infra21:07
*** raissa has joined #openstack-infra21:07
*** raissa has quit IRC21:08
*** eharney has joined #openstack-infra21:08
*** pkopec has quit IRC21:10
*** e0ne has quit IRC21:17
*** ekultails has quit IRC21:25
*** igordc has quit IRC21:27
*** igordc has joined #openstack-infra21:28
openstackgerritJames E. Blair proposed opendev/system-config master: WIP: run a gerrit container  https://review.opendev.org/63040621:28
openstackgerritMerged zuul/zuul master: Add "supercedes" pipeline option  https://review.opendev.org/67067021:33
openstackgerritJames E. Blair proposed opendev/system-config master: WIP: Add gerrit to gitea job  https://review.opendev.org/67116221:37
*** beekneemech has joined #openstack-infra21:38
*** beekneemech has quit IRC21:38
openstackgerritJames E. Blair proposed opendev/system-config master: Build a docker image of the existing gerrit 2.13  https://review.opendev.org/67145721:39
openstackgerritMatthew Thode proposed openstack/diskimage-builder master: support alternate portage directories  https://review.opendev.org/67153021:41
mordredcorvus: huh. yeah - that was some of our earliest dockerfileing21:55
openstackgerritMatt McEuen proposed openstack/project-config master: Remove airship/berth from infra  https://review.opendev.org/67155022:00
openstackgerritMatt McEuen proposed openstack/project-config master: Remove airship/utils from infrastructure  https://review.opendev.org/67154722:01
*** weifan has quit IRC22:01
*** weifan has joined #openstack-infra22:02
*** bnemec has quit IRC22:03
*** weifan has quit IRC22:06
*** weifan has joined #openstack-infra22:06
*** slaweq has quit IRC22:09
mnaserwas there an infra issue at some point inside gitea today?22:09
mnaserhttp://logs.openstack.org/05/671105/2/check/openstack-ansible-upgrade-aio_metal-ubuntu-bionic/44da912/job-output.txt.gz#_2019-07-18_20_40_35_36742222:09
mnaserbut the commit does exist https://opendev.org/openstack/ansible-hardening/commit/ae01cbc824c7cf8aa7eac054fde481dacc441b6822:09
mnaser(this is the first time we do upgrade jobs though, so we might be broken, but i fail to see how it woudl do that)22:10
clarkbmnaser: thats the issue I've been working through the last little while22:10
*** weifan has quit IRC22:10
clarkb(or at least is likely that)22:10
mnaserclarkb: gah, sorry, your name is all white in my irc client so i kinda get blind to it22:10
mnaseri see the text about replication and what not :(22:10
*** weifan has joined #openstack-infra22:10
clarkbbasically what happens is if the gitea server OOMs while gerrit si replicating there is a non zero chance that oomkiller kills the git process that is handling the gerrit replication22:12
*** weifan has quit IRC22:12
clarkband when that happens the ref doesn't exist on that backend until some other event causes it to rereplicate22:12
fungifor some arbitrary commit22:12
clarkbThe ooming is happening because git can be quite large to handle nova (and probably openstack manuals and neutron)22:12
clarkbso what I've done is added swapfiles to all of the nodes which seems to have fixed it for gitea06 whihc was my original test node22:12
clarkbunfortunately all of the nodes but 06 are somewhat disk constrainted so started with only 1GB swapfiles22:13
clarkbAnd now I'm rereplicating everything to ensure they are caught up22:13
clarkbbut this should hopefully prevent oomkiller from running again22:13
fungido we think there's a chance we have jobs recently starting to clone things from gitea instead of using local copies?22:13
clarkband we'll be stable going forward22:13
fungi(or maybe even not so recently starting)22:13
*** bnemec has joined #openstack-infra22:13
clarkbfungi: I identified at least one job that was hitting opendev.org instaed of zuul repos but it wasn't doing full clones just fetching a file at a ref22:14
fungiahh22:14
clarkbthat info is all in haproxy logs now though if we want to uniq -c and then grep the results out of nodepool/zuul22:14
clarkbthe next step which I'm somewhat less enthusiastic about but we've been meaning to do it anyway is to rebuild all of the gitea nodes with more disk22:15
fungii just remember that time we accidentally uploaded images with incomplete git mirrors (back in the old days) and our git farm melted down to slag in moments22:15
clarkbmaybe I can get help with that and get checked on my gitea backend rebuild docs22:15
*** weifan has joined #openstack-infra22:15
fungiyeah, i ought to be able to find time to try out your docs on at least one22:16
fungi(but not tonight)22:16
*** eharney has quit IRC22:16
clarkbya I think I've got a good enough bandaid for the next little while22:16
clarkbseparately corvus thinks he has identifed the cause of some of the slowness browsing nova in gitea22:18
clarkbcorvus: ^ that might be worth a response to sean's ml thread?22:18
*** weifan has quit IRC22:20
corvusclarkb: i'd rather not give anyone the incorrect impression that i'm working on that22:21
corvusif there's someone to collaborate with, sure, but i don't think anyone has respended yet?22:22
clarkbya I haven't seen anyone volunteer yet22:22
clarkbthe last gitea backend is replicating now (06)22:36
clarkbin theory we'll avoid these ref not found issues now22:36
*** bnemec has quit IRC22:42
*** bnemec has joined #openstack-infra22:43
*** hwoarang has quit IRC22:44
corvusfingerprint = base64.b64encode(hashlib.sha256(base64.b64decode(public_key.encode('ascii'))).digest()).decode('ascii').replace('=','')22:44
corvusjust, you know, in case you were wondering how to do that in python22:45
*** armax has quit IRC22:45
*** hwoarang has joined #openstack-infra22:45
corvusactually, it's better than that if you don't already start with the b64 public key22:46
corvus        fingerprint = base64.b64encode(hashlib.sha256(base64.b64decode(paramiko.RSAKey.from_private_key_file(ssh_key_file).ssh_key.get_base64().encode('ascii'))).digest()).decode('ascii').replace('=','')22:46
mordredcorvus: that makes perfect sense22:57
*** tkajinam has joined #openstack-infra22:58
*** mattw4 has quit IRC23:07
*** weifan has joined #openstack-infra23:08
donnydclarkb: Soooo my equipment was supposed to be here today, and instead of a machine with 7 pcie slots.. i got a box full of someone elses parts... So I have to return those and wait.. again for the right stuff23:09
donnydshould be here the 24th23:09
clarkbwhoops23:09
donnydSo I lowered the quota down to 50 to see if it helps with the timeouts till i get it fixed23:10
donnydso frustating23:10
*** rcernin has joined #openstack-infra23:15
*** aaronsheffield has quit IRC23:17
*** tdasilva has quit IRC23:19
openstackgerritJames E. Blair proposed zuul/zuul master: Allow projects to borrow ssh keys  https://review.opendev.org/67163723:34
corvusmordred: i'm going to hand https://review.opendev.org/671457 back over to you now -- the builds are running, but they are both failing (probably something you could debug with a local docker build)23:40
openstackgerritJames E. Blair proposed opendev/system-config master: WIP: Add gerrit to gitea job  https://review.opendev.org/67116223:45
*** armax has joined #openstack-infra23:50
openstackgerritJames E. Blair proposed opendev/system-config master: WIP: run a gerrit container  https://review.opendev.org/63040623:50

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!