Friday, 2019-06-28

*** lseki has quit IRC00:02
*** mriedem_afk is now known as mriedem00:03
openstackgerritMerged opendev/system-config master: Add docs for deploying a new gitea server  https://review.opendev.org/66803000:05
*** dchen has quit IRC00:07
*** dchen has joined #openstack-infra00:11
clarkbgc'ing on gitea06 is done00:12
clarkbthere are a few defunct git processes that we might try to clean up tomorrow if they don't go away00:12
ianwfungi / corvus: thanks for looking at those messages.  to try and sync everyone I've sent -> http://lists.infradead.org/pipermail/linux-afs/2019-June/003078.html00:34
ianwbasically if we know what & how to trace in a useful way, i think we can probably just write a script to capture something helpful00:35
ianwsorry, wrt afs offline messages00:35
*** hamzy has joined #openstack-infra00:38
donnydclarkb I am just about ready to start testing my infra00:42
*** diablo_rojo has quit IRC00:44
auristorianw: abort code is 106 or VOFFLINE00:50
auristorianw: which is what I expect the openafs fileserver to send while the volume is releasing00:52
diablo_rojo_phonmelwitt: I think I have a local WIP patch for the search but I'm happy to walk you through stuff. Generally it's like clarkb said. Give it a second and it will do some typeahead magic to help you tailor your search.00:54
diablo_rojo_phonAnd of course will take whatever patches you've got :)00:54
melwittdiablo_rojo_phon: oh, coolness, thanks for letting me know. I'm totes cool with it being already done by you :)00:57
melwittfor me it's taking several seconds to get anything to appear, so I thought I was doing it wrong00:57
melwittbut I tried it some more, waiting 30s and longer and now I get the behavior that y'all described00:57
diablo_rojo_phonYeah. mordred has volunteered to help with our database indexing in hopes that it speed up the search a bit.01:06
melwittnoice01:07
diablo_rojo_phonmelwitt: I'll try to get that doc patch up next week or so01:07
diablo_rojo_phonIf you have questions in the meantime, feel free to ping me :)01:07
melwittdiablo_rojo_phon: awesome, thanks much01:07
*** imacdonn has quit IRC01:09
*** imacdonn has joined #openstack-infra01:10
ianwauristor: ok, perhaps a similar case of just turn down the logging?01:11
ianwor is it hard to tell the difference between "releasing and coming back" and actually offline?01:11
auristordo you have FileAuditLog enabled on the fileservers ?01:12
ianwauristor: no, i think i have dejavu of us discussing this before01:12
auristorif so, the requests returning "CODE 106" will be logged in the FileAuditLog01:13
ianwis it going to be a ridiculous amount of logging to manage on busy servers?01:13
auristoryou can then see whether the kafs client is trying to read from each fileserver and if its getting 106 (VOFFLINE) from each01:14
ianwis this something openafs on the client side likely masks for us by retrying or something?01:16
auristorYou can make the FileAuditLog be a named pipe and then have a reader process to filter the requests you want to keep such as only those that fail01:16
auristorThe openafs client does have an algorithm for retrying another fileserver if the fileserver returns VSALVAGE, VOOFLINE, VNOVOL and VMOVED.   The algorithm is somewhat broken but it works in many cases.01:18
*** dchen has quit IRC01:22
ianwinteresting.  we don't see this on every vos release; i guess i could correlate from apache logs and update logs if requests for AFS files actually came in for the vos release period in times we did not see any error01:22
*** dchen has joined #openstack-infra01:22
auristorBut looking at the "vos release" output that was included in http://lists.infradead.org/pipermail/linux-afs/2019-June/003078.html the RO on afs02 is taken offline and upd.  During that update the afs02 copy is available.  Then the afs01 RO is taken offline so it can be used as the source for updating the afs02 RO which is being updated in place.01:23
auristorIf there was a 3rd fileserver, that third copy would be available during the update of afs02 and then afs02 would be put in service while afs03 is updated.  Then afs03 and afs01 would be put in service01:25
auristorI will talk with dhowells tomorrow to review the failover and retry algorithms in kafs01:26
*** bhavikdbavishi has joined #openstack-infra01:26
ianwgreat, thanks, i thought a mail thread might be better for some more prolonged debugging.01:28
auristorif you have an estimate for how RPCs each of the fileservers process ?01:28
*** bhavikdbavishi1 has joined #openstack-infra01:29
*** bhavikdbavishi has quit IRC01:31
*** bhavikdbavishi1 is now known as bhavikdbavishi01:31
ianwis this from rxdebug helpful : Free packets: 1541/9103, packet reclaims: 905, calls: 1667017653, used FDs: 6301:32
ianwuptime of 108 days ...01:32
auristorcalls is the number of rpcs but its a 32-bit integer that rolls over01:33
ianwahh, in that case i don't know :)  it is at 1667019421 now01:33
ianwso 1768 in what, 2 minutes?01:34
ianwsay ~14/s01:34
auristorcan you query every minute and graph the stats in a shboard?01:36
auristorthe reason I'm  asking is to estimate the amount of space required for audit logging.  Assume there will be one entry per rpc with an average length of 128 characters01:37
auristorassuming there was no rollover the audit data log for 100 days would be about 200GB01:39
auristorbut I think that counter has wrapped01:44
auristordo you have xstat_fs_test from the openafs source tree?01:45
*** dchen has quit IRC01:45
*** dchen has joined #openstack-infra01:46
auristor"xstat_fs_test <fileserver> 2 -once" will return per RPC statistics01:46
auristorLinus has pulled the afs fixes into his source tree01:47
*** gouthamr has quit IRC01:49
ianwauristor: xstat_fs_test -> http://paste.openstack.org/show/753534/01:51
*** dmellado has quit IRC01:51
auristorThose stats are useful to dashboard01:51
*** gouthamr has joined #openstack-infra01:52
*** bhavikdbavishi has quit IRC01:53
*** dchen has quit IRC01:53
*** sgw has joined #openstack-infra01:54
*** dchen has joined #openstack-infra01:54
*** dmellado has joined #openstack-infra01:54
*** mriedem has quit IRC01:54
*** rajinir has quit IRC01:55
*** apetrich has quit IRC01:58
auristorbtw, the -cb parameter on afs01.dfw is too small based upon the xstat_fs_test collection set 3 output01:58
*** happyhemant has quit IRC01:59
auristorraise it to 200000001:59
ianwso after watching the stats for a while 1577.727273/minute ? ... https://docs.google.com/a/redhat.com/spreadsheets/d/e/2PACX-1vS6x3IpFlNTo_el355S0ooNDDjk-MjHm-5TnHrci43_HarqsDiDj15WZ5OuNs3iDnrRx70sy2AxWPal/pubhtml02:00
*** gouthamr has quit IRC02:02
*** dmellado has quit IRC02:02
*** dmellado has joined #openstack-infra02:05
ianwauristor: which stat was that based on?  i'll update our docs before doing it02:05
*** gouthamr has joined #openstack-infra02:06
auristorIf any GSS counters are non-zero it means the fileserver is breaking callback promises before they expire due to lack of space02:06
*** gouthamr has quit IRC02:07
ianwwith about twice as much data now, average calls / minute sitting at 184602:07
openstackgerritIan Wienand proposed opendev/system-config master: Raise callbacks for AFS server  https://review.opendev.org/66807802:14
*** dmellado has quit IRC02:14
*** gouthamr has joined #openstack-infra02:14
ianwauristor: ^ look about right?02:14
openstackgerritIan Wienand proposed opendev/system-config master: Raise callbacks for AFS server  https://review.opendev.org/66807802:15
*** dchen has quit IRC02:15
*** dchen has joined #openstack-infra02:16
*** gouthamr has quit IRC02:17
*** dmellado has joined #openstack-infra02:17
*** jesusaur has quit IRC02:18
*** gouthamr has joined #openstack-infra02:19
*** ricolin has joined #openstack-infra02:20
auristorhave a +102:21
auristorI can't access the spreadsheet your posted02:28
*** gouthamr has quit IRC02:29
ianwyeah, how silly i can't seem to get it to share with anyone !redhat.com ... it's one of these new fangled google work accounts02:31
*** gouthamr has joined #openstack-infra02:35
*** dmellado has quit IRC02:35
ianwauristor: raw numbers -> https://ethercalc.openstack.org/1sp6l7r0xb4b02:35
*** dmellado has joined #openstack-infra02:38
*** gouthamr has quit IRC02:42
*** dmellado has quit IRC02:42
*** dmellado has joined #openstack-infra02:45
ianw#status log afs01/02.dfw & afs01.ord restarted with greater -cb values: see https://review.opendev.org/66807802:46
openstackstatusianw: finished logging02:46
*** zhangfei has joined #openstack-infra02:49
*** gouthamr has joined #openstack-infra02:49
*** dmellado has quit IRC02:52
*** dmellado has joined #openstack-infra02:54
*** gouthamr has quit IRC02:54
*** gouthamr has joined #openstack-infra02:58
fungisee, i knew we decided to run a collaborative open source spreadsheet application for a good reason ;)03:01
*** dmellado has quit IRC03:02
*** dmellado has joined #openstack-infra03:04
*** whoami-rajat has joined #openstack-infra03:05
*** gouthamr has quit IRC03:05
*** tonyb has quit IRC03:05
*** gouthamr has joined #openstack-infra03:10
*** dmellado has quit IRC03:11
*** dmellado has joined #openstack-infra03:13
openstackgerritMerged opendev/system-config master: Raise callbacks for AFS server  https://review.opendev.org/66807803:13
*** zhurong has joined #openstack-infra03:22
*** gouthamr has quit IRC03:27
*** dmellado has quit IRC03:29
*** gouthamr has joined #openstack-infra03:31
*** dmellado has joined #openstack-infra03:32
*** bhavikdbavishi has joined #openstack-infra03:32
*** gouthamr has quit IRC03:33
*** xinranwang has joined #openstack-infra03:33
*** dmellado has quit IRC03:34
*** tonyb has joined #openstack-infra03:36
*** raukadah is now known as chandankumar03:37
*** gouthamr has joined #openstack-infra03:37
*** dmellado has joined #openstack-infra03:39
*** psachin has joined #openstack-infra03:40
openstackgerritIan Wienand proposed opendev/system-config master: [dnm] move rsync updates to new mirror host  https://review.opendev.org/66778203:44
*** gouthamr has quit IRC03:49
*** dmellado has quit IRC03:51
*** dmellado has joined #openstack-infra03:55
*** gouthamr has joined #openstack-infra03:58
*** gouthamr has quit IRC04:00
openstackgerritIan Wienand proposed opendev/system-config master: [dnm] move rsync updates to new mirror host  https://review.opendev.org/66778204:00
*** rh-jelabarre has quit IRC04:00
*** ykarel|away has joined #openstack-infra04:02
*** gouthamr has joined #openstack-infra04:06
*** ykarel|away is now known as ykarel04:08
*** dmellado has quit IRC04:09
*** gouthamr has quit IRC04:10
*** dmellado has joined #openstack-infra04:13
*** dmellado has quit IRC04:15
*** ramishra has joined #openstack-infra04:15
*** gouthamr has joined #openstack-infra04:16
*** gouthamr has quit IRC04:17
*** dmellado has joined #openstack-infra04:20
*** gouthamr has joined #openstack-infra04:22
*** udesale has joined #openstack-infra04:24
*** gouthamr has quit IRC04:25
openstackgerritIan Wienand proposed opendev/system-config master: [dnm] move rsync updates to new mirror host  https://review.opendev.org/66778204:29
*** gouthamr has joined #openstack-infra04:30
*** gouthamr has quit IRC04:35
*** pcaruana has joined #openstack-infra04:36
*** dmellado has quit IRC04:37
openstackgerritIan Wienand proposed opendev/system-config master: [dnm] move rsync updates to new mirror host  https://review.opendev.org/66778204:39
*** dmellado has joined #openstack-infra04:41
*** gouthamr has joined #openstack-infra04:41
*** gouthamr has quit IRC04:50
*** virendra-sharma has joined #openstack-infra04:52
*** dmellado has quit IRC04:56
*** gouthamr has joined #openstack-infra04:57
*** dmellado has joined #openstack-infra05:00
*** dchen has quit IRC05:00
*** dchen has joined #openstack-infra05:05
*** gouthamr has quit IRC05:06
*** dmellado has quit IRC05:11
*** gouthamr has joined #openstack-infra05:12
*** dmellado has joined #openstack-infra05:13
*** hwoarang has quit IRC05:16
*** hwoarang has joined #openstack-infra05:19
*** gouthamr has quit IRC05:19
*** gouthamr has joined #openstack-infra05:24
*** gouthamr has quit IRC05:31
*** kjackal has joined #openstack-infra05:32
*** dmellado has quit IRC05:35
*** gouthamr has joined #openstack-infra05:37
*** dmellado has joined #openstack-infra05:38
openstackgerritIan Wienand proposed opendev/system-config master: Move rsync mirror updates to new opendev.org mirror-update host  https://review.opendev.org/66778205:39
openstackgerritIan Wienand proposed opendev/system-config master: Move rsync mirror updates to new opendev.org mirror-update host  https://review.opendev.org/66778205:41
*** xinranwang has quit IRC05:43
*** gouthamr has quit IRC05:44
*** jbadiapa has quit IRC05:46
*** dmellado has quit IRC05:48
*** dmellado has joined #openstack-infra05:51
*** gouthamr has joined #openstack-infra05:51
*** gouthamr has quit IRC05:52
ianwauristor / fungi : ok, calling it on the rx stats tracking, etc.  notes updated in https://etherpad.openstack.org/p/opendev-mirror-afs05:58
*** dmellado has quit IRC05:58
*** gouthamr has joined #openstack-infra05:58
*** jtomasek has joined #openstack-infra05:58
ianwi'll put it on my todo to track some of those more interesting stats; i think afsmon could be expanded06:00
*** slaweq has joined #openstack-infra06:00
*** dmellado has joined #openstack-infra06:01
auristorok06:03
*** gouthamr has quit IRC06:03
*** yamamoto has joined #openstack-infra06:06
*** yamamoto_ has joined #openstack-infra06:06
*** dmellado has quit IRC06:08
*** gouthamr has joined #openstack-infra06:08
*** lpetrut has joined #openstack-infra06:09
*** lpetrut has quit IRC06:09
*** lpetrut has joined #openstack-infra06:10
*** yamamoto has quit IRC06:10
*** dmellado has joined #openstack-infra06:11
*** gouthamr has quit IRC06:11
*** gouthamr has joined #openstack-infra06:16
*** gouthamr has quit IRC06:16
*** dpawlik has joined #openstack-infra06:18
*** gouthamr has joined #openstack-infra06:22
*** kopecmartin|off is now known as kopecmartin06:24
openstackgerritMerged openstack/diskimage-builder master: Enable nodepool testing for opensuse 15.1  https://review.opendev.org/66753706:27
*** dmellado has quit IRC06:27
*** gouthamr has quit IRC06:29
*** dmellado has joined #openstack-infra06:30
openstackgerritMerged openstack/diskimage-builder master: debootstrap: make default network interface names configurable  https://review.opendev.org/66493606:30
*** pgaxatte has joined #openstack-infra06:34
*** gouthamr has joined #openstack-infra06:36
*** dmellado has quit IRC06:44
openstackgerritIan Wienand proposed opendev/system-config master: Move rsync mirror updates to new opendev.org mirror-update host  https://review.opendev.org/66778206:44
*** gouthamr has quit IRC06:46
*** dmellado has joined #openstack-infra06:47
*** dchen has quit IRC06:51
*** dchen has joined #openstack-infra06:52
*** ccamacho has joined #openstack-infra06:53
*** dchen has quit IRC06:57
ianwfungi: sent a note about the oom in the logs too @ https://www.redhat.com/archives/linux-cachefs/2019-June/msg00011.html .  i feel like i discussed that before.  probably a second tier problem to the hard-crashes, but just wanted a reference so we can come back to it06:57
*** dmellado has quit IRC06:57
*** gouthamr has joined #openstack-infra06:58
*** dmellado has joined #openstack-infra07:00
*** jbadiapa has joined #openstack-infra07:04
*** gouthamr has quit IRC07:04
*** dmellado has quit IRC07:05
*** dmellado has joined #openstack-infra07:06
*** gouthamr has joined #openstack-infra07:12
*** dmellado has quit IRC07:13
yoctozepto> <ianw> fungi: thanks for looking in on iad ... there's also another [Thu Jun 27 15:53:34 2019] kAFS: afs_dir_check_page(32f): bad magic 1/2 is 0000 which i've never seen before07:13
yoctozeptooh, so finally it has revealed itself - it's bad magic behind all of this, knew it since childhood07:13
yoctozepto;D07:14
*** gouthamr has quit IRC07:15
*** dmellado has joined #openstack-infra07:16
*** rcernin has quit IRC07:16
*** tesseract has joined #openstack-infra07:17
ianwmagic is one thing the whole thing is not short of! :)07:18
yoctozeptoianw: agreed!07:19
*** dmellado has quit IRC07:21
*** dmellado has joined #openstack-infra07:21
*** dmellado has quit IRC07:21
*** gouthamr has joined #openstack-infra07:23
*** dmellado has joined #openstack-infra07:24
*** tosky has joined #openstack-infra07:25
*** bhavikdbavishi has quit IRC07:28
*** dmellado has quit IRC07:30
*** tesseract has quit IRC07:30
*** gouthamr has quit IRC07:30
*** tesseract has joined #openstack-infra07:32
*** witek has joined #openstack-infra07:34
*** dmellado has joined #openstack-infra07:34
*** jpich has joined #openstack-infra07:37
*** gouthamr has joined #openstack-infra07:39
*** dmellado has quit IRC07:39
*** dmellado has joined #openstack-infra07:44
*** apetrich has joined #openstack-infra07:45
*** gouthamr has quit IRC07:51
*** dmellado has quit IRC07:51
*** dmellado has joined #openstack-infra07:54
*** dmellado has quit IRC07:54
*** dmellado has joined #openstack-infra07:55
*** lpetrut has quit IRC07:56
*** gouthamr has joined #openstack-infra07:57
*** gouthamr has quit IRC07:59
*** ralonsoh has joined #openstack-infra08:01
*** dmellado has quit IRC08:03
*** gouthamr has joined #openstack-infra08:04
*** Emine has joined #openstack-infra08:08
*** psachin has quit IRC08:08
*** dmellado has joined #openstack-infra08:08
*** gouthamr has quit IRC08:09
*** lucasagomes has joined #openstack-infra08:09
*** Lucas_Gray has joined #openstack-infra08:11
*** AJaeger_ has joined #openstack-infra08:12
*** gouthamr has joined #openstack-infra08:14
*** xek has joined #openstack-infra08:14
*** AJaeger has quit IRC08:15
*** gouthamr has quit IRC08:17
*** dmellado has quit IRC08:19
*** dmellado has joined #openstack-infra08:22
*** dmellado has quit IRC08:23
*** gouthamr has joined #openstack-infra08:23
*** dmellado has joined #openstack-infra08:24
*** psachin has joined #openstack-infra08:24
*** gouthamr has quit IRC08:30
*** dmellado has quit IRC08:33
*** whoami-rajat has quit IRC08:34
*** dmellado has joined #openstack-infra08:34
*** dmellado has quit IRC08:37
*** dmellado has joined #openstack-infra08:39
*** gouthamr has joined #openstack-infra08:40
*** dmellado has quit IRC08:46
*** gouthamr has quit IRC08:47
*** dmellado has joined #openstack-infra08:50
*** dmellado has quit IRC08:50
*** dmellado has joined #openstack-infra08:51
*** pkopec has joined #openstack-infra08:52
*** gouthamr has joined #openstack-infra08:53
*** dmellado has quit IRC08:54
*** gouthamr has quit IRC08:56
*** dmellado has joined #openstack-infra08:57
*** jaosorior has quit IRC08:57
*** jaosorior has joined #openstack-infra09:00
*** bhavikdbavishi has joined #openstack-infra09:01
*** gouthamr has joined #openstack-infra09:02
*** dmellado has quit IRC09:02
*** bstinson has quit IRC09:05
*** gouthamr has quit IRC09:05
*** dmellado has joined #openstack-infra09:05
*** ykarel has quit IRC09:05
*** ykarel has joined #openstack-infra09:06
*** jbadiapa has quit IRC09:09
*** jbadiapa has joined #openstack-infra09:10
*** dmellado has quit IRC09:12
*** dmellado has joined #openstack-infra09:14
*** gouthamr has joined #openstack-infra09:14
*** psachin has quit IRC09:16
*** gouthamr has quit IRC09:19
*** pkopec has quit IRC09:20
*** ricolin has quit IRC09:21
*** bstinson has joined #openstack-infra09:21
*** pkopec has joined #openstack-infra09:21
*** gouthamr has joined #openstack-infra09:23
*** happyhemant has joined #openstack-infra09:23
*** gouthamr has quit IRC09:25
*** trident has quit IRC09:27
*** trident has joined #openstack-infra09:29
*** dmellado has quit IRC09:29
*** gouthamr has joined #openstack-infra09:32
*** dmellado has joined #openstack-infra09:32
*** dmellado has quit IRC09:32
*** pkopec has quit IRC09:33
*** dmellado has joined #openstack-infra09:35
*** pkopec has joined #openstack-infra09:35
*** gouthamr has quit IRC09:37
*** ykarel is now known as ykarel|lunch09:40
*** gouthamr has joined #openstack-infra09:42
*** gouthamr has quit IRC09:43
*** pkopec has quit IRC09:44
*** dmellado has quit IRC09:44
*** dmellado has joined #openstack-infra09:47
*** jaosorior has quit IRC09:49
*** dmellado has quit IRC09:54
*** gouthamr has joined #openstack-infra09:54
*** pkopec has joined #openstack-infra09:55
*** dmellado has joined #openstack-infra09:56
*** emine__ has joined #openstack-infra09:57
*** AJaeger_ is now known as AJaeger09:57
*** xek has quit IRC09:59
*** Emine has quit IRC10:00
*** gouthamr has quit IRC10:00
*** dmellado has quit IRC10:02
*** ykarel|lunch is now known as ykarel|meeting10:03
*** dmellado has joined #openstack-infra10:06
*** gouthamr has joined #openstack-infra10:06
*** ociuhandu has joined #openstack-infra10:07
*** virendra-sharma has quit IRC10:07
*** kjackal has quit IRC10:09
*** kjackal has joined #openstack-infra10:10
*** dmellado has quit IRC10:11
*** gfidente has joined #openstack-infra10:12
*** ociuhandu has quit IRC10:14
*** dmellado has joined #openstack-infra10:15
*** ociuhandu has joined #openstack-infra10:17
*** gouthamr has quit IRC10:18
*** dmellado has quit IRC10:22
*** dmellado has joined #openstack-infra10:24
*** gouthamr has joined #openstack-infra10:26
*** ccamacho has quit IRC10:30
*** ccamacho has joined #openstack-infra10:31
*** gouthamr has quit IRC10:35
*** jpena|off is now known as jpena10:40
*** gouthamr has joined #openstack-infra10:41
*** gouthamr has quit IRC10:43
*** dmellado has quit IRC10:44
*** dmellado has joined #openstack-infra10:49
*** whoami-rajat has joined #openstack-infra10:51
*** gouthamr has joined #openstack-infra10:52
*** bhavikdbavishi has quit IRC10:56
*** gouthamr has quit IRC10:57
*** jpena is now known as jpena|lunch10:58
*** yamamoto_ has quit IRC11:02
*** dmellado has quit IRC11:04
*** ykarel|meeting has quit IRC11:05
*** dmellado has joined #openstack-infra11:05
*** ykarel|meeting has joined #openstack-infra11:05
*** gouthamr has joined #openstack-infra11:06
yoctozeptoany ask.o.o moderators lurking in here? I would like to ask how long it would take for my answer to get through https://ask.openstack.org/en/answers/122773/revisions/ thanks in advance11:07
*** ccamacho has quit IRC11:11
*** yamamoto has joined #openstack-infra11:13
*** kjackal has quit IRC11:24
*** goldyfruit has quit IRC11:33
*** kjackal has joined #openstack-infra11:33
*** yikun has quit IRC11:33
*** Lucas_Gray has quit IRC11:38
*** Lucas_Gray has joined #openstack-infra11:41
fungiyoctozepto: would you like to be a moderator?11:43
*** EmilienM is now known as EvilienM12:02
*** rh-jelabarre has joined #openstack-infra12:03
openstackgerritMarcin Juszkiewicz proposed opendev/system-config master: do backports for Debian 'buster'  https://review.opendev.org/66815712:05
*** ykarel|meeting is now known as ykarel12:05
*** ykarel is now known as ykarel|afk12:13
*** dpawlik has quit IRC12:15
*** dpawlik has joined #openstack-infra12:20
*** udesale has quit IRC12:22
*** udesale has joined #openstack-infra12:23
*** jpena|lunch is now known as jpena12:23
*** iurygregory has joined #openstack-infra12:24
*** electrofelix has joined #openstack-infra12:29
*** tdasilva has joined #openstack-infra12:35
*** rlandy has joined #openstack-infra12:37
*** jbadiapa has quit IRC12:37
slaweqfungi: hi12:39
slaweqfungi:sorry for bothering You again but I have one more question related to tobiko12:39
slaweqfungi: can You also remove docs from https://docs.openstack.org/tobiko/latest/ ?12:39
*** pcaruana has quit IRC12:40
*** goldyfruit has joined #openstack-infra12:40
*** pcaruana has joined #openstack-infra12:41
*** ekultails has joined #openstack-infra12:44
*** lseki has joined #openstack-infra12:48
openstackgerritSean McGinnis proposed openstack/project-config master: Retire the release-schedule-generator project  https://review.opendev.org/66804512:49
*** rfarr_ has joined #openstack-infra12:51
*** iurygregory has quit IRC12:52
*** rfarr__ has joined #openstack-infra12:53
*** eharney has joined #openstack-infra12:53
*** rfarr_ has quit IRC12:56
*** pkopec has quit IRC13:02
*** pkopec has joined #openstack-infra13:03
openstackgerritFlavio Percoco proposed zuul/nodepool master: Allow nodepool for using in-cluster configs  https://review.opendev.org/66817213:11
fungi#status log deleted /afs/.openstack.org/docs/tobiko at slaweq's request as a member of https://review.opendev.org/#/admin/groups/tobiko-core13:13
openstackstatusfungi: finished logging13:13
slaweqfungi: thx a lot13:13
fungino problem13:13
*** lpetrut has joined #openstack-infra13:23
*** mriedem has joined #openstack-infra13:24
*** pkopec has quit IRC13:24
*** sthussey has joined #openstack-infra13:25
*** pkopec has joined #openstack-infra13:25
*** yamamoto has quit IRC13:31
*** aaronsheffield has joined #openstack-infra13:31
yoctozepto> <fungi> yoctozepto: would you like to be a moderator?13:36
yoctozeptoare you a work-seller or something? ;D13:36
yoctozeptowill think about it, for now I just wanted the answer to pass :D13:37
fungiyoctozepto: sure, and no just reminding that the way things get moderated more quickly is by having more moderators ;)13:39
* fungi is technically a moderator by way of being an administrator of the service for purposes of basic troubleshooting, but not really an active moderator in practice13:39
*** jbadiapa has joined #openstack-infra13:42
yoctozeptofungi: tricky13:42
fungiyoctozepto: the other sneaky thing about that service is that if you answer enough questions it makes you a moderator automatically... be careful! ;)13:45
fungi(moderator privs are automatically granted once a certain karma threshold is reached, i forget what number exactly)13:45
yoctozeptofungi: then we may skip some steps I guess ;D13:49
yoctozeptoif it's going to get me anyway13:49
yoctozeptoI thought there is a chirograph for this13:49
openstackgerritMerged opendev/yaml2ical master: Add DTSTAMP and UID values to meeting instances  https://review.opendev.org/66796113:52
*** trident has quit IRC13:56
*** iurygregory has joined #openstack-infra13:58
*** trident has joined #openstack-infra13:58
fungiyoctozepto: done. if nothing else, your future answers won't go straight into the moderation queue, but the other mods probably appreciate if you check whether there's any pending items in the queue from time to time (drop-down from your username at the top when you're logged in, though it won't show the link to the queue if there's nothing waiting)14:03
yoctozeptofungi: k, thanks, think I will be looking out for kolla stuff, doing some good PR ;D14:04
*** yamamoto has joined #openstack-infra14:05
fungiyeah, really "moderating" just means approving anything which looks like an actual question/answer and occasionally discarding things which looks like spam14:07
fungidoesn't mean you have to answer anything you approve14:08
*** ykarel|afk is now known as ykarel14:08
yoctozeptofungi: I didn't even dream of it14:09
*** dpawlik has quit IRC14:10
*** liuyulong has joined #openstack-infra14:10
yoctozeptook, the link's not there so we seem clean14:10
*** lpetrut has quit IRC14:11
*** jbadiapa has quit IRC14:12
*** jbadiapa has joined #openstack-infra14:12
*** yamamoto has quit IRC14:12
dirkinfra-root: is it normal that the fedora mirroring is broken? http://grafana.openstack.org/d/ACtl1JSmz/afs?orgId=1&from=now-90d&to=now&fullscreen&panelId=2314:23
*** mattw4 has joined #openstack-infra14:30
donnydclarkb will be available today to do some testing with my infra?14:35
donnydwill *you*14:35
corvusdirk: i don't think those two lines are supposed to be colinear, but they do seem to be for much of the time.14:36
dirkcorvus: well, http://grafana.openstack.org/d/ACtl1JSmz/afs?orgId=1&from=now-90d&to=now&fullscreen&panelId=15 is pretty telling14:36
corvusurg.  i broke my local afs client setup testing the kafs stuff, and haven't rebooted back into openafs...14:37
corvusi guess i can use mirror-update14:37
clarkbdonnyd: yes, my day should start in about an hour14:37
donnydok thanks14:37
corvusdirk: mirror.fedora.readonly     800000000  799973784  100%<<       33%    <<WARNING14:38
corvushow big should it be? :)14:38
corvus(also, wow, it had a 100GB jump in 1 day)14:39
corvusshould we make it 1tb?14:39
clarkba good chunk of that is the atomic images, which I think we may not need to mirror in its entirety and instead choose the one or two images are usera need (but that is more worl than bumping volume size)14:40
*** lpetrut has joined #openstack-infra14:46
*** xek has joined #openstack-infra14:48
*** panda has quit IRC14:48
*** panda has joined #openstack-infra14:48
fungijust a heads up, i'm disappearing in a bit to run some pre-travel errands but should return in time for the storyboard bug scrub at 16:15z14:48
fungii'll be in and out over the rest of the day so may be slow to answer14:49
*** pgaxatte has quit IRC14:51
*** pkopec has quit IRC14:53
*** pkopec has joined #openstack-infra14:57
*** mattw4 has quit IRC15:04
*** lpetrut has quit IRC15:05
*** yamamoto has joined #openstack-infra15:08
*** bobh has joined #openstack-infra15:09
donnydclarkb what do I need to get setup first?15:10
*** whoami-rajat has quit IRC15:11
*** rajinir has joined #openstack-infra15:12
clarkbdonnyd: if we can get an openstackci tenant/project/user to run the mirror in and a openstackzuul tenant/project/user for the test nodes that is probably step zero. Then we can work on quotas and spinning of some initial instances15:15
donnydDoes that name need to be specific15:15
*** igordc has joined #openstack-infra15:17
clarkbno, those names are what we typically call them but it will go in a clouds.yaml so the two names can be more arbitrary15:17
*** whoami-rajat has joined #openstack-infra15:19
*** jistr is now known as jistr|off15:20
donnydok15:22
donnydWhats next15:23
openstackgerritPaul Belanger proposed opendev/system-config master: Retire pabelanger as infra-root  https://review.opendev.org/66819215:23
clarkbusually once that is in place we spin up a mirror and a test node and do some sanity checking that stuff like nova unittests and devstack work15:24
clarkbthat may require adding or tweaking nova flavors15:25
clarkbthen once sanity checks look good we update the nodepool config to build images in that cloud then to launch test nodes15:26
*** xek has quit IRC15:26
*** ykarel is now known as ykarel|away15:26
pabelangerinfra-root: I think the time as come for me to stepdown from day to day duties as infra-root^, I send a message to ML too. I still plan to be here, just that I don't have the bandwidth any more for day to day infra-root duties :(15:26
clarkbdonnyd: if we get account details shared I can start on spinning up a mirror node15:28
donnydsure can15:28
*** jpena is now known as jpena|off15:31
*** jpena|off has quit IRC15:31
*** bhavikdbavishi has joined #openstack-infra15:35
*** chandankumar is now known as raukadah15:40
*** jistr|off is now known as jistr15:42
corvuspabelanger: i, uh, have left a friendly -1 with a suggestion on your removal request :)15:50
*** ramishra has quit IRC15:52
*** zhangfei has quit IRC16:01
*** emine__ has quit IRC16:04
*** mattw4 has joined #openstack-infra16:08
clarkbheh I've just realized I said openstackci and openstackzuul instead of opendevci and opendevzuul. As mentioned before the names don't matter so much because they are hidden behind logical clouds.yaml config. Oh well16:10
clarkbalso TIL openstackclient doesn't default to the one cloud you have defined in a clouds.yaml if there is only one cloud specific16:10
*** mattw4 has quit IRC16:12
pabelangercorvus: thanks! replied with some thoughts16:14
*** mattw4 has joined #openstack-infra16:15
*** efried is now known as fried_rolls16:15
*** ociuhandu has quit IRC16:16
*** yamamoto has quit IRC16:16
*** yamamoto has joined #openstack-infra16:17
clarkbinfra-root I'm going to take the secrets lock and add donnyd's cloud details16:18
*** mriedem is now known as mriedem_away16:18
corvuspabelanger: ah, excellent point.  i'm okay with the status quo of access to all servers, but maybe shell access to the zuul/nodepool cluster only would be appropriate if we think that's better.16:18
*** lucasagomes has quit IRC16:19
*** Lucas_Gray has quit IRC16:19
corvusmaybe we can get a quorum of infra-root together next week and mull over it some more16:20
corvus(since it seems we're sort of light today)16:20
pabelangerYup, not rush for removal, could even bring it up at meeting next week16:20
*** udesale has quit IRC16:26
*** jpich has quit IRC16:28
*** jtomasek has quit IRC16:35
clarkbinfra-root I've updated the secrets file with fortnebula cloud and contact details16:35
clarkbthe passwords were changed from those that were given to me16:35
clarkbI'll get a change up for clouds.yaml updates shortly16:35
*** diablo_rojo has joined #openstack-infra16:36
*** iurygregory has quit IRC16:38
*** cmurphy is now known as cmorpheus16:41
corvusclarkb, donnyd: \o/ thanks!16:41
*** kmalloc is now known as needscoffee16:47
AJaegerconfig-core, could you review https://review.opendev.org/#/c/667900/ and https://review.opendev.org/531829 , please? These are some cleanups...16:50
*** gfidente has quit IRC16:52
openstackgerritJeff Liu proposed zuul/zuul-operator master: Add playbooks folder and zuul job that installs k8s  https://review.opendev.org/66802916:54
openstackgerritClark Boylan proposed opendev/system-config master: Add clouds.yaml entries for fortnebula cloud  https://review.opendev.org/66820616:57
clarkbdonnyd: corvus ^ I've not yet updated hiera/host vars on bridge but maybe give that a quick look to make sure I havne't missed anything obvious16:58
clarkbfungi: ^ you too if around16:58
clarkbdonnyd: I've realized that we may want to sync up with ianw on the plan for the mirror node as he is trying to transition us to bionic mirror nodes. Given that maybe I'll start sanity checking with a test node and no mirror node to run nova unittests and devstack16:59
clarkbin either case getting the clouds.yaml in place will drastically simplify the owrk as I'll be able to use our normal tooling16:59
clarkbI need to step out for a bit but I'll be back to add entries to hiera/host vars and maybe approve that chnage if it look sgood to others17:03
donnydok thanks17:07
fungiokay, back and catching up on highlights (actually got back just in time for the sb bug scrub but have been focusing on that until now)17:09
*** ralonsoh has quit IRC17:18
*** ricolin has joined #openstack-infra17:19
openstackgerritJames E. Blair proposed zuul/zuul master: WIP: Add extra tenant config  https://review.opendev.org/66821417:21
openstackgerritAdam Coldrick proposed opendev/storyboard-webclient master: Render user-submitted links with nofollow attribute  https://review.opendev.org/64832217:23
*** e0ne has joined #openstack-infra17:26
*** mriedem_away is now known as mriedem17:27
*** electrofelix has quit IRC17:30
openstackgerritKendall Nelson proposed opendev/storyboard master: Link development.rst to contributing.rst  https://review.opendev.org/64596017:30
donnydI am still hammering on getting ipv6 functioning correctly17:31
*** ykarel|away has quit IRC17:34
fungiproblems with neutron config or network gear?17:36
*** davecore has joined #openstack-infra17:37
openstackgerritKendall Nelson proposed opendev/storyboard master: Link development.rst to contributing.rst  https://review.opendev.org/64596017:37
*** witek has quit IRC17:40
openstackgerritKendall Nelson proposed opendev/storyboard master: Add StoryBoard Story Tags to Docs  https://review.opendev.org/66748217:42
*** ricolin has quit IRC17:45
*** ykarel|away has joined #openstack-infra17:49
*** whoami-rajat has quit IRC17:51
*** e0ne has quit IRC18:12
*** e0ne has joined #openstack-infra18:13
*** tesseract has quit IRC18:13
*** e0ne has quit IRC18:16
clarkbtaking he hiera/hostfiles lock now18:16
clarkber maybe I'll fix my change's -1 first18:16
openstackgerritClark Boylan proposed opendev/system-config master: Add clouds.yaml entries for fortnebula cloud  https://review.opendev.org/66820618:17
clarkbok hiera/host vars lock released18:22
clarkbthe data is up to date and 668206 should be ready for approval18:22
*** e0ne has joined #openstack-infra18:26
*** e0ne has quit IRC18:28
mnaseris there any issues with ovh-gra118:29
mnaserhttp://logs.openstack.org/03/668003/2/gate/openstack-tox-docs/bec3243/job-output.txt.gz18:30
mnaserssh: connect to host 213.32.77.161 port 22: No route to host\r\nrsync: connection unexpectedly closed (0 bytes received so far) [Receiver]\nrsync error: unexplained error (code 255) at io.c(226) [Receiver=3.1.1]18:30
clarkbmnaser: normally that is a side effect of zuul memory issues causing zookeeper client disconnects which results in cleanup of the test nodes. But zuul scheduler memory looks fine http://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=64792&rra_id=all18:32
clarkbpossible that something else has caused zookeeper network connectivity issues. Let me see if grep shows anything in the zuul logs18:32
mnaserlooks like the fetch-output worked fine until the rsync18:32
mnaserso within 3 seconds it disappeared there18:32
clarkbgrep -i kazoo doesn't show naything in the zuul schedulre log18:34
clarkbcorvus: ^ am I grepping for the wrong thing or does that mean we hae to consider a different failure mode here?18:34
* clarkb pulls up nodepool logs for that node18:36
clarkb2019-06-28 18:15:04,076 DEBUG nodepool.DeletedNodeWorker: Marking for deletion unlocked node 0008509452 (state: used, allocated_to: 200-0004128157)18:37
clarkbthat seems to happen after the rsync failure so I think that implies this is a different fialure mode18:38
clarkbmnaser: any idea if that is a persistent issue? if not maybe it was a short network hiccup in that region or between regions?18:39
clarkbI guess we can keep an eye on it and if the problem continues or gets worse we can loop in ovh18:39
mnaserclarkb: it was the first one but i usually speak up early :P18:39
clarkbya that particular failure type is often tied toa global zuul memory issue, just doesn't seem to be this time18:39
clarkbso hard to pin down without more data (and more failures == more data :( )18:40
mnaseri'll recheck and if i see others ill loopback18:40
openstackgerritMerged openstack/project-config master: Remove release-openstack-python-without-pypi  https://review.opendev.org/53182918:41
clarkbAJaeger: ^ theres one done18:41
clarkbcorvus: https://review.opendev.org/#/c/668206/ zuul is happy there now and hiera/hostvars are in place if you want to reack that18:42
fungilogan-: network connectivity still looks pretty consistently broken, if you've missed the earlier mentions: http://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=6493418:44
fungitoday it seems it's been unreachable >90% of the time18:45
logan-yep fungi, thanks. I saw, just haven't had a chance to track it down yet18:45
fungino worries, just wanted to make sure you're aware!18:45
logan-yeah its crazy, really weird18:45
*** pkopec has quit IRC18:45
*** pkopec has joined #openstack-infra18:48
*** ykarel|away has quit IRC18:52
AJaegerthanks, clarkb18:53
*** EvilienM is now known as EmilienM19:05
*** tdasilva has quit IRC19:06
corvusclarkb: hah, of course we have 2 copies of that :)19:12
*** whoami-rajat has joined #openstack-infra19:14
*** yamamoto has quit IRC19:14
*** diablo_rojo has quit IRC19:23
openstackgerritMerged opendev/system-config master: Add clouds.yaml entries for fortnebula cloud  https://review.opendev.org/66820619:31
clarkbcorvus: thanks19:31
clarkbonce that is in place on the nodepool nodes I think my next step is to upload a current bionic image, then boot a test node19:32
clarkbdonnyd: ^ let me know if yo uthink that will conflict with your network debugging19:32
clarkbgitea06 looked a lot happier today too (load average wise)19:32
donnydno, you can push forward19:33
donnydI have it about figured out19:33
*** dklyle has quit IRC19:36
donnydlogan - Do you think you can give me a hand with T/S ipv6 issues19:40
donnydWhen i setup slaac from the openstack router, the instance gets no ip address19:42
dustinccan anyone here help me understand how to request CI rechecks? is there a doc somewhere? (google failed me)19:48
dustincspecifically for https://review.opendev.org/#/c/659690/19:48
clarkbdustinc: if you leave a comment of 'recheck' on the top level of the change it will rerun the tests19:48
clarkbI'm guessing we document that in the manual /me looks19:49
*** fried_rolls is now known as efried19:49
dustincis there a way to get it to just rerun specific tests?19:49
donnydlogan NM, its working with stateless dhcpv619:50
clarkbdustinc: no, we've found in the past that people have been able to recheck changes that are broken and break testing enough times they sneak through and want to avoid being able to lock in results19:50
dustincthat makes sense. thanks for your helkp19:50
dustinchelp*19:50
*** yamamoto has joined #openstack-infra19:50
clarkbdustinc: https://docs.openstack.org/infra/manual/developers.html#automated-testing list entry 619:51
clarkbthat might be more hidden than it should be :/19:51
*** mattw4 has quit IRC19:51
dustinconce I toggled CI comments on the review I noticed that there was a link in the results comment..oops.. :)19:51
dustinc(to that exact doc)19:52
*** diablo_rojo has joined #openstack-infra19:52
logan-donnyd just saw your message since it didn't trigger my nick highlight, but glad you got it working :)19:57
openstackgerritJames E. Blair proposed zuul/zuul master: Add extra tenant config  https://review.opendev.org/66821420:00
*** yamamoto has quit IRC20:01
*** mattw4 has joined #openstack-infra20:02
openstackgerritJames E. Blair proposed zuul/zuul master: Add extra-config-paths tenant config option  https://review.opendev.org/66821420:02
corvusclarkb: you might express your concern about that being more hidden than it should be with a review of this change: https://review.opendev.org/65578320:08
clarkbfungi: ^ did you muster enough js fu to make the change you talked about there?20:09
corvus(fwiw, any of thosse ideas would be fine with me; though i would be perfectly comfortable with renaming the button for now and if anyone later feels like contributing an improvement to only show the last, great)20:11
clarkbya I think renaming the button would be sufficient20:13
clarkbshould we just push a quick patch for that?20:13
clarkbthough rereading this cahnge I think this will break the summary table for Zuul?20:15
clarkbis that a desired outcome?20:15
*** diablo_rojo has quit IRC20:16
corvusnope20:16
clarkbthe ciRegex is used to find CI comments and that is then used to both build the summary table and hide comments I think20:17
clarkb(my js isn't great)20:17
corvusin that case, given what would be needed to fix it properly, implementing the 'show last' suggestion might not be much harder20:18
donnydok, it should all be working now20:18
*** dklyle has joined #openstack-infra20:18
clarkbdonnyd: cool, we just missed the cutoff for the previous round of ansible on the host with the images so will be about 15 minutes before I can start an image upload but will do so then20:19
donnydif logan- has a few minutes to review my ipv6 setup that would be great20:20
logan-donnyd: sure20:20
*** slaweq has quit IRC20:23
clarkbcorvus: fungi: I'm deciphering js now to be more sure of that concern20:24
*** bhavikdbavishi has quit IRC20:25
clarkbcorvus: fungi left a comment on the change with my findings20:28
*** diablo_rojo has joined #openstack-infra20:32
fungiclarkb: no, i haven't had time to look into how to go about trying to match on all but the last comment for a given account20:38
openstackgerritJames E. Blair proposed openstack/project-config master: Adjust jobs running on zuul-jobs  https://review.opendev.org/66824820:38
clarkbfungi: want to check my comment for the new concern to see if I've read js properly?20:38
fungii think you're right, yeah20:38
openstackgerritJames E. Blair proposed openstack/openstack-zuul-jobs master: Remove upload-git-mirror test job  https://review.opendev.org/66824920:39
corvusAJaeger, clarkb: ^ those 2 changes are my agreement with AJaeger in #zuul  :)20:40
clarkblooking20:41
openstackgerritMerged zuul/zuul-jobs master: Split job definitions into multiple files  https://review.opendev.org/66804020:46
openstackgerritMerged zuul/zuul-jobs master: Add a simple test of the install-nodejs role  https://review.opendev.org/66804320:46
clarkbcurrent ubuntu bionic image on nb02 is being uploaded to fortnebula cloud now20:50
clarkbonce that is in I can test booting an instance manually then running something like nova unittests on it20:50
openstackgerritMerged zuul/zuul-jobs master: Move upload-git-mirror test job in-repo  https://review.opendev.org/66805020:55
*** mattmceuen has joined #openstack-infra20:55
mattmceuenHi folks, happy Friday!  Whenever someone gets a minute, I'd appreciate if I could be added as a seed member of the airship-docs-core gerrit group20:56
corvusmattmceuen: on it20:56
mattmceuenty corvus!20:56
corvusmattmceuen: att.com address?20:57
mattmceuenyep that's fine -- mm9745@att.com20:57
corvusmattmceuen: er, the one gerrit has for you is different, but it's the only matt mceuen with an att.com address, so i probably got the right account.  you should be gtg.  :)20:58
mattmceuencorvus: I have all the buttons enabled I need!  appreciate your help sir!20:58
corvusmattmceuen: np20:58
clarkbspeaking of buttons https://arstechnica.com/science/2019/06/behind-the-scenes-at-nasas-newly-restored-historic-apollo-mission-control/21:00
clarkbNow I need an excuse to go to houston21:00
mattmceuenoh wow!21:00
donnydthat is pretty awesome21:00
donnydThe only part is dislike about raw images is upload time...21:03
clarkbdonnyd: indeed21:03
fungii'm going to build my next workstation stand to look like a bank of those consoles21:03
fungior maybe an arcade, i can't decide21:03
openstackgerritMerged openstack/project-config master: Adjust jobs running on zuul-jobs  https://review.opendev.org/66824821:04
*** kjackal has quit IRC21:05
clarkbdonnyd: note that if you end up fiddling with our images too, that they require config-drive on the instance and root is what ends up configured with ssh keys21:06
clarkb(we don't use cloud-init, we use glean instead which is far simpler but comes with a few of those restrictions)21:06
donnydI wish i had time to fiddle... too many irons, not enough fire21:07
clarkbno worries21:07
clarkbupload is done. Figuring out a boot next21:08
clarkboh I remember what I was forgetting now21:08
clarkbcloud launcher config21:08
*** diablo_rojo has quit IRC21:10
openstackgerritClark Boylan proposed opendev/system-config master: Add fornebula cloud to cloud launcher  https://review.opendev.org/66825121:11
clarkbI'll manually upload my key now too so I don't have to wait for that21:11
*** slaweq has joined #openstack-infra21:11
donnydclarkb: thanks for sticking with this all day. Much appreciated21:12
*** kopecmartin has quit IRC21:13
clarkbclarkb-test1 is booting now21:15
clarkbdonnyd: and thank you for the generous offer of test resources21:15
*** slaweq has quit IRC21:15
*** pcaruana has quit IRC21:16
*** rfolco is now known as rfolco|off21:16
donnyddo you think local mirrors would help speed things up a bit (.deb  / .rpm)21:18
fungithat's basically what our static instance there will be21:19
donnydAh I C21:19
fungiwe stick one in every provider region where we run jobs, to improve response time, even out connectivity hiccups, and reduce pressure on the external network21:19
donnydyea that makes lots of sense21:20
*** dpawlik has joined #openstack-infra21:20
clarkbreduces pressure on the ipv4 nat too (if only github and docker hub could ipv6)21:20
fungigenerally a mix of apache proxy caches with some cache-busting overrides to make the caches more effective and also some afs-backed mirrors we build up via rsync or reprepro21:21
donnydthere are only two tenant's (currently) in this "cloud"21:21
fungisince afs also does client-side caching21:21
clarkbfwiw I got into the instance just fine over ipv6 and am installing deps for nova unittests21:21
clarkbso far everything looks great21:21
donnydnice21:22
*** dpawlik has quit IRC21:22
donnydAny tuning I can do to ceph for this particular workload?21:22
donnydi tuned as best i could for IOPS21:23
fungimnaser may have suggestions... i know our vexxhost nodes are on ceph-baced volumes21:23
clarkbNot sure about ceph. For kvm you can tell it to not wait on writeback confirmation for data writes since we only run ephemeral workload there21:23
clarkbI don't know if kvm + ceph changes that21:23
clarkbhttps://doc.opensuse.org/documentation/leap/virtualization/html/book.virt/cha.cachemodes.html#cachemodes.descr we are happy with unsafe or writeback21:24
clarkbI think if both tempest and nova come back as happy The only hurdle left will be spinning up a mirror node and I want to sync with ianw on that to make sure we set that up in a way that makes sense for him as he has been working on updating those recently21:29
clarkbgood chance we could have this in production on monday assuming things go smoothly21:29
clarkbI'll have to sort out what the run tempest easy mode method is now that d-g is mostly retired21:31
clarkbthough I guess we have d-g running stable branch tests? that might be sufficient21:31
clarkbRan: 16748 tests in 501.0000 sec.21:34
donnydnot sure if that is a good or bad number21:34
funginow to figure out if that's fast or not, yeah ;)21:34
clarkbhttp://logs.openstack.org/15/651315/7/check/openstack-tox-py27/39f3565/job-output.txt.gz#_2019-06-28_03_56_17_96558721:35
clarkbseems to be within the same range (note this run ran more tests than that run for some reason)21:35
clarkbhttp://logs.openstack.org/15/651315/6/check/openstack-tox-py27/9131333/job-output.txt.gz#_2019-06-15_16_02_52_11039221:36
clarkbso ya I think we are in the ballpark21:36
fungii like that stestr is reporting to 4 decimal places when it is clearly only summarizing to second granularity21:36
clarkbtime to boot a second node to try a tempest run21:37
donnydI am hopeful that when my disks finally finish syncing that this thing will get a little faster...21:38
donnydseems like the raid sync is taking forever....21:38
*** happyhemant has quit IRC21:39
clarkbas an interesting datapoint the mysql tests take about twice as long but the rbd etc tests seem to be about the same21:39
clarkbso ya disk performance could explain that21:39
clarkb(and the writeback thing might be a way to improve that if it remains a problem)21:40
donnydits already set21:40
donnydits on network=writeback21:40
donnydso it should be using the rbd cache21:40
donnydbut horizon is painfully slow and the api's are not exactly snappy21:43
donnydso I am hopeful the latency is in the sync21:44
donnydif not... I am going to have to get some other disks for the controllers21:44
clarkbI wouldn't worry about it right now. We do have variable performance across clouds so we should do a wider check before deciding that is significantly slower21:45
donnydWell I am a little bummed out, I just replaced these controllers with much newer servers and they do not seem to be any faster... if anything they are slower21:46
clarkbah if you ahve your own numbers I'll definitely defer to those :)21:48
clarkbI have a devstack-gate reproduce.sh running now21:48
clarkbthere is a reasonable chance I didn't edit its env vars enough to make it work without a zuul21:49
clarkbbut we'll see21:49
donnydwhat kinda of opstools are people running these days?22:00
donnydlooked at https://wiki.openstack.org/wiki/Osops22:01
*** bobh has quit IRC22:01
donnydjust not sure if its current22:01
clarkbaround here we have a legacy set of puppet4 that is being transitioned to ansible with docker (and eventually we'll likely deploy into k8s but gitea was the plan for that and we discovered gitea can't run share everything yet)22:02
*** pkopec has quit IRC22:04
donnydwell I am going to go get some dinner. I will be back later22:07
*** diablo_rojo has joined #openstack-infra22:09
*** rlandy has quit IRC22:10
*** slaweq has joined #openstack-infra22:11
*** slaweq has quit IRC22:16
*** raissa has joined #openstack-infra22:19
openstackgerritKendall Nelson proposed opendev/storyboard master: Link development.rst to contributing.rst  https://review.opendev.org/64596022:24
*** auristor has quit IRC22:26
*** rfarr__ has quit IRC22:26
*** factor has joined #openstack-infra22:31
clarkbdevstack spinup seemed to be about twice that of the source for my reproduce.sh. That could be related to lack of local caches and such. Will keep fiddling with it (but first to get tempest results)22:34
*** rajinir has quit IRC22:42
*** goldyfruit has quit IRC22:47
*** diablo_rojo has quit IRC22:53
*** roman_g has quit IRC22:53
*** roman_g has joined #openstack-infra22:53
roman_gfungi> i'll see if i have some notes on how we've authorized them to do that in the past22:59
roman_gfungi: reminding :)22:59
roman_gfungi: need to contact Freenode team, ask them to OP someone from openstack-infra team, and then get channel registered to openstack group23:00
roman_gThank you!23:00
clarkbroman_g: we wanted to discuss with hogepodge when he is back from china about the correct way to set that up23:01
clarkbbut then corvus should be able to request that from freenode for us'23:02
roman_gclarkb: oh, that's good. When and whom to contact later?23:07
clarkbroman_g: at this point maybe the best thing to do is start up an email thread? I think mostly we want to ensure that whoever has the keys for that new freenode namespace knows they have it and that they know to pass it on if necessary. And I think we were looking for hogepodge's feedback from the foundation side of things regarding that23:08
roman_gnamespace openstack exists already23:10
clarkbroman_g: correct, but #airship isn't in that namespace23:10
roman_gyes, we just want to include it there23:10
clarkbmeaning you want it to be part of the openstack namespace?23:11
*** mattw4 has quit IRC23:11
clarkbcorvus: ^ do you know if freenode relies on the unique prefix for that to happen or do they track it some other way?23:11
roman_gwell. I basically want channel to be registered & controlled by openstack infra and openstack IRC bots/services to work on it23:12
clarkbroman_g: yup thats the plan, in addition to that freenode has teh concept of groups of channels that all have a super admin basically23:13
clarkband the throught was that #airship likely wants its own set of those23:13
clarkbthe bots and services can run there either way23:13
*** whoami-rajat has quit IRC23:13
*** Lucas_Gray has joined #openstack-infra23:14
roman_gwe are fine not having own super admin23:14
corvusclarkb: i believe it's by prefix; i think they can add a prefix to a group registration, or, perhaps nearly as easily create a new group registration23:16
clarkbok I think my preference would be for airship to be its own group if only to simplify management of the channels (people won't have to wonder or follow some hint thread to know who to talk to)23:18
corvusclarkb, roman_g: i lean toward a new group registration regardless (regardless of who actually ends up as an admin) just because it will make things easier to manage in the future, and we won't have to untangle airship from openstack23:18
clarkb++23:18
clarkband we can run that by hogepodge to make sure we aren't missing anything silly23:18
roman_gunderstood. thank you.23:19
*** tosky has quit IRC23:26
fungiyes, #opendev is already in its own group separate from #openstack, and the idea is that #zuul will probably ask for one of its own if it grows a need for additional channels at some point23:29
fungiroman_g: on other fronts, did you get a chance to try tumbleweed nodes again? we got the rsync mirror for it readded23:29
openstackgerritMerged opendev/storyboard master: Add StoryBoard Story Tags to Docs  https://review.opendev.org/66748223:33
*** aaronsheffield has quit IRC23:33
clarkbhrm tempest timed out23:34
clarkbI'm going to restart it manually then figure out dinner and come back to this later (maybe monday)23:34
*** sthussey has quit IRC23:34
roman_gfungi: tumbleweed works perfectly. Thank you!23:35
*** yamamoto has joined #openstack-infra23:36
fungiexcellent!23:36
roman_gI think you are one of the most responsive teams I've worked with. I appreciate your great help a lot.23:37
fungiwe try, thanks for being patient with us when we're busy or otherwise occupied!23:38
*** Wryhder has joined #openstack-infra23:39
clarkbok tempest is running withuot a timeout now23:39
donnydclarkb: do you know why it was timing out?23:40
*** Lucas_Gray has quit IRC23:40
*** Wryhder is now known as Lucas_Gray23:40
donnydi also may have to adjust my edge FW rules23:40
clarkbdonnyd: devstack-gate has a timeout built into it23:41
clarkbso that jobs stop within a reasonable amount of time if things go sideawys. It hit that timeout23:41
clarkbthat implies things are slower than we want but without full logs we probably can't debug (hence rerunning to generate better data)23:41
donnydi will need to look into why... this thing has no load, 40G networking and many more fairly high performance parts..23:42
donnydI will look more into performance this weekend23:44
clarkbya the networking is quite quick. Did notice that. I think it may have to do with disk as database things seemed slow23:45
*** goldyfruit has joined #openstack-infra23:45
donnydwhen you say DB things seem slow.. do you mean API requests or do you mean DB(ish) things on the actual instance23:46
clarkbjudging by the nova unittest for applying database migrations: the actual db ish things on the instance23:46
*** ekultails has quit IRC23:46
donnydI have other storage I can use that will speed things up a lot... but will require a rebuild... and there is no redundancies23:47
clarkbdonnyd: as a step zero you might want to double check that the instance images are served from where you expect them to be from?23:47
clarkbone thing I realized is I didn't boot these from volume and you had mentioned ceph so is it possible they booted on local disk?23:48
donnydno23:48
donnydI can see the workload coming in on  Ceph23:48
donnydat abysmal speeds23:51
donnydjust makes no sense.. all my fio testing showed between 10k and 15K IOPS at 4K block sizes23:53
donnydplenty for most db operations23:53
*** auristor has joined #openstack-infra23:54

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!