Friday, 2019-08-16

*** igordc has quit IRC00:11
*** igordc has joined #zuul00:11
ianwShrews: ahh, sorry i'll loop back on that00:17
openstackgerritJames E. Blair proposed zuul/zuul master: WIP: super hacky demo of logfile under the manifest  https://review.opendev.org/67684300:31
corvuszuul-maint: high-priority reviews of "topic:fix-zuul-logs" will help folks inconvenienced by the switch to the build page and therefore they will like us more and maybe buy us beer00:34
*** igordc has quit IRC00:44
*** igordc has joined #zuul00:45
*** spsurya has joined #zuul01:14
*** bhavikdbavishi has quit IRC01:38
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: highlight selected line  https://review.opendev.org/67684902:18
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: highlight selected line  https://review.opendev.org/67684903:18
*** bhavikdbavishi has joined #zuul03:19
*** raukadah is now known as chkumar|ruck03:24
*** igordc has quit IRC04:11
*** bjackman_ has joined #zuul04:13
*** bjackman_ has quit IRC04:30
*** bjackman_ has joined #zuul04:37
fungii can review changes until my flight starts boarding05:00
fungiwill take a look05:00
openstackgerritIan Wienand proposed zuul/nodepool master: Add a dib-cmd option for diskimages  https://review.opendev.org/67219606:03
*** yolanda has joined #zuul06:55
*** saneax has joined #zuul07:48
ofososDid the location of job-output.txt change recently?10:50
*** bhavikdbavishi has quit IRC11:02
*** bhavikdbavishi has joined #zuul11:03
*** bhavikdbavishi has quit IRC11:22
*** mgoddard has quit IRC11:52
*** mgoddard has joined #zuul11:59
openstackgerritMark Meyer proposed zuul/zuul master: Rework a cache invalidation issue  https://review.opendev.org/67442512:28
*** rlandy has joined #zuul12:33
*** rlandy is now known as rlandy|rover12:33
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: highlight selected line  https://review.opendev.org/67684912:35
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: do not scroll into view more than once  https://review.opendev.org/67692412:35
tristanCcorvus: mordred: i'm working on a new implementation for line selection, without using anchors12:37
mordredneat12:37
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: logfile highlight selected line  https://review.opendev.org/67684912:57
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: logfile do not scroll into view more than once  https://review.opendev.org/67692412:57
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: logfile do not use anchor for line selection  https://review.opendev.org/67692812:57
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: logfile scroll into view a bit more  https://review.opendev.org/67692912:57
tristanChttps://review.opendev.org/676928 should gives a better result12:57
*** bjackman_ has quit IRC12:58
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: logfile scroll into view a bit more  https://review.opendev.org/67692913:20
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: logfile do not use anchor for line selection  https://review.opendev.org/67692813:20
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: logfile support multi line selection through shift-click  https://review.opendev.org/67693713:20
tristanCmordred: here is a multi line selector implementation (that supports only one range)13:21
mordredtristanC: I'm fine with only one range :)13:21
tristanCwell highlighting multiple range shouldn't be hard, but making a non-confusing ui for the selection is rather difficult :)13:26
openstackgerritDavid Shrewsbury proposed zuul/nodepool master: DNM: testing openshift job  https://review.opendev.org/67694313:34
SpamapSsimplicity > corner case coverage13:35
corvusmordred, SpamapS: would you please reconsider your -1s on 676818?  i realize that in some clases when you click the line, it still doesn't appear in the right place, however, it works better than the current situation in that it works on page load, and we can improve it later.13:39
corvusnormally i like to merge perfect code, but since we already merged imperfect code, i'd like to merge more imperect code to make it less imperfect13:40
* Shrews almost spits out coffee13:40
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Remove sphinx_output_dir  https://review.opendev.org/67694513:41
AJaegercorvus, good morning. promote of openstack-manuals works and we can remove sphinx_output_dir again ^13:41
AJaegerbetter remove it before anybody uses it ;)13:41
*** jeliu_ has joined #zuul13:44
openstackgerritTristan Cacqueray proposed zuul/zuul master: manager: check if parent_layout exists before looking for errors  https://review.opendev.org/67694713:47
openstackgerritTristan Cacqueray proposed zuul/zuul master: github: properly exist when failing to get a pull request  https://review.opendev.org/67694813:50
tristanCcorvus: we setup a new monitoring system to detect stacktrace in zuul/nodepool logs yesterday, here are a couple of changes to prevent those, not sure that's correct though ^13:52
tristanCfwiw, here is the script we are now running: https://softwarefactory-project.io/cgit/software-factory/sf-ops/tree/scripts/monitor-traceback.py13:52
tristanCShrews: also, got this NotEmptyError in nodepool-launcher: http://paste.openstack.org/show/757967/13:54
ShrewstristanC: yeah, deleting znodes is racey because of how locks work13:56
ShrewstristanC: basically, you can have a znode locked, then try to delete it. Because the lock is part of the znode itself, once the lock data is deleted, another thread can attempt to lock the znode before it disappears, causing new lock data to appear.13:57
tristanCShrews: then shouldn't this be excepted and logged as error instead of exception?13:57
Shrews(if that makes sense)13:57
ShrewstristanC: maybe?13:58
corvustristanC: i can't figure out what 676924 does14:01
tristanCcorvus: not sure it's still useful, but it basically prevent the scroll from happening when browsing the file14:03
openstackgerritTristan Cacqueray proposed zuul/nodepool master: zk: except NotEmptyError in deleteRawNode  https://review.opendev.org/67695314:03
corvustristanC: it isn't working for me14:03
corvusi can't see any behavior change compared to the previous patch14:03
tristanCcorvus: it's because that patch still uses <a href="#">, next one removes the link14:04
tristanCcorvus: 676924 should be on top of 676928 to be effective14:04
corvustristanC: okay, that makes sense14:04
corvustristanC: one really minor note, if you click on a line visible when you're at the top, it will scroll. but this is still an improvement i think14:06
corvus(and yeah, scrolling when you click an anchor is standard behavior, but i also think it makes things difficult when it's something like a line in a logfile versus a section header -- it's hard for your eyes to follow the jump, so i think not scrolling on line selection is the way to go :)14:06
tristanCcorvus: we can use a state variable to remember if the initial scrollinto view already happened14:07
ShrewstristanC: i left a comment on that nodepool change14:10
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: logfile remember if initial scroll was performed  https://review.opendev.org/67695414:11
tristanCcorvus: like so ^14:11
ShrewstristanC: also, nodepool-functional-openshift job seems consistently broken now for some reason14:14
corvustristanC: sweet, i have +2d the entire stack14:14
tristanCShrews: perhaps it's time to merge https://review.opendev.org/#/c/672785/14:15
corvushow did a gating job break?14:15
ShrewstristanC: shouldn't we care that it is broken with an older version of openshift?14:16
openstackgerritTristan Cacqueray proposed zuul/nodepool master: zk: except NotEmptyError in deleteRawNode  https://review.opendev.org/67695314:16
Shrewshttps://zuul.opendev.org/t/zuul/build/49a3d363d94e4008b3bb8b6876f1a3bc/log/job-output.txt#70614:17
Shrewsseems it can't recognize that the service is started14:17
tristanCShrews: corvus: the previous version may have been updated and now assume the node has correct firewall configurations14:17
tristanCShrews: i can extract the firewall task from 672785, but i think we should bump to 3.11 now as it is the version that feature the operator framework14:18
corvustristanC, Shrews: we also added a clear-firewall rule we could run before install-openshift if we want14:19
tristanCcorvus: it seems like we only needs to authorize 172.16.0.0/12, which seems safer to do on public network14:20
corvusk14:20
corvusanyway, if it looks like it's just the firewall that's the issue, that seems like a satisfactory explanation to me, and we can upgrade to fix it14:21
corvusmordred, SpamapS: also, all your concerns on 676818 are addressed in tristanC's followups, which all have a +2 from me14:23
corvusthe shortest path to happiness is to +3 676818 and the rest of the fix-zuul-logs stack14:23
corvustristanC: can you take a look at https://review.opendev.org/676843 and let me know if you like the idea?14:24
corvustristanC: i thought of it when i was switching between different logfiles looking for an error14:24
mordredcorvus: yeah- I agree - I've +2'd the stack too - waiting to see the previews for the +A just to be sure14:25
tristanCcorvus: i'll give it a try, iiuc, it is fetching the file content each time a select occurs right?14:27
corvustristanC: yes, i agree we should make your suggested change to keep the file content around if we do this.  mostly i'm wondering if we like the ui enough to continue working on this change14:28
corvus(many things are wrong with the change, mostly i'm just wondering -- is putting the file content below the tree a good idea?)14:29
tristanCcorvus: i'm terrible at designing such things... but i would put it in the tree if there was way to inline a panel without the tree indentation14:30
tristanCso that, when you look at tox logs for example, you can open each numbered file in order and get a complete view of what tox did14:31
corvustristanC: i feel like tox is the exception though, most of these files are hundreds or thousands (or tens of thousands) of lines long14:32
openstackgerritMerged zuul/zuul-jobs master: install-openshift: bump version to 3.11.0  https://review.opendev.org/67278514:32
corvusso that might push the rest of the tree way down14:32
tristanCcorvus: we also limit the height of the inlined panel14:34
tristanCcan* also14:34
corvustristanC: if we put part of the tree above, and part below, that can make it difficult to switch between logs14:36
tristanChere is an example of how it may looks like: https://softwarefactory-project.io/logs/65/15565/12/check/sf-ci-functional-allinone/45668f1/report.html  (though the panel do not respect window heights)14:36
corvusi guess you're saying that's a way to put all the logs on the page14:36
corvushrm, i'm not sure i like the double scrollbars14:38
corvusto browse a file, you have to line up the interior box with the viewport, then you can scroll within it14:38
tristanCcorvus: then we shouldn't inline the log in the tree14:38
corvus(i do agree that works really well for shorter files)14:39
tristanCperhaps we could get a similar result by stacking the file clicked (sorted by clicking order) under the tree14:39
tristanCalso, we can count line return, if there is not too many (e.g. less than ten), then we can still inline the content in the tree14:40
corvusi asked efried in #openstack-infra for thoughts too -- he had some feedback there14:41
corvusmordred: https://cb7d8e118e66e1fa69a1-f54717b5728ca51ec481953b9301c7c6.ssl.cf2.rackcdn.com/676954/1/check/zuul-build-dashboard/376fe86/npm/html/ is the preview for the end of the stack14:44
corvustristanC: followup question on 67694714:53
tristanCcorvus: replied15:05
corvustristanC: had the scheduler been running for a while?  and was that near a tenant reconfiguration event?15:07
tristanCcorvus: we restarted the scheduler at 2019-08-15 20:12:49 UTC; 18h ago15:08
tristanCcorvus: and there has been a full-reconfigure command at 2019-08-15 21:14:16,384,   about 3 minutes before the exception15:09
mordredtristanC, corvus: stack looks great! want me to push it in - or want to wait for efried?15:11
corvusmordred: go for it15:12
tristanCmordred: corvus: glad you like it15:12
mordredk. stack is +A15:13
tristanCpabelanger: tobiash corvus: regarding executor zone, iiuc this feature doesn't isolate merge job right? e.g. does a zone executor register merge gearman jobs and will perform merge task, even for un-related change?15:16
tristanCwe are looking into spawning an executor on a slow network, but it seems like it will need access to all of our connections15:17
corvustristanC: correct15:17
tristanCgit* connections15:17
tristanCcorvus: then would it be possible to implement merge zone, or disable the registering of merge gearman jobs for zoned executor?15:17
corvustristanC: i think the second idea would make the most sense15:18
corvus(also easier)15:18
dmsimardI wrongly assumed that zuul executors only merged their own things :)15:19
corvusthe zones are really supposed to be about the executor communicating with the test nodes.  so there really isn't a locality to tie them to changes or git sources.15:19
pabelangertopic:distributed-executors is work tobiash has been doing for improvements but yah, merges still happen15:19
corvusso given what you describe, i think just not doing more merges than necessary is the way to go15:20
corvusdmsimard: there are lots of things mergers do other than just preparing repos for running jobs; it's a lot of work, so if the executors have spare cycles, they pitch in15:20
corvusbut that's predicated on the idea that they are fast.  if the're off on a slow network, then they're not helping.  so better to just not have them participate in that case.15:20
pabelangerif we still had infracloud, that was something we likely wanted to implement there. It had slowish network too15:21
pabelangerdmsimard: on disk repos also help, we did that for ansible/ansible just down on alot of traffic pushing data to the node15:22
pabelangers/just/cut15:22
pabelangerhttps://opendev.org/zuul/zuul-jobs/src/branch/master/roles/mirror-workspace-git-repos15:23
openstackgerritTristan Cacqueray proposed zuul/zuul master: executor: add merge_jobs options to disable gearman merge jobs  https://review.opendev.org/67697415:27
openstackgerritJames E. Blair proposed zuul/zuul master: Collect more information from quickstart failures  https://review.opendev.org/67697615:31
corvusthe stack failed due to a quickstart error; ^ hopefully that will help us find the problem15:31
*** noorul has joined #zuul15:32
corvusi've reapproved it15:33
tristanCpabelanger: would it be possible to set the job zone without a nodepool pool/label? Or does the zone feature implies running a launcher too?15:34
clarkbtristanC: corvus ^ comments on your two most recent changes there15:36
tristanCbecause the second issue is that this zone would use https://wiki.centos.org/QaWiki/CI/Duffy, which doesn't have a nodepool driver, but we have ansible role to create the instance15:36
pabelangertristanC: yes, currently depends on having nodepool15:37
clarkbya as mentioned before the zoning has to do with accessibility and nodepool has to get things like ssh host keys15:37
pabelangeryah, was setup to solve FIP issue really15:38
tristanCi guess we can write a duffy driver, but that would be more difficult, and we would have to run another service there (with zookeeper lack of authentication, that's a bit of a problem...)15:38
clarkbsince the assumption is that you can't talk to the nodes unless in that zone then nodepool and executor are both going to live there15:38
clarkbzk has authentication...15:38
tristanCclarkb: well, it has authentication, but iiuc it doesn't enforce it15:38
clarkbiirc shrews tested it and it works fine15:38
pabelangerdid we add zone info to instance data? or just into zk directky15:39
pabelangerdirectly*15:39
clarkbtristanC: you can connect without auth but you can't access data behind auth without authing15:39
clarkbbasically tress (and subtrees) end up protected by acls (and authentication) and until you access that data it doesn't care about your auth state. This means you can start the tcp connection but you can't read the or write the data unless auth'd15:40
tristanCclarkb: can we put an acl on / or /zookeeper ?15:40
clarkbtristanC: yes I believe that is exaclty what shrews had tested15:40
tristanCthen i stand corrected, didn't realized it was possible to lock down the zk tree acl15:41
Shrewswell, i didn't test an acl on /zookeeper, just that the kazoo auth code worked15:42
clarkbShrews: you did test it on subtrees though iirc15:42
Shrewsi can't recall. i want to say "yes"15:42
corvusclarkb: q on 67697615:43
clarkbcorvus: responded. Hopefully that makes more sense15:44
clarkbit is early and I am probably not making sense :)15:44
corvusclarkb: nope that makes perfect sense. i just didn't see that i had left an extra line in there :)15:45
pabelangerwe've also thought about the idea of a zuul-executor on pem for partner for testing this, but haven't decided to do that yet or not.  Idea being, to reduce the size of zuul a partner would run15:45
Shrewsclarkb: tristanC: fwiw, i think this was my test script: http://paste.openstack.org/show/757970/15:45
tristanCShrews: clarkb: iirc, the concern was that malicious access could stress the cluster by creating on /, or maybe manipulate /nodepool15:46
tristanCand iirc it was considered a serious issue if anonymous access was possible to a zk cluster15:47
clarkbtristanC: with an acl on / no one should be able to access or create in that tree15:47
clarkbit is possible that you could cause problems creating too many tcp sessions15:47
clarkbbut that is true of any server you can talk to15:47
openstackgerritJames E. Blair proposed zuul/zuul master: Collect more information from quickstart failures  https://review.opendev.org/67697615:48
ShrewstristanC: i think that script addresses that concern. the last line troubles me as i do not recall why that didn't work, or why it wasn't a concern15:49
Shrewsor if it was a concern, why i forgot about it15:49
Shrewsmaybe my comment is outdated15:50
ofososnoorul: ready?15:50
Shrewsi guess subtrees are not addressed in that sample code15:51
ShrewstristanC: oh, i misread your comment. yeah, i don't think there's a way to prevent anonymous connections15:51
*** noorul has quit IRC15:51
Shrewsi'm not aware of one, at least15:52
clarkblooks like zookeeper merged sasl auth support ~23 days ago15:56
clarkbnext release will manage it at the connection level looks like15:56
tristanCShrews: clarkb: alright, thank you very much for the feedback. Then it seems like running a nodepool-launcher in that network is the right thing to do.16:02
*** noorul has joined #zuul16:08
noorulofosos: hi16:08
ofososnoorul: hi! all good?16:14
noorulofosos: hi16:16
ofososnoorul: hi16:17
noorulofosos: I could bring up zuul using docker-compose16:17
ofososcool16:17
noorulofosos: Issue was that it was waiting for gerrit to start16:17
openstackgerritMerged zuul/zuul master: JS: account for header when scrolling to line  https://review.opendev.org/67681816:17
noorulWhat is the next step?16:17
ofososSo, did you already have a look at the docs?16:17
noorulwhich doc?16:18
ofososWe need a "Zuul" user in bitbucket, then we need to configure the driver with that user.16:18
noorulI did that16:18
noorulhttps://zuul-ci.org/docs/zuul/admin/quick-start.html16:18
noorulIt has example for gerrit16:18
ofososhttps://021706778c3f130fc2cd-793c55b6fd3f2546ffa515e5af6fce40.ssl.cf2.rackcdn.com/674425/5/check/zuul-tox-docs/429bfef/docs/16:19
noorulIs there an example for Bitbucket zuul-config16:19
ofososhttps://021706778c3f130fc2cd-793c55b6fd3f2546ffa515e5af6fce40.ssl.cf2.rackcdn.com/674425/5/check/zuul-tox-docs/429bfef/docs/admin/drivers/bitbucket.html this one16:20
ofososThis contains the info you need for the bitbucket connection16:20
noorulSo as first step I need to have a zuul-config repo in bitbucket16:22
ofososYou can create a connection like this for bitbucket, after that you can decide if you want to have zuul-config in gerrit or in bitbucket. For simplicity, let's keep it in Gerrit (we have everythin in Bitbucket).16:23
noorulI don't have gerrit16:23
noorulI created a zuul-config repo16:23
ofososNot necessarily, the connection needs to be configured before we can pull from zuul-config. We also need a main.yaml with the tenant configuration.16:23
noorulHow is that done?16:24
noorulIs there a complete step by step documentation ?16:24
ofososOk, then in your /etc/zuul on the scheduler you should find two files. One is the config with the connections and one is the tenant config (usually main.yaml).16:24
ofososRemove the gerrit connection and add the bitbucket config with the credentials from the zuul user.16:25
noorulDone!16:25
ofososDo you already have a project in Bitbucket?16:25
ofososWhat's it named?16:25
noorulYes demo project16:25
ofososok, go to the tenant config and configure a config project, it should be `demo/zuul-config`16:26
ofosos`demo/zuul-config` that's the path16:26
ofososhttps://zuul-ci.org/docs/zuul/admin/tenants.html16:27
ofososThat's the link to the tenant config.16:27
noorulsource should be bitbucket right?16:27
ofososAfter that, start/restart the scheduler.16:27
ofososYes, if that is the name of the source you configured.16:27
ofososAFK for 7 minutes16:28
noorul- tenant:16:28
noorul    name: demo16:28
noorul    source:16:28
noorul      bitbucket:16:28
noorul        config-projects:16:28
noorul          - zuul-config16:28
noorul16:28
noorulIs that fine?16:28
*** yolanda has quit IRC16:30
corvusnoorul, ofosos: if you want, you can use etherpad to sketch out config files like that.  see this url: https://etherpad.openstack.org/p/rWL36RmF6W16:30
corvusand yeah, that looks right to me16:31
*** hwangbo has joined #zuul16:34
openstackgerritMerged zuul/zuul master: JS: Break log viewer out of the panel  https://review.opendev.org/67682716:36
noorulofosos: Added zuul.conf there. Can you review?16:36
*** yolanda has joined #zuul16:37
noorulIs opendev and mysql connections required?16:37
ofososNo, use demo/zuul-config instead of plain zuul-config16:37
noorulI meant /etc/zuul/zuul.conf16:38
noorulNot the pipeline configuration16:38
noorulofosos: Can you take a look at https://etherpad.openstack.org/p/rWL36RmF6W ?16:39
noorulFirst one is main.yaml16:39
noorulsecond one is zuul.conf16:39
ofososI corrected some stuff16:43
noorulWhat is server for?16:43
ofososThe driver has no sshkey option, please provide that in /root/.ssh/id_* on the executor.16:43
ofososServer is the API endpoint16:43
noorulThen it should be http right?16:43
ofososYou don't need server16:44
ofososBase URL will suffice for the API and cloneurl will suffice for GIT access16:44
noorulYou corrected that to add ssh://..16:44
noorulok, I see that it is removed now16:44
corvus(we should add sshkey support to the driver)16:45
ofososcorvus: I think we should add the ability of zuul to upload it's own key to bitbucket :)16:45
ofososI.e. attach the access key to a repo.16:45
ofososI think that's possible API wise. If noorul joins our party, it should be swiftly done ;)16:46
ofososnoorul: I think with the config you're now ready to spin up the processes.16:47
noorulofosos: started16:47
*** rlandy|rover is now known as rlandy|rover|brb16:47
corvusofosos: yeah, better bootstrapping sounds good -- but different connections will may different keys, so we can't rely on ~/.ssh/id_rsa being there for a particular connection -- so the other drivers allow you to specify a key16:47
corvuss/will may/may have/16:47
corvusanyway, just a note for us to come back to later; i'll leave it as a review comment16:48
corvus(don't want to derail the bootstrapping party)16:48
noorulThe driver has no sshkey option, please provide that in16:48
noorul         /root/.ssh/id_* on the executor.16:48
noorulofosos: Can you explain that a bit?16:48
ofososIt uses the default ssh identity of the user the executor runs as.16:49
noorulofosos: Are you saying that I should have zuul users private key at /root/.ssh/ folder?16:50
ofososYes, I am :)16:50
ofososBut in the first step we don't need it right away. First we have to check if the scheduler connects to the Bitbucket correctly. After that we'll grant access to the key in the zuul-config repo and restart once more.16:51
ofososAlso, the zuul-config repo will need "write" permssions to be granted to the zuul user.16:52
noorulI see16:52
noorulWhat is the next step?16:52
ofososThere are three things you have to do for every repo: put it in the tenant config, grant access to the access key in the repo and add the zuul user to the repo with "write" level permissions.16:53
ofososIf you've done that, start up the scheduler and look at the output.16:53
openstackgerritMerged zuul/zuul master: JS: add line numbers to log file  https://review.opendev.org/67683016:53
noorulofosos: Can you help to add one repo example in etherpad?16:54
AJaegercorvus: want to take https://review.opendev.org/676945 (remove sphinx_output_dir from zuul-jobs) before it gets used? I don't need it anymore...16:55
AJaegerOr do we need to deprecate that properly?16:55
ofososnoorul: I added one16:56
noorulok16:56
ofososYou still have to create the repo16:56
ofososBut,... you will have to push the sample code from the zuul-config repo that is included with gerrit into zuul-config. That'll make testing easier16:56
noorulI created16:56
ofososWhat do you get from the scheduler logs?16:57
ofososThe executor should check out two repos: zuul-config and test16:57
ofososAnd the scheduler should be running a loop with 60s delay and looking into those repos.16:57
noorulI am getting some other error related to alembic16:59
noorulI forgot the openstack pastie service16:59
noorulI could paste the error there16:59
*** panda has quit IRC17:00
noorulI am getting http://paste.openstack.org/show/757976/17:00
*** mattw4 has joined #zuul17:01
noorulMay be I should remove mysql driver?17:02
noorulfrom the config?17:02
*** panda has joined #zuul17:02
noorulDid anyone see that error before?17:04
noorulI am using docker-compose17:04
*** igordc has joined #zuul17:04
corvusnoorul: my guess is that you have a database leftover from running docker-compose on the current code but you're now running it on older code17:05
noorulI see17:05
corvusnoorul: you may need to stop all the containers, delete the volume used by the mariadb container, then restart17:06
*** mgoddard has quit IRC17:07
noorulcorvus: I am not sure where the mariadb volume is located as I don't see any volumes entry in mariadb section17:08
*** igordc has quit IRC17:08
*** mgoddard has joined #zuul17:11
*** rlandy|rover|brb is now known as rlandy|rover17:11
noorulcorvus: https://opendev.org/zuul/zuul/src/branch/master/doc/source/admin/examples/docker-compose.yaml#L3417:12
openstackgerritMerged zuul/zuul master: web: logfile highlight selected line  https://review.opendev.org/67684917:13
ofososYou can run without the mysql driver, but with reduced functionality (no historic build info).17:15
noorulI would like to understand what is going on17:16
corvusnoorul: yeah, it's a volume that's specified in the container image itself, so it doesn't show up in docker-compose.  you can see all tho volumes with docker volume list17:16
openstackgerritMerged zuul/zuul-jobs master: Remove sphinx_output_dir  https://review.opendev.org/67694517:17
corvusah17:17
noorulThere are too many, not sure which one belongs to mariadb17:17
corvusnoorul: "docker inspect examples_mysql_1" should tell you which volume17:17
corvusnoorul: look under the "Mounts" section17:18
corvusnoorul: another easy way to fix this might be to just delete the mysql container and let docker-compose recreate it17:18
corvusso just: docker rm examples_mysql_117:19
*** mattw4 has quit IRC17:20
*** mgoddard has quit IRC17:22
*** mgoddard has joined #zuul17:23
noorulcorvus: Thank you17:25
noorulcorvus: I pruned using docker system prune17:25
noorulofosos: http://paste.openstack.org/show/757978/17:25
noorulofosos: That scheduler log now17:25
*** igordc has joined #zuul17:26
corvus++17:27
ofososThe last three lines seem alien to me, but maybe they're benign17:27
ofososnoorul: Have another look, the bitbucket driver takes a minute to do something.17:28
noorulofosos: There was a typo. Fixed it17:28
ofososnoorul: can you also paste the executor log?17:28
noorulhttp://paste.openstack.org/show/757979/17:28
corvusthe keypair stuff happens on first boot17:29
ofososGive it a minute and post again.17:29
ofososAlso, I would be interested in the executor log.17:29
noorulexecutor log: http://paste.openstack.org/show/757980/17:29
ofososBut that now look good, the driver has initialized.17:29
noorulok, then what next?17:30
ofososWait, post the scheduler log again. The watcher thread should have run by now.17:31
ofososWhat's in the `zuul-config` repo?17:31
noorulNo change17:32
noorulzuul-config has nothing17:32
noorulIt is empty now17:32
noorulShall I add the example pipeline from bitbucket doc?17:32
ofososYes please17:33
ofososJust create it on master, we need that in the repo initially and then we need to do `zuul-scheduler full-reconfigure` to make it pick up the changes (no pipelines yet).17:33
ofosos(Issue the command in the scheduler container)17:34
openstackgerritJames E. Blair proposed zuul/zuul master: WIP: super hacky demo of logfile under the manifest  https://review.opendev.org/67684317:37
noorulofosos: Did you mean not to add pipeline.yaml now?17:37
ofososYes17:40
noorulok, I just pushed README.md17:40
noorulto master branch17:41
noorulShall I run zuul-scheduler full-reconfigure17:41
ofososTry it17:41
ofososThere may still lurk some error, since there was no car job in your log17:42
ofososCat17:42
ofososNot car17:42
noorulhttp://paste.openstack.org/17:42
noorulhttp://paste.openstack.org/show/758063/17:43
ofososCan you post the executor logs?17:45
ofososIt's now waiting for the cat job, so we need to check that that ran correctly17:46
*** igordc has quit IRC17:49
noorulLooks like ssh key issue http://paste.openstack.org/show/758134/17:49
*** igordc has joined #zuul17:50
*** mattw4 has joined #zuul17:50
ofososThe logs are a bit weird, because the bitbucket watcher is not running17:50
ofososYes, please put the Zuul ssh key into the repository under repo config / access keys17:51
ofososThat might do it. When I bork the ssh key (which happens fairly often), we usually run into this error. After I fix the ssh key it's ok.17:54
noorulActually executor is not having ssh private key under /root/.ssh17:55
noorulssh key mounting is not working17:58
*** armstrongs has joined #zuul18:11
noorulofosos: I have new error18:12
noorulofosos: http://paste.openstack.org/show/758136/18:12
noorulcorvus: ^^18:13
noorulcorvus: Any idea?18:13
ofososCan you try using a name for the ssh server?18:15
pabelangertobiash: corvus: mordred: clarkb: I think http://paste.openstack.org/show/758138/ is going to be the next issue to solve with github, we are seeing a high rate of MERGER_FAILURE, from what looks like maybe the event getting to zuul faster then the git refs being updated on github side18:15
noorulofosos: where?18:16
ofososIn the Zuul config, in the driver18:18
noorulYou mean instead of IP are you asking to give FQDN?18:19
ofososYes18:19
pabelangerhttps://dashboard.zuul.ansible.com/t/ansible/buildsets?result=MERGER_FAILURE is example18:20
pabelangerbut hard to say time frame18:20
*** armstrongs has quit IRC18:20
noorulofosos: Are you using sshkey property?18:21
ofososNo18:22
*** chkumar|ruck is now known as raukadah18:23
clarkbpabelanger: pull request came in, zuul gets event for pull, then tries to fetch the head of the pull ref and fails because it isn't there yet?18:24
openstackgerritMerged zuul/zuul master: web: logfile do not scroll into view more than once  https://review.opendev.org/67692418:24
ofososBut we're using a named Bitbucket server.18:24
ofososI think the error indicates that paramiko will not put the ssh host id into it's known hosts file.18:24
ofososWhat you can do is try to ssh to port 7999 from the executor and have the ssh command put it there.18:25
ofososNo, I'm wrong. Please try sshing to this IP and port. I think read timeout indicates some other problem.18:26
openstackgerritTristan Cacqueray proposed zuul/zuul master: gerrit: ensure patchset numbre is a number  https://review.opendev.org/67700718:29
ofososI think if the key was wrong we'd get a different error. Maybe there's something wrong with container networking. Can you ssh from the zuul executor container to the Bitbucket server?18:29
clarkbpabelanger: I wonder if there are other events that signify the ref is available (like gerrit's replication completed events)18:29
noorulofosos: Cloning from ssh://10.29.12.160:7990/demo/zuul-config.git18:29
clarkbpabelanger: otherwise we may just have to do a stronger backoff with our retries18:29
ofososYes18:30
noorulofosos: Shouldn't this be ssh://git@10.29.12.160:7999/demo/zuul-config.git18:30
pabelangerclarkb: yah, I think that is right, not there18:30
pabelangerI need to check if other events are there18:30
pabelangerclarkb: or we have another merger try18:31
ofososYou're right. That's the error, enter ssh://git@10.... into the driver config.18:31
ofososAnd restart18:31
ofososOk, I'm switching places. Responses will now a bit more sluggish.18:31
openstackgerritMerged zuul/zuul master: web: logfile scroll into view a bit more  https://review.opendev.org/67692918:43
noorulofosos: now scheduler log says bad status line http://paste.openstack.org/show/758141/18:47
noorulDid anyone see ^^ this before ?18:51
openstackgerritMerged zuul/zuul master: web: logfile do not use anchor for line selection  https://review.opendev.org/67692819:00
noorulofosos: hi19:01
noorulofosos: leaving now19:02
noorulofosos: When can we meet again?19:02
noorulofosos: let me know19:02
*** noorul has quit IRC19:02
SpamapScorvus:ACK I'll peek now19:23
SpamapSAh I see that's already been handled. :)19:23
corvusSpamapS: yep, tldr all your wishes should have been granted :)19:26
SpamapSIndeed. Is that going to be in a 3.10.2 eventually?19:30
SpamapSI was starting to upgrade to 3.10.1 last night but figured I might want to wait until fix-zuul-logs is .. well.. fixed. ;)19:31
jeliu_mordred: Hey Monty, are you familiar with setting up percona db cluster using percona operator? I'm trying to do it manually first on minikube (https://www.percona.com/doc/kubernetes-operator-for-pxc/minikube.html) and then incorporate it into the zuul-operator but I was having some issues because the cluster states were "running" but not "ready" (https://docs.google.com/document/d/1rveUpciVirBrK6hqeXiH61JVgZuMDpDw108uZrf6bt0/edit?usp=sharing)19:33
pabelanger+1 3.10.2, I confirmed we also had deep link issue, but only after trying to reproduce the issue19:33
clarkbjeliu_: there was stuff for that in our k8s gitea deployment19:39
clarkblet me see if I can find links19:39
clarkbjeliu_: https://opendev.org/opendev/system-config/src/branch/master/kubernetes/percona-xtradb-cluster19:40
corvusclarkb: i don't think we used the operator, did we?19:40
corvusjeliu_: you might want to paste that log into http://paste.openstack.org/ so folks can read it without logging in19:41
clarkboh no it may have all veen ansible19:41
corvusclarkb: yeah, i think maybe it was converted from helm charts or something?19:41
corvusstill, there could be a clue there in to what's going wrong, it's just not a 1:119:42
corvusie, you could use that as a reference for "here is one way to set up a pxc cluster, what is the pxc operator doing differently?"19:42
corvus(pxc == percona xtradb cluster)19:42
clarkbya seems to just be a statefulset of 3 pxc containers19:43
clarkband ansible around that19:43
*** hwangbo has quit IRC20:06
openstackgerritMerged zuul/zuul master: web: logfile support multi line selection through shift-click  https://review.opendev.org/67693720:10
jeliu_clarkb, corvus: thanks, and will the pxc-playbook run successfully on my local computer?20:10
SpamapSrandom question: how's the Zuul Kubernetes Operator effort coming along?20:22
openstackgerritMerged zuul/zuul master: web: logfile remember if initial scroll was performed  https://review.opendev.org/67695420:24
corvusSpamapS: jeliu_ is trying to get the pxc operator to work but it's failing; if you have a second to take a look at his error, that could help.20:32
corvusjeliu_: it would really be great if you pasted your error into the public pastebin istead of google docs :)20:33
*** mattw4 has quit IRC20:34
corvusjeliu_: yeah, i think it should run with a local k8s20:35
*** mattw4 has joined #zuul20:35
jeliu_Logs for trying to Install the Percona Operator and Create a Cluster Resource: http://paste.openstack.org/show/758146/20:39
jeliu_^not the prettiest logs to look at20:40
corvuserrors rarely are :)20:42
corvusjeliu_: that looks great, thanks20:43
corvusjeliu_: something just occurred to me -- it may also be helpful to get the logs for the cluster member that *did* start20:43
corvusit still may have information which could indicate why the others didn't20:43
*** yolanda__ has joined #zuul20:45
*** yolanda has quit IRC20:46
*** rfolco has quit IRC20:48
openstackgerritMerged zuul/zuul master: Collect more information from quickstart failures  https://review.opendev.org/67697620:50
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: extract pure functions from the TaskOutput component  https://review.opendev.org/67546020:50
*** mattw4 has quit IRC20:53
*** mattw4 has joined #zuul20:53
*** pcaruana has quit IRC20:57
openstackgerritTristan Cacqueray proposed zuul/zuul master: web: console mark syntax error task as failed  https://review.opendev.org/67703021:00
*** igordc has quit IRC21:01
*** igordc has joined #zuul21:01
tristanCcorvus: please have a look at 677030, the console is showing syntax error as OK. I tried to added a test but i think the change is missing legitimate success result21:01
tristanCcorvus: it's from: https://softwarefactory-project.io/zuul/t/local/build/8b1f38432ba747379c3ecb8e5035d81a/console21:02
corvustristanC: mordred is working on the solution to that21:02
corvustristanC: https://review.opendev.org/67672321:02
tristanCalright, that's good to know21:03
corvusturns out we're missing important info in the json file :)21:03
tristanCcorvus: would it be possible to land the js tests soon ( https://review.opendev.org/675460 )21:03
corvustristanC: i'll look soon, running to a meeting now21:04
*** armstrongs has joined #zuul21:10
armstrongshey im seeing a situation where i have pushed the latest version of a container to a docker registry and nodepool is scheduling an older version of the same tag. Does nodepool cache the images and how do i refresh this?21:12
clarkbarmstrongs: I want to say k8s and/or docker behave like git in this case?21:14
clarkblocal versions arent moved unless explicitly told to do so21:14
clarkbhttps://github.com/kubernetes/kubernetes/issues/3366421:15
armstrongsok so not nodepool thanks :)21:16
tristanCarmstrongs: the nodepool kubernetes provides uses "IfNotPresent" by default for the image-pull attribute21:27
*** mgoddard has quit IRC21:29
openstackgerritTristan Cacqueray proposed zuul/nodepool master: kubernetes: add missing image-pull documentation  https://review.opendev.org/67703621:29
*** mgoddard has joined #zuul21:32
*** armstrongs has quit IRC21:36
corvustristanC: i think we can merge 677030 as a workaround until mordred finishes the work on the callback -- but can you use that to look at a bunch of logfiles and make sure it still does the right thing?  i know there are some test cases, but since it's changing the default return value, i'm not sure if they're enough coverage.21:40
*** rlandy|rover has quit IRC21:41
*** spsurya has quit IRC22:23
*** jeliu_ has quit IRC22:45
*** igordc has quit IRC22:46
clarkbcorvus: tristanC I left a question on 67703023:28

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!