*** CaptTofu has joined #tripleo | 00:05 | |
openstackgerrit | Chris Krelle proposed a change to openstack/tripleo-incubator: Add Ironic bits to _undercloud https://review.openstack.org/72969 | 00:07 |
---|---|---|
*** rpodolyaka has quit IRC | 00:11 | |
*** rpodolyaka has joined #tripleo | 00:12 | |
*** tserong has joined #tripleo | 00:41 | |
*** CaptTofu has quit IRC | 00:50 | |
*** CaptTofu has joined #tripleo | 00:50 | |
*** CaptTofu has quit IRC | 00:55 | |
*** rbrady-afk is now known as rbrady | 00:56 | |
rbrady | lifeless, SpamapS: do you have any time to review my 4 cinder/nfs patches this afternoon? | 00:57 |
*** rpodolyaka has quit IRC | 01:00 | |
*** vkozhukalov has quit IRC | 01:05 | |
*** blamar has joined #tripleo | 01:08 | |
*** cwolferh has quit IRC | 01:10 | |
*** rbrady is now known as rbrady-afk | 01:16 | |
*** openstack has joined #tripleo | 01:22 | |
*** coolsvap has quit IRC | 01:44 | |
*** CaptTofu has joined #tripleo | 01:58 | |
*** rpodolyaka has joined #tripleo | 02:01 | |
*** sdake has quit IRC | 02:02 | |
*** CaptTofu has quit IRC | 02:03 | |
*** sdake_ has quit IRC | 02:03 | |
*** rpodolyaka has quit IRC | 02:05 | |
*** tzumainn has quit IRC | 02:07 | |
*** blamar has quit IRC | 02:10 | |
lifeless | rbrady-afk: I'm travelling and jprovazn's rabbit patches are my top of queue | 02:16 |
*** rbrady-afk is now known as rbrady | 02:16 | |
rbrady | lifeless: alright | 02:16 |
rbrady | lifeless: safe travels and thanks for the help | 02:17 |
lifeless | rbrady: np; will look after I get through jan's work | 02:18 |
lifeless | thanks | 02:18 |
*** rbrady is now known as rbrady-afk | 02:18 | |
*** sdake has joined #tripleo | 02:19 | |
*** sdake has quit IRC | 02:19 | |
*** sdake has joined #tripleo | 02:19 | |
*** sdake has quit IRC | 02:36 | |
*** tserong has quit IRC | 02:41 | |
*** tserong has joined #tripleo | 02:41 | |
*** tserong has joined #tripleo | 02:41 | |
*** newell has quit IRC | 02:43 | |
*** sdake has joined #tripleo | 02:47 | |
*** mestery has quit IRC | 03:16 | |
*** ramishra has joined #tripleo | 03:39 | |
*** CaptTofu has joined #tripleo | 03:46 | |
*** CaptTofu has quit IRC | 03:51 | |
killer_prince | I have a doubt. when image is generated using dib, i see that the kernel and ramdisks are updated in the image. and old copies of kernel and ramdisk is left on the image. is there a reason why its not cleaned up? | 04:00 |
killer_prince | is it a good idea to cleanup the image by deleting old kernel and ramdisk when generating image using dib.. | 04:12 |
killer_prince | or is there an element already to do this cleanup..? | 04:16 |
jammyjam | killer_prince: it sounds like you think that leaving the old copies of the kernel around is going to be a problem | 04:28 |
jammyjam | normally the first reason it would bother me is wasting disk space - nothing worse than a box with a full /boot that can't do anything any more. Second reason it would bother me is doubt over whether the kernel that's coming up next time is the same as what's running now | 04:29 |
jammyjam | I don't think either of those are going to be a problem in this case? | 04:29 |
jammyjam | so that just leaves tidiness, right? | 04:29 |
jammyjam | I think tidiness alone is sufficient reason to want to only have current kernel/ramdisk on the image. There's no need to have a "last known good" image that I can think of, not when we're building a master that will be tested before being deployed | 04:30 |
killer_prince | jammyjam: I am writing an element that will install bootloader on disk when instance is cretaed. so the way grub installer works is that it picks up the kernel and ramdisk with latest time stamps as default boot entry. and this results in some cases where instead of picking up generic kernel, it picks up virtual kernel. | 04:32 |
killer_prince | jammyjam: and there is where my problem starts. when i boot such image on baremetal, it will not detect all devices and even result in kernel panic.. | 04:33 |
killer_prince | so wanted to check if there is a reason to keep those kernels on the disk.. | 04:34 |
jammyjam | This is the project to enable the nodes to boot from bare metal if pxe-boot fails? | 04:34 |
killer_prince | yes.. | 04:34 |
jammyjam | It sounds to me as though there's two possible solutions here - either make sure that your intended kernel/ramdisk are the only ones around, or start tinkering with the grub config to be more explicit about what it should pick up | 04:35 |
killer_prince | yup.. i want to keep it simple and avoid future maintainance if grub config format changes.. so was looking for the former solution.. | 04:37 |
killer_prince | jammyjam: what is your opinion on this.. | 04:39 |
jammyjam | my considered opinion, after thinking about it for 20 minutes? | 04:40 |
killer_prince | jammyjam: take your time.. | 04:41 |
jammyjam | my understanding is that the baremetal boot-off-disk is not intended for regular production use, it's only there as a fallback in case pxeboot wasn't available right then - it's only intended to get the box back up so that it's contactable and we can figure out what's wrong with it and reboot once pxe is available | 04:42 |
jammyjam | is that right? | 04:42 |
jammyjam | If so, you probably (A) don't want it to boot up from a virtual kernel, and (B) don't want your fallback system to fall back to an older kernel. Cleaning them up sounds like the right thing to do to me. | 04:44 |
killer_prince | partially correct. however i have seen customers who would like to use PXE only for deployment and not for production use.. they simply insist that they want the instances to boot from the disk rather than pxe.. (not sure why).. | 04:44 |
*** ramishra has quit IRC | 04:44 | |
jammyjam | I don't know if there's an existing element though | 04:44 |
*** weshay has quit IRC | 04:45 | |
jammyjam | I can think of a few possibilities - maybe they think booting from disk will be faster and more robust? Maybe the machines get moved to another part of the network after boot and can't reach the PXE server any more? | 04:45 |
killer_prince | may be.. so my next doubt would be shall i write a seperate element to do this cleanup or shall i include this in my element itself.. | 04:45 |
jammyjam | Maybe they have security auditors who are worried that doing PXE boot every time means you can't be sure what code is running on the machine? | 04:46 |
jammyjam | That's a question I don't have an answer for :) Either way we can move it later.. | 04:46 |
killer_prince | they had some security concerns and some scalability concerns.. | 04:46 |
killer_prince | okay... then i guess i iwll include them in my elemnt for now, if needed we can rework later to a seperate element.. | 04:47 |
*** jammyjam is now known as tchaypo | 05:18 | |
*** noslzzp has quit IRC | 05:33 | |
*** CaptTofu has joined #tripleo | 05:35 | |
*** CaptTofu has quit IRC | 05:39 | |
*** sdake_ has joined #tripleo | 06:01 | |
*** sdake_ has quit IRC | 06:01 | |
*** sdake_ has joined #tripleo | 06:01 | |
*** tzumainn has joined #tripleo | 06:02 | |
* SpamapS finally returns bleh | 06:31 | |
SpamapS | I think there are plenty of valid reasons to want to boot from disk later on. It just increases the testing burden to test both scenarios. | 06:36 |
SpamapS | killer_prince: ^^ | 06:36 |
killer_prince | SpamapS: you mean testing the element or testing instances.. | 06:38 |
SpamapS | killer_prince: I mean testing the image. | 06:38 |
killer_prince | SpamapS: well my approach is that when this element is used, it will continue to boot from PXE. but it will fall back to localdisk only when pxe fails.. | 06:40 |
killer_prince | SpamapS: and so for testing all we need to do is remove pxe config from tftp and it should come up from localdisk. | 06:41 |
killer_prince | SpamapS: and the boot loader will get installed upon first pxe boot of instance.. | 06:41 |
SpamapS | killer_prince: "it will fall back" is a bios problem.. not a user space image controllable problem.. AFAIK | 06:42 |
SpamapS | killer_prince: which is one reason why it is harder to test | 06:42 |
killer_prince | SpamapS: well thats true. but once it falls back, disk should have bootloader configured properly so that it can continue booting from localdisk. | 06:47 |
killer_prince | SpamapS: i agree, its hard to test. but that should not be the reason not to have it there.. | 06:47 |
killer_prince | my opinion.. | 06:47 |
SpamapS | killer_prince: it is not a reason to not do it. It is a reason to require more resources to get it done. ;) | 06:49 |
killer_prince | SpamapS: aha.. that way | 06:51 |
*** ramishra has joined #tripleo | 06:57 | |
*** akuznetsov has quit IRC | 07:19 | |
*** CaptTofu has joined #tripleo | 07:23 | |
*** CaptTofu has quit IRC | 07:28 | |
*** GheRivero has quit IRC | 07:34 | |
*** jtomasek has quit IRC | 07:46 | |
*** killer_prince is now known as lazy_prince | 07:55 | |
*** GheRivero has joined #tripleo | 08:35 | |
*** nijaba_ has joined #tripleo | 08:49 | |
*** jeblair_ has joined #tripleo | 08:52 | |
*** Slower_ has joined #tripleo | 08:52 | |
*** dtantsur_ has joined #tripleo | 08:54 | |
*** jeblair has quit IRC | 08:56 | |
*** jrist has quit IRC | 08:56 | |
*** Slower has quit IRC | 08:56 | |
*** dtantsur has quit IRC | 08:56 | |
*** nijaba has quit IRC | 08:56 | |
*** greghaynes has quit IRC | 08:56 | |
*** dtantsur_ is now known as dtantsur | 08:56 | |
*** ramishra has quit IRC | 08:59 | |
*** jrist has joined #tripleo | 09:01 | |
*** greghaynes has joined #tripleo | 09:02 | |
*** CaptTofu has joined #tripleo | 09:11 | |
*** rcarrillocruz1 has joined #tripleo | 09:11 | |
*** rcarrillocruz has quit IRC | 09:14 | |
*** rpodolyaka has joined #tripleo | 09:14 | |
*** rcarrillocruz has joined #tripleo | 09:15 | |
*** CaptTofu has quit IRC | 09:16 | |
*** ramishra has joined #tripleo | 09:16 | |
*** rcarrillocruz1 has quit IRC | 09:17 | |
*** vkozhukalov has joined #tripleo | 09:29 | |
*** ramishra has quit IRC | 09:34 | |
*** ramishra has joined #tripleo | 09:49 | |
*** rcarrillocruz1 has joined #tripleo | 09:52 | |
*** rpodolyaka has quit IRC | 09:53 | |
*** rcarrillocruz has quit IRC | 09:55 | |
*** vkozhukalov has quit IRC | 10:00 | |
*** ramishra has quit IRC | 10:11 | |
*** CaptTofu has joined #tripleo | 10:51 | |
*** akuznetsov has joined #tripleo | 11:07 | |
*** akuznetsov has quit IRC | 11:15 | |
*** martyntaylor has joined #tripleo | 11:18 | |
*** d0ugal has joined #tripleo | 11:18 | |
*** d0ugal has joined #tripleo | 11:18 | |
*** martyntaylor has left #tripleo | 11:32 | |
*** rcarrillocruz1 has quit IRC | 12:12 | |
*** rcarrillocruz1 has joined #tripleo | 12:15 | |
*** CaptTofu has quit IRC | 12:29 | |
*** killer_p- has joined #tripleo | 14:46 | |
*** killer_p- is now known as killer_prince | 14:46 | |
*** lazy_prince has quit IRC | 14:46 | |
*** tzumainn has quit IRC | 15:00 | |
*** jeblair_ is now known as corvus | 15:40 | |
*** ramishra has joined #tripleo | 16:22 | |
*** ramishra has quit IRC | 16:26 | |
*** bauzas has joined #tripleo | 17:14 | |
lifeless | morning | 17:15 |
*** jang1 has joined #tripleo | 17:28 | |
*** akuznetsov has joined #tripleo | 17:40 | |
*** petertoft has joined #tripleo | 17:47 | |
mordred | morning lifeless | 18:00 |
mordred | lifeless: how was the flight back? | 18:00 |
lifeless | mordred: it was ok; I managed to finangle an upgrade (200 miles spent as a bid; elite inflates that by 50% = win | 18:04 |
mordred | lifeless: woot | 18:06 |
* lifeless disables the CRDA udev rule | 18:07 | |
lifeless | hmm, not that one | 18:09 |
lifeless | more fiddling needed | 18:09 |
lifeless | but I am getting closer to tracking this EPIC FAIL down | 18:09 |
lifeless | AP roams | 18:11 |
lifeless | regulatory domain gets updated to world | 18:11 |
lifeless | then back to nz | 18:11 |
lifeless | and the card goes 'fuckit, I'm confused' | 18:11 |
lifeless | because the channel that was in use gets blocked by the temporary world domain | 18:12 |
lifeless | *I* think | 18:12 |
lifeless | and the card drops off the ssid | 18:12 |
lifeless | I'm happy because to have a working theory now, since this has been pissing me off for years :) | 18:14 |
lifeless | bah, english, do I speak it? | 18:15 |
lifeless | or possibly, its merely changing crda that glitches things. more investigation needed. | 18:19 |
lifeless | ok, ttg | 18:29 |
*** jang1 has quit IRC | 18:33 | |
*** akuznets_ has joined #tripleo | 19:53 | |
*** akuznetsov has quit IRC | 19:55 | |
SpamapS | lifeless: regarding the mv and space allocation, I believe the problem is that the loopdev/ext4 is allocating space faster than the mv is freeing it, which is why we haven't seen the out of space until we got to the point where the tmpfs was already 75%+ full before we started the mv. | 20:12 |
*** hashar has joined #tripleo | 20:18 | |
SpamapS | lifeless: is there any reason we build and upload the 'user' image in devtest instead of just uploading saucy? | 20:19 |
*** akuznets_ has quit IRC | 21:10 | |
*** hashar has quit IRC | 21:10 | |
*** akuznetsov has joined #tripleo | 21:17 | |
*** hashar has joined #tripleo | 21:18 | |
*** andreaf has joined #tripleo | 21:23 | |
*** CaptTofu has joined #tripleo | 21:30 | |
*** andreaf has quit IRC | 21:31 | |
*** petertoft has quit IRC | 21:43 | |
*** steels has joined #tripleo | 21:49 | |
SpamapS | we should have the "waiting for the overcloud stack to be ready" tail os-collect-config logs on all the nodes | 21:55 |
SpamapS | or at least spit out the heat event list | 21:55 |
* SpamapS goes to do saturday things | 21:55 | |
*** hashar has quit IRC | 21:56 | |
*** CaptTofu has quit IRC | 22:21 | |
*** CaptTofu has joined #tripleo | 22:22 | |
*** d0ugal has quit IRC | 22:23 | |
*** CaptTofu has quit IRC | 22:26 | |
lifeless | SpamapS: we could upload cirros for that matter | 22:35 |
lifeless | SpamapS: logstash :) | 22:35 |
mordred | lifeless: ++ | 23:20 |
mordred | lifeless: poke jesusaurus about that - he's on the hook on the hp cloud side for all things logstash | 23:21 |
*** CaptTofu has joined #tripleo | 23:23 | |
jesusaurus | <_< | 23:24 |
jesusaurus | >_> | 23:25 |
*** CaptTofu has quit IRC | 23:28 | |
* mordred throws bunnyrabbit at jesusaurus | 23:32 | |
SpamapS | 2014-03-08 22:06:28.974 | Build timed out (after 145 minutes). Marking the build as failed. | 23:41 |
SpamapS | things were working otherwise | 23:41 |
SpamapS | hm we need the same thing as seed_logs.tgz but for overcloud | 23:42 |
mordred | SpamapS: BUILD FASTER | 23:42 |
SpamapS | much vrooom | 23:43 |
SpamapS | mordred: local pypi/apt cache will help a bit | 23:43 |
SpamapS | mordred: and wheels | 23:43 |
mordred | word | 23:43 |
SpamapS | and cloud image for that matter | 23:43 |
SpamapS | mordred: https://etherpad.openstack.org/p/tripleo-parallel-test FYI | 23:44 |
SpamapS | but anyway.. | 23:44 |
SpamapS | I think so we can start approving changes we should just allow 20 more minutes | 23:44 |
mordred | SpamapS: I don't believe in testing things parallel | 23:44 |
SpamapS | mordred: it should be tripleo-tests-are-zomg-slow | 23:45 |
SpamapS | its just a bitch fest list of all the things we should be doing :) | 23:45 |
*** bauzas has quit IRC | 23:46 | |
SpamapS | anyway... | 23:46 |
SpamapS | much weekend | 23:46 |
SpamapS | so sun | 23:46 |
SpamapS | very outside | 23:46 |
*** rcarrillocruz has joined #tripleo | 23:48 | |
*** rcarrillocruz1 has quit IRC | 23:49 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!