Thursday, 2019-04-04

openstackgerritIan Wienand proposed openstack/diskimage-builder master: [WIP] add buster tests  https://review.openstack.org/64949702:10
*** hwoarang has quit IRC03:01
*** hwoarang has joined #openstack-dib03:06
openstackgerritIan Wienand proposed openstack/diskimage-builder master: debian-minimal buster support  https://review.openstack.org/64949605:10
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [WIP] add buster tests  https://review.openstack.org/64949705:10
*** hwoarang has quit IRC06:39
*** hwoarang has joined #openstack-dib06:41
cgoncalvesianw, hi! do you happen to be still around?09:38
cgoncalvesianw, I proposed a work-around to https://bugzilla.redhat.com/show_bug.cgi?id=1666612 in https://review.openstack.org/#/c/643749/09:39
openstackbugzilla.redhat.com bug 1666612 in systemd "Rules "uname -p" and "systemd-detect-virt" kill the system boot time on large systems" [High,On_qa] - Assigned to jsynacek09:39
cgoncalvesapparently the work-around is not helping much, as you can see in https://review.openstack.org/#/c/643752/09:40
cgoncalvesthe centos7 job passed, twice, but barely hit the job time out. the tempest jobs still take a long time. the job is expected to run in ~1h30 like in ubuntu jobs09:41
cgoncalvesneither with or without the work-around I see the expected output from the journalctl example09:43
cgoncalvesso I am wondering why... I ran a centos7 VM with and without nested virtualization enabled09:43
*** mjturek has joined #openstack-dib13:53
*** altlogbot_2 has quit IRC13:54
*** Vorrtex has joined #openstack-dib15:58
VorrtexHey, I come to you guys with a small problem.  I've been working on building an octavia amphora on power with a centos base, and I'm seeing an error that I don't quite understand.  Any help you can provide would be *fantastic*.  https://gist.github.com/tvardema/5bb00d0d327b6a026edbddd0b5953d3d16:02
VorrtexAs someone who is almost completely unfamiliar with the process here, I'm very confused why DIB would look for a compressed image tarball as the base, and then immediately re-pack the tarball, then complain about the tarred tar being not a qcow2 image?  Which seems "accurate" to me?  I'm 90% sure I'm missing an argument somewhere that I should use.16:03
clarkbVorrtex: This is from memory but I think it takes the images and converts it to a file stream (tarball basically) so that it can write it out onto its own image file (which allows it to do partitioning and the like)16:09
clarkbI want to say this may break if the upstream image has multiple partitions itself?16:10
clarkbjohnsom: may have more insight on this as it relates to octavia16:10
johnsomThat is a pure DIB error. The image is xz compressed and DIB expects it to be a raw qcow16:11
johnsomclarkb He is setting file:///root/CentOS-7-ppc64le-GenericCloud.qcow2.xz as the file path, but DIB wants just a qcow216:12
clarkboh I see. So it needs to be uncompressed first16:12
Vorrtexclarkb the problem is I'm not telling it to look for CentOS-7-ppc64le-GenericCloud.qcow2.xz specifically, that's what DIB is looking for by default16:24
VorrtexI tried changing my base image's name to that just to see if it would work regardless, but it fails with a separate error.16:25
clarkbhttps://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/centos7/root.d/10-centos7-cloud-image is the source of the filename16:29
johnsomVorrtex Did you unxz it? it's in xz compression format. You can check by running "file <filename>"16:29
clarkblooks like the cloud images url doesn't work either (centos isn't publishing ppc images anymore?)16:29
johnsomHe is overriding the image location with a command line export16:30
clarkbI don't see that in the paste?16:30
clarkbbut ya DIB_LOCAL_IMAGE is how you should set that I think16:30
johnsomIt's the "file://" URL that points me that way16:31
johnsomPlus DIB doesn't know anything about xz compressed files as far as I know16:32
clarkbthe extract-image script seems to at least try16:33
* clarkb gets a link16:33
*** yolanda has quit IRC16:33
clarkbhttps://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/redhat-common/bin/extract-image#n5316:33
Vorrtexjohnsom I compressed it because the error message said it was looking for a .xz file.  Like I mentioned above, I also took the original image and changed its name to have ".xz" just to try it out, and it fails anyway16:34
clarkbVorrtex: where can we find the original source file?16:35
VorrtexIts on a vm that I have in redhat internal servers.16:35
VorrtexHad to build it ourselves16:35
clarkbqcow2 has had incompatible format changes in the past, it is theoretically possible the qcow2 is too new for your version of qcow2.16:35
clarkber too new for qemu-img16:36
clarkbVorrtex: are you able to boot that image or otherwise verify it is a valid qcow2 image?16:36
clarkbother debugging steps would be to run through the commands at https://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/redhat-common/bin/extract-image#n53 to do the uncompression and convert yourself just so that we can narrow down where it might be failing16:38
Vorrtexclarkb the command run to create the image was "qemu-img create -f qcow2 clouimg.qcow2 60G", followed by a virt-install using this location:  http://mirror.centos.org/altarch/7/os/ppc64le/16:39
clarkbya I would start by double checking that image is actually happy with qemu-img then run through the steps that dib is taking to see if we can get any more info out of it16:41
clarkb(assuming something doesn't pop up before hand)16:41
clarkb`qemu-img check`16:41
Vorrtex"No errors were found on the image"16:42
Vorrtexfollowed by some extra output16:42
clarkbok then run through the steps dib is taking at https://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/redhat-common/bin/extract-image#n5316:43
clarkbyou know I wonder if you are running out of disk?16:43
clarkband the tmp image is getting truncated? (just an idea I have no evidence of that)16:43
Vorrtex32GB available on the VM16:43
clarkbbut your image is 60GB16:43
Vorrtexthe image file isn't that big.16:44
VorrtexI guess it could be that bad, right?16:44
clarkbVorrtex: if you read the dib script it is converting it to raw16:44
clarkbthe raw image will be that big16:44
Vorrtexalright, then I'll have to rebuild the image smaller... johnsom how "small" can the amp image be?16:45
johnsomThat is up to your image. Centos is usually pretty fat and requires at least 3GB16:45
VorrtexI'll make it 5 GB then.  Just to see what's what.16:46
clarkbyou can probably just resize it16:46
clarkbqemu-img resize16:47
clarkb(not sure if that supports shrinking)16:47
VorrtexOh?  Hmm... I try that.16:47
clarkbsometimes filesystems get bad about that16:47
clarkband i think centos defaults to xfs now which doesn't shrink16:47
clarkbbut worth a try if it is a quick fix :)16:47
VorrtexThis seems to have worked:  qemu-img resize --shrink clouimg.qcow2 -25G16:49
VorrtexRIP, my image was 30 GB, not the one generated for the base image... had to shrink it more (bad maths).  Retrying again, just in case.17:13
Vorrtexalright, size shouldn't be a problem at all.  Still complaining about the format.17:25
VorrtexI'll run through the steps of that script now.17:25
Vorrtexclarkb I'm not sure if I got farther or not, but I stopped trying to fuss with the compressed image, and am trying to use this other environment variable:  DIB_LOCAL_IMAGE="/root/clouimg.qcow2"18:11
VorrtexImage is valid, for sure, but it failes with:18:11
Vorrtexdevice-mapper: reload ioctl on loop1p2  failed: Invalid argument18:11
Vorrtexfails*** rather18:12
clarkbcould be that the loopbacks are confused? an easyt way to clear them out is a reboot but you can manually manage with losetup iirc18:12
VorrtexI'll try a reboot real quick18:12
Vorrtexclarkb same error.18:16
clarkbIm not sure then. Are you able to paste the list of commands and output?18:27
Vorrtexlist of commands of what?18:27
VorrtexI'm not even through the script just yet (got back from lunch, sorry)18:28
VorrtexI have an environment variable set (DIB_LOCAL_IMAGE, mentioned above) and the error is :18:29
Vorrtex2019-04-04 18:28:52.719 | device-mapper: reload ioctl on loop1p2  failed: Invalid argument                             │./diskimage_builder/elements/redhat-common/bin/extract-image:            echo "Working in $WORKING"18:29
Vorrtex2019-04-04 18:28:52.724 | create/reload failed on loop1p218:29
Vorrtexlol, it grabbed my other screen... dammit18:29
Vorrtexlemme get a better copy.18:29
Vorrtexdevice-mapper: reload ioctl on loop1p2  failed: Invalid argument18:29
Vorrtexcreate/reload failed on loop1p218:29
VorrtexI'm having trouble even finding where in that script I would start on device-mapper, since that doesn't seem to be in there as failure text.18:30
VorrtexI believe I see the output from this line: https://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/redhat-common/bin/extract-image#n6718:32
Vorrtexbut can't seem to find exactly what fails next, really.18:32
VorrtexAlright, looks like I might just have to walk this more slowly than I thought.  Will report back in a little bit.18:34
clarkbok. Also ianw should be coming on line soon and may be more help18:34
Vorrtexclarkb the erring line is https://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/redhat-common/bin/extract-image#n7118:52
clarkbVorrtex: fails on the kpartx?18:55
clarkbcan you paste that and the output without the awk?18:55
VorrtexLemme see.   (I was just using the script, and commenting out bits of it.18:55
Vorrtex)18:55
Vorrtexhttps://gist.github.com/tvardema/5bb00d0d327b6a026edbddd0b5953d3d18:58
VorrtexIf it "helps" at all, it keeps incrementing my loops... first run is loop0, and now I'm on loop4.18:58
Vorrtexclarkb ^^19:05
clarkbVorrtex: does /dev/loop4p1 exist?19:08
Vorrtexnope.  How could it, nothing created it.19:10
VorrtexIt goes from "make /dev/loop#" exist, and then assumes "/dev/loop#p1" or whatever exists...19:11
clarkbwell kpartx is trying to add it if I am reading that correctly19:13
clarkband was curious if it succeeded despite the error19:13
Vorrtexsudo losetup -f isn't apparently creating "loop#p1" or whatever, it just makes a new "loop#".19:13
clarkbya then kpartx reads the partition tablr and creates those subdevices19:14
VorrtexYou mean when it does the awk portion for line 71?19:14
Vorrtexso technically line 72?  because I didn't read that right myself lol19:16
clarkbno kpartx itself19:18
VorrtexAlright, well, I'm obviously not familiar with how that's supposed to work, but its definitely *not* working as written.  Any ideas, or is this still us waiting for maybe ianw to help out?19:20
clarkbI think the next thing maybe cehcking the kernel loh for more error info19:21
Vorrtexchecking the host machine's kernel log or is there a log associated with DIB?19:24
clarkbwould be the kernel log on the machine19:25
clarkbioctl is in the kernel so hopefully it logged more i fo19:25
VorrtexHmm:  device-mapper: table: 253:8: loop4 too small for target: start=10240, len=125818880, dev_size=1048576019:26
VorrtexI don't really know what that means.19:26
clarkbpossible the shrink didnt update the partition table properly19:29
VorrtexAlright, building new image... lol thanks for the help so far, maybe a build from scratch will remove all these issues :(19:29
VorrtexI'm also going to fix that cloud image name... cuz it hurts me.19:30
Vorrtexrebuilt the image, restarted the host machine, and it seems to have gone much farther.... Looks like it failed on a repofile, though20:22
VorrtexThanks again for the help, I'll be looking into it more tomorrow or early next week.20:23
Vorrtexclarkb ^^20:23
ianwRan: 15 tests in 2499.0931 sec.20:32
ianwRan: 15 tests in 4967.0000 sec.20:32
ianwhttp://logs.openstack.org/52/643752/1/check/octavia-v2-dsvm-scenario-centos-7/56ef7da/job-output.txt.gz20:32
ianwhttp://logs.openstack.org/88/634988/6/check/octavia-v2-dsvm-scenario-ubuntu-bionic/c1db89d/job-output.txt.gz20:33
ianwdoes seem like it's fairly apples-to-apples much slower20:33
clarkbwhat is the difference between the two?20:34
johnsomianw I am out of the loop, but bionic usually runs faster than centos20:34
johnsomConfused by those results20:34
johnsomAh, they are backward, ok20:34
ianwsorry, the longer run is centos yes; this was in response to a cgoncalves question above20:35
johnsomIt's roughly double the run time on centos.  Not just tests either, even the boot time is slower20:36
ianwhrm, it would be nice if the time was in the testr_results.html20:36
ianwyeah, when we last deep dived into this we debugged the entire thing to find it was the same thing as reported @ https://bugzilla.redhat.com/show_bug.cgi?id=166661220:36
openstackbugzilla.redhat.com bug 1666612 in systemd "Rules "uname -p" and "systemd-detect-virt" kill the system boot time on large systems" [High,On_qa] - Assigned to jsynacek20:36
johnsomIt's also pretty hard to judge using the zuul gates as instance provider performance varies greatly.20:37
ianwit ended up forking uname for like every single udev rule20:37
cgoncalveshi Ian :)20:38
johnsomThe issue cgoncalves is trying to track down is why the centos based job is double the run time of the ubuntu job. It's so far out it ends up timing out the centos jobs regularly.20:38
cgoncalvesyeah. I believe I applied the workaround suggested but still it doesn't reduce the times20:39
cgoncalvesI also didn't get the journalctl messages that confirm the bug so I wonder if it's a different issue20:40
ianwyeah, fix applied at http://logs.openstack.org/52/643752/1/check/octavia-v2-dsvm-scenario-centos-7/56ef7da/controller/logs/dib-build/amphora-x64-haproxy.qcow2_log.txt.gz#_2019-03-17_19_41_05_34820:41
ianwit ran for ~40 seconds which seems consistent with rebuilding the initramfs20:41
*** Vorrtex has quit IRC20:46
*** mjturek has quit IRC21:08

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!