Tuesday, 2018-09-25

*** zigo has quit IRC00:05
*** Bhujay has joined #openstack-sahara02:29
*** dave-mccowan has quit IRC02:29
*** Bhujay has quit IRC02:30
*** Bhujay has joined #openstack-sahara02:30
*** dave-mccowan has joined #openstack-sahara02:49
*** Bhujay has quit IRC03:41
*** rcernin has quit IRC04:24
*** Bhujay has joined #openstack-sahara04:37
*** rcernin has joined #openstack-sahara04:38
*** rcernin_ has joined #openstack-sahara05:17
*** rcernin has quit IRC05:19
*** pcaruana has joined #openstack-sahara05:41
*** Bhujay has quit IRC06:00
*** Bhujay has joined #openstack-sahara06:14
*** Bhujay has quit IRC06:15
*** Bhujay has joined #openstack-sahara06:16
*** Bhujay has quit IRC06:32
*** Bhujay has joined #openstack-sahara06:33
*** rcernin_ has quit IRC07:05
*** tosky has joined #openstack-sahara07:54
openstackgerritLuigi Toscano proposed openstack/sahara master: Force the format of ssh key to PEM, at least for now  https://review.openstack.org/60502810:00
*** Bhujay has quit IRC10:31
*** Bhujay has joined #openstack-sahara10:32
*** rcernin_ has joined #openstack-sahara10:43
*** rcernin_ has quit IRC10:52
*** pcaruana has quit IRC11:15
*** Bhujay has quit IRC12:31
*** Bhujay has joined #openstack-sahara12:32
*** Bhujay has quit IRC12:33
*** Bhujay has joined #openstack-sahara12:33
openstackgerritLuigi Toscano proposed openstack/sahara master: Force the format of ssh key to PEM, at least for now  https://review.openstack.org/60502812:37
openstackgerritbhujay kumar proposed openstack/sahara master: adds unit test for ssh_remote.replace_remote_line  https://review.openstack.org/60508514:13
tosky\o/14:22
Bhujayhope the approach is right14:26
*** Bhujay has quit IRC15:03
*** dave-mccowan has quit IRC15:14
*** dave-mccowan has joined #openstack-sahara15:38
openstackgerritweizj proposed openstack/openstack-ansible-os_sahara master: Update messaging notification configuration  https://review.openstack.org/60511115:40
*** wlmbasson has quit IRC15:52
openstackgerritweizj proposed openstack/openstack-ansible-os_sahara master: Update messaging notification configuration  https://review.openstack.org/60511115:58
*** pcaruana has joined #openstack-sahara16:07
*** dave-mccowan has quit IRC16:13
toskytellesnobrega (and jeremyfreudberg, and any other core :) https://review.openstack.org/#/c/605028/ fixes an issue with openssl 1.1, which blocks the newer distributions (like Fedora 28)16:27
toskyI think I explained why I went for that solution instead of trying to use the new format (tl;dr paramiko does not support the new format)16:27
toskybut please try to keep an eye on it16:27
tellesnobregatosky, saw it earlier, was waiting on tests16:29
toskyoki :)16:31
toskyI wrote it down so the other lurking people know about it16:31
tellesnobregamakes sense16:46
tellesnobregalooks good16:46
tellesnobregatosky, are you still around?18:22
*** guimaluf has joined #openstack-sahara18:59
guimalufhi folks, I'm using sahara on pike version and I just can't delete any cluster. Try to find something on web but could not find nothing. Any path to follow?19:00
tellesnobregaguimaluf, we have a bug about cluster deletion19:01
tellesnobregathis is due to some bad communication with other services19:01
tellesnobregacommon approach now is to force delete heat stack19:01
guimaluftellesnobrega, anything I can do about it or contribute to fix?19:02
tellesnobregaand than trying to delete to delete the cluster again19:02
guimalufbut there is no heat stack created19:02
guimalufall clusters triggered Error on creation19:02
tellesnobregaguimaluf, we have fixes for newer versions19:02
tellesnobreganot sure we backported to pike19:02
tellesnobregaI see19:02
guimaluftellesnobrega, thank you for the info! :)19:03
tellesnobregaon that case, since there is no stack created is the worst case19:03
guimaluftellesnobrega, I've tried to remove direct on DB, but there is a foreignkey restriction19:04
tellesnobregayou would have to delete it directly on the DB19:04
tellesnobregayou can remove that reference as well19:04
tellesnobregait won't break anything19:04
guimalufthank you :)19:04
guimalufthere is no deleted field on clusters table... so I should delete from clusters; right?19:05
tellesnobregayes19:06
tellesnobregaand this will be reject due to foreign key19:06
tellesnobregabut it shows the issue and you can delete that as well19:06
tellesnobregaand than come back to delete from clusters19:06
guimaluftellesnobrega, I'm not fluent on database management ehhehe the best practice here would be `drop from node_groups; drop from clusters;` ?19:08
tellesnobregatry deleting from clusters first19:08
tellesnobregaand print show me the error message19:08
toskytellesnobrega: yep?19:08
guimalufERROR 1451 (23000): Cannot delete or update a parent row: a foreign key constraint fails (`sahara`.`node_groups`, CONSTRAINT `node_groups_ibfk_1` FOREIGN KEY (`cluster_id`) REFERENCES `clusters` (`id`))19:09
tellesnobregayou should delete from node groups and than clusters19:10
tellesnobregatosky, about the tests on sahara side19:10
guimalufI have no working cluster :) so I'll cleanup everything19:10
tellesnobregathat works19:11
tellesnobregaI'm down to 5 failing tests19:12
tellesnobregaand those 5 are fixed with the same fix19:12
guimaluftellesnobrega, thank you :)19:12
tellesnobregabut I'm not sure how to do it19:12
tellesnobregaguimaluf, no problem :)19:13
tellesnobregaanytime19:13
guimalufit works... I hope next time should be more complicated, since I want a working cluster xD19:13
tellesnobregaguimaluf, what happened that the cluster didn't start?19:13
guimalufI was getting a random KeyError19:14
toskyguimaluf: that exact KeyError and the error message would help a lot in finding out if the issue is still there19:15
toskyis it the last version of the stable/pike branch, or some specific tag?19:15
guimaluftosky, I'm pasting on paste.openstack :)19:15
guimaluftosky, stable/pike19:15
guimalufhttp://paste.openstack.org/show/730742/19:16
guimalufI don't know if this is image related or if is my setup19:16
toskyprobably configuration-related19:22
toskyare you sure you have the tip of stable/pike?19:22
toskybecause that error reminds me of this fix: https://review.openstack.org/#/c/502101/19:23
toskywhich is not in 7.0.0 (the first release of pike), but it's in 7.0.119:23
tosky(and newer releases)19:23
tellesnobregatosky, it does looks familiar to that19:24
toskyhttps://storyboard.openstack.org/#!/story/171568119:24
guimaluftosky, 1:7.0.0-0ubuntu119:25
guimalufsahara is already the newest version (1:7.0.0-0ubuntu1~cloud0).19:26
toskyguimaluf: that's definitely not the last version; please check if the repository that you used provides a new er version19:26
guimaluftosky, I'm using ubuntu cloud archive19:26
toskyit's the newest *packaged* version in that repository19:26
guimalufyeah, :/19:26
toskyI suggest to ping the maintainer of that archive to update their packages19:26
toskywe do release newer versions for a reason...19:26
guimaluf"<tosky> we do release newer versions for a reason..." that's the point! hahahah19:27
toskyyou can manually apply that patch to your system, of course, at least you should be able to proceed19:28
toskyor change the configuration to add default values to those configuration keys (so that you don't need to patch the code)19:28
tellesnobregatosky, https://github.com/tellesnobrega/sahara/tree/split-plugins19:29
tellesnobregaif you have the time can you take a look at this and see the failing tests19:29
guimaluftosky, thank you for the hint! I'll apply the patch! :)19:30
Gaasmannhave you heard of an issue where sahara keeps nc processes? I had a controller with 500 processes like that19:59
Gaasmannsudo sahara-rootwrap /etc/sahara/rootwrap.conf ip netns exec qdhcp-7e5c352e-7e71-412c-bb44-b2b846fe8f95 nc 10.42.0.5 2219:59
Gaasmanneven if the related clusters and instances were deleted19:59
* tosky points Gaasmann to tellesnobrega and tries to hide20:00
Gaasmann:-)20:00
tellesnobregaGaasmann, looks like you have netns configured for sahara20:01
GaasmannI think so yes20:01
tellesnobregaI'm not sure why they are kept, but those processes created for each instance or proxy instance that sahara needs to communicate20:02
tellesnobregaGaasmann, we can take a look to try and make a better management of these processes20:04
Gaasmannmakes sense as sahara connects to instances with ssh. But I don't know why they remain. I had exactly 500 of them, like there is a maximum or something20:05
tellesnobregacould be system20:06
tellesnobregaI will have to check20:06
Gaasmannok thanks20:07
tellesnobregano problem, thanks for bringing it up20:07
Gaasmannbtw, for ambari, should I try to build centos or centos7 images? I can't get a running cluster with centos7 :-(20:08
tellesnobregaqueens?20:08
Gaasmannpike :-/20:08
tellesnobregalet me check20:09
tellesnobregacentos7 would be the best20:09
tellesnobregawhat issue are you seeing?20:09
Gaasmannfor for HDP 2.5, it's the exact same thing than I had with Ubuntu20:10
Gaasmannfor the others, I don't know, the cluster is being configured and then a timeout occured but I can't get the action that timed out20:11
tellesnobrega:(20:11
tellesnobregaThe timeout for ambari is a common issue, we have a new patch that allows increasing the timeout20:12
toskycentos 6 images were deprecated (and probably not even working) even in pike (centos 7 was the way already)20:12
tellesnobregahttps://review.openstack.org/#/c/593598/20:13
toskytellesnobrega: should we start backporting it?20:14
tellesnobregatosky, yes20:14
toskytellesnobrega: technically it was marked as "feature" in the release notes, but it's more a fix for slow systems20:16
tellesnobregait is a fix, we are basically exposing a config option20:17
tellesnobregawe can backport it20:17
openstackgerritLuigi Toscano proposed openstack/sahara stable/rocky: Add template param for ambari pkg install timeout  https://review.openstack.org/60522620:18
guimalufshould I use images within here http://sahara-files.mirantis.com/images/upstream/ to use sahara? or there is a better image repo?20:30
guimalufI would like to get a sahara-spark with spark 2.x20:31
tellesnobregaguimaluf, the best practice is for you to generate your images20:32
tellesnobregabut I think for pike the images there would work (I believe)20:32
guimaluftellesnobrega, should I use sahara elements or the newer image-gen?20:32
tellesnobregaguimaluf, newer image gen is for queens forward, for pike you should use sahara-image-elements20:33
guimaluftellesnobrega, ok! :) nice :) thanks again xD20:33
toskyguimaluf: we removed all references to those images from our documentation, so no, please create your own images20:34
guimaluftosky, no problem... there is a safe setup for creating my images? I did generate an spark image but I'm getting an error when creating the cluster20:35
guimalufError during command execution: "mv temp-file-aced7642-e5af-4887-a0a8-600342b396d7 /opt/hadoop/etc/hadoop/core-site.xml"20:35
guimalufmaybe I forgot something20:35
toskymake sure to use the last version of sahara-image-elements for the pike branch20:36
toskyand run it on a separate, fresh system (a small VM/instance is more then enough)20:36
*** pcaruana has quit IRC20:43
toskytellesnobrega: those exceptions: I suspect that the "results" of the fake plugin for validation do not match the results expected by the test20:58
tellesnobregatosky, that is what I think as well20:59
tellesnobreganot really sure how to properly fix, because some of those tests are interesting but not with fake plugin20:59
toskycouldn't the fake plugin be changed to simulate some more complex validation rules?21:03
*** rcernin has joined #openstack-sahara23:07
*** tosky has quit IRC23:11
*** dave-mccowan has joined #openstack-sahara23:19

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!