Friday, 2017-07-07

*** esikachev has joined #openstack-sahara00:00
*** esikachev has quit IRC00:05
*** abalutoiu has quit IRC00:17
*** esikachev has joined #openstack-sahara01:00
*** esikachev has quit IRC01:05
openstackgerritzhongshengping proposed openstack/puppet-sahara master: Remove deprecated keystone authtoken signing_dir option  https://review.openstack.org/48137102:00
*** esikachev has joined #openstack-sahara02:01
*** esikachev has quit IRC02:06
*** lucasxu has joined #openstack-sahara02:53
*** lucasxu has quit IRC02:59
*** esikachev has joined #openstack-sahara03:02
*** esikachev has quit IRC03:07
*** ukaynar has joined #openstack-sahara03:08
*** chlong_ has quit IRC03:08
*** chlong_ has joined #openstack-sahara03:09
*** chlong_ has quit IRC03:17
*** chlong_ has joined #openstack-sahara03:18
*** ukaynar has quit IRC03:55
*** esikachev has joined #openstack-sahara04:03
*** esikachev has quit IRC04:08
*** ukaynar has joined #openstack-sahara04:10
*** ukaynar has quit IRC04:26
*** ukaynar has joined #openstack-sahara04:26
*** ukaynar has quit IRC04:26
*** links has joined #openstack-sahara04:28
*** esikachev has joined #openstack-sahara05:04
*** Poornima has joined #openstack-sahara05:30
*** rcernin has joined #openstack-sahara05:46
*** pgadiya has joined #openstack-sahara05:50
*** Poornima has quit IRC06:21
*** aolwas28 has quit IRC06:24
*** rickflare has quit IRC06:37
*** htruta has quit IRC06:38
*** aolwas28 has joined #openstack-sahara06:39
*** rickflare has joined #openstack-sahara06:40
*** htruta has joined #openstack-sahara06:42
*** tesseract has joined #openstack-sahara07:17
openstackgerritLuigi Toscano proposed openstack/sahara-image-elements master: Get files from tarballs.o.o if possible (extjs, policy)  https://review.openstack.org/48147707:42
*** anshul has joined #openstack-sahara07:49
*** abalutoiu has joined #openstack-sahara08:09
*** mgoddard_ has joined #openstack-sahara08:16
*** tosky has joined #openstack-sahara09:12
*** openstackgerrit has quit IRC09:48
*** mgoddard_ has quit IRC10:19
*** tellesnobrega has quit IRC10:43
*** mgoddard_ has joined #openstack-sahara10:51
*** esikachev has quit IRC10:59
*** tellesnobrega has joined #openstack-sahara11:25
*** esikachev has joined #openstack-sahara11:47
*** pcaruana has joined #openstack-sahara11:49
*** pgadiya has quit IRC12:08
*** fazalkhan has joined #openstack-sahara12:10
*** tellesnobrega has quit IRC12:12
fazalkhanTrying to deploy CDH 5.7.0 on OSP 10 using Sahara, cluster provisioning is failing at Plugin: Configure cluster (Configure Services). In the sahara log, I am getting12:13
fazalkhanERROR sahara.service.ops [req-63639760-e72c-4407-8e5c-37ba8deb01f1 bad7eae8d0304f9d97987774b2da3ca9 cd860a89a6f54ec2a39c300d2b81cf37 - - -] [instance: none, cluster: e30d432a-7ff0-41d6-9671-4d88f1a9fb7e] Error during operating on cluster (reason: <urlopen error [Errno 111] ECONNREFUSED>)12:13
fazalkhanTraceback is going to urllib2.py line 431, in open  -- line 449 in _open -- line 409 in _call_chain ---- line 1244 in http_open --- line 1214 do_open --- raise URLError(err).12:18
fazalkhanCan anybody help or point me to something? Thanks.12:18
toskyyou should paste a bit more from the logs, and possibly setting them to debug level12:22
tosky(and using a pastebin of course)12:22
*** esikachev has quit IRC12:22
*** tellesnobrega has joined #openstack-sahara12:25
*** esikachev has joined #openstack-sahara12:26
*** esikachev has quit IRC12:39
*** chlong_ has quit IRC12:43
*** esikachev has joined #openstack-sahara12:43
fazalkhan@tosky Here is the current paste: https://pastebin.com/3b43YYt212:46
toskyfazalkhan: that's the stacktrace of the operation; what I was asking was for more details before12:48
toskyfazalkhan: did the instances spawned by sahara through nova reach the ACTIVE state?12:48
fazalkhanYes, all instances are up, volumes created, attached and mounted. I can access all of them.12:49
*** esikachev has quit IRC12:50
*** jeremyfreudberg has joined #openstack-sahara12:52
toskywere the node group and cluster templates configured to access the instances through the public network?12:53
fazalkhanAll instances have been auto assigned floating IPs of public network and using auto-genrated security groups.12:55
toskyso please paste more from the logs12:57
jeremyfreudbergfazalkhan, tosky, if provisioning fails at "Configuring step", then I would think it passed the "wait for instances accessible" step, so it's probably not related to network. (and if auto sec group, then that rules out sec group issue). so my guess is related to a specific CDH process not coming up properly12:57
jeremyfreudbergso, yes, more logs :)12:58
fazalkhan@tosky and @jeremyfreudberg You mean Sahara-engine and api logs with debugging set to true?12:59
*** ukaynar has joined #openstack-sahara12:59
toskyyes12:59
jeremyfreudbergfazalkhan give us as much context as you can now, but ideally with debug would be the best way to diagnose12:59
*** lucasxu has joined #openstack-sahara12:59
*** esikachev has joined #openstack-sahara13:00
fazalkhanokay. thanks. I will get back in ~10 minutes with the new logs.13:00
*** links has quit IRC13:06
*** esikachev has quit IRC13:22
jeremyfreudbergtosky, it's a one line change to start publishing oozie on tarballs.o.o again (some time ago we did do it), do you want me to push that?13:44
toskyjeremyfreudberg: yep!13:44
toskythe infra team would complain if there is something wrong13:44
toskyand the oozie tarballs are not so big13:44
jeremyfreudbergyep13:44
*** openstackgerrit has joined #openstack-sahara13:46
openstackgerritJeremy Freudberg proposed openstack/sahara-extra master: Start building oozie again  https://review.openstack.org/48163813:46
toskyjeremyfreudberg: I wonder... why did we disable it?13:47
toskyI totally forgot13:47
*** ukaynar has quit IRC13:47
jeremyfreudbergtosky, according to some commit message from vitaly, we had only been building the same oozie 4.2.0 for a very long time, since we only needed that for Vanilla 2.7.1 (all other Vanillas had been removed by that time)13:48
jeremyfreudbergalthough, it still might be a  good point that we don't actually need to build a new oozie each time, we could just put the binaries there once13:48
toskyjeremyfreudberg: did you check if the generated build matches the value used by sahara-image-elements?13:48
jeremyfreudberggood point too, I'll check soon13:49
fazalkhan@tosky @jeremyfreudberg here is the complete sahara-engine log: https://paste.ee/p/uC60L13:50
jeremyfreudbergthanks fazalkhan, I will take a look13:50
fazalkhanthank you guys. really appreciate your help.13:50
jeremyfreudberglooks like cloudera manager (port 7180) is our problem13:52
toskyfazalkhan: which flavor did you use for the host with cloudera manager?13:52
toskyalso, did you assign other services to the host where cloudera manager is assigned?13:53
openstackgerritJeremy Freudberg proposed openstack/sahara-extra master: Start building oozie again  https://review.openstack.org/48163813:53
toskybecause cloudera manager, from my experience, should be in a separate node with 8 GiB, if no more, or RAM13:54
toskyof RAM*13:54
fazalkhanclouder manager host is using m1.medium flavor and only has cloudera_manager process assigned to it.13:57
tosky4 GiB of RAM13:58
toskytoo small13:58
toskyat least 8 GiB13:58
fazalkhan@tosky okay, I am trying again with 8GB for cloudera manager host.13:58
*** chlong_ has joined #openstack-sahara14:08
*** esikachev has joined #openstack-sahara14:19
*** esikachev has quit IRC14:23
fazalkhan@jeremyfreudberg On the cloudera-manager host, I have manually started the cloudera-scm-server but it is still not listening on port 7180. the httpd server starts listening on 80 and 443.14:25
toskydid you start it with the 8 GiB flavor?14:27
fazalkhanno I did that previously with the 4GB flavor. the deployment with 8GB flavor is still in progress.14:28
toskyI would suggest to postpone the testing until the 8GB deployment ends (or exits)14:29
toskyif cloudera manager was not started was because there was not enough memory - did you check its logs?14:30
fazalkhan@tosky thanks a lot sir. It has gone past the step where it was failing.14:35
fazalkhan@tosky it has now failed at cluster event> Plugin: start cluster (First run cluster) and logs say: Error during operating on cluster (reason: Failed to Provision Hadoop Cluster: Failed to format NameNode.14:41
openstackgerritJeremy Freudberg proposed openstack/sahara-extra master: Start building oozie again  https://review.openstack.org/48163814:43
jeremyfreudbergfazalkhan, some debug log context around that error?14:43
fazalkhan@jeremyfreudberg Here sahara-enigne.log https://paste.ee/p/6Tm5L14:53
fazalkhanI am using Local storage on compute nodes (LVM) and assigning 4x volumes of 1TB each to each worker instance. it is failing to format those volumes.14:54
*** anshul has quit IRC14:55
*** ukaynar has joined #openstack-sahara14:56
jeremyfreudbergfazalkhan, I admit I do not see much solution off the top of my head, but I guess try with ephemeral instead of cinder if you can15:03
jeremyfreudbergsorry for the extra steps15:03
*** lucasxu has quit IRC15:04
*** rcernin has quit IRC15:06
*** tellesnobrega has quit IRC15:07
fazalkhanno problem @jeremyfreudberg. unfortunately, i have to use cinder. It is a permission related error - on the namenode instance, as "cloud-user", I cannot execute "hadoop namenode -format"  but with sudo it gets executed successfully.15:08
toskyuhm, which images are you using?15:09
*** sudipto has joined #openstack-sahara15:10
*** sudipto_ has joined #openstack-sahara15:10
fazalkhan@tosky I am using sahara-newton-cloudera-5.7.0-centos7.qcow215:12
toskywe are going OT, but just to be sure, are you using OSP10 or RDO/Newton?15:13
fazalkhanIt is OSP1015:13
toskythen I tested the RHEL7-based image, rebuilt using sahara-image-elements15:14
toskythe instructions are part of the documentation of OSP1015:14
toskythat said, the centos7 image should work as well15:15
toskymaybe still too low memory?15:20
toskyhttps://bugs.launchpad.net/sahara/+bug/168235915:20
openstackLaunchpad bug 1682359 in Sahara "Creation of CDH Cluster fails if flavor is too small" [Undecided,Confirmed]15:20
*** esikachev has joined #openstack-sahara15:20
fazalkhanOkay, I will check the cloudera logs and will try with 16GB in the next attempt.15:22
*** esikachev has quit IRC15:25
openstackgerritJeremy Freudberg proposed openstack/sahara-extra master: Start building oozie again  https://review.openstack.org/48163815:40
*** abalutoiu has quit IRC15:42
*** anshul has joined #openstack-sahara15:48
*** mgoddard_ has quit IRC15:58
openstackgerritJeremy Freudberg proposed openstack/sahara-extra master: Start building oozie again  https://review.openstack.org/48163816:05
jeremyfreudbergtosky, I think either my brain is broken, or there is something I don't know about16:14
jeremyfreudbergregarding the artifacts publishing job16:14
jeremyfreudbergsays file doesn't exist when it does, if you have some time you could take a look. (but I am afk for now)16:15
*** jeremyfreudberg has quit IRC16:15
*** anshul has quit IRC16:19
*** esikachev has joined #openstack-sahara16:21
*** esikachev has quit IRC16:25
fazalkhan@tosky It failed again with the error "Failed to format Namenode" with 16GB memory assigned to Cloudera Manager host. So, its probably a permission related error.16:31
toskyfazalkhan: try to rebuild the RHEL image according the OSP instructions16:31
fazalkhan@tosky you said that you tested with RHEL-7 based image, did it work fine with CDH 5.7?16:31
fazalkhanokay.16:31
toskyI did, using the RHEL image built using sahara-image-elements16:31
fazalkhanawesome. Thanks.16:32
toskybut I couldn't try with a 1 TB attached disk :)16:32
toskyjust to be sure, while rebuilding the image, could you please try to attach smaller disks, just to be sure?16:32
fazalkhanI will let you know of my experiment. :)16:32
fazalkhansmaller as in? 200GB disks for all instances in the cluster?16:33
toskyand even no disk, just to be sure16:33
toskytry smaller than 100GB just to be sure16:33
fazalkhanoh okay. I can try that. I will try with no disks or 100GB first with RHEL-based image and then go on to increase the disk size.16:34
toskyI was suggesting to test no disk and/or small disk also with the current image, while rebuilding the new image16:35
toskybut whatever you can test, it's fine - it's your time16:35
*** esikachev has joined #openstack-sahara16:40
*** tosky has quit IRC17:14
*** dave-mccowan has joined #openstack-sahara17:24
*** links has joined #openstack-sahara17:25
*** links has quit IRC17:29
*** links has joined #openstack-sahara17:30
*** tosky has joined #openstack-sahara17:32
*** sudipto has quit IRC17:35
*** sudipto_ has quit IRC17:35
*** links has quit IRC17:36
*** ukaynar has quit IRC17:41
openstackgerritTelles Mota Vidal Nóbrega proposed openstack/sahara master: Image generation for Ambari Plugin  https://review.openstack.org/44871418:09
*** tellesnobrega has joined #openstack-sahara18:14
tellesnobregatosky, are you still around?18:16
tellesnobregafazalkhan, hey, I wasn't around when you were with your problem, did you get it to work?18:17
*** tesseract has quit IRC18:21
toskytellesnobrega: hi18:36
tellesnobregadid you guys figure out what was wrong with the cdh env?18:37
toskythe issue reported by fazalkhan ?18:38
*** chlong_ has quit IRC18:40
tellesnobregayes18:48
*** ukaynar has joined #openstack-sahara18:55
toskynot completely19:14
*** ukaynar has quit IRC19:20
*** ukaynar has joined #openstack-sahara19:21
*** fazalkhan has quit IRC19:29
*** ukaynar has quit IRC19:53
*** ukaynar has joined #openstack-sahara20:27
*** ukaynar has quit IRC20:38
*** ukaynar has joined #openstack-sahara20:39
*** abalutoiu has joined #openstack-sahara20:41
*** abalutoiu_ has joined #openstack-sahara20:43
*** abalutoiu has quit IRC20:46
*** abalutoiu has joined #openstack-sahara20:48
*** abalutoiu_ has quit IRC20:49
openstackgerritTelles Mota Vidal Nóbrega proposed openstack/sahara master: Image generation for Ambari Plugin  https://review.openstack.org/44871420:52
*** ukaynar has quit IRC20:53
*** abalutoiu has quit IRC20:54
*** abalutoiu has joined #openstack-sahara20:55
*** abalutoiu_ has joined #openstack-sahara20:57
*** abalutoiu has quit IRC21:00
*** abalutoiu has joined #openstack-sahara21:07
*** abalutoiu_ has quit IRC21:09
*** abalutoiu has quit IRC21:21
*** dave-mccowan has quit IRC21:41
*** esikachev has quit IRC21:46
*** openstackgerrit has quit IRC21:47
*** openstackstatus has quit IRC21:56
*** openstack has joined #openstack-sahara21:59
*** esikachev has joined #openstack-sahara22:42
*** esikachev has quit IRC22:47
*** tosky has quit IRC22:53
*** abalutoiu has joined #openstack-sahara23:23
*** esikachev has joined #openstack-sahara23:43
*** abalutoiu has quit IRC23:45
*** abalutoiu has joined #openstack-sahara23:47
*** esikachev has quit IRC23:48
*** abalutoiu has quit IRC23:48
*** abalutoiu has joined #openstack-sahara23:49
*** abalutoiu_ has joined #openstack-sahara23:52
*** abalutoiu has quit IRC23:54
*** abalutoiu__ has joined #openstack-sahara23:56
*** abalutoiu has joined #openstack-sahara23:57
*** abalutoiu_ has quit IRC23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!