Wednesday, 2017-08-02

*** shuyingya has joined #openstack-sahara00:07
*** shuyingya has quit IRC00:11
*** tosky has quit IRC00:15
*** shuyingya has joined #openstack-sahara00:43
*** shuyingya has quit IRC00:50
*** shuyingya has joined #openstack-sahara01:13
*** tuanluong has joined #openstack-sahara01:38
*** https_GK1wmSU has joined #openstack-sahara02:10
*** https_GK1wmSU has left #openstack-sahara02:13
*** dave-mcc_ has joined #openstack-sahara02:41
*** dave-mccowan has quit IRC02:42
*** dave-mccowan has joined #openstack-sahara03:12
*** dave-mcc_ has quit IRC03:14
*** esikachev has joined #openstack-sahara03:23
*** esikachev has quit IRC03:27
*** dave-mccowan has quit IRC03:33
*** links has joined #openstack-sahara03:48
*** esikachev has joined #openstack-sahara04:44
*** esikachev has quit IRC04:59
*** Poornima has joined #openstack-sahara05:12
*** ssmith has joined #openstack-sahara05:16
*** esikachev has joined #openstack-sahara05:42
*** anshulbehl has joined #openstack-sahara05:42
*** pgadiya has joined #openstack-sahara05:48
*** hoonetorg has quit IRC06:02
*** hoonetorg has joined #openstack-sahara06:15
*** pcaruana has joined #openstack-sahara06:34
*** rcernin has joined #openstack-sahara06:46
*** shuyingya has quit IRC07:12
*** shuyingya has joined #openstack-sahara07:12
*** esikachev has quit IRC07:31
*** tesseract has joined #openstack-sahara07:48
*** ssmith has quit IRC08:00
*** esikachev has joined #openstack-sahara08:09
*** zemuvier has joined #openstack-sahara08:10
openstackgerritAlina Nesterova proposed openstack/sahara-ci-config master: Remove README.md  https://review.openstack.org/48842008:20
*** pgadiya has quit IRC08:25
openstackgerritAlina Nesterova proposed openstack/sahara-ci-config master: Remove README.md  https://review.openstack.org/48842008:28
*** openstackgerrit has quit IRC08:33
*** pgadiya has joined #openstack-sahara08:41
*** abalutoiu has joined #openstack-sahara09:11
*** esikachev has quit IRC09:42
*** tosky has joined #openstack-sahara09:48
*** shuyingya has quit IRC09:53
*** anshulbehl has quit IRC10:06
*** tuanluong has quit IRC10:07
*** anshulbehl has joined #openstack-sahara10:18
*** esikachev has joined #openstack-sahara10:26
*** shuyingya has joined #openstack-sahara10:38
*** Poornima has quit IRC10:48
*** esikachev has quit IRC10:50
*** anshulbehl has quit IRC10:54
*** esikachev has joined #openstack-sahara10:54
*** Poornima has joined #openstack-sahara10:54
*** pgadiya has quit IRC10:56
*** anshulbehl has joined #openstack-sahara11:07
*** pgadiya has joined #openstack-sahara11:08
*** esikachev has quit IRC11:28
*** abalutoiu has quit IRC11:38
*** Poornima_K has joined #openstack-sahara11:50
*** Poornima has quit IRC11:51
*** Poornima_K has quit IRC12:00
*** Poornima has joined #openstack-sahara12:00
*** openstackgerrit has joined #openstack-sahara12:15
openstackgerritAlina Nesterova proposed openstack/sahara-ci-config master: [wip] Add playbook to run Apache2  https://review.openstack.org/49000212:15
*** Poornima has quit IRC12:23
*** shuyingya has quit IRC12:35
*** abalutoiu has joined #openstack-sahara12:49
*** jeremyfreudberg has joined #openstack-sahara12:58
*** shuyingya has joined #openstack-sahara12:58
*** shuyingya has quit IRC13:03
*** shuyingya has joined #openstack-sahara13:03
*** lucasxu has joined #openstack-sahara13:12
*** aolwas has quit IRC13:21
tellesnobregajeremyfreudberg, remember the error with spark.py and swift?13:21
jeremyfreudbergtellesnobrega, i remember discussing it, yes13:22
tellesnobregagoes down to java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.fs.swift.snative.SwiftNativeFileSystem not found13:22
*** aolwas has joined #openstack-sahara13:22
jeremyfreudbergso, hadoop-swift jar is not in classpath13:23
tellesnobregaapparently yes, but looking into /opt/spark/conf there is a spark-defaults.conf file13:24
tellesnobregawhich has spark.executor.extraClassPath /usr/lib/hadoop-mapreduce/hadoop-openstack.jar13:24
jeremyfreudbergthat is true13:24
jeremyfreudbergi'm in a spark cluster now seeing that13:25
tellesnobregawhich has the class13:25
tellesnobregafor some reason it is not on the classpath13:25
jeremyfreudbergcan you go into the job execution's folder in /tmp/spark-edp/....... and look at launch_command.log, too?13:26
tellesnobregayes13:26
tellesnobregawhat would you like to see?13:27
tellesnobrega/opt/spark/bin/spark-submit --class main --master spark://spark21-spark21-master-0:7077 --deploy-mode client /tmp/wcount.py swift://bigsea.sahara/text.py13:27
tellesnobregathat is command13:27
jeremyfreudbergthat's what i wanted to see13:28
*** pgadiya has quit IRC13:29
jeremyfreudbergi think you have edp.spark.adapt_for_swift (this is a checkbox in dashboard ui, otherwise it's a config value) as false13:29
tellesnobregait is false13:29
jeremyfreudbergwith that true, it adds --driver-class-path /usr/lib/hadoop-mapreduce/hadoop-openstack.jar as an argument to spark submit13:29
jeremyfreudbergBUT13:29
jeremyfreudbergit also uses the SparkWrapper jar13:29
jeremyfreudbergand some other stuff13:30
jeremyfreudbergwhich i thought was glitchy with pyspark13:30
jeremyfreudbergwhich is why we should try it out13:30
tellesnobregasure13:31
tellesnobregai got a different error now13:35
tellesnobregaUsage: wordcount <file>13:35
jeremyfreudbergthe input args is not getting there13:35
tellesnobregait is13:36
*** abalutoiu has quit IRC13:36
tellesnobregabut there is some stuff in between13:36
tellesnobregaspark.xml and main13:36
*** abalutoiu has joined #openstack-sahara13:36
jeremyfreudbergright, in this edp.spark.adapt_for_swift=true case, the last arguments are input and output, and before that should be mainclass13:38
jeremyfreudbergthe main class of the wordcount jar, i mean13:38
jeremyfreudberg--deploy-mode client <file name> spark.xml <main class> <input> <output>13:39
jeremyfreudbergand i don't mean wordcount jar13:39
jeremyfreudbergit's py13:40
jeremyfreudbergmain class we determined was "main", but possibly dummy13:40
tellesnobregastill failing with wordcount <file>13:42
tellesnobrega--deploy-mode client /tmp/wcount.py spark.xml main swift://bigsea.sahara/text.py swift://bigsea.sahara/output13:43
jeremyfreudberghmm13:44
jeremyfreudbergtrying something13:47
tellesnobregaok13:49
jeremyfreudbergwell, my finding is that when running with edp.spark.adapt_for_swift=true, a lot of stuff makes in into sys.argv of the python script that wouldn't, if you were running with adapt=false13:55
*** abalutoiu_ has joined #openstack-sahara13:56
jeremyfreudbergconfirmed with meteos codebase as well, see https://github.com/openstack/meteos/blob/master/meteos/engine/drivers/generic.py#L191 they have two relevant args, but https://github.com/openstack/meteos/blob/master/meteos/cluster/binary/meteos-script-1.6.0.py#L648 shows that they start way down in sys.argv[3], with some other stuff before it13:57
-openstackstatus- NOTICE: We have disable infracloud-vanilla due to the compute host running mirror.regionone.infracloud-vanilla.o.o being offline. Please recheck your failed jobs to schedule them to another cloud.13:58
*** abalutoiu has quit IRC13:58
*** abalutoiu__ has joined #openstack-sahara13:59
*** zemuvier has quit IRC14:02
tellesnobregajeremyfreudberg, I'm seeing something similar here14:02
*** abalutoiu_ has quit IRC14:03
tellesnobregaremoved a lot of stuff and it kinda worked14:03
tellesnobregafailed on swift credentials14:03
jeremyfreudbergremoved stuff from the spark-submit command? or what?14:03
tellesnobregayes14:09
tellesnobregacleaned up the command14:09
tellesnobregaa bit14:09
tellesnobregafor some reason it is not reading the spark.xml14:10
tellesnobregait is passed --files14:10
jeremyfreudbergah yes, finally, that's the issue i was trying to remember!14:11
tellesnobregabut it fails saying that the config is missing fs.swift.service.sahara.username14:12
*** esikachev has joined #openstack-sahara14:15
tellesnobregajeremyfreudberg, https://spark.apache.org/docs/2.1.0/storage-openstack-swift.html14:16
jeremyfreudbergtellesnobrega, so the configs are the right thing, but they can't be accessed by pyspark job. you remember every time we discuss this and I say, "hmm... I think there's some weird issue between SparkWrapper jar and pyspark jobs, but I can't remember what it is", this is the issue... it can't read spark.xml because of the way that SparkWrapper is injecting those fs.swift.* configs at runtime is only accessible to JAR-file jobs14:16
jeremyfreudbergin the next release, we can invent a edp.spark.adapt_for_swift_pyspark that injects the configs in a compatible way14:17
tellesnobregaby pyspark you mean spark-submit?14:18
jeremyfreudbergi mean spark job written in python14:18
tellesnobregaok14:18
tellesnobregabecause pyspark used to be something no spark that isn't supported anymore14:18
jeremyfreudbergoh, i see14:19
tellesnobregacan you try running without the sparkwrapper?14:19
jeremyfreudbergwithout the spark wrapper (And the extra args that get put there because of it) it should mostly work, since that's the same as edp.spark.adapt_for_swift=false, but you'll still need --driver-classpath and someway for the swift username and password to get in14:21
tellesnobregayes14:21
jeremyfreudbergat some point i had written something like this in a job:14:22
jeremyfreudbergsc = SparkContext(appName="PythonWordCount")14:22
jeremyfreudbergsc._jsc.hadoopConfiguration().set('fs.swift.service.sahara.username', sys.argv[3])14:22
jeremyfreudbergand i recall that working14:22
tellesnobregayes, from that link I sent you the right way to run it is by putting the configuration into the job itself14:23
tellesnobregaconsidering the risk of having sensitive information in a file somewhere14:24
tellesnobregaI'm not sure what would be the best set up for us here. I get that having the config done on our side makes things easier for the user14:26
tellesnobregabut we need to do this right, if it doesn't work then we have nothing14:27
jeremyfreudbergwell, i'm thinking about a few things14:27
jeremyfreudbergright now, it basically works if you tell the user "put the credentials in the script, or read them from args"14:27
jeremyfreudbergbut that's annoying/bad because it means the python spark jobs are different for no reason apparent to user14:28
jeremyfreudbergand you can't use proxy user temp credentials14:28
tellesnobregaare you sure that works? do you still have the job you wrote?14:28
jeremyfreudbergi can test it later, but it works on meteos14:28
tellesnobregacool, I will try to add that to my wcount.py test as well14:28
jeremyfreudbergbut what i was saying regarding proxy users, is that sahara assumes the credentials can be read from edp configs in fs.swift.service.sahara.*14:29
jeremyfreudbergbut a python spark can't access these configs, which eventually end up in spark.xml, as we saw, at least in the current state14:29
tellesnobregayes14:30
tellesnobregaI assume that this is what the wrapper does, reads this config before sending the job up14:30
tellesnobregaor am I wrong? my java is a little rusty14:30
jeremyfreudbergthat's more or less right, it reads the config and puts it "somewhere", then invokes the main method of the user's actual job after config has been properly loaded14:31
jeremyfreudbergso, let's remember that the point of EDP is to provide a generic interface for jobs that should behave more or less the same regardless of underlying services14:31
jeremyfreudbergso we are kind of failing that goal now regarding the python spark situation14:32
tellesnobregatrue14:32
jeremyfreudbergideally we could write some new kind of wrapper specifically for python spark jobs, whicih would preserve the user experience and make edp feel generic as it should be14:32
tellesnobregathat is exactly my point14:33
tellesnobregawe are completely failing it right now14:33
jeremyfreudbergso, we should definitely investigate if that's possible to resolve14:33
tellesnobregaI'm trying a basic word count and it doesn't work14:33
tellesnobregayes14:33
*** Poornima has joined #openstack-sahara14:35
*** Poornima has quit IRC14:35
*** Poornima has joined #openstack-sahara14:37
jeremyfreudbergin any case, i'll add it to etherpad, now that we actually have a clear view of the limitations14:42
tellesnobregayes14:42
jeremyfreudbergexact solution will take some time to flesh out14:42
tellesnobregathanks14:42
*** links has quit IRC15:01
jeremyfreudberghey tellesnobrega, do you know how i can get added to the launchpad group for sahara, so i can set bug importance (hi/medium/etc)?15:08
* tellesnobrega check15:08
*** esikachev has quit IRC15:14
tellesnobregajeremyfreudberg, can you check if you can add yourself to the team?15:14
jeremyfreudbergtellesnobrega, not sure if I'm even looking at the right team, sahara drivers, sahara bug team15:15
jeremyfreudberg?15:15
tellesnobregasahara drivers should be the one15:16
tellesnobregaI'm not an administrator of the group15:17
*** esikachev has joined #openstack-sahara15:17
jeremyfreudbergtellesnobrega, try adding me to sahara bug team, if you are member of drivers team then you are admin there15:19
jeremyfreudberghttps://launchpad.net/~sahara-bugs/+members#active15:19
tellesnobregaok15:20
jeremyfreudbergif it can't be done, it's not the end of the world, but it would be nice15:20
tellesnobregathat I could do15:21
*** esikachev has quit IRC15:22
jeremyfreudbergtellesnobrega, thanks, that works, i have full control over bugs now15:22
tellesnobregaawesome15:22
jeremyfreudbergyou should add tosky and esikachev too15:22
tellesnobregawill do15:28
*** tomtomtom has quit IRC15:32
*** tomtomtom has joined #openstack-sahara15:33
*** esikachev has joined #openstack-sahara15:33
toskyuhm, talking about the usage of the main gates for more complex jobs... https://review.openstack.org/#/c/489773/15:36
tosky(see '[trove] Can we move some non-voting broken jobs to the experimental queue?' on openstack-dev)15:36
*** abalutoiu__ has quit IRC15:38
elmikojeremyfreudberg esikachev tosky congrats to you all on the cores =)15:41
esikachevthanks)15:42
* elmiko just saw the "paperwork" go through15:42
*** anshulbehl has quit IRC15:42
*** jeremyfreudberg has quit IRC15:44
openstackgerritShu Yingya proposed openstack/sahara-dashboard master: Fix the compatibility after django 1.11 support  https://review.openstack.org/49006415:49
*** shuyingya has quit IRC15:51
*** shuyingya has joined #openstack-sahara15:51
*** jeremyfreudberg has joined #openstack-sahara15:54
jeremyfreudbergelmiko, thanks :)15:55
*** shuyingya has quit IRC15:56
jeremyfreudbergtosky, looking15:56
tellesnobregatosky, I don't see how that affect us, maybe I wasn't during a previous discussion, is this regarding adding our jobs to the openstack ci?15:57
*** shuyingya has joined #openstack-sahara15:57
toskytellesnobrega: it does not affect us; I was just pointing out the number of jobs triggered15:57
tellesnobregaoh, I see15:57
tellesnobregaI guess that if we push we can get some of ours there15:57
toskyexactly; I was worrying too much maybe15:58
tellesnobregamaybe15:58
*** links has joined #openstack-sahara15:58
*** esikachev has quit IRC16:02
*** shuyingya has quit IRC16:04
*** shuyingya has joined #openstack-sahara16:04
*** Poornima has quit IRC16:05
*** rcernin has quit IRC16:08
*** shuyingya has quit IRC16:09
*** pcaruana has quit IRC16:14
*** hoonetorg has quit IRC16:30
toskyopenstack-dev: [all] Rollout of Zuul v3 at the PTG16:35
* tosky rejoices AND screams at the same time16:35
jeremyfreudberglol16:36
*** hoonetorg has joined #openstack-sahara16:42
*** jeremyfreudberg has quit IRC16:43
*** shuyingya has joined #openstack-sahara16:48
*** shuyingya has quit IRC16:53
*** esikachev has joined #openstack-sahara16:59
*** esikachev has quit IRC17:03
*** shuyingya has joined #openstack-sahara17:06
*** tesseract has quit IRC17:07
openstackgerritShu Yingya proposed openstack/sahara-dashboard master: Fix the compatibility after django 1.11 support  https://review.openstack.org/49006417:10
*** shuyingya has quit IRC17:11
*** esikachev has joined #openstack-sahara17:34
*** tosky has quit IRC17:36
*** esikachev has quit IRC17:38
*** jeremyfreudberg has joined #openstack-sahara17:41
*** esikachev has joined #openstack-sahara17:47
openstackgerritMerged openstack/python-saharaclient stable/pike: Update .gitreview for stable/pike  https://review.openstack.org/48880617:53
*** links has quit IRC18:31
*** esikachev has quit IRC18:32
*** esikachev has joined #openstack-sahara18:33
*** esikachev has quit IRC18:39
*** shuyingya has joined #openstack-sahara19:21
*** shuyingya has quit IRC19:25
tellesnobregajeremyfreudberg, about tosky question if we need to backport the reno patch19:56
tellesnobregathe answer is no19:56
jeremyfreudbergtellesnobrega, cool, although it still seems strange to me, what if we wanted a release note to be included for pike-eol release, as an example. still, i'll trust your answer19:58
*** hoonetorg has quit IRC20:03
*** hoonetorg has joined #openstack-sahara20:20
tellesnobregajeremyfreudberg, when you have time can you check that python job for me?20:20
jeremyfreudbergtellesnobrega, sure20:21
*** lucasxu has quit IRC21:00
dhellmannjeremyfreudberg : the job that builds release notes for our CI always runs against the master branch, and reno scans the other branches based on the instructions in the rst files. So the rst file doesn't have to exist on the stable branch for our publishing to work. Now if someone wants to package that file, you would have to backport it.21:12
jeremyfreudbergok, dhellmann, thanks for stepping in to clarify. that actually makes a lot of sense21:13
dhellmannalways happy to help :-)21:13
*** shuyingya has joined #openstack-sahara21:20
*** shuyingya has quit IRC21:25
*** jeremyfreudberg has quit IRC21:38
*** shuyingya has joined #openstack-sahara22:29
*** shuyingya has quit IRC22:33
*** https_GK1wmSU has joined #openstack-sahara22:57
*** https_GK1wmSU has left #openstack-sahara22:59
*** shuyingya has joined #openstack-sahara23:18
*** shuyingya has quit IRC23:23

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!