Tuesday, 2017-09-05

jmlowe_https://jetstream-cloud.org/archive/publications.php00:44
jmlowe_masber ^^^^00:46
masberjmlowe_, you have vms flavors with 44 cores?01:02
masberdo you use NUMA?01:02
jmlowe_yes01:02
jmlowe_44 hyperthreaded cores01:02
jmlowe_24 real cores01:02
jmlowe_2 socket dell m630 blades01:04
masberwhy not using ironic for those use cases where the tenant needs a whole machine?01:07
jmlowe_we don't use openvswitch or sdn01:08
jmlowe_we also wanted the flexibility to live migrate and take maintenance that way01:08
jmlowe_linpack gets %97 of bare metal for our purposes that was close enough01:09
jmlowe_we aren't exactly doing hpc, we are targeting users that don't currently use nationally funded resources01:10
jmlowe_%97 of users eligible to use NSF funded resources don't01:11
masberok01:11
jmlowe_so the tradeoffs we made were performance for uptime01:11
masberyou don't use sdn means neutron talks to your physical switches and routers to configure the networking?01:12
jmlowe_all vxlan01:12
jmlowe_linuxbridge, so encapsulated tenant networks and neutron doesn't talk to any switches01:13
*** b1airo has joined #scientific-wg04:22
*** b1airo has quit IRC04:56
*** simon-AS559 has joined #scientific-wg06:44
*** simon-AS559 has quit IRC06:49
*** simon-AS559 has joined #scientific-wg06:49
*** simon-AS559 has quit IRC07:11
*** simon-AS559 has joined #scientific-wg07:11
*** priteau has joined #scientific-wg08:24
*** simon-AS559 has quit IRC09:27
*** simon-AS559 has joined #scientific-wg09:29
*** priteau has quit IRC09:51
*** simon-AS559 has quit IRC11:03
*** priteau has joined #scientific-wg11:26
*** b1airo has joined #scientific-wg11:28
*** rbudden has joined #scientific-wg13:00
*** simon-AS559 has joined #scientific-wg13:02
*** b1airo has quit IRC13:28
*** simon-AS559 has quit IRC13:32
*** trandles has joined #scientific-wg13:50
bolligjmlowe_: looking forward to your writeup on luminous. blynch has some data on our rebuilds that might be of interest. we have dual NVMe and 6 SSDs per server, and our rebuilds have been pretty quick. I’ll remind him to bring the data and notes to the SC BoF.14:16
*** simon-AS559 has joined #scientific-wg14:17
bolligmasber: jmlowe_: we run panasas on our HPCs, but opted to run a separate Ceph cluster for OpenStack. a) it is significantly cheaper and scales well; b) we partition it for dual purpose (block and object); c) it reserves the parallel I/O performance for nodes on the HPC that can actually use it (panasas requires a kernel mod we don’t have on hypervisors); d) ceph as a separate island allows us to satisfy use-cases like controlled-access data which14:23
bolligdon’t fit well on global namespace filesystems.14:23
bolligmasber: if you’re interested in scheduler integrations, you might look at Adapative Computing’s MOAB. They have/had a cloud add-on that provisions compute nodes to alleviate queue pressure; scale-in is included. We opted out of that feature due to cost. Also, at one point I hacked together a few scripts to report openstack resource availability to the PBS scheduler, and matching prologue and epilogue scripts to manage VM provisioning for indivi14:28
bolligpipeline jobs. I can share more details of that, but it would be operating in an unsupported mode.14:28
trandlesjmlowe_: it's official now   https://www.openstack.org/summit/berlin-201817:25
jmlowe_bollig masber: I saw an early demo of the moab openstack integration, it was pretty slick if you want to move the partition back and forth between traditional batch and cloud as needed18:52
jmlowe_Whew, just finished recovering the last pg, took roughly 6 days18:53
*** priteau has quit IRC19:15
*** priteau has joined #scientific-wg19:37
*** priteau has quit IRC19:41
*** priteau has joined #scientific-wg19:59
*** simon-AS559 has quit IRC20:45
*** oneswig has joined #scientific-wg20:56
*** martial has joined #scientific-wg21:00
*** b1airo has joined #scientific-wg21:06
*** StefanPaetowJisc has joined #scientific-wg21:23
priteauThere was no time for AOB in the meeting, I would like to share something else: I am happy to announce that Chameleon is being renewed for another three years! https://ci.uchicago.edu/blog/cloud-computing-testbed-chameleon-renewed-second-phase22:04
jmlowe_excellent, congratultions22:04
martialcongratulations Pierre :)22:04
oneswigBravo priteau, looking forward to Chameleon's Second Age!22:05
martialI should also introduce a new federation effort backed by NIST and IEEE22:05
priteauLooking forward to continue contributing to OpenStack :-)22:05
martialtitled P2302 ... I know catchy :)22:05
b1airogreat news priteau22:05
jmlowe_I just heard from Jeff Adams a few minutes ago, we now have 4 speakers for SC'17 booth talks22:06
oneswigmartial: got a link?22:06
martialoneswig: lookibg22:06
martialYou can find more details at http://collaborate.nist.gov/twiki-cloud-computing/bin/view/CloudComputing/FederatedCloudPWGFC and http://sites.ieee.org/sagroups-2302/22:06
martialwe can discuss this more next week22:07
oneswigSounds good to me22:07
martialfor now not much happening there ... the kick off meeting was last week22:07
*** oneswig has quit IRC22:08
StefanPaetowJiscSorry folks, as much as there was a suggestion I should try to get to Sydney to bring up macOS support for Moonshot, I've been so snowed under here it's not possible. Next OS summit, honest22:08
StefanPaetowJisc:-/22:08
martialwait and see:)22:10
jmlowe_Please excuse my ignorance, what's Moonshot?22:10
jmlowe_damn, we had a meeting today didn't we?22:11
jmlowe_nothing like a vomiting 3yr old to reoder your priorities22:11
*** b1airo has quit IRC22:13
*** priteau has quit IRC22:15
StefanPaetowJiscMoonshot is a GSSAPI mechanism for federated non-web authentication :-)22:16
StefanPaetowJiscIf you're used to the Grid Computing world, you'll know GSI-SSH. It does something similar, but just with grid certs22:16
StefanPaetowJischttps://wiki.moonshot.ja.net - macOS was a priority for us in the end when we bet on the wrong horse (Windows instead of macOS).22:17
StefanPaetowJisc:-/22:17
*** StefanPaetowJi-1 has joined #scientific-wg22:19
*** StefanPaetowJisc has quit IRC22:19
*** StefanPaetowJi-1 is now known as StefanPaetowJisc22:19
trandlesGRRRR, sorry I missed the meeting today22:21
*** StefanPaetowJisc has quit IRC22:22
trandlesjmlowe_: quick link ;)    http://eavesdrop.openstack.org/meetings/scientific_wg/2017/scientific_wg.2017-09-05-21.00.log.html22:25
jmlowe_ah, interesting, I'll have to check it out22:26
jmlowe_My contribution for the meeting would have been using senlin to backfill with osg instances, using webhooks to scale up and down as needed22:32
martialmike: save it for next time :)22:35
*** martial has quit IRC22:35
*** b1airo has joined #scientific-wg23:30
*** b1airo has quit IRC23:47
*** b1airo has joined #scientific-wg23:50

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!