Thursday, 2019-08-29

*** s_shogo has joined #openstack-cyborg01:15
*** s_shogo has quit IRC01:36
*** s_shogo has joined #openstack-cyborg01:59
*** openstackgerrit has joined #openstack-cyborg02:06
openstackgerritchenker proposed openstack/cyborg master: P4: Fix pep8 error in cyborg/api  https://review.opendev.org/67917202:06
openstackgerritXinran WANG proposed openstack/cyborg master: Fill "driver_name" field in Deployable object  https://review.opendev.org/67795202:20
*** openstackgerrit has quit IRC02:37
*** openstackgerrit has joined #openstack-cyborg02:46
openstackgerritYumengBao proposed openstack/cyborg master: enable branch selection in devstack installation  https://review.opendev.org/66930302:46
*** shaohe_feng has joined #openstack-cyborg02:56
openstackgerritchenker proposed openstack/cyborg master: P5: Fix pep8 error in cyborg/accelerator  https://review.opendev.org/67917502:59
*** Coco_gao_ has joined #openstack-cyborg03:04
Coco_gao_Hi all03:04
*** Sundar has joined #openstack-cyborg03:04
Coco_gao_Hi Sundar03:04
Coco_gao_Good evening03:04
SundarHi Coco_gao_03:04
shaohe_fengGood evening.03:05
Sundar#startmeeting openstack-cyborg03:05
openstackMeeting started Thu Aug 29 03:05:12 2019 UTC and is due to finish in 60 minutes.  The chair is Sundar. Information about MeetBot at http://wiki.debian.org/MeetBot.03:05
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.03:05
*** openstack changes topic to " (Meeting topic: openstack-cyborg)"03:05
openstackThe meeting name has been set to 'openstack_cyborg'03:05
Coco_gao_#info Coco_gao_03:05
SundarHi all03:05
shaohe_fengmorning Coco_gao_03:05
s_shogoHi all03:05
Sundar#topic Attendance03:05
Coco_gao_morning shaohe03:05
*** openstack changes topic to "Attendance (Meeting topic: openstack-cyborg)"03:05
*** chenke has joined #openstack-cyborg03:05
Sundar#info SUndar03:05
s_shogo#info s_shogo03:05
*** Yumeng has joined #openstack-cyborg03:05
chenkeHi~03:05
SundarHi all03:05
Coco_gao_Hi chenke03:05
chenke#info chenke03:05
shaohe_feng#info shaohe_feng03:06
Yumeng#info Yumeng03:06
yikun#info yikun03:06
SundarAgenda: https://wiki.openstack.org/wiki/Meetings/CyborgTeamMeeting#Agenda03:06
chenkeHi Coco_gao_03:06
SundarPython 3: Since OpenStack Train release has some Python 3 goals, due by Milestone 3, and it seems that we are close to fixing Py3 issues for Cyborg,03:07
SundarI have requested s_shogo to make Python 3 tests as a voting job in Zuul.03:08
SundarAny objections or comments?03:08
*** wangzhh has joined #openstack-cyborg03:08
*** chunxiu has joined #openstack-cyborg03:08
SundarI'll take the silence as agreement. ;)  There were requests for fixing Python 3 in the cyborg client too. Luckily, it has taken only 1 patch so far, so we don't need to spend much time on it.03:10
chenke+103:10
s_shogoI'll do the py3 work in cyborg client, too.03:11
chenkegood job03:11
chenkeI had modify the tox.ini default env support py36,py3703:12
Coco_gao_thank you03:12
Sundars_shogo: The catch is, the current client is for v1 API code and not based on the openstacksdk method. Bringing it to v2 is more important, right?03:12
Coco_gao_s_shaogo03:12
Coco_gao_s_shogo03:12
wangzhhCool.03:12
shaohe_fenghttps://review.opendev.org/#/c/673228/03:12
shaohe_fengthis is a python3 issue fix for client03:13
SundarBut somebody else proposed a patch and it got merged.03:13
s_shogoSundar: I think so, My openstackSDK patch is made for the v2 Deployable API, now.03:13
s_shogoAnd the P5-P9 patches doesn't include the migration code , "Deployable" API , from v1 to v2.03:14
Sundars_shogo: Great. Please add device profiles, as that is more importan IMHO. Operators need to create device profiles to use Cyborg, but doing that with curl is not easy03:15
Coco_gao_agree, Sundar03:15
SundarAs 2nd priority, I'd say devices -- that will give an inventory of accelerator devices in the cluster03:15
SundarIMHO, when devices are asked for, we can return the components like deployables and attributes, so the client gets a full picture03:16
shaohe_fengyes,  client if more friendly than curl03:16
s_shogoAs related the client,  the deadline for openstackSDK's commit seems to be near, so would like to begin commit to that, prior to the merge of APIv2 patches.03:17
SundarYes, makes sense03:17
SundarThanks, s_shogo!03:17
SundarThe main thing that is holding me back is that I am testing P5-P9 with the notification and Placement report patches. Plus, Nova code changes to create a merge conflict for me.03:18
SundarOnce those are resolved, hope we can merge the P5-P9 patches03:18
SundarANy other comments on the client, anybody?03:19
shaohe_fengyes, async job depends on P5-P903:19
s_shogoIn my assumption,python-cyborg client and openstacksdk could to be completed before the Train release,03:19
shaohe_fenggreat03:19
s_shogobut I'm anxious of sufficiency in my test codes, thus please review that in following patches, and help that if necessary.03:19
Sundarshaohe_feng: Agreed. I'll expedite as much as I can.03:19
Sundars_shogo: Agreed, we'll help for sure03:20
s_shogoThanks , Sundar03:20
shaohe_fengmaybe the test codes can be add later.03:20
Sundarwangzhh: Thanks for proposing the RBAC patch. I had some concerns/questions in the patch. Please take a look.03:20
shaohe_fengfirstly let the client can work.03:20
Coco_gao_s_shogo, thank you . We will review the code.03:21
wangzhhYep. I have updated my code. May commit after meeting.03:21
SundarThanks, wangzhh03:21
s_shogoshaohe_feng : OK, I'll do that preferentially.03:22
Sundarshaohe_feng: Part of the issue is that some Nova developers want to test Cyborg code with Nova code in theor env. Also, we need to show tempest working end-to-end.03:22
SundarAnybody else trying out the Placement report? With GPUs, AI chip, etc.?03:23
Coco_gao_What's the remaining work for tempest?03:23
shaohe_fengyes, tempest can eliminate their concerns03:23
SundarCoco_gao_: It is mostly to get the patches to work together, I think03:24
SundarXinran's patches look good IMO. Trying to make sure they work with P5-P903:24
YumengI have tried the Placement report With GPUs03:24
SundarYumeng: Good to know03:25
Sundar#topic Nova functional tests03:25
*** openstack changes topic to "Nova functional tests (Meeting topic: openstack-cyborg)"03:25
SundarThere was talk at the PTG that we should propose functional tests for Nova, which mock CYborg API in a test fixture, and use that to test Nova patches03:26
SundarThey seem to cover a few more scenarios than unit tests and tempest03:27
Coco_gao_mock cyborg API's return?03:27
chenkeI agree we need to import functional test for nova.03:27
SundarCoco_gao_: Yes03:28
SundarWe have an entry in the Storyboard too. I have not any comments of late, but there is concern that it may come up at the last moment03:28
SundarSince there is lots of stuff in Nova runway, it can be tough to get a 2nd look if this issue comes up03:29
SundarDO we have any volunteers for writing Nova functional tests? I'll help as much as I can03:29
SundarPlease think it over and LMK if you can.03:31
Sundarshaohe_feng: Do you want to bring up the discussion about ARQ states and transitions, as followup? Or is it settled?03:32
shaohe_fengyes03:33
shaohe_fengone things is that, who delete the ARQ03:33
shaohe_fengwhen delete API tag the state as delete_pending?03:34
SundarThere is Nova code to delete the ARQ in some error cases and when VM is terminated03:34
shaohe_fengmaybe it is still in bind process03:35
shaohe_fengthe bind process to delete it when it find the state is delete_pending?03:35
SundarYes. In that case, IMHO, it is best to let the bind complete and the traits get updated in Placement, and then unbind/delete the ARQ03:36
SundarIf we try to interrupt FPGA progamming, bad things can happen03:36
shaohe_fengwe will not add any rollback this release for bind.  just go through the whole process even deleting.03:36
SundarAgreed03:36
shaohe_fengOK.03:37
Coco_gao_OK03:37
shaohe_fengany state transform should be transaction.03:37
SundarYes, db transaction03:38
*** xinranwang has joined #openstack-cyborg03:38
shaohe_fengseems there is a state machine in oslo lib03:39
SundarAny other issue, shaohe_feng?03:39
shaohe_fengwe will not introduce it release03:39
SundarOk by me. What are the benefits of using that?03:40
shaohe_fengfor I need time to read up it.03:40
shaohe_fengdo not look into it at present.03:40
Sundarok03:40
shaohe_fengmaybe after the whole flow code are finished03:40
shaohe_fengwe can have a look for cons and pros03:41
SundarSure. We'll trust your judgement on this :)03:41
shaohe_fenganother things, should the async job timeout?03:41
SundarOn a different note, I am seeing this issue for allocating attach handles: https://opendev.org/openstack/cyborg/src/branch/master/cyborg/db/sqlalchemy/api.py#L269 The in_use field does not get written to db03:42
shaohe_fengbut there's still a problem.03:42
SundarThe timeout should correspond to default Nova timeout03:42
shaohe_fengmaybe it is in programming or other critical job03:43
SundarThe programming typically takes a few seconds, so default of 300 seconds (I think) is good enough03:43
shaohe_fengtimeout can be disaster03:43
shaohe_fenganother things03:44
shaohe_fengcurrently the bind process is specify for FPGA03:44
SundarUmm, bind if for all accelerators. Only programming is for FPGA. the bind means the ARQ is associated with a host and deployable in Cyborg's db, and the device is ready to use03:45
Sundar*is for03:45
shaohe_fengthere should be good extension for other kinds03:45
shaohe_fengI means:03:46
shaohe_feng1.  get the resource type.03:46
shaohe_fengevery resource type should has its own extend bind action03:46
shaohe_fengfor FPGA it is program.03:46
shaohe_fengother's maybe evn setup, not sure.03:47
shaohe_feng2. every resource should be has its own placement report.03:47
shaohe_fengthe report info maybe  different03:48
shaohe_fengso the code should be:03:48
shaohe_fengtype, num = arq.group_get_resource()03:49
shaohe_fengfor n in num:03:49
shaohe_fengaction = get_accelerator_action(type) # fpga is program03:50
shaohe_fengaction()03:50
shaohe_fengsomethings like this03:50
shaohe_fengand these code should be split from the arq object file03:50
SundarIn general, the process should be generic for all accelerators. The current code looks at the device profile request group to see if it has function_id or bitstream_id entries, which are specific to FPGA, to decide if programming is needed03:51
shaohe_fengwe maybe add other spec in03:52
SundarAFAIK, for non-FPGA devices in this release, there is nothing required to prepare the device, right?03:52
shaohe_fengdevices profile for different acclerations03:53
shaohe_fengsuch as HDDL03:53
shaohe_fengwe can add03:53
shaohe_feng"accel:affinity": true03:54
SundarOk03:54
shaohe_fengwhich means we need  4 accelerator in one card03:54
SundarWe had an idea of a generic prepare_device API in the driver, which gets a dictionary as a parameter, where the dictionary values depend on the device type.03:54
shaohe_fengyes, different devices maybe take different action during bind.03:55
SundarQuick process check: Since we have only few minutes left, should we continue this via email, copying all of us and openstack-ML? What do you all think?03:56
shaohe_fengalso another things, where we init the threadpoolexcutor?03:56
shaohe_fengint the arq object file?03:56
shaohe_fengseems not good.03:56
shaohe_fengOK.03:57
SundarAll, please look at this issue for allocating attach handles: https://opendev.org/openstack/cyborg/src/branch/master/cyborg/db/sqlalchemy/api.py#L269 The in_use field does not get written to db03:57
SundarAll, we are seeing good review activity of late. Thank you all, and please keep it up. We are literally 2 weeks from the milestone. :)03:58
Sundar#topic AoB03:58
*** openstack changes topic to "AoB (Meeting topic: openstack-cyborg)"03:58
Sundarshaohe_feng: if you prefer, I can initiate an email thread for the good points that you brought up. Good?03:59
shaohe_fengOK03:59
SundarAnything else, folks?03:59
shaohe_fengdo you have a look that the in_use is in the arguments of the update function?04:00
SundarYes04:01
shaohe_fengand the your DB really have the in_use field?04:01
shaohe_fengdirectly use mysql command.04:01
SundarOh yes. The ref.update has it, but it doesn;t get written to db. Use mysql cmd from Python code?04:02
shaohe_fengno04:02
shaohe_fengsuch as:04:02
shaohe_fengmysql  -uroot -ppass cyborg04:02
SundarYes, update command works from CLI04:02
SundarWe'll follow up on this too by email.04:03
shaohe_fengdesc haddler;04:03
shaohe_fengOK.04:03
SundarThanks, everybody. Happy coding and reviewing :). Have a good day. Bye.04:03
Sundar#endmeeting04:03
*** openstack changes topic to "Pending patches (Meeting topic: openstack-cyborg)"04:03
openstackMeeting ended Thu Aug 29 04:03:58 2019 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)04:04
openstackMinutes:        http://eavesdrop.openstack.org/meetings/openstack_cyborg/2019/openstack_cyborg.2019-08-29-03.05.html04:04
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/openstack_cyborg/2019/openstack_cyborg.2019-08-29-03.05.txt04:04
openstackLog:            http://eavesdrop.openstack.org/meetings/openstack_cyborg/2019/openstack_cyborg.2019-08-29-03.05.log.html04:04
shaohe_fengyou can update the field manully by mysql CLI,  right?04:04
shaohe_fengmanually04:04
SundarHi shaohe_feng: Yes04:07
shaohe_fengcan I login your evn?04:07
SundarSure, I'll send you the info separately04:07
shaohe_fengOK. thanks.04:07
*** chunxiu has quit IRC04:34
*** s_shogo has quit IRC04:34
openstackgerritchenker proposed openstack/cyborg master: Fix pep8 error in cyborg/*.py and add Forbidden class  https://review.opendev.org/67904206:03
openstackgerritYumengBao proposed openstack/cyborg master: enable branch selection in devstack installation  https://review.opendev.org/66930306:07
*** xinranwang has quit IRC06:08
openstackgerritchenker proposed openstack/cyborg master: P3: Fix pep8 error in cyborg/common and cyborg/conductor  https://review.opendev.org/67906206:24
openstackgerritchenker proposed openstack/cyborg master: P4: Fix pep8 error in cyborg/api  https://review.opendev.org/67917206:27
*** Coco_gao_ has quit IRC06:34
openstackgerritYumengBao proposed openstack/cyborg master: enable branch selection in devstack installation  https://review.opendev.org/66930306:36
openstackgerritchenker proposed openstack/cyborg master: Fix pep8 error in cyborg/agent and cyborg/db  https://review.opendev.org/67919307:01
openstackgerritchenker proposed openstack/cyborg master: P4: Fix pep8 error in cyborg/api  https://review.opendev.org/67917207:07
openstackgerritchenker proposed openstack/cyborg master: Fix pep8 error in cyborg/*.py and add Forbidden class  https://review.opendev.org/67904207:21
openstackgerritchenker proposed openstack/cyborg master: P2: Fix pep8 error in cyborg/conf and cyborg/cmd  https://review.opendev.org/67904507:21
openstackgerritchenker proposed openstack/cyborg master: P3: Fix pep8 error in cyborg/common and cyborg/conductor  https://review.opendev.org/67906207:21
openstackgerritchenker proposed openstack/cyborg master: P4: Fix pep8 error in cyborg/api  https://review.opendev.org/67917207:21
*** shaohe_feng has quit IRC07:23
openstackgerritchenker proposed openstack/cyborg master: P5: Fix pep8 error in cyborg/accelerator  https://review.opendev.org/67917507:26
openstackgerritchenker proposed openstack/cyborg master: P6: Fix pep8 error in cyborg/agent and cyborg/db  https://review.opendev.org/67919307:26
*** Sundar has quit IRC07:49
chenkeHi, all. When you have time, please help me review the pep8 related commit. I think this should be merged as soon as possible, which will facilitate the code in cyborg. Thanks.07:52
*** wangzhh has quit IRC07:58
*** chenke has quit IRC09:59
*** Yumeng has quit IRC11:18
openstackgerritchenker proposed openstack/cyborg master: P6: Fix pep8 error in cyborg/agent and cyborg/db  https://review.opendev.org/67919312:39
openstackgerritchenker proposed openstack/cyborg master: P6: Fix pep8 error in cyborg/agent and cyborg/db  https://review.opendev.org/67919312:41
*** efried is now known as efried_afk13:47
openstackgerritMerged openstack/cyborg master: enable branch selection in devstack installation  https://review.opendev.org/66930314:06
openstackgerritMerged openstack/cyborg master: Fill "driver_name" field in Deployable object  https://review.opendev.org/67795214:36
*** efried_afk is now known as efried15:13
openstackgerritShaoHe Feng proposed openstack/cyborg master: bug fix: update in DB instead of in cache  https://review.opendev.org/67931418:00
openstackgerritShaoHe Feng proposed openstack/cyborg master: bug fix: update in DB instead of in cache  https://review.opendev.org/67931418:40
openstackgerritMerged openstack/cyborg master: Move to releases.openstack.org  https://review.opendev.org/66477421:13

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!