Friday, 2018-08-10

*** pmannidi has joined #openstack-kuryr00:01
*** hongbin has joined #openstack-kuryr01:00
*** hongbin has quit IRC03:19
*** rh-jelabarre has quit IRC03:28
*** janki has joined #openstack-kuryr04:25
openstackgerritPeng Liu proposed openstack/kuryr-tempest-plugin master: NPWG multi-vif driver tempest tests  https://review.openstack.org/59064304:37
*** janki has quit IRC05:19
*** janki has joined #openstack-kuryr05:28
*** janki has quit IRC06:41
openstackgerritDaniel Mellado proposed openstack/kuryr-kubernetes master: Add a new gate for network policies  https://review.openstack.org/59030406:41
*** pcaruana has joined #openstack-kuryr06:43
openstackgerritPeng Liu proposed openstack/kuryr-kubernetes master: Move function get_subnet to kuryr_kubernetes.utils  https://review.openstack.org/59066206:45
ltomasbocelebdor[m], dmellado: do you now if there is any issue with containerized openshift on the gates?07:03
dmelladonot that I'm aware ltomasbo07:03
dmelladowhat's going on?07:03
ltomasboit's taking ages for me to make this patch pass them! https://review.openstack.org/#/c/58068007:03
ltomasbothe 2 thatt timed out07:04
ltomasboI'm checking the logs, and it seems the kuryr-cni was not ready07:04
dmelladoI had some timouts on octavia yesterday07:04
ltomasboand I see there is no crds/annotations being made07:04
dmelladoin fact I'm still waiting for 1 patch to get merged07:04
ltomasbo Kuryrnet CRD could not be added. Rolling back network resources created for the namespace.: K8sClientException: <Response07:04
dmelladohmm werid07:05
dmelladolet's check https://review.openstack.org/#/c/590304/07:05
dmelladoit's set to trigger experimentla gates07:05
ltomasbodmellado, btw, I'll be posting one soon this morning regarding namespace and pool deletion race07:05
ltomasbodmellado, great! let me know the outcome!07:05
dmelladoltomasbo: depending on how fast you are it might even cut in rc107:06
ltomasboperhaps those gates are marked as non-voting for a reason...07:06
ltomasbodmellado, I'm stacking to test if it fix the issue07:06
ltomasboif it does, it will be in a couple of hours I guess07:06
dmelladoheh, let me know if you can get it07:06
dmelladobut be warned that I'll be going off early today07:06
dmelladolike in 3 hours or so07:06
ltomasboohh, ok07:07
ltomasbowe'll see if I'm fast enough xD07:07
ltomasbodmellado, celebdor[m]: look what I'm seeing: Aug 09 17:33:42.760263 ubuntu-xenial-limestone-regionone-0001248938 openshift[6215]: 2018-08-09 17:33:42.751 23 ERROR kuryr_kubernetes.cni.api [-] Looks like 127.0.0.1:50036 cannot be reached. Is kuryr-daemon running?: ConnectionError: HTTPConnectionPool(host='127.0.0.1', port=50036): Max retries exceeded with url: /addNetwork (Caused by NewConnectionError('<urllib3.connection.HTTPConn07:07
ltomasboection object at 0x7f02389ea750>: Failed to establish a new connection: [Errno 111] Connection refused',))07:07
dmelladohmm that's weird07:08
ltomasbotaht is on the openshift-node07:08
dmelladothat's only on that experimental gate, isn't it?07:08
ltomasboyep, and that should be independent on the namespace isolation (I suppose)07:08
dmelladoltomasbo: let's check if that duplicates on my patch07:08
dmelladoif so, we'll open a bug for this07:08
openstackgerritMerged openstack/kuryr-kubernetes master: Change Pod annotations format to o.vo  https://review.openstack.org/58437707:20
*** janki has joined #openstack-kuryr07:27
*** pmannidi has quit IRC07:42
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Ensure OpenShift gate uses the namespace subnet/sg drivers  https://review.openstack.org/58068007:46
*** garyloug has joined #openstack-kuryr08:06
*** celebdor has joined #openstack-kuryr08:27
pliuHi, I am working a patch for testing NWPG feature with tempest plugin, and hit a problem when using method exec_command_in_pod.08:31
celebdorpliu: which problem?08:31
*** janki has quit IRC08:32
pliusince kuryr-kubernetes cannot work with kube-dns, the kube-api cannot get ip of pod from dns. like you cannot run 'kubectl exec' with kuryr08:32
pliuI saw other tempest cases also use this method like test_namespace.py, I wonder how it works.08:34
dmelladodamn, I'm having issues cutting rc108:36
dmelladodamn 'paperworks' xD08:36
celebdorpliu: did you try with devstack openshift support?08:36
celebdorI added dns support there08:36
pliuso the kube-dns can run on openshift base deployment?08:38
celebdorpliu: kube-dns is run internally by openshift node when you choose openshift devstack08:38
pliugood to know that. I will try it with openshift. Thanks08:39
celebdorI didn't have time to integrate kube-dns for kubernetes deployment08:39
celebdordmellado: we have flakiness again in getting answers from different backends08:39
celebdorwhy oh why can't we get reliable round robin from Octavia?08:40
*** janki has joined #openstack-kuryr08:43
ltomasbocelebdor, https://review.openstack.org/#/c/580680/ this one is finally working on containerized openshift octavia09:25
ltomasbocelebdor, I'm re-checking the experimental gate as the serial one failed due to the yum install issue that happens every now and then09:26
celebdorltomasbo: what's the new apigroups for?09:26
ltomasbootherwise openshift cannot push objects into the kuryrnet crds that are used for the namespaces09:27
ltomasbocelebdor, ^^09:27
ltomasboit was being rejected (403 exception)09:27
celebdorright!09:28
celebdorgood ctach09:28
ltomasbocelebdor, well, it took me a few retries...09:28
ltomasbocelebdor, it was driving me crazy!!09:29
celebdorshouldn't the second apigroup be enough?09:29
celebdorI would think the first one is only to create new crd from specs09:29
celebdornot to create/update/delete crd instances09:29
celebdorltomasbo: ^^09:29
ltomasboprobably09:30
ltomasbothe crd was being created without that actuall (as it was created with kubectl apply)09:30
ltomasbocelebdor, do you want me to give it a try without that?09:30
celebdorltomasbo: yes, please09:31
ltomasbosure!09:31
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Ensure OpenShift gate uses the namespace subnet/sg drivers  https://review.openstack.org/58068009:32
ltomasbodone09:32
celebdorltomasbo: dmellado: do you know why the hell do we use curl in https://github.com/openstack/kuryr-tempest-plugin/blob/master/kuryr_tempest_plugin/tests/scenario/test_service.py#L45-L4709:32
ltomasbolet's see if it likes it09:32
celebdorinstead of just using requests?09:32
celebdorI don't quite get it09:32
dmelladocelebdor: before going off09:32
dmelladoI'd say because of gcheresh09:32
dmelladoxD09:33
dmelladog'weekend y'all09:33
celebdordo you see anything wrong with changing it to requests?09:33
dmelladonope09:33
celebdorok09:33
celebdorso then I'm changing it to parallel requests09:34
celebdorusing a threadpool09:34
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Ensure delete_network_pools include all the ports  https://review.openstack.org/59073910:25
ltomasbocelebdor, ^^ this fixes the race for deleting namespaces and ports10:32
openstackgerritAntoni Segura Puimedon proposed openstack/kuryr-tempest-plugin master: Use ThreadPool and requests instead of subprocess  https://review.openstack.org/59074010:32
openstackgerritAntoni Segura Puimedon proposed openstack/kuryr-tempest-plugin master: Use ThreadPool and requests instead of subprocess  https://review.openstack.org/59074010:37
celebdorltomasbo: ^^ getting rid of curl10:37
celebdorltomasbo: can you explain how that works?10:42
ltomasbocelebdor, sure!10:54
openstackgerritMerged openstack/kuryr-kubernetes master: drop deprecated oadm  https://review.openstack.org/58830310:55
ltomasbocelebdor, basically it ensures that _trigger_return_to_pool is not executed at the same time on the background (periodic task cleaning up ports to be reused) and by the deletE_neetwork_polls10:55
openstackgerritMerged openstack/kuryr-kubernetes master: Allow musl based distros to resolve using search  https://review.openstack.org/58070110:56
celebdorltomasbo: at the same time as what?11:00
ltomasbocelebdor, so, _trigger_return_to_pool is the function that puts back the ports on the available_ports_pool dict11:00
ltomasbocelebdor, this is executed (in a different thread) every 15 seconds (by default)11:01
ltomasbocelebdor, and this function is also called from delete_network_pools11:01
celebdoroh11:02
ltomasbocelebdor, otherwise, delete_network_pools will delete the ports (belonging to the network) on available_ports_pool and then _trigger_return_to_pool will put some other ports of that network back to available_ports_pools11:02
ltomasbothat will lead to ports on that network not being deleted11:02
ltomasboand therefore, the subnet/network/sg cannot be deleted if it is in used11:02
ltomasboso, calling trigger_return_to_pool ensure taht all the ports belonging to delted pods are put back on the available_ports_pool11:03
ltomasboand locking trigger_return_to_pool so taht it cannot be executed by different threads at the same time will ensure this is not f*ck up by executing the same fucntion by the periodic time11:04
ltomasbotask11:04
celebdorltomasbo: thanks. Now go and make the commit message include this much more in depth explanation11:05
celebdor:-)11:05
ltomasboman...11:06
ltomasboI have it on the bug description11:06
celebdorok, let me look at it again11:06
ltomasboor you want more detailed info there too?11:06
celebdorcause I read both earlier and I missed part11:06
celebdorok, ok11:07
celebdorI buy it as it is11:07
ltomasboif not clear, I can try to extend it11:07
ltomasbono problem, I will re-spin it anyway to fix the typo11:08
celebdorno, it's okay, just the typo11:08
ltomasbook!11:08
celebdorltomasbo: I'm curious, why did you use separate lock names for baremetal and nested?11:08
celebdoris it for environments with both?11:09
ltomasboyep11:09
ltomasboI believe there is no need to block both at the same time11:09
ltomasboas they will be touching different pool keys11:09
ltomasbo(perhaps I'm wrong)11:09
celebdorok11:11
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Ensure delete_network_pools include all the ports  https://review.openstack.org/59073911:15
openstackgerritAlexey Perevalov proposed openstack/kuryr-kubernetes master: Separate helper function for pod  https://review.openstack.org/59024511:23
*** rh-jelabarre has joined #openstack-kuryr11:26
ltomasbocelebdor, I'm my explanation I forgot to mention the second part of the problem (deleting 2 namespaces with pods)11:31
celebdorltomasbo: add it to the bug11:32
ltomasboI'll do11:32
ltomasbocelebdor, seems removing the crd thing was not good: https://review.openstack.org/#/c/580680/2112:10
openstackgerritAntoni Segura Puimedon proposed openstack/kuryr-tempest-plugin master: Use ThreadPool and requests instead of subprocess  https://review.openstack.org/59074012:11
celebdorltomasbo: for which gate?12:11
ltomasbocontainerized openshift with octavia12:11
ltomasbothe 2 experiemtnal ones that have failed12:11
celebdorltomasbo: why didn't it fail with the other openshift one?12:12
ltomasbono containerized12:12
ltomasbothen it does not need all the sa things12:12
celebdorthe controller always uses a serviceaccount, doesn't it?12:12
ltomasboonly if is running on top of openshift/kubernetse, right?12:13
ltomasboif it is a devstack@XXX service it does not need it, right>12:13
celebdorltomasbo: how does it perform the operations on k8s otherwise?12:14
celebdorI don't think we use the admin kubeconfig ever12:14
celebdorlet me check12:14
ltomasbonop, I only have this problems for openshift gates12:14
ltomasboand only for the ones containerized12:15
celebdormeh12:30
celebdorthis is a bug12:30
celebdorwe should be using service accounts and cluster role bindings in both cases :/12:32
celebdorltomasbo: did you find the permission denied message?12:35
*** janki has quit IRC12:35
ltomasbonop12:35
ltomasboI'm rechecking just in case12:35
ltomasboI didn't find any error12:35
ltomasboseems like a timeout issue12:36
*** jistr is now known as jistr|call13:46
ltomasbocelebdor, so it seems the failures are unrelated, recheck changed the type of gate failing... https://review.openstack.org/#/c/58068013:54
ltomasboso I guess you were right and the customresourcedefinition api was not needed13:55
ltomasbogoing to recheck for the last time!13:56
*** jistr|call is now known as jistr14:00
*** hongbin_ has joined #openstack-kuryr14:03
celebdor:-)14:21
*** AlexeyPerevalov has quit IRC14:37
openstackgerritAntoni Segura Puimedon proposed openstack/kuryr-tempest-plugin master: Use ThreadPool and requests instead of subprocess  https://review.openstack.org/59074014:46
*** celebdor has quit IRC16:44
*** celebdor has joined #openstack-kuryr16:45
*** aojea has joined #openstack-kuryr16:46
*** celebdor has quit IRC16:49
*** aojea has quit IRC16:52
*** pcaruana has quit IRC19:19
*** aojea has joined #openstack-kuryr21:56
*** aojea has quit IRC22:27
*** hongbin_ has quit IRC23:00

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!