Tuesday, 2020-09-01

*** cshen has joined #openstack-ansible00:09
*** cshen has quit IRC00:14
*** rh-jelabarre has quit IRC01:40
*** rh-jelabarre has joined #openstack-ansible01:40
*** cshen has joined #openstack-ansible02:10
*** cshen has quit IRC02:14
*** rh-jelabarre has quit IRC02:28
*** rh-jelabarre has joined #openstack-ansible02:47
*** rh-jelabarre has quit IRC02:52
*** gyee has quit IRC03:13
*** cshen has joined #openstack-ansible04:10
*** spatel has joined #openstack-ansible04:13
*** cshen has quit IRC04:14
*** evrardjp has quit IRC04:33
*** evrardjp has joined #openstack-ansible04:33
*** sakharkar has quit IRC04:52
*** djhankb has quit IRC05:49
*** djhankb has joined #openstack-ansible05:50
noonedeadpunkmornings!06:09
noonedeadpunkcan I have extra vote on soma patches? https://review.opendev.org/#/q/owner:noonedeadpunk%2540ya.ru+label:Verified+status:open06:10
*** cshen has joined #openstack-ansible06:10
noonedeadpunkalso would be great to have https://review.opendev.org/#/c/742105/ merged06:25
openstackgerritDmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible-tests master: Make focal and centos 8 functional tests voting  https://review.opendev.org/74918606:34
*** akahat is now known as akahat|rover07:05
*** andrewbonney has joined #openstack-ansible07:13
*** tosky has joined #openstack-ansible07:21
*** djhankb has quit IRC08:01
*** djhankb has joined #openstack-ansible08:02
*** sshnaidm|bbl is now known as sshnaidm08:36
openstackgerritMerged openstack/openstack-ansible-lxc_container_create master: Remove obosletd lxc_container_interface variable  https://review.opendev.org/73893408:38
*** SecOpsNinja has joined #openstack-ansible08:54
openstackgerritMerged openstack/openstack-ansible-os_cloudkitty master: Use the utility host for db setup tasks  https://review.opendev.org/74718008:57
*** viks____ has joined #openstack-ansible09:02
openstackgerritMerged openstack/openstack-ansible-ceph_client master: Fix ceph-client deployment for Ubuntu Focal  https://review.opendev.org/74812810:02
openstackgerritMerged openstack/openstack-ansible-plugins master: Replace deprecated imp module with importlib  https://review.opendev.org/74884010:19
*** mgagne has quit IRC10:39
openstackgerritJonathan Rosser proposed openstack/openstack-ansible-ceph_client stable/ussuri: Fix ceph-client deployment for Ubuntu Focal  https://review.opendev.org/74924211:07
openstackgerritDaniel Meloy proposed openstack/openstack-ansible-haproxy_server master: Add haproxy_backend_only flag to service template  https://review.opendev.org/74739111:15
openstackgerritDmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible-os_ironic master: [DNM] test patch  https://review.opendev.org/72706711:37
mgariepygood morning11:41
noonedeadpunko/11:41
mgariepywhat's up ?11:41
noonedeadpunkI guess we're mosntly in nternal stuff nowadays:)11:42
noonedeadpunkand I unlearned how to type11:42
noonedeadpunkbut generally things seem pretty nice:)11:43
mgariepyend of summer is hard for everybody.11:43
noonedeadpunkhow was your vacation?:)11:43
mgariepyway too short11:43
noonedeadpunkI head really big one this year, but it still feels like too short :) So I don't know if enough time of rest actually exist :p11:44
mgariepyhaha ;) yep.11:44
mgariepyit's easy to get used to be on vacation.11:45
noonedeadpunkjust there's so many things you always put on the shelf because of the lack of the time....11:45
mgariepyfor ovn tests, i'd wait to see the ovn tests succeed for a while before we make it in the gate job.11:48
*** chenhaw has quit IRC12:00
*** oyrogerg has quit IRC12:02
*** rh-jelabarre has joined #openstack-ansible12:03
*** rh-jelabarre has quit IRC12:04
*** rh-jelabarre has joined #openstack-ansible12:04
openstackgerritMerged openstack/openstack-ansible-ceph_client master: Remove trailing '/' from ceph_apt_repo_url  https://review.opendev.org/73697712:07
openstackgerritMerged openstack/openstack-ansible-os_cloudkitty master: Add CentOS 8 and Ubuntu Focal support  https://review.opendev.org/74847612:16
jrosserhmmm theres centos-7 stuff failing all over patches on mater12:23
jrosser*master12:23
openstackgerritMerged openstack/openstack-ansible-tests master: Make focal and centos 8 functional tests voting  https://review.opendev.org/74918612:24
jrosserwierd nova-compute stack traces https://zuul.opendev.org/t/openstack/build/a038834887194c4ab6f94dec06d5ab91/log/logs/host/nova-compute.service.journal-10-42-23.log.txt#272912:25
jrosserand another the same error https://zuul.opendev.org/t/openstack/build/ee7cb39658044384ad335e0edcbc75e7/log/logs/host/nova-compute.service.journal-11-57-52.log.txt#274712:32
*** dave-mccowan has joined #openstack-ansible12:33
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_barbican master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74925812:34
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_blazar master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74925912:35
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_cinder master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926012:35
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_cloudkitty master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926112:35
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_glance master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926312:36
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_heat master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926412:36
jrosser^ these are all wrong12:37
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_ironic master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926512:37
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_keystone master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926612:37
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_magnum master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74585112:38
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_manila master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926712:38
*** dave-mccowan has quit IRC12:38
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_masakari master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926812:38
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_mistral master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74926912:38
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_neutron master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74927012:40
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_nova master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74927112:40
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_octavia master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74927212:40
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_placement master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74927312:41
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_sahara master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74927412:41
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_tacker master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74927512:42
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_trove master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74927612:42
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_zun master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74927712:43
*** spatel has joined #openstack-ansible12:58
noonedeadpunkNOT MERGING THIS:)13:02
jrossernoooooooo13:02
noonedeadpunkjrosser: I think it's time to merge your patch13:02
jrosserthat would certainly fix this proposal bot stuff13:03
noonedeadpunkbut I think we need to abandon these though....13:03
jrosserits this? https://review.opendev.org/#/c/671454/13:04
jrosserafaik it keeps re-proposing on top of whatever outstanding changes there already are13:04
jrosseror something......13:04
noonedeadpunkbut then no changes will be made in patch... and I'm afraid that may break git review13:05
noonedeadpunkor bot script13:05
noonedeadpunkyeah, that's it13:05
*** chenhaw has joined #openstack-ansible13:05
spatelnoonedeadpunk: good morning13:09
spateli have noticed senlin repo got created13:09
spatellets me know where you want me to upload senlin playbook and i will do that.. i have few more question which i ask you in 1 hour because right now i am driving :)13:10
noonedeadpunkI'm pretty sure it did not yet13:11
noonedeadpunkWe need this https://review.opendev.org/#/c/748683/ to get repo created13:11
*** spatel has quit IRC13:11
noonedeadpunkAnd we need mnaser's vote on it at the moment:)13:12
mnaserill give you a +2 instead of a +1 :)13:12
noonedeadpunkThat totally works a well :D13:14
*** chenhaw has quit IRC13:18
*** cshen has quit IRC13:21
openstackgerritErik Berg proposed openstack/openstack-ansible-ceph_client stable/ussuri: Remove trailing '/' from ceph_apt_repo_url  https://review.opendev.org/74928213:22
SecOpsNinjaim trying to enable magnum but im always getting SSL handshake failure in haproxy when magnum tryies to comunicate with keystone. any ideas qhat i could be doing wrong? i was thinking of enabling openstack_service_adminuri_proto and openstack_service_internaluri_proto for https but i think i will get more errors (and i would probably need to enalbe ssl in all modules that is going  to give m13:29
SecOpsNinja0ore problems)13:29
jrosserSecOpsNinja: you should not need to put SSL on the internal endpoint for magnum13:29
jrossercan you describe what the issue is?13:30
SecOpsNinjath e problem is for what im seging in magnum.conf it alwasu used the public endpoitn fof keystone13:30
jrosserare you seeing an issue with the magnum container contacting keystone public endpoint?13:31
jrosseror are you seeing an issue with the heat agent / cloud-init stuff in a magnum VM contacting the keystone public endpoint?13:32
SecOpsNinjajrosser,  when i try to use openstack coe cluster list --insecure i always get SSL exception connecting to https:// (HTTP 500) and followiung the logs the haproxy is the one that delisvers that error with  "keystone_service-front-1/1: SSL handshake failure"13:32
SecOpsNinjathe problems is comunicating with keystone from what i seasing in haproxy logs13:32
SecOpsNinjaand in os_magnum ansible roles the magnum.conf.j2 alsways uses {{keystone_service_publicurl }} that ius https13:34
SecOpsNinjai normaly have problems in utility container when comunicating with openstack client and i need to add the --insecure flag13:35
SecOpsNinjabut i think its because we openstack-asnbile is useing and self signed certificate13:35
*** pcaruana has quit IRC13:36
SecOpsNinjaand if i use curl to got to keystone public endpoitn it works correctly so i supose its in the way that magnum connects with the keystone endpoint but cant understand what im missing13:36
*** pcaruana has joined #openstack-ansible13:36
*** chenhaw has joined #openstack-ansible13:37
jrosserSecOpsNinja: you can put --debug on the openstack cli to see exactly what is being called13:38
SecOpsNinjajrosser,  hum ok that is easier to troublesshoot. ok from what im seasing is magnum endpoint returnign 500 in GET /CLUSTER and the reason is SSL exception connecting to https://x.x.x.x:5000/v3/auth/tokens: HTTPSConnectionPool(host='x.x.x.x', port=5000): Max retries exceeded with url: /v3/auth/tokens (Caused by SSLError(SSLError(\"bad handshake: Error([('SSL routines', 'tls_process_server_cer13:43
SecOpsNinjatificate', 'certificate verify failed'13:43
jrosserSecOpsNinja: what version of openstack is this?13:44
SecOpsNinja21.0.013:44
SecOpsNinjajrosser,  this what im getting http://paste.openstack.org/show/797328/13:49
jrosserSecOpsNinja: do any of the other coe commands work, like show a template?13:51
SecOpsNinjajrosser, no because all comunication that magnum tries to make with keyston allways return keystone_service-front-1/1: SSL handshake failure13:54
*** cshen has joined #openstack-ansible13:54
SecOpsNinjaand locking to haproxy andn the SSL handshake failure where in keystone_service-front-1/1 eather from magnum or another comunicação from infra host13:57
*** cshen has quit IRC13:58
*** cshen has joined #openstack-ansible14:26
*** spatel has joined #openstack-ansible14:30
spatelnoonedeadpunk: i am here now14:30
spatelso what is this patch about https://review.opendev.org/#/c/748683/14:32
*** sshnaidm has quit IRC14:34
noonedeadpunkit's creating repo for os_senlin14:45
*** mgariepy has quit IRC14:48
spatelnoonedeadpunk: just trying to learn process, 1. create repo, 2. upload playbooks on those repo?14:51
noonedeadpunkyep)14:51
*** mgariepy has joined #openstack-ansible14:51
*** sshnaidm has joined #openstack-ansible14:53
jrosserSecOpsNinja: i don't have anything totally helpful for you right now as I don't have a magnum deployment to investigate with14:54
jrosserbut trying to reproduce this in an AIO would be useful14:55
SecOpsNinjai have capture the magnum tls comunication with haproxy and see if i can find what is causing this...14:55
jrosseri think this comes down to deciding if it is 1) error in magnum.conf 2) bug in magnum14:56
SecOpsNinjabecause from what i saw in haproxy documentation they all say that SSL handshake error is a problem in the ssl comunication and need to check client and server settings to see what is causing it...14:56
SecOpsNinjaregarding magnum.conf the only way is for me to disable https/ssl for keystone public domain and see if it resolves because i can't find anymore settings available in magnum role14:57
jrosseri am not sure the haproxy log is really too useful14:57
jrosserthe magnum api log will be the place that error is generated14:57
noonedeadpunkspatel: https://opendev.org/openstack/openstack-ansible-os_senlin created:)14:57
SecOpsNinjathe magnum log reports the error returned by haproxy ssl handshake error14:58
SecOpsNinjathat is what is reported by openstack client14:58
jrosserthe error comes from magnum14:58
SecOpsNinjait comes from magnum after not being able to comunicate with keystone by its public endpoint... let me put the log of magnum here14:59
SecOpsNinjajrosser,  this is what the magnum api returns http://paste.openstack.org/show/797340/15:01
spatelnoonedeadpunk: +115:02
spateljrosser: look like this is same magnum issue i was talking about last week, they don't have valid cert so they failed to connect.15:02
SecOpsNinjajrosser,  regarding that log my conclusion is that an SSL problem regarding mangnum client and haproxy15:03
jrosserspatel: your issue was from the heat agent in the VM wasnt it?15:03
jrosserspatel: this is from magnum api -> keystone15:03
spatelmagnum and heat work together so i have to disable cert validation on both15:03
spatelclient---ssl--->magnum ---ssl---> heat15:04
SecOpsNinjain wireshark one interesting part is that magnum send and tlsv1.2 package to keystone public endpoint saying level:fatal: Description: Unknown CA15:04
jrosserthis is all getting a bit circular :)15:04
openstackgerritMerged openstack/openstack-ansible-plugins master: Use templated jobs instead of setting independent set  https://review.opendev.org/74884715:04
jrosserideally the magnum api would talk to keystone on the internal endpoint15:05
jrosserthen this problem would not happen15:05
SecOpsNinjayep in os magnum is using the publuc endpoint for keystone and authtoken15:06
spateljrosser: Yes15:06
jrosserSecOpsNinja: the trouble is that there are mixed concerns here15:06
spateljrosser: majority time to solve this issue i use my company valid certificate so i never this this issue.15:06
jrossermagnum api is communicating with things that are external to the cloud, and internal15:07
spateljrosser: this is only AIO problem if you running openstack in production i am assuming people go with valid certs15:07
jrosserno it is more of a problem than that15:07
spatelhmm.15:07
SecOpsNinjaatm we are using openstack in a testeb behind a VPN so no need for valid certificate15:07
jrosserthere are several people deploy OSA with mgmt networks that cannot talk to the external endpoint15:07
jrosseri am one of those15:08
jrosserso if a service (like magnum) chooses the wrong endpoint, it's game over15:08
jrosserif you NAT the mgmt network to the external network and allow traffic with whatever firewall rules then all this internal->external traffic is magically working15:09
*** waxfire has quit IRC15:09
*** waxfire has joined #openstack-ansible15:10
spateljrosser: what do you mean deploy OSA with mgmt network?15:10
SecOpsNinjabut my external endpoint is a vip in themgmt network and we aren't using AIO15:10
jrosserSecOpsNinja: ok thats no problem, but that doesnt make it correct for the magnum service to be trying to connect to the external endpoint15:11
spatelI have lab machines with single NIC and i just run deploy script of AIO. (is there any other method to avoid this SSL disaster?)15:12
SecOpsNinjayep but that the problem its in there https://github.com/openstack/openstack-ansible-os_magnum/blob/master/templates/magnum.conf.j2 where you use the public url to keystone forn auth and auth token15:13
openstackgerritMerged openstack/openstack-ansible-tests master: Add _oslodb_login_host to db_setup tasks.  https://review.opendev.org/67145415:13
SecOpsNinjabecause the majority is configured with the variable keystone_service_publicurl15:13
SecOpsNinjaso it i change this to internaluri it probably works?15:14
spatelSecOpsNinja: yes it should work then many time i change openstack endpoint --set  to internal to avoid this issue.15:15
*** zerozephyrum has joined #openstack-ansible15:17
SecOpsNinjaspatel,  i didn't understand tthe openstack endpoint set part. how do you avoid the ansible configurations?15:19
*** spatel has quit IRC15:21
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_aodh master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74933515:23
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_designate master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74933615:23
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_gnocchi master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74933715:23
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_magnum master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74585115:24
*** cshen has quit IRC15:24
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_murano master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74585615:24
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_panko master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74933815:24
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible-os_rally master: Updated from OpenStack Ansible Tests  https://review.opendev.org/74933915:24
SecOpsNinjajrosser,  so to confirm atm the magnum.conf should be pointing to the internaluri instead of the public right?  regarding the ssl the problem seems to be in the selfsigned certificate wirgth? so i can try to make the change to internaluri and if it works i can try to make a patch but ofc this would resolve the SSL problem is all enable https and ssl in internal and admin uri15:25
jrosseri keep trying to explain that its 'complicated'15:26
jrosserthe magnum api needs to talk to keystone/heat/whatever in order to function, and that must happen over the internal endpoint15:27
jrossermagnum must also respond correctly when you use openstack cli from the utility container15:27
jrosserand also when one of your users interacts with the magnum API externally, who cannot contact the internal endpoints15:27
SecOpsNinjajrosser,  sorry for all the questions but if i understand correctly, all the internal comunication betwwen each openstack should be using the internal one, and only external clients to the openstack should be using the public uri right? so whats the problem? or its something that the openstack componenets need to access that  is only exposed in public uris and not on the internal uris?15:34
*** waxfire has quit IRC15:34
*** waxfire has joined #openstack-ansible15:35
SecOpsNinjabecause i dont have my openstack instalation public available i can try to disable the https/ssl in all endpoints but i would like to  use the https in publci uri to detect all this possible errors in configurations15:36
jrosseryou are seeing the magnum service try to use keystone15:38
jrosserand it connects to the external endpoint and fails due to the untrusted cert15:38
jrosserideally when magnum *itself* contacts keystone it would use the internal endpoint so there would not be a problem15:39
*** spatel_ has joined #openstack-ansible15:41
spatel_Damn IRC chat kicking me out from channel15:41
SecOpsNinjaso what do you recomend to try bypass this issue? patch os_mangum role to use the internaluri, for the configuration of the public endpoint to use a valid cert or disable the ssl in keystone + heat + magnun (i understand that this change would aply only to the https endpoints that this moment is only the public one)15:49
*** sshnaidm is now known as sshnaidm|bbl15:50
jrosserlike i said earlier....15:51
jrosserjrosser> i think this comes down to deciding if it is 1) error in magnum.conf 2) bug in magnum15:51
jrosserand i don't know which of those this is15:51
jrosserthere are some big deployments using OSA with magnum so i'm suspecting that with a valid certificate things are more straightforward15:52
jrosserhaving the internal and external VIP on the same network also introduces a degree of "everything can talk to everything" which may not be representative of a production deployment15:55
jrosserit would be good to make your testbed be structurally as similar to your intended production deployment as possible in this respect in order to catch issues early15:56
noonedeadpunk#startmeeting openstack_ansible_meeting16:00
openstackMeeting started Tue Sep  1 16:00:44 2020 UTC and is due to finish in 60 minutes.  The chair is noonedeadpunk. Information about MeetBot at http://wiki.debian.org/MeetBot.16:00
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.16:00
*** openstack changes topic to " (Meeting topic: openstack_ansible_meeting)"16:00
openstackThe meeting name has been set to 'openstack_ansible_meeting'16:00
noonedeadpunk#topic office hours16:00
*** openstack changes topic to "office hours (Meeting topic: openstack_ansible_meeting)"16:00
noonedeadpunkSo, the main thing that needs to be announced, is that we have os_senlin repo https://opendev.org/openstack/openstack-ansible-os_senlin16:01
jrosserspatel_: push a patch for senlin!16:02
noonedeadpunkAnd spatel_ has previously volunteered to get the role for senlin project16:02
noonedeadpunkyeah!16:02
jrosseri was wondering about all these failing jobs for centos-7 on master16:02
noonedeadpunkI guess it needs also https://review.opendev.org/#/c/748693/16:02
noonedeadpunkoh, yeah, that's also great question....16:03
jrossertwo possible things we can do there - investigate with nova/oslo becasue it's the same error in lots of jobs16:03
jrosseror we drop the centos-7 jobs16:03
noonedeadpunkI also noticed that centos8 functional for galera fails, which means that probably cluster would be broken16:04
noonedeadpunkand we're good only for aio deployments...16:04
noonedeadpunkworth placing just noop task though...16:05
noonedeadpunkas maybe it's related to 10.5 only...16:06
spatel_noonedeadpunk: fyi, i am running centos8 with 3 controller nodes and my 3 node galera cluster (10.4) working fine.16:06
noonedeadpunkbtw returning to galera, I think we need to decide the desteny of galera_client role16:06
noonedeadpunkspatel_: ok, good to know)16:07
noonedeadpunkthan may be really related to 10.5 which is quite another question16:07
openstackgerritDmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible-galera_server master: [DNM] noop  https://review.opendev.org/74935416:07
openstackgerritDmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible-os_cloudkitty stable/ussuri: Add CentOS 8 and Ubuntu Focal support  https://review.opendev.org/74935516:08
noonedeadpunkjrosser: we can also add this single test to the blacklist as well16:12
noonedeadpunkbut tbh worth dropping centos716:12
noonedeadpunkI think we've almost finished cherry-picking everything to U16:13
admin0my aio stops at this error: os_ceilometer : Initialize Gnocchi database by creating ceilometer resources  .. is that normal ?16:13
admin0its an AIO16:13
noonedeadpunkadmin0: not really....16:13
jrossernoonedeadpunk: what do you think of this https://zuul.opendev.org/t/openstack/build/86acf3990e094afaa2648c894a67e71d/log/logs/host/nova-compute.service.journal-14-17-08.log.txt#2781-278316:19
jrosserit is outside the venv, is that to be expected?16:19
noonedeadpunklibvirt may generate this....16:22
noonedeadpunkwe symlink bunch of things into venv....16:22
noonedeadpunkso such request may be kind of valid...16:23
noonedeadpunkhm, but we don't symlink for centos7...16:23
noonedeadpunkwe do for 8 and ubuntu16:24
openstackgerritDmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible master: Decrease amount of jobs and update distros  https://review.opendev.org/74688116:25
noonedeadpunkI've updated ^ to remove centos-7 jobs16:25
noonedeadpunkbut not sure I can just drop jobs....16:27
openstackgerritDmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible master: Decrease amount of jobs and update distros  https://review.opendev.org/74688116:27
noonedeadpunkthey can be used somewhere directly....16:27
jrosseryes, need to do this http://codesearch.openstack.org/?q=openstack-ansible-deploy-aio_metal-centos-716:29
jrosserthen a bunch of patches to undo all those16:29
jrosserfor each job we want to delete worth a check with codesearch16:29
noonedeadpunksince we need quick fix reight now, let's drop them from template to unblock some things, and then will leave jobs removal for cleanup16:30
jrosseri have some patches for the remainder of removing centos-7 and suse as well16:31
jrossershould rebase those and get them in order to merge16:31
noonedeadpunkYeah, I guess it's about time to start this16:32
jrosseri am kinda concerned about all this magnum endpoint stuff keeps coming up16:32
noonedeadpunk(despite being pretty loaded now because of some incidents16:32
jrosserseems to be recurring, cost us may weeks of debugging here16:32
jrosser*many weeks16:33
noonedeadpunkI was facing some issue but it was due to missing passing X-Forwarded-For in haproxy endpoint16:35
noonedeadpunkAs heat unlike all other services really rely on the output of endpoint root (ie 172.29.238.100:5000) so if he see there http, it will ignore keystone's endpoint and will ask for http16:37
noonedeadpunkand answer there depends on proto passed16:37
noonedeadpunks/X-Forwarded-For/X-Forwarded-Proto/16:37
*** gyee has joined #openstack-ansible16:38
*** sshnaidm|bbl is now known as sshnaidm16:46
noonedeadpunksorry need to leave, so will end meeting 10 mins earlier16:50
noonedeadpunk#endmeeting16:50
*** openstack changes topic to "Launchpad: https://launchpad.net/openstack-ansible || Weekly Meetings: https://wiki.openstack.org/wiki/Meetings/openstack-ansible || Review Dashboard: https://bit.ly/2SAcGAn"16:50
openstackMeeting ended Tue Sep  1 16:50:07 2020 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)16:50
openstackMinutes:        http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2020/openstack_ansible_meeting.2020-09-01-16.00.html16:50
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2020/openstack_ansible_meeting.2020-09-01-16.00.txt16:50
openstackLog:            http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2020/openstack_ansible_meeting.2020-09-01-16.00.log.html16:50
*** sshnaidm is now known as sshnaidm|afk17:11
openstackgerritSatish Patel proposed openstack/openstack-ansible-os_senlin master: First commit of os_senlin role  https://review.opendev.org/74936517:12
spatel_noonedeadpunk: jrosser here i pushed os_senlin - https://review.opendev.org/#/c/749365/17:13
noonedeadpunkspatel_: you should drop releasenotes at least xd17:16
spatel_whole director?17:16
spatel_let me empty them17:17
noonedeadpunkum not really directory, just older branches17:17
noonedeadpunkjust leave unreleased17:17
noonedeadpunk(and drop references in https://review.opendev.org/#/c/749365/1/releasenotes/source/index.rst)17:18
spatel_ok17:18
*** cshen has joined #openstack-ansible17:19
openstackgerritSatish Patel proposed openstack/openstack-ansible-os_senlin master: Removing older releasenote.  https://review.opendev.org/74936717:22
spatel_hold on, let me delete that patch and re-submit, i messed up something and git is angry on me.17:23
*** cshen has quit IRC17:23
mgariepyspatel_, git commit --amend17:24
spatel_i missed that first time but then i did again so i created new patch  https://review.opendev.org/74936717:24
spatel_is that normal ?17:24
jrosserif you miss the --amend first time it will create a new patch17:25
jrosserthere are as many patches as you have things in git log17:25
spatel_jrosser: look like it did create new patch (do you think i should delete and re-commit fresh one?)17:26
spatel_let me do fresh one anyway this is new project so what can go wrong.17:27
jrosseryou should fix up the first one17:27
mgariepyyou added a patch on top of your first one and when you hit review, it will create a new one with the parent set as the other patch.17:27
jrosserthis would be a superb time to learn about git rebase -i17:27
jrosserif you do 'git rebase -i'17:28
spatel_let me do that17:28
jrosserthen in the editor put 'squash' against the new patch17:28
jrosserquit the editor17:28
jrossershould be done17:28
-spatel_- [spatel@os-osa openstack-ansible-os_senlin]$ git rebase -i17:28
-spatel_- Successfully rebased and updated refs/heads/master.17:28
jrosseroh sorry17:28
jrossergit rebase -i <sha of first commit>17:29
spatel_jrosser: how do i find sha of first commit?17:30
jrossergit log17:30
-spatel_- [spatel@os-osa openstack-ansible-os_senlin]$ git rebase -i fd0afb07a08c3d9c777ab6ace7271958e2e63f7a17:31
-spatel_- Successfully rebased and updated refs/heads/master.17:31
spatel_do i need to do git review?17:32
mgariepyjrosser, have you played with the new focal auto-install preseed stuff  ?17:32
jrosserspatel_: you need to check that you've squashed the two commits together17:32
mgariepyit's kinda neat, it's a bit easier to debug than the d-i preseed17:32
jrosserand that you've got the change-id of the original still17:32
jrossermgariepy: hmm no, not looked at that yet17:33
mgariepythe downside it the way you need to boot the servers, it needs to download the whole 900M iso17:33
spatel_jrosser: "git rebase -i fd0afb07a08c3d9c777ab6ace7271958e2e63f7a" put me in editor. do i need to do anything there? (sorry i am not very good at git except commit/review :(017:34
jrosseron the line of the commit you want to squash into the previous one, change the first word to "squash"17:35
jrosserthen quit the editor17:35
spatel_it has these two line17:39
-spatel_- pick a9383aa First commit of os_senlin role17:39
-spatel_- pick b0fbe3d Removing older releasenote.17:39
jrosserchange 'pick' to 'squash' on the second line17:39
jrosserthen quit the editor17:39
spatel_ok17:39
jrosserit should then show you both commit messages in an editor17:40
jrosserdelete the bits that are not needed17:40
spatel_oh crap i just save file and exit out17:40
jrosserso you now have just one commit but a bad commit message?17:41
spatel_This is what it did - http://paste.openstack.org/show/797346/17:42
SecOpsNinjaone question its considered safe to explose the public endpoints to the internet? i think to resolve the magnum solution the easier solution would to enable the public endpoint vip to the internet and see it if resovles. regarding the internal vip it shouldn't be in the same br-mgmt?17:42
jrosserspatel_: that looks fine as now the releasenotes are deleted17:43
mgariepyspatel_, on a quick glance it's seems fine.17:44
jrosseri think you have merged the two commits together17:44
jrosserjust check that the commit message is good17:44
jrosserand if not, fix it with git commit --amend17:44
*** djhankb has quit IRC17:44
*** dirk has quit IRC17:44
*** alvinstarr has quit IRC17:44
jrosserSecOpsNinja: a public cloud would expose the public https endpoint to the internet, otherwise users could not use the cloud17:45
jrosserbut it is up to you to decide how you architect that, there is no fixed way that should be17:45
jrosserthe openstack management network (br-mgmt) should not be accessible from the internet17:46
*** djhankb has joined #openstack-ansible17:46
*** dirk has joined #openstack-ansible17:46
*** alvinstarr has joined #openstack-ansible17:46
*** djhankb has quit IRC17:46
spatel_jrosser: you are saying commit message showing two commit so fix that?17:46
SecOpsNinjaok i was cjhecking another solution and that was in this https://docs.openstack.org/magnum/latest/configuration/sample-config.html  to put verify_ca = true to disable the validadtion of the self sifgned of jkeystone17:46
*** djhankb has joined #openstack-ansible17:47
jrosserspatel_: i don't know what you did :) git rebase -i will (i think) show you both commit messages and you choose the bits that you want in the editor17:47
openstackgerritSatish Patel proposed openstack/openstack-ansible-os_senlin master: First commit of os_senlin role  https://review.opendev.org/74936517:47
SecOpsNinjayep the br-mgmt its inst acesseble from the internet. i only put the external vip and internal vip there for easy acess but that is acessed but firewall rules and vpn acesss17:47
*** noonedeadpunk has quit IRC17:48
jrosserSecOpsNinja: verify_ca = 'Indicates whether the cluster nodes validate the Certificate Authority'17:48
spatel_jrosser: even i am also confused, now this is what i am seeing - https://review.opendev.org/#/c/749365/17:48
jrosserthis is the VM created by heat on behalf of magnum17:48
jrossernot the magnum service itself17:49
jrosserspatel_: it looks ok17:49
spatel_can i delete? - https://review.opendev.org/#/c/749367/117:49
SecOpsNinja:(17:49
jrosserspatel_: yes17:49
SecOpsNinjajrosser,  but atm i think the problem in the comunication of manugum with opestack api and i think that verify_ca = false would resolve.  ofc the public endpoint of its service would be served by haproxy cert (that in the future can be a valid one)17:51
SecOpsNinjaeven if i create a private CA e would need to fill the openstack_ca_file opetion, right?17:52
spatel_jrosser: done, and thank you for helping me out :)17:52
jrosserspatel_: git rebase -i is really cool17:52
jrosserit means you can make many small commits as you go17:52
jrosserkind of without too much worry, fix typo here, bug there, reformat blah blah17:52
jrosserthen when you are done you can reorder and merge all the little commits together into something clean and ready to push17:53
jrosserjust changing the order of the lines in the editor that git rebase -i puts up re-orders your local git history17:53
jrosserSecOpsNinja: again i think that openstack_ca_file is for the benefit of the VM created by heat for magnum17:54
jrosserthat is when you need to insert the specific CA into the magnum worker VM so that they can trust the external endpoint17:55
SecOpsNinjaim testing that atm17:55
jrosserto make magnum itself trust a private CA you need to insert it into the system CA store and adjust the magnum systemd unit to set REQUESTS_CA_BUNDLE environment variable to point to that rather than the one from certifi17:56
SecOpsNinjaok so the solution for what im seeing it expose the public endpoint or disable https if i dont want to expose.17:58
SecOpsNinjaagain thanks for the help jrosser   and spatel_ i will try tomorow to exposoe it17:59
jrosseror you can keep it private with a genuine certificate17:59
SecOpsNinjawhen you say genuine certificate is using a private CA and generating certificates for each of the public domains, right?18:01
jrosseryou can buy a certificate18:04
jrosserso long as your DNS points to the right thing you are good18:04
jrosserbut this is a long way from OSA stuff18:04
jrosserbut also private CA is possible, i have a cloud like that but it is quite involved to get everything right18:05
SecOpsNinjayep im seing that....18:05
SecOpsNinjathe cheaper solution would be to use lets encrypt18:06
jrosseryes, that is supported18:06
SecOpsNinjaok tomorow i wil test it. gain thanks for your help in troubleshotting this18:07
*** SecOpsNinja has left #openstack-ansible18:19
*** d34dh0r53 has joined #openstack-ansible18:23
spatel_jrosser: we also need to patch /opt/openstack-ansible to add senlin inventory and secrets. do you want me to cut that patch?18:24
jrosserspatel_: yes that would be good18:31
spatel_jrosser: doing it18:38
spatel_jrosser: what branch/topic i should be using for this?18:39
jrosseryou can make one up, osa-senlin for example18:39
spatel_ok18:39
*** d34dh0r53 has quit IRC18:49
openstackgerritMerged openstack/openstack-ansible-haproxy_server master: Add haproxy_backend_only flag to service template  https://review.opendev.org/74739118:51
*** cshen has joined #openstack-ansible19:00
jrosseradmin0: i managed to boot an nvidia vgpu on an ubuntu compute node :)19:04
*** cshen has quit IRC19:04
mgariepyjrosser, gamers on or a real hpc one ?19:06
jrossera T419:06
mgariepygamers one are kinda tricky.19:06
mgariepyin passthrough?19:07
jrosserit's just "RHEL ONLY" all over the documentation19:07
jrosserso i was using a T4 as a way of validating that i could do vgpu/ubuntu before getting something really £££££19:07
mgariepyor in ""vgpu"" split stuff ?19:07
mgariepyok.19:07
jrossernot passthrough, actual vgpu19:07
mgariepythe issue with that is the driver on host and in vm need to be kinda sync.19:08
jrosseri've just booted a vm with 1/16th of the T4 in it19:09
*** jbadiapa has quit IRC19:10
mgariepythat's nice tho. i have some colleage that are doing it on osa+centos719:12
mgariepyhow tricky was it ? do you have some notes on your process ?19:12
jrosserhttp://paste.openstack.org/show/797348/19:13
jrosseri registered for an eval licence for vgpu on the nvidia site, and dowloaded the RHEL+KVM installer19:14
jrosserwhich is a big zip and you need NVIDIA-Linux-x86_64-440.107-vgpu-kvm.run and NVIDIA-Linux-x86_64-440.107-grid.run from it19:15
jrosserfor ubuntu the only fix i needed to do was create /usr/lib64 directory and make a /etc/ld.so.conf.d/nvidia.conf pointing to that dir19:16
jrosserrun the installers and the i had to manually copy the .so from the unpacked installer into /usr/lib64, run ldconfig and it's working19:17
jrosserso just a minor libraries directory difference beteen rhel<>debuntu19:17
jrossertheres a change it'll work directly with the installer if the /usr/lib64 dir is created beforehand, but i've only got this on one test node19:18
jrosserthen this for nova https://docs.openstack.org/nova/latest/admin/virtual-gpu.html19:19
*** openstackgerrit has quit IRC19:21
mgariepynice.19:23
*** openstackgerrit has joined #openstack-ansible19:23
openstackgerritSatish Patel proposed openstack/openstack-ansible master: Adding os_senlin role support to osa.  https://review.opendev.org/74937919:23
mgariepythe migration / upgrade of driver nightmare would not be super fun tho.19:24
jrosserno, but although an A100 GPU is gigantically expensive, you can carve it 7 ways19:25
jrosserand each of those 1/7ths is 2x the performance of a T419:25
jrosserso for the equivalent GPU performance you are winning if you have a lot of T4 shaped workloads19:26
mgariepyyep indeed.19:28
*** viks____ has quit IRC19:28
*** renich has joined #openstack-ansible19:52
renichgood day, everyone! o/19:52
admin0jrosser, how many hours you spent on making this work :) ?19:52
renichI was wondering, is there a way to install only a subset of the components with the openstack ansible playbook? I'd like to install horizon, keystone and swift only.19:53
jrosseradmin0: half of today :)19:53
jrosserI downloaded the driver this morning19:53
admin0renich, cat setup-openstack.yml and you can see that all it does is call individual playbooks19:53
admin0that you can run in its own19:53
admin0for example: root@/opt/openstack-ansible/playbooks# openstack-ansible os-keystone-install.yml19:54
renichthanks admin019:55
jrosserrenich: in openstack_user_config only assign hosts for the set of services you need19:56
renichjrosser: OK, thanks! ;D19:56
jrosserthat will empty out the ansible groups for the rest19:56
jrosserand make sure you only get the endpoints you want in haproxy19:56
jrosseradmin0: did your gpu node work out with a different OS ?19:59
admin0as per cloudnull notes, he used diff controllers for the os he supports, so i am in the process to nuke 1 controller and set it up as centos19:59
admin0but with your news, i might drop that idea and retry with ubuntu20:00
jrosserthere’s a flag on the vgpu-kvm installer to unpack but not install20:01
jrosserthat’s how I got the .so out to put in the right place20:01
*** spatel_ has quit IRC20:16
*** andrewbonney has quit IRC20:18
*** BlackFX has joined #openstack-ansible20:25
*** spatel has joined #openstack-ansible20:28
*** spatel has quit IRC20:38
BlackFXHi, I am running through the process of setting up with openstack-ansible, however it is failing while setting up the LXCs. It looks like the veths are not being created in the containers, no IPs are added to /etc/hosts and then it fails trying to gather facts from the containers with an unreachable error. Anyone have any pointers?20:44
admin0yeah .. i spent a few days like that BlackFX .. turned out i had an issue with my lxc-dnsmasq process20:45
admin0check that one out20:45
BlackFXthanks20:45
admin0is it not being created, or being created, but no ip address ?20:45
BlackFXnot being created20:46
BlackFXlxc-dnsmasq seems to be running happily20:46
admin0then issue would like somewhere else .. what is the error message ?20:46
BlackFXfatal: [infra1_cinder_api_container-3a7b835f]: UNREACHABLE! => {"changed": false, "msg": "Data could not be sent to remote host \"infra1_cinder_api_container-3a7b835f\". Make sure this host can be reached over ssh: ssh: connect to host infra1_cinder_api_container-3a7b835f port 22: Connection timed out\r\n", "unreachable": true}20:47
BlackFXI get that for every container in the gathering facts step20:48
admin0lxc-ls -f .. are any containers created ?20:48
admin0ip link inside the containers to check if they show any net interface20:48
admin0and are you by any chance on master branch ?20:48
BlackFXthey are created and if I lxc-attach the only interface in them is loopback20:49
BlackFXyes using master20:50
openstackgerritMerged openstack/openstack-ansible-ceph_client stable/ussuri: Fix ceph-client deployment for Ubuntu Focal  https://review.opendev.org/74924220:51
prometheanfirethis the right way to define a non-default nova AZ?  https://gist.github.com/prometheanfire/d4983eebcc6644589d4472debf96335020:51
prometheanfireper host vars in ansible, yay20:51
BlackFXis there a better branch to use than master?20:53
prometheanfirewhatever is stable20:53
prometheanfirein this case stable/ussuri iirc20:53
BlackFXthanks, trying that now20:55
jrosserBlackFX: have you built an all-in-one yet?20:58
BlackFXan all-in-one?20:58
redkriegHi all, I'm currently preparing to upgrade my infra from rocky to stein and I'm getting an error that /etc/ceilometer is not empty, so ansible refuses to convert it to a symlink for os_ceilometer : Create ceilometer dir.  I didn't see anything in a search for the error: http://paste.openstack.org/show/797354/20:59
prometheanfireI think I need to add it within host vars21:00
prometheanfireyep, found it in https://docs.openstack.org/openstack-ansible/queens/reference/configuration/using-overrides.html#overriding-conf-files21:00
*** cshen has joined #openstack-ansible21:00
BlackFXStill the same using stable/ussuri21:03
*** cshen has quit IRC21:04
jrosserBlackFX: this is an all-in-one https://docs.openstack.org/openstack-ansible/latest/user/aio/quickstart.html21:11
jrosserit puts an entire openstack into a single VM as a test/development environment21:11
BlackFXOh right, no haven't done that - we are trying to deploy to 7 machines21:12
jrosserthere are several reasons i would recommend that as a first step - the config is autogenerated and widely understood by folk here, it is the exact same thing that we use to validate openstack-ansible in CI dozens of times a day, and maybe for you best thing is you would end up with a miniature environment you could compare against your 7 nodes to look for differences / broken things21:13
BlackFXOkay - I will build one21:20
jrossertheres quite a lot of detail on that page21:22
jrosserbut simplistically its 8core / 8G ram / 60G disk, clone the repo, checkout the branch/tag you want, scripts/bootstrap-ansible.sh, scriipts/bootstrap-aio.sh, then playbooks21:24
jrosserthe rest described there is very much optional things21:24
*** zerozephyrum has quit IRC21:28
*** cshen has joined #openstack-ansible21:48
*** cshen has quit IRC21:53
*** tosky has quit IRC22:43
*** renich has quit IRC23:07
*** renich has joined #openstack-ansible23:24
*** djhankb has quit IRC23:27
*** djhankb has joined #openstack-ansible23:27
*** cshen has joined #openstack-ansible23:49
*** cshen has quit IRC23:54

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!