Thursday, 2020-02-27

*** flwang has joined #openstack-containers00:21
flwangbrtknr: around?00:21
*** ramishra has joined #openstack-containers01:18
*** ramishra has quit IRC01:26
openstackgerritXinliang Liu proposed openstack/magnum master: Prevent scripts from exiting when there is no error  https://review.opendev.org/70048501:32
*** ramishra has joined #openstack-containers01:48
openstackgerritMerged openstack/magnum stable/train: [bug] Fix regression when use_podman=false  https://review.opendev.org/70978202:26
openstackgerritMerged openstack/magnum stable/train: [k8s] Make metrics-server work without DNS  https://review.opendev.org/70978102:29
openstackgerritMerged openstack/magnum stable/train: Fix api-cert-manager=true blocking cluster creation  https://review.opendev.org/70977802:29
openstackgerritMerged openstack/magnum stable/train: k8s_coreos Set REQUESTS_CA for heat-agent  https://review.opendev.org/70977702:29
openstackgerritMerged openstack/magnum stable/train: Fix Field `health_status_reason[api]' cannot be None`  https://review.opendev.org/70977602:29
openstackgerritMerged openstack/magnum stable/train: Fix the load balancer description regex pattern for deleting cluster  https://review.opendev.org/70977502:29
*** iokiwi has quit IRC03:59
*** iokiwi has joined #openstack-containers03:59
*** udesale has joined #openstack-containers04:35
*** ykarel|away is now known as ykarel05:08
*** rcernin has quit IRC05:33
*** rcernin has joined #openstack-containers05:33
brtknrflwang: hi05:53
*** rcernin has quit IRC06:24
*** AJaeger has left #openstack-containers07:24
*** ykarel is now known as ykarel|lunch07:35
*** pcaruana has joined #openstack-containers07:42
*** udesale has quit IRC08:31
*** udesale has joined #openstack-containers08:32
*** ykarel|lunch is now known as ykarel09:30
cosmicsoundgood day10:35
*** pcaruana has quit IRC10:41
*** pcaruana has joined #openstack-containers10:55
brtknrcosmicsound: hi11:10
brtknrstrigazi: i dont understand the api versioning11:10
cosmicsoundhey brtknr11:11
brtknr cosmicsound did you get anywhere?11:12
cosmicsoundbrtknr , i got here:  https://mdb.uhlhost.net/uploads/32e6d3c9ef5de88f/image.png11:19
cosmicsoundthis is last step that eventually will fail11:19
cosmicsoundgreped all kolla logs and searched for magnum11:19
cosmicsoundgot this11:19
cosmicsoundhttp://paste.openstack.org/show/790049/11:19
cosmicsoundthis was on yesterdays deployment, today i start from scratch again11:20
cosmicsoundkube)deploy_cluster is last step that fails in end with time out, again not much info to debug on it11:20
cosmicsoundkube_11:21
cosmicsoundwill get heat-aget-logs from this new deployment11:23
cosmicsoundpitty this is always empty : openstack software deployment output show --all --long11:24
cosmicsoundlatest http://paste.openstack.org/show/790064/ cloud-init.log11:36
cosmicsoundlatest http://paste.openstack.org/show/790065/ cloud-init-output.log11:37
cosmicsoundthe network errors are weird, because the machines got local and public ip11:41
*** ivve has joined #openstack-containers11:53
*** udesale_ has joined #openstack-containers12:19
*** udesale_ has quit IRC12:21
*** udesale_ has joined #openstack-containers12:21
*** udesale has quit IRC12:22
*** mgariepy has joined #openstack-containers12:32
*** mgariepy has quit IRC12:38
*** alti_17 has joined #openstack-containers12:44
*** iokiwi has quit IRC12:47
*** iokiwi has joined #openstack-containers12:48
alti_17Hello, FYI, for those who are using fedora coreos 31 stable, (in our case 31.20200127) yesterday Stable stream was updated to 31.20200223 https://getfedora.org/en/coreos/download/ . If you start cluster provisioning using older image it often fails with errors related to SoftwareDeployment stage, looks like it caused by coreos zincati12:52
alti_17https://docs.fedoraproject.org/en-US/fedora-coreos/auto-updates/ auto updates feauture. It start update and restart of OS during heat script execution and interrupts it12:52
*** mgariepy has joined #openstack-containers12:52
*** alti_17 has quit IRC12:53
*** alti_17 has joined #openstack-containers12:54
*** udesale_ has quit IRC13:05
*** udesale_ has joined #openstack-containers13:05
*** markguz_ has joined #openstack-containers13:06
markguz_Hi I'm trying to deploy a 1master 3node cluster on fedora coreos using the lastest version of magnum. (git master)13:10
markguz_however consistently 1 out of 3 of the minions fails to be configured by heat.  I see this error in the logs "Command failed, will not cache new data. Command 'os-refresh-config' died with <Signals.SIGTERM: 15>"13:11
markguz_and then the heat-container-agent dies13:11
markguz_i can't find any more detailed logging inside the vm.. anyone got any hints as to where to look?13:12
brtknrmarkguz_: try inside /var/log/heat-config/13:12
markguz_brtknr: no such folder13:13
brtknrmarkguz_: so heat container is failing for a different reason13:13
brtknrmarkguz_: so when you have 1 master 1 worker its successful?13:13
brtknrmarkguz_: which image version are you using?13:14
markguz_brtknr: i checked on the the nodes that did configure and they don't have that folder either13:14
markguz_fedora-coreos-31.20200127.3.013:14
brtknrmarkguz_: are you using heat container agent ussuri-dev13:14
markguz_brtknr: yes13:14
brtknrmarkguz_: so when you have 1 master 1 worker its successful?13:15
markguz_brtknr: i will try that13:15
alti_17This is no longer latest version13:16
alti_17fedora-coreos-31.20200127.3.013:16
markguz_alti_17: it was yesterday?13:16
markguz_that's when i downloaded it13:16
markguz_there was no newer version available13:16
markguz_did they release another?13:17
alti_17Yes, 1 Day ago new one was released13:17
alti_1731.20200210.3.0 stable13:17
markguz_do y'all have a verified version that you know works?13:17
alti_17And it caused issues for me. Because of  https://docs.fedoraproject.org/en-US/fedora-coreos/auto-updates/ zincati auto updates.. it updates and restart vm during heat script execution what causes errors for me.13:18
alti_17I'm testing right now, looks like new version works. but haven't verified it yet13:18
markguz_alti_17: YES! i saw those zincati things13:19
alti_17I think magnum driver doesn't expect that vm will be rebooted by zincati. It also causes issues for already provisioned clusters, we can't scale them now13:20
markguz_alti_17: does that mean coreos is broken for the moment?13:23
markguz_maybe zincati can be disabled in the user data before config starts..13:25
alti_17I have just created 2 clusters in a row using very latest version of coreos. I will start looking on how to disable/control coreos/zincati update behavior, but meanwhile maybe someone from Magnum maintainers will share some thoughts about it, maybe there are some solutions which we missed13:29
markguz_i'm just about to try it.  was the zincati feature recently added to coreos?13:30
markguz_or perhaps using atomic 2913:33
alti_17Just for history this is journalctl log from node vm when you are using not latest coreos. Reboot triggered. Master nodes not being rebooted. No nodes being rebooted if you use latest coreos13:36
alti_17Feb 27 11:03:56 alti17-2xw75kfskmyd-node-0 audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-tmpfiles-se>13:36
alti_17Persistent Storage...13:36
alti_17BOOT_IMAGE=(hd0,gpt1)/ostree/fedora-coreos-19190477fad0e60d605a623b86e06bb92aa318b6b79f78696b06f68f262ad5d6/vmlinuz-5.4.17->13:36
*** lpetrut has joined #openstack-containers13:36
markguz_hmm. ok i got the image uploaded just about to try spinning a cluster13:36
markguz_i hope they disabled auto rebooting.. that seems like a dumb thing to do with something like coreos.13:38
markguz_it's dumb thing to do on anything that is designed to run services...13:39
alti_17This is, by the way, coreos "killer feature"  https://getfedora.org/en/coreos/  "Fedora CoreOS is an automatically-updating" maybe we just don't know how to operate it properly13:41
*** waverider has joined #openstack-containers13:44
markguz_auto updating... maybe.. auto rebooting.. definitely not13:49
*** alti_17 has quit IRC13:50
*** alti_17 has joined #openstack-containers13:57
*** pcaruana has quit IRC13:59
*** alti_17 has quit IRC14:07
*** alti_17 has joined #openstack-containers14:07
*** ykarel is now known as ykarel|away14:08
*** mgariepy has quit IRC14:14
*** mgariepy has joined #openstack-containers14:55
*** udesale_ has quit IRC14:57
*** pcaruana has joined #openstack-containers15:01
*** alti_17 has quit IRC16:08
*** markguz_ has quit IRC16:31
*** waverider has quit IRC16:40
*** lpetrut has quit IRC16:40
*** jmlowe has quit IRC16:54
cosmicsoundwhats the cli you guys use to build the latest fedora-coreos image?16:58
cosmicsoundwhenever i build it is not visible in magnum at template creation16:59
*** alti_17 has joined #openstack-containers17:04
alti_17cosmicsound something like this openstack image create \17:05
alti_17fedora-coreos-31.20200127.3.017:05
alti_17You might missed os_distro or public key17:06
cosmicsoundwhat is the good os=distro?17:06
cosmicsoundfedora-coreos?17:07
cosmicsounddo i need also to specify a default username? like on fedora-atomic17:07
cosmicsoundi used this: openstack image create "fedora-coreos-31.20200127.3.0"   --file fedora-coreos-latest.qcow2   --disk-format qcow2   --container-format=bare   --min-disk 10   --min-ram 4096   --public   --protected   --property hw_scsi_model=virtio-scsi   --property hw_disk_bus=scsi   --property hw_qemu_guest_agent=yes   --property17:08
cosmicsoundos_distro=fedora-coreos   --property os_admin_user=fedora   --property os_version="31.20200127.3.0"17:08
cosmicsoundhttps://mdb.uhlhost.net/uploads/330b518a122d9356/image.png in the end is only the fedora atomic visible17:09
alti_17Which version of openstack/magnum do you have?17:11
cosmicsoundtrain/2.5.017:11
cosmicsoundsorry17:13
cosmicsound2.17.017:13
cosmicsoundI upgraded from 2.1517:13
cosmicsoundopenstack 5.0.017:14
cosmicsoundTrain release17:14
alti_17Sory, a bit confused, magnum train has 9.* version https://docs.openstack.org/releasenotes/magnum/train.html17:17
cosmicsoundthats the cli version my bad17:17
alti_17starting from 9.1.0 tag fedora-coreos supported17:18
cosmicsoundhmm i am also confused to find real magnum version17:18
cosmicsound:D not the cli version17:18
*** alti_17 has quit IRC17:44
cosmicsoundmagnum-conductor --version17:51
cosmicsound9.2.017:51
cosmicsoundthis is the version i run17:51
cosmicsoundstill my image is not visible17:51
cosmicsoundil try to go to latest image version17:52
*** jmlowe has joined #openstack-containers18:01
*** jmlowe has quit IRC18:05
*** jmlowe has joined #openstack-containers18:05
*** jmlowe has quit IRC18:27
*** jmlowe has joined #openstack-containers18:30
*** jmlowe has quit IRC18:34
*** jmlowe has joined #openstack-containers18:35
*** jmlowe has quit IRC18:36
*** jmlowe has joined #openstack-containers18:56
*** jmlowe has quit IRC18:59
-openstackstatus- NOTICE: Memory pressure on zuul.opendev.org is causing connection timeouts resulting in POST_FAILURE and RETRY_LIMIT results for some jobs since around 06:00 UTC today; we will be restarting the scheduler shortly to relieve the problem, and will follow up with another notice once running changes are reenqueued.19:10
*** jmlowe has joined #openstack-containers19:31
*** pcaruana has quit IRC19:38
-openstackstatus- NOTICE: The scheduler for zuul.opendev.org has been restarted; any changes which were in queues at the time of the restart have been reenqueued automatically, but any changes whose jobs failed with a RETRY_LIMIT, POST_FAILURE or NODE_FAILURE build result in the past 14 hours should be manually rechecked for fresh results19:44
*** jmlowe has quit IRC20:07
*** jmlowe has joined #openstack-containers20:30
*** rcernin has joined #openstack-containers21:44
*** ivve has quit IRC22:57

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!