Friday, 2023-06-09

opendevreviewrenliang proposed openstack/kolla master: Fixed an issue with zun-cin-daemon building images in aarch64.  https://review.opendev.org/c/openstack/kolla/+/88572908:34
basileusHi, hope everyone is doing great ! After a bit of help from this IRC I come back asking for one very simple question, I'd like to know if there is any ressource / tutorial to fully dismantle a kolla-ansible environment by any chance? I want to retry to re-install openstack correctly this time with Veth and Vbridges but I was wondering if there was .sh script to uninstall / revert all changes? Thanks in advance ! 08:52
basileusFor information I deployed it through virtual environment, don't know if that would impact anything regarding dismantling the environment08:55
mmalchukbasileus kolla-ansible have a bit, take a look into tools/ directory08:58
mmalchukthere you can find ║*cleanup-containers║*cleanup-host║*cleanup-images scripts08:58
mmalchukmaybe they should help you08:59
basileusI saw these scripts in there ! I will skim through them and see what is achievable, I wanted to reach a "semi" clean slate without having to reinstall the entire OS, thanks ! 09:00
mmalchukbut if you want the tool which can you really help - try kayobe project09:00
mmalchuktry kayobe (https://doc.openstack.org/kayobe)09:02
mmalchukhttps://docs.openstack.org/kayobe/latest/09:02
mmalchukyou can do:09:02
mmalchukkayobe overcloud service deploy09:03
mmalchukthan09:03
mmalchukkayobe overcloud service destroy09:03
mmalchukand repeat it)09:03
basileusOh that seems... neater than kolla 09:03
mmalchukkayobe uses kolla-ansible jfyi09:03
mmalchukkayobe in this case some kind of wrapper with cli09:04
basileusI'm assuming, just like my previous install I'll need 2 Veth including an empty one for Neutron? 09:04
mmalchukdon't remember. and with kayobe you can do any expirement you want and repeat deploy and redeploy09:05
basileusawesome ! Let me skim through the cleanup script and try to run it in a virtual environment ! Thanks a lot 09:05
mmalchukalso, you can start to learn Kayobe from this: https://github.com/stackhpc/a-universe-from-nothing09:06
mmalchukthen dig into the documentation and etc.09:07
basileusAy Ay captain, just had a question, how come some projects use CentOS while other more or less don't recommend it whatsoever? 09:08
mmalchukI choose Ubuntu. it more stable for me. and this is my choice only.09:09
mmalchukIf you prefer RedHat based distros - have a look on RockyLinux09:10
mmalchukit well supported by Kolla/Kolla-ansible/Kayobe09:10
mmalchukboth09:10
mmalchukUbuntu and RockyLinux09:10
basileusYeah I originally went for CentOS and quickly swapped back to ubuntu at that point, too many issues during install09:11
mmalchukjfyi: https://docs.openstack.org/kolla-ansible/latest/user/support-matrix.html09:11
basileusThank you so much ! Time to pull my hair out a bit ! 09:13
hrwbasileus: "kolla-ansible destroy"09:53
fricklerbut I also strongly suggest to do a fresh OS deploy for a re-installation. you'll want to automate that part anyway to reduce work and error rates09:56
basileusI see, and what would be the best OS version for that ? Would 20.04 Ubuntu server LTS be best or should I stick to 22.04? 09:58
frickler22.04 is required for latest openstack, why start with something old?10:06
mmalchukfrickler did you fix your internet?10:18
fricklergood enough to do IRC at least10:48
mmalchukmay be able to merge docs too?10:49
fricklermaybe, but I also dont see the urgency11:52
mmalchukonly because of no urgency and no rush today I ask... next week there can be problems and urgency to fix other issues maybe12:00
mmalchukbut if youre busy, than ok12:01
spatelI am running kolla-build and it start building images but somehow it randomly stuck somewhere and don't move further.. only option left to ctrl+c 14:00
spatelHow does it work with CI jobs?  because of that I am building image one by one.. instead building all in single shot 14:00
spatelFor example, This is stuck since last night - https://paste.opendev.org/show/bf1rRhg9V3BCy4KmHPRg/ 14:01
mmalchukstuck - is not an error14:14
mmalchukwhat the problem?14:14
opendevreviewJuan Pablo Suazo proposed openstack/kolla-ansible master: Configures the tap-as-a-service neutron plugin  https://review.opendev.org/c/openstack/kolla-ansible/+/88541714:14
spatelmmalchuk I have no idea what is the problem because its just not moving 14:39
spatelIf i ctrl+c and do again then in second run it works14:39
spatelI have 100G internet link so i don't think its internet related issue :)14:39
mmalchukyou can enter to the intermediate container and execute last command interactively14:40
mmalchukand maybe you'll see an error14:40
spatelHmm that is good way to test.. How do i find intermediate container? I believe must be show up in docker ps command 14:41
mmalchukin the log you provided you can find the line about it14:42
mmalchukbut you show only part... there more about keystone-ssh but need nova-compute container logs14:43
spatelOh something like this - Removing intermediate container 7cddcee1dc6614:43
mmalchukyep14:43
mmalchukand lines with an arrow14:43
spatelI see, you are saying find intermediate container ID and get into container with exec -it mode and run last command by hand etc.. 14:43
spatel---> fb570654e59814:43
mmalchukyep14:44
spatelPerfect! good to know that14:44
mmalchukthis is successfully created layer (intermediate container)14:44
spatelMay be nova-compute container got stuck.. 14:45
spatelI realized building image one-by-one is better way to see things.. 14:46
mmalchukif you show the tail of the log - yes nova-compute14:46
spatelI have built all images with tag 2023.1 but now when I am running deploy getting error sayin - docker-reg:4000/kolla/fluentd:2023.1-ubuntu-jammy not found14:47
spateldo i need to use tag 2023.1-ubuntu-jammy ? 14:47
mmalchukyou always can control the way of build with and config or command-line options14:47
mmalchukthere one usefull option for you - threads14:48
mmalchukset it to 1, but also you need to control images already built for example - skip_existing, or even control retries - retries14:49
mmalchukalso to be more verbose there good reason to enable debug - debug14:50
spatelmmalchuk I see - The number of threads to use while building. (Note: setting to one will allow real time logging)14:50
mmalchukyep14:50
spatelI always use --debug 14:50
mmalchukthreads = 1, retries = 0, skip_existing = True and debug = True - my choice for troubleshooting)14:51
spatel+1 14:51
mmalchukalso format = none to remove unneded info in the tail of the logfile14:52
mmalchukformat = none14:52
spatelWhy don't we document these option for best practice suggestion ?14:53
spatelIts handy for people like me :)14:54
mmalchukalso, as I can see, you have some problems with ubuntu repos (mirrors) - last two lines with different hosts for one file14:54
spatelYes, its always stuck on some random mirror fetch14:55
mmalchukit tries to download file number 243, but not succeed from mirrors.cmich.edu, then tries to do from mirrors.advancedhosters.com14:55
mmalchukthis is very strange14:55
mmalchukhttp://mirrors.cmich.edu/ is online and have ubuntu repo14:56
mmalchukmay be you have network issues?14:56
mmalchukits a good idea to use local reachable ubuntu mirror14:57
spatelmay be regional issue, I am in US east cost so not sure if something going on there14:57
spatel+1 Yes.. I like that idea 14:57
spatelThis is my kolla-build.conf file - https://paste.opendev.org/show/bnI9OFmryOaaFbw3ugXN/14:58
mmalchukeven you can do your own mirror, it takes about 2Tb only)14:58
mmalchukor use caching proxy to leverage slow network issues14:58
spatelhaha! Not a bad idea, in that case how do i inject local mirror during build? 14:59
mmalchukabout the documentation, you always can run kolla-build --help14:59
spatelI will try to poke and figure out.. 14:59
spatelWe don't build images everyday but again its good to have it local. 15:00
mmalchukyou always can use override mechanisms - described in the documentation15:00
spatelI am getting this error during deploy command - docker-reg:4000/kolla/fluentd:2023.1-ubuntu-jammy not found 15:01
spatelmy tag is 2023.1  so where this -ubuntu-jammy extra suffix coming from?15:01
spatelIn global.yml - openstack_release: "2023.1"15:02
mmalchuk# Docker image tag used by default.15:05
mmalchukopenstack_tag: "{{ openstack_release }}-{{ kolla_base_distro }}-{{ kolla_base_distro_version }}{{ openstack_tag_suffix }}"15:05
mmalchukkolla-ansible/ansible/group_vars/all.yml15:05
spatelshould I use openstack_tag: 2023.1 ?15:08
mmalchukreasonable if you build images with this tag. but this is not default behaviour15:10
spatelGot it :)15:12
*** hrww is now known as hrw15:55
spatelmmalchuk is this looks ok to you? - https://paste.opendev.org/show/bQyDbzxqA7ZY6yiGsm5l/16:48
spatelbuild.py version is 15.1.1 and kolla version 16.x.x 16:48
mmalchuknot really, better have binary newer or the same major version with the code16:50
spatelmmalchuk I did checkout of 16.x.x tag doesn't it include ./build.py binary? 17:11
mmalchukdon't know what you did)17:11
mmalchukto get the binary you should use pip17:11
mmalchukeven if you build from the source17:12
mmalchukhttps://docs.openstack.org/kolla/latest/admin/image-building.html17:13
mmalchukdid you see build.py usage here?17:13
spatelThat is pip way to install kolla, I did git clone https:// way 17:13
mmalchukno. pip install binary into system dir or into a virtualenv (better)17:14
spatelhttps://paste.opendev.org/show/bXLJAyII51oWGqdZsKLc/17:15
spatelyou are suggesting to do python3 -m pip install kolla==16.0.0 ?17:17
mmalchukok. what next?17:17
fricklerdo you have an mtu < 1500 on your build host? that could affect downloads in docker containers unless you tell docker to use the lower mtu for networking, too17:17
mmalchukid depends. but yes. pip install17:18
mmalchukif you planed to install from pypi - pip install kolla==<version> (version is optional)17:19
mmalchukif you planed to build from source - pip install path_to_source/17:19
mmalchukbut before create the virtualenv (prefered way)17:20
spatelI was reading this doc, look at top section - https://hlyani.github.io/notes/openstack/kolla_image_build.html17:20
spatelHow did they install kolla? 17:20
spatelI did same way and my binary versions are different as i show you earlier 17:21
spatelhttps://paste.opendev.org/show/bXLJAyII51oWGqdZsKLc/17:21
mmalchukjapaneeze? queens? this is outdated and at most places incorrect documetation17:22
spatelThat is example.. I know its old 17:22
spatelJust trying to use same method for new release 17:22
mmalchukdon't know how they (japaneeze?) do anything. this is not official documentation!17:23
mmalchukplease dont do this way17:23
spatelOk.. let me try python3 -m pip install kolla==16.0.017:23
mmalchukwhat you need:17:23
mmalchuk1. source dir - git clone ...17:24
mmalchuk2. virtualenv dir: python3 -m venv .... or virtualenv .... depends on OS17:24
mmalchuk3. pip install path_to_source/17:25
mmalchuk4. use kolla-build binary from the virtualenv path17:25
mmalchukthats all17:25
spatelwhat pip install path_to_source would be? 17:26
mmalchukpip - python installer, install - an option, path_to_source - path to kolla source code17:27
mmalchukif you do 'cd kolla' after 'git clone' - than use 'pip install .' for example17:28
spatelohh! 17:29
spatelLet me try 17:29
mmalchukdo you have created virtualenv before?17:30
spatelYes, I did (I use kolla-ansible to run from venv) 17:30
mmalchukcool. then proceed with 'pip install .'17:30
spatelI did install kolla on multiple place on production but never use own images. I always pull images from public repo and push them to local mirror 17:30
spatelThis time thinking to use own images to run kolla and that is where I am playing right now 17:31
spatelI am planning to install kolla on 600 node cluster so better use own images. 17:32
spatelHope kolla support on that scale 17:32
mmalchukimho its bad idea to use images from the internet in production. the better way to build your own and controll everything.17:32
spatelmmalchuk 100% with you.. In past I deployed 10 to 20 node cluster and it was small environment so I didn't bother to build images.17:33
spatelThis time its very large scale deployment so make it right 17:33
mmalchukno matter of size... build takes several minutes17:34
spatelYep17:34
spatelMy plan is to use 3x node for rabbit/DB and 3x for api to support 600 compute 17:35
mmalchukwhat reason to split bus/db with api ?17:36
spatelJust to have dedicated CPU/memory for rabbitMQ and DB 17:36
spatelPutting everything on 3 node would be too much work 17:37
spatelDB doesn't take lots of CPU/memory but rabbitMQ crush thing very fast17:37
mmalchukdue to 'Probability theory' 1 of 6 node will fail faster in unike 1 of 3)17:38
spatelanything is possible when it comes to fail 17:39
mmalchukalso in your case there will be big latency from api to backend because of network17:39
spatelwith 600 nodes control plan will be very chatty 17:39
spatelI was thinking to use virtual machine for API layer to reduce locks or bottleneck 17:40
mmalchukvirrtualisation layer adds even more latency17:40
spatelI meant multiple virtual VM for api instead of just 317:40
spatelhorizontal scale 17:41
mmalchukdon't invent the bycicle)17:41
mmalchukplease read the РФ пгшву17:41
mmalchukHA guide17:41
spatelI know what you saying but sometime its not about CPU power but number of request they handle 17:42
spatelРФ пгшву ?17:42
spatelDo you have link ?17:42
mmalchukmy keyboard switches... sorry. I say 'please read the HA guide'17:43
spatelhaha :)17:43
mmalchukhttps://docs.openstack.org/ha-guide/17:43
spatelI did saw that and many other openstack summit scaling videos to learn how to scale 17:45
spatelI am running 3 large cloud already in production with openstack-ansible each cloud has 300 compute nodes with 3x controller nodes. 17:46
mmalchukok. than) lets back to kolla17:46
spatelThis is first time I am pushing to do 600 to 800 comoute nodes on single control plane 17:46
spatelusing kolla 17:46
spatelI am just little worried to use 3x node that is why decided to give dedicated nodes to rabbitmq 17:47
mmalchukhow is your 'pip install .' ?17:47
spatelIts all good :) and images started building 17:47
mmalchukkolla-build --version ?17:48
mmalchuksame as checkout?)17:48
spatelits showing 16.0.0 :)17:48
spatelYesss 17:48
mmalchukcool17:48
mmalchukI'm proud of you)17:48
spatelI am using your options --thread 1 --skip-existing --cache --format none etc.. and not a single image stuck yet :)17:49
spatelYou should be proud :) 17:49
mmalchuknote bene - this way it takes much more time to build17:49
mmalchukthis way is only for debug17:49
spatelI like slow and steady vs troubleshooting 17:50
mmalchuknice17:50
spatelcan you share your kolla-build.conf file if possible.. I am curious to match with my one 17:51
mmalchukalso with skip_existing and cache - in case of temporary failure and repeated build you also need more space for docker17:51
spatelIf its safe to share 17:51
spatelThis time i use cache just to save download time during failure 17:52
mmalchukhttps://paste.openstack.org/show/bNsIeRFLYvj5UQlPJGce/17:54
mmalchuknote - this is template only. because I use Kayobe and full config created from this template dynamically17:54
mmalchukand this is not from production, but from my dev lab17:56
spateloh okay17:56
spatelis this your local image? registry-openstack.cloud.local/openstack/infra/ubuntu17:56
mmalchukyep. the local docker registry17:57
mmalchukwith the fixed version of ubuntu from the internet.17:57
mmalchukeven if they have tag 20.0417:58
mmalchukit updated from the internet when needed17:58
spatelI know what you saying because if you build image after few days then may be its 20.04.10017:59
mmalchuklol)17:59
spatelI should push ubuntu image to local repo then 17:59
spatelYou don't put tag in build file? 18:00
mmalchukyep)18:00
mmalchukno18:00
mmalchukbecause it common. the uncommon build options passed in commandline while build started18:01
mmalchukI can build xena or zed, depend of need18:02
spatelHmmm18:02
mmalchukalso I can oly build to test and not push18:02
mmalchukso, some options in config some used externaly18:02
mmalchukdevops way)18:03
spatelWe have artifactory so plan is to push images there 18:13
mmalchuknot cheap18:20
mmalchukwhy not opensource solution?18:21
spatelWe are big company and we already have all those tools in production :) 18:49
mmalchukso why not the VMWare?)18:55
spatelwe are big but not that big to use vmware. Our all tooling already working for openstack so why go for VMware?  We have multiple business and each business has different budget to run. 19:02
opendevreviewJuan Pablo Suazo proposed openstack/kolla-ansible master: Adds support for Huawei backends  https://review.opendev.org/c/openstack/kolla-ansible/+/86925220:28

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!