Friday, 2022-02-25

noonedeadpunkmornings! can we merge https://review.opendev.org/c/openstack/ansible-role-python_venv_build/+/830570 cherry-picks ?11:34
*** frenzyfriday|rover is now known as frenzyfriday12:49
mgariepymorning anyone here have configured vga passthrough with nvlink between 2 gpus ?14:03
opendevreviewMerged openstack/ansible-role-python_venv_build stable/wallaby: Slurp constraints.txt separately for each host in a batch  https://review.opendev.org/c/openstack/ansible-role-python_venv_build/+/83057014:14
noonedeadpunknah, I'm not, we ordered gpu without nvlink at end as info about nvlink was a bit confusing for me at least14:19
mgariepyyep, in my case i do have 4 gpus 2 with nvlink and 2 without..14:19
mgariepyreally not sure how it should work for the nvlink as i'm not 100% sure nova has something to tag the 2 that are nvlink linked.14:20
noonedeadpunkI was actually looking at nvlink in terms of vgpu and how splitting and scheduling would work then...14:20
noonedeadpunkI guess with nvlink it would represented as single pci device, or?14:21
mgariepyin my case the gpus are gaming ones.. 14:21
mgariepycheap for research :) haha14:21
mgariepyi guess i'll create a flavor with all the gpus in it...14:21
mgariepyi wasn't invloved in the purchase on that one. i only manage openstack for them.14:22
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/wallaby: Bump SHAs for Wallaby  https://review.opendev.org/c/openstack/openstack-ansible/+/83040614:35
JamesGiboNot going to help with your NVlink issue, but something to be aware of is that mdev's for GPUs are not recreated on reboot https://bugs.launchpad.net/nova/+bug/1900800?comments=all14:50
mgariepythanks for the inpu JamesGibo 14:54
noonedeadpunkoh, yes, there's nvidia-smi that needs to be launched for mdev14:55
*** dviroel is now known as dviroel|lunch14:57
mgariepyin my case i'm not splitting gpus.15:07
opendevreviewMerged openstack/ansible-role-python_venv_build stable/victoria: Slurp constraints.txt separately for each host in a batch  https://review.opendev.org/c/openstack/ansible-role-python_venv_build/+/83057115:12
* damiandabrowski[m] leaving for a vacation, will be back on March 7th15:25
mgariepytake care damiandabrowski[m] 15:36
damiandabrowski[m]thank You!15:51
*** dviroel|lunch is now known as dviroel16:04
spatelmgariepy here is my GPU passthrough blog - https://satishdotpatel.github.io/gpu-passthrough-for-openstack/17:04
mgariepyspatel, yep i already have that do you have nvlink between gpus ?17:08
mgariepybut anyway i'll have all the gpus inside a single vm.17:09
spatelno i don't have that config.. 17:09
spateldoes it required license ?17:09
spatelhow does nvlink work? they communicate over ethernet?17:10
spatelor this is link between two GPU card within single system (like NUMA)17:11
mgariepylink between 2 gpus17:12
mgariepyi don't have fancy hardware that requires licence :)17:12
mgariepyonly doing passthrought 1:1 17:12
spatellink between 2 GPU in two different system or single?17:12
mgariepya single17:12
mgariepyit's a phisical link between 2 gpus17:13
spatelI have two GPU card in single system but they work independently 17:13
spatelWhat are the advantage to link two GPU? vs two independent card  ?17:14
spatelIf there are significant advantage then i may link them :) 17:15
spatelhttps://www.tomshardware.com/news/nvidia-nvlink-boosts-performance,28989.html17:17
noonedeadpunkI thought they act as single computing core tbh...17:45
noonedeadpunklike 2 socket motherboard with cpu...17:45
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/victoria: Bump SHAs for Victoria  https://review.opendev.org/c/openstack/openstack-ansible/+/83041618:03
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/pike: EOL Pike branch  https://review.opendev.org/c/openstack/openstack-ansible/+/82695618:05
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/xena: Add test of used SHAs  https://review.opendev.org/c/openstack/openstack-ansible/+/83103118:06
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible stable/wallaby: Add test of used SHAs  https://review.opendev.org/c/openstack/openstack-ansible/+/83103218:06
mgariepyi think it's just faster to push data over the nvlink between the gpus  thant go through the pci lanes.18:20
spatelI saw doc. PCI support 16Gbps vs nvlink support 150Gbps with Tesla v100S card  18:40
lowercasenoonedeadpunk: After testing some more i've hit a wall with the fluent-logger + oslo.log. I've posted a question in #openstack-oslo to see if anyone has some guidance. 18:40
opendevreviewMerged openstack/ansible-role-python_venv_build stable/xena: Slurp constraints.txt separately for each host in a batch  https://review.opendev.org/c/openstack/ansible-role-python_venv_build/+/83036918:42
spatelwhat are we trying to achieve with fluent? 18:47
lowercaseThis: https://docs.openstack.org/oslo.log/latest/admin/example_nova.html18:48
lowercaseEnable a fluent logger within openstack to log directly to fluent annndddd hopefully with fluent_formatter it will be automatically in fluent parsable form.18:49
lowercasecause I try to say no to regex when i can lol18:49
noonedeadpunkdamn... sorry I don't really have much time to dig into this18:51
noonedeadpunkif to say any18:52
spatelsorry, may be i am new to fluent so asking odd question. You are saying openstack will send logs to fluent and then it will automatically formate logs.. ?18:54
lowercaseThat's the goal.18:55
lowercaseWith one minor difference. Openstack -> creates logger -> formats logs -> sends to fluent. 18:55
spatelin current deployment of OSA use journalctl 18:56
spatelfluent will keep logs in simple text file? 18:56
lowercaseBy pushing the burden onto openstack oslo, I don't need special regex formatters in a fluent config to parse openstack logs. They are already ready to go, and future proof and the burden is on the openstack oslo team to keep the log format compliant.18:57
lowercaseOSA pushes all openstack application logs to journalctl, yes. What fluent decides to push the logs is up to you. My fluentd pushes the logs to Loki.18:58
spatelkolla-ansible use text file and which is very handy sometime.. 18:59
lowercaseCorrect, but what I gain with journalctl is a lot of extra metadata that is not available by tailing a text log file.19:00
lowercaseAlso, I would not be required to maintain the regex matching that is provided in kolla-ansible.19:01
spatelI like clear text file :) because easy to handle. journal is totally overhead for sure. 19:15
opendevreviewMerged openstack/openstack-ansible-os_zun master: Update Zun api-paste  https://review.opendev.org/c/openstack/openstack-ansible-os_zun/+/82284719:20
opendevreviewJonathan Rosser proposed openstack/ansible-role-pki master: Refactor conditional generation of CA and certificates  https://review.opendev.org/c/openstack/ansible-role-pki/+/83079419:24
opendevreviewJonathan Rosser proposed openstack/openstack-ansible-os_keystone master: Migrate ssl certificate generation to the PKI role  https://review.opendev.org/c/openstack/openstack-ansible-os_keystone/+/83017919:26
spatelhow long k8s cluster create using magnum ? 19:48
spatelmy cluster stuck in CREATE_IN_PROGRESS last 10 min and more 19:49
spatel1 master 2 worker node 19:49
admin1more than 10 mins spatel19:59
admin1sometimes it might last 15-20mins and then it errors out :19:59
spatelhmm now its 20 min 19:59
admin1you can try to ssh and then ps a x19:59
admin1and journalctl and see if its acutally working 19:59
spatelits been long time it stuck here - https://paste.opendev.org/show/bPAZRp1lLHuUylhSoqSA/20:00
admin1i will help you tomorrow :) 20:03
admin1coz i have to run now :) 20:03
admin1have a great weekend .. 20:03
spatelnp.. 20:03
spatelhave a good weekend 20:03
*** dviroel is now known as dviroel|afk20:21
spatelI am using fedora-coreos-35.20220213.2.0-openstack.x86_64.qcow2 image and some how my ssh-key not working.. look like cloud-init doesn't working in that image21:44
spatelsame ssh key working on all other image but not with fedora-coreos21:44

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!