Tuesday, 2019-12-17

clarkbya seems like a reasonable compromise for staying on top of arm64 compat while managing the limited resources we have00:02
ianwfor the static server, should we have one cert with multiple hostnames, or do a separate cert for each?  does it matter?00:03
clarkbwe are going to LE them right? I half expect we'd do a single cert for that?00:05
clarkbI guess the LE hostvars input is a list so could split them up?00:06
clarkbI guess I don't have a strong opinion00:06
*** mattw4 has quit IRC00:07
ianwyeah, we can issue different keys for governance.openstack.org/security.openstack.org, or put them in the same with SAN00:08
clarkbfungi: ^ any opinions?00:08
*** smarcet has quit IRC00:15
clarkbOk I signed up for a geoguessr pro account so that I can make a shared challenge00:19
clarkbmaybe tomorrow after the meeting we can all play a round of "whose geography/travel knowledge is best"00:19
clarkbIt generates a url which I can share and then I think you can play without an account? I guess we'll see tomorrow00:20
openstackgerritMerged opendev/system-config master: install-ansible: Pre-install build deps for aarch64  https://review.opendev.org/69118700:22
openstackgerritMerged opendev/system-config master: Add arm64 mirror test  https://review.opendev.org/69079800:22
ianw"00:37
ianwRegarding which approach is “better” - I’d generally recommend keeping subdomains of the same domain on the same certificate, and different domains on different certificates, to minimize the risk of hitting the rate limits."00:37
clarkbThat makes sense00:37
ianwrandom internet comment ... https://community.letsencrypt.org/t/san-vs-sni-alternate-domain-visibility/53443#00:37
ianwbut that was ~ what i was thinking00:37
*** rlandy|bbl is now known as rlandy00:58
openstackgerritClark Boylan proposed zuul/zuul-jobs master: DNM test all the things on ansible 2.9 with default python  https://review.opendev.org/69834401:01
fungiianw: clarkb: sorry for the delay, but honestly whichever model is simpler is fine. the certs are served from the same server either way so whether they share a key doesn't have any security implications01:13
ianwfungi: i think separate certs ... but i can't understand why testing i'm getting "AH02032: Hostname localhost provided via SNI and hostname security.openstack.org provided via HTTP have no compatible SSL setup"01:13
fungiand yeah, i wouldn't be surprised if a cross-domain cert gets complicated with letsencrypt dns validation01:14
ianwbut i don't get the same doing the same thing for governance.openstack.org .. and they seem to be the same01:14
*** goldyfruit has quit IRC01:14
fungithat sounds like a configuration issue01:14
fungi"Hostname localhost provided via SNI" suggests you're testing against localhost?01:15
ianwyeah; https://review.opendev.org/#/c/697587/11/testinfra/test_static.py01:15
ianwthe first works, the second doesn't ... weird01:15
fungii suspect wget is stepping on your toes there01:16
fungiit's probably trying to do sni for "localhost" independent of the http header you're injecting01:16
*** rosmaita has left #openstack-infra01:20
fungiremember sni happens during (really before) ssl/tls handshake, so by the time you're passing http headers it's too late01:20
ianwhrm, but why doesn't it happen on the first request?01:20
ianwmaybe it's reusing something01:20
fungiis the first request maybe winding up as the default vhost?01:20
fungis/as/at/01:20
ianwhrm ... possibly01:20
fungihave you tried overrides in /etc/hosts? or does that create different problems?01:22
ianwi might have to ... i think curl has that built into command-line01:22
ianwthis worked for the mirror testing, but as you say, it might be because that's become the one-and-only vhost01:22
*** yamamoto has joined #openstack-infra01:25
ianwcurl --insecure  https://security.openstack.org --resolve security.openstack.org:443:127.0.0.1 might work better i think01:27
*** goldyfruit has joined #openstack-infra01:29
*** gyee has quit IRC01:29
fungiseems reasonable01:30
*** larainema has joined #openstack-infra01:30
fungigood to know curl has a --resolve01:31
ianwyeah and that wget is a bit of a trap01:32
*** smarcet has joined #openstack-infra01:32
fungii'm definitely not finding any similar feature in wget01:33
openstackgerritIan Wienand proposed opendev/system-config master: Add roles for a basic static server  https://review.opendev.org/69758701:34
*** goldyfruit has quit IRC01:43
openstackgerritMerged opendev/puppet-ptgbot master: Deploy new logo.png and motd.js files  https://review.opendev.org/69432701:47
*** jamesmcarthur has joined #openstack-infra01:53
*** Lucas_Gray has joined #openstack-infra02:01
*** jamesmcarthur has quit IRC02:02
*** rlandy has quit IRC02:08
ianwnice that works02:10
*** bnemec has quit IRC02:10
*** bnemec has joined #openstack-infra02:12
*** yamamoto has quit IRC02:19
*** Goneri has quit IRC02:21
fungiexcellent!02:21
*** jamesmcarthur has joined #openstack-infra02:38
*** yamamoto has joined #openstack-infra02:38
*** rfolco|bbl has quit IRC02:46
*** jamesmcarthur has quit IRC02:54
openstackgerritAntony Messerli proposed openstack/diskimage-builder master: Adds support for Fedora 31  https://review.opendev.org/69771302:56
*** ricolin has joined #openstack-infra03:03
*** apetrich has quit IRC03:09
*** psachin has joined #openstack-infra03:37
*** ykarel|away has joined #openstack-infra03:46
*** jamesmcarthur has joined #openstack-infra03:54
*** goldyfruit has joined #openstack-infra04:01
*** Lucas_Gray has quit IRC04:17
*** jamesmcarthur has quit IRC04:26
*** ykarel|away has quit IRC04:32
*** yamamoto has quit IRC04:34
*** yamamoto has joined #openstack-infra04:40
*** ykarel|away has joined #openstack-infra04:52
*** ykarel|away is now known as ykarel04:53
*** goldyfruit has quit IRC05:00
*** rh-jelabarre has quit IRC05:00
*** factor has quit IRC05:05
*** ramishra has joined #openstack-infra05:10
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Use olso to parse size strings  https://review.opendev.org/50357405:20
*** yamamoto has quit IRC05:20
*** raukadah is now known as chkumar|rover05:33
*** factor has joined #openstack-infra05:34
*** slaweq has joined #openstack-infra05:36
*** yamamoto has joined #openstack-infra05:41
*** slaweq has quit IRC05:43
openstackgerritIan Wienand proposed openstack/project-config master: Update nodepool dib stats  https://review.opendev.org/63858305:52
*** yamamoto has quit IRC05:53
*** yamamoto has joined #openstack-infra05:54
*** yamamoto has quit IRC06:07
*** ricolin_ has joined #openstack-infra06:10
openstackgerritMerged openstack/project-config master: Update nodepool dib stats  https://review.opendev.org/63858306:10
*** ricolin has quit IRC06:13
openstackgerritMerged openstack/diskimage-builder master: Install rng-tools in Red Hat family distro images  https://review.opendev.org/69718306:25
openstackgerritMerged zuul/nodepool master: Dockerfile: install sudo for nodepool-builder  https://review.opendev.org/69470906:26
openstackgerritMerged zuul/nodepool master: Dockerfile: add DEBUG environment flag  https://review.opendev.org/69484506:27
*** jamesmcarthur has joined #openstack-infra06:28
*** jamesmcarthur has quit IRC06:33
*** hwoarang has quit IRC06:36
*** hwoarang has joined #openstack-infra06:37
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: DNM: Remove openstack-python-jobs-trusty  https://review.opendev.org/69934806:42
*** yamamoto has joined #openstack-infra06:47
*** lpetrut has joined #openstack-infra06:48
*** yamamoto has quit IRC06:58
*** kozhukalov has joined #openstack-infra06:59
*** ricolin_ is now known as ricolin07:00
AJaegerfungi, do you know whether we still need to run legacy-storyboard-js-integration-ubuntu-trusty and legacy-storyboard-js-integration jobs on storyboard and storyboard-webclient? Those are experimental. Can they get removed?07:06
*** pkopec has joined #openstack-infra07:09
diablo_rojo_phonAJaeger: I want to way they can get removed, but I would definitely wait for fungi's answer.07:09
diablo_rojo_phon*say07:09
AJaegerthanks, diablo_rojo_phon. In that case let me push a change and wait for fungi to review ;)07:12
openstackgerritAndreas Jaeger proposed openstack/project-config master: Remove experimental legacy-storyboard-integration jobs  https://review.opendev.org/69935307:14
AJaegerfungi, diablo_rojo_phon ^ - I'll WIP until fungi confirms.07:14
diablo_rojo_phonHopefully I'm not wrong lol07:20
AJaegerit's a tiny change, so no hazzle if you are ;)07:35
AJaegerhazzle? Trouble? Argh, my English is rusty ;(07:35
*** ykarel is now known as ykarel|lunch07:41
openstackgerritAndreas Jaeger proposed openstack/project-config master: Remove legacy-group-based-policy trusty jobs  https://review.opendev.org/69936007:47
*** pcaruana has joined #openstack-infra07:52
*** lpetrut has quit IRC07:54
openstackgerritAndreas Jaeger proposed openstack/project-config master: Remove trusty fuel jobs  https://review.opendev.org/69936207:54
AJaegerfungi, clarkb, ianw, you discussed trusty jobs, above are a few changes to remove most of them.07:55
AJaegerIt still leaves a few in: legacy-puppet-openstack-infra-spec-helper-unit-ubuntu-trusty, project-config-bindep-fallback-ubuntu-trusty, publish-wheel-mirror-ubuntu-trusty, legacy-gearman-plugin-maven-build-ubuntu-trusty, legacy-group-based-policy-dsvm-functional-ubuntu-trusty, legacy-zmq-event-publisher-maven-build-ubuntu-trusty.07:56
AJaegerWhich of these are obsolete as well?07:56
AJaegerthere's also  legacy-logstash-filters-ubuntu-trusty07:58
*** slaweq has joined #openstack-infra07:59
*** pgaxatte has joined #openstack-infra07:59
*** dchen has quit IRC08:03
openstackgerritAndreas Jaeger proposed opendev/storyboard master: Remove project stanza from zuul.yaml  https://review.opendev.org/69936308:04
*** rpittau|afk is now known as rpittau08:05
*** yamamoto has joined #openstack-infra08:08
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: DNM: Remove openstack-python-jobs-trusty  https://review.opendev.org/69934808:14
*** tkajinam has quit IRC08:18
*** tosky has joined #openstack-infra08:20
*** tesseract has joined #openstack-infra08:25
*** ykarel|lunch is now known as ykarel08:31
*** hashar has joined #openstack-infra08:36
*** yamamoto has quit IRC08:41
*** yamamoto has joined #openstack-infra08:46
*** ralonsoh has joined #openstack-infra08:46
*** apetrich has joined #openstack-infra08:47
*** jpena|off is now known as jpena08:49
*** lucasagomes has joined #openstack-infra08:54
*** priteau has joined #openstack-infra08:57
*** dklyle_ has joined #openstack-infra09:01
*** priteau has quit IRC09:02
*** kevinz_ has joined #openstack-infra09:02
*** dtantsur has joined #openstack-infra09:02
*** clayg_ has joined #openstack-infra09:03
*** priteau has joined #openstack-infra09:03
*** ildikov_ has joined #openstack-infra09:03
*** rpioso_ has joined #openstack-infra09:03
*** fresta_ has joined #openstack-infra09:03
*** davecore_ has joined #openstack-infra09:03
*** knikolla_ has joined #openstack-infra09:03
*** coreycb_ has joined #openstack-infra09:03
*** ykarel_ has joined #openstack-infra09:03
*** petevg_ has joined #openstack-infra09:03
*** tosky_ has joined #openstack-infra09:04
*** cyberpear_ has joined #openstack-infra09:04
*** zxiiro_ has joined #openstack-infra09:04
*** jistr_ has joined #openstack-infra09:05
*** benj_- has joined #openstack-infra09:05
*** Anticime1 has joined #openstack-infra09:05
*** iokiwi3 has joined #openstack-infra09:06
*** StevenK_ has joined #openstack-infra09:06
*** lpetrut has joined #openstack-infra09:06
*** ktsuyuzaki has joined #openstack-infra09:09
*** amotoki_ has joined #openstack-infra09:09
*** tesseract has quit IRC09:10
*** tosky has quit IRC09:10
*** kozhukalov has quit IRC09:10
*** ykarel has quit IRC09:10
*** psachin has quit IRC09:10
*** iokiwi has quit IRC09:10
*** rcernin has quit IRC09:10
*** szaher has quit IRC09:10
*** osmanlicilegi has quit IRC09:10
*** tbarron has quit IRC09:10
*** zer0c00l has quit IRC09:10
*** jistr has quit IRC09:10
*** benj_ has quit IRC09:10
*** AJaeger has quit IRC09:10
*** david-lyle has quit IRC09:10
*** SotK has quit IRC09:10
*** kambiz has quit IRC09:10
*** jamesdenton has quit IRC09:10
*** andreykurilin has quit IRC09:10
*** fnordahl has quit IRC09:10
*** StevenK has quit IRC09:10
*** cmorpheus has quit IRC09:10
*** dtantsur|afk has quit IRC09:10
*** kevinz has quit IRC09:10
*** markmcclain has quit IRC09:10
*** zxiiro has quit IRC09:10
*** admcleod has quit IRC09:10
*** calebb has quit IRC09:10
*** yankcrime has quit IRC09:10
*** radez has quit IRC09:10
*** nicholas has quit IRC09:10
*** arif-ali has quit IRC09:10
*** rakhmerov has quit IRC09:10
*** fresta has quit IRC09:10
*** rpioso has quit IRC09:10
*** kota_ has quit IRC09:10
*** cyberpear has quit IRC09:10
*** petevg has quit IRC09:10
*** SergeyLukjanov has quit IRC09:10
*** bradm has quit IRC09:10
*** amotoki has quit IRC09:10
*** zaro has quit IRC09:10
*** Anticimex has quit IRC09:10
*** coreycb has quit IRC09:10
*** clayg has quit IRC09:10
*** davecore has quit IRC09:10
*** knikolla has quit IRC09:10
*** ildikov has quit IRC09:10
*** kevinz_ is now known as kevinz09:12
*** clayg_ is now known as clayg09:12
*** petevg_ is now known as petevg09:12
*** rpioso_ is now known as rpioso09:12
*** szaher has joined #openstack-infra09:12
*** zxiiro_ is now known as zxiiro09:12
*** davecore_ is now known as davecore09:12
*** knikolla_ is now known as knikolla09:12
*** coreycb_ is now known as coreycb09:12
*** ildikov_ is now known as ildikov09:12
*** cyberpear_ is now known as cyberpear09:12
*** iokiwi3 is now known as iokiwi09:12
*** benj_- is now known as benj_09:12
*** admcleod has joined #openstack-infra09:12
*** SotK has joined #openstack-infra09:12
*** yamamoto has quit IRC09:13
*** AJaeger has joined #openstack-infra09:16
*** zaro has joined #openstack-infra09:17
*** tesseract has joined #openstack-infra09:17
*** rcernin has joined #openstack-infra09:18
*** yamamoto has joined #openstack-infra09:18
*** tosky_ is now known as tosky09:19
*** ykarel_ is now known as ykarel09:20
*** yamamoto has quit IRC09:23
*** derekh has joined #openstack-infra09:33
*** sshnaidm|afk is now known as sshnaidm09:34
*** arif-ali has joined #openstack-infra10:09
*** yankcrime has joined #openstack-infra10:15
*** hashar has quit IRC10:24
*** priteau has quit IRC10:38
*** ociuhandu has joined #openstack-infra10:49
*** kozhukalov has joined #openstack-infra10:56
*** yamamoto has joined #openstack-infra11:02
*** yamamoto has quit IRC11:03
*** yamamoto has joined #openstack-infra11:06
*** ijw has joined #openstack-infra11:07
*** ijw has quit IRC11:12
*** lucasagomes has quit IRC11:24
*** ociuhandu has quit IRC11:27
*** ociuhandu has joined #openstack-infra11:28
*** yamamoto has quit IRC11:31
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: Remove openstack-python-jobs-trusty template  https://review.opendev.org/69934811:32
*** ociuhandu has quit IRC11:32
*** ociuhandu has joined #openstack-infra11:33
*** osmanlicilegi has joined #openstack-infra11:37
*** ianychoi has quit IRC11:38
*** ociuhandu has quit IRC11:39
*** ianychoi has joined #openstack-infra11:40
*** surpatil has joined #openstack-infra11:41
*** SurajPatil has joined #openstack-infra11:41
*** yamamoto has joined #openstack-infra11:41
*** yamamoto has quit IRC11:45
*** smarcet has quit IRC11:52
*** yamamoto has joined #openstack-infra11:54
*** ociuhandu has joined #openstack-infra11:55
*** yamamoto has quit IRC11:56
*** rfolco|bbl has joined #openstack-infra11:57
ShrewsAJaeger: I like this new word "hazzle" that you have invented.11:59
*** amotoki_ is now known as amotoki12:12
openstackgerritMerged zuul/nodepool master: Also build sibling container images  https://review.opendev.org/69739312:19
*** rfolco|bbl is now known as rfolco12:27
*** rosmaita has joined #openstack-infra12:35
*** yamamoto has joined #openstack-infra12:37
*** tesseract has quit IRC12:38
*** tesseract has joined #openstack-infra12:38
*** jpena is now known as jpena|lunch12:38
*** lucasagomes has joined #openstack-infra12:39
*** goldyfruit has joined #openstack-infra12:41
*** hwoarang has quit IRC12:53
*** hwoarang has joined #openstack-infra12:53
*** gfidente has joined #openstack-infra12:54
*** surpatil has quit IRC12:54
*** SurajPatil has quit IRC12:55
*** surpatil has joined #openstack-infra12:55
*** SurajPatil has joined #openstack-infra12:55
*** jamesmcarthur has joined #openstack-infra12:56
*** sshnaidm is now known as sshnaidm|afk12:58
*** rlandy has joined #openstack-infra12:59
*** SurajPatil has quit IRC13:00
*** SurajPatil has joined #openstack-infra13:01
*** surpatil has quit IRC13:01
*** cmurphy has joined #openstack-infra13:02
*** surpatil has joined #openstack-infra13:02
*** jamesmcarthur has quit IRC13:02
*** jamesmcarthur has joined #openstack-infra13:08
*** lbragstad_ is now known as lbragstad13:08
*** rh-jelabarre has joined #openstack-infra13:11
*** udesale has joined #openstack-infra13:11
*** ykarel is now known as ykarel|afk13:12
*** udesale has quit IRC13:12
*** udesale has joined #openstack-infra13:13
openstackgerritMonty Taylor proposed opendev/system-config master: Use explicit image paths  https://review.opendev.org/69051113:13
openstackgerritMonty Taylor proposed opendev/system-config master: Update pip3 role to install from get-pip.py  https://review.opendev.org/69076613:13
openstackgerritMonty Taylor proposed opendev/system-config master: Add service playbook and test run for prod gerrit  https://review.opendev.org/69117113:13
openstackgerritMonty Taylor proposed opendev/system-config master: Add launchpadlib credentials to gerrit ansible  https://review.opendev.org/69117213:13
openstackgerritMonty Taylor proposed opendev/system-config master: Add replication config to gerrit ansible  https://review.opendev.org/69117313:13
openstackgerritMonty Taylor proposed opendev/system-config master: Plumb through storyboard hiera data  https://review.opendev.org/69177713:13
openstackgerritMonty Taylor proposed opendev/system-config master: Clean up review comments  https://review.opendev.org/69200313:13
openstackgerritMonty Taylor proposed opendev/system-config master: Plumb through secure.config contents  https://review.opendev.org/69180013:13
openstackgerritMonty Taylor proposed opendev/system-config master: Update bazel to version 1.2.0  https://review.opendev.org/69940613:13
mordredShrews: do you see how good I am at being out?13:13
Shrewsmordred: it's about what I expected, tbh13:15
*** ykarel|afk has quit IRC13:17
*** smarcet has joined #openstack-infra13:27
*** ociuhandu has quit IRC13:28
*** ociuhandu has joined #openstack-infra13:28
AJaegerfungi, thanks for confirming the storyboard jobs!13:30
*** jamesmcarthur has quit IRC13:31
mordredfungi: got a sec and feel like +Aing a python2 remove patch? https://review.opendev.org/#/c/699132/13:31
AJaegermordred: want to +2A https://review.opendev.org/699363 and https://review.opendev.org/699353, please?13:33
mordred++13:34
AJaegerthanks13:34
*** ociuhandu has quit IRC13:34
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: Remove legacy-storyboard-js-integration jobs  https://review.opendev.org/69941213:36
AJaegermordred, fungi, and followup cleanup, please ^13:36
openstackgerritMonty Taylor proposed opendev/storyboard master: Add a dedicated "lpimport" tox testenv  https://review.opendev.org/64906513:37
*** udesale has quit IRC13:39
*** jpena|lunch is now known as jpena13:41
*** jamesmcarthur has joined #openstack-infra13:42
mordredfungi: thanks!13:46
mordredAJaeger: I assume you'll recheck that one when appropriate13:46
openstackgerritMerged openstack/project-config master: Remove experimental legacy-storyboard-integration jobs  https://review.opendev.org/69935313:47
AJaegermordred: just done - thanks13:48
*** yamamoto has quit IRC13:49
*** yamamoto has joined #openstack-infra13:51
*** yamamoto has quit IRC13:51
*** mriedem has joined #openstack-infra13:57
openstackgerritMerged opendev/storyboard master: Remove project stanza from zuul.yaml  https://review.opendev.org/69936313:58
*** smarcet has quit IRC14:00
*** dpawlik has joined #openstack-infra14:00
*** ociuhandu has joined #openstack-infra14:01
*** smarcet has joined #openstack-infra14:01
*** sshnaidm|afk is now known as sshnaidm14:02
*** liuyulong has joined #openstack-infra14:03
*** tkajinam has joined #openstack-infra14:04
*** smarcet has quit IRC14:05
*** surpatil has quit IRC14:12
*** SurajPatil has quit IRC14:13
*** smarcet has joined #openstack-infra14:14
*** ociuhandu has quit IRC14:21
openstackgerritMerged openstack/openstack-zuul-jobs master: Remove legacy-storyboard-js-integration jobs  https://review.opendev.org/69941214:22
*** smarcet has quit IRC14:27
*** smarcet has joined #openstack-infra14:31
rm_worktrying to run devstack on cent7 and it's exploding during a package install step because debootstrap isn't found (obviously)... am i missing something obvious in my config? pretty sure this used to work before <_<14:36
*** factor has quit IRC14:39
*** eharney has joined #openstack-infra14:42
mnaserrm_work: i think debootstrap is in epel14:42
mnaserso i would add it and you'll probably be ok14:42
*** addyess has quit IRC14:42
*** smarcet has quit IRC14:43
*** panda has quit IRC14:44
*** panda has joined #openstack-infra14:44
*** tkajinam has quit IRC14:47
*** ykarel|afk has joined #openstack-infra14:47
*** addyess has joined #openstack-infra14:47
*** smarcet has joined #openstack-infra14:48
rm_workah ok14:50
rm_workdidn't think you could do that on an RPM distro :D14:50
*** ykarel|afk is now known as ykarel14:51
mnaser:)14:51
*** yamamoto has joined #openstack-infra14:54
*** yamamoto has quit IRC14:54
fungijust like you an use rpm on a debian derivative14:54
fungier, you can14:54
*** yamamoto has joined #openstack-infra14:55
*** yamamoto has quit IRC15:00
*** smarcet has quit IRC15:01
*** ykarel has quit IRC15:03
*** ykarel has joined #openstack-infra15:03
*** smarcet has joined #openstack-infra15:05
rm_workblegh, we have a custom internal epel repo and it's missing debootstrap, that's why this is failing lol15:06
*** kozhukalov has quit IRC15:07
fungiinfra-root: heads up, we got another report of an exposed portmapper socket on a job node in inap, their scanner saw it on 198.72.124.78 at 2019-12-12T00:00:45.215319 utc15:08
*** piotrowskim has joined #openstack-infra15:09
fungii'm a bit strapped for time trying to prepare to head out of town tomorrow, does anyone have a few minutes to track it back to a build and correlate the findings with my analysis from a week ago? http://eavesdrop.openstack.org/irclogs/%23openstack-infra/%23openstack-infra.2019-12-10.log.html#t2019-12-10T15:21:5315:09
*** lourot has quit IRC15:10
*** zzehring has joined #openstack-infra15:12
*** ociuhandu has joined #openstack-infra15:14
corvusi'll track it back to a build15:16
*** lourot has joined #openstack-infra15:16
*** ociuhandu has quit IRC15:18
openstackgerritAlbin Vass proposed zuul/nodepool master: 'keys' must be defined for host-key-checking: false  https://review.opendev.org/69802915:19
corvusfungi, mgagne: it looks like we did not have that IP at that time.  here are the log entries for the node before and the node after: http://paste.openstack.org/show/787677/15:23
*** jaosorior has joined #openstack-infra15:26
*** smarcet has quit IRC15:36
*** ociuhandu has joined #openstack-infra15:36
*** eharney has quit IRC15:43
*** michael-beaver has joined #openstack-infra15:45
*** ociuhandu has quit IRC15:46
*** jamesmcarthur has quit IRC15:47
*** jamesmcarthur has joined #openstack-infra15:48
*** chkumar|rover is now known as raukadah15:50
*** jamesmcarthur has quit IRC15:53
*** smarcet has joined #openstack-infra15:55
*** jamesmcarthur has joined #openstack-infra15:59
*** lucasagomes has quit IRC16:00
fungithanks corvus! i've definitely seen that happen with their scans before as well, they don't seem to strictly correlate time in their nova logs (or make mistakes when doing so). maybe they assume servers which show up in their scans are long-lived and so just look to see which tenant had that ip address most recently or currently?16:01
*** ykarel is now known as ykarel|away16:04
openstackgerritClark Boylan proposed zuul/zuul-jobs master: DNM test all the things on ansible 2.9 with default python  https://review.opendev.org/69834416:04
*** hwoarang has quit IRC16:06
*** ociuhandu has joined #openstack-infra16:07
*** larainema has quit IRC16:09
*** hwoarang has joined #openstack-infra16:11
*** ociuhandu has quit IRC16:12
AJaegerconfig-core, working on trusty removal: https://review.opendev.org/699348 removes a unused template; https://review.opendev.org/699362 and https://review.opendev.org/699360 remove some jobs on repos/branches that are not used. Reviews welcome!16:19
openstackgerritClark Boylan proposed zuul/zuul-jobs master: Use present for package state instead of installed  https://review.opendev.org/69945016:24
fungijust discovered in #openstack-nova, devstack-gate defaults to installing ansible 2.516:27
*** ociuhandu has joined #openstack-infra16:27
clarkbfungi: ya, because everytime ansible would update it would break the gate16:28
fungiwhich is no longer compatible with ara, so grenade jobs are blowing up because grenade still uses devstack-gate16:28
clarkbso we pinned it16:28
clarkbah I guess we need to pin ara too then16:28
fungiyeah, mriedem ^ that's another option16:28
clarkbwe can also bump the pin for ansible up16:29
clarkband that chnge should be self testing16:29
clarkb2.7 is likely a safe ish jump16:29
*** gyee has joined #openstack-infra16:29
clarkb2.8 and 2.9 less safe16:29
mriedemlooks like we do pin ara https://github.com/openstack/devstack-gate/blob/13972f3a9b40b03121b7e8c6270cae36cf947beb/devstack-vm-gate-wrap.sh#L49916:30
clarkbhuh did ara < 1.0.0 get releases that stopped supporting old ansible?16:31
clarkbI think we should start by pushing a change to try ansible 2.716:31
clarkbthat I expect will be mostly safe16:31
mriedemhttps://pypi.org/project/ara/#history16:32
*** ociuhandu has quit IRC16:32
openstackgerritAntony Messerli proposed openstack/diskimage-builder master: Adds support for Fedora 31  https://review.opendev.org/69771316:32
mriedemyeah ara 0.16.6 dropped support for ansible < 2.616:36
clarkbbut that was back in november?16:36
mriedemyup16:36
mriedemnot sure why it's showing up now16:36
clarkbweird. But ya I think lets try ansible 2.7 and if that fails we can pin ara < 0.16.616:37
mriedemhttps://github.com/ansible-community/ara/commit/872ba818fbc5267b1f769d5485fd3ab318235aea fwiw16:37
openstackgerritMerged zuul/nodepool master: Add container-with-siblings functional test  https://review.opendev.org/69346416:37
clarkbmriedem: do you want to push that change or should I/16:37
mriedemi can push it16:37
mriedemjust making sure we have ansible 2.7 on xenial nodes and we do:16:40
mriedem2019-12-06 01:06:16.927660 | Ansible Version: 2.7.1416:40
*** pgaxatte has quit IRC16:40
mriedem^ is from a stable/pike grenade job16:40
*** eharney has joined #openstack-infra16:43
*** ricolin has quit IRC16:44
*** lpetrut has quit IRC16:44
*** jpena is now known as jpena|brb16:45
mriedemweird - looking at an ocata tempest job that's using xenial it is using ansible 2.8.7, but maybe because it's using the uCA16:47
mriedem*UCA16:47
*** jamesmcarthur has quit IRC16:48
*** jamesmcarthur has joined #openstack-infra16:48
openstackgerritMatt Riedemann proposed openstack/devstack-gate master: Bump ANSIBLE_VERSION default to 2.7.14  https://review.opendev.org/69946316:51
clarkbmriedem: there are two layers of ansible here too. The zuul ansible (whihc should be 2.8.7) and the ansibel that d-g runs under the zuul ansible which is currently 2.5 or so16:52
*** jamesmcarthur has quit IRC16:53
*** jamesmcarthur has joined #openstack-infra16:53
*** ociuhandu has joined #openstack-infra16:53
*** efried_pto has quit IRC16:54
*** ociuhandu has quit IRC16:55
*** ociuhandu has joined #openstack-infra16:55
mriedemit's also funny that this appears to be intermittent16:55
mriedemi have a nova change on master that just got into the gate16:55
clarkbperhaps the ara compatibility depends on ansible's run state which depends on factors that aren't consistent?16:56
*** jamesmcarthur has quit IRC16:56
clarkbsome runs it "just works" by chance?16:56
mriedemyeah idk, looking at logstash it's multiple branches/jobs/providers16:59
*** ykarel|away has quit IRC16:59
*** tesseract has quit IRC17:00
mriedemso, it might not be the ansible thing17:02
mriedemhttps://zuul.opendev.org/t/openstack/build/ce17ff84afb04b339cb98917c3625ced/log/logs/devstack-gate-setup-host.txt#184717:02
mriedemin this case it was a network failure17:02
openstackgerritMerged zuul/zuul master: Remove support for ansible 2.5  https://review.opendev.org/65043117:06
mriedemclarkb: why are we even installing ara anymore? looking at a tempest job on master it doesn't use ara - i thought the zuul console stuff replaced ara?17:06
dmsimardara 0.x /probably/ still works with 2.5 but it's untested17:06
clarkbmriedem: its for the nested ansible run. You are correct that the zuul level ansible doesn't use ara anymore17:07
mriedemand ara from a grenade/devstack-gate job is busted: https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_ed7/638047/56/check/nova-grenade-multinode/ed76ba4/logs/ara/index.html.gz17:07
mriedemok so maybe ^ is just the result of the issue we're trying to fix17:07
dmsimardmriedem: I'm not sure gzipping the reports work without a web server in front to transparently rewrite URLs and decompress things17:07
dmsimarduncompressed reports should just work17:08
clarkbdmsimard: the file encodings should be set properly in swift17:08
clarkbso your browser should decompress them17:08
*** smarcet has left #openstack-infra17:08
dmsimardclarkb: sure, but the links to the css/js files are not the compressed versions17:08
clarkbah17:09
dmsimardthe logserver had a rule to try the gzipped version first and fall back to uncompressed17:09
clarkbthen the fix would be to stop having an explicit compression step in the job and let the upload manage it17:09
clarkbwe compress on upload if not already compressed, but keep the filename and set the encoding17:09
dmsimardfor ara-report specifically, there is a toggle for that: https://opendev.org/zuul/zuul-jobs/src/branch/master/roles/ara-report/tasks/main.yaml#L6117:10
dmsimardif there is something else that goes and compress everything, it would override that though17:10
mriedembtw, if there is a goal to drop devstack-gate usage grenade jobs are probably the big remaining user of d-g and https://review.opendev.org/#/c/548936/102 is the thing holding that back17:14
mriedemi haven't been following it but it's huge and scary17:15
clarkbmriedem: yup. tosky has been working on grenade jobs without d-g17:15
mgagnecorvus, fungi: IP addresses aren't shared at inap. This means no other costumer can use them beside your account.17:18
clarkbmaybe the timestamp is wrong then?17:19
toskyclarkb, mriedem the jobs seems to be working; there is just one comment by Sean Mooney about a possible improvement17:19
toskyand I hold back waiting for more comments17:19
toskyI guess I need to start working again on them17:19
clarkbtosky: we might want to land that during the holiday quiet period17:19
clarkbworst case it breaks and we can revert?17:20
*** Goneri has joined #openstack-infra17:20
clarkbbut may give us a low volume time to burn it in a bit17:20
toskyif you want to comment: https://review.opendev.org/#/q/topic:grenade_zuulv3+status:open17:20
clarkbtosky: I'll try to make time for it after the infra meeting today17:20
toskythanks17:21
toskyif that improvement is the only request that comes out, I will try to implement it as soon as possible17:21
toskybasically everything seemed to be working, apart from a multinode grenade job which failed while running their tests I think after the upgrade17:22
toskybut not sure it was really related to my changes17:22
toskyaaaanyway, there is a long history of comments :)17:22
openstackgerritMerged zuul/zuul-jobs master: Use present for package state instead of installed  https://review.opendev.org/69945017:22
*** jpena|brb is now known as jpena17:23
toskyuh, weird (the author)17:23
*** ramishra has quit IRC17:23
toskylet me restore it17:23
fungimgagne: what are the chances future reports could include a nova instance name or uuid? i suppose dereferencing ip addresses in real time during the scan is nontrivial?17:23
AJaegermriedem: there's a goal to use "zuul v3 native jobs" and that implies getting rid of devstack-gate. Community goal for V release, did not make it for U17:23
fungimainly because of concerns grenade wouldn't be updated in time for ussuri17:24
mgagnefungi: little to no chance. The scanning system isn't linked to OpenStack. It is performed by a different department which scans all our IP space, including non-openstack systems.17:24
*** rpittau is now known as rpittau|afk17:25
fungimgagne: no worries, figured it didn't hurt to ask. i can understand that wouldn't be easy. i also wonder if that address could belong to a rogue vm nova has lost track of?17:25
mgagnefungi: is it the same IP as last time? I didn't check.17:26
mgagnefungi: if it's rogue, you should be able to ping it right now.17:26
fungiit's not the same ip address as a week prior, no, i just checked that myself too17:27
fungithat ip address is pingable at the moment17:27
fungi198.72.124.7817:27
fungibut it may have a running node i'll check17:27
fungiyeah, up just over an hour17:28
fungiLSB Version:    :core-4.1-amd64:core-4.1-noarch17:29
fungido we have coreos nodes?17:30
fungiis that what that means?17:30
mgagnefungi: I'll try to ask for more details17:30
mgagnefungi: about the timestamp17:30
fungithanks17:30
toskymriedem, clarkb : fixed the attribution of one review, rebase all the stack17:31
toskyrebased*17:31
*** jamesmcarthur has joined #openstack-infra17:31
fungiyeah, at the moment 198.72.124.78 is in use booted as a centos-7 job node17:32
fungiso definitely explains why it's pingable17:32
clarkbfungi: jobs can replace their root disk and reboot17:32
fungiyeah, i think this is an octavia job which has done just that17:33
*** jamesmcarthur has quit IRC17:33
fungijudging from the ansible processes i see running17:33
*** jamesmcarthur has joined #openstack-infra17:34
johnsomfungi What did we do now? lol17:35
funginothing!17:35
johnsomOh, ok, something I can help with then?17:35
fungii was just logging into a node to check that it was doing stuff, and the motd and lsb_release output confused me17:35
fungiit's not a problem17:36
clarkbunless coreos opens a bunch of servies to the internet without a firewall17:36
johnsomWe don't have any coreos jobs (we don't support it)17:36
fungiclarkb: yeah, i don't think the running build is in any way related to the notification from a week ago17:37
johnsomI looked at it about a year ago and it seemed like it was going to be a bunch of work to get the image size down17:37
fungi(and i did check `netstat -lnu` for 111/udp just out of curiosity while i was in there)17:37
fungiaha, it's a tripleo job17:38
*** mattw4 has joined #openstack-infra17:38
fungii saw "ansible-playbook --tags build,standalone,octavia" and immediately though it could be an octavia job17:39
fungis/though/thought/17:39
fungibut it's just a job using octavia (one of many, i'm sure)17:39
johnsomYep, they are around... lol17:40
fungianyway, i did not mean to summon you, was just trying to work out what jobs we have running on coreos since we don't have any coreos nodes, that's all ;)17:42
johnsomNo worries17:42
clarkbfungi: corvus we could add a cleanup task that checked for port 111 use17:43
clarkbthen skim the zuul logs after a week17:43
fungiwell, tons of jobs run on images with a portmapper listening, the trick is working out which ones expose it through iptables17:44
clarkbnc $public_ip 111 ?17:44
fungidoes nc do udp?17:44
clarkbyes17:44
fungineat!17:44
corvusclarkb, fungi, mgagne: it seems to me that the most likely explanation is that the scanning process performs the customer lookup long after it identifies a host with an open port, and we're just getting mis-routed reports.  do we really need to take any further action?17:45
clarkbthough I think you need respionses for it to be useful?17:45
funginc -u per the manpage17:45
clarkbcorvus: mgagne says the IPs are per tenant so that IP will only ever be used by us17:45
fungicorvus: mgagne says we have dedicated ip addresses17:45
mgagnecorvus: This would mean the scanner got the IP wrong. The IPs are dedicated to your account.17:45
corvusoh, interesting17:46
corvuswell, it either got the ip or the time wrong17:46
fungiso still possible they were wrong about the timezone it's using, yes17:46
mgagnecorvus: I asked more details about the time to the other department.17:46
fungiwell, or somehow nodepool isn't successfully deleting nodes sometimes and isn't telling us?17:46
corvusit's only as good as openstack17:47
fungiindeed... so, flawless right? ;)17:47
fungii suppose someone with access to the nova logs may also be able to tell us whether there was an instance in our project/tenant matching that ip address and time17:49
corvusclarkb: then i believe your idea sounds pretty good, though nc may not be installed...17:50
mgagnefungi: I don't think the IPs are logged in Nova. Maybe Neutron port ID which is now deleted.17:50
clarkbcorvus: ya maybe even just have the executor open a socket?17:50
clarkbI'm not sure what the best implementation is there but having it run from executor to test node would catch open ports17:51
corvusyeah, that sounds better17:51
corvusand wrap it in a 'when: nodepool.provider==inap' sort of thing17:51
corvusor i guess we could just do 'when: nodepool.public_ipv4'17:51
clarkbya catching it in another cloud might be quicker17:52
corvusyeah that17:52
clarkbsincei n theory this should happen everywhere we schedule those job(s)17:52
Shrewsfungi: if nodepool had a persistent problem with deleting nodes, it would show up in a diminishing quota, i would think17:52
fungiyeah, agreed17:53
*** michael-beaver has quit IRC17:55
fungireading rfc 1057 for some clue about how to tickle a response out of it17:55
fungiappendix a indicates it's a binary protocol17:57
*** lbragstad_ has joined #openstack-infra17:58
fungipart of the challenge with the netcat idea is that with udp you're not going to be able to determine the service is listening without sending something to it which elicits a response17:58
fungithere's no handshake to be able to say "there's something listening there which i've established a connection with"17:59
clarkbfungi: right we need a resposne to confirm it is there17:59
fungiwhich, my initial skimming of rfc 1057 appendix a suggests we'll need to send it an 8-bit packed structure18:00
*** lbragstad has quit IRC18:00
johnsomYou might also consider rpcinfo18:00
*** derekh has quit IRC18:00
fungioh! right you are ;)18:00
fungiusing an actual client which speaks the protocol may make more sense18:00
fungijohnsom with the obvious answers18:01
corvuswe'll need to install that on the executors, but should be fine18:01
fungiyeah, would need the rpcbind package18:02
fungiwhich will additionally pull in libtirpc118:02
fungiso not especially onerous18:02
fungiand syntax should just be `rpcinfo W.X.Y.Z`18:03
johnsomI think it's rpcinfo -p <ip>18:03
*** jaosorior has quit IRC18:04
fungi-p tells it to use rpcbind v2 (portmapper) protocol18:04
fungibut yeah18:04
johnsomAh, yeah, sorry, a bit rusty on the old portmap stuff18:05
fungisame, i ceased running nfs servers some years ago18:05
*** ociuhandu has quit IRC18:05
fungii think by default modern rpcbind tries version 4 of the protocol if you don't specify18:05
openstackgerritJames E. Blair proposed opendev/system-config master: Add rpcbind to executors  https://review.opendev.org/69947418:06
corvusmay as well get that started18:07
mgagnefungi, corvus: I found timestamp of IP usage: http://paste.openstack.org/show/FZKSHVmW9iGnhL5Kzgrb/18:08
mgagnebased on port.create.end event in Neutron.18:08
fungithanks! so that's creation time and instance uuid?18:11
mgagnethe created_at field as found in the event for the neutron port.18:12
mgagneand device_id found in Neutron port, so this should be the instance UUID.18:12
corvusthat doesn't have the 22:25 event i found?18:12
mgagneit seems not. I'll check again to make sure18:13
fungii also don't see that uuid appearing in our nodepool launcher debug log for that timeframe18:13
fungioh, wait, here we go18:14
fungi2019-12-11 00:35:19,329 DEBUG nodepool.NodeLauncher: [node: 0013326094] Waiting for server 57205139-e4de-473e-9299-6052f6e558a9 for node id: 001332609418:14
fungi2019-12-11 01:12:03,456 INFO nodepool.DeletedNodeWorker: Deleting used instance 57205139-e4de-473e-9299-6052f6e558a9 from inap-mtl0118:14
corvuswhat's the idea here?  do we want to gather a set of builds within a window around the time and try to cross that with a second report to narrow it down?18:15
fungi2019-12-11 00:40:34,995 DEBUG nodepool.NodeLauncher: [node: 0013326094] Node 0013326094 is running [region: mtl01, az: nova, ip: 198.72.124.78 ipv4: 198.72.124.78, ipv6: , hostid: 198627a086ccf9500de7782a9fdb952c2599f13005bf30b2f02bcb9f]18:15
*** rlandy has quit IRC18:15
mgagnemaybe trying to correlate with a job name?18:15
fungiohh18:15
fungiwrong date18:16
corvusif so, what's the window size we want?  are we thinking it's a EST/UTC difference? or...?18:16
*** rlandy has joined #openstack-infra18:16
mgagneI asked again the other department, they said: "The timestamp is detection time and is UTC"18:16
fungii was looking at 2019-12-11 and thinking 2019-12-12, sorry18:16
mgagnewell, maybe VM got created ~3-4 hours ago before being used and/or detected.18:17
fungimgagne: so no instances created on 2019-12-12 in that timeframe?18:17
mgagnenothing before 6am18:17
corvusokay, so we have a scanning process that definitively identifies an ip+time, and a nodepool process that definitively says that ip address was not in use at that time.18:18
fungiyeah, the examples there end at 18:02 utc18:18
corvusit seems the neutron logs are backing up the nodepool logs.18:18
mgagnebefore 12:00:18:18
fungiwe don't have a neutron log entry confirming deletion of the port for 619ae072-a88d-411a-9c12-4b0f564219f7 i guess18:19
corvuswell, that's not the last use anyway.  3e7996a7-8e14-49f8-a13a-f0a7620bc17e is18:19
fungiso it's theoretically possible that instance was created at 2019-12-11T18:02:27Z and still running 6+ hours later when the scan claims to have hit it18:19
mgagnea240db8a-dc9f-4379-8d4e-f35fc9a1ea05 2019-12-12T07:00:48Z18:19
mgagneb9bc23fb-13ae-42cb-9ab9-4d3457bf5450 2019-12-12T09:40:06Z18:19
mgagnea14b014d-c190-4f86-9a28-491d5170de81 2019-12-12T11:39:23Z18:19
fungicorvus: right, and that's not included18:20
fungiwhich suggests there's still some missing data18:20
corvusif we want to examine the idea that a node existed for longer than nodepool thinks, then we should look at 3e7996a7-8e14-49f8-a13a-f0a7620bc17e18:25
corvuswhich is the usage immediately before the report time18:25
corvusideally, mgagne will find the missing neutron logs that confirm that was the immediate prior use.  but if not, we should also look at 619ae072-a88d-411a-9c12-4b0f564219f718:26
corvusi will find the build for 3e7....18:26
clarkbI'll admit to having gotten a bit lost in the discussion. Let me know if there is anything I can do to help.18:29
mgagne3e7996a7-8e14-49f8-a13a-f0a7620bc17e had 198.72.124.78 with port created_at 2019-12-11T22:25:47Z18:29
mgagneSo it seems the timestamp I'm used to filter event isn't right. It's a unix timestamp.18:29
corvusgreat, that means nodepool and neutron agree about the node immediately prior18:29
corvusi'm still tracking down that build (it will take a minute, it appears to be an incomplete build)18:29
corvusalso, amusingly, i have managed to open the maximum number of X windows on my workstation18:30
clarkbthats a thing?18:31
corvusapparently!  i couldn't open any more windows until i closed some.  maybe it's an xfce thing?18:32
clarkbTIL18:32
*** jpena is now known as jpena|off18:33
corvushttps://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_c3b/698145/4/gate/cross-osc-tox-docs/c3b9dba/18:33
corvusthat job seems particularly unlikely to do anything weird with port 11118:33
mgagnecorvus: somehow the port.create.end event is missing for the 3e7996a7-8e14-49f8-a13a-f0a7620bc17e instance. That's why I missed it.18:34
corvusmgagne: could that indicate some kind of error in neutron?  perhaps one where the ip is assigned to something without correct auditing?18:35
mgagnethe auditing is done by stacktach. that's where I'm looking at.18:36
mgagneI'll grep for something else, the IP looks to be in the compute.instance.create.end payload too.18:36
corvuson the inap mirror, i see no requests from 198.72.124.78 on 2019-12-12 before 06:04:2518:38
*** pcaruana has quit IRC18:38
mgagneI think I have a better query now: http://paste.openstack.org/show/P8WtmyvKpdaABlXCXOgr/18:40
corvusand the last request before that is 22:32:0618:40
corvuswhich does correspond with the pip install at https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_c3b/698145/4/gate/cross-osc-tox-docs/c3b9dba/job-output.txt18:41
mgagneso last one is 3e7996a7-8e14-49f8-a13a-f0a7620bc17e18:41
mgagnewhich got deleted at "2019-12-11T22:35:29.000000". So this makes no sense.18:42
corvusso nodepool and neutron agree that 3e7996a7-8e14-49f8-a13a-f0a7620bc17e is the most recent instance with that ip before the reporting time.  the mirror and build logs confirm that the expected node did have that ip and was using it during the time that nodepool expected that node to have the ip.  nodepool says nothing had it for 6 hours after that (the report time is during that window).  and the18:43
corvusmirror confirms that nothing was accessing it from that ip during that window (so that's a confirmation external to nodepool that no rogue job had started after 3e7996a7-8e14-49f8-a13a-f0a7620bc17e)18:43
fungicool, this sounds like actionable feedback for the security department in that case18:44
fungithanks for investigating!18:45
*** panda has quit IRC18:49
*** panda has joined #openstack-infra18:49
openstackgerritClark Boylan proposed zuul/zuul-jobs master: Fix ansible use of filters and tests  https://review.opendev.org/69947818:50
*** jamesmcarthur has quit IRC18:51
mgagnecorvus, fungi: I checked and there is no rogue instance with that IP address. Only one is a legit instance referenced in Nova.18:54
*** dave-mccowan has joined #openstack-infra18:54
fungiyeah, the mystery deepens18:54
fungistarting to seem increasingly likely they reported the wrong address or wrong timestamp18:55
fungi(or that the timestamp is uselessly inaccurate in some way at least)18:55
fungihere is the exact text of what they sent us: http://paste.openstack.org/show/787687/18:57
fungier, sorry, http://paste.openstack.org/show/787688/ is slightly cleaner, the previous one had a linewrap added by my mua18:58
*** jaosorior has joined #openstack-infra19:00
*** ralonsoh has quit IRC19:04
*** pcaruana has joined #openstack-infra19:18
*** hashar has joined #openstack-infra19:31
*** eharney has quit IRC19:45
*** smarcet has joined #openstack-infra19:45
*** ianw is now known as ianw_pto19:50
ianw_ptofyi i'm on PTO until Jan 13 now ... never that far from a computer but won't be actively pushing on stuff19:51
fungii'll be away from the computer starting tomorrow, until new year's eve19:52
ianw_ptosounds fun!19:53
fungithat's the general idea ;)19:54
*** jamesmcarthur has joined #openstack-infra19:55
*** pcaruana has quit IRC19:59
AJaegerfungi, ianw_pto, enjoy your vacations!20:10
*** jamesmcarthur has quit IRC20:11
*** jamesmcarthur has joined #openstack-infra20:12
fungithanks!20:16
clarkbbefore everyone disappears we should fix the mysql backups20:18
clarkbfungi: were you working on that or should someone else take a look?20:18
mordredclarkb: from meeting scrollback - yes, just bumping the gitea tag number in the dockerfile should be all that's needed20:18
openstackgerritMonty Taylor proposed opendev/system-config master: Bump gitea version to 1.10.1  https://review.opendev.org/69949020:19
mordredclarkb: ^^ there ya go20:19
clarkbmordred: no template delta?20:19
*** jamesmcarthur has quit IRC20:19
clarkbI'm going to eat lunch then start looking at reviews20:19
mordredclarkb: will go check20:19
mordredclarkb: https://review.opendev.org/#/c/699406 is needed for the gerrit stack20:20
mordred(and the one right after it)20:20
mordredhttps://review.opendev.org/#/c/69051120:20
mordredthe rest of the stack is reviewed and green20:21
mordred(I'll keep poking at getting that in and moving forward over the slow period)20:21
clarkbboth of those changes lgtm20:21
mordredclarkb: also - no template changes in 1.10.120:24
*** Lucas_Gray has joined #openstack-infra20:25
clarkbcool20:25
clarkbnow that is really curious20:27
clarkbmysqlclient on etherpad is working find20:27
clarkb*fine20:27
clarkbit has no ssl config but does not require it20:27
clarkbmysql --version reports it has the same mysqlclient version as review.o.o20:27
clarkbmordred: ^ can you help decipher that and the way to fix db backups on review.o.o?20:27
mordreduhm. sure!20:28
mordredwhere are we seeing issues with ssl?20:28
clarkbmordred: on review.o.o if you try to connect to the gerrit db using mysql client and the root user my.cnf it fails on ssl. iF you run mysql --skip-ssl it works20:29
clarkbmordred: doing the same on etherpad.o.o works fine without the --skip-ssl and it has the same mysqlclient version20:29
clarkbI'm guessing there is some piece of config somewhere we hve on review.o.o and not on etherpad or vice versa20:29
clarkb/etc/mysql/conf.d/client.conf exists on review.o.o but not etherpad. However it doesn't seem to say anything about ssl on review.o.o20:29
mordredI don't see any ssl config on review.o.o20:30
mordredyeah20:30
mordredthat said:20:30
mordredssl                               TRUE20:30
clarkbwhere do you see that?20:30
mordredlooking more20:30
clarkbkk20:30
mordredmysql --help20:30
clarkbthanks20:30
clarkbI'm going to eat and can help dig more after20:30
mordredclarkb: etherpad and review seem identical20:33
mordredboth show SSL TRUE with --no-defaults - so I think that's a red herring20:34
mordredmaybe the db for reviewdb is advertising ssl for some reason20:34
*** jamesmcarthur has joined #openstack-infra20:34
mordredand since ssl is true, the client is trying to use it - but there's no certs so it can't?20:34
clarkboh server + client setup affecting it?20:35
clarkbI guess if we set ssl off in yhe cnf that shoulf make it just work?20:35
mordredlemme try20:35
mordredyes20:35
clarkbok so update puppet-mysql_backup with that is our fix I think20:36
mordredclarkb: [client]\nssl=false fixes it20:36
openstackgerritMonty Taylor proposed opendev/puppet-mysql_backup master: Turn off ssl in my.cnf  https://review.opendev.org/69949420:37
mordredclarkb: ^^20:37
fungiclarkb: i doubt i have time to dig deeper into the mysql behavior change on review.o.o, but adding --skip-ssl to mysqldump in our cronjob should be sufficient?20:38
fungioh, or what mordred suggested now that i'm caught up on scrollback20:40
clarkbfungi: I think the my.cnf is better do that all client commands work20:40
fungithanks!20:40
*** pcaruana has joined #openstack-infra20:42
*** eharney has joined #openstack-infra20:42
*** jaosorior has quit IRC20:45
*** gfidente has quit IRC20:48
openstackgerritMerged zuul/nodepool master: Dockerfile: install nodepool-builder dependencies  https://review.opendev.org/69330620:50
*** smarcet has quit IRC20:54
*** pkopec has quit IRC20:55
openstackgerritMerged zuul/nodepool master: Add a container-with-releases functional test  https://review.opendev.org/69881820:58
*** armax has joined #openstack-infra20:59
*** kozhukalov has joined #openstack-infra21:02
mordredcorvus, fungi: have a sec for 2 quick patches? https://review.opendev.org/#/c/699406 and https://review.opendev.org/#/c/690511/ would be nice to land21:02
corvusdone21:03
*** kopecmartin is now known as kopecmartin|off21:04
*** smarcet has joined #openstack-infra21:05
*** jaosorior has joined #openstack-infra21:09
mgagnecorvus, fungi: so I checked with abuse/security department. Actual time is 2019-12-11 18:43:56 UTC. They will review the time conversion mapping settings.21:09
corvusmgagne: progress!  i'll look that up21:10
mgagnecorvus, fungi: So this would match a1f4fbbc-4d8a-4f17-8848-53e15da23819 @ 2019-12-11 18:28:17+00:0021:10
corvusmgagne: that does agree with nodepool; looking up the build now21:12
mordredcorvus: thanks!21:12
corvushere's the build: http://zuul.opendev.org/t/openstack/build/3d1da15f262f47c0b1d15a89904dc84921:13
corvusironic-tempest-ipa-partition-pxe_ipmitool-tinyipa21:13
mriedemclarkb: donnyd: another grenade job failed networking with fortnebula https://zuul.opendev.org/t/openstack/build/79107b8c3bac4fdba4d21a059311e9c3/log/logs/devstack-gate-setup-host.txt#340821:13
corvusclarkb, fungi: ^^ see convo with mgagne21:13
*** smarcet has quit IRC21:14
*** jaosorior has quit IRC21:15
mriedemhttp://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22%7C%20localhost%20%7C%20FAILED%20%7C%20network_sanity_check%20%3A%20Perform%20ping%20check%20%7C%20rc%3D1%5C%22%20AND%20tags%3A%5C%22console%5C%22&from=7d - far and away FN for those failures21:15
mriedemlooks like mostly jobs that use devstack-gate, i don't know if there is something about how it does network setup checking that is different in the zuulv3 jobs21:16
fungicorvus: mgagne: thanks! in that case we probably need to forget the analysis i performed a week ago, as it was based on incorrect time assumptions and so likely selected an unrelated build (though maybe we still have logs from then and could redo the analysis?)21:18
fungii need to go meet some folks for dinner, but will hopefully be back in an hour or so21:18
mgagnefungi: abuse/security told me that a misconfiguration made it so now() was used instead of original report timestamp.21:18
*** smarcet has joined #openstack-infra21:18
clarkbmriedem: we know we had a network issue to the mirror early monday iirc21:20
clarkbhowever that timestamp is today21:20
mriedemyeah a spike in this failure since yesterday21:20
clarkbthe host is reachable via ipv4 externally21:20
clarkbdonnyd: ^ any chance there are still ipv6 route issues?21:20
donnydpossible21:20
clarkbfungi: corvus: and we need to work with ironic to figure out why their job is exposing 111?21:21
openstackgerritMatt Riedemann proposed opendev/elastic-recheck master: Add query for network_sanity_check ping check fail bug 1856760  https://review.opendev.org/69950321:21
openstackbug 1856760 in OpenStack-Gate "icmp_seq=1 Destination unreachable: Address unreachable causing built failures on fortnebula nodes" [Undecided,New] https://launchpad.net/bugs/185676021:21
mriedemthere are a handful of rax and 1 inap hit on that but 40 on FN21:21
clarkbhttps://f9c248a400bd30174240-0e9efd411d5f516ecd1b5a61c03e35b7.ssl.cf1.rackcdn.com/697585/6/check/ironic-tempest-ipa-partition-pxe_ipmitool-tinyipa/3d1da15/controller/logs/iptables.txt.gz is what iptables looks like at the end of the job I think21:22
donnydmaybe FN is just so awesome that I get all the jobs ;-)21:22
donnydLMAO21:22
donnydI reset the GW21:23
donnydthat was what i had to do to fix the CI project21:23
donnydmriedem: can you recheck please21:23
clarkbthat iptables ruleset doesn't appear to allow external udp to port 11121:24
mriedemdonnyd: done21:24
clarkbI see the host dropping other udp packets in syslog (different dpt)21:26
clarkbthat implies the firewall is generally working21:27
*** jamesmcarthur has quit IRC21:28
donnydi can reach all of the instances inbound21:28
*** kozhukalov has quit IRC21:29
mgagnefungi, corvus: time was AM, not PM21:30
mgagne2019-12-11 06:43:5621:30
clarkbok so different job? that would make sense as the identified job seems clean21:30
mgagnea59f143c-9006-4e92-8c05-e82f4864f03a @ 2019-12-11 06:00:45+00:0021:30
corvusok, i'll look that one up21:31
corvusthat also agrees with nodepool; node existed from 6:00 to 8:1821:32
corvushttp://zuul.opendev.org/t/openstack/build/6fc5285fdb76484b894f0d288facdbb221:35
corvusopenstack-helm-multinode-temp-ubuntu21:35
corvusthat's a 5-node job; the ip in question is the "primary" host21:35
clarkbI wonder if they use our "turn off the firewall" role for k8s21:35
*** Lucas_Gray has quit IRC21:37
corvuslooks like they have their own21:37
corvusthe result is this: http://zuul.opendev.org/t/openstack/build/6fc5285fdb76484b894f0d288facdbb2/console21:37
corvuser21:37
corvushttp://zuul.opendev.org/t/openstack/build/6fc5285fdb76484b894f0d288facdbb2/console#2/1/7/primary21:37
clarkbthat does look like an empty ruleset21:38
clarkbhttp://zuul.opendev.org/t/openstack/build/6fc5285fdb76484b894f0d288facdbb2/console#2/1/6/primary does the rule update I think21:39
clarkbnow I guess we have to decide if disabling the firewall is a valid tactic for dealing with k8s21:40
*** pcaruana has quit IRC21:41
clarkbdoes k8s do anything with low ports?21:42
clarkbdns maybe?21:42
clarkbpossible that we can open 1024 and above21:42
clarkbbut leave the low ports firewalled off21:42
openstackgerritMerged zuul/zuul-jobs master: Fix ansible use of filters and tests  https://review.opendev.org/69947821:44
clarkbwe could resurrect the idea of relying on security groups21:45
clarkbthat is probably going to be most zuul user friendly21:46
clarkbbut will need careful application21:46
*** rcernin has quit IRC21:46
corvusor modify that job to add a reject rule for 11121:46
clarkbcorvus: ya but its going to be a similar issue with dns resolvers and other services that can be exploited for reflection attacks21:47
clarkbtrying to do it port by port seems like we will always be behind where we want to be21:47
clarkbI guess its largely just udp services though21:49
clarkbmaybe we can accept tcp?21:49
clarkband block udp21:49
*** smarcet has joined #openstack-infra21:51
openstackgerritMerged opendev/system-config master: Update bazel to version 1.2.0  https://review.opendev.org/69940621:52
openstackgerritMerged opendev/elastic-recheck master: Add query for network_sanity_check ping check fail bug 1856760  https://review.opendev.org/69950321:52
openstackbug 1856760 in OpenStack-Gate "icmp_seq=1 Destination unreachable: Address unreachable causing built failures on fortnebula nodes" [Undecided,New] https://launchpad.net/bugs/185676021:52
clarkbmnaser: ^ you may have thoughts on that since you are doing a fair but of zuul + k8s intersection work too iirc21:52
* mnaser reads21:52
*** dpawlik has quit IRC21:53
*** ijw has joined #openstack-infra21:54
*** stevebaker_ is now known as stevebaker21:54
mnaserI don’t know why they aren’t blocking that port, there’s no need to unblock it21:54
mnaserThe cluster DNS generally runs in the overlay network so that should affect things21:54
clarkbmnaser: well they are unblocking everything. And I think corvus decided this was a reasonable action for zuul's k8s jobs too21:55
*** rascasoft has joined #openstack-infra21:55
clarkbmnaser: hrm if things run in overlays I guess its just the cluster networking and not the hosted service networking we'd need to worry about?21:55
clarkbapi access, the overlays themselves, etc21:55
mnaserI guess depending on the overlay that you’re using all you might realistically need to open are high number ports like 30k and above if you’ll be exposing NodePort services21:56
mnaserAnd obviously technically Kubernetes can let you run containers that use the host networking so that would entirely skip the overlay21:56
clarkbour multinode networking does open all ports between nodeset members though so we must be missing something21:57
mnaserWhat overlay is being used?  I’m on mobile so I can dig deeper right now21:57
corvusclarkb: that is not a "multinode" job21:57
corvusit's just a job with multiple nodes21:57
clarkbcorvus: ah21:57
clarkbmnaser: re overlay I have no idea its an osh job21:57
clarkbmnaser: for zuul its whatever minikube deploys by default21:57
*** diablo_rojo has quit IRC21:58
mnaserah well my guess is “it’s the easiest way to avoid figuring out why it ain’t working” was the case for osh21:58
corvus(ie, it does not inherit from the job "multinode")21:58
clarkbso ya they probably couldn't get nodeset nodes to talk to each other to make a cluster. Then disabled the firewall and now it works :/21:59
clarkbthough there likely is additional firewall tweaking necessary on top of what multinode firewall setup would give you22:00
*** ijw has quit IRC22:04
*** panda has quit IRC22:07
*** panda has joined #openstack-infra22:10
clarkbtosky: left a couple notes on https://review.opendev.org/#/c/548936 I don't think anything is a hard -1 though the hosts: all may need to be updated if things are reliable that way22:11
clarkbtosky: for the grenade.sh vs devstack.sh I think I'm mostly looking for a bit more of the reason why that change is made.22:11
clarkbcorvus: mnaser fungi mgagne maybe the thing to do is start a thread on openstack-discuss about this issue and see if people that know k8s can suggest an appropriate solution?22:13
toskyclarkb: uhm, I'm not sure I get the second comment; is that about letting devstack roles do the devstack part?22:15
toskyrun-grenade only runs grenade.sh, excluding the installation part which is already done, and it should do it as it was done before22:16
openstackgerritClark Boylan proposed opendev/puppet-mysql_backup master: Turn off ssl in my.cnf  https://review.opendev.org/69949422:17
clarkbmordred: fungi corvus ^ that fixes the puppet linter errors22:17
toskythe point of splitting was already there since the first change done by Andrea, and I guess it was to reduce the duplication22:17
toskythe ansible roles in devstack.git already know how to deploy22:17
clarkbtosky: well in the old d-g driver it runs grenade.sh and it does the installation and the upgrade22:17
clarkbtosky: this change runs stack.sh for the installation then grenade.sh only for the upgrade22:18
toskyyes, exactly22:18
clarkband ya I guess that is probably why? we arleady have a devstack role for the upgrade?22:18
clarkber installation22:18
toskythat was my understanding when I continue Andrea's work22:18
clarkbgot it22:18
clarkbcorvus: if you think that would be a good next step for the port 111 thing I can write that email to openstack-discuss22:19
toskyas there will be other reviews, I will add some notes22:20
toskywhat is the record for the review with the highest amount of changesets? :)22:20
corvusclarkb: ++22:20
*** rh-jelabarre has quit IRC22:26
*** dave-mccowan has quit IRC22:30
*** diablo_rojo has joined #openstack-infra22:35
*** slaweq has quit IRC22:37
sshnaidmI have an error in one of jobs, FYI: ERROR: Could not install packages due to an EnvironmentError: HTTPSConnectionPool(host='opendev.org', port=443): Max retries exceeded with url: /openstack/requirements/raw/branch/master/upper-constraints.txt (Caused by NewConnectionError('<pip._vendor.urllib3.connection.VerifiedHTTPSConnection object at 0x7f3a8e21c110>: Failed to establish a new connection: [Errno 101] Network is unreachable',))22:40
sshnaidmhttps://43a0ea1d907a03afa010-dc7530dd32f75128221ba69b2574d743.ssl.cf5.rackcdn.com/699314/1/check/tripleo-ci-centos-7-scenario001-standalone/5c69614/job-output.txt22:40
*** rcernin has joined #openstack-infra22:42
clarkbsshnaidm: any idea why it is trying to reach a remote resource in the first place?22:43
clarkbthe command shows it is using a local constraints file22:43
sshnaidmclarkb, no idea.. maybe because of -U ?22:44
clarkbpackages can't provide their own constraints though, they have to be provided directly to the command. This is odd22:45
clarkbsshnaidm: that said network is unreachable implies to me a layer 3 or below problem22:46
clarkblike you don't have a route to the opendev.org ip addresses22:46
sshnaidmI hope it's just a sort of glitch, will keep eye on it22:48
clarkbsshnaidm: do you capture any networking info for the host at the end of the job22:49
sshnaidmclarkb, oh, yeah :) https://43a0ea1d907a03afa010-dc7530dd32f75128221ba69b2574d743.ssl.cf5.rackcdn.com/699314/1/check/tripleo-ci-centos-7-scenario001-standalone/5c69614/logs/undercloud/var/log/extra/network.txt.gz22:50
sshnaidmhttps://43a0ea1d907a03afa010-dc7530dd32f75128221ba69b2574d743.ssl.cf5.rackcdn.com/699314/1/check/tripleo-ci-centos-7-scenario001-standalone/5c69614/logs/undercloud/var/log/extra/netstat.txt.gz22:50
clarkbsshnaidm: https://43a0ea1d907a03afa010-dc7530dd32f75128221ba69b2574d743.ssl.cf5.rackcdn.com/699314/1/check/tripleo-ci-centos-7-scenario001-standalone/5c69614/zuul-info/zuul-info.primary.txt is the data from the beginning of the job22:50
sshnaidmclarkb, in the beginning all was good, traceroute to opendev.org worked22:51
clarkbyup22:52
clarkband the routes at the end look similar (just additions for the br-ex network)22:52
clarkbI wonder if it tried to do ipv6 for some reason22:52
clarkbor maybe the firewall dropped the packets22:52
openstackgerritMerged opendev/system-config master: Use explicit image paths  https://review.opendev.org/69051122:52
clarkbsyslog doesn't log any dropped port 443 packets22:54
clarkbI know in the past there have been dns issues that looked like other problems22:54
clarkbI wonder if this could actually be a name resolution problem?22:54
clarkbmostly it looks like it isn't even really trying to make the tcp connection. its just deciding very quickly it can't do it for some reason22:55
clarkbrather than sending a bunch of SYNs looking for a friend22:55
clarkbbut there was no ACK to find22:55
sshnaidmwe have unbound log: https://43a0ea1d907a03afa010-dc7530dd32f75128221ba69b2574d743.ssl.cf5.rackcdn.com/699314/1/check/tripleo-ci-centos-7-scenario001-standalone/5c69614/logs/undercloud/var/lib/unbound/unbound.log.txt.gz23:00
sshnaidmseems like it resolves23:01
sshnaidmhmm.. but AAAA23:01
sshnaidmand with A: 38.108.68.12423:02
*** armax has quit IRC23:07
*** dklyle_ is now known as dklyle23:12
*** tkajinam has joined #openstack-infra23:13
openstackgerritClark Boylan proposed openstack/devstack-gate master: Bump ANSIBLE_VERSION default to 2.7.14  https://review.opendev.org/69946323:15
clarkbmriedem: ^ I think that edit is needed for the linter job23:15
openstackgerritKendall Nelson proposed openstack/cookiecutter master: Update CONTRIBUTING.rst template  https://review.opendev.org/69600123:16
openstackgerritKendall Nelson proposed openstack/cookiecutter master: Update CONTRIBUTING.rst template  https://review.opendev.org/69600123:17
clarkbsshnaidm: thinking a bit more we might want to sort out where the remote constraints file comes from as that may give us a hint to why it is failing23:18
clarkbbasically pip is acting in an unexpected manner and working backward from the known unexpected thing may help23:18
sshnaidmyeah, possibly23:19
*** sgw has quit IRC23:19
*** hashar has quit IRC23:20
clarkbhttps://opendev.org/openstack/ansible-role-python_venv_build/src/branch/master/tasks/python_venv_install.yml#L100-L121 this is the task that is failing right?23:21
clarkbI wonder if you can have transitive constraints and that is how this happens23:22
clarkbone of those local files lists the remote23:22
clarkbyes that is what happens23:24
clarkbvia https://opendev.org/openstack/openstack-ansible/src/branch/master/playbooks/utility-install.yml#L25 and https://opendev.org/openstack/openstack-ansible/src/branch/master/playbooks/utility-install.yml#L126-L12723:25
clarkbthats not totally confusing at all23:25
clarkbalso in a testing environment its wrong to go to the network for that when the repo is available locally23:25
openstackgerritMerged opendev/puppet-mysql_backup master: Turn off ssl in my.cnf  https://review.opendev.org/69949423:25
clarkbbut at least we undersatnd that bit now23:25
fungiand i guess nothing is passing a requirements_git_url value in to that playbook23:27
*** dchen has joined #openstack-infra23:28
clarkbwell and it should just be a file path (though I guess you can express that as a url too)23:28
fungiright, it could just be the path to the file in the checked-out copy of openstack/requirements23:29
fungiwhich would also make it possible to successfully depends-on proposed changes to the constraints file too23:29
clarkbgitea03 seems sad, but its universally sad and not just against that single url23:30
clarkbit was just restarted23:30
clarkblooks like due to a mariadb update23:30
*** ociuhandu has joined #openstack-infra23:30
clarkbthat file loads fine from it now and all of the others23:31
clarkb(the speed at which it fails really makes me suspect local networking though)23:31
*** ociuhandu has quit IRC23:35
*** smarcet has quit IRC23:48

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!