Is there an existing issue for this? <li class="

got the same problem. is there any way to solve it? </bl

got the same problem <div class="snippet-clipboard-conte

got the same problem. is there any way to solve it? <p

got the same problem <div class="snippet-clipboard-content notranslate position-re

got the same problem <div class="snippet-clipboard-content notranslat

Host Namespace Sockets (i.e. such as Wireguard) can cause FQDN Proxy Port Collisions about cilium HOT 23 OPEN

tommyp1ckles commented on September 26, 2024 1

Host Namespace Sockets (i.e. such as Wireguard) can cause FQDN Proxy Port Collisions

from cilium.

Comments (23)

gandro commented on September 26, 2024 3

So this is not necessarily a WireGuard issue. Here is an example of opening a UDP server on the host network namespace which also can cause conflicts with transparent proxy mode:

root@kind-worker:/home/cilium# python3
Python 3.10.12 (main, Nov 20 2023, 15:14:05) [GCC 11.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from socket import *
>>> 
>>> serverSocket = socket(AF_INET, SOCK_DGRAM)
>>> 
>>> serverSocket.bind(('', 12000))
>>>

Verifying that the sever is running:

root@kind-worker:/home/cilium# ss -ulpn | grep 12000
UNCONN 0      0            0.0.0.0:12000      0.0.0.0:*    users:(("python3",pid=1049,fd=3))

Running a dig with client port 12000 (in the pod namespace):

$ kubectl -n cilium-test exec client-59c486cb54-9f74n -- dig -b '10.244.1.163#12000' google.com
;; Warning: ID mismatch: expected ID 38409, got 23676

Error in the cilium pod:

$ kubectl -n kube-system logs cilium-96j2c | grep level=error
level=error msg="Cannot forward proxied DNS lookup" DNSRequestID=38409 dnsName=google.com. endpointID=1079 error="failed to dial connection to 10.244.2.203:53: dial udp 10.244.1.163:12000->10.244.2.203:53: bind: address already in use" identity=19156 ipAddr="10.244.1.163:12000" subsys=fqdn/dnsproxy

from cilium.

kamikaze commented on September 26, 2024 1

got the same problem. is there any way to solve it?

Do you have the problem with WireGuard or with a other host network socket?

There is no solution yet. We have discussed various solutions (e.g. for WireGuard changing the default port and providing a seamless migration path), but so far no easy fix has been found (besides turning off dnsproxy-enable-transparent-mode, which does have security implications for encryption though)

WireGuard

from cilium.

pengfeiopen commented on September 26, 2024 1

got the same problem
 level=error msg="Cannot forward proxied DNS lookup" DNSRequestID=41915 dnsName=xxxx.s3.cn-north-1.amazonaws.com.cn. endpointID=3109 error="failed to dial connection to 10.180.7.197:53: dial udp 10.180.8.200:8472->10.180.7.197:53: bind: address already in use" identity=7250 ipAddr="10.180.8.200:8472" subsys=fqdn/dnsproxy
Thank you! What is interesting about your log message is that the client pod (10.180.8.200) is using a non-ephemeral source port (i.e. Linux usually uses source ports 32768–60999). Do you have some information what software is selecting the source port for your client pod?

I set net.ipv4.ip_local_port_range = 1024 65535；port 8472 is cilium for vxlan（UDP）？

from cilium.

WoodyWoodsta commented on September 26, 2024 1

Will give reserved ports a go.

Out of curiosity, does your client-side application not perform retries for failed DNS lookups? I would have expected clients to retry after a port collision, since DNS is not a reliable protocol after all

That's true, and it's not a service we own unfortunately. They have a very brittle HA implementation which appeared to completely fail as a result of not being able to look pods up.

from cilium.

openJT commented on September 26, 2024 1

It's empty

from cilium.

gandro commented on September 26, 2024

As discussed offline, this might not be a Wireguard problem. Any port in the ephemeral range which is used by the host network namespace might cause conflicts it seems

from cilium.

kamikaze commented on September 26, 2024

got the same problem. is there any way to solve it?

from cilium.

gandro commented on September 26, 2024

got the same problem. is there any way to solve it?

Do you have the problem with WireGuard or with a other host network socket?

There is no solution yet. We have discussed various solutions (e.g. for WireGuard changing the default port and providing a seamless migration path), but so far no easy fix has been found (besides turning off dnsproxy-enable-transparent-mode, which does have security implications for encryption though)

from cilium.

pengfeiopen commented on September 26, 2024

got the same problem

 level=error msg="Cannot forward proxied DNS lookup" DNSRequestID=41915 dnsName=xxxx.s3.cn-north-1.amazonaws.com.cn. endpointID=3109 error="failed to dial connection to 10.180.7.197:53: dial udp 10.180.8.200:8472->10.180.7.197:53: bind: address already in use" identity=7250 ipAddr="10.180.8.200:8472" subsys=fqdn/dnsproxy

from cilium.

gandro commented on September 26, 2024

got the same problem

 level=error msg="Cannot forward proxied DNS lookup" DNSRequestID=41915 dnsName=xxxx.s3.cn-north-1.amazonaws.com.cn. endpointID=3109 error="failed to dial connection to 10.180.7.197:53: dial udp 10.180.8.200:8472->10.180.7.197:53: bind: address already in use" identity=7250 ipAddr="10.180.8.200:8472" subsys=fqdn/dnsproxy

Thank you! What is interesting about your log message is that the client pod (10.180.8.200) is using a non-ephemeral source port (i.e. Linux usually uses source ports 32768–60999). Do you have some information what software is selecting the source port for your client pod?

from cilium.

gandro commented on September 26, 2024

Thank you, yes, that explains it. Port 8472 in indeed also used by Cilium VXLAN, which of can collide if you increase the ephemeral range by setting the port range to 1024-65535.

from cilium.

WoodyWoodsta commented on September 26, 2024

I'm hitting this problem with Wireguard enabled. I get about 5 collisions per hour (not acceptable as it can temporarily take entire services down).

I'm assuming a dirty workaround at the moment is to set the ip_local_port_range to 51872 65535, but then there is a very small port range within which the cluster can use?

from cilium.

gandro commented on September 26, 2024

I'm assuming a dirty workaround at the moment is to set the ip_local_port_range to 51872 65535, but then there is a very small port range within which the cluster can use?

Have you tried using net.ipv4.ip_local_reserved_ports? This might be something we could implement to fix also on the Cilium CNI side at least the problem with WireGuard.

I get about 5 collisions per hour (not acceptable as it can temporarily take entire services down).

Out of curiosity, does your client-side application not perform retries for failed DNS lookups? I would have expected clients to retry after a port collision, since DNS is not a reliable protocol after all

from cilium.

gandro commented on September 26, 2024

I've created a PR which will instruct Cilium to set net.ipv4.ip_local_reserved_ports for the WireGuard and VXLAN port by default: #32128

Hopefully this should improve the situation for most users. We still ought to think about a better solution long-term though.

from cilium.

WoodyWoodsta commented on September 26, 2024

FYI, I've set 51871 to be a reserved port, but I'm still seeing the binding failures on that port.

from cilium.

gandro commented on September 26, 2024

FYI, I've set 51871 to be a reserved port, but I'm still seeing the binding failures on that port.

Interesting, in my local testing net.ipv4.ip_local_reserved_ports did indeed prevent dig from using that port. The important thing is that the setting needs to be set in the pod network namespace, not the namespace of the host (since the setting is not inherited)

from cilium.

WoodyWoodsta commented on September 26, 2024

FYI, I've set 51871 to be a reserved port, but I'm still seeing the binding failures on that port.

Interesting, in my local testing net.ipv4.ip_local_reserved_ports did indeed prevent dig from using that port. The important thing is that the setting needs to be set in the pod network namespace, not the namespace of the host (since the setting is not inherited)

That'll be my issue - apologies. I had a feeling that was the case.

from cilium.

pengfeiopen commented on September 26, 2024

update cilium to 1.14.11 ，the problem still happen;
when i exec -it cilium-xxx sysctl net.ipv4.ip_local_reserved_ports

root@ike-012:/home/cilium# sysctl net.ipv4.ip_local_reserved_ports 
net.ipv4.ip_local_reserved_ports = 30000-32767

30000-32767 is for NodePort

from cilium.

gandro commented on September 26, 2024

Please check the ip_local_reserved_ports inside the pod namespace, not the host namespace. What's the exact problem that you observe (which ports are affected? what's the error message)?

Edit: Also, because Cilium can't access the pod namespace, the workaround with ip_local_reserved_ports will only be set for new pods (i.e. pods created after Cilium has been updated). You will have to restart affected pods for the workaround to take effect.

from cilium.

openJT commented on September 26, 2024

Hello, I just updated to 1.15.5, my understanding was that the WireGuard port (51871) would/should be excluded by default. I still get errors like: bind: address already in use" identity=34960 ipAddr="10.0.2.92:51871" subsys=fqdn/dnsproxy. Fresh install. Searching documentation I cannot find how to exclude the port.

from cilium.

gandro commented on September 26, 2024

Hello, I just updated to 1.15.5, my understanding was that the WireGuard port (51871) would/should be excluded by default. I still get errors like: bind: address already in use" identity=34960 ipAddr="10.0.2.92:51871" subsys=fqdn/dnsproxy. Fresh install. Searching documentation I cannot find how to exclude the port.

Hi, the port should be automatically excluded without any configuration needed. Are you able to share a sysdump?

from cilium.

openJT commented on September 26, 2024

Sure, thank you.
cilium-sysdump-20240522-090917.zip

from cilium.

gandro commented on September 26, 2024

Sure, thank you. cilium-sysdump-20240522-090917.zip

Thanks! The sysdump itself looks alright. Unfortunately we don't dump the sysctl of the pods, the feature could probably use some additional logging for troubleshooting.

Could you share the output of cat /proc/sys/net/ipv4/ip_local_reserved_ports from one of your client pods (e.g. uptime-kuma-986f65945-bcjdx?

from cilium.

Host Namespace Sockets (i.e. such as Wireguard) can cause FQDN Proxy Port Collisions about cilium HOT 23 OPEN

Comments (23)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent