8.7 KiB
Introduction
punchd
(short for "hole punching daemon") provides TCP and UDP hole punching as a service to applications. This allows two applications on different hosts, both behind NAT and / or firewalls, to establish a direct communication channel. The motivation is to help peer-to-peer applications improve their connectivity.
How does it work?
The idea is to try out a sequence of well known hole punching techniques until one of them succeeds. punchd
implements the following techniques:
- tcp-syni (TCP hole punching based on SYN injection), DOI: 10.1109/NCA.2011.66
- tcp-nutss (TCP hole punching, the non-spoofing approach described in section 4.2.2 of the paper), DOI: 10.1145/1016707.1016715
- udp (simple UDP hole punching)
The assumption of those techniques is that there is a side-channel (i.e. a rendezvous server) between the two hosts. A peer that wants to be available for hole punching needs to constantly be connected to the rendezvous server so it can be notified by other peers about new hole punching attempts. In addition it needs to provide an endpoint (public IP address and port) to other peers before those can initiate the hole punching. See the example applications for a naive rendezvous server implementation.
How can I use punchd?
Applications can communicate with punchd
through a unix domain socket. An application can call one of punchd
's API functions to either start a hole punching attempt (initiate
) or react to an attempt started by another peer (respond
). After calling an API function punchd
will report back a status (either ok
, progress
or error
). After a successful hole punching attempt, punchd
will pass a socket to the application which can be used immediately to communicate with the other peer. The full punchd
API is described in section The punchd API
.
The figure below shows the workflow of a successful hole punching attempt between two peers A (initiator) and B (responder).
+--------------------------+
| Rendezvous server (RS) |
+--------------------------+
Λ |
| 4
3 |
| V
+----------------------------+ +----------------------------+
| Application at peer A (AA) | | Application at peer B (AB) |
+----------------------------+ +----------------------------+
| Λ Λ | Λ
1 | | 5 |
| 2 6 | 7
V | | V |
+-------------------------+ +-------------------------+
| punchd at peer A (PA) | | punchd at peer B (PB) |
+-------------------------+ +-------------------------+
initiate
:AA
asksPA
to initiate the hole punching toAB
progress
:PA
reports progressAA
sends notification about the hole punching attempt toRS
RS
forwards notification toAB
respond
:AB
asksPB
to respond to the hole punching attemptok
:PA
reports success and passes a connected socket toAA
ok
:PB
reports success and passes a connected socket toAB
Installation
No packages exist yet, so installation has to be done manually to the desired location after running nimble install
. Starting punchd
is as simple as
$ sudo ./punchd
The punchd API
Applications can exchange messages with punchd
through a unix domain socket. By default this is /tmp/punchd.socket
. The message format is described in the next section. After that an example message exchange is given. Finally we need to understand how to receive a successfully connected socket from punchd
.
The message format
A punchd
message is a string with a maximum length of 500 bytes followed by
a line feed character ('\n'). The string consists of fields (positional
arguments) separated by '|'.
An application can send two types of messages (requests) to punchd
:
"initiate" messages (ask punchd
to start a new hole punching attempt) or
"respond" messages (ask punchd
to react to the hole punching attempt started
by another peer). These messages have the following formats:
initiate|ID|TECHNIQUE|IP_FROM|PORTS_FROM|IP_TO|PORTS_TO
respond|ID|TECHNIQUE|IP_FROM|PORTS_FROM|IP_TO|PORTS_TO|EXTRA_ARGS
The fields have the following semantics:
ID
: the message ID (random string)TECHNIQUE
: the hole punching technique, one out of "tcp-syni", "tcp-nutss" and "udp"IP_FROM
: the public IP address of the initiating peerPORTS_FROM
: a list of previously used source ports of the initiating peerIP_TO
: the public IP address of the responding peerPORTS_TO
: a list of previously used source ports of the responding peerEXTRA_ARGS
: technique-specific arguments
For each request (identified by a message ID) punchd
will report back a
status (containing the same message ID). The status can be "ok" (hole punching
was successful), "progress" (hole punching is in progress, the other peer has to
be notified) or "error" (the hole punching failed). These status messages have
the following formats:
ok|ID
progress|ID|TECHNIQUE|IP_FROM|PORTS_FROM|IP_TO|PORTS_TO|EXTRA_ARGS
error|ID|ERROR_MSG
The fields in the "progress" message are the same that can be found in the
"respond" message described above. This is because the application is expected
to forward them to the other peer through the rendezvous server so the other
peer can call "respond" on its punchd
instance.
Example
An example session between two applications AA and AB on two different peers (AA
has the endpoint 1.2.3.4:1234
; AB has the endpoint 5.6.7.8:5678
) and
their respective punchd
instances (PA and PB) might look like this:
AA -> PA: initiate|645|tcp-syni|1.2.3.4|1234|5.6.7.8|5678,5679,5680
AA <- PA: progress|645|tcp-syni|1.2.3.4|1234|5.6.7.8|5678,5679,5680|344567,644456,345365
--- AA sends notification to AB ---
AB -> PB: respond|538|tcp-syni|1.2.3.4|1234|5.6.7.8|5678,5679,5680|344567,644456,345365
AB <- PB: ok|538
AA <- PA: ok|645
Receiving sockets using sendmsg/recvmsg
When punchd
successfully has punched a hole, i.e. has a socket that is
connected to the other peer, it will report back status "ok" to the application.
But how can the application now use the socket? The answer is: punchd
has to
pass the corresponding file descriptor to the application's process. On Unix
systems that is possible using the system calls sendmsg
(on punchd
's
side) and recvmsg
(on the application's side). This allows punchd
passing a control message, a.k.a. ancillary message, of type SCM_RIGHTS
along with the actual contents of the status message. This control message will
contain the file descriptor.
The mechanism of passing file descriptors to other processes is described in the man pages unix(7), cmsg(3) and recv(2). The example applications can be used as a reference too. After receiving the file descriptor the application can immediately use it to send data to the other peer.
FAQ
-
Q: Why does
punchd
only offer a weird socket API and not REST? A:punchd
needs to pass a file descriptor to the application. On Unix systems that is only possible using thesendmsg
system call with an ancillary message of typeSCM_RIGHTS
on a unix domain socket. See the unix(7) man page. -
Q: Why does
punchd
need root permissions? A: Most hole punching techniques require elevated privileges because they need raw sockets (e.g. for capturing TCP sequence numbers) or they need to create firewall rules (some techniques require sending out low-TTL packets; the resulting ICMP time-exceeded responses have to be filtered out or else TCP connections will fail). On linuxpunchd
can run as a non-root user though ifpunchd
is started with theCAP_NET_RAW
andCAP_NET_ADMIN
capabilities. -
Q: Which one is the best hole punching technique? A: No hole punching technique will work for all environments. In fact there are environments where no hole punching will be possible at all. The goal of this project is to find a good sequence of hole punching techniques to be tried one after the other. A lot of research still has to be done. See the example applications for some proposed sequences.