Hole Punching

http://www.brynosaurus.com/pub/net/p2pnat/

Peer-to-Peer Communication Across Network Address Translators

Bryan Ford
Massachusetts Institute of Technology
baford (at) mit.edu

Pyda Srisuresh
Caymas Systems, Inc.
srisuresh (at) yahoo.com

Dan Kegel
dank (at) kegel.com

J’fais des trous, des petits trous$\dots$
toujours des petits trous

– S. Gainsbourg

Abstract:

Network Address Translation (NAT) causes well-known difficulties for peer-to-peer (P2P) communication, since the peers involved may not be reachable at any globally valid IP address. Several NAT traversal techniques are known, but their documentation is slim, and data about their robustness or relative merits is slimmer. This paper documents and analyzes one of the simplest but most robust and practical NAT traversal techniques, commonly known as “hole punching.” Hole punching is moderately well-understood for UDP communication, but we show how it can be reliably used to set up peer-to-peer TCP streams as well. After gathering data on the reliability of this technique on a wide variety of deployed NATs, we find that about 82% of the NATs tested support hole punching for UDP, and about 64% support hole punching for TCP streams. As NAT vendors become increasingly conscious of the needs of important P2P applications such as Voice over IP and online gaming protocols, support for hole punching is likely to increase in the future.

1 Introduction

The combined pressures of tremendous growth and massive security challenges have forced the Internet to evolve in ways that make life difficult for many applications. The Internet’s original uniform address architecture, in which every node has a globally unique IP address and can communicate directly with every other node, has been replaced with a new de facto Internet address architecture, consisting of a global address realm and many private address realms interconnected by Network Address Translators (NAT). In this new address architecture, illustrated in Figure 1, only nodes in the “main,” global address realm can be easily contacted from anywhere in the network, because only they have unique, globally routable IP addresses. Nodes on private networks can connect to other nodes on the same private network, and they can usually open TCP or UDP connections to “well-known” nodes in the global address realm. NATs on the path allocate temporary public endpoints for outgoing connections, and translate the addresses and port numbers in packets comprising those sessions, while generally blocking all incoming traffic unless otherwise specifically configured.

Figure 1: Public and private IP address domains
\begin{figure}\centerline{\epsfig{file=twicenat.eps, scale=0.40}}\end{figure}

The Internet’s new de facto address architecture is suitable for client/server communication in the typical case when the client is on a private network and the server is in the global address realm. The architecture makes it difficult for two nodes on different private networks to contact each other directly, however, which is often important to the “peer-to-peer” communication protocols used in applications such as teleconferencing and online gaming. We clearly need a way to make such protocols function smoothly in the presence of NAT.

One of the most effective methods of establishing peer-to-peer communication between hosts on different private networks is known as “hole punching.” This technique is widely used already in UDP-based applications, but essentially the same technique also works for TCP. Contrary to what its name may suggest, hole punching does not compromise the security of a private network. Instead, hole punching enables applications to function within the the default security policy of most NATs, effectively signaling to NATs on the path that peer-to-peer communication sessions are “solicited” and thus should be accepted. This paper documents hole punching for both UDP and TCP, and details the crucial aspects of both application and NAT behavior that make hole punching work.

Unfortunately, no traversal technique works with all existing NATs, because NAT behavior is not standardized. This paper presents some experimental results evaluating hole punching support in current NATs. Our data is derived from results submitted by users throughout the Internet by running our “NAT Check” tool over a wide variety of NATs by different vendors. While the data points were gathered from a “self-selecting” user community and may not be representative of the true distribution of NAT implementations deployed on the Internet, the results are nevertheless generally encouraging.

While evaluating basic hole punching, we also point out variations that can make hole punching work on a wider variety of existing NATs at the cost of greater complexity. Our primary focus, however, is on developing the simplest hole punching technique that works cleanly and robustly in the presence of “well-behaved” NATs in any reasonable network topology. We deliberately avoid excessively clever tricks that may increase compatibility with some existing “broken” NATs in the short term, but which only work some of the time and may cause additional unpredictability and network brittleness in the long term.

Although the larger address space of IPv6 [3] may eventually reduce the need for NAT, in the short term IPv6 is increasing the demand for NAT, because NAT itself provides the easiest way to achieve interoperability between IPv4 and IPv6 address domains [24]. Further, the anonymity and inaccessibility of hosts on private networks has widely perceived security and privacy benefits. Firewalls are unlikely to go away even when there are enough IP addresses: IPv6 firewalls will still commonly block unsolicited incoming traffic by default, making hole punching useful even to IPv6 applications.

The rest of this paper is organized as follows. Section 2 introduces basic terminology and NAT traversal concepts. Section 3 details hole punching for UDP, and Section 4 introduces hole punching for TCP. Section 5 summarizes important properties a NAT must have in order to enable hole punching. Section 6 presents our experimental results on hole punching support in popular NATs, Section 7 discusses related work, and Section 8 concludes.

2 General Concepts

This section introduces basic NAT terminology used throughout the paper, and then outlines general NAT traversal techniques that apply equally to TCP and UDP.

2.1 NAT Terminology

This paper adopts the NAT terminology and taxonomy defined in RFC 2663 [21], as well as additional terms defined more recently in RFC 3489 [19].

Of particular importance is the notion of session. A session endpoint for TCP or UDP is an (IP address, port number) pair, and a particular session is uniquely identified by its two session endpoints. From the perspective of one of the hosts involved, a session is effectively identified by the 4-tuple (local IP, local port, remote IP, remote port). The direction of a session is normally the flow direction of the packet that initiates the session: the initial SYN packet for TCP, or the first user datagram for UDP.

Of the various flavors of NAT, the most common type is traditional or outbound NAT, which provides an asymmetric bridge between a private network and a public network. Outbound NAT by default allows only outbound sessions to traverse the NAT: incoming packets are dropped unless the NAT identifies them as being part of an existing session initiated from within the private network. Outbound NAT conflicts with peer-to-peer protocols because when both peers desiring to communicate are “behind” (on the private network side of) two different NATs, whichever peer tries to initiate a session, the other peer’s NAT rejects it. NAT traversal entails making P2P sessions look like “outbound” sessions to both NATs.

Outbound NAT has two sub-varieties: Basic NAT, which only translates IP addresses, and Network Address/Port Translation (NAPT), which translates entire session endpoints. NAPT, the more general variety, has also become the most common because it enables the hosts on a private network to share the use of a single public IP address. Throughout this paper we assume NAPT, though the principles and techniques we discuss apply equally well (if sometimes trivially) to Basic NAT.

2.2 Relaying

The most reliable–but least efficient–method of P2P communication across NAT is simply to make the communication look to the network like standard client/server communication, through relaying. Suppose two client hosts $A$ and $B$ have each initiated TCP or UDP connections to a well-known server $S$, at $S$‘s global IP address 18.181.0.31 and port number 1234. As shown in Figure 2, the clients reside on separate private networks, and their respective NATs prevent either client from directly initiating a connection to the other. Instead of attempting a direct connection, the two clients can simply use the server $S$ to relay messages between them. For example, to send a message to client $B$, client $A$ simply sends the message to server $S$ along its already-established client/server connection, and server $S$forwards the message on to client $B$ using its existing client/server connection with $B$.

Figure 2: NAT Traversal by Relaying
\begin{figure}\centerline{\epsfig{file=relay.eps, scale=0.40}}\end{figure}

Relaying always works as long as both clients can connect to the server. Its disadvantages are that it consumes the server’s processing power and network bandwidth, and communication latency between the peering clients is likely increased even if the server is well-connected. Nevertheless, since there is no more efficient technique that works reliably on all existing NATs, relaying is a useful fall-back strategy if maximum robustness is desired. The TURN protocol [18] defines a method of implementing relaying in a relatively secure fashion.

2.3 Connection Reversal

Some P2P applications use a straightforward but limited technique, known as connection reversal, to enable communication when both hosts have connections to a well-known rendezvous server $S$ and only one of the peers is behind a NAT, as shown in Figure 3. If $A$ wants to initiate a connection to $B$, then a direct connection attempt works automatically, because $B$ is not behind a NAT and $A$‘s NAT interprets the connection as an outgoing session. If $B$ wants to initiate a connection to $A$, however, any direct connection attempt to $A$ is blocked by $A$‘s NAT. $B$ can instead relay a connection request to $A$ through a well-known server $S$, asking $A$ to attempt a “reverse” connection back to $B$. Despite the obvious limitations of this technique, the central idea of using a well-known rendezvous server as an intermediary to help set up direct peer-to-peer connections is fundamental to the more general hole punching techniques described next.

Figure 3: NAT Traversal by Connection Reversal
\begin{figure}\centerline{\epsfig{file=reversal.eps, scale=0.40}}\end{figure}

3 UDP Hole Punching

UDP hole punching enables two clients to set up a direct peer-to-peer UDP session with the help of a well-known rendezvous server, even if the clients are both behind NATs. This technique was mentioned in section 5.1 of RFC 3027 [10], documented more thoroughly elsewhere on the Web [13], and used in recent experimental Internet protocols [17,11]. Various proprietary protocols, such as those for on-line gaming, also use UDP hole punching.

3.1 The Rendezvous Server

Hole punching assumes that the two clients, $A$ and $B$, already have active UDP sessions with a rendezvous server $S$. When a client registers with $S$, the server records two endpoints for that client: the (IP address, UDP port) pair that the client believes itself to be using to talk with $S$, and the (IP address, UDP port) pair that the server observes the client to be using to talk with it. We refer to the first pair as the client’s private endpoint and the second as the client’s public endpoint. The server might obtain the client’s private endpoint from the client itself in a field in the body of the client’s registration message, and obtain the client’s public endpoint from the source IP address and source UDP port fields in the IP and UDP headers of that registration message. If the client is not behind a NAT, then its private and public endpoints should be identical.

A few poorly behaved NATs are known to scan the body of UDP datagrams for 4-byte fields that look like IP addresses, and translate them as they would the IP address fields in the IP header. To be robust against such behavior, applications may wish to obfuscate IP addresses in messages bodies slightly, for example by transmitting the one’s complement of the IP address instead of the IP address itself. Of course, if the application is encrypting its messages, then this behavior is not likely to be a problem.

3.2 Establishing Peer-to-Peer Sessions

Suppose client $A$ wants to establish a UDP session directly with client $B$. Hole punching proceeds as follows:

  1. $A$ initially does not know how to reach $B$, so $A$ asks $S$ for help establishing a UDP session with $B$.
  2. $S$ replies to $A$ with a message containing $B$‘s public and private endpoints. At the same time, $S$ uses its UDP session with $B$ to send $B$ a connection request message containing $A$‘s public and private endpoints. Once these messages are received, $A$ and $B$ know each other’s public and private endpoints.
  3. When $A$ receives $B$‘s public and private endpoints from $S$$A$ starts sending UDP packets to both of these endpoints, and subsequently “locks in” whichever endpoint first elicits a valid response from $B$. Similarly, when $B$ receives $A$‘s public and private endpoints in the forwarded connection request, $B$ starts sending UDP packets to $A$ at each of $A$‘s known endpoints, locking in the first endpoint that works. The order and timing of these messages are not critical as long as they are asynchronous.

We now consider how UDP hole punching handles each of three specific network scenarios. In the first situation, representing the “easy” case, the two clients actually reside behind the same NAT, on one private network. In the second, most common case, the clients reside behind different NATs. In the third scenario, the clients each reside behind two levels of NAT: a common “first-level” NAT deployed by an ISP for example, and distinct “second-level” NATs such as consumer NAT routers for home networks.

It is in general difficult or impossible for the application itself to determine the exact physical layout of the network, and thus which of these scenarios (or the many other possible ones) actually applies at a given time. Protocols such as STUN [19] can provide some information about the NATs present on a communication path, but this information may not always be complete or reliable, especially when multiple levels of NAT are involved. Nevertheless, hole punching works automatically in all of these scenarios without the application having to know the specific network organization, as long as the NATs involved behave in a reasonable fashion. (“Reasonable” behavior for NATs will be described later in Section 5.)

3.3 Peers Behind a Common NAT

First consider the simple scenario in which the two clients (probably unknowingly) happen to reside behind the same NAT, and are therefore located in the same private IP address realm, as shown in Figure 4. Client $A$ has established a UDP session with server $S$, to which the common NAT has assigned its own public port number 62000. Client $B$ has similarly established a session with $S$, to which the NAT has assigned public port number 62005.

Figure 4: UDP Hole Punching, Peers Behind a Common NAT
\begin{figure*}\centerline{\epsfig{file=samenat.eps, scale=0.34}}\end{figure*}

Suppose that client $A$ uses the hole punching technique outlined above to establish a UDP session with $B$, using server $S$ as an introducer. Client $A$ sends $S$ a message requesting a connection to $B$$S$responds to $A$ with $B$‘s public and private endpoints, and also forwards $A$‘s public and private endpoints to $B$. Both clients then attempt to send UDP datagrams to each other directly at each of these endpoints. The messages directed to the public endpoints may or may not reach their destination, depending on whether or not the NAT supports hairpin translation as described below in Section 3.5. The messages directed at the private endpoints do reach their destinations, however, and since this direct route through the private network is likely to be faster than an indirect route through the NAT anyway, the clients are most likely to select the private endpoints for subsequent regular communication.

By assuming that NATs support hairpin translation, the application might dispense with the complexity of trying private as well as public endpoints, at the cost of making local communication behind a common NAT unnecessarily pass through the NAT. As our results in Section 6 show, however, hairpin translation is still much less common among existing NATs than are other “P2P-friendly” NAT behaviors. For now, therefore, applications may benefit substantially by using both public and private endpoints.

3.4 Peers Behind Different NATs

Suppose clients $A$ and $B$ have private IP addresses behind different NATs, as shown in Figure 5$A$ and $B$ have each initiated UDP communication sessions from their local port 4321 to port 1234 on server $S$. In handling these outbound sessions, NAT $A$ has assigned port 62000 at its own public IP address, 155.99.25.11, for the use of $A$‘s session with $S$, and NAT $B$ has assigned port 31000 at its IP address, 138.76.29.7, to $B$‘s session with $S$.

Figure 5: UDP Hole Punching, Peers Behind Different NATs
\begin{figure*}\centerline{\epsfig{file=diffnat.eps, scale=0.34}}\end{figure*}

In $A$‘s registration message to $S$$A$ reports its private endpoint to $S$ as 10.0.0.1:4321, where 10.0.0.1 is $A$‘s IP address on its own private network. $S$ records $A$‘s reported private endpoint, along with $A$‘s public endpoint as observed by $S$ itself. $A$‘s public endpoint in this case is 155.99.25.11:62000, the temporary endpoint assigned to the session by the NAT. Similarly, when client $B$ registers, $S$records $B$‘s private endpoint as 10.1.1.3:4321 and $B$‘s public endpoint as 138.76.29.7:31000.

Now client $A$ follows the hole punching procedure described above to establish a UDP communication session directly with $B$. First, $A$ sends a request message to $S$ asking for help connecting with $B$. In response, $S$ sends $B$‘s public and private endpoints to $A$, and sends $A$‘s public and private endpoints to $B$$A$ and $B$ each start trying to send UDP datagrams directly to each of these endpoints.

Since $A$ and $B$ are on different private networks and their respective private IP addresses are not globally routable, the messages sent to these endpoints will reach either the wrong host or no host at all. Because many NATs also act as DHCP servers, handing out IP addresses in a fairly deterministic way from a private address pool usually determined by the NAT vendor by default, it is quite likely in practice that $A$‘s messages directed at $B$‘s private endpoint will reach some (incorrect) host on $A$‘s private network that happens to have the same private IP address as $B$ does. Applications must therefore authenticate all messages in some way to filter out such stray traffic robustly. The messages might include application-specific names or cryptographic tokens, for example, or at least a random nonce pre-arranged through $S$.

Now consider $A$‘s first message sent to $B$‘s public endpoint, as shown in Figure 5. As this outbound message passes through $A$‘s NAT, this NAT notices that this is the first UDP packet in a new outgoing session. The new session’s source endpoint (10.0.0.1:4321) is the same as that of the existing session between $A$ and $S$, but its destination endpoint is different. If NAT $A$ is well-behaved, it preserves the identity of $A$‘s private endpoint, consistently translating all outbound sessions from private source endpoint 10.0.0.1:4321 to the corresponding public source endpoint 155.99.25.11:62000. $A$‘s first outgoing message to $B$‘s public endpoint thus, in effect, “punches a hole” in $A$‘s NAT for a new UDP session identified by the endpoints (10.0.0.1:4321, 138.76.29.7:31000) on $A$‘s private network, and by the endpoints (155.99.25.11:62000, 138.76.29.7:31000) on the main Internet.

If $A$‘s message to $B$‘s public endpoint reaches $B$‘s NAT before $B$‘s first message to $A$ has crossed $B$‘s own NAT, then $B$‘s NAT may interpret $A$‘s inbound message as unsolicited incoming traffic and drop it. $B$‘s first message to $A$‘s public address, however, similarly opens a hole in $B$‘s NAT, for a new UDP session identified by the endpoints (10.1.1.3:4321, 155.99.25.11:62000) on $B$‘s private network, and by the endpoints (138.76.29.7:31000, 155.99.25.11:62000) on the Internet. Once the first messages from $A$ and $B$ have crossed their respective NATs, holes are open in each direction and UDP communication can proceed normally. Once the clients have verified that the public endpoints work, they can stop sending messages to the alternative private endpoints.

3.5 Peers Behind Multiple Levels of NAT

In some topologies involving multiple NAT devices, two clients cannot establish an “optimal” P2P route between them without specific knowledge of the topology. Consider a final scenario, depicted in Figure 6. Suppose NAT $C$ is a large industrial NAT deployed by an internet service provider (ISP) to multiplex many customers onto a few public IP addresses, and NATs $A$ and $B$ are small consumer NAT routers deployed independently by two of the ISP’s customers to multiplex their private home networks onto their respective ISP-provided IP addresses. Only server $S$ and NAT $C$ have globally routable IP addresses; the “public” IP addresses used by NAT $A$ and NAT $B$ are actually private to the ISP’s address realm, while client $A$‘s and $B$‘s addresses in turn are private to the addressing realms of NAT $A$ and NAT $B$, respectively. Each client initiates an outgoing connection to server $S$ as before, causing NATs $A$ and $B$ each to create a single public/private translation, and causing NAT $C$ to establish a public/private translation for each session.

Figure 6: UDP Hole Punching, Peers Behind Multiple Levels of NAT
\begin{figure*}\centerline{\epsfig{file=multinat.eps, scale=0.34}}\end{figure*}

Now suppose $A$ and $B$ attempt to establish a direct peer-to-peer UDP connection via hole punching. The optimal routing strategy would be for client $A$ to send messages to client $B$‘s “semi-public” endpoint at NAT $B$, 10.0.1.2:55000 in the ISP’s addressing realm, and for client $B$ to send messages to $A$‘s “semi-public” endpoint at NAT $B$, namely 10.0.1.1:45000. Unfortunately, $A$ and $B$ have no way to learn these addresses, because server $S$ only sees the truly global public endpoints of the clients, 155.99.25.11:62000 and 155.99.25.11:62005 respectively. Even if $A$ and $B$ had some way to learn these addresses, there is still no guarantee that they would be usable, because the address assignments in the ISP’s private address realm might conflict with unrelated address assignments in the clients’ private realms. (NAT $A$‘s IP address in NAT $C$‘s realm might just as easily have been 10.1.1.3, for example, the same as client $B$‘s private address in NAT $B$‘s realm.)

The clients therefore have no choice but to use their global public addresses as seen by $S$ for their P2P communication, and rely on NAT $C$ providing hairpin or loopback translation. When $A$ sends a UDP datagram to $B$‘s global endpoint, 155.99.25.11:62005, NAT $A$ first translates the datagram’s source endpoint from 10.0.0.1:4321 to 10.0.1.1:45000. The datagram now reaches NAT $C$, which recognizes that the datagram’s destination address is one of NAT $C$‘s own translated public endpoints. If NAT $C$ is well-behaved, it then translates both the source and destination addresses in the datagram and “loops” the datagram back onto the private network, now with a source endpoint of 155.99.25.11:62000 and a destination endpoint of 10.0.1.2:55000. NAT $B$ finally translates the datagram’s destination address as the datagram enters $B$‘s private network, and the datagram reaches $B$. The path back to $A$ works similarly. Many NATs do not yet support hairpin translation, but it is becoming more common as NAT vendors become aware of this issue.

3.6 UDP Idle Timeouts

Since the UDP transport protocol provides NATs with no reliable, application-independent way to determine the lifetime of a session crossing the NAT, most NATs simply associate an idle timer with UDP translations, closing the hole if no traffic has used it for some time period. There is unfortunately no standard value for this timer: some NATs have timeouts as short as 20 seconds. If the application needs to keep an idle UDP session active after establishing the session via hole punching, the application must send periodic keep-alive packets to ensure that the relevant translation state in the NATs does not disappear.

Unfortunately, many NATs associate UDP idle timers with individual UDP sessions defined by a particular pair of endpoints, so sending keep-alives on one session will not keep other sessions active even if all the sessions originate from the same private endpoint. Instead of sending keep-alives on many different P2P sessions, applications can avoid excessive keep-alive traffic by detecting when a UDP session no longer works, and re-running the original hole punching procedure again “on demand.”

4 TCP Hole Punching

Establishing peer-to-peer TCP connections between hosts behind NATs is slightly more complex than for UDP, but TCP hole punching is remarkably similar at the protocol level. Since it is not as well-understood, it is currently supported by fewer existing NATs. When the NATs involved do support it, however, TCP hole punching is just as fast and reliable as UDP hole punching. Peer-to-peer TCP communication across well-behaved NATs may in fact be more robust than UDP communication, because unlike UDP, the TCP protocol’s state machine gives NATs on the path a standard way to determine the precise lifetime of a particular TCP session.

4.1 Sockets and TCP Port Reuse

The main practical challenge to applications wishing to implement TCP hole punching is not a protocol issue but an application programming interface (API) issue. Because the standard Berkeley sockets API was designed around the client/server paradigm, the API allows a TCP stream socket to be used to initiate an outgoing connection via connect(), or to listen for incoming connections via listen() andaccept()but not both. Further, TCP sockets usually have a one-to-one correspondence to TCP port numbers on the local host: after the application binds one socket to a particular local TCP port, attempts to bind a second socket to the same TCP port fail.

For TCP hole punching to work, however, we need to use a single local TCP port to listen for incoming TCP connections and to initiate multiple outgoing TCP connections concurrently. Fortunately, all major operating systems support a special TCP socket option, commonly named SO_REUSEADDR, which allows the application to bind multiple sockets to the same local endpoint as long as this option is set on all of the sockets involved. BSD systems have introduced a SO_REUSEPORT option that controls port reuse separately from address reuse; on such systems both of these options must be set.

4.2 Opening Peer-to-Peer TCP Streams

Suppose that client $A$ wishes to set up a TCP connection with client $B$. We assume as usual that both $A$ and $B$ already have active TCP connections with a well-known rendezvous server $S$. The server records each registered client’s public and private endpoints, just as for UDP. At the protocol level, TCP hole punching works almost exactly as for UDP:

  1. Client $A$ uses its active TCP session with $S$ to ask $S$ for help connecting to $B$.
  2. $S$ replies to $A$ with $B$‘s public and private TCP endpoints, and at the same time sends $A$‘s public and private endpoints to $B$.
  3. From the same local TCP ports that $A$ and $B$ used to register with $S$$A$ and $B$ each asynchronously make outgoing connection attempts to the other’s public and private endpoints as reported by $S$, while simultaneously listening for incoming connections on their respective local TCP ports.
  4. $A$ and $B$ wait for outgoing connection attempts to succeed, and/or for incoming connections to appear. If one of the outgoing connection attempts fails due to a network error such as “connection reset” or “host unreachable,” the host simply re-tries that connection attempt after a short delay (e.g., one second), up to an application-defind maximum timeout period.
  5. When a TCP connection is made, the hosts authenticate each other to verify that they connected to the intended host. If authentication fails, the clients close that connection and continue waiting for others to succeed. The clients use the first successfully authenticated TCP stream resulting from this process.

Unlike with UDP, where each client only needs one socket to communicate with both $S$ and any number of peers simultaneously, with TCP each client application must manage several sockets bound to a single local TCP port on that client node, as shown in Figure 7. Each client needs a stream socket representing its connection to $S$, a listen socket on which to accept incoming connections from peers, and at least two additional stream sockets with which to initiate outgoing connections to the other peer’s public and private TCP endpoints.

Figure 7: Sockets versus Ports for TCP Hole Punching
\begin{figure}\centerline{\epsfig{file=tcpsocks.eps, scale=0.35}}\end{figure}

Consider the common-case scenario in which the clients $A$ and $B$ are behind different NATs, as shown in Figure 5, and assume that the port numbers shown in the figure are now for TCP rather than UDP ports. The outgoing connection attempts $A$ and $B$ make to each other’s private endpoints either fail or connect to the wrong host. As with UDP, it is important that TCP applications authenticate their peer-to-peer sessions, due of the likelihood of mistakenly connecting to a random host on the local network that happens to have the same private IP address as the desired host on a remote private network.

The clients’ outgoing connection attempts to each other’s public endpoints, however, cause the respective NATs to open up new “holes” enabling direct TCP communication between $A$ and $B$. If the NATs are well-behaved, then a new peer-to-peer TCP stream automatically forms between them. If $A$‘s first SYN packet to $B$ reaches $B$‘s NAT before $B$‘s first SYN packet to $A$ reaches $B$‘s NAT, for example, then $B$‘s NAT may interpret $A$‘s SYN as an unsolicited incoming connection attempt and drop it. $B$‘s first SYN packet to $A$ should subsequently get through, however, because $A$‘s NAT sees this SYN as being part of the outbound session to $B$ that $A$‘s first SYN had already initiated.

4.3 Behavior Observed by the Application

What the client applications observe to happen with their sockets during TCP hole punching depends on the timing and the TCP implementations involved. Suppose that $A$‘s first outbound SYN packet to $B$‘s public endpoint is dropped by NAT $B$, but $B$‘s first subsequent SYN packet to $A$‘s public endpoint gets through to $A$ before $A$‘s TCP retransmits its SYN. Depending on the operating system involved, one of two things may happen:

  • $A$‘s TCP implementation notices that the session endpoints for the incoming SYN match those of an outbound session $A$ was attempting to initiate. $A$‘s TCP stack therefore associates this new session with the socket that the local application on $A$ was using to connect() to $B$‘s public endpoint. The application’s asynchronous connect() call succeeds, and nothing happens with the application’s listen socket.Since the received SYN packet did not include an ACK for $A$‘s previous outbound SYN, $A$‘s TCP replies to $B$‘s public endpoint with a SYN-ACK packet, the SYN part being merely a replay of $A$‘s original outbound SYN, using the same sequence number. Once $B$‘s TCP receives $A$‘s SYN-ACK, it responds with its own ACK for $A$‘s SYN, and the TCP session enters the connected state on both ends.
  • Alternatively, $A$‘s TCP implementation might instead notice that $A$ has an active listen socket on that port waiting for incoming connection attempts. Since $B$‘s SYN looks like an incoming connection attempt, $A$‘s TCP creates a new stream socket with which to associate the new TCP session, and hands this new socket to the application via the application’s next accept() call on its listen socket.$A$‘s TCP then responds to $B$ with a SYN-ACK as above, and TCP connection setup proceeds as usual for client/server-style connections.Since $A$‘s prior outbound connect() attempt to $B$ used a combination of source and destination endpoints that is now in use by another socket, namely the one just returned to the application viaaccept()$A$‘s asynchronous connect() attempt must fail at some point, typically with an “address in use” error. The application nevertheless has the working peer-to-peer stream socket it needs to communicate with $B$, so it ignores this failure.

The first behavior above appears to be usual for BSD-based operating systems, whereas the second behavior appears more common under Linux and Windows.

4.4 Simultaneous TCP Open

Suppose that the timing of the various connection attempts during the hole punching process works out so that the initial outgoing SYN packets from both clients traverse their respective local NATs, opening new outbound TCP sessions in each NAT, before reaching the remote NAT. In this “lucky” case, the NATs do not reject either of the initial SYN packets, and the SYNs cross on the wire between the two NATs. In this case, the clients observe an event known as a simultaneous TCP open: each peer’s TCP receives a “raw” SYN while waiting for a SYN-ACK. Each peer’s TCP responds with a SYN-ACK, whose SYN part essentially “replays” the peer’s previous outgoing SYN, and whose ACK part acknowledges the SYN received from the other peer.

What the respective applications observe in this case again depends on the behavior of the TCP implementations involved, as described in the previous section. If both clients implement the second behavior above, it may be that all of the asynchronous connect() calls made by the application ultimately fail, but the application running on each client nevertheless receives a new, working peer-to-peer TCP stream socket via accept()–as if this TCP stream had magically “created itself” on the wire and was merely passively accepted at the endpoints! As long as the application does not care whether it ultimately receives its peer-to-peer TCP sockets via connect() or accept(), the process results in a working stream on any TCP implementation that properly implements the standard TCP state machine specified in RFC 793 [23].

Each of the alternative network organization scenarios discussed in Section 3 for UDP works in exactly the same way for TCP. For example, TCP hole punching works in multi-level NAT scenarios such as the one in Figure 6 as long as the NATs involved are well-behaved.

4.5 Sequential Hole Punching

In a variant of the above TCP hole punching procedure implemented by the NatTrav library [4], the clients attempt connections to each other sequentially rather than in parallel. For example: (1) $A$ informs $B$via $S$ of its desire to communicate, without simultaneously listening on its local port; (2) $B$ makes a connect() attempt to $A$, which opens a hole in $B$‘s NAT but then fails due to a timeout or RST from $A$‘s NAT or a RST from $A$ itself; (3) $B$ closes its connection to $S$ and does a listen() on its local port; (4) $S$ in turn closes its connection with $A$, signaling $A$ to attempt a connect() directly to $B$.

This sequential procedure may be particularly useful on Windows hosts prior to XP Service Pack 2, which did not correctly implement simultaneous TCP open, or on sockets APIs that do not support theSO_REUSEADDR functionality. The sequential procedure is more timing-dependent, however, and may be slower in the common case and less robust in unusual situations. In step (2), for example, $B$ must allow its “doomed-to-fail” connect() attempt enough time to ensure that at least one SYN packet traverses all NATs on its side of the network. Too little delay risks a lost SYN derailing the process, whereas too much delay increases the total time required for hole punching. The sequential hole punching procedure also effectively “consumes” both clients’ connections to the server $S$, requiring the clients to open fresh connections to $S$ for each new P2P connection to be forged. The parallel hole punching procedure, in contrast, typically completes as soon as both clients make their outgoing connect()attempts, and allows each client to retain and re-use a single connection to $S$ indefinitely.

5 Properties of P2P-Friendly NATs

This section describes the key behavioral properties NATs must have in order for the hole punching techniques described above to work properly. Not all current NAT implementations satisfy these properties, but many do, and NATs are gradually becoming more “P2P-friendly” as NAT vendors recognize the demand for peer-to-peer protocols such as voice over IP and on-line gaming.

This section is not meant to be a complete or definitive specification for how NATs “should” behave; we provide it merely for information about the most commonly observed behaviors that enable or break P2P hole punching. The IETF has started a new working group, BEHAVE, to define official “best current practices” for NAT behavior. The BEHAVE group’s initial drafts include the considerations outlined in this section and others; NAT vendors should of course follow the IETF working group directly as official behavioral standards are formulated.

5.1 Consistent Endpoint Translation

The hole punching techniques described here only work automatically if the NAT consistently maps a given TCP or UDP source endpoint on the private network to a single corresponding public endpoint controlled by the NAT. A NAT that behaves in this way is referred to as a cone NAT in RFC 3489 [19] and elsewhere, because the NAT “focuses” all sessions originating from a single private endpoint through the same public endpoint on the NAT.

Consider again the scenario in Figure 5, for example. When client $A$ initially contacted the well-known server $S$, NAT $A$ chose to use port 62000 at its own public IP address, 155.99.25.11, as a temporary public endpoint to representing $A$‘s private endpoint 10.0.0.1:4321. When $A$ later attempts to establish a peer-to-peer session with $B$ by sending a message from the same local private endpoint to $B$‘s public endpoint, $A$ depends on NAT $A$ preserving the identity of this private endpoint, and re-using the existing public endpoint of 155.99.25.11:62000, because that is the public endpoint for $A$ to which $B$ will be sending its corresponding messages.

A NAT that is only designed to support client/server protocols will not necessarily preserve the identities of private endpoints in this way. Such a NAT is a symmetric NAT in RFC 3489 terminology. For example, after the NAT assigns the public endpoint 155.99.25.11:62000 to client $A$‘s session with server $S$, the NAT might assign a different public endpoint, such as 155.99.25.11:62001, to the P2P session that $A$ tries to initiate with $B$. In this case, the hole punching process fails to provide connectivity, because the subsequent incoming messages from $B$ reach NAT $A$ at the wrong port number.

Many symmetric NATs allocate port numbers for successive sessions in a fairly predictable way. Exploiting this fact, variants of hole punching algorithms [9,1] can be made to work “much of the time” even over symmetric NATs by first probing the NAT’s behavior using a protocol such as STUN [19], and using the resulting information to “predict” the public port number the NAT will assign to a new session. Such prediction techniques amount to chasing a moving target, however, and many things can go wrong along the way. The predicted port number might already be in use causing the NAT to jump to another port number, for example, or another client behind the same NAT might initiate an unrelated session at the wrong time so as to allocate the predicted port number. While port number prediction can be a useful trick for achieving maximum compatibility with badly-behaved existing NATs, it does not represent a robust long-term solution. Since symmetric NAT provides no greater security than a cone NAT with per-session traffic filtering, symmetric NAT is becoming less common as NAT vendors adapt their algorithms to support P2P protocols.

5.2 Handling Unsolicited TCP Connections

When a NAT receives a SYN packet on its public side for what appears to be an unsolicited incoming connection attempt, it is important that the NAT just silently drop the SYN packet. Some NATs instead actively reject such incoming connections by sending back a TCP RST packet or even an ICMP error report, which interferes with the TCP hole punching process. Such behavior is not necessarily fatal, as long as the applications re-try outgoing connection attempts as specified in step 4 of the process described in Section 4.2, but the resulting transient errors can make hole punching take longer.

5.3 Leaving Payloads Alone

A few existing NATs are known to scan “blindly” through packet payloads for 4-byte values that look like IP addresses, and translate them as they would the IP address in the packet header, without knowing anything about the application protocol in use. This bad behavior fortunately appears to be uncommon, and applications can easily protect themselves against it by obfuscating IP addresses they send in messages, for example by sending the bitwise complement of the desired IP address.

5.4 Hairpin Translation

Some multi-level NAT situations require hairpin translation support in order for either TCP or UDP hole punching to work, as described in Section 3.5. The scenario shown in Figure 6, for example, depends on NAT $C$ providing hairpin translation. Support for hairpin translation is unfortunately rare in current NATs, but fortunately so are the network scenarios that require it. Multi-level NAT is becoming more common as IPv4 address space depletion continues, however, so support for hairpin translation is important in future NAT implementations.

6 Evaluation of Existing NATs

To evaluate the robustness of the TCP and UDP hole punching techniques described in this paper on a variety of existing NATs, we implemented and distributed a test program called NAT Check [16], and solicited data from Internet users about their NATs.

NAT Check’s primary purpose is to test NATs for the two behavioral properties most crucial to reliable UDP and TCP hole punching: namely, consistent identity-preserving endpoint translation (Section 5.1), and silently dropping unsolicited incoming TCP SYNs instead of rejecting them with RSTs or ICMP errors (Section 5.2). In addition, NAT Check separately tests whether the NAT supports hairpin translation (Section 5.4), and whether the NAT filters unsolicited incoming traffic at all. This last property does not affect hole punching, but provides a useful indication the NAT’s firewall policy.

NAT Check makes no attempt to test every relevant facet of NAT behavior individually: a wide variety of subtle behavioral differences are known, some of which are difficult to test reliably [12]. Instead, NAT Check merely attempts to answer the question, “how commonly can the proposed hole punching techniques be expected to work on deployed NATs, under typical network conditions?”

6.1 Test Method

NAT Check consists of a client program to be run on a machine behind the NAT to be tested, and three well-known servers at different global IP addresses. The client cooperates with the three servers to check the NAT behavior relevant to both TCP and UDP hole punching. The client program is small and relatively portable, currently running on Windows, Linux, BSD, and Mac OS X. The machines hosting the well-known servers all run FreeBSD.

6.1.1 UDP Test

To test the NAT’s behavior for UDP, the client opens a socket and binds it to a local UDP port, then successively sends “ping”-like requests to servers 1 and 2, as shown in Figure 8. These servers each respond to the client’s pings with a reply that includes the client’s public UDP endpoint: the client’s own IP address and UDP port number as observed by the server. If the two servers report the same public endpoint for the client, NAT Check assumes that the NAT properly preserves the identity of the client’s private endpoint, satisfying the primary precondition for reliable UDP hole punching.

Figure 8: NAT Check Test Method for UDP
\begin{figure}\centerline{\epsfig{file=udptest.eps, scale=0.40}}\end{figure}

When server 2 receives a UDP request from the client, besides replying directly to the client it also forwards the request to server 3, which in turn replies to the client from its own IP address. If the NAT’s firewall properly filters “unsolicited” incoming traffic on a per-session basis, then the client never sees these replies from server 3, even though they are directed at the same public port as the replies from servers 1 and 2.

To test the NAT for hairpin translation support, the client simply opens a second UDP socket at a different local port and uses it to send messages to the public endpoint representing the client’s first UDP socket, as reported by server 2. If these messages reach the client’s first private endpoint, then the NAT supports hairpin translation.

6.1.2 TCP Test

The TCP test follows a similar pattern as for UDP. The client uses a single local TCP port to initiate outbound sessions to servers 1 and 2, and checks whether the public endpoints reported by servers 1 and 2 are the same, the first precondition for reliable TCP hole punching.

The NAT’s response to unsolicited incoming connection attempts also impacts the speed and reliability of TCP hole punching, however, so NAT Check also tests this behavior. When server 2 receives the client’s request, instead of immediately replying to the client, it forwards a request to server 3 and waits for server 3 to respond with a “go-ahead” signal. When server 3 receives this forwarded request, it attempts to initiate an inbound connection to the client’s public TCP endpoint. Server 3 waits up to five seconds for this connection to succeed or fail, and if the connection attempt is still “in progress” after five seconds, server 3 responds to server 2 with the “go-ahead” signal and continues waiting for up to 20 seconds. Once the client finally receives server 2’s reply (which server 2 delayed waiting for server 3’s “go-ahead” signal), the client attempts an outbound connection to server 3, effectively causing a simultaneous TCP open with server 3.

What happens during this test depends on the NAT’s behavior as follows. If the NAT properly just drops server 3’s “unsolicited” incoming SYN packets, then nothing happens on the client’s listen socket during the five second period before server 2 replies to the client. When the client finally initiates its own connection to server 3, opening a hole through the NAT, the attempt succeeds immediately. If on the other hand the NAT does not drop server 3’s unsolicited incoming SYNs but allows them through (which is fine for hole punching but not ideal for security), then the client receives an incoming TCP connection on its listen socket before receiving server 2’s reply. Finally, if the NAT actively rejects server 3’s unsolicited incoming SYNs by sending back TCP RST packets, then server 3 gives up and the client’s subsequent attempt to connect to server 3 fails.

To test hairpin translation for TCP, the client simply uses a secondary local TCP port to attempt a connection to the public endpoint corresponding to its primary TCP port, in the same way as for UDP.

6.2 Test Results

The NAT Check data we gathered consists of 380 reported data points covering a variety of NAT router hardware from 68 vendors, as well as the NAT functionality built into different versions of eight popular operating systems. Only 335 of the total data points include results for UDP hairpin translation, and only 286 data points include results for TCP, because we implemented these features in later versions of NAT Check after we had already started gathering results. The data is summarized by NAT vendor in Table 1; the table only individually lists vendors for which at least five data points were available. The variations in the test results for a given vendor can be accounted for by a variety of factors, such as different NAT devices or product lines sold by the same vendor, different software or firmware versions of the same NAT implementation, different configurations, and probably occasional NAT Check testing or reporting errors.

Table 1: User Reports of NAT Support for UDP and TCP Hole Punching
UDP TCP
Hole Hole
Punching Hairpin Punching Hairpin
NAT Hardware
Linksys 45/46 (98%) 5/42 (12%) 33/38 (87%) 3/38 (8%)
Netgear 31/37 (84%) 3/35 (9%) 19/30 (63%) 0/30 (0%)
D-Link 16/21 (76%) 11/21 (52%) 9/19 (47%) 2/19 (11%)
Draytek 2/17 (12%) 3/12 (25%) 2/7 (29%) 0/7 (0%)
Belkin 14/14 (100%) 1/14 (7%) 11/11 (100%) 0/11 (0%)
Cisco 12/12 (100%) 3/9 (33%) 6/7 (86%) 2/7 (29%)
SMC 12/12 (100%) 3/10 (30%) 8/9 (89%) 2/9 (22%)
ZyXEL 7/9 (78%) 1/8 (13%) 0/7 (0%) 0/7 (0%)
3Com 7/7 (100%) 1/7 (14%) 5/6 (83%) 0/6 (0%)
OS-based NAT
Windows 31/33 (94%) 11/32 (34%) 16/31 (52%) 28/31 (90%)
Linux 26/32 (81%) 3/25 (12%) 16/24 (67%) 2/24 (8%)
FreeBSD 7/9 (78%) 3/6 (50%) 2/3 (67%) 1/1 (100%)
All Vendors 310/380 (82%) 80/335 (24%) 184/286 (64%) 37/286 (13%)

Out of the 380 reported data points for UDP, in 310 cases (82%) the NAT consistently translated the client’s private endpoint, indicating basic compatibility with UDP hole punching. Support for hairpin translation is much less common, however: of the 335 data points that include UDP hairpin translation results, only 80 (24%) show hairpin translation support.

Out of the 286 data points for TCP, 184 (64%) show compatibility with TCP hole punching: the NAT consistently translates the client’s private TCP endpoint, and does not send back RST packets in response to unsolicited incoming connection attempts. Hairpin translation support is again much less common: only 37 (13%) of the reports showed hairpin support for TCP.

Since these reports were generated by a “self-selecting” community of volunteers, they do not constitute a random sample and thus do not necessarily represent the true distribution of the NATs in common use. The results are nevertheless encouraging: it appears that the majority of commonly-deployed NATs already support UDP and TCP hole punching at least in single-level NAT scenarios.

6.3 Testing Limitations

There are a few limitations in NAT Check’s current testing protocol that may cause misleading results in some cases. First, we only learned recently that a few NAT implementations blindly translate IP addresses they find in unknown application payloads, and the NAT Check protocol currently does not protect itself from this behavior by obfuscating the IP addresses it transmits.

Second, NAT Check’s current hairpin translation checking may yield unnecessarily pessimistic results because it does not use the full, two-way hole punching procedure for this test. NAT Check currently assumes that a NAT supporting hairpin translation does not filter “incoming” hairpin connections arriving from the private network in the way it would filter incoming connections arriving at the public side of the NAT, because such filtering is unnecessary for security. We later realized, however, that a NAT might simplistically treat any traffic directed at the NAT’s public ports as “untrusted” regardless of its origin. We do not yet know which behavior is more common.

Finally, NAT implementations exist that consistently translate the client’s private endpoint as long as only one client behind the NAT is using a particular private port number, but switch to symmetric NAT or even worse behaviors if two or more clients with different IP addresses on the private network try to communicate through the NAT from the same private port number. NAT Check could only detect this behavior by requiring the user to run it on two or more client hosts behind the NAT at the same time. Doing so would make NAT Check much more difficult to use, however, and impossible for users who only have one usable machine behind the NAT. Nevertheless, we plan to implement this testing functionality as an option in a future version of NAT Check.

6.4 Corroboration of Results

Despite testing difficulties such as those above, our results are generally corroborated by those of a large ISP, who recently found that of the top three consumer NAT router vendors, representing 86% of the NATs observed on their network, all three vendors currently produce NATs compatible with UDP hole punching [25]. Additional independent results recently obtained using the UDP-oriented STUN protocol [12], and STUNT, a TCP-enabled extension [8,9], also appear consistent with our results. These latter studies provide more information on each NAT by testing a wider variety of behaviors individually, instead of just testing for basic hole punching compatibility as NAT Check does. Since these more extensive tests require multiple cooperating clients behind the NAT and thus are more difficult to run, however, these results are so far available on a more limited variety of NATs.

7 Related Work

UDP hole punching was first explored and publicly documented by Dan Kegel [13], and is by now well-known in peer-to-peer application communities. Important aspects of UDP hole punching have also been indirectly documented in the specifications of several experimental protocols, such as STUN [19], ICE [17], and Teredo [11]. We know of no existing published work that thoroughly analyzes hole punching, however, or that points out the hairpin translation issue for multi-level NAT (Section 3.5).

We also know of no prior work that develops TCP hole punching in the symmetric fashion described here. Even the existence of the crucial SO_REUSEADDR/SO_REUSEPORT options in the Berkeley sockets API appears to be little-known among P2P application developers. NatTrav [4] implements a similar but asymmetric TCP hole punching procedure outlined earlier in Section 4.5. NUTSS [9] and NATBLASTER [1] implement more complex TCP hole punching tricks that can work around some of the bad NAT behaviors mentioned in Section 5, but they require the rendezvous server to spoof source IP addresses, and they also require the client applications to have access to “raw” sockets, usually available only at root or administrator privilege levels.

Protocols such as SOCKS [14], UPnP [26], and MIDCOM [22] allow applications to traverse a NAT through explicit cooperation with the NAT. These protocols are not widely or consistently supported by NAT vendors or applications, however, and do not appear to address the increasingly important multi-level NAT scenarios. Explicit control of a NAT further requires the application to locate the NAT and perhaps authenticate itself, which typically involves explicit user configuration. When hole punching works, in contrast, it works with no user intervention.

Recent proposals such as HIP [15] and FARA [2] extend the Internet’s basic architecture by decoupling a host’s identity from its location [20]. IPNL [7], UIP [5,6], and DOA [27] propose schemes for routing across NATs in such an architecture. While such extensions are probably needed in the long term, hole punching enables applications to work over the existing network infrastructure immediately with no protocol stack upgrades, and leaves the notion of “host identity” for applications to define.

8 Conclusion

Hole punching is a general-purpose technique for establishing peer-to-peer connections in the presence of NAT. As long as the NATs involved meet certain behavioral requirements, hole punching works consistently and robustly for both TCP and UDP communication, and can be implemented by ordinary applications with no special privileges or specific network topology information. Hole punching fully preserves the transparency that is one of the most important hallmarks and attractions of NAT, and works even with multiple levels of NAT–though certain corner case situations require hairpin translation, a NAT feature not yet widely implemented.

Acknowledgments

The authors wish to thank Dave Andersen for his crucial support in gathering the results presented in Section 6. We also wish to thank Henrik Nordstrom, Christian Huitema, Justin Uberti, Mema Roussopoulos, and the anonymous USENIX reviewers for valuable feedback on early drafts of this paper. Finally, we wish to thank the many volunteers who took the time to run NAT Check on their systems and submit the results.

Bibliography

1
Andrew Biggadike, Daniel Ferullo, Geoffrey Wilson, and Adrian Perrig.
NATBLASTER: Establishing TCP connections between hosts behind NATs.
In ACM SIGCOMM Asia Workshop, Beijing, China, April 2005.
2
David Clark, Robert Braden, Aaron Falk, and Venkata Pingali.
FARA: Reorganizing the addressing architecture.
In ACM SIGCOMM FDNA Workshop, August 2003.
3
S. Deering and R. Hinden.
Internet protocol, version 6 (IPv6) specification, December 1998.
RFC 2460.
4
Jeffrey L. Eppinger.
TCP connections for P2P apps: A software approach to solving the NAT problem.
Technical Report CMU-ISRI-05-104, Carnegie Mellon University, January 2005.
5
Bryan Ford.
Scalable Internet routing on topology-independent node identities.
Technical Report MIT-LCS-TR-926, MIT Laboratory for Computer Science, October 2003.
6
Bryan Ford.
Unmanaged internet protocol: Taming the edge network management crisis.
In Second Workshop on Hot Topics in Networks, Cambridge, MA, November 2003.
7
Paul Francis and Ramakrishna Gummadi.
IPNL: A NAT-extended Internet architecture.
In ACM SIGCOMM, August 2002.
8
Saikat Guha and Paul Francis.
Simple traversal of UDP through NATs and TCP too (STUNT).
http://nutss.gforge.cis.cornell.edu/.
9
Saikat Guha, Yutaka Takeday, and Paul Francis.
NUTSS: A SIP-based approach to UDP and TCP network connectivity.
In SIGCOMM 2004 Workshops, August 2004.
10
M. Holdrege and P. Srisuresh.
Protocol complications with the IP network address translator, January 2001.
RFC 3027.
11
C. Huitema.
Teredo: Tunneling IPv6 over UDP through NATs, March 2004.
Internet-Draft (Work in Progress).
12
C. Jennings.
NAT classification results using STUN, October 2004.
Internet-Draft (Work in Progress).
13
Dan Kegel.
NAT and peer-to-peer networking, July 1999.
http://www.alumni.caltech.edu/~dank/peer-nat.html.
14
M. Leech et al.
SOCKS protocol, March 1996.
RFC 1928.
15
R. Moskowitz and P. Nikander.
Host identity protocol architecture, April 2003.
Internet-Draft (Work in Progress).
16
NAT check.
http://midcom-p2p.sourceforge.net/.
17
J. Rosenberg.
Interactive connectivity establishment (ICE), October 2003.
Internet-Draft (Work in Progress).
18
J. Rosenberg, C. Huitema, and R. Mahy.
Traversal using relay NAT (TURN), October 2003.
Internet-Draft (Work in Progress).
19
J. Rosenberg, J. Weinberger, C. Huitema, and R. Mahy.
STUN – simple traversal of user datagram protocol (UDP) through network address translators (NATs), March 2003.
RFC 3489.
20
J. Saltzer.
On the naming and binding of network destinations.
In P. Ravasio et al., editor, Local Computer Networks, pages 311-317. North-Holland, Amsterdam, 1982.
RFC 1498.
21
P. Srisuresh and M. Holdrege.
IP network address translator (NAT) terminology and considerations, August 1999.
RFC 2663.
22
P. Srisuresh, J. Kuthan, J. Rosenberg, A. Molitor, and A. Rayhan.
Middlebox communication architecture and framework, August 2002.
RFC 3303.
23
Transmission control protocol, September 1981.
RFC 793.
24
G. Tsirtsis and P. Srisuresh.
Network address translation – protocol translation (NAT-PT), February 2000.
RFC 2766.
25
Justin Uberti.
E-mail on IETF MIDCOM mailing list, February 2004.
Message-ID: <402CEB11.1060906@aol.com>.
26
UPnP Forum.
Internet gateway device (IGD) standardized device control protocol, November 2001.
http://www.upnp.org/.
27
Michael Walfish, Jeremy Stribling, Maxwell Krohn, Hari Balakrishnan, Robert Morris, and Scott Shenker.
Middleboxes no longer considered harmful.
In USENIX Symposium on Operating Systems Design and Implementation, San Francisco, CA, December 2004.

[Architecture] 孰輕孰重?Web Service vs. REST APIs

[Architecture] 孰輕孰重?Web Service vs. REST APIs

Web Service,這個從 2001 年開始被微軟炒起來的分散式元件概念,隨著 Visual Studio 2002 (.NET Framework 1.0) 的推出得到了軟體龍頭微軟的背書,一個原本要從 SOAP 和 XML 開始打造的可重覆使用 (reusable) 式軟體元件就在 .NET Framework (System.Web.Services 命名空間) 的精美包裝下,讓開發人員只需要使用 [WebMethod] 就能夠寫出自己的 Web Service 元件,一切看似完美的表象之下,卻包裝著殘酷的事實:它很肥,又很複雜 (PS: 約耳的抽象滲漏法則)。

Source: http://msdn.microsoft.com/en-us/library/x05s00wz(v=vs.80).aspx

Web Service 的核心是植基於 SOAP (Simple Object Access Protocol) 的協定之上,SOAP 的結構由 XML 所組成,所以不會 XML 根本看不懂 SOAP,而想靈活的操控 Web Service 又不能不會 SOAP,像是利用 SOAP header 來做使用者認證,或是在使用 Web Service 上傳檔案時顯示進度列等 (使用 Web Service 傳檔案又是另一個故事了),這些都是要對 SOAP 有一部份的了解後,才會知道要怎麼做的一些案例,撇開這些不談,就算是一個簡單的 Hello World Web Service 好了,光是 SOAP 訊息的通訊,解譯,執行 Web Service 方法,編譯,回傳再於用戶端解譯,這些工作所需要的網路傳輸量和時間就很可觀了,在這兩年輕量化 Web 介面與 AJAX 的興起,Web Service 傳輸量重的問題已經浮上檯面,雖然有些 JavaScript Library 可以直接呼叫 Web Service,然而它卻已經不是輕量級 Web 服務的開發介面首選。

而 REST API (Represental State Transfer) 則是相對於 Web Service 的另外一面,REST API 藉由 URL 與 HTTP 動詞 (GET, POST, PUT, DELETE) 來決定服務端要執行的動作,與 Web Service 最大的不同,就是它不必死綁著 SOAP 協定,服務端只要在 HTTP 動詞以及 URL 規範好,通常服務端不需要做太多額外的處理,同時透過 HTTP Header 的資訊,服務端可以做更多的控制 (ex: OAuth 就是使用 HTTP Header 傳輸 Authorization 標頭),需要傳輸的資料就會在無形中最小化,尤其適合 mashup application 或 AJAX application 使用,看現在 Google, Facebook, Twitter 與微軟等主流廠商廣為應用即可得之端睨,同時 REST API 配合 JSON 或是少量的 XML,亦可以達到組織資料的目的,而且它能夠跨越到手機等行動應用上,這點是 Web Service 較難做到的。

然而 Web Service 仍有存在的必要,因為 Web Service 發展較久,基礎建設十分完備,包含來自 WS-I (Web Service Interoperability) 組織的規範制訂,安全性的規範完備,且有許多大型應用程式會依賴 Web Service 進行資料交換,所以 Web Service 十分適合在企業應用程式間交換資料或做流程操作,SOAP 與 XML 可以確保格式的正確性與完整性,同時 XML-Sig 規格可以確保訊息的安全性。REST API 就十分適合前端應用程式與服務之間的資料交換以及與使用者互動的部份,它也可以有效的降低開發人員的學習門檻 (當然,基本的 HTTP, Script 與 AJAX 還是要會)。

image


資安專欄

http://www.twisc.ncku.edu.tw/epaper/ePaper-023/column.html

雙面惡魔(Evil Twin)

駭客使用兩張網卡,一張連接HotSpot的無線基地台,另一張則將SSID設成與HotSpot相同,假扮成HotSpot無線基地台。使用者使用無線網路連線時,如果搜尋到駭客所架設的HotSpot,並與之連接,使用者則很容易被竊取帳號密碼。而且假扮的無線基地台不容易追蹤,因為駭客只需將電腦或裝置關閉,走人即可。

資料來源:
http://en.wikipedia.org/wiki/Evil_twin_(wireless_networks)

後門程式(Backdoor)

腦中的後門指的是繞過或規避存取電腦時所需要的認證。通常是程式設計師在設計程式時,為了方便修改程式而預留的通道,或者是電腦遭到植入程式而造成的。駭客會利用此通道對電腦竊取重要資料或遠端遙控電腦進行惡意行為。

資料來源:
http://en.wikipedia.org/wiki/Backdoor_(computing)

Backscatter analysis of DoS Attack

Backscatter本意是用來指涉反向散射的電磁波, 粒子或者訊號, 此概念借用於網路安全的領域當中, 對於阻斷服務攻擊, 攻擊者經常修改封包當中的來源位址並送給網路中想要攻擊的對象, 被攻擊的機器無法分辨此來源位址的真偽而根據協定內容將回應封包傳送給此來源位址, 此回應的封包即稱為backscatter. 藉著分析這些回應封包的來源位址可定序阻斷服務攻擊的特徵值, 此之謂backscatter analysis.

資料來源:
http://en.wikipedia.org/wiki/Backscatter#Backscatter_from_denial-of-service_attacks

http://www.caida.org/publications/animations/passive_monitoring/backscatter.mpg

CAIDA

CAIDA全名為Cooperative Association for Internet Data Analysis, 此機構位於加州大學的San Diego分校的超級電腦中心, 提供產官學界一協同研究的平台以研發維護強固並持續成長的Internet基礎架構. CAIDA 的主要目標包括:
 1.鼓勵創造Internet傳輸的量測尺度, 與IETF/IPPM等業界及消費者組織協同合作以
保證此量測尺度的使用以及全球性的可接受度。
 2.發展對於傳輸資料研發分析的協同環境, 包括取得, 分析以及分享。
 3.育成以下領域的先進方法論及技術: 傳輸效能, 流量特徵值, 模擬, 分析, 視覺
化呈現. 對特定的未來領域探索涉及下一代routing protocol的演化. CAIDA 容易
為人所了解的發佈成果有IPv4 及 IPv6 skitter chart。

資料來源:
http://www.caida.org/home/

API HOOK

API HOOK是一種對系統kernel進行監聽的應用,針對目標可能是的kernel API的入口或出口,以便監聽後並跳至自己所設計的程式,最後會再回到原來的程式繼續執行。API HOOK除了惡意程式應用以外,也常用於智慧財產權保護的軟體上。

資料來源:
http://anti-hacker.blogspot.com/2008/02/hook-api.html

E-Mail spoofing

E-Mail spoofing是一種利用變更E-Mail格式的表頭來達到欺騙收件者的行為。目前這種攻擊常會夾帶木馬或者其他惡意程式以讓受騙的收件者運行該程式而受到系統與隱私上的威脅。

資料來源:
http://en.wikipedia.org/wiki/E-mail_spoofing

Fuzz testing

Fuzz testing或者fuzzing是一種高度自動化軟體檢測技術。提供受測試軟體隨機產生的資料,藉此觀察軟體是否因此發生瑕疵(如當機)。目前Fuzz testing最大詬病之處在於:它通常只能夠找到很簡單的錯誤。程式碼檢測問題本身是一個相當複雜的問題,而人們卻期許Fuzz testing能夠在很短時間內找出錯誤。畢竟Fuzz testing無法瞭解程式碼的真正涵義,我們不難想像fuzzer可能大部分的時間一直不斷作毫無意義的測試。舉例來說,對於一僅接受符合checksum的輸入才可以正確執行的受測軟體,fuzzer卻花費大量的時間重複檢測各種不符合正確checksum的input,受測軟體當然無法正確執行。不過不可否認地,Fuzz testing通常提供軟體檢測人員更加全面性的檢測,同時也替軟體開法人員或者品管人員省下大量人工檢測的勞力與時間。

資料來源:
http://en.wikipedia.org/wiki/Fuzz_testing

Botnet

殭屍電腦(稱為Bot或 Zombie)為一台遭受駭客植入惡意程式的電腦,而成為可被駭客遠端遙控的受害主機。Botnet則是指一群被惡意程式感染的電腦主機,駭客只需透過Internet 即可取得這群主機資源控制權,進行特定的攻擊。散播的方式除了透過網路病毒或蠕蟲感染植入 (Unpatched O.S.或Application),也會利用社交工程誘騙使用者遭受植入或感染 (MSN、Email附件、瀏覽惡意網站等)。其目的主要是為了進行特定攻擊,如DOS/DDOS、被控制大量發送垃圾郵件、竊取機密與私人資料、當成跳板進行犯法行為。

資料來源:
http://en.wikipedia.org/wiki/Botnet

同代加密(Homomorphic Encryption)

一般而言,密文對不持有金鑰的人而言是均勻分佈的亂數字串,無法進行有意義的計算。在某些特殊狀況下,加密方會希望能不經加密金鑰而直接在密文上運算,產生與明文相對應的效果。
例如密文上的*運算對應明文上的+運算:
If c=a+b, then E(c)=E(a)+E(b)
具備這樣特性的加密法E()稱為同代加密。

資料來源:
http://en.wikipedia.org/wiki/Homomorphic_encryption

Javascript綁架(JavaScript hijacking

這種攻擊法來自於AJAX和JSON規範的弱點,利用Javascript中的< script >標籤綁架登入網站的受害者cookie,攻擊者就可取得受害者在網站的權限。 Javascript綁架完全不需要對被攻擊網站值入惡意程式碼,只需要架設惡意網站,並等使用者在開啟並登入被攻擊目標網頁後在開啟惡意網站。最常被攻擊的範例就屬網頁email服務。

資料來源:
http://www.fortifysoftware.com/advisory.jsp

程式隱匿技術 (Code Obfuscation Technique)

程式開發者刻意地隱藏程式的行為。這樣的方式可使得駭客或一般程式設計師在撰寫系統時,可利用這個概念來避免使用者知道或推導程式實際執行的流程及動作。而達到隱匿目的的方式有幾種:加殼(Pack)、加密(Encrypt)以及編碼(Encode)等方式。程式隱匿技術的目的是為了使用者無法了解系統在實際上執行了那些動作以及讓逆向工程(reverse engineering)不易推導成功。這樣的目的通常用來保護軟體內容及相關的智慧財產權保護等。但是若駭客利用這樣的概念在惡意程式上,則是為要躲避防毒軟體的偵測。因此當使用者使用了這類的程式,則無法辨別程式背後的目的是好還是壞。程式指令最終仍然需要進行脫殼(Unpack)、解密(Decrypt)或解碼(Decode)等動作才能加以載入主記憶體內並交由CPU來執行,因此只要有足夠的時間,逆向工程仍然可以推導程式執行的流程及動作。所以程式隱匿技術與逆向工程之間無形中成了競爭關係。這樣的關係就像是防毒軟體不斷地嘗試找出惡意程式的存在,而惡意程式為了達到目的,就嘗試地在實際的程式中加上了”外表” 。事實上,程式隱匿技術的概念已經廣泛地被使用在軟體的撰寫中,尤其是駭客所撰寫的惡意程式上。因此程式檢測人員在檢測程式是否存在惡意行為時,首先即是先判斷程式是否有試圖隱藏其行為的動作,使用的方法可以考慮利用靜態分析隱匿特徵、偵測自身修改程式碼等方式來加快程式檢測的處理時間。

資料來源:
http://www.i-security.tw/learn/learn_az.asp?Code=C

http://www.dgbas.gov.tw/public/Data/762815412271.pdf

http://blog.yam.com/nephowind/article/13461242

社交工程攻擊 (Social Engineering Attacks)

社交工程攻擊是一種利用人性弱點的攻擊手法,用來哄騙他人的個人資料(帳號密碼等),進而危害到系統的安全。同時這種攻擊的手法也是非技術性的。例如釣魚者發一個看似來自於真實來源的電子郵件,使得那些深信不疑的人進入偽造的網站,並輸入個人敏感資料,讓釣魚者達到目的。其他常見的手法還包括了偽裝成技術支援人員,向用戶索取密碼,再俟機登入到特定的系統裏。

資料來源:
http://www.i-security.tw/learn/learn_az.asp?Code=C

http://www.dgbas.gov.tw/public/Data/762815412271.pdf

http://blog.yam.com/nephowind/article/13461242

網址嫁接攻擊 (Pharming Attack)

網址嫁接攻擊是攻擊者意圖將使用者導引到偽造的網站。攻擊者主要是經由篡改DNS伺服器上的網址與IP位址對應記錄,使攻擊行動中的目標網站之網址對應至攻擊者架設的偽造網站之IP位址,造成使用者不知不覺地拜訪了偽造網站,因而有洩漏個人資料的危險。

資料來源:
http://en.wikipedia.org/wiki/Pharming

相同來源政策 (Same Origin Policy)

相同來源政策是網頁瀏覽器的安全性機制,目的是防止不同來源的物件 (web object) 互相存取資源,這裡的物件指的是Javascript,CSS files,和HTTP cookies等。而最典型的存取行為就是Javascript存取或參考其他物件。相同來源政策只允許被認定為相同來源的物件互相存取資源,而其定義是兩個物件必須來自同樣的網域名稱 (domain name)、經由同樣的連接埠 (port) 和同樣的應用層協定 (application layer protocol)。例如在 http://www.example.com/dir/page.html的Javascript可以存取在 http://www.example.com/dir/page.html 的物件,但是不能存取 https://www.example.com/dir2/other.htmlhttp://en.example.com/dir2/other.html 的物件,前者是由於應用層協定不同 (HTTP v.s. HTTPS),後者則是網域名稱不同。

資料來源:
http://en.wikipedia.org/wiki/Same_origin_policy

Fast-flux Service Networks

Fast-flux 意指在網域名稱(Domain Name)內的IP位址會快速轉換,目的是要躲避基於IP位址黑名單之阻擋方式,並透過正常的負載平衡的機制, 如內容傳遞網路(Content Delivery Network, CDN),來建構出一個惡意的代理伺服網路,並主要用來幫助網路犯罪者隱藏他們的內容伺服器,包含釣魚網頁、 惡意程式之下載等,最著名就是MySpace之受害案例。

資料來源:
http://en.wikipedia.org/wiki/Fast_flux

入侵偵測系統 (Intrusion Detection System, IDS)

入侵偵測系統主要是去監控某一環境,如內部網路或網頁伺服器,去進行可疑行為之分析及偵測,主要是利用規則方式或統計上異常行為進行偵測;但有別於「合法才授權」的防火牆,入侵偵測系統更能從合法的行為但夾帶攻擊意圖之行為進行偵測。

資料來源:
http://en.wikipedia.org/wiki/Intrusion_detection_system

電腦鑑識(computer forensics)

電腦鑑識包含了保留、紀錄、讀取、鑑別儲存在電腦裡的資料文件。當資安或犯罪事件發生時, 便需利用電腦鑑識來還原事件的發生過程或擷取已儲存在電腦中的證據。電腦鑑識技術可透過讀取 電腦裝置內的資料進一步提供相關證據 。

資料來源:
http://www.i-security.tw/

網頁隱藏式惡意連結(Malicious URL injection)

又稱為「網頁惡意掛馬」或「網頁掛馬」。指駭客竄改所攻擊的網頁,植入惡意連結,或者是設立惡意網站, 透過宣傳手法,利用一般人的好奇心吸引觀眾到站瀏覽,使用者只要連上網站,就會轉落入駭客預先設計好的陷阱, 被植入木馬程式。進而被竊取電腦中的資料或機密造成損失。

資料來源:
http://www.i-security.tw/

中間人攻擊(Man-in-the-Middle Attack)

指駭客扮演中間人角色,冒充伺服器接收您傳送的訊息,再冒充你把訊息傳給真正的服務器, 因此可以在通訊兩端不知情的情況下竊取或更改傳遞的訊息。

資料來源:
http://www.i-security.tw/

欺騙(spoofing)

spoofing中文意指欺騙,是一種技術用以取得非經授權的存取電腦,其手法主要以偽裝發送訊息、 郵件的位址或偽裝成為已被授權的使用者,在未經許可下進入安全的電腦系統,以進行任何不法的行為。 Spoofing多為垃圾郵件發送者、駭客或網路詐騙者所使用,藉以達到犯罪行為而不被發現。

資料來源:
http://www.i-security.tw/


convert webpage to an image? RSS

http://forums.asp.net/p/1244725/2286975.aspx