How to get the size of available TCP data?
How to get the size of available TCP data?
Problem
I have a use case where I need to Peek
at exactly the first TCP packet, whatever length it may be.
Peek
Snippet
I would have expected this to work:
conn, err := sock.Accept()
if nil != err {
panic(err)
}
// plenty of time for the first packet to arrive
time.Sleep(2500 * 1000000)
bufConn := bufio.NewReader(conn)
n := bufConn.Buffered()
fmt.Fprintf(os.Stdout, "Size of Buffered Data %dn", n)
However, even though I am positive that the data has arrived it still shows that 0 bytes are buffered.
Full Test Application
Here's a full test program:
package main
import (
"bufio"
"fmt"
"net"
"os"
"strconv"
"time"
)
func main () {
addr := ":" + strconv.Itoa(4080)
sock, err := net.Listen("tcp", addr)
if nil != err {
panic(err)
}
conn, err := sock.Accept()
if nil != err {
panic(err)
}
bufConn := bufio.NewReader(conn)
var n int
for {
n = bufConn.Buffered()
fmt.Fprintf(os.Stdout, "Size of Buffered Data %dn", n)
if 0 != n {
break
}
time.Sleep(2500 * 1000000)
}
first, err := bufConn.Peek(n)
if nil != err {
panic(err)
}
fmt.Fprintf(os.Stdout, "[Message] %sn", first)
}
Testing
And how I've been testing:
telnet localhost 4080
Hello, World!
This works equally well:
echo "Hello, World!" | nc localhost -p 4080
However, if I call Peek(14)
directly the data is obviously there.
Peek(14)
Why?
I'm dealing with an application-specific use case - magic byte detection when multiplexing multiple protocols over a single port.
In theory packet sizes are unreliable, but in practice a small hello packet of a few bytes will not be made smaller by any routers in the path and the application will not send more data until it receives the handshake response.
I'm supporting exactly one protocol that requires the server to send its hello packet first, which means that if after a wait of 250ms no packet has been received, the server will assume that this special protocol is being used and send its hello.
Hence, it will be best if I can know if data exists in the underlying buffer without doing any Read()
or Peek()
beforehand.
Read()
Peek()
2 Answers
2
I have a use case where I need to Peek at exactly the first TCP packet, whatever length it may be.
TCP is a streaming protocol and not a datagram protocol like UDP. This means packets are irrelevant from the perspective of TCP. They only exist temporarily on the wire.
Any data the application sends will be put into the continuous send buffer and then packetized by the operating system for transport. This means multiple writes by the application can result in a single packet, a single write into multiple packets etc. If data are lost during transport (i.e. no ACK) the senders OS can even do a retransmit with differently sized packets.
Similar packets received on the wire will be reassembled inside the OS kernel and will be put into the continuous read buffer. All packet boundaries which might have existed on the wire will be lost when doing this. Therefore no way exist for the application to find out where the packet boundary was.
n = bufConn.Buffered()
bufConn
is not the OS socket buffer. bufConn.Buffered()
will only see the data which are read from the underlying socket into the Go process but which are not yet retrieved by the application logic using bufConn.Read()
: if you try to read a single byte with bufConn.Read()
it will actually try to read more bytes from the underlying socket, return the single byte you've requested and keep the rest in the bufConn
buffer for later reads. This is done to provide a more efficient interface for the application logic. If you don't want this don't use buffered I/O.
bufConn
bufConn.Buffered()
bufConn.Read()
bufConn.Read()
bufConn
@EJP The sleep is my test code is just a way to ensure that data has been received. I understand that I would use channels and select in real code.
– CoolAJ86
Jul 24 at 17:07
@Steffen-Ullrich In general what you're describing is true, however, I'm dealing with an application-specific use case. In theory packet sizes are unreliable, but in practice a small hello packet of a few bytes will not be made smaller by any routers in the path and the application will not send more data until it receives the handshake response. The reason I need this functionality is for magic byte detection when multiplexing multiple protocols over a single port.
– CoolAJ86
Jul 24 at 17:26
@CoolAJ86: in this case the question is answered by the part which explains that
bufConn.Buffered()
returns only how many data are buffered which is not the same as how many data are able to read. And if you need it for magic byte detection - why don't you just read the magic byte instead of checking how many data are possible to read?– Steffen Ullrich
Jul 24 at 17:28
bufConn.Buffered()
@SteffenUllrich Mostly because I'm used to writing code in Node.js and, except when for very particular circumstances when pausing the network stream, there's a data event for each tcp packet. Aside from that, different protocols have different handshake styles and different "magic bytes" (SSH, TLS-SNI, HTTP, PROXY, proprietary protocols, etc) . In some cases the client sends the first packet, in other cases the server does (in which case only 1 such protocol could be supported). In some cases the magic bytes are the first 3 bytes, in other cases they require reading a length.
– CoolAJ86
Jul 24 at 17:39
Update: Can't be done with net.Conn
Actually, it is not possible to "Peek" at a net.Conn
without reading. However net.Conn
can be wrapped and that wrapper can be passed around anywhere net.Conn
is accepted.
net.Conn
net.Conn
net.Conn
See
Workable Half-Solution
The ideal solution would be to be able to Peek immediately on the first try. While searching around I did find some custom go TCP libraries... but I'm not feeling adventurous enough to try that yet.
Building off of what @SteffenUllrich said, it turns out that buffConn.Peek(1)
will cause the buffer to be filled with the available data. After that buffConn.Buffered()
returns the expected number of bytes and it's possible to proceed with buffConn.Peek(n)
:
buffConn.Peek(1)
buffConn.Buffered()
buffConn.Peek(n)
// Cause the bufConn with the available data
firstByte, err = bufConn.Peek(1)
if nil != err {
panic(err)
}
// Check the size now
n = bufConn.Buffered()
fmt.Fprintf(os.Stdout, "Size of Buffered Data %dn", n)
// Peek the full amount of available data
firstPacket, err = bufConn.Peek(n)
if nil != err {
panic(err)
}
I thought I had tried this earlier and saw the buffer only filled with 1 byte, but reading the answer above caused me to create a specific test case to be sure, and it worked.
The Downside
This still requires a Read()
/Peek()
before knowing the size of the data.
Read()
Peek()
This means that for my particular case where a single protocol is supported which requires the server to send the first hello packet, I have to store state about the connection somewhere else such that if enough time has passed (say 250ms) without any data being received I know to now skip detection of the first packet Peek when it comes in.
After the
Peek
there might still be data in the operating systems socket buffer. Peek
will not read more data than fit into the buffer of the bufConn
object. This might not be relevant in your specific use case with small data though.– Steffen Ullrich
Jul 24 at 17:53
Peek
Peek
bufConn
By clicking "Post Your Answer", you acknowledge that you have read our updated terms of service, privacy policy and cookie policy, and that your continued use of the website is subject to these policies.
... and therefore sleeps in blocking mode networking code are just literally a waste of time.
– EJP
Jul 23 at 5:41