AVB Network Latency

All devices in an AVB network share the same time. This allows the sending device (talker) to specify the precise point of time when its audio samples should be played out at any receiver (listener). This is achieved by adding an offset to the current time and transmitting the resulting timestamps along with the audio payload. The timestamp is called "presentation time" and has nanosecond precision. For comparison, a single sample at 48 kHz has a duration of over 20800 ns.

The receiver compares the incoming presentation time of each frame to the current time and buffers the sample until the presentation time is has come.

The offset (maximum transit time) is specified by the AVB standard as 2 ms for class A traffic, which is enough time for the signal to pass through a very large network under full load with over ten Gigabit switches ('hops') along the way. By default, certified AVB products will use this offset, even though it results in a significantly higher latency than necessary in most cases. In smaller Gigabit networks, the offset can be adjusted to lower values, such as 0.3 ms, 0.6 ms or 1 ms. It can be adjusted for each stream individually.

If the digital outputs of listener devices should to be phase aligned to the talker, is necessary to choose an offset that is a multiple of a sample length (which is 1 divided by sampling rate). Using the web interface, the offset can be conveniently set in samples for each stream to ensure phase alignment between talker and listeners across the network.

Table 1. Recommended presentation time offset (estimated maximum transfer time) depending on network size, in samples, for Gigabit networks

Rate (Hz)

1 switch
(~0.3 ms)

≤3 switches
(~0.6 ms)

≤6 switches
(~1 ms)

Default
(~2 ms)

44100

14

27

44

88

88200 (x2)

28

54

88

176

176400 (x4)

56

108

176

352

48000

15

30

48

96

96000 (x2)

30

60

96

192

192000 (x4)

60

120

192

384

In the event that the chosen offset is too low, the audio stream may experience drop-outs or distortion. However, the maximum transit time to a specific listener is known, and an ATDECC controller can show a warning if the intended presentation time is earlier than the calculated maximum transit time for that stream.

As a rule of thumb, 0.125 ms should be added for each device (talker, switch, listener) along the way.
In AVB networks, the latency is always specified by the talker and guaranteed by the switches along the way and the listener. This behavior is plug and play and does not require any user interaction or monitoring for the entire duration of the stream.