Left: Fostex NetCIRA ES6300 – active speaker receiver which receives audio data converted to EtherSound protocol; right: Fostex NetCIRA ES-2PRO – EtherSound to analog audio converter
EtherSound is anaudio-over-Ethernet technology foraudio engineering andbroadcast engineering applications. EtherSound is developed and licensed byDigigram.[3] EtherSound is intended by the developer to be compliant withIEEE 802.3 Ethernet standards.[4] Just as the IEEE defines rates such as100 Megabit andGigabit Ethernet standards, EtherSound has been developed as both ES-100 (for use on dedicated 100 Megabit Ethernet networks or within a Gigabit network as a VLAN) and ES-Giga (for use on dedicated Gigabit Ethernet networks). The two versions of EtherSound are not compatible.
While Ethersound is compliant with the IEEE 802.3physical layer standards, logically it uses atoken passing scheme for transporting audio data which prevents all of its features from being used on a standard Ethernet network. On a standard network, it is only able to distribute audio and control data one way. It is not designed to share EthernetLANs with typical office operations data or Internet traffic such as email. It supports two-way communications only when wired in a daisy chain topology.[5] For this reason Ethersound is best used in applications suitable to a daisy chain network topology or in live sound applications that benefit from its low point-to-point latency.
Low latency is important for many users ofaudio over Ethernet technologies.[6][note 2] EtherSound can deliver up to 64 channels of 48 kHz, 24-bitPCM audio data with a network latency of 125microseconds.[2]If A/D and D/A conversions are included, this latency is about 1.5 milliseconds, the major part ofthis latency being caused by the converters. Each device in adaisy-chain network adds 1.4 microseconds of latency.[1]EtherSound's network latency is stable and deterministic; The delay between any two devices on an EtherSound network can be calculated.
^Audio may only be passed unidirectionally through ahub orswitch.
^One of the most critical applications involves picking up sound from a vocalist's or instrumentalist's microphone on a live performance stage, mixing that signal with those received from other microphones (and performers) and delivering the mix to the performer viain-ear monitors. Latency in this application is particularly annoying to vocalists. This is because a singer hears his or her own voice throughbone conduction as well as through the outer ear. If the sound from the in-ear monitor lags the bone-conduction sound by more than a few milliseconds,phase shifts andcomb filtering will become audible.