Understanding QoS Data and Terminology

Question

Accepted Answer

Please reference the definitions below regarding terms used with QoS Monitoring and troubleshooting call issues. 
Note: If you would like more information on how QoS Monitor works or how to use QoS Monitor, please visit our How QoS Monitor Works and How to Use QoS Monitor articles. 
QoS (Quality of Service) - The ability of a network (including applications, hosts, and infrastructure devices) to deliver traffic with minimum delay and maximum availability. 
MOS (Mean Opinion Score) - A measurement of the subjective quality of human speech, represented as a rating index (4.5 being the highest possible score). MOS is derived by taking the average of numerical scores given by juries to rate quality and using it as a quantitative indicator of system performance. 
Packet Loss - A term used to indicate the loss of data packets during transmission over a computer network. This may happen on account of high network latency or on account of overloading of switches or routers that are unable to process or route all the incoming data. 
In the example image below, five packets are being sent, yet only packets 1,2, 4, and 5 are received, resulting in Packet Loss. Audibly, Packet Loss can be audibly described as parts of audio cutting in and out. Packet loss is one of the most common QoS issues that can occur. It’s important to keep in mind each voice packet contains 20ms of audio, meaning losing 1 or 2 packets may not be noticeable. However, the severity of the issue may depend on the codec being used, the frequency of packet loss, packet loss concealment, and other potential factors.  
Delay/Lag - These are terms used to indicate the extra time taken by a packet of data to travel from the source computer to the destination computer and back again. The lag may be caused by poor networking or by inefficient or excessive processing. 
In the example image below, there are five packets being sent with 120ms of delay, and though each packet is being received with 120ms of delay, they are received in sequential order.  It’s important to keep in mind there is always going to be some delay from one end to the other (Even on LAN). Delay is not as common of an issue as it used to be, and it’s not as easy to detect by ear alone unless it is severe.

Jitter - A term used to indicate a momentary fluctuation in the transmission signal. This happens in computing when a data packet arrives either ahead or behind a standard clock cycle. In telecommunication, it may result from an abrupt variation in signal characteristics, such as the interval between successive pulses. In simple terms, when Jitter occurs, packets are received out of their intended order. Audibly, Jitter can sometimes sound like a “robot voice.”
In the example image below, five packets are being sent in sequential order; however, some packets have a larger delay than others, and the result is that the packets are received out of their intended order.

PDV (Packet Delay Variation) - The difference in end-to-end one-way delay between selected packets in a flow with any lost packets being ignored. The effect is sometimes incorrectly referred to as ‘packet jitter.’ PDV is typically caused by network congestion that causes packets to be sent in a delayed fashion. Audibly, PDV will sound very similar to packet loss.
In the example image below, five packets are being sent in sequential order; however, the receiving party gets packet 1 at the intended time, but packets 2, 3, and 4 are received at the exact same time, and packet 5 is received immediately after.

Jitter Buffer – A buffer used to counter jitter introduced by queuing in packet-switched networks to ensure continuous playout of an audio or video media stream transmitted over the network. Similar to how YouTube videos buffer and load content up until it can be played, a Jitter Buffer stores up enough audio until there is enough to get consistent playback. By default, a phone/endpoint will store 70ms-120ms of audio. In the case where we have either Jitter or Packet Delay Variation (PDV), a Jitter Buffer will re-order the packets correctly and then dispatch them to the receiving end.
In the example image below, five packets are sent in sequential order; however, as they arrive out of order, the Jitter Buffer takes the allotted 120ms to re-order the packets correctly and then sends them to the receiving end in the correct order.

RTP (Real-time Transport Protocol) - A network protocol for delivering audio and video over IP networks. RTP is used in communication and entertainment systems that involve streaming media, such as telephony, and video teleconference applications, including television services, and web-based push-to-talk features.
RTCP (Real-time Transport Control Protocol) - A sister protocol of the Real-time Transport Protocol (RTP). RTCP provides out-of-band statistics and control information for an RTP session. It partners with RTP in the delivery and packaging of multimedia data but does not transport any media data itself.
RTCP-XR (Real-time Transport Control Protocol – Extended Reports) - An extension to regular RTCP, but includes a VoIP Metrics Block, which gives more specific detail about VoIP Quality. 
Post Dial Delay (PDD) - Post Dial Delay is experienced by the customer originating the call from the time the final digit is dialed to the point at which they hear ringtone or other in-band information. Where the originating network is required to play an announcement before completing the call, then this definition of PDD excludes the duration of such announcements. 
ACD - The average call duration is a measurement that reflects the average length of telephone calls. 
ASR (Answer Seizure Ratio) - ASR is a measure of network quality defined in ITU SG2 Recommendation E.411. It is calculated by taking the number of successfully answered calls and dividing it by the total number of calls attempted (seizures). Since busy signals and other rejections by the called number count as call failures, the calculated ASR value can vary depending on user behavior.
PLC (Packet Loss Concealment) -  PLC is a technique to mask the effects of packet loss in VoIP communications. Because the voice signal is sent as packets on a VoIP network, they may travel different routes to get to the destination. At the receiver, a packet might arrive very late, corrupt, or simply might not arrive. One of the cases in which the last situation could happen is where a packet is rejected by a server that has a full buffer and cannot accept any more data. In a VoIP connection, error-control techniques such as ARQ are not feasible, and the receiver should be able to cope with packet loss. 
PLC Techniques 
Zero Insertion: The lost speech frames are replaced with zero 
Waveform Substitution: The missing gap is reconstructed by repeating a portion of the already received speech. The simplest form of this would be to repeat the last received frame. Other techniques account for fundamental frequency, gap duration, etc. Waveform substitution methods are popular because of their simplicity to understand and implement. An example of such an algorithm is proposed in ITU recommendation G.711 Appendix I. 
Model-based Methods: An increasing number of algorithms that take advantage of speech models of interpolating and extrapolating speech gaps are being introduced and developed. 
MOS Score Percentiles 
Some values are expressed as %95 or %99, which is the 95th percentile, respectively 99th percentile. For example, if the MOS score %95 is 3, it tells that at least 5% of all calls have a MOS score of 3 or worse. It is better to pay attention to %95 or %99 rather than average or min/max values because average/min/max does not tell well that 5% of all calls are bad. 
As an example of how the percentile is calculated for MOS score, let's say we have 100 calls where only the last five calls have MOS scores: 3.1, 2.5, 3.2, 1.0, 2.9. 
Order all MOS calls by the best MOS score to the lowest (e.g., 4.5, 4.5, ..., 3.2, 3.1, 2.9, 2.5, 1.0) 
Remove the top 95% of all calls (in this example: 3.2, 3.1, 2.9, 2.5, 1.0) 
Take the first number from the left of the remaining 5%, which in this example, is 3.2.
In this example, the MOS score in the 95th percentile is 3.2. The average MOS score is 4.4, Min is 1.0, and Max is 4.5. As you can see, the average/min/max is not that useful, but the %95 percentile says that we have a problem with 5% of all calls.