Movatterモバイル変換


[0]ホーム

URL:


RFC 8698NADAFebruary 2020
Zhu, et al.Experimental[Page]
Stream:
Internet Engineering Task Force (IETF)
RFC:
8698
Category:
Experimental
Published:
ISSN:
2070-1721
Authors:
X. Zhu
Cisco Systems
R. Pan
Intel Corporation
M. Ramalho
AcousticComms
S. Mena
Cisco Systems

RFC 8698

Network-Assisted Dynamic Adaptation (NADA): A Unified Congestion Control Scheme for Real-Time Media

Abstract

This document describes Network-Assisted Dynamic Adaptation (NADA), a novel congestion control scheme for interactive real-time media applications such as video conferencing. In the proposed scheme, the sender regulates its sending rate, based on either implicit or explicit congestion signaling, in a unified approach. The scheme can benefit from Explicit Congestion Notification (ECN) markings from network nodes. It also maintains consistent sender behavior in the absence of such markings by reacting to queuing delays and packet losses instead.

Status of This Memo

This document is not an Internet Standards Track specification; it is published for examination, experimental implementation, and evaluation.

This document defines an Experimental Protocol for the Internet community. This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Not all documents approved by the IESG are candidates for any level of Internet Standard; see Section 2 of RFC 7841.

Information about the current status of this document, any errata, and how to provide feedback on it may be obtained athttps://www.rfc-editor.org/info/rfc8698.

Copyright Notice

Copyright (c) 2020 IETF Trust and the persons identified as the document authors. All rights reserved.

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.

Table of Contents

1.Introduction

Interactive real-time media applications introduce a unique set of challenges for congestion control. Unlike TCP, the mechanism used for real-time media needs to adapt quickly to instantaneous bandwidth changes, accommodate fluctuations in the output of video encoder rate control, and cause low queuing delay over the network. An ideal scheme should also make effective use of all types of congestion signals, including packet loss, queuing delay, and explicit congestion notification (ECN)[RFC3168] markings. The requirements for the congestion control algorithm are outlined in[RMCAT-CC]. The requirements highlight that the desired congestion control scheme should 1) avoid flow starvation and attain a reasonable fair share of bandwidth when competing against other flows, 2) adapt quickly, and 3) operate in a stable manner.

This document describes an experimental congestion control scheme called Network-Assisted Dynamic Adaptation (NADA). The design of NADA benefits from explicit congestion control signals (e.g., ECN markings) from the network, yet also operates when only implicit congestion indicators (delay and/or loss) are available. Such a unified sender behavior distinguishes NADA from other congestion control schemes for real-time media. In addition, its core congestion control algorithm is designed to guarantee stability for path round-trip times (RTTs) below a prescribed bound (e.g., 250 ms with default parameter choices). It further supports weighted bandwidth sharing among competing video flows with different priorities. The signaling mechanism consists of standard Real-time Transport Protocol (RTP) timestamp[RFC3550] and Real-time Transport Control Protocol (RTCP) feedback reports. The definition of the desired RTCP feedback message is described in detail in[RTCP-FEEDBACK] so as to support the successful operation of several congestion control schemes for real-time interactive media.

2.Terminology

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14[RFC2119][RFC8174] when, and only when, they appear in all capitals, as shown here.

3.System Overview

Figure 1 shows the end-to-end system for real-time media transport that NADA operates in. Note that there also exist network nodes along the reverse (potentially uncongested) path that the RTCP feedback reports traverse. Those network nodes are not shown in the figure for the sake of brevity.

  +---------+  r_vin  +--------+        +--------+     +----------+  |  Media  |<--------|  RTP   |        |Network |     |   RTP    |  | Encoder |========>| Sender |=======>|  Node  |====>| Receiver |    +---------+  r_vout +--------+ r_send +--------+     +----------+                          /|\                                |                           |                                 |                           +---------------------------------+                                 RTCP Feedback Report
Figure 1:System Overview
Media encoder with rate control capabilities:
Encodes raw media (audio and video) frames into a compressed bitstreamthat is later packetized into RTP packets. As discussed in[RFC8593], theactual output rate from the encoder r_vout may fluctuate around the targetr_vin. Furthermore, it is possible that the encoder can only react to bit ratechanges at rather coarse time intervals, e.g., once every 0.5 seconds.
RTP sender:
Responsible for calculating the NADA reference rate based on networkcongestion indicators (delay, loss, or ECN marking reports from the receiver),for updating the video encoder with a new target rate r_vin and forregulating the actual sending rate r_send accordingly. The RTP sender alsogenerates a sending timestamp for each outgoing packet.
RTP receiver:
Responsible for measuring and estimating end-to-end delay (based on sendertimestamp), packet loss (based on RTP sequence number), ECN marking ratios(based on[RFC6679]), and receiving rate (r_recv) of theflow. It calculatesthe aggregated congestion signal (x_curr) that accounts for queuing delay, ECNmarkings, and packet losses. The receiver also determines the mode for senderrate adaptation (rmode) based on whether the flow has encountered any standingnon-zero congestion. The receiver sends periodic RTCP reports back to thesender, containing values of x_curr, rmode, and r_recv.
Network node with several modes of operation:
The system can work with the default behavior of a simple drop-tailqueue. It can also benefit from advanced Active Queue Management (AQM)features such as Proportional Integral Controller Enhanced(PIE) [RFC8033], Flow Queue Controlling Queue Delay(FQ-CoDel) [RFC8290], ECNmarking based onRandom Early Detection (RED) [RFC7567],and Pre-Congestion Notification (PCN) marking using atoken bucket algorithm[RFC6660]. Note that network nodeoperation is out of scope for the design of NADA.

4.Core Congestion Control Algorithm

Like TCP-Friendly Rate Control (TFRC)[FLOYD-CCR00][RFC5348], NADA is a rate-based congestion control algorithm. In its simplest form, the sender reacts to the collection of network congestion indicators in the form of an aggregated congestion signal and operates in one of two modes:

Accelerated ramp up:
When the bottleneck is deemed to be underutilized, the rate increasesmultiplicatively with respect to the rate of previously successfultransmissions. The rate increase multiplier (gamma) is calculated based onthe observed round-trip time and target feedback interval, so as to limitself-inflicted queuing delay.
Gradual rate update:
In the presence of a non-zero aggregate congestion signal, the sendingrateis adjusted in reaction to both its value (x_curr) and its change in value(x_diff).

This section introduces the list of mathematical notations and describes the core congestion control algorithm at the sender and receiver, respectively. Additional details on recommended practical implementations are described in Sections5.1 and5.2.

4.1.Mathematical Notations

This section summarizes the list of variables and parameters used in the NADA algorithm.Table 2 also includes the default values for choosing the algorithm parameters to represent either a typical setting in practical applications or a setting based on theoretical and simulation studies. SeeSection 6.3 for some of the discussions on the impact of parameter values. Additional studies in real-world settings suggested inSection 8 could gather further insight on how to choose and adapt these parameter values in practical deployment.

Table 1:List of Variables
NotationVariable Name
t_currCurrent timestamp
t_lastLast time sending/receiving a feedback message
deltaObserved interval between current and previous feedback reports: delta = t_curr-t_last
r_refReference rate based on network congestion
r_sendSending rate
r_recvReceiving rate
r_vinTarget rate for video encoder
r_voutOutput rate from video encoder
d_baseEstimated baseline delay
d_fwdMeasured and filtered one-way delay
d_queueEstimated queuing delay
d_tildeEquivalent delay after non-linear warping
p_markEstimated packet ECN marking ratio
p_lossEstimated packet loss ratio
x_currAggregate congestion signal
x_prevPrevious value of aggregate congestion signal
x_diffChange in aggregate congestion signal w.r.t. its previous value: x_diff = x_curr - x_prev
rmodeRate update mode: (0 = accelerated ramp up; 1 = gradual update)
gammaRate increase multiplier in accelerated ramp-up mode
loss_intMeasured average loss interval in packet count
loss_expThreshold value for setting the last observed packet loss to expiration
rttEstimated round-trip time at sender
buffer_lenRate-shaping buffer occupancy measured in bytes
Table 2:List of Algorithm Parameters and Their Default Values
NotationParameter NameDefault Value
PRIOWeight of priority of the flow1.0
RMINMinimum rate of application supported by media encoder150 Kbps
RMAXMaximum rate of application supported by media encoder1.5 Mbps
XREFReference congestion level10 ms
KAPPAScaling parameter for gradual rate update calculation0.5
ETAScaling parameter for gradual rate update calculation2.0
TAUUpper bound of RTT in gradual rate update calculation500 ms
DELTATarget feedback interval100 ms
LOGWINObservation window in time for calculating packet summary statistics at receiver500 ms
QEPSThreshold for determining queuing delay buildup at receiver10 ms
DFILTBound on filtering delay120 ms
GAMMA_MAXUpper bound on rate increase ratio for accelerated ramp up0.5
QBOUNDUpper bound on self-inflicted queuing delay during ramp up50 ms
MULTILOSSMultiplier for self-scaling the expiration threshold of the last observed loss (loss_exp) based on measured average loss interval (loss_int)7.0
QTHDelay threshold for invoking non-linear warping50 ms
LAMBDAScaling parameter in the exponent of non-linear warping0.5
PLRREFReference packet loss ratio0.01
PMRREFReference packet marking ratio0.01
DLOSSReference delay penalty for loss when packet loss ratio is at PLRREF10 ms
DMARKReference delay penalty for ECN marking when packet marking is at PMRREF2 ms
FPSFrame rate of incoming video30
BETA_SScaling parameter for modulating outgoing sending rate0.1
BETA_VScaling parameter for modulating video encoder target rate0.1
ALPHASmoothing factor in exponential smoothing of packet loss and marking ratios0.1

4.2.Receiver-Side Algorithm

The receiver-side algorithm can be outlined as below:

  • On initialization:
    • set d_base = +INFINITY
    • set p_loss = 0
    • set p_mark = 0
    • set r_recv = 0
    • set both t_last and t_curr as current time in milliseconds
  • On receiving a media packet:
    • obtain current timestamp t_curr from system clock
    • obtain from packet header sending time stamp t_sent
    • obtain one-way delay measurement: d_fwd = t_curr - t_sent
    • update baseline delay: d_base = min(d_base, d_fwd)
    • update queuing delay: d_queue = d_fwd - d_base
    • update packet loss ratio estimate p_loss
    • update packet marking ratio estimate p_mark
    • update measurement of receiving rate r_recv
  • On time to send a new feedback report (t_curr - t_last > DELTA):
    • calculate non-linear warping of delay d_tilde if packet loss exists
    • calculate current aggregate congestion signal x_curr
    • determine mode of rate adaptation for sender: rmode
    • send feedback containing values of: rmode, x_curr, and r_recv
    • update t_last = t_curr

In order for a delay-based flow to hold its ground when competing against loss-based flows (e.g., loss-based TCP), it is important to distinguish between different levels of observed queuing delay. For instance, over wired connections, a moderate queuing delay value on the order of tens of milliseconds is likely self-inflicted or induced by other delay-based flows, whereas a high queuing delay value of several hundreds of milliseconds may indicate the presence of a loss-based flow that does not refrain from increased delay.

If the last observed packet loss is within the expiration window of loss_exp (measured in terms of packet counts), theestimated queuing delay follows a non-linear warping:

           / d_queue,                   if d_queue < QTH           |d_tilde = <                                           (1)           |                  (d_queue-QTH)             \ QTH exp(-LAMBDA ---------------) , otherwise                                 QTH

In Equation (1), the queuing delay value is unchanged when it is belowthe first threshold QTH; otherwise, it is scaled down followinga non-linear curve. This non-linear warping is inspired bythe delay-adaptive congestion window backoff policy in[BUDZISZ-AIMD-CC] so as to "gradually nudge"the controller to operate based on loss-induced congestionsignals when competing against loss-based flows. The exact formof the non-linear function has been simplified with respect to[BUDZISZ-AIMD-CC]. The value of the thresholdQTH should be carefully tuned for different operational environmentsso as to avoid potential risks of prematurely discounting the congestionsignal level. Typically, a higher value of QTH is required in anoisier environment (e.g., over wireless connections or where thevideo stream encounters many time-varying background competing traffic)so as to stay robust against occasional non-congestion-induced delayspikes. Additional insights on how this value can be tuned or auto-tunedshould be gathered from carrying out experimental studies in differentreal-world deployment scenarios.

The value of loss_exp is configured to self-scale with the average packet loss interval loss_int with a multiplier MULTILOSS:

 loss_exp = MULTILOSS *loss_int.

Estimation of the average loss interval loss_int, in turn, followsSection 5.4 of "TCP Friendly Rate Control (TFRC): Protocol Specification" [RFC5348].

In practice, it is recommended to linearly interpolate between thewarped (d_tilde) and non-warped (d_queue) values of the queuing delayduring the transitional period lasting for the duration of loss_int.

The aggregate congestion signal is:

                         / p_mark \^2        / p_loss \^2 x_curr = d_tilde + DMARK*|--------|  + DLOSS*|--------|   (2)                          \ PMRREF /          \ PLRREF /

Here, DMARK is prescribed a reference delay penalty associated with ECN markings at the reference marking ratio of PMRREF; DLOSS is prescribed a reference delay penalty associated with packet losses at the reference packet loss ratio of PLRREF. The value of DLOSS and DMARK does not depend on configurations at the network node. Since ECN-enabled active queue management schemes typically mark a packet before dropping it, the value of DLOSSSHOULD be higher than that of DMARK. Furthermore, the values of DLOSS and DMARK need to be set consistently across all NADA flows sharing the same bottleneck link so that they can compete fairly.

In the absence of packet marking and losses, the value of x_curr reduces to the observed queuing delay d_queue. In that case, the NADA algorithm operates in the regime of delay-based adaptation.

Given observed per-packet delay and loss information, the receiver is also in a good position to determine whether or not the network is underutilized and then recommend the corresponding rate adaptation mode for the sender. The criteria for operating in accelerated ramp-up mode are:

  • No recent packet losses within the observation window LOGWIN; and
  • No buildup of queuing delay: d_fwd-d_base < QEPS for all previous delay samples within the observation window LOGWIN.

Otherwise, the algorithm operates in graduate update mode.

4.3.Sender-Side Algorithm

The sender-side algorithm is outlined as follows:

  • On initialization:
    • set r_ref = RMIN
    • set rtt = 0
    • set x_prev = 0
    • set t_last and t_curr as current system clock time
  • On receiving feedback report:
    • obtain current timestamp from system clock: t_curr
    • obtain values of rmode, x_curr, and r_recv from feedback report
    • update estimation of rtt
    • measure feedback interval: delta = t_curr - t_last
    • if rmode == 0:
      • update r_ref following accelerated ramp-up rules
    • else:
      • update r_ref following gradual update rules
    • clip rate r_ref within the range of minimum rate (RMIN) and maximum rate(RMAX).
    • set x_prev = x_curr
    • set t_last = t_curr

In accelerated ramp-up mode, the rate r_ref is updated as follows:

                                QBOUND    gamma = min(GAMMA_MAX, ------------------)       (3)                            rtt+DELTA+DFILT                            r_ref = max(r_ref, (1+gamma) r_recv)                            (4)

The rate increase multiplier gamma is calculated as a function of the upper bound of self-inflicted queuing delay (QBOUND), round-trip time (rtt), and target feedback interval (DELTA); it is bound on the filtering delay for calculating d_queue (DFILT). It has a maximum value of GAMMA_MAX. The rationale behind Equations (3)-(4) is that the longer it takes for the sender to observe self-inflicted queuing delay buildup, the more conservative the sender should be in increasing its rate and, hence, the smaller the rate increase multiplier.

In gradual update mode, the rate r_ref is updated as:

    x_offset = x_curr - PRIO*XREF*RMAX/r_ref          (5)    x_diff   = x_curr - x_prev                        (6)                            delta    x_offset    r_ref = r_ref - KAPPA*-------*------------*r_ref                            TAU       TAU                                x_diff                  - KAPPA*ETA*---------*r_ref         (7)                                 TAU

The rate changes in proportion to the previous rate decision.It is affected by two terms: the offset of the aggregate congestionsignal from its value at equilibrium (x_offset) and its change(x_diff). The calculation of x_offset depends on the maximum rateof the flow (RMAX), its weight of priority (PRIO), as wellas a reference congestion signal (XREF). The value ofXREF is chosen so that the maximum rate of RMAX can be achievedwhen the observed congestion signal level is below PRIO*XREF.

At equilibrium, the aggregated congestion signal stabilizes atx_curr = PRIO*XREF*RMAX/r_ref. This ensures that when multipleflows share the same bottleneck and observe a common value ofx_curr, their rates at equilibrium will be proportional to theirrespective priority levels (PRIO) and the range between minimumand maximum rate. Values of the minimum rate (RMIN) and maximum rate (RMAX) will be provided by the media codec,for instance, as outlined by[RMCAT-CC-RTP]. In the absence of such information, the NADA sender willchoose a default value of 0 for RMIN and 3 Mbps for RMAX.

As mentioned in the sender-side algorithm, the final rateis always clipped within the dynamic range specified by theapplication:

    r_ref = min(r_ref, RMAX)                         (8)    r_ref = max(r_ref, RMIN)                         (9)

The above operations ignore many practical issues such as clocksynchronization between sender and receiver, the filtering of noise indelay measurements, and base delay expiration. These will be addressedinSection 5.

5.Practical Implementation of NADA

5.1.Receiver-Side Operation

The receiver continuously monitors end-to-end per-packetstatistics in terms of delay, loss, and/or ECN marking ratios.It then aggregates all forms of congestion indicators into theform of an equivalent delay and periodically reports this backto the sender. In addition, the receiver tracks the receivingrate of the flow and includes that in the feedback message.

5.1.1.Estimation of One-Way Delay and Queuing Delay

The delay estimation process in NADA follows an approach similar to that ofearlierdelay-based congestion control schemes, such as Low Extra Delay BackgroundTransport (LEDBAT)[RFC6817]. Forexperimental implementations, instead of relying on RTP timestamps and thetransmission time offset RTP header extension[RFC5450], the NADA sender can generate its own timestamp based onthe local system clock and embed that information in the transport packetheader. The NADA receiver estimates the forward delay as having a constantbase delay component plus a time-varying queuing delay component. The basedelay is estimated as the minimum value of one-way delay observed over arelatively long period (e.g., tens of minutes), whereas the individualqueuing delay value is taken to be the difference between one-way delay andbase delay. By re-estimating the base delay periodically, one can avoid thepotential issue of base delay expiration, whereby an earlier measured basedelay value is no longer valid due to underlying route changes or a cumulativetiming difference introduced by the clock-rate skew between sender andreceiver. All delay estimations are based on sender timestamps with arecommended granularity of 100 microseconds or finer.

The individual sample values of queuing delay should be furtherfiltered against various non-congestion-induced noise, such asspikes due to a processing "hiccup" at the network nodes. Therefore,in addition to calculating the value of queuing delay usingd_queue = d_fwd - d_base, as expressed inSection 5.1, the current implementation further employs a minimum filter witha window size of 15 samples over per-packet queuing delay values.

5.1.2.Estimation of Packet Loss/Marking Ratio

The receiver detects packet losses via gaps in theRTP sequence numbers of received packets. For interactivereal-time media applications with stringent latencyconstraints (e.g., video conferencing), the receiver avoidsthe packet reordering delay by treating out-of-order packetsas losses. The instantaneous packet loss ratio p_inst is estimatedas the ratio between the number of missing packets overthe number of total transmitted packets within therecent observation window LOGWIN. The packet loss ratiop_loss is obtained after exponential smoothing:

                p_loss = ALPHA*p_inst + (1-ALPHA)*p_loss        (10)

The filtered result is reported back to the sender asthe observed packet loss ratio p_loss.

The estimation of the packet marking ratio p_mark follows the same procedureas above. It is assumed that ECN marking information at the IP headercan be passed to the receiving endpoint, e.g., by following the mechanismdescribed in[RFC6679].

5.1.3.Estimation of Receiving Rate

It is fairly straightforward to estimate the receiving rate r_recv. NADAmaintains a recent observation window with a time span of LOGWIN and simplydivides the total size of packets arriving during that window over the timespan. The receiving rate (r_recv) can be either calculated at the sender sidebased on the per-packet feedback from the receiver or included as part of thefeedback report.

5.2.Sender-Side Operation

Figure 2 provides a detailedview of the NADA sender. Upon receipt of an RTCP feedbackreport from the receiver, the NADA sender calculates thereference rate r_ref as specified inSection 4.3.It further adjusts both the target rate for the live videoencoder r_vin and the sending rate r_send over the networkbased on the updated value of r_ref and rate-shaping bufferoccupancy buffer_len.

The NADA sender behavior stays the same in the presenceof all types of congestion indicators: delay, loss, andECN marking. This unified approach allows a gracefultransition of the scheme as the network shifts dynamicallybetween light and heavy congestion levels.

                   +----------------+                                |  Calculate     | <---- RTCP report                   | Reference Rate |                    -----------------+                           | r_ref              +------------+-------------+              |                          |             \|/                        \|/      +-----------------+           +---------------+      | Calculate Video |           |   Calculate   |      |  Target Rate    |           | Sending Rate  |      +-----------------+           +---------------+          |        /|\                 /|\      |       r_vin |         |                   |       |         \|/        +-------------------+       |             +----------+          | buffer_len        |  r_send      |  Video   | r_vout  -----------+        \|/      |  Encoder |-------->   |||||||||=================>        +----------+         -----------+    RTP packets           Rate-Shaping Buffer
Figure 2:NADA Sender Structure

5.2.1.Rate-Shaping Buffer

The operation of the live video encoder is out of the scopeof the design for the congestion control scheme in NADA.Instead, its behavior is treated as a black box.

A rate-shaping buffer is employed to absorb any instantaneousmismatch between the encoder rate output r_vout and the regulated sendingrate r_send. Its current level of occupancy is measured in bytesand is denoted as buffer_len.

A large rate-shaping buffer contributes to higherend-to-end delay, which may harm the performance ofreal-time media communications. Therefore, the senderhas a strong incentive to prevent the rate-shapingbuffer from building up. The mechanisms adopted are:

  • To deplete the rate-shaping buffer faster by increasing the sending rate r_send; and
  • To limit incoming packets of the rate-shaping buffer by reducing the video encoder target rate r_vin.

5.2.2.Adjusting Video Target Rate and Sending Rate

If the level of occupancy in the rate-shaping buffer is accessibleat the sender, such information can be leveraged to further adjustthe target rate of the live video encoder r_vin as well as theactual sending rate r_send. The purpose of such adjustments is tomitigate the additional latencies introduced by the rate-shapingbuffer. The amount of rate adjustment can be calculated as follows:

    r_diff_v = min(0.05*r_ref, BETA_V*8*buffer_len*FPS)     (11)     r_diff_s = min(0.05*r_ref, BETA_S*8*buffer_len*FPS)     (12)     r_vin  = max(RMIN, r_ref - r_diff_v)                    (13)     r_send = min(RMAX, r_ref + r_diff_s)                    (14)

In Equations (11) and (12), the amount of adjustment is calculatedas proportional to the size of the rate-shaping buffer but isbounded by 5% of the reference rate r_ref calculated from networkcongestion feedback alone. This ensures that the adjustmentintroduced by the rate-shaping buffer will not counteract with the corecongestion control process. Equations (13) and (14) indicatethe influence of the rate-shaping buffer. A largerate-shaping buffer nudges the encoder target rate slightlybelow (and the sending rate slightly above) the referencerate r_ref. The final video target rate (r_vin) and sendingrate (r_send) are further bounded within the original range of[RMIN, RMAX].

Intuitively, the amount of extra rate offset needed to completelydrain the rate-shaping buffer within the duration of a singlevideo frame is given by 8*buffer_len*FPS, where FPS standsfor the reference frame rate of the video. The scaling parametersBETA_V and BETA_S can be tuned to balance between the competinggoals of maintaining a small rate-shaping buffer and deviatingfrom the reference rate point. Empirical observations show thatthe rate-shaping buffer for a responsive live video encoder typicallystays empty and only occasionally holds a large frame (e.g., whenan intra-frame is produced) in transit. Therefore, the rate adjustmentintroduced by this mechanism is expected to be minor. For instance,a rate-shaping buffer of 2000 bytes will lead to a rate adjustmentof 48 Kbps given the recommended scaling parameters of BETA_V = 0.1and BETA_S = 0.1, and the reference frame rate of FPS = 30.

5.3.Feedback Message Requirements

The following list of information is required forNADA congestion control to function properly:

Recommended rate adaptation mode (rmode):
A 1-bit flag indicating whether the sender should operate in acceleratedramp-up mode (rmode=0) or gradual update mode (rmode=1).
Aggregated congestion signal (x_curr):
The most recently updated value, calculated by the receiver according toSection 4.2. This informationcan be expressed with a unit of 100 microseconds (i.e., 1/10 of a millisecond)in 15 bits. This allows a maximum value of x_curr at approximately 3.27seconds.
Receiving rate (r_recv):
The most recently measured receiving rate according toSection 5.1.3. This information isexpressed with a unit of bits per second (bps) in 32 bits (unsigned int). Thisallows a maximum rate of approximately 4.3 Gbps, approximately 1000 times thestreaming rate of a typical high-definition (HD) video conferencing sessiontoday. This field can be expanded further by a few more bytes if an evenhigher rate needs to be specified.

The above list of information can be accommodated by 48 bits, or 6 bytes, in total. They can be either included in the feedback report from the receiver or, in the case where all receiver-side calculations are moved to the sender, derived from per-packet information from the feedback message as defined in[RTCP-FEEDBACK]. Choosing the feedback message interval DELTA is discussed inSection 6.3. A target feedback interval of DELTA = 100 ms is recommended.

6.Discussions and Further Investigations

This section discusses the various design choicesmade by NADA, potential alternative variants of itsimplementation, and guidelines on how the key algorithmparameters can be chosen.Section 8recommends additional experimental setups to further explore these topics.

6.1.Choice of Delay Metrics

The current design works with relative one-way delay (OWD) as the mainindication of congestion. The value of the relative OWD is obtained bymaintaining the minimum value of observed OWD over a relatively long timehorizon and subtracting that out from the observed absolute OWD value. Such anapproach cancels out the fixed difference between the sender and receiverclocks. It has been widely adopted by other delay-based congestion controlapproaches such as[RFC6817]. As discussed in[RFC6817], the time horizon for tracking theminimum OWD needs to be chosen with care; it must be long enough for anopportunity to observe the minimum OWD with zero standing queue along thepath,and it must be sufficiently short enough to timely reflect "true" changes inminimum OWD introduced by route changes and other rare events andto mitigate the cumulative impact of clock rate skew over time.

The potential drawback in relying on relative OWD as the congestionsignal is that when multiple flows share the same bottleneck, theflow arriving late at the network experiencing a non-empty queue maymistakenly consider the standing queuing delay as part of the fixedpath propagation delay. This will lead to slightly unfair bandwidthsharing among the flows.

Alternatively, one could move the per-packet statistical handlingto the sender instead and use relative round-trip time (RTT) in lieuof relative OWD, assuming that per-packet acknowledgments are available.The main drawback of an RTT-based approach is the noise in the measured delayin the reverse direction.

Note that the choice of either delay metric (relative OWD vs. RTT) involves nochange in the proposed rate adaptation algorithm. Therefore, comparing thepros and cons regarding which delay metric to adopt can be kept as anorthogonal direction of investigation.

6.2.Method for Delay, Loss, and Marking Ratio Estimation

Like other delay-based congestion control schemes, performance of NADA depends on the accuracy of its delay measurement and estimation module.Appendix A of [RFC6817] provides an extensive discussion on this aspect.

The current recommended practice of applying minimum filter with a window size of 15 samples suffices in guarding against processing delay outliers observed in wired connections. For wireless connections with a higher packet delay variation (PDV), more sophisticated techniques on denoising, outlier rejection, and trend analysis may be needed.

More sophisticated methods in packet loss ratio calculation,such as that adopted by[FLOYD-CCR00],will likely be beneficial. These alternatives are part ofthe experiments this document proposes.

6.3.Impact of Parameter Values

In the gradual rate update mode, the parameter TAU indicates the upper bound of round-trip time (RTT) in the feedback control loop. Typically, the observed feedback interval delta is close to the target feedback interval DELTA, and the relative ratio of delta/TAU versus ETA dictates the relative strength of influence from the aggregate congestion signal offset term (x_offset) versus its recent change (x_diff), respectively. These two terms are analogous to the integral and proportional terms in a proportional-integral (PI) controller. The recommended choice of TAU = 500 ms, DELTA = 100 ms, and ETA = 2.0 corresponds to a relative ratio of 1:10 between the gains of the integral and proportional terms. Consequently, the rate adaptation is mostly driven by the change in the congestion signal with a long-term shift towards its equilibrium value driven by the offset term. Finally, the scaling parameter KAPPA determines the overall speed of the adaptation and needs to strike a balance between responsiveness and stability.

The choice of the target feedback interval DELTA needs to strike the rightbalance between timely feedback and low RTCP feedback message counts. A targetfeedback interval of DELTA = 100 ms is recommended, corresponding to afeedbackbandwidth of 16 Kbps with 200 bytes per feedback message -- approximately 1.6%overhead for a 1 Mbps flow. Furthermore, both simulation studies andfrequency-domain analysis in[IETF-95] haveestablished that a feedback interval below 250 ms (i.e., more frequently than4feedback messages per second) will not break up the feedback control loop ofNADA congestion control.

In calculating the non-linear warping of delay in Equation (1), the current design uses fixed values of QTH for determiningwhether to perform the non-linear warping. Its value should becarefully tuned for different operational environments (e.g.,over wired vs. wireless connections) so as to avoid the potentialrisk of prematurely discounting the congestion signal level.It is possible to adapt its value based on past observed patternsof queuing delay in the presence of packet losses. It needs to benoted that the non-linear warping mechanism may lead to multipleNADA streams stuck in loss-based mode when competing againsteach other.

In calculating the aggregate congestion signal x_curr, thechoice of DMARK and DLOSS influence the steady-state packetloss/marking ratio experienced by the flow at a givenavailable bandwidth. Higher values of DMARK and DLOSS resultin lower steady-state loss/marking ratios but are more susceptible to the impact of individual packet loss/markingevents. While the value of DMARK and DLOSS are fixed andpredetermined in the current design, this document also encouragesfurther explorations of a scheme for automaticallytuning these values based on desired bandwidth sharing behaviorin the presence of other competing loss-based flows (e.g.,loss-based TCP).

6.4.Sender-Based vs. Receiver-Based Calculation

In the current design, the aggregated congestionsignal x_curr is calculated at the receiver, keepingthe sender operation completely independent of theform of actual network congestion indications (delay, loss, or marking) in use.

Alternatively, one can shift receiver-side calculationsto the sender, whereby the receiver simply reports on per-packetinformation via periodic feedback messages as defined in[RTCP-FEEDBACK]. Such an approach enables interoperability amongst senders operatingon different congestion control schemes but requires slightlyhigher overhead in the feedback messages. See additional discussionsin[RTCP-FEEDBACK]regarding the desired format of the feedback messages and therecommended feedback intervals.

6.5.Incremental Deployment

One nice property of NADA is the consistent video endpointbehavior irrespective of network node variations. This facilitatesgradual, incremental adoption of the scheme.

Initially, the proposed congestion control mechanism canbe implemented without any explicit support from the network andrelies solely on observed relative one-way delay measurementsand packet loss ratios as implicit congestion signals.

When ECN is enabled at the network nodes with RED-based marking,the receiver can fold its observations of ECN markings into thecalculation of the equivalent delay. The sender can react to theseexplicit congestion signals without any modification.

Ultimately, networks equipped with proactive marking based on the level oftoken bucket metering can reap the additional benefits ofzero standing queues and lower end-to-end delay and workseamlessly with existing senders and receivers.

7.Reference Implementations

The NADA scheme has been implemented in both ns-2[NS-2]and ns-3[NS-3] simulation platforms. Theimplementationin ns-2 hosts the calculations as described inSection 4.2 at the receiverside,whereas the implementation in ns-3 hosts these receiver-side calculationsat the sender for the sake of interoperability. Extensive ns-2 simulationevaluations of an earlier draft version of this document are recorded in[ZHU-PV13]. An open-source implementation of NADA as part of an ns-3 module isavailable at[NS3-RMCAT].Evaluation results of this document based on ns-3 are presentedin[IETF-90] and[IETF-91]for wired test cases as documented in[RMCAT-EVAL-TEST].Evaluation results of NADA over Wi-Fi-based test cases as defined in[WIRELESS-TESTS] arepresented in[IETF-93]. These simulation-basedevaluations have shown that NADA flows can obtain their fair share ofbandwidth when competing against each other. They typically adapt fastin reaction to the arrival and departure of other flows and can sustaina reasonable throughput when competing against loss-based TCP flows.

[IETF-90] describes the implementation andevaluation of NADA in a lab setting. Preliminary evaluationresults of NADA in single-flow and multi-flow test scenariosare presented in[IETF-91].

A reference implementation of NADA has been carried out bymodifying the WebRTC module embedded in the Mozilla open-sourcebrowser. Presentations from[IETF-103]and[IETF-105] document real-world evaluationsof the modified browser driven by NADA. The experimental settinginvolves remote connections with endpoints over either home or enterprisewireless networks. These evaluations validate the effectiveness ofNADA flows in recovering quickly from throughput drops caused byintermittent delay spikes over the last-hop wireless connections.

8.Suggested Experiments

NADA has been extensively evaluated under various test scenarios, includingthe collection of test cases specified by[RMCAT-EVAL-TEST] and the subset ofWi-Fi-based test cases in[WIRELESS-TESTS]. Additional evaluations have been carried out tocharacterize how NADA interacts with various AQMschemes such as RED, Controlling Queue Delay (CoDel), and ProportionalIntegral Controller Enhanced (PIE). Most of these evaluations have beencarried out in simulators. A few key test cases have been evaluated in labenvironments with implementations embedded in video conferencing clients. Itis strongly recommended to carry out implementation and experimentation ofNADA in real-world settings. Such exercises will provide insights on how tochoose or automatically adapt the values of the key algorithm parameters (seelist inTable 2) as discussed inSection 6.

Additional experiments are suggested for the following scenarios, preferably over real-world networks:

9.IANA Considerations

This document has no IANA actions.

10.Security Considerations

The rate adaptation mechanism in NADA relies on feedback from the receiver. As such, it is vulnerable to attacks where feedback messages are hijacked, replaced, or intentionally injected with misleading information resulting in denial of service, similar to those that can affect TCP. Therefore, it isRECOMMENDED that the RTCP feedback message is at least integrity checked. In addition,[RTCP-FEEDBACK] discusses the potential risk of a receiver providing misleading congestion feedback information and the mechanisms for mitigating such risks.

The modification of the sending rate based on the sender-side rate-shaping buffer may lead to temporary excessive congestion over the network in the presence of an unresponsive video encoder. However, this effect can be mitigated by limiting the amount of rate modification introduced by the rate-shaping buffer, bounding the size of the rate-shaping buffer at the sender, and maintaining a maximum allowed sending rate by NADA.

11.References

11.1.Normative References

[RFC2119]
Bradner, S.,"Key words for use in RFCs to Indicate Requirement Levels",BCP 14,RFC 2119,DOI 10.17487/RFC2119,,<https://www.rfc-editor.org/info/rfc2119>.
[RFC3168]
Ramakrishnan, K., Floyd, S., and D. Black,"The Addition of Explicit Congestion Notification (ECN) to IP",RFC 3168,DOI 10.17487/RFC3168,,<https://www.rfc-editor.org/info/rfc3168>.
[RFC3550]
Schulzrinne, H., Casner, S., Frederick, R., and V. Jacobson,"RTP: A Transport Protocol for Real-Time Applications",STD 64,RFC 3550,DOI 10.17487/RFC3550,,<https://www.rfc-editor.org/info/rfc3550>.
[RFC5348]
Floyd, S., Handley, M., Padhye, J., and J. Widmer,"TCP Friendly Rate Control (TFRC): Protocol Specification",RFC 5348,DOI 10.17487/RFC5348,,<https://www.rfc-editor.org/info/rfc5348>.
[RFC6679]
Westerlund, M., Johansson, I., Perkins, C., O'Hanlon, P., and K. Carlberg,"Explicit Congestion Notification (ECN) for RTP over UDP",RFC 6679,DOI 10.17487/RFC6679,,<https://www.rfc-editor.org/info/rfc6679>.
[RFC8174]
Leiba, B.,"Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words",BCP 14,RFC 8174,DOI 10.17487/RFC8174,,<https://www.rfc-editor.org/info/rfc8174>.

11.2.Informative References

[BUDZISZ-AIMD-CC]
Budzisz, L., Stanojevic, R., Schlote, A., Baker, F., and R. Shorten,"On the Fair Coexistence of Loss- and Delay-Based TCP",IEEE/ACM Transactions on Networking, vol. 19, no. 6, pp. 1811-1824,DOI 10.1109/TNET.2011.2159736,,<https://doi.org/10.1109/TNET.2011.2159736>.
[FLOYD-CCR00]
Floyd, S., Handley, M., Padhye, J., and J. Widmer,"Equation-based congestion control for unicast applications",ACM SIGCOMM Computer Communications Review, vol. 30, no. 4, pp. 43-56,DOI 10.1145/347057.347397,,<https://doi.org/10.1145/347057.347397>.
[IETF-103]
Zhu, X., Pan, R., Ramalho, M., Mena, S., Jones, P., Fu, J., and S. D'Aronco,"NADA Implementation in Mozilla Browser",IETF 103,,<https://datatracker.ietf.org/meeting/103/materials/slides-103-rmcat-nada-implementation-in-mozilla-browser-00>.
[IETF-105]
Zhu, X., Pan, R., Ramalho, M., Mena, S., Jones, P., Fu, J., and S. D'Aronco,"NADA Implementation in Mozilla Browser and Draft Update",IETF 105,,<https://datatracker.ietf.org/meeting/105/materials/slides-105-rmcat-nada-update-02.pdf>.
[IETF-90]
Zhu, X., Ramalho, M., Ganzhorn, C., Jones, P., and R. Pan,"NADA Update: Algorithm, Implementation, and Test Case Evaluation Results",IETF 90,,<https://tools.ietf.org/agenda/90/slides/slides-90-rmcat-6.pdf>.
[IETF-91]
Zhu, X., Pan, R., Ramalho, M., Mena, S., Ganzhorn, C., Jones, P., and S. D'Aronco,"NADA Algorithm Update and Test Case Evaluations",IETF 91,,<https://www.ietf.org/proceedings/interim/2014/11/09/rmcat/slides/slides-interim-2014-rmcat-1-2.pdf>.
[IETF-93]
Zhu, X., Pan, R., Ramalho, M., Mena, S., Ganzhorn, C., Jones, P., D'Aronco, S., and J. Fu,"Updates on NADA",IETF 93,,<https://www.ietf.org/proceedings/93/slides/slides-93-rmcat-0.pdf>.
[IETF-95]
Zhu, X., Pan, R., Ramalho, M., Mena, S., Jones, P., Fu, J., D'Aronco, S., and C. Ganzhorn,"Updates on NADA: Stability Analysis and Impact of Feedback Intervals",IETF 95,,<https://www.ietf.org/proceedings/95/slides/slides-95-rmcat-5.pdf>.
[NS-2]
"ns-2",,<http://nsnam.sourceforge.net/wiki/index.php/Main_Page>.
[NS-3]
"ns-3 Network Simulator",<https://www.nsnam.org/>.
[NS3-RMCAT]
Fu, J., Mena, S., and X. Zhu,"Simulator of IETF RMCAT congestion control protocols",,<https://github.com/cisco/ns3-rmcat>.
[RFC5450]
Singer, D. and H. Desineni,"Transmission Time Offsets in RTP Streams",RFC 5450,DOI 10.17487/RFC5450,,<https://www.rfc-editor.org/info/rfc5450>.
[RFC6660]
Briscoe, B., Moncaster, T., and M. Menth,"Encoding Three Pre-Congestion Notification (PCN) States in the IP Header Using a Single Diffserv Codepoint (DSCP)",RFC 6660,DOI 10.17487/RFC6660,,<https://www.rfc-editor.org/info/rfc6660>.
[RFC6817]
Shalunov, S., Hazel, G., Iyengar, J., and M. Kuehlewind,"Low Extra Delay Background Transport (LEDBAT)",RFC 6817,DOI 10.17487/RFC6817,,<https://www.rfc-editor.org/info/rfc6817>.
[RFC7567]
Baker, F., Ed. and G. Fairhurst, Ed.,"IETF Recommendations Regarding Active Queue Management",BCP 197,RFC 7567,DOI 10.17487/RFC7567,,<https://www.rfc-editor.org/info/rfc7567>.
[RFC8033]
Pan, R., Natarajan, P., Baker, F., and G. White,"Proportional Integral Controller Enhanced (PIE): A Lightweight Control Scheme to Address the Bufferbloat Problem",RFC 8033,DOI 10.17487/RFC8033,,<https://www.rfc-editor.org/info/rfc8033>.
[RFC8290]
Hoeiland-Joergensen, T., McKenney, P., Taht, D., Gettys, J., and E. Dumazet,"The Flow Queue CoDel Packet Scheduler and Active Queue Management Algorithm",RFC 8290,DOI 10.17487/RFC8290,,<https://www.rfc-editor.org/info/rfc8290>.
[RFC8593]
Zhu, X., Mena, S., and Z. Sarker,"Video Traffic Models for RTP Congestion Control Evaluations",RFC 8593,DOI 10.17487/RFC8593,,<https://www.rfc-editor.org/info/rfc8593>.
[RMCAT-CC]
Jesup, R. and Z. Sarker,"Congestion Control Requirements for Interactive Real-Time Media",Work in Progress,Internet-Draft, draft-ietf-rmcat-cc-requirements-09,,<https://tools.ietf.org/html/draft-ietf-rmcat-cc-requirements-09>.
[RMCAT-CC-RTP]
Zanaty, M., Singh, V., Nandakumar, S., and Z. Sarker,"Congestion Control and Codec interactions in RTP Applications",Work in Progress,Internet-Draft, draft-ietf-rmcat-cc-codec-interactions-02,,<https://tools.ietf.org/html/draft-ietf-rmcat-cc-codec-interactions-02>.
[RMCAT-EVAL-TEST]
Sarker, Z., Singh, V., Zhu, X., and M. Ramalho,"Test Cases for Evaluating RMCAT Proposals",Work in Progress,Internet-Draft, draft-ietf-rmcat-eval-test-10,,<https://tools.ietf.org/html/draft-ietf-rmcat-eval-test-10>.
[RTCP-FEEDBACK]
Sarker, Z., Perkins, C., Singh, V., and M. Ramalho,"RTP Control Protocol (RTCP) Feedback for Congestion Control",Work in Progress,Internet-Draft, draft-ietf-avtcore-cc-feedback-message-05,,<https://tools.ietf.org/html/draft-ietf-avtcore-cc-feedback-message-05>.
[WIRELESS-TESTS]
Sarker, Z., Johansson, I., Zhu, X., Fu, J., Tan, W., and M. Ramalho,"Evaluation Test Cases for Interactive Real-Time Media over Wireless Networks",Work in Progress,Internet-Draft, draft-ietf-rmcat-wireless-tests-08,,<https://tools.ietf.org/html/draft-ietf-rmcat-wireless-tests-08>.
[ZHU-PV13]
Zhu, X. and R. Pan,"NADA: A Unified Congestion Control Scheme for Low-Latency Interactive Video",Proc. IEEE International Packet Video Workshop, San Jose, CA, USA,DOI 10.1109/PV.2013.6691448,,<https://doi.org/10.1109/PV.2013.6691448>.

Appendix A.Network Node Operations

NADA can work with different network queue managementschemes and does not assume any specific network node operation.As an example, this appendix describes three variants of queuemanagement behavior at the network node, leading to eitherimplicit or explicit congestion signals. It needs to beacknowledged that NADA has not yet been tested with non-probabilisticECN marking behaviors.

In all three flavors described below, the network queueoperates with the simple First In, First Out (FIFO) principle.There is no need to maintain per-flow state. The systemcan scale easily with a large number of video flows andat high link capacity.

A.1.Default Behavior of Drop-Tail Queues

In a conventional network with drop-tail or RED queues,congestion is inferred from the estimation of end-to-enddelay and/or packet loss. Packet drops at the queue aredetected at the receiver and contribute to the calculationof the aggregated congestion signal x_curr. No specialaction is required at the network node.

A.2.RED-Based ECN Marking

In this mode, the network node randomly marksthe ECN field in the IP packet header followingtheRandom Early Detection(RED) algorithm [RFC7567]. Calculation of the markingprobability involves the following steps on packet arrival:

  1. update smoothed queue size q_avg as:

       q_avg = w*q + (1-w)*q_avg
  2. calculate marking probability p as:

            / 0,                    if q < q_lo        |        |        q_avg - q_lo    p= <  p_max*--------------, if q_lo <= q < q_hi        |         q_hi - q_lo        |        \ p = 1,                if q >= q_hi

Here, q_lo and q_hi correspond to the lowand high thresholds of queue occupancy.The maximum marking probability is p_max.

The ECN marking events will contributeto the calculation of an equivalent delayx_curr at the receiver. No changes are requiredat the sender.

A.3.Random Early Marking with Virtual Queues

Advanced network nodes may support random early markingbased on a token bucket algorithm originally designed forPre-Congestion Notification(PCN) [RFC6660]. The early congestion notification (ECN) bit in theIP header of packets is marked randomly.The marking probability is calculated based on atoken bucket algorithm originally designed forPCN [RFC6660].The target link utilization is set as 90%; the markingprobability is designed to grow linearly with the tokenbucket size when it varies between 1/3 and 2/3 of thefull token bucket limit.

Calculation of the marking probability involvesthe following steps upon packet arrival:

  1. meter packet against token bucket (r,b)

  2. update token level b_tk

  3. calculate the marking probability as:

             / 0,                     if b-b_tk < b_lo          |         |          b-b_tk-b_lo    p = <  p_max* --------------, if b_lo <= b-b_tk < b_hi         |           b_hi-b_lo         |         \ 1,                     if b-b_tk >= b_hi

Here, the token bucket lower and upper limits are denoted byb_lo and b_hi, respectively. The parameter b indicates the sizeof the token bucket. The parameter r is chosen to be belowcapacity, resulting in slight underutilization of the link.The maximum marking probability is p_max.

The ECN marking events will contribute to the calculationof an equivalent delay x_curr at the receiver. No changes arerequired at the sender. The virtual queuing mechanism fromthe PCN-based marking algorithm will lead to additionalbenefits such as zero standing queues.

Acknowledgments

The authors would like to thankRandell Jesup,Luca De Cicco,Piers O'Hanlon,Ingemar Johansson,Stefan Holmer,Cesar Ilharco Magalhaes,Safiqul Islam,Michael Welzl,Mirja Kühlewind,Karen Elisabeth Egede Nielsen,Julius Flohr,Roland Bless,Andreas Smas, andMartin Stiemerling for their valuablereview comments and helpful input to this specification.

Contributors

The following individuals contributed to the implementation and evaluation of the proposed scheme and, therefore, helped to validate and substantially improve this specification.

Paul E. Jones <paulej@packetizer.com> of Cisco Systems implemented an early version of the NADA congestion control scheme and helped with its lab-based testbed evaluations.

Jiantao Fu <jianfu@cisco.com> of Cisco Systems helped with the implementation and extensive evaluation of NADA both in Mozilla web browsers and in earlier simulation-based evaluation efforts.

Stefano D'Aronco <stefano.daronco@geod.baug.ethz.ch> of ETH Zurich (previously at Ecole Polytechnique Federale de Lausanne when contributing to this work) helped with the implementation and evaluation of an early version of NADA in[NS-3].

Charles Ganzhorn <charles.ganzhorn@gmail.com> contributed to the testbed-based evaluation of NADA during an early stage of its development.

Authors' Addresses

Xiaoqing Zhu
Cisco Systems
12515 Research Blvd., Building 4
Austin,TX78759
United States of America
Email:xiaoqzhu@cisco.com
Rong Pan
Intel Corporation
2200 Mission College Blvd
Santa Clara,CA95054
United States of America
Email:rong.pan@intel.com
Michael A. Ramalho
AcousticComms Consulting
6310 Watercrest Way Unit 203
Lakewood Ranch,FL34202-5211
United States of America
Phone:+1 732 832 9723
Email:mar42@cornell.edu
URI:http://ramalho.webhop.info/
Sergio Mena
Cisco Systems
EPFL, Quartier de l'Innovation, Batiment E
1015Ecublens
Switzerland
Email:semena@cisco.com

[8]ページ先頭

©2009-2025 Movatter.jp