Movatterモバイル変換

Real-time Transport Protocol

From Wikipedia, the free encyclopedia

Protocol for delivering audio and video over IP networks

Real-time Transport Protocol
Communication protocol
Abbreviation	RTP
Purpose	Delivering audio and video
Developer(s)	Audio-Video Transport Working Group of theIETF
Introduction	January 1996; 30 years ago (1996-01)
Based on	Network Voice Protocol^[1]
RFC(s)	RFC 1889,3550,3551

Internet protocol suite
Application layer
BGP DHCP (v6) DNS FTP HTTP (HTTP/3) HTTPS IMAP IPP IRC LDAP MGCP MQTT NNTP NTP OSPF POP PTP ONC/RPC RTP RTSP RIP SIP SMTP SNMP SSH Telnet TLS/SSL XMPP more...
Transport layer
TCP UDP DCCP SCTP RSVP QUIC more...
Internet layer
IP v4 v6 ICMP (v6) NDP ECN L4S IGMP IPsec more...
Link layer
ARP Tunnels PPP MAC more...
v t e

TheReal-time Transport Protocol (RTP) is anetwork protocol for delivering audio and video overIP networks. RTP is used in communication and entertainment systems that involvestreaming media, such astelephony,video teleconference applications includingWebRTC,television services and web-basedpush-to-talk features.

RTP typically runs overUser Datagram Protocol (UDP). RTP is used in conjunction with theRTP Control Protocol (RTCP). While RTP carries the media streams (e.g., audio and video), RTCP is used to monitor transmission statistics andquality of service (QoS) and aidssynchronization of multiple streams. RTP is one of the technical foundations ofvoice over IP and in this context is often used in conjunction with asignaling protocol such as theSession Initiation Protocol (SIP), which establishes connections across the network.

RTP was developed by the Audio-Video Transport Working Group of theInternet Engineering Task Force (IETF) and first published in 1996 asRFC 1889, which was then superseded byRFC 3550 in 2003.^[2]

Overview

[edit]

Research on audio and video over packet-switched networks dates back to the early 1970s. TheInternet Engineering Task Force (IETF) publishedRFC 741 in 1977 and began developing RTP in 1992,^[1] and would go on to developSession Announcement Protocol (SAP), theSession Description Protocol (SDP), and theSession Initiation Protocol (SIP).

RTP is designed forend-to-end,real-time transfer ofstreaming media. The protocol provides facilities forjitter compensation and detection ofpacket loss andout-of-order delivery, which are common, especially during UDP transmissions on an IP network. RTP allows data transfer to multiple destinations throughIP multicast.^[3] RTP is regarded as the primary standard for audio/video transport in IP networks and is used with an associated profile and payload format.^[4]^{[needs update]} The design of RTP is based on the architectural principle known asapplication-layer framing, where protocol functions are implemented in the application as opposed to the operating system'sprotocol stack.

Real-timemultimedia streaming applications require timely delivery of information and often can tolerate some packet loss to achieve this goal. For example, loss of a packet in an audio application may result in loss of a fraction of a second of audio data, which can be made unnoticeable with suitableerror concealment algorithms.^[5] TheTransmission Control Protocol (TCP), although standardized for RTP use,^[6] is not normally used in RTP applications because TCP favors reliability over timeliness. Instead, the majority of the RTP implementations are built on theUser Datagram Protocol (UDP).^[5] Other transport protocols specifically designed for multimedia sessions areSCTP^[7] andDCCP,^[8] although, as of 2012^[update], they were not in widespread use.^[9]

RTP was developed by the Audio/Video Transport working group of the IETF standards organization. RTP is used in conjunction with other protocols such asH.323 andRTSP.^[4] The RTP specification describes two protocols: RTP and RTCP. RTP is used for the transfer of multimedia data, and the RTCP is used to periodically send control information and QoS parameters.^[10]

The data transfer protocol, RTP, carries real-time data. Information provided by this protocol includes timestamps (for synchronization), sequence numbers (for packet loss and reordering detection) and the payload format, which indicates the encoded format of the data.^[11] The control protocol, RTCP, is used for quality of service (QoS) feedback and synchronization between the media streams. The bandwidth of RTCP traffic compared to RTP is small, typically around 5%.^[11]^[12]

RTP sessions are typically initiated between communicating peers using a signaling protocol, such as H.323, theSession Initiation Protocol (SIP), RTSP, orJingle (XMPP). These protocols may use theSession Description Protocol to specify the parameters for the sessions.^[13]

An RTP session is established for each multimedia stream. Audio and video streams may use separate RTP sessions, enabling a receiver to selectively receive components of a particular stream.^[14] The RTP and RTCP design is independent of the transport protocol. Applications most typically use UDP with port numbers in the unprivileged range (1024 to 65535).^[15] TheStream Control Transmission Protocol (SCTP) and theDatagram Congestion Control Protocol (DCCP) may be used when a reliable transport protocol is desired. The RTP specification recommends even port numbers for RTP and the use of the next odd port number for the associated RTCP session.^[16]^: 68 A single port can be used for RTP and RTCP in applications that multiplex the protocols.^[17]

RTP is used by real-time multimedia applications such asvoice over IP,audio over IP,WebRTC,Internet Protocol television, andprofessional video over IP includingSMPTE 2022 andSMPTE 2110.

Profiles and payload formats

[edit]

Main article:RTP payload formats

RTP is designed to carry a multitude of multimedia formats, which permits the development of new formats without revising the RTP standard. To this end, the information required by a specific application of the protocol is not included in the generic RTP header. For each class of application (e.g., audio, video), RTP defines aprofile and associatedpayload formats.^[10] Every instantiation of RTP in a particular application requires a profile and payload format specifications.^[18]^: 71

The profile defines the codecs used to encode the payload data and their mapping to payload format codes in the protocol fieldPayload Type (PT) of the RTP header. Each profile is accompanied by several payload format specifications, each of which describes the transport of particular encoded data.^[4] Examples of audio payload formats areG.711,G.723,G.726,G.729,GSM,QCELP,MP3, andDTMF, and examples of video payloads areH.261,H.263,H.264,H.265 andMPEG-1/MPEG-2.^[19] The mapping ofMPEG-4 audio/video streams to RTP packets is specified inRFC 3016, and H.263 video payloads are described inRFC 2429.^[20]

Examples of RTP profiles include:

TheRTP profile for Audio and video conferences with minimal control (RFC 3551) defines a set of static payload type assignments, and a dynamic mechanism for mapping between a payload format and a PT value usingSession Description Protocol (SDP).
TheSecure Real-time Transport Protocol (SRTP) (RFC 3711) defines an RTP profile that providescryptographic services for the transfer of payload data.^[21]
The experimentalControl Data Profile for RTP (RTP/CDP) formachine-to-machine communications.^[22]

Packet header

[edit]

RTP packets are created at the application layer and handed to the transport layer for delivery. Each unit of RTP media data created by an application begins with the RTP packet header.

RTP packet header
Offset	Octet	0								1								2								3
Octet	Bit	0	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17	18	19	20	21	22	23	24	25	26	27	28	29	30	31
0	0	Version		P	X	CC				M	PT							Sequence Number
4	32	Timestamp
8	64	SSRC Identifier
12	96	CSRC Identifier(s)
⋮	⋮	⋮
⁠ $12+4\times \mathrm {CC}$ ⁠	⁠ $96+32\times \mathrm {CC}$ ⁠	Profile-specific Extension Header ID																Extension Header Length
⁠ $16+4\times \mathrm {CC}$ ⁠	⁠ $128+32\times \mathrm {CC}$ ⁠	Extension Data
⋮	⋮	⋮

The RTP header has a minimum size of 12 bytes. After the header, optional header extensions may be present. This is followed by the RTP payload, the format of which is determined by the particular class of application.^[23] The fields in the header are as follows:

Version: 2 bits

Indicates the version of the protocol. Current version is 2.^[24]

Padding (P): 1 bit

Used to indicate if there are extra padding bytes at the end of the RTP packet. Padding may be used to fill up a block of certain size, for example, as required by an encryption algorithm. The last byte of the padding contains the number of padding bytes that were added (including itself).^[16]^: 12^[24]

Extension (X): 1 bit

Indicates presence of anExtension Header between the header and payload data. The extension header is application or profile specific.^[24]

CSRC Count (CC): 4 bits

Contains the number of CSRC identifiers (defined below) that follow the SSRC (also defined below).^[16]^: 12

Marker (M): 1 bit

Signaling used at the application level in a profile-specific manner. If it is set, it means that the current data has some special relevance for the application.^[16]^: 13

Payload Type (PT): 7 bits

Indicates the format of the payload and thus determines its interpretation by the application. Values are profile specific and may be dynamically assigned.^[25]

Sequence Number: 16 bits

The sequence number is incremented for each RTP data packet sent and is to be used by the receiver to detect packet loss^[3] and to accommodateout-of-order delivery. The initial value of the sequence number should be randomized to makeknown-plaintext attacks onSecure Real-time Transport Protocol more difficult.^[16]^: 13

Timestamp: 32 bits

Used by the receiver to play back the received samples at appropriate time and interval. When several media streams are present, the timestamps may be independent in each stream.^[a] The granularity of the timing is application specific. For example, an audio application that samples data once every 125 μs (8 kHz, a common sample rate in digital telephony) would use that value as its clock resolution. Video streams typically use a 90 kHz clock. The clock granularity is one of the details that is specified in the RTP profile for an application.^[26]

SSRC: 32 bits

Synchronization Source Identifier uniquely identifies the source of a stream. The synchronization sources within the same RTP session will be unique.^[16]^: 15

CSRC: Variable (CSRC Count × 32 bits)

Contributing Source IDs enumerate contributing sources to a stream that has been generated from multiple sources.^[16]^: 15

Header Extension: Variable; Exists when X=1

WhenExtension is true, this optional field contains:

Profile-specific Extension Header ID: 16 bits

a profile-specific identifier

Extension Header Length: 16 bits

indicates the length of the extension in 32-bit units, excluding the 32 bits of the extension header.

Extension Header Data: Variable

The data of the extension header.^[16]^: 18

Application design

[edit]

A functional multimedia application requires other protocols and standards used in conjunction with RTP. Protocols such as SIP,Jingle, RTSP,H.225 andH.245 are used for session initiation, control and termination. Other standards, such as H.264, MPEG and H.263, are used for encoding the payload data as specified by the applicable RTP profile.^[27]

An RTP sender captures the multimedia data, then encodes, frames and transmits it as RTP packets with appropriate timestamps and increasing timestamps and sequence numbers. The sender sets thepayload type field in accordance with connection negotiation and the RTP profile in use. The RTP receiver detects missing packets and may reorder packets. It decodes the media data in the packets according to the payload type and presents the stream to its user.^[27]

Standards documents

[edit]

RFC 3550 – "RTP: A Transport Protocol for Real-Time Applications,"^[16]Internet Standard 64.
RFC 3551 – "RTP Profile for Audio and Video Conferences with Minimal Control,"^[28]Internet Standard 65.
RFC 4855 – "Media Type Registration of RTP Payload Formats,"^[29]Proposed Standard.
RFC 4856 – "Media Type Registration of Payload Formats in the RTP Profile for Audio and Video Conferences,"^[30]Proposed Standard.
RFC 7656 – "A Taxonomy of Semantics and Mechanisms for Real-Time Transport Protocol (RTP) Sources,"^[31]Informational.
RFC 3190 – "RTP Payload Format for 12-bitDAT Audio and 20- and 24-bit Linear Sampled Audio,"^[32]Proposed Standard.
RFC 6184 – "RTP Payload Format forH.264 Video,"^[33]Proposed Standard.
RFC 3640 – "RTP Payload Format for Transport of MPEG-4 Elementary Streams,"^[34]Proposed Standard.
RFC 6416 – "RTP Payload Format forMPEG-4 Audio/Visual Streams,"^[35]Proposed Standard.
RFC 2250 – "RTP Payload Format forMPEG1/MPEG2 Video,"^[36]Proposed Standard.
RFC 4175 – "RTP Payload Format for Uncompressed Video,"^[37]Proposed Standard.
RFC 6295 – "RTP Payload Format for MIDI,"^[38]Proposed Standard.
RFC 4696 – "An Implementation Guide forRTP MIDI,"^[39]Informational.
RFC 7164 – "RTP and Leap Seconds,"^[40]Proposed Standard.
RFC 7587 – "RTP Payload Format for theOpus Speech and Audio Codec,"^[41]Proposed Standard.
RFC 7798 – "RTP Payload Format forHigh Efficiency Video Coding (HEVC),"^[42]Proposed Standard.

Notes

[edit]

^RFC 7273 provides a means for signalling the relationship between media clocks of different streams.

References

[edit]

^^a ^bPerkins 2003, p. 6.
^Wright, Gavin."What is the Real-time Transport Protocol (RTP)?".TechTarget. Retrieved2022-11-10.
^^a ^bDaniel Hardy (2002).Network. De Boeck Université. p. 298.
^^a ^b ^cPerkins 2003, p. 55
^^a ^bPerkins 2003, p. 46
^J. Lazzaro (July 2006).Framing Real-time Transport Protocol (RTP) and RTP Control Protocol (RTCP) Packets over Connection-Oriented Transport. Network Working Group.doi:10.17487/RFC4571.RFC 4571.Proposed Standard.
^Farrel, Adrian (2004).The Internet and its protocols. Morgan Kaufmann. p. 363.ISBN 978-1-55860-913-6.
^Ozaktas, Haldun M.; Levent Onural (2007).THREE-DIMENSIONAL TELEVISION. Springer. p. 356.ISBN 978-3-540-72531-2.
^Hogg, Scott."What About Stream Control Transmission Protocol (SCTP)?".Network World. Archived fromthe original on August 30, 2014. Retrieved2017-10-04.
^^a ^bLarry L. Peterson (2007).Computer Networks. Morgan Kaufmann. p. 430.ISBN 978-1-55860-832-0.
^^a ^bPerkins 2003, p. 56
^Peterson & Davie 2007, p. 435
^Begen, A; Kyzivat, P; Perkins, C; Handley, M (January 2021).SDP: Session Description Protocol.Internet Engineering Task Force.doi:10.17487/RFC8866.ISSN 2070-1721.RFC 8866.Proposed Standard. ObsoletesRFC 4566.
^Zurawski, Richard (2004)."RTP, RTCP and RTSP protocols".The industrial information technology handbook. CRC Press. pp. 28–7.ISBN 978-0-8493-1985-3.
^Collins, Daniel (2002). "Transporting Voice by using IP".Carrier grade voice over IP. McGraw-Hill Professional. pp. 47.ISBN 978-0-07-136326-6.
^^a ^b ^c ^d ^e ^f ^g ^h ⁱH. Schulzrinne; S. Casner; R. Frederick;V. Jacobson (July 2003).RTP: A Transport Protocol for Real-Time Applications. Network Working Group.doi:10.17487/RFC3550. STD 64. RFC 3550.Internet Standard 64. Updated byRFC 8860,7160,5761,5506,6051,6222,7022,7164 and8083. ObsoletesRFC 1889.
^C. Perkins; M. Westerlund (April 2010).Multiplexing RTP Data and Control Packets on a Single Port.Internet Research Task Force (IRTF).doi:10.17487/RFC5761.ISSN 2070-1721.RFC 5761.Proposed Standard. UpdatesRFC 3550,3551. Updated byRFC 8858 and8035.
^RFC 3550
^Perkins 2003, p. 60
^Chou, Philip A.; Mihaela van der Schaar (2007).Multimedia over IP and wireless networks. Academic Press. pp. 514.ISBN 978-0-12-088480-3.
^Perkins 2003, p. 367
^Breese, Finley (2010).Serial Communication over RTP/CDP. BoD - Books on Demand. pp. [1].ISBN 978-3-8391-8460-8.
^Peterson & Davie 2007, p. 430
^^a ^b ^cPeterson & Davie 2007, p. 431
^Perkins 2003, p. 59
^Peterson, p.432
^^a ^bPerkins 2003, pp. 11–13
^H. Schulzrinne; S. Casner (July 2003).RTP Profile for Audio and Video Conferences with Minimal Control. Network Working Group.doi:10.17487/RFC3551. STD 65. RFC 3551.Internet Standard 65. Updated byRFC 8860,5761 and7007. ObsoletesRFC 1890.
^S. Casner (February 2007).Media Type Registration of RTP Payload Formats. Network Working Group.doi:10.17487/RFC4855.RFC 4855.Proposed Standard. ObsoletesRFC 3555. Updated byRFC 8851.
^S. Casner (March 2007).Media Type Registration of Payload Formats in the RTP Profile for Audio and Video Conferences. Network Working Group.doi:10.17487/RFC4856.RFC 4856.Proposed Standard. ObsoletesRFC 3555.
^J. Lennox; K. Gross; S. Nandakumar; G. Salgueiro (November 2015). B. Burman (ed.).A Taxonomy of Semantics and Mechanisms for Real-Time Transport Protocol (RTP) Sources.Internet Engineering Task Force.doi:10.17487/RFC7656.ISSN 2070-1721.RFC 7656.Informational.
^K. Kobayashi; A. Ogawa; A. Ogawa; C. Bormann (January 2002).RTP Payload Format for 12-bit DAT Audio and 20- and 24-bit Linear Sampled Audio. Network Working Group.doi:10.17487/RFC3190.RFC 3190.Proposed Standard.
^Y.-K. Wang; R. Even; T. Kristensen; R. Jesup (May 2011).RTP Payload Format for H.264 Video.Internet Engineering Task Force (IETF).doi:10.17487/RFC6184.RFC 6184.Proposed Standard. ObsoletesRFC 3984.
^J. van der Meer; D. Mackie; V. Swaminathan; D. Singer; P. Gentric (November 2003).RTP Payload Format for Transport of MPEG-4 Elementary Streams. Network Working Group.doi:10.17487/RFC3640.RFC 3640.Proposed Standard. Updated byRFC 5691.
^M. Schmidt; F. de Bont; S. Doehla; J. Kim (October 2011).RTP Payload Format for MPEG-4 Audio/Visual Streams.Internet Engineering Task Force.doi:10.17487/RFC6416.ISSN 2070-1721.RFC 6416.Proposed Standard. ObsoletesRFC 3016.
^D. Hoffman; G. Fernando; V. Goyal; M. Civanlar (January 1998).RTP Payload Format for MPEG1/MPEG2 Video. Network Working Group.doi:10.17487/RFC2250.RFC 2250.Proposed Standard. ObsoletesRFC 2038.
^L. Gharai; C. Perkins (September 2005).RTP Payload Format for Uncompressed Video. Network Working Group.doi:10.17487/RFC4175.RFC 4175.Proposed Standard. Updated byRFC 4421.
^J. Lazzaro; J. Wawrzynek (June 2011).RTP Payload Format for MIDI.Internet Engineering Task Force.doi:10.17487/RFC6295.ISSN 2070-1721.RFC 6295.Proposed Standard. ObsoletesRFC 4695.
^J. Lazzaro; J. Wawrzynek (November 2006).An Implementation Guide for RTP MIDI. Network Working Group.doi:10.17487/RFC4696.RFC 4696.Informational.
^K. Gross; R. van Brandenburg (March 2014).RTP and Leap Seconds.Internet Engineering Task Force.doi:10.17487/RFC7164.ISSN 2070-1721.RFC 7164.Proposed Standard. UpdatesRFC 3550.
^J. Spittka; K. Vos; JM. Valin (June 2015).RTP Payload Format for the Opus Speech and Audio Codec.Internet Engineering Task Force.doi:10.17487/RFC7587.ISSN 2070-1721.RFC 7587.Proposed Standard.
^Y.-K. Wang; Y. Sanchez; T. Schierl; S. Wenger; M. M. Hannuksela (March 2016).RTP Payload Format for High Efficiency Video Coding (HEVC).Internet Engineering Task Force.doi:10.17487/RFC7798.ISSN 2070-1721.RFC 7798.Proposed Standard.

Perkins, Colin (2003).RTP: Audio and Video for the Internet. Addison-Wesley.ISBN 978-0-672-32249-5.
Peterson, Larry L.; Davie, Bruce S. (2007).Computer Networks (4 ed.). Morgan Kaufmann.ISBN 978-0-12-374013-7.

External links

[edit]

Henning Schulzrinne's RTP page (includingFAQ)
GNU ccRTP
JRTPLIB, a C++ RTP library
Managed Media Aggregation Archived 2018-01-09 at theWayback Machine:.NET C# RFC-compliant implementation of RTP / RTCP written in completely managed code.
"RTP",Broadband Networks, Ministry of Human resources, India, 2008,archived from the original on 2021-11-18

Digital audio and video protocols

Control

Direct	HDBaseT
Bus	CEC MIDI Modbus Obsolete: mLAN ZIPI
IP	Modbus ONVIF Open Sound Control AES70 RTSP RTP-MIDI DetNet

Audio only

Direct	ADAT Lightpipe AES3 MADI S/PDIF
Bus	A-Net AES50 (SuperMAC) AudioRail MaGIC
Ethernet	AES51 AVB Milan CobraNet dSNAKE EtherSound REAC SoundGrid
IP	AES67 Dante NetJack Livewire Q-LAN Ravenna WheatNet-IP