This is a reissue application of U.S. Pat. No. 8,515,265, which was filed as U.S. patent application Ser. No. 12/945,194 on Nov. 12, 2010 and issued on Aug. 20, 2013, and which claims priority from Korean Patent Application No. 10-2010-0103697, filed on Oct. 22, 2010 in the Korean Intellectual Property Office, the disclosure of which is incorporated herein by reference in its entirety.
CROSS-REFERENCE TO RELATED PATENT APPLICATIONSThis application claims priorities from U.S. Provisional Application No. 61/260,906, filed on Nov. 13, 2009, U.S. Provisional Application No. 61/267,131, filed on Dec. 7, 2009, U.S. Provisional Application No. 61/303,778, filed on Feb. 12, 2010, U.S. Provisional Application No. 61/307,093, filed on Feb. 23, 2010, U.S. Provisional Application No. 61/310,104, filed on Mar. 3, 2010, U.S. Provisional Application No. 61/318,916, filed on Mar. 30, 2010, U.S. Provisional Application No. 61/282,860, filed on Apr. 12, 2010, U.S. Provisional Application No. 61/351,434, filed on Jun. 4, 2010, U.S. Provisional Application No. 61/362,805, filed on Jul. 9, 2010, in the U.S. Patents and Trademark Office, and Korean Patent Application No. 10-2010-0103697, filed on Oct. 22, 2010 in the Korean Intellectual Property Office, the disclosures of which are incorporated herein in their entireties by reference.
BACKGROUND1. Field
Apparatuses and methods consistent with exemplary embodiments relate to a method and apparatus for providing a trick play service, and more particularly, to a method and apparatus for providing a trick play service in a hypertext transfer protocol (HTTP) adaptive streaming (HAS) architecture for adaptively streaming media data according to fluctuation of a streaming environment.
2. Description of the Related Art
Examples of a method of transmitting media data through a network include a downloading method and a streaming method. In the streaming method, a server transmits media data in real time and a client reproduces the received media data in real time. In the downloading method, media data is reproduced by the client after completely receiving the media data from the server.
According to the streaming method, the media data is transmitted, received, and played in real time through a logical channel set between the server and the client.
SUMMARYOne or more exemplary embodiments provide a method and apparatus for providing a trick play service in a hypertext transfer protocol (HTTP) adaptive streaming (HAS) architecture for adaptively streaming media data according to fluctuation of a streaming environment, and a computer readable recording medium having recorded thereon a computer program for executing the method.
According to an aspect of an exemplary embodiment, there is provided a method of providing a trick play service at a server, the method including: generating a media presentation description (MPD) file including information about at least one piece of trick play data; transmitting the MPD file to a client; and transmitting the at least one piece of trick play data to the client in response to a request by the client based on the MPD file.
The MPD file may include type information identifying that the at least one piece of trick play data is data for trick play.
A number of the at least one piece of trick play data may be determined based on a maximum trick play speed, and the at least one piece of trick play data may include one or more segments generated by encoding media content at a predetermined bit rate into frames corresponding to a predetermined trick play speed and dividing the encoded frames based on time.
The MPD file may include information about the predetermined trick play speed, and uniform resource locators (URLs) of the one or more segments divided and generated based on time to be played exclusively at the predetermined trick play speed.
The MPD file may include information about trick play data corresponding to a 2× trick play speed, which physically exists in the server, and information about at least one piece of trick play data corresponding to play speeds other than 2×, which virtually exists in the server, and the method may further include extracting the at least one piece of trick play data corresponding to play speeds other than 2× from the trick play data corresponding to a 2× trick play speed upon a request of the client based on the MPD file.
The extracting the at least one piece of trick play data corresponding to play speeds other than 2× may be performed by using a common gateway interface (CGI) program based on an index file including locations and sizes of frames.
The number of the at least one piece of trick play data may be determined based on a maximum depth of trick levels, the at least one piece of trick play data may include one or more segments generated by encoding media content at a predetermined bit rate into frames corresponding to a predetermined trick level and dividing the encoded frames based on time, the predetermined trick level may correspond to a hierarchical structure and may be one of at least one trick level based on the maximum depth of trick levels, the maximum depth of trick levels may be determined based on a maximum trick play speed, and the frames included in the predetermined trick level may not repeatedly exist in another trick level.
The MPD file may include information about the predetermined trick level, and URLs of the one or more segments divided and generated based on time and corresponding to the predetermined trick level, and the predetermined trick level may be described to include information about at least one trick play speed using the one or more segments divided and generated based on time.
The MPD file may further include information about at least one of a frame rate, a frame type, and the maximum trick play speed, the frame rate may indicate the number of frames to be played per second at the client, and the frame type may indicate whether the at least one piece of trick play data includes only intra-frames, or intra- and inter-frames.
According to an aspect of another exemplary embodiment, there is provided a method of providing a trick play service at a client, the method including: receiving a media presentation description (MPD) file including information about at least one piece of trick play data; and receiving the at least one piece of trick play data from a server based on the MPD file.
The MPD file may include type information identifying that the at least one piece of trick play data is data for trick play.
A number of the at least one piece of trick play data may be determined based on a maximum trick play speed, and the at least one piece of trick play data may include one or more segments generated by encoding media content at a predetermined bit rate into frames corresponding to a predetermined trick play speed and dividing the encoded frames based on time.
The MPD file may include information about the predetermined trick play speed, and uniform resource locators (URLs) of the one or more segments divided and generated based on time and are to be played exclusively at the predetermined trick play speed.
The MPD file may include information about trick play data corresponding to a 2× trick play speed, which physically exists in the server, and information about at least one piece of trick play data corresponding to play speeds other than 2×, which virtually exists in the server, and the receiving the at least one piece of trick play data from the server may include receiving, from the server, the at least one piece of trick play data corresponding to play speeds other than 2×, which is extracted at the server from the trick play data corresponding to a 2× trick play speed in response to the request of the client based on the MPD file.
The at least one piece of trick play data corresponding to play speeds other than 2× may be extracted at the server from the trick play data corresponding to a 2× trick play speed by using a common gateway interface (CGI) program based on an index file including locations and sizes of frames
The number of the at least one piece of trick play data may be determined based on a maximum depth of trick levels, the at least one piece of trick play data may include one or more segments generated by encoding media content at a predetermined bit rate into frames corresponding to a predetermined trick level and dividing the encoded frames based on time, the predetermined trick level may correspond to a hierarchical structure and may be one of at least one trick level based on the maximum depth of trick levels, the maximum depth of trick levels may be determined based on a maximum trick play speed, and the frames included in the predetermined trick level may not repeatedly exist in another trick level.
The MPD file may include information about the predetermined trick level, and URLs of the one or more segments divided and generated based on time and corresponding to the predetermined trick level, and the predetermined trick level may be described to include information about at least one trick play speed using the plurality of segments divided and generated based on time.
The receiving the at least one piece of trick play data from the server may include receiving, from the server, the at least one piece of trick play data corresponding to each trick level in order to support a predetermined trick play speed based on a request of the client.
The method may further include realigning the at least one piece of trick play data in an order of play time.
The MPD file may further include information about at least one of a frame rate, a frame type, and the maximum trick play speed, the frame rate may indicate the number of frames to be played per second by the client, and the frame type may indicate whether the at least one piece of trick play data includes only intra-frames, or intra- and inter-frames.
The method may further include varying the frame rate into the number of frames per second corresponding to the predetermined trick play speed.
The method may further include playing the at least one piece of trick play data based on the frame rate.
According to an aspect of another exemplary embodiment, there is provided a computer readable recording medium having recorded thereon a computer program for executing the above method.
According to an aspect of another exemplary embodiment, there is provided a server including: an information generation unit which generates a media presentation description (MPD) file including information about at least one piece of trick play data; an information transmission unit which transmits the MPD file to a client; and a trick play data transmission unit which transmits the at least one piece of trick play data to the client in response to a request by the client based on the MPD file.
According to an aspect of another exemplary embodiment, there is provided a client including: an information reception unit which receives a media presentation description (MPD) file including information about at least one piece of trick play data; and a trick play data reception unit which receives the at least one piece of trick play data from the server based on the MPD file.
According to an aspect of another exemplary embodiment, there is provided a method of providing a trick play service at an encoder, the method including: generating at least one piece of trick play data from a media content according to a predetermined generating method that corresponds to information included in a media presentation description (MPD) file that is transmitted to a client and based on which the client requests the at least one piece of trick play data.
According to an aspect of another exemplary embodiment, there is provided a system including: an encoder which generates at least one piece of trick play data from a media content; and a server which includes: an information generation unit which generates a media presentation description (MPD) file including information about the at least one piece of trick play data, an information transmission unit which transmits the MPD file to a client, and a trick play data transmission unit which transmits the at least one piece of trick play data to the client in response to a request by the client based on the MPD file.
BRIEF DESCRIPTION OF THE DRAWINGSThe above and/or other aspects will become more apparent by describing in detail exemplary embodiments with reference to the attached drawings in which:
FIG. 1 is a block diagram of a system for providing a trick play service, according to an exemplary embodiment;
FIG. 2 is a flowchart of a method of providing a trick play service, according to an exemplary embodiment;
FIG. 3 is a diagram showing trick play data according to an exemplary embodiment;
FIG. 4 is a diagram showing a media presentation description (MPD) including type information for identifying trick play data, according to an exemplary embodiment;
FIG. 5 is a structural diagram of trick play data including intra (I)-frames for trick play in units of multiples of two, according to an exemplary embodiment;
FIG. 6 is a diagram for describing a method of providing a trick play service by using multiple streams, according to an exemplary embodiment;
FIGS. 7 and 8 are diagrams showing MPDs of a method of providing a trick play service by using multiple streams, according to exemplary embodiments;
FIG. 9 is a diagram for describing a method of providing a trick play service by using a frame range query, according to an exemplary embodiment;
FIG. 10 is a structural diagram of an MP4 file for performing a method of providing a trick play service by using a frame range query, according to an exemplary embodiment;
FIG. 11 is a diagram for describing a method of providing a trick play service by using virtual streams, according to an exemplary embodiment;
FIG. 12A is a diagram for describing a method of providing a trick play service by using multiple streams having a hierarchical structure and including intra-frames, according to an exemplary embodiment;
FIG. 12B is a diagram for describing a method of providing a trick play service by using multiple streams having a hierarchical structure and including intra- and inter-frames, according to an exemplary embodiment;
FIG. 13 is a diagram for describing a method of providing a trick play service by using multiple streams having a hierarchical structure at a server, according to an exemplary embodiment;
FIG. 14 is a diagram for describing a method of providing a trick play service by using multiple streams having a hierarchical structure at a client, according to an exemplary embodiment;
FIGS. 15 and 16A are diagrams showing MPDs of a method of providing a trick play service by using multiple streams having a hierarchical structure, according to an exemplary embodiment;
FIG. 16B is a diagram showing an MPD of a method of providing a trick play service by using multiple streams having a hierarchical structure for identifying a trick level and a frame rate, according to an exemplary embodiment;
FIG. 17 is a structural diagram of a transport stream (TS) packet for detecting an I-frame from a Moving Picture Experts Group (MPEG) TS, according to an exemplary embodiment;
FIG. 18 is a diagram for describing a method of forming a TS packet for detecting an I-frame from an MPEG TS, according to an exemplary embodiment;
FIG. 19 is a flowchart of a method of detecting an I-frame from an MPEG TS, according to an exemplary embodiment;
FIG. 20 is a structural diagram of an MP4 file for detecting an I-frame from an MPEG TS, according to an exemplary embodiment;
FIG. 21 is a conceptual diagram for describing a method of providing a trick play service by varying a frame rate, according to an exemplary embodiment;
FIG. 22 is a diagram for describing a method of providing a trick play service by varying a frame rate at a server and a client, according to an exemplary embodiment;
FIG. 23 is a schema of an MPD of a method of providing a trick play service by varying a frame rate, according to an exemplary embodiment;
FIG. 24 is a diagram showing an MPD of a method of providing a trick play service by varying a frame rate, according to an exemplary embodiment;
FIG. 25 is a block diagram of a server according to an exemplary embodiment; and
FIG. 26 is a block diagram of a client according to an exemplary embodiment.
DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTSHereinafter, exemplary embodiments will be described in detail with reference to the attached drawings. In the drawings, like reference numerals denote like elements and the sizes or thicknesses of elements may be exaggerated for clarity of explanation. Expressions such as “at least one of,” when preceding a list of elements, modify the entire list of elements and do not modify the individual elements of the list.
FIG. 1 is a block diagram of asystem100 for providing a trick play service, according to an exemplary embodiment. Referring toFIG. 1, thesystem100 includes anencoder110, aserver120, and aclient130.
Theencoder110 generates trick play data by encoding input media content into frames at a predetermined bit rate. When the trick play data is generated, theencoder110 may encode the media content by using only intra-frames, or intra- and inter-frames. The intra (I)-frames are frames encoded by using information of only corresponding frames. The inter-frames are frames encoded by using information of corresponding frames and other frames, and include predicted (P)-frames and bidirectional (B)-frames. Theencoder110 encodes the trick play data at a play start point by using the I-frames.
The trick play data may be encoded at a low bit rate, though it is understood that the trick play data may be encoded by using any bit rate. The trick play data indicates a trick play track or a trick play stream. The trick play data provides at least one of “fast forward” and “fast rewind” operability. Theencoder110 may be included in or may be physically separated from theserver120.
Theencoder110 may generate at least one piece of trick play data by using the same media content, and the number of generated pieces of trick play data may vary according to a respective method of providing a trick play service.
For example, in a method of providing a trick play service by using multiple streams according to an exemplary embodiment, the number of pieces of the trick play data may be determined based on a maximum trick play speed. In this case, if the maximum trick play speed is 16× and the trick play service is provided in units of multiples of two, the number of pieces of the trick play data is 4 and the trick play data includes a piece of trick play data corresponding to a 2× trick play speed, a piece of trick play data corresponding to a 4× trick play speed, a piece of trick play data corresponding to a 8× trick play speed, and a piece of trick play data corresponding to a 16×trick play speed. Hereinafter, multiple streams indicate at least one piece of trick play data.
In a method of providing a trick play service by using a frame range query according to an exemplary embodiment, the trick play data may be one piece of trick play data corresponding to a 2× trick play speed. Upon a request of theclient130 that receives from theserver120 an index file including locations and sizes of frames, trick play data corresponding to play speeds other than 2× are extracted from the trick play data corresponding to the 2× trick play speed.
In a method of providing a trick play service by using virtual streams according to an exemplary embodiment, the trick play data may be one piece of trick play data corresponding to a 2× trick play speed. In response to a request by theclient130, trick play data corresponding to play speeds other than 2× are extracted from the trick play data corresponding to the 2× trick play speed by using a common gateway interface (CGI) program of theserver120 based on an index file including locations and sizes of frames. Hereinafter, virtual streams indicate at least one piece of trick play data corresponding to play speeds other than 2×.
In a method of providing a trick play service by using multiple streams having a hierarchical structure according to an exemplary embodiment, the number of pieces of the trick play data may be determined based on a maximum depth of trick levels. The maximum depth of trick levels is determined based on a maximum trick play speed. For example, if the maximum trick play speed is 16× and the trick play service is provided in units of multiples of two, the maximum depth of trick levels is 4 and the number of pieces of the trick play data is 4. The four pieces of the trick play data correspond to trick levels TL1, TL2, TL3, and TL4. Hereinafter, multiple streams having a hierarchical structure indicate at least one piece of trick play data corresponding to each trick level.
In a method of providing a trick play service by varying a frame rate according to an exemplary embodiment, the trick play data may be one piece of trick play data corresponding to a 2× trick play speed. Play speeds other than 2× may be supported by varying a frame rate of the trick play data corresponding to a 2× trick play speed at theclient130.
The above five exemplary methods of providing a trick play service will be described in detail below with reference toFIGS. 5 through 24.
Theserver120 receives at least one piece of trick play data from theencoder110 and, in this case, information about the trick play data. The information about the trick play data may be described as a media presentation description (MPD) file, though it is understood that another exemplary embodiment is not limited thereto, and any method may be used to describe the information about the trick play data. The information about the trick play data may include, for example, at least one of a bit rate, a type, an identifier, a uniform resource locator (URL) template of the trick play data, etc., and will be described in detail below with reference toFIG. 4.
Theclient130 receives from theserver120 the MPD file including the information about the trick play data and requests theserver120 for at least one piece of trick play data based on the MPD file.
In the method of providing a trick play service by using multiple streams, theclient130 requests a piece of trick play data corresponding to a desired trick play speed from among at least one piece of trick play data.
In the method of providing a trick play service by using a frame range query, theclient130 receives an index file with reference to a URL of the index file, which may be included in the MPD file, and requests trick play data including frames corresponding to a desired trick play speed based on the index file.
In the method of providing a trick play service by using virtual streams, theclient130 requests a piece of virtual trick play data corresponding to a desired trick play speed from among at least one piece of virtual trick play data.
In the method of providing a trick play service by using multiple streams having a hierarchical structure, theclient130 requests at least one piece of trick play data corresponding to each trick level in order to support a desired trick play speed. The number of trick levels and the number of pieces of the trick play data corresponding to the trick levels and for supporting the desired trick play speed will be described in detail below with reference toFIGS. 12A through 16B.
In the method of providing a trick play service by varying a frame rate, theclient130 requests for a piece of trick play data corresponding to a default play speed (e.g., 2×). Theclient130 may support play speeds other than the default play speed by varying a frame rate.
If theclient130 requests theserver120 to transmit at least one piece of trick play data, theserver120 transmits the requested trick play data to theclient130.
The MPD file and the trick play data may be requested and transmitted by using a hypertext transfer protocol (HTTP), though it is understood that another exemplary embodiment is not limited thereto, and another protocol may be used.
The trick play data may encode media content at a predetermined bit rate into frames corresponding to a predetermined trick play speed, and may include at least one of a plurality of segments divided and generated based on time. That is, the trick play data generated as a result of encoding performed by theencoder110 may include at least one segment divided based on time. Theserver120 may divide the media content into a plurality of segments to separately transmit the segments rather than encoding the media content into one stream to transmit the stream continuously. The media content may be divided in units of a predetermined time, such as 10 seconds or 20 seconds, and thus may be generated as at least one segment. The time used for division may be set based on a group of pictures (GOP). Media data corresponding to pictures of one or more GOPs may be set as one segment.
Since the trick play data is divided based on time, the trick play service may be provided more efficiently. For example, when streaming is started, theserver120 transmits a segment corresponding to a time from 0 seconds to 20 seconds of 2× trick play data. Then, if theclient130requests 4× trick play data after 20 seconds, theserver120 may transmit a segment corresponding to a time from 20 seconds to 40 seconds of 4× trick play data. Since the trick play data is divided into a plurality of segments based on time, even while trick play streaming is performed, segments of different trick play data may be transmitted upon a request of theclient130.
According to an exemplary embodiment, the trick play service may be provided in an HTTP adaptive streaming (HAS) architecture for adaptively streaming media data according to fluctuation of a streaming environment. Since theserver120 provides trick play data separately from normal speed play data, theclient130 may efficiently change play speeds between a normal speed play mode and a trick play mode.
In the method of providing a trick play service by using multiple streams, since theserver120 provides trick play data corresponding to various trick play speeds, theclient130 may efficiently change trick play speeds.
In the method of providing a trick play service by using a frame range query and a method of providing a trick play service by using virtual streams, since theserver120 retains only trick play data corresponding to a 2× trick play speed, a memory space may be saved and the trick play service may be provided at various trick play speeds.
In the method of providing a trick play service by using multiple streams having a hierarchical structure, since frames included in a predetermined trick level do not repeatedly exist in another trick level, a memory space may be saved and the trick play service may be provided at various trick play speeds. Also, since trick play is performed gradually from a high trick level to a low trick level in consideration of a trick play environment such as a network bandwidth, the trick play service may be provided adaptively to the trick play environment.
In the method of providing a trick play service by varying a frame rate, since theserver120 provides to theclient130 information about a frame rate and a frame type as well as information about a maximum trick play speed, theclient130 may be provided with the information about the frame rate, and the frame type without parsing the entire trick play data transmitted from theserver120, and theclient130 may efficiently provide the trick play service at various trick play speeds by simply varying the provided frame rate.
FIG. 2 is a flowchart of a method of providing a trick play service, according to an exemplary embodiment.
Referring toFIG. 2, inoperation230, aserver210 generates an MPD file including information about at least one piece of trick play data. The information about the trick play data may include, for example, at least one of a bit rate, a type, an identifier, a URL template of the trick play data, etc.
Inoperation240, theclient220 requests theserver210 for the MPD file including the information about the trick play data and receives the requested MPD file from theserver210.
Inoperation250, theclient220 requests theserver210 to transmit at least one piece of the trick play data. Theclient220 selects at least one piece of the trick play data corresponding to desired trick play speeds with reference to the MPD file including the information about the trick play data, requests theserver210 for the selected trick play data, and receives the requested trick play data from theserver210.
The MPD file and the trick play data may be requested and transmitted by using an HTTP, though it is understood that another exemplary embodiment is not limited thereto, and another protocol may be used.
FIG. 3 is a diagram showing trick play data according to an exemplary embodiment.
Referring toFIG. 3, theserver120 may include a plurality of pieces ofmedia data310 and320 generated by encoding a media content at a plurality of different bit rates. As shown, theserver120 also includes at least one piece oftrick play data330. For example, “TrackN” may indicate thetrick play data330. Also, thetrick play data330 may include at least one segment generated by dividing thetrick play data330 based on time. In the present exemplary embodiment, “SliceN-1.as”, “SliceN-2.as”, and “SliceN-3.as” indicate segments of thetrick play data330.
Also, theserver120 may includeinformation340 used by theclient130 to access thetrick play data330. For example, theinformation340 may include a “MainMeta.xml” file as information about thetrick play data330 and a “HeadN.ref” file as header information of thetrick play data330. InFIG. 3, “Head1.ref” may indicate a header file of “Track1” and “Head2.ref” may indicate a header file of “Track2.”
The “MainMeta.xml” file is an MPD file. The MPD file may be obtained by theclient130 based on a content access descriptor (CAD) file including information about the media content according to the open IPTV forum (OIPF) standard, though it is understood that another exemplary embodiment is not limited thereto. For example, according to another exemplary embodiment, theclient130 obtains the MPD file by directly requesting theserver120 without reference to the CAD file. Also, it is understood that the “Head1.ref” and “Head2.ref” files may be omitted, for example, where headers are included in thetrick play data330.
Theserver120 may include at least one piece of thetrick play data330 and the number of pieces of thetrick play data330 varies according to a respective method of providing a trick play service.
FIG. 4 is a diagram showing an MPD including type information for identifying trick play data, according to an exemplary embodiment.
Referring toFIG. 4, the MPD includes a template tag about a URL of the trick play data, a tag for defining the location of a header, and track tags for defining a plurality of pieces of media data and at least one piece of the trick play data.
A “URLTemplate” tag defines a common segment of URL information of the trick play data. For example, if “http://asexample.com/vod/movies/18888/Tracks/Seg{TrackID}-{SegmentID}.as” is a URL template, the URL of the trick play data may be defined by substituting an identifier of at least one trick play data and an identifier of at least one segment of the trick play data with “TrackID” and “SegmentID”.
A “RefDataURL” tag defines a URL of a header of the trick play data.
A plurality of “Track” tags are used to define a plurality of pieces of media data generated by encoding media content to have different bit rate qualities, and at least one piece of the trick play data. Each “Track” tag includes an “ID” attribute, a “Type” attribute, a “BitRate” attribute, a “StartTime” attribute, a “SegmentDuration” attribute, a “SegmentStartID” attribute, and a “SegmentCount” attribute.
Hereinafter, each attribute will be described based on a “Track” tag for defining the trick play data.
The “ID” attribute defines a name of the trick play data and may be an identifier of the trick play data. The “Type” attribute defines a type of the trick play data. With respect to the trick play data, information for identifying the trick play data from among audio data, video data, audio/video data, and the trick play data may be defined as the “Type” attribute. The information for identifying the trick play data may be described by using various types of information such as “I-Frame” and “Trick Play.”
The “Bitrate” attribute defines a bit rate of the trick play data, the “StartTime” attribute defines a time stamp for specifying a start time of the trick play data, the “SegmentDuration” attribute defines a duration of segments included in the trick play data, and the “SegmentStartID” attribute defines a number of a segment that initially starts and defines an identifier of at least one segment included in the trick play data.
The “SegmentConunt” attribute defines a total number of segments included in the trick play data.
Although not shown inFIG. 4, a “Segment” tag is a sub tag of the “Track” tag. If the trick play data includes at least one segment generated by encoding media content at a predetermined bit rate and dividing the encoded media content based on time, each segment may be defined.
In the “Segment” tag, an “IntNum” attribute defines a number of the corresponding segment, and a “StartTime” tag defines a start time of the corresponding segment. Furthermore, a “Duration” tag defines a duration of the corresponding segment, and a “url” tag defines a URL of the corresponding segment.
It is understood that in another exemplary embodiment, the “Segment” tag may be omitted, for example, if information about at least one segment included in the trick play data may be inferred from other attributes of the “Track” tag. In this case, the “Segment” tag may not be included in an MPD if the information about at least one segment included in the trick play data may be inferred from the “StartTime”, “SegmentStartID”, “SegmentDuration”, and “SegmentCount” attributes of the “Track” tag. Also, a “url” attribute of the “Segment” tag may be omitted, for example, if a predetermined template is defined in the “URLTemplate” tag, and URLs of segments are inferred by substituting an identifier of the trick play data and an identifier of at least one segment included in the trick play data with the defined predetermined template.
FIG. 5 is a structural diagram of trick play data including I-frames for trick play in units of multiples of two, according to an exemplary embodiment.
Referring toFIG. 5, the trick play data is formed of I-frames in order to allow the trick play data to be decoded by using only the trick play data. If trick play is performed in units of multiples of two, the trick play data for various trick plays includes I-frames corresponding to desired trick play speeds while the number of frames (or a frame rate) is reduced by half.
For example, firsttrick play data510 corresponding to a 2× trick play speed is formed by extracting a frame as an I-frame every 2 seconds, and a decoder trick-plays a frame corresponding to a 2× trick play speed every 2 seconds. Secondtrick play data520 corresponding to a 4× trick play speed may be formed by reducing a frame rate of thetrick play data510 corresponding to a 2× trick play speed by half. Thirdtrick play data530 corresponding to a 8× trick play speed may be formed by reducing a frame rate of thetrick play data520 corresponding to a 4× trick play speed by half. Fourthtrick play data540 corresponding to a 16× trick play speed may be formed by reducing a frame rate of thetrick play data530 corresponding to a 8× trick play speed by half.
Also, trick play data corresponding to a trick play speed of a multiple of a decimal (e.g., 2.5×) may be formed by adjusting the frame rate.
Meanwhile, although not shown inFIG. 5, the trick play data may also be encoded by using inter-frames in addition to I-frames. The inter-frames may include at least one of B-frames and P-frames. In this case, the trick play data at a play start time may be encoded by using at least one I-frame.
FIG. 6 is a diagram for describing a method of providing a trick play service by using multiple streams, according to an exemplary embodiment.
In the present exemplary embodiment, at least one piece of trick play data supports different trick play speeds. Referring toFIG. 6, “Stream_segment0_2×.ts, Stream_segment1_2×.ts, . . . ” are 2× trick play data, “Stream_segment0_4×.ts, Stream_segment1_4×.ts, . . . ” are 4× trick play data, “Stream_segment0_8×.ts, Stream_segment1_8×.ts, . . . ” are 8× trick play data, and “Stream_segment0_16×.ts, Stream_segment1_16×.ts, . . . ” are 16× trick play data. In this case, the number of pieces of the trick play data is determined based on a maximum trick play speed. Accordingly, inFIG. 6, the number of pieces of the trick play data is 4.
An MPD file includes information about the trick play data. Theclient130 requests theserver120 for one piece of trick play data corresponding to a desired trick play speed. Since the one piece of the trick play data includes at least one segment divided and generated based on time, theclient130 receives from theserver120 at least one segment of the corresponding trick play data according to the flow of time.
FIGS. 7 and 8 are diagrams showing MPDs of a method of providing a trick play service by using multiple streams, according to exemplary embodiments.
In the present exemplary embodiment, a trick play speed corresponding to trick play data may be defined by an “AlternatePlayoutRate” attribute, though it is understood that the name of the attribute may vary.
Referring toFIGS. 7 and 8, “<AlternatePlayoutRate>2</AlternatePlayoutRate>” indicates that the corresponding trick play data is 2× trick play data. “<AlternatePlayoutRate>4</AlternatePlayoutRate>” indicates that the corresponding trick play data is 4× trick play data.
FIG. 9 is a diagram for describing a method of providing a trick play service by using a frame range query, according to an exemplary embodiment.
Referring toFIG. 9, in the method of providing a trick play service by using a frame range query, theserver120 includes one piece of trick play data corresponding to a 2× trick play speed, and an index file including locations and sizes of frames. Theclient130 may receive from theserver120 the index file with reference to a URL of the index file, which is included in an MPD file.
Theclient130 receives the index file from theserver120 and requests the trick play data including frames corresponding to a desired trick play speed by using the index file. Theclient130 requests the trick play data including frames corresponding to the desired trick play speed by transmitting to theserver120 an HTTP range query (or an HTTP range request) including locations and sizes of frames to be requested. The location of the frame is described in a “Content Range” field of an HTTP and the size of the frame is described in a “Content Length” field of the HTTP.
Theserver120 forms a frame as trick play data based on the HTTP range query including the locations and sizes of frames, and transmits the corresponding trick play data to theclient130 by using an HTTP range response (or an HTTP partial response).
While in the present exemplary embodiment, the frame is identified and transmitted by using an HTTP, it is understood another exemplary embodiment is not limited thereto, and another protocol may be used.
FIG. 10 is a structural diagram of an MP4 file for performing a method of providing a trick play service by using a frame range query, according to an exemplary embodiment.
An MP4 file is a file of a Moving Picture Experts Group (MPEG)-4part14 video compression coding standard of the International Organization for Standardization/International Electro-technical Commission Joint Technical Committee 1 (ISO/IEC JTC 1), and is also referred to as an MP4 container. A default extension of the MP4 file is “.mp4.”
Referring toFIG. 10, each piece of trick play data in the MP4 file corresponds to a track of the MP4 file. A “trak” box of each track includes metadata of the trick play data. Theserver120 may include one piece of trick play data corresponding to a 2× trick play speed together with media data corresponding to a normal play speed. Each segment divided and generated based on time includes a “moof” box and an “mdat” box. The “moof” box includes metadata of a segment and the “mdat” box includes media content corresponding to the segment.
Theclient130 describes location information of frames corresponding to a desired trick play speed by using a “Trak” box or a “Traf” box of the MP4 file, and requests for trick play data including the frames corresponding to a desired trick play speed.
The frame may be identified and transmitted by using an HTTP, though it is understood that another exemplary embodiment is not limited thereto and another protocol may be used.
FIG. 11 is a diagram for describing a method of providing a trick play service by using virtual streams, according to an exemplary embodiment.
Referring toFIG. 11, in the method of providing a trick play service by using virtual streams, theserver120 includes one piece of trick play data corresponding to a 2× trick play speed, and an index file including locations and sizes of frames. Also, an MPD file may include information about the trick play data corresponding to a 2× trick play speed, which physically exists in theserver120, and information about at least one piece of trick play data corresponding to play speeds other than 2×, which virtually exists in theserver120.
Theclient130 requests for one piece of the virtual trick play data corresponding to a desired trick play speed (e.g., Trick_segment0_4×.as) from among the at least one piece of virtual trick play data. Theserver120 extracts trick play data corresponding to play speeds other than 2× from the trick play data corresponding to a 2× trick play speed by using a CGI program of theserver120 based on the index file including the locations and sizes of frames. Theserver120 transmits the extracted trick play data to theclient130.
In the present exemplary embodiment, desired frames are extracted by using a CGI program based on an index file included in theserver120, though it is understood that another exemplary embodiment is not limited thereto, and another program may be used.
FIG. 12A is a diagram for describing a method of providing a trick play service by using multiple streams having a hierarchical structure and including intra-frames, according to an exemplary embodiment.
Referring toFIG. 12A, in the method of providing a trick play service by using multiple streams having a hierarchical structure, the number of pieces of trick play data is determined based on a maximum depth of trick levels (or a maximum number of trick levels). The maximum depth of trick levels is determined based on a maximum trick play speed. Inexemplary Equation 1, Lmaxis defined as the maximum depth of trick levels, and Rmaxis defined as the maximum trick play speed:
Lmax=log2(Rmax)  <Equation 1>
For example, if the maximum trick play speed is 16× and the trick play service is provided in units of multiples of two, the maximum depth of trick levels is 4 and the number of pieces of the trick play data is 4. The four pieces of the trick play data correspond to trick levels TL1, TL2, TL3, and TL4.
The trick play data corresponding to each trick level corresponds to each trick play speed. However, frames included in the trick play data corresponding to each trick level do not repeatedly exist in another trick level.
For example, if the maximum trick play speed is 16×, the trick play service is provided in units of multiples of two, and the trick play data is encoded by using I-frames, the trick play data corresponding to the trick level TL4 includes I-frames I0, I8, and I16 corresponding to a 16× trick play speed, and the trick play data corresponding to the trick level TL3 includes I-frames I4 and I12 corresponding to a 8× trick play speed other than the I-frames corresponding to the 16× trick play speed. The trick play data corresponding to the trick level TL2 includes 1-frames12,16,110, and114 corresponding to a 4× trick play speed other than the I-frames corresponding to the 8× trick play speed. The trick play data corresponding to the trick level TL1 includes I-frames I1, I3, I5, I7, I9, I11, I13, and I15 corresponding to a 2× trick play speed other than the I-frames corresponding to the 4× trick play speed.
Inexemplary Equation 2, E1,ndefines a frame index of each trick level, n=0, 1, 2, . . . N1, and N1indicates a total number of frames of the trick play data.
Theclient130 requests for at least one piece of trick play data corresponding to each trick level in order to support a desired trick play speed.
For example, theclient130 requests for trick play data corresponding to the trick level TL4 in order to support a 16× trick play speed, requests for a plurality of pieces of trick play data corresponding to the trick levels TL3 and TL4 in order to support a 8× trick play speed, requests for a plurality of pieces of trick play data corresponding to the trick levels TL2, TL3, and TL4 in order to support a 4× trick play speed, and requests for a plurality of pieces of trick play data corresponding to the trick levels TL1, TL2, TL3, and TL4 in order to support a 2× trick play speed.
Inexemplary Equation 3, frame_index(x) defines indices of all I-frames for supporting an x trick play speed:
For example, if the trick play data is encoded by using I-frames, all I-frames for supporting a 8× trick play speed have indices corresponding to the I-frames I0, I4, I8, I12, I16, etc. in the trick levels TL3 and TL4.
FIG. 12B is a diagram for describing a method of providing a trick play service by using multiple streams having a hierarchical structure and including intra- and inter-frames, according to an exemplary embodiment.
Theencoder110 may encode media content by using intra- and inter-frames to generate trick play data. The intra (I)-frames are frames encoded by using information of only corresponding frames. The inter-frames are frames encoded by using information of corresponding frames and other frames and include P-frames and B-frames. Theencoder110 encodes the trick play data at a play start point by using only I-frames. Referring toFIG. 12B, the trick play data includes I-frames, P-frames, and B-frames.
In the method of providing a trick play service by using multiple streams having a hierarchical structure according to an exemplary embodiment, the number of pieces of the trick play data is determined based on a maximum depth of trick levels (or a maximum number of trick levels). The maximum depth of trick levels is determined based on a maximum trick play speed. For example, in the above-describedexemplary Equation 1, Lmaxis defined as the maximum depth of trick levels, and Rmaxis defined as the maximum trick play speed.
InFIG. 12B, if the maximum trick play speed is 16× and the trick play service is provided in units of multiples of two, the maximum depth of trick levels is 4 and the number of pieces of the trick play data is 4. The four pieces of the trick play data correspond to trick levels TL1, TL2, TL3, and TL4.
The trick play data corresponding to each trick level corresponds to each trick play speed. However, frames included in the trick play data corresponding to each trick level do not repeatedly exist in another trick level.
InFIG. 12B, if the maximum trick play speed is 16×, the trick play service is provided in units of multiples of two, and the trick play data is encoded by using I-frames, P-frames, and B-frames, the trick play data corresponding to the trick level TL4 includes I-frames I0, I8, and I16 corresponding to a 16× trick play speed, and the trick play data corresponding to the trick level TL3 includes P-frames P4 and P12 corresponding to a 8× trick play speed other than the frames corresponding to the 16× trick play speed. The trick play data corresponding to the trick level TL2 includes B-frames B2, B6, B10, and B14 corresponding to a 4× trick play speed other than the frames corresponding to the 8× trick play speed. The trick play data corresponding to the trick level TL1 includes B-frames B1, B3, B5, B7, B9, B11, B13, and B15 corresponding to a 2× trick play speed other than the frames corresponding to the 4× trick play speed.
In the above-describedexemplary Equation 2, E1,ndefines a frame index of each trick level, n=0, 1, 2, . . . N1, and N1indicates a total number of frames of the trick play data.
Theclient130 requests at least one piece of trick play data corresponding to each trick level in order to support a desired trick play speed.
For example, theclient130 requests trick play data corresponding to the trick level TL4 in order to support a 16× trick play speed, requests a plurality of pieces of trick play data corresponding to the trick levels TL3 and TL4 in order to support a 8× trick play speed, requests a plurality of pieces of trick play data corresponding to the trick levels TL2, TL3, and TL4 in order to support a 4× trick play speed, and requests a plurality of pieces of trick play data corresponding to the trick levels TL1, TL2, TL3, and TL4 in order to support a 2× trick play speed.
In the above-describedexemplary Equation 3, frame_index(x) defines indices of all frames for supporting an x trick play speed.
If the trick play data is encoded by using I-frames, P-frames, and B-frames as illustrated inFIG. 12B, frames for supporting a 8× trick play speed have indices corresponding to the frames I0, P4, I8, P12, I16, etc., in the trick levels TL3 and TL4.
FIG. 13 is a diagram for describing a method of providing a trick play service by using multiple streams having a hierarchical structure at aserver120, according to an exemplary embodiment
In the present exemplary embodiment, theserver120 includes trick play data corresponding to each trick level. The number of pieces of the trick play data is determined based on a maximum depth of trick levels (or a maximum number of trick levels). As described above with reference toFIG. 12A, the maximum depth of trick levels is determined based on a maximum trick play speed according toexemplary Equation 1.
For example, referring toFIG. 13, if the maximum trick play speed is 16×, the trick play service is provided in units of multiples of two, and the trick play data is encoded by using I-frames, asegment 0 includes a Stream_segment0_16×.ts file including an I-frame I0 corresponding to a trick level TL4, a Stream_segment0_8×.ts file including an I-frame14 corresponding to a trick level TL3, a Stream_segment0_4×.ts file including an I-frame I2 corresponding to a trick level TL2, and a Stream_segment0_2×.ts file including I-frames I1 and I3 corresponding to a trick level TL1.
Also, asegment 1 includes a Stream_segment1_16×.ts file including an I-frame I8 corresponding to the trick level TL4, a Stream_segment1_4×.ts file including an I-frame I6 corresponding to the trick level TL2, and a Stream_segment1_2×.ts file including I-frames I5, I7, and I9 corresponding to the trick level TL1. Since no I-frame corresponds to the trick level TL3, a Stream_segment1_8×.ts file does not exist.
Furthermore, asegment 2 includes a Stream_segment2_8×.ts file including an I-frame112 corresponding to the trick level TL3, a Stream_segment2_4×.ts file including I-frames I10 and I14 corresponding to the trick level TL2, and a Stream_segment2_2×.ts file including I-frames I11 and I13 corresponding to the trick level TL1. Since no I-frame corresponds to the trick level TL4, a Stream_segment2_16×.ts file does not exist.
FIG. 14 is a diagram for describing a method of providing a trick play service by using multiple streams having a hierarchical structure at aclient130, according to an exemplary embodiment.
In the present exemplary embodiment, theclient130 requests at least one piece of trick play data corresponding to each trick level in order to support a desired trick play speed.
For example, referring toFIG. 14, theclient130 requests trick play data corresponding to a trick level TL4 in order to support a 16× trick play speed. Theclient130 requests the Stream_segment0_16×.ts file, the Stream_segment1_16×.ts file, etc.
Theclient130 requests a plurality of pieces of trick play data corresponding to trick levels TL2, TL3, and TL4 in order to support a 4× trick play speed. Theclient130 requests theserver120 for files corresponding to each segment in the trick levels TL2, TL3, and TL4 according to a flow of time. For example, the Stream_segment0_16×.ts, Stream_segment0_8×.ts, and stream_segment 0_4×.ts files are requested in thesegment 0, and the Stream_segment1_16×.ts, Stream_segment1_8×.ts, and stream_segment 1_4×.ts files are requested in thesegment 1.
Theclient130 realigns at least one piece of trick play data corresponding to each trick level in an order of reproduction time. That is, theclient130 realigns I-frames included in the received segment files in order.
FIGS. 15 and 16A are diagrams showing MPDs of a method of providing a trick play service by using multiple streams having a hierarchical structure, according to an exemplary embodiment.
In the present exemplary embodiment, a trick level corresponding to trick play data may be defined by at least one “AlternatePlayoutRate” attribute, though it is understood that the name of the attribute may vary.
Referring toFIGS. 15 and 16A, a trick level TL4 is defined by using four “AlternatePlayoutRate” attributes such as “<AlternatePlayoutRate>2 </AlternatePlayoutRate>, <AlternatePlayoutRate>4</AlternatePlayoutRate>, <AlternatePlayoutRate>8</AlternatePlayoutRate>, and <AlternatePlayoutRate>16</AlternatePlayoutRate>”. A trick level TL2 is defined by using two “AlternatePlayoutRate” attributes such as “<AlternatePlayoutRate>2</AlternatePlayoutRate> and <AlternatePlayoutRate>4</AlternatePlayoutRate>”.
While in the present exemplary embodiment, a trick level is identified by using at least one “AlternatePlayoutRate” attribute, it is understood that another exemplary embodiment is not limited thereto and another identification method may be used.
FIG. 16B is a diagram showing an MPD of a method of providing a trick play service by using multiple streams having a hierarchical structure for identifying a trick level and a frame rate, according to an exemplary embodiment.
Anencoder110 according to an exemplary embodiment may encode media content by using intra- and inter-frames to generate trick play data. The intra (I)-frames are frames encoded by using information of only corresponding frames. The inter-frames are frames encoded by using information of corresponding frames and other frames and include P-frames and B-frames. Furthermore, theencoder110 encodes the trick play data at a play start point by using only I-frames. Referring toFIG. 16B, the trick play data includes I-frames, P-frames, and B-frames.
In the method of providing a trick play service by using multiple streams having a hierarchical structure, the number of pieces of the trick play data is determined based on a maximum number of trick levels. In the present exemplary embodiment, the maximum number of trick levels is 4 and the number of pieces of the trick play data is 4. The four pieces of the trick play data correspond to trick levels TL1, TL2, TL3, and TL4. A trick level of the trick play data may be defined by using a “tricklevel” attribute, though it is understood that the name of the attribute may vary.
The trick play data corresponding to each trick level defines a trick play speed by using a frame rate. The trick play speed of the trick play data may be defined by using a “frame rate” attribute, though it is understood that the name of the attribute may vary. Frames included in the trick play data corresponding to each trick level do not repeatedly exist in another trick level.
In the present exemplary embodiment, theencoder110 defines the trick play data including 30 frames per second (fps) by using four trick levels TL1, TL2, TL3, and TL4.
The trick play data corresponding to the trick level TL4 includes frames corresponding to a trick play speed of 3.75 frames per second (fps), and the trick play data corresponding to the trick level TL3 includes frames corresponding to a trick play speed of 7.5 fps other than the frames corresponding to the trick play speed of 3.75 fps. The trick play data corresponding to the trick level TL2 includes frames corresponding to a trick play speed of 15 fps other than the frames corresponding to the trick play speed of 7.5 fps. The trick play data corresponding to the trick level TL1 includes frames corresponding to a trick play speed of 30 fps other than the frames corresponding to the trick play speed of 15 fps. The frames of the trick play data are stored in theserver120 in an order from the trick level TL4 to the trick level TL1, and a desired trick level may be accessed by using index information indicating the trick level.
Theclient130 requests at least one piece of trick play data corresponding to each trick level in order to support a desired trick play speed.
For example, the client,130 requests trick play data corresponding to the trick level TL4 in order to support a trick play speed of 3.75 fps, requests a plurality of pieces of trick play data corresponding to the trick levels TL3 and TL4 in order to support a trick play speed of 7.5 fps, requests a plurality of pieces of trick play data corresponding to the trick levels TL2, TL3, and TL4 in order to support a trick play speed of 15 fps, and requests a plurality of pieces of trick play data corresponding to the trick levels TL1, TL2, TL3, and TL4 in order to support a trick play speed of 30 fps.
According to another exemplary embodiment, for example, location information of trick play data corresponding to each trick level may be added in the form of a box of an MP4 file by using information about a trick level, and a frame rate, though it is understood that the name of the box may differ.
FIG. 17 is a structural diagram of a transport stream (TS) packet for detecting an I-frame from an MPEG TS, according to an exemplary embodiment.
Referring toFIG. 17, an “Adaptation field” is a portion of a TS header and is an optional field for inputting TS-related additional information. The “Adaptation field” has a plurality of parameters and includes a “private-data-byte” field that may be arbitrarily used by a user. A “transport-private-data-length” parameter indicates the size of the “private-data-byte” field included in the “Adaptation field.” The “private-data-byte” field is a space for storing data arbitrarily defined by the user. Aclient130 according to an exemplary embodiment may calculate a start point of a subsequent I-frame in the MPEG TS by using the “transport-private-data-length” parameter and the “private-data-byte” field.
The start point of a subsequent I-frame in the MPEG TS may be calculated when theclient130 realigns I-frames included in received segment files in order according to a method of providing a trick play service by using multiple streams having a hierarchical structure, though it is understood that another exemplary embodiment is not limited thereto, and the start point of a subsequent I-frame may be calculated for another purpose.
FIG. 18 is a diagram for describing a method of forming a TS packet for detecting an I-frame from an MPEG TS, according to an exemplary embodiment.
Referring toFIG. 18, an “Adaptation field” includes a “private-data-byte” field for inputting “private data.” Anencoder110 according to an exemplary embodiment defines the length of the “private-data-byte” field and inputs the length as a “transport-private-data-length” parameter. Theencoder110 records the “private data” in the “private-data-byte” field by the “transport-private-data-length.” The “private-data-byte” field has a numerical value in the form of an “unsigned integer.” The value of the “private-data-byte” field is an offset value regarding a start point of a TS packet having a subsequent I-frame with respect to a current TS packet. If a plurality of I-frames is included in one TS, the “Adaptation field” exists at a start point of each I-frame.
FIG. 19 is a flowchart of a method of detecting an I-frame from an MPEG TS, according to an exemplary embodiment.
Referring toFIG. 19, inoperation1910, aclient130 downloads a trick play segment from aserver120.
Inoperation1920, if a subsequent I-frame is to be detected from the trick play segment, an “Adaptation field” of the MPEG TS is parsed.
Inoperation1930, an offset value of the subsequent I-frame is extracted by using a “private-data-byte” field of the “Adaptation field.” For example, if the offset value is 2462, 0x99E obtained by changing the value 2462 into a 16-bit value is calculated. Since the size of the “unsigned integer” is 4 bytes, the value of a “transport-private-data-length” parameter is registered as 4. 0x99E is converted into “0x0 00x00 0x09 0x9E,” i.e., a 4-byte integer. The converted value is input to the “private-data-byte” field. Moreover, according to a method of extracting the offset value from the “private-data-byte” field, if the private-data-byte is declared as pdb[4], the offset value may be calculated as (int) (pdb[3]<<24|pdb[2]<<16|pdb[1]<<8|pdb[0]).
Inoperation1940, a TS file, i.e., a segment file, is divided by the offset value of the subsequent I-frame.
Inoperation1950, it is determined whether the segment file is the last frame. If the segment file is not the last frame, the method returns tooperation1930 and a subsequent I-frame is extracted. If the segment file is the last frame, the method returns tooperation1910 and theclient130 downloads a subsequent trick play segment from theserver120.
FIG. 20 is a structural diagram of an MP4 file for detecting an I-frame from an MPEG TS, according to an exemplary embodiment.
Referring toFIG. 20, in the MP4 file, each piece of trick play data corresponds to a track of the MP4 file. A “trak” box of each track includes metadata of the trick play data. The number of pieces of the trick play data may be determined based on a maximum trick play speed according toexemplary Equation 1 as described above with reference toFIG. 11.
Aserver120 according to an exemplary embodiment includes at least one piece of trick play data corresponding to each trick level, together with normal speed play data. The trick play data corresponding to each trick level includes at least one segment divided and generated based on time. Each segment includes a “moof” box and an “mdat” box. The “moof” box includes metadata of a segment and the “mdat” box includes media content corresponding to the segment.
Location information of an I-frame corresponding to a desired trick play speed may be obtained by using a “Trak” box or a “Traf” box of the MP4 file.
When I-frames included in received segment files are realigned in order, aclient130 according to an exemplary embodiment may obtain the location information of a subsequent I-frame by using the “Trak” box or the “Traf” box.
FIG. 21 is a conceptual diagram for describing a method of providing a trick play service by varying a frame rate, according to an exemplary embodiment.
In the method of providing a trick play service by varying a frame rate, aserver120 includes one piece of trick play data corresponding to a default play speed. For example, if a maximum trick play speed is 16× and the trick play service is provided in units of multiples of two, theserver120 includes one piece of trick play data corresponding to a 2× trick play speed. Referring toFIG. 21, theserver120 includes one piece of trick play data corresponding to a 2× trick play speed, which is formed by extracting one frame in every unit time of two seconds. Play speeds other than 2× may be supported by varying a frame rate of the trick play data corresponding to a 2× trick play speed at theclient130.
Theclient130 may support 2× trick play by playing one frame in a unit time, may support 4× trick play by playing two frames in the unit time, may support 8× trick play by playing four frames in the unit time, and may support 16× trick play by playing eight frames in the unit time.
Furthermore, the method of providing a trick play service by varying a frame rate may be used together with at least one of the above-described methods of providing a trick play service by using multiple streams, method of providing a trick play service by using a frame range query, method of providing a trick play service by using virtual streams, and method of providing a trick play service by using multiple streams having a hierarchical structure, so as to support a variable trick play speed. For example, where the method of providing a trick play service by using multiple streams and the method of providing a trick play service by varying a frame rate are used together, aclient130 may receive from theserver120 one piece of trick play data corresponding to a certain trick play speed (e.g., 8×), and then may support a variable trick play speed that is different from the certain trick play speed by varying a frame rate.
FIG. 22 is a diagram for describing a method of providing a trick play service by varying a frame rate at aserver120 and aclient130 illustrated inFIG. 1, according to an exemplary embodiment.
Referring toFIG. 22, theserver120 includes a piece of trick play data corresponding to a default play speed, and an MPD file including information about the trick play data. For example, if a maximum trick play speed is 16× and the trick play service is provided in units of multiples of two, theserver120 includes a piece of trick play data corresponding to a 2× trick play speed.
The MPD file includes information about a frame rate, a frame type, and the maximum trick play speed. The frame rate indicates the number of frames to be played per second at theclient130. The frame type indicates whether the trick play data includes only intra-frames, or intra- and inter-frames. InFIG. 22, an “alternatePlayoutRate” attribute describes the maximum trick play speed as 16, a “type” attribute identifies that the trick play data includes only I-frames, and a “frameRate” attribute is described as “1 fps.” A schema of the MPD file will be described in detail below with reference toFIGS. 23 and 24.
Theclient130 requests a piece of trick play data corresponding to a 2× trick play speed based on the MPD file including information about the trick play data.
If theclient130desires 2× trick play, one frame corresponding to 2× is played in every unit time based on the frame rate described in the MPD file. Theclient130 may support play speeds other than 2× by varying the frame rate.
For example, if theclient130desires 4× trick play, the frame rate described in the MPD file is varied from “1 fps” to “2 fps” and two frames corresponding to 2× are played in every unit time.
If theclient130desires 8× trick play, the frame rate described in the MPD file is varied from “1 fps” to “4 fps” and four frames corresponding to 2× are played in every unit time.
If theclient130 desires 16× trick play, the frame rate described in the MPD file is varied from “1 fps” to “8 fps” and eight frames corresponding to 2× are played in every unit time.
FIG. 23 is a schema of an MPD of a method of providing a trick play service by varying a frame rate, according to an exemplary embodiment.
Referring toFIG. 23, in the method of providing a trick play service by varying a frame rate, an MPD includes a “TrickMode” tag. The “TrickMode” tag includes an “alternatePlayoutRate” attribute, a “type” attribute, and a “frameRate” attribute.
The “alternatePlayoutRate” attribute defines a maximum trick play speed. The “type” attribute defines whether trick play data includes only intra-frames, or intra- and inter-frames. The “frameRate” attribute defines the number of frames to be played per second at aclient130.
FIG. 24 is a diagram showing an MPD of a method of providing a trick play service by varying a frame rate, according to an exemplary embodiment.
Referring toFIG. 24, an “alternatePlayoutRate” attribute has avalue 16 and indicates that a maximum trick play speed is 16×. A “type” attribute has a value “Intra” from among values “Intra” and “Intra or Inter”, and indicates that trick play data includes only I-frames. A “frameRate” attribute has avalue 1 and indicates that the number of frames to be played per second is one.
FIG. 25 is a block diagram of aserver2500 according to an exemplary embodiment.
Referring toFIG. 25, theserver2500 includes aninformation generation unit2510, aninformation transmission unit2520, and a trick playdata transmission unit2530.
Theinformation generation unit2510 generates an MPD file including information about at least one piece of trick play data. The MPD file includes type information for identifying that the trick play data is data for trick play.
In a method of providing a trick play service by using multiple streams according to an exemplary embodiment, the number of pieces of trick play data is determined based on a maximum trick play speed, and the trick play data includes at least one of a plurality of segments generated by encoding media content at a predetermined bit rate into frames corresponding to a predetermined trick play speed and dividing the encoded frames based on time. In this case, the MPD file includes information about the predetermined trick play speed, and URLs of the segments that are divided and generated based on time and are to be played at the predetermined trick play speed.
In a method of providing a trick play service by using virtual streams according to an exemplary embodiment, the MPD file includes information about trick play data corresponding to a 2× trick play speed, which physically exists in theserver2500, and information about at least one piece of trick play data corresponding to play speeds other than 2×, which virtually exists in theserver2500. In this case, theserver2500 may further include an extraction unit (not shown) which extracts the trick play data corresponding to play speeds other than 2× from the trick play data corresponding to a 2× trick play speed upon a request of a client based on the MPD file. The extraction unit may be realized by using a CGI program based on an index file including locations and sizes of frames.
In a method of providing a trick play service by using a frame range query according to an exemplary embodiment, at least one piece of trick play data includes at least one of a plurality of segments generated by encoding media content at a predetermined bit rate into frames corresponding to a predetermined trick play speed (e.g., 2×) and dividing the encoded frames based on time. In this case, theinformation generation unit2510 may further generate an index file including locations and sizes of frames. Furthermore, in this case, the MPD file includes information about trick play data corresponding to a predetermined trick play speed, and a URL of the index file. Theserver2500 may further include an extraction unit (not shown) which extracts the trick play data corresponding to play speeds other than 2× from the trick play data corresponding to a 2× trick play speed upon a request of the client that received the MPD file and the index file. The extraction unit may be realized by using an HTTP server capable of processing an HTTP range response.
In a method of providing a trick play service by using multiple streams having a hierarchical structure according to an exemplary embodiment, the number of pieces of trick play data is determined based on a maximum trick play speed, and the trick play data includes at least one of a plurality of segments generated by encoding media content at a predetermined bit rate into frames corresponding to a predetermined trick level and dividing the encoded frames based on time. The predetermined trick level forms a hierarchical structure and is one of at least one trick level based on a maximum depth of trick levels. The maximum depth of trick levels is determined based on the maximum trick play speed. Moreover, frames included in the predetermined trick level do not repeatedly exist in another trick level. In this case, the MPD file includes information about the predetermined trick level, and URLs of a plurality of segments divided and generated based on time and corresponding to the predetermined trick level. The predetermined trick level is described to include information about at least one trick play speed using a plurality of segments divided and generated based on time.
In a method of providing a trick play service by varying a frame rate according to an exemplary embodiment, at least one piece of trick play data includes at least one of a plurality of segment generated by encoding media content at a predetermined bit rate into frames corresponding to a predetermined trick play speed, and dividing the encoded frames based on time. In this case, the MPD file includes information about at least one of a frame rate, a frame type, and a maximum trick play speed. The frame rate indicates the number of frames to be played per second at the client. The frame type indicates whether the trick play data includes only intra-frames, or intra- and inter-frames.
Theinformation transmission unit2520 transmits the MPD file to the client.
In a method of providing a trick play service by using a frame range query according to an exemplary embodiment, theinformation transmission unit2520 may further transmit the index file to the client.
The trick playdata transmission unit2530 transmits the trick play data to the client upon a request of the client based on the MPD file.
FIG. 26 is a block diagram of aclient2600 according to an exemplary embodiment.
Referring toFIG. 26, theclient2600 includes aninformation reception unit2610 and a trick playdata reception unit2620.
Theinformation reception unit2610 receives from a server an MPD file including information about at least one piece of trick play data.
The trick playdata reception unit2620 receives the trick play data from the server based on the MPD file.
In a method of providing a trick play service by using multiple streams having a hierarchical structure according to an exemplary embodiment, the trick playdata reception unit2620 receives a plurality of segments divided and generated based on time and corresponding to at least one trick level in order to support a predetermined trick play speed based on the request of the client. In this case, theclient2600 may further include a realignment unit (not shown) which realigns the trick play data in an order of play time.
In a method of providing a trick play service by varying a frame rate according to an exemplary embodiment, theclient2600 may further include a play unit (not shown) which reproduces the trick play data based on a frame rate.
In a method of providing a trick play service by using a frame range query according to an exemplary embodiment, theinformation reception unit2610 may further receive an index file from the server with reference to a URL of the index file, which is included in the MPD file. In this case, the trick playdata reception unit2620 may receive from the server the trick play data including I-frames corresponding to a desired trick play speed based on the index file. The trick playdata reception unit2620 may be realized by using an HTTP client capable of processing an HTTP range query.
While exemplary embodiments have been particularly shown and described above, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the present inventive concept as defined by the following claims.
For example, theserver2500 and theclient2600 can include a bus coupled to units of each of the devices shown inFIGS. 25 and 26, and at least one processor connected to the bus. In addition, a memory coupled to at least one processor for performing commands as described above can be included and connected to the bus to store the commands and received messages or generated messages.
An exemplary embodiment can also be embodied as computer readable codes on a computer readable recording medium. The computer readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks and, optical data storage devices. The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.