Coding mode (M^*), but as the lagrangian values J that gets off and troop and calculate_MMake S be made as the set of coding mode K, wherein the lagrangian values of Ji Suaning satisfies condition:

S = {k | \frac{J^{*}}{J_{k}} | &GreaterEqual; ϵ} - - - (3)

Wherein general Shillong in distress (" ε ") is the error amount of selecting, J^*Be the J of the minimum of all patterns.Ifcoding mode 0 is the element of S set, then system selectscoding mode 0 as the coding mode that will be used to the encoded pixels module, otherwise system select withCorresponding codes pattern M^*(generate the coding mode M of minimum J value^*).

Above-mentioned step utilized with benchmark (nonstandardized technique) H.264 encoder compare novel assembly.Especially, the present invention uses Huber cost function calculated distortion, the Lagrange's multiplier of modification and trooping of lagrangian values.

The Huber cost function belongs to robust M estimator classification.The key property of these functions is abilities that they reduce the outlier influence.More particularly, if any outlier is present in the picture element module, then the Huber cost function is lower than (the quadratic power ground) of mean square error function to their weighting (linearly), and making successively may be identical with the coding mode of neighboring macro-blocks to the selected coding mode of picture element module.

The lagrangian multiplier of revising must be slower as the function of quantization parameter Q, thereby the degree height that the degree that the distortion components of lagrangian values J is paid attention to is paid attention to than bit rate composition R.(in the document, " lambda " or " λ " expression is used in coding mode and determines Lagrange's multiplier in the process.The multiplier that is used in the motion vector selection process is different).

At last, trooping of the lagrangian values of describing in the past supported coding mode 0.Therefore, system of the present invention allows to utilize respectively for the Direct Model of B picture element module and P picture element module or the skip mode more picture element module of encoding.

Experimental result

Vidclip " is visited Egypt (Discovering Egypt) " by coming from, 9 kinds of color video montages of " wafing " and " Britain patient " constitute to be used in video measurement collection in the experiment.The particular characteristics of these video sequences is as described in Table 1.

Table 1: cycle tests

(slightly write ch and Og and represent chapters and sections and oppositely flicker (glance) respectively)

Sequence number	The video sequence title	Frame size	Frame number	Type
					1	Visit Egypt, ch.1	704×464	58	Distant taking the photograph
2	Waft ch.11	720×480	44	Og
					3	Visit Egypt, ch.1	704×464	630	Distant taking the photograph
4	Visit Egypt, ch.2	704×464	148	Zoom
					5	Visit Egypt, ch.3	704×464	196	Lifting (Boom)
6	Visit Egypt, ch.6	704×464	298	Distant taking the photograph
					7	The Britain patient, ch.2	720×352	97	Veining
8	The Britain patient, ch.6	720×352	196	Og
					9	The Britain patient, ch.8	720×352	151	Og

Frame of video is represented with yuv format, equals per second 23.976 frames (fps) for all video sequence video frame rates.The visual quality of the bit rate R of the video sequence of utilization compression and the video sequence of decoding is come the effect of the method for the present invention's recommendation is assessed.Visual inspection and Y-PSNR (PSNR) value by video sequence are assessed the latter.

The assembly of the novelty in the coding method of the present invention that the Direct Model Enhancement Method of recommending is partly described replenishes the influence of speed and distortion mutually according to them.Method of the present invention makes overall bit rate reduce and the minimizing of slight Y-PSNR.Two experiments that utilization is described in following textual portions are assessed system of the present invention.

The fixed quantisation parameter of all sequences

To all video sequences, first tests selected quantization parameter is identical, and equals Q, Q+1, Q+3 respectively for I frame, P frame and B frame.Described in table 2, when utilizing coding method of the present invention, the minimizing of bit rate can be 9%, wherein the about 0.12dB of loss of Y-PSNR (PSNR).With comparing of the method coding that utilizes benchmark, there is not visible distortion in the video sequence that utilizes coding method of the present invention to encode.

Table 2: utilize identical quantization parameter Q to use bit rate (BR) [k bps] and the Y-PSNR (PSNR) [dB] of video sequence of the method for pedestal method and recommendation to all sequences

The highest quantization parameter of each sequence

For the further validity of assessment coding method of the present invention, design and carried out second experiment.When bit rate R and Y-PSNR value all reduced, general argument was that several different methods such as pre-filtering video sequence, the value that increases quantizer Q etc. can generate identical result.The purpose of this experiment is to show that method of the present invention can further reduce bit rate when these methods can not further be used under the situation of the quality that does not unacceptably weaken video.

At first, to the video sequence of each test, when distortion becomes visible, utilize pedestal method to reduce bit rate as far as possible by the value that increases quantization parameter, up to Q_Max+ 1.Next, system utilizes Q_Max(distortion is sightless maximum also) and pedestal method coding and decoding video sequence generate the bit rate and Y-PSNR (PSNR) value that are included in the table 3.For each sequence, Q_MaxValue is different, and for I frame, P frame and B frame, it also is respectively different.Suppose that maximum available bit minimizing does not have vision loss, is coded in identical Q with coding method of the present invention then_MaxThe sequence of value.

Table 3: utilize the highest quantization parameter to use bit rate (BR) [k bps] and the Y-PSNR (PSNR) [dB] of film sequence of the method for pedestal method and recommendation

Described in table 3, method of the present invention can further reduce bit rate 13.3% significantly, and (PSNR) loses about 0.29dB for Y-PSNR.(in order to assess the relevant pseudomorphism of any B frame) can the deterministic bit rate minimizing not introduce visual pseudomorphism by the sequence visual inspection under full motion in the video sequence of decoding.Notice that when utilizing method of the present invention, the value that can increase quantization parameter surpasses Q_Max, and obtain the more bits rate and reduce and do not have a vision loss.

Conclusion

The invention provides a kind of method, be used for the enhancing of skip mode in the enhancing of Direct Model in the framework B image of the video compression standard of (MPEG4/ part 10) H.264 and the P image.System of the present invention utilizes Huber cost function calculated distortion, revises Lagrange's multiplier, and the lagrangian values of trooping is to select to be used for the coding mode of encoded pixels module.Test has shown the method for the present invention of utilizing, just can obtain significant bit-rate reduction with small Y-PSNR (PSNR) loss, and not have subjective visual quality to descend.As additives, when other the value where applicable no longer of scheme such as further increase quantization parameter, it is particularly useful that these characteristics make that method of the present invention reduces for the bit rate in any video coding system, and this video coding system utilizes the distortion rate optimization framework that coding mode is determined.

The method and apparatus of combine digital figure image intensifying has below been described.Under the situation that does not deviate from scope of the present invention, those of ordinary skill in the art can make a change and revise material and the arrangement of parts of the present invention.

Claims

1. method that execution pattern is selected in video compression and coded system, described method comprises:

Come the Code And Decode picture element module with each possible coding mode;

Calculate the distortion value of each coding mode;

Calculate the bit-rates values of each coding mode;

Use described distortion value, described bit-rates values and Lagrange's multiplier to calculate the lagrangian values of each coding mode;

The set of recognition coding pattern, wherein the Zui Xiao lagrangian values of calculating with this set in the ratio of the lagrangian values that is associated of each coding mode more than or equal to the threshold error value;

When coding mode 0 belongs to the set of the coding mode of identifying, select coding mode 0; And

When coding mode 0 does not belong to the set of the coding mode of identifying, select the coding mode that is associated with the minimum lagrangian values of calculating.

2. method according to claim 1, wherein said distortion value is calculated as the Huber functional value sum of the error between the pixel in the picture element module of the pixel in the original picture element module and decoding, wherein said Huber function has reduced the influence of outlier, and described Huber function is lower than mean square error function to the weighting of the outlier in the picture element module.

3. method according to claim 1 is wherein calculated described bit-rates values and is comprised one group of motion vector of calculation code and one group of necessary total number of bits of conversion coefficient.

4. method according to claim 1, wherein said Lagrange's multiplier comprises the Lagrange's multiplier of slow change, it is as the function of quantized value.

5. method that execution pattern is selected in video compression and coded system, described method comprises:

Come the Code And Decode picture element module with each possible coding mode;

Calculate the distortion value of each coding mode;

Calculate the bit-rates values of each coding mode;

The Lagrange's multiplier of using described distortion value, described bit-rates values and slowly changing is calculated the lagrangian values of each coding mode, when calculating described lagrangian values, the Lagrange's multiplier of described slow change changes slowlyer as the function of quantized value than standard Lagrange's multiplier, to emphasize distortion value with respect to bit-rates values; And

Select coding mode by using the lagrangian values of calculating.

6. method according to claim 5 wherein reduces outlier by use the function of the influence of distortion value is calculated described distortion value.

7. method according to claim 6, wherein said function comprises the Huber function.

8. method according to claim 5 is wherein calculated described bit-rates values and is comprised one group of motion vector of calculation code and one group of necessary total number of bits of conversion coefficient.

9. one kind is used for from the method for a plurality of coding modes selection coding modes, and described method comprises:

At each coding mode from described a plurality of coding modes,

Reduce outlier to the function of the influence of distortion value by use, the distortion value of calculation code pattern;

Calculate the bit-rates values of this coding mode; And

Based on bit-rates values and the Lagrange's multiplier of described distortion value, this coding mode, calculate lagrangian values;

When coding mode 0 belongs to the set of the coding mode of identifying, select coding mode 0;

When coding mode 0 does not belong to the set of the coding mode of identifying, select the coding mode that is associated with the minimum lagrangian values of calculating; And

Use selected coding mode a plurality of video images of encoding.

10. method according to claim 9, the influence that wherein reduces outlier are in order to select the coding mode of a pixel groups, and it is identical with the coding mode that is used for the coding sets of adjacent pixels.

11. method according to claim 9, wherein when the error amount between the decoded pixel value group of the original pixel value group of video image and this video image during greater than the threshold error value, described distortion value equals described error amount * described threshold error value-(described threshold error value)²/ 2.

12. method according to claim 9, wherein when the error amount between the decoded pixel value group of the original pixel value group of video image and this video image was equal to or less than the threshold error value, described distortion value equaled (described error amount)²/ 2.

13. an equipment that is used for selecting from a plurality of coding modes coding mode, described equipment comprises:

Be used for reducing outlier by use the function of the influence of distortion value is calculated device from the distortion value of each coding mode of described a plurality of coding modes;

The device that is used for the bit-rates values of each coding mode of calculating;

For the device that calculates lagrangian values based on bit-rates values and the Lagrange's multiplier of described distortion value, this coding mode for each coding mode from described a plurality of coding modes;

The device that is used for the set of recognition coding pattern, wherein the Zui Xiao lagrangian values of calculating with this set in the ratio of the lagrangian values that is associated of each coding mode more than or equal to the threshold error value;

Be used for when coding mode 0 belongs to the set of the coding mode of identifying, selecting coding mode 0; And when coding mode 0 does not belong to the set of the coding mode of identifying, the device of the coding mode that selection and the minimum lagrangian values of calculating are associated; And

Be used for to use the encode device of a plurality of video images of selected coding mode.

14. equipment according to claim 13, the influence that wherein reduces outlier are in order to select the coding mode of a pixel groups, it is identical with the coding mode that is used for the coding sets of adjacent pixels.

15. equipment according to claim 13, wherein when the error amount between the decoded pixel value group of the original pixel value group of video image and this video image during greater than the threshold error value, described distortion value equals described error amount * described threshold error value-(described threshold error value)²/ 2.

16. equipment according to claim 13, wherein when the error amount between the decoded pixel value group of the original pixel value group of video image and this video image was equal to or less than the threshold error value, described distortion value equaled (described error amount)²/ 2.

17. a method that is used for selecting from a plurality of coding modes coding mode, described method comprises:

At each coding mode from described a plurality of coding modes,

Reduce outlier calculates this coding mode to the function of the influence of distortion value distortion value by use;

Calculate the bit-rates values of this coding mode; And

Based on the distortion value of (i) this coding mode, the (ii) bit-rates values of this coding mode and the Lagrange's multiplier that (iii) slowly changes, calculate lagrangian values, when calculating described lagrangian values, described Lagrange's multiplier with than the low speed of reference Lagrange's multiplier as the function of quantization parameter and change, to emphasize distortion value with respect to bit-rates values;

From described a plurality of coding modes, select coding mode based on the lagrangian values of calculating; And

Use selected coding mode a plurality of video images of encoding.

18. H.264 method according to claim 17 is wherein saidly being stipulated in the standard with reference to Lagrange's multiplier.

19. an equipment that is used for selecting from a plurality of coding modes coding mode, described equipment comprises:

Be used for reducing outlier calculates the distortion value of each coding mode to the function of the influence of distortion value device by use;

Be used at each coding modes of described a plurality of coding modes based on the distortion value of (i) this coding mode, the (ii) bit-rates values of this coding mode and the device that the Lagrange's multiplier that (iii) slowly changes is calculated the lagrangian values of described coding mode, when calculating described lagrangian values, described Lagrange's multiplier with than the low speed of reference Lagrange's multiplier as the function of quantization parameter and change, to emphasize distortion value with respect to bit-rates values;

Be used for from described a plurality of coding modes, select the device of coding mode based on the lagrangian values of calculating; And

20. H.264 equipment according to claim 19 wherein saidly stipulated in the standard with reference to Lagrange's multiplier.

21. a method that is used for selecting from a plurality of coding modes coding mode, described a plurality of coding modes comprise coding mode 0, and described method comprises:

At each coding mode from described a plurality of coding modes,

Calculate the distortion value of this coding mode;

Calculate the bit-rates values of this coding mode; And

Based on (i) this distortion value, (ii) this bit-rates values and the Lagrange's multiplier that (iii) slowly changes are calculated the lagrangian values of this coding mode, when calculating described lagrangian values, the Lagrange's multiplier of described slow change changes with the function that the standard Lagrange's multiplier is compared as quantized value slowlyer, to emphasize distortion value with respect to bit-rates values;

Minimum lagrangian values in the lagrangian values of determining to calculate;

The set of recognition coding pattern, wherein said minimum lagrangian values with this set in the ratio of the lagrangian values that is associated of each coding mode more than or equal to the threshold error value;

When coding mode 0 does not belong to the set of the coding mode of identifying, the coding mode that selection and described minimum lagrangian values are associated.

22. method according to claim 21, wherein said coding mode 0 are the Direct Model codings.

23. method according to claim 21, wherein said coding mode 0 are the skip mode codings.

24. method according to claim 21, wherein said coding mode 0 are the coding modes of transmitting moving vector information not.

25. an equipment that is used for selecting from a plurality of coding modes coding mode, described a plurality of coding modes comprise coding mode 0, and described equipment comprises:

The device that is used for the distortion value of each coding mode of calculating;

Be used for each coding mode at described a plurality of coding modes, based on the distortion value of (i) this coding mode, the Lagrange's multiplier that (ii) slowly changes and (iii) the bit-rates values of this coding mode calculate the device of lagrangian values, when calculating described lagrangian values, the Lagrange's multiplier of described slow change changes with the function that the standard Lagrange's multiplier is compared as quantized value slowlyer, to emphasize distortion value with respect to bit-rates values;

The device that is used for the minimum lagrangian values of definite lagrangian values of calculating;

The device that is used for the set of recognition coding pattern, the ratio of wherein said minimum lagrangian values and the lagrangian values of calculating for each coding mode in this set is more than or equal to the threshold error value; And

Be used for when coding mode 0 belongs to the set of the coding mode of identifying, selecting coding mode 0; And when coding mode 0 does not belong to the set of the coding mode of identifying, the device of the coding mode that selection and described minimum lagrangian values are associated.

26. equipment according to claim 25, wherein said coding mode 0 are the Direct Model codings.

27. equipment according to claim 25, wherein said coding mode 0 are the skip mode codings.

28. equipment according to claim 25, wherein said coding mode 0 are the coding modes of transmitting moving vector information not.

29. the equipment that execution pattern is selected in video compression and coded system, described equipment comprises:

Be used for coming with each possible coding mode the device of Code And Decode picture element module;

Be used for using described distortion value, described bit-rates values and Lagrange's multiplier to calculate the device of the lagrangian values of each coding mode;

The device that is used for the set of recognition coding pattern, wherein the Zui Xiao lagrangian values of calculating with this set in the ratio of the lagrangian values that is associated of each coding mode more than or equal to the threshold error value; And

Be used for when coding mode 0 belongs to the set of the coding mode of identifying, selecting coding mode 0; And when coding mode 0 does not belong to the set of the coding mode of identifying, the device of the coding mode that selection and the minimum lagrangian values of calculating are associated.

30. equipment according to claim 29, wherein said distortion value is calculated as the Huber functional value sum of the error between the pixel in the picture element module of the pixel in the original picture element module and decoding, and wherein said Huber function is lower than mean square error function to the weighting of the outlier in the picture element module.

31. equipment according to claim 29, the device that wherein be used for to calculate described bit-rates values comprises the device for one group of motion vector of calculation code and one group of necessary total number of bits of conversion coefficient.

32. equipment according to claim 29, wherein said Lagrange's multiplier comprises the Lagrange's multiplier of slow change, and it is as the function of quantized value.

33. the equipment that execution pattern is selected in video compression and coded system, described equipment comprises:

The Lagrange's multiplier that is used for using described distortion value, described bit-rates values and slowly changes is calculated the device of the lagrangian values of each coding mode, when calculating described lagrangian values, the Lagrange's multiplier of described slow change changes slowlyer as the function of quantized value than standard Lagrange's multiplier, to emphasize distortion value with respect to bit-rates values; And

Be used for by using the lagrangian values of calculating to select the device of coding mode.

34. equipment according to claim 33 wherein reduces outlier by use the function of the influence of distortion value is calculated described distortion value.

35. equipment according to claim 34, wherein said function comprises the Huber function.

36. equipment according to claim 33, the device that wherein be used for to calculate described bit-rates values comprises the device for one group of motion vector of calculation code and one group of necessary total number of bits of conversion coefficient.