Movatterモバイル変換


[0]ホーム

URL:


US8325220B2 - Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input - Google Patents

Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input
Download PDF

Info

Publication number
US8325220B2
US8325220B2US12/095,183US9518306AUS8325220B2US 8325220 B2US8325220 B2US 8325220B2US 9518306 AUS9518306 AUS 9518306AUS 8325220 B2US8325220 B2US 8325220B2
Authority
US
United States
Prior art keywords
regions
image data
image
region
focus
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related, expires
Application number
US12/095,183
Other versions
US20080303894A1 (en
Inventor
Fabian Edgar Ernst
Bart Gerard Bernard Barenbrug
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NVfiledCriticalKoninklijke Philips Electronics NV
Assigned to KONINKLIJKE PHILIPS ELECTRONICS N VreassignmentKONINKLIJKE PHILIPS ELECTRONICS N VASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: BARENBRUG, BART GERARD BERNARD, ERNST, FABIAN EDGAR
Publication of US20080303894A1publicationCriticalpatent/US20080303894A1/en
Application grantedgrantedCritical
Publication of US8325220B2publicationCriticalpatent/US8325220B2/en
Expired - Fee Relatedlegal-statusCriticalCurrent
Adjusted expirationlegal-statusCritical

Links

Images

Classifications

Definitions

Landscapes

Abstract

2D image data are converted into 3D image data. The image is divided, on the basis of focusing characteristics, into two or more regions, it is determined to which region an edge separating two regions belongs. The regions are depth ordered in accordance with the rule that the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions. Preferably to each of the regions a depth is assigned in dependence on an average or median focusing characteristic of the region.

Description

The invention relates to a stereoscopic image display method wherein 2D image data are converted into 3D image data and wherein focus information is extracted from the 2D image data and used for generating the 3D image data.
The invention also relates to a stereoscopic image display device comprising an input for 2D image data and a converter to convert theinput 2D image data into 3D image data the converter comprising a focus information extractor for extracting focus information from the 2D image data.
The invention also relates to an image display data conversion method wherein 2D image data are converted into 3D image data and wherein focus information is extracted from the 2D image data and used for generating the 3D image data.
The invention further relates to a 3D image signal.
The invention further relates to a computer program product to be loaded by a computer arrangement, comprising instructions to generate 3D image data on basis of a 2D image data input, the computer arrangement comprising processing means.
A stereoscopic image display method and device of the kind described in the opening paragraph is disclosed inEP 1 021 049. InEP 1 021 049 a 3 dimensional video image is generated from a 2 dimensional video input. The known device and method uses a foreground/background discriminating circuit which discriminates on the basis of focus information which is extracted from the 2 dimensional video input. A parallax control signal is outputted on the basis of edge detecting wherein sharp edges are placed in the foreground of the 3D image.
Although the known method and device provide for a relatively simple device and method, it has been found that the rendered 3D images occasionally are confusing images wherein depth of vision, i.e. the 3D effect, is difficult to distinguish.
It is an object of the invention to improve 3D image rendering based on a 2D image input.
To this end the method in accordance with the invention is characterized in that on basis of focus characteristics the image is divided into two or more regions, it is determined to which region of the image an edge separating two regions belongs and a depth order is established between the regions following the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions.
In the prior art method ofEP 1 021 049 edge detection is also performed. Sharp edges are placed in the foreground. This scheme, however, sometimes provides for confusing results since parts of the images that are in reality in the foreground are given background parallax and vice versa in case the background happened to be in focus and the foreground out-of-focus. This provides for confusing images wherein the parallax information provides the viewer the cue that certain parts of the 3D image are on the foreground and others parts in the background, but the actual content of the image provides the viewer with a completely opposite cue, i.e. what is foreground according the parallax cue is background according to the actual content.
The 3D sensation is then confusing at best and often lost, especially since the depth cue given by the known method is usually limited. It is assumed that the human brain is capable of reconstructing a stereoscopic sense from even an imperfect depth cue. The depth cues in the prior art method and device are, however, sometimes at odds with each other, and may even change from scene to scene, i.e. in one scenes the depth cues may be correct, followed by a sudden shift to conflicting depth cues wherein a foreground figure hides behind a background tree. The depth sensation is then lost or at least a very annoying conflict between depth cues is perceived by the viewer.
The method in accordance with the invention solves or at least reduces the problem. The image is divided in regions on the basis of focus information, for instance the blur radius. The pixels or blocks of the image are clustered into a number of regions having similar focus characteristic. Based on the focus information, e.g. the average blur per block, the image is divided into two or more regions wherein each region has averaged focusing characteristics. It is determined to which region an edge separating two regions belongs. This may e.g. be done by comparing the sharpness (blur) of a detected edge to the average blur of the regions bordering either side of the edge. A blurred edge belongs to a bordering region having a high blur, whereas a sharp edge to a region having a low blur. A depth ordering is performed on the regions, wherein the rule is followed that a region comprising an edge is closer to the viewer than the adjacent region. 3D information is assigned to the regions in accordance with the depth ordering. The various regions of the image thus form depth layers. Dividing the image into regions is performed by means of clustering pixels or blocks into regions. Although this clustering could be done on a pixel per pixel basis, it is found that more robust results are obtained when, prior to division of the image into regions, a focusing characteristic is determined per block of pixels and the blocks are clustered into regions. Blocks are small parts of the image having n×m pixels, usually m×m, where n and m are typically 2, 4, 8 or 16.
The advantage of the method in comparison to the known method is clear for e.g. an image in which a person is seated partially behind a flower arrangement. The person is in focus; the flower arrangement is not. Using the known method the person being in focus and thus having sharp image edges, is given a parallax so that it seems in the foreground and image portion depicting the flower arrangement, having a blurred edge, is given a parallax corresponding with background. This conflicts with the true situation since the person is partially behind the flower arrangement and not the other way around. The known method and device thus confronts the viewer with two conflicting, in fact irreconcilable, depth cues. The parallax depth cue, putting the person on the foreground in front of the flower arrangement, contradicts the image information depth cue, which shows the person seated behind the flower arrangement.
The method in accordance with the invention does not provide for conflicting depth cues. The image is divided into regions and comprises at least two regions, for instance an in focus region comprising the person and an out-of-focus region comprising the flower arrangement. The edges separating the regions comprising the flower arrangement and the region comprising the person are formed by the blurred edges of the flower arrangement. Thus the region of the image comprising the flower arrangement is placed on the foreground, in accordance with the rule that a region comprising the edge separating two regions is closer to a viewer than the other region. Out-of-focus foreground regions, which are bounded by blurred edges, are placed in front of in-focus background regions. Thus, if there are two regions, an out-of-focus foreground flower arrangement in front of an in-focus person, the correct parallax is assigned to both regions. If there are three regions, an out-of-focus foreground flower arrangement, an in-focus person and an out-of-focus background, the correct 3D information is provided for the three regions. It is emphasized that the results of the method in accordance with the invention provide, in this example, results that are against the very core of the teaching of EP 0 121 049, which dictates that depth ordering is done by placing sharp edges on the foreground.
Preferably the 3D depth information is assigned in dependence on the focusing characteristics of the regions. The average focusing characteristics provides a clue as to the difference in depth between the regions. This can be used to improve the 3D effect.
In preferred embodiments the number of regions is three or two. Clustering the pixels or blocks of the image into two or three regions has proven to give good results, while requiring only limited calculating power. Almost all images have an in-focus part, and an out-of-focus part, the out-of-focus part sometimes being foreground, sometimes being background, so that two regions often suffice. Occasionally the out-of-focus part comprises a fore-ground and background part, for instance a foreground tree and a background forest and an intermediate in-focus region, in which case three regions usually suffice.
In a preferred embodiment a statistical distribution is made of focusing characteristics of pixels or blocks of the image and the number of regions is determined in dependence on the statistical distribution.
It is found that the focusing characteristics, such a blur radius, often cluster around a limited number of peaks, one corresponding to a small blur radius, i.e. in focus or nearly in focus, and another or others at larger blur radii, corresponding to out of focus parts of the image. Using these statistical data allows for a quick determination of the number of regions in which the region can be divided.
The image display device in accordance with the invention comprises means for performing the method steps in accordance with the invention.
The invention is also embodied in a transmitter comprising means for performing the method steps in accordance with the invention.
These and other objects of the invention will be apparent from and elucidated with reference to the embodiments described hereinafter.
In the drawings:
FIG. 1 illustrates the thin lens model;
FIGS. 2A-2C illustrate a possible method for determining blur radius;
FIG. 3 illustrates the relation between blur radius and focal plane;
FIG. 4 illustrates a statistical distribution of blur radii;
FIGS. 5A and 5B illustrate a method of determining regions;
FIG. 6 illustrates a method for deciding to which regions an edge belongs;
FIG. 7 illustrates a method in accordance with the invention;
FIG. 8 illustrates a display device in accordance with the invention; and
FIG. 9 illustrates a transmitter in accordance with invention.
The figures are not drawn to scale. Generally, identical components are denoted by the same reference numerals in the figures.
In a simple optical system, like a convex thin lens, objects at a particular distance from the lens are clearly depicted (objects are in focus) on the image plane, while objects at other distances are mapped blurred (objects are defocused) proportional to their distance from the plane of focus. The latter situation for a point source is depicted inFIG. 1.
The blur behavior is according to the thin lens formula:
1f=1u+1v(1)
in which f represents the focal length of the lens, u is the object distance and v is the image distance. From the geometric relations inFIG. 1 and the lens formula, the formula for the distance u can be derived:
u=fss-f-kσfifu>u0(2)u=fss-f-kσfifu<u0(3)
wherein u0denotes the distance for which points are in focus. The parameter s is the image plane to lens distance and the parameter k is a constant determined by the characteristics of the lens system. The parameters f, s and k are camera parameters, which can be determined from camera calibration. Thus estimating the distance u of an object involves determining the camera parameters and estimating the blur radius a. Thus there is a relation between the blurriness of an image, i.e. a focus characteristic and the distance.
For 2D-to-3D conversion, disparity (inverse depth) is a more relevant quantity than depth itself, as for instance the parallax for rendered views is linear in disparity. Using the above expression it is possible to find a relation between disparity differences between points in focus and out of focus and the blur radius a.
1u-1u0=kσs(4)
In other words, the disparity difference to the focal plane is proportional to the blur radius. Moreover, as the amount of disparity for rendered views can usually be changed to accommodate for the preference of the user and/or the possibilities of the display, accurate determination of the camera-related constant k/s is not necessary, all that is needed is determination of the blur radius σ, i.e. of a focus characteristic. In the following description, the blur radius is taken for the focus characteristic for the simple reason that there is a simple relation between distance and blur radius. However, although determining the blur radius as the focus characteristic is preferred, due to the simple relation between blur radius and distance, other measures of blurriness could also be determined within the concept of the invention.
FIGS. 2A-C schematically illustrate a possible method for determining blur radius σ. InFIG. 2A a blurred edge with a blur radius σ is schematically shown. The horizontal axis denotes position, the vertical axis luminance. InFIG. 2B a filtering function is shown which is the second derivative of a Gaussian filter with width s. Convolution ofFIG. 2A andFIG. 2B provides for a function having two peaks. The distance dhbetween the peaks can be measured reasonably adequate and the relation between the blur radius σ, filter width s and peak distance dhis as follows:
σ2=(dh/2)2−s2  (5)
This exemplary algorithm is robust and the results obtained for various types of content were good. Taking various filter widths s for each pixel for each filter width a value for the blur radius σ is found. Taking an average or median value of σ per pixel and then determining an average or median value for σ over a block wherein more pronounced edges, which have a larger height in partFIG. 2C, are given a larger weight proved to give robust results. A reasonably good distinction in determined values for σ between the in-focus and out-focus regions is found.
The relation between u and the blur radius σ is schematically shown inFIG. 3 and follows from equation (4).
If the parameters k and s are known from calibration, then a true estimate of the absolute distance to the focal plane can be made, once the blur radius σ is known. Since this does not reveal if a blurred object is in front of the focal plane or behind it, also at least two images for different focal distances need to be known for true depth estimation from the blur radius σ. However, neither of these requirements is usually known or obtainable for arbitrary externally given image data such as e.g. video. A good distinction can nevertheless be made between out-of-focus regions of the image and in focus regions of the image and, if there are more regions, between the various regions.
Since the formula between the disparity difference and the blur radius gives a relation between the absolute value of the disparity difference and the blur radius, the equation has two separate solutions. Hence determination of two different values of the blur radius σ does not enable depth ordering, as the same values of σ may result from an object closer to or further away. InFIG. 4 this is schematically shown for two different values for the blur radius σ (σ1 and σ2). In principle there are four different possible combinations of image planes possible.
FIG. 4 shows a typical distribution of blur radii for blocks within an image wherein the horizontal axis denotes the percentage of blocks with a certain blur radius. Clearly two modes centered on peaks with values of σ1and σ2can be distinguished corresponding in this example with in-focus and out-of-focus parts of the image. Such a distribution alone, however, does not enable to provide an accurate depth ordering for two reasons. First of all, as explained in relation toFIG. 3, there is ambiguity as to the actual relative position of image planes corresponding with the peaks inFIG. 3 since more than one solution is possible. Secondly the peaks in the distribution in σ are quite broad. This indicates that the actual blur values have a high numerical uncertainty and may not be suitable for deriving depth ordering information, as blur radius difference (the spread in the peaks inFIG. 3) in each mode (e.g. the out-of-focus region) may exceed blur radius differences between modes. Hence only using actual numerical values of the blur radius to decide on depth ordering and depth ordering of each block introduces a large amount of noise.
To nevertheless obtain reliable depth ordering the method and device in accordance with the invention executes two steps.
In a first step the pixels or blocks of the image are clustered based on their focusing characteristic, thereby forming regions within the image. Within the broadest scope of the invention, also pixels could be clustered. However, the spread in values of σ for pixels is even larger than for blocks. More robust results are obtained when, prior to clustering a focusing characteristic, in the examples given an average or median value for the blur radius σ, is assigned on a block basis and the blocks are clustered into regions on the basis of the block values for σ. To each region an average or medium blur radius is assigned. Clustering may be done in various manners.
A simple iterative clustering algorithm may be used which always divides the image into two or more clusters starting from a heuristic initial clustering. The decision whether we have one, two or more clusters is then based on the similarity of the characteristics of the clusters.
FIGS. 5A and 5B illustrate such a method wherein it is assumed that there are two large regions, one in focus and more or less in the middle, surrounded by an out-of-focus region. The initial clustering consists of assigning the blocks on the left, top and right border (say ¼ of the image) to the ‘background’ cluster C2, and the other pixels to the ‘foreground’ cluster C1(seeFIG. 5A). This choice originates from the selection of blocks for background motion model estimation. Heuristically, one expects that the object of interest (usually the foreground) is somewhere in the center of the image, and the borders of the image do not contain objects of interest. For background motion model estimation, it is assumed that the object of interest in the center is the foreground. It is, however, not necessary to make such an assumption in the clustering stage. It has been observed, however, that most of the time the center cluster is in focus.
As the initial clustering is rather coarse and based on heuristics, a robust method to arrive at initial estimates of the blur radii for each cluster is as follows.
A number of feature points (in our case 28), regularly distributed inside the clusters is selected. The initial blur radius value σ1respectively σ2of a cluster is the median of the blur radii σ of all those feature points.
Then an iterative procedure is carried out to refine this cluster:
Step 1: Reassign the blocks. A sweep is made over the image, and each block B on a cluster boundary is assigned to the cluster to which it has the smallest deviation to its mean focus estimate:
B→C1if |σB−σ1|<|σB−σ2|  (6)
B→C2else
Step 2: Update the values for σ1and σ2. Blocks have been reassigned to clusters C1and C2so new average or median cluster blur radii σ1and σ2are computed for each of the two (or more if there are more) clusters.
Step 3: Iterate. A new sweep is made, seestep 1.
This process converges after a few (typically 4) iterations.
FIG. 5B shows the result of such iteration: two regions are formed, a foreground region C1with a median blur radius σ1, and background region C2with a median blur radius σ2.
Typically this method provides for two regions, an out-of-focus region and in a in-focus regions. These regions need not be connected, e.g. the in focus regions may comprise two separate sub regions, as may the out-of-focus region. When the statistics shows evidence of three regions, i.e. three peaks in the σdistribution, it is possible to start with three regions. An initial clustering may also be found by determining the peaks in the σ diagram, and simply assigning each block to the peak with the best matching σ.
Once the image is divided into regions C1, C2, C3 etc, it is possible to assign a region blur radius σito each of the regions. The next step in the method and device in accordance with the invention is that the mutual position, i.e. which region is in front of which region, of the regions is determined. A decision on depth ordering has to be made. In order to do so use is made of the principle that an edge belongs to the foremost object.FIG. 6 illustrates schematically a method for distinguishing from this principle which edge belongs to which regions.FIG. 6 shows along the horizontal axis a position parameter, such as the x, or y coordinate, or a coordinate perpendicular to a transition between two regions. Vertically the blur radius is shown. InFIG. 6 schematically the transition between an in-focus region with a low value for blur radius σ and an out-of-focus region with a high value for σ is shown. The width W illustrates schematically the blurriness of the edge. An out-of-focus edge will have a larger width W than an in-focus edge. Schematically this is shown in the top part ofFIG. 6, having a small W and thus a sharp transition, and the bottom part, showing a large width W and thus a blurred transition. Thus in the top part the edge separating the regions C1and C2belongs to the region C1with low blur radius σ1. Thus region C1is foreground, which is indicated in the figure by C1(F). Region C2is background indicated by C2(B). In the bottom part the width W is large. The edge separating the regions C1and C2belongs to the region C2withhigh blur radius2. Thus “blurred” region C2is foreground, which is indicated inFIG. 6 by C2(F). Region C1is background indicated by C1(B). By taking various measurement points along lines perpendicular to the transition lines between the regions, and taking an average or deciding for each measure point to which the region the edge seems to belong and then voting between the different measurements, it is easily found whether the edge belongs to the an in-focus region, in which case the in-focus region lies in front of the out-of-focus region, or to an in-focus region, in which case the in-focus region lies in front of the out-of-focus region. To put it differently, the width W is only dependent on the σ of one the regions, not or at least hardly on the σ of the other region. This characteristic can be used to determine to which regions an edge separating two regions belong.
This is one example of a method for establishing to which region an edge belongs.
A different method is for instance to segment the image, i.e. the find luminance or color edges in the image near the transitions between the regions and compare them to the edges between the regions as follows from the preceding clustering step.
Using luminance segmentation, different methods may be used to find which edge belongs to which regions. One way is to look at the orientation of luminance edges in the various regions near the transition between the regions. The luminance edge corresponding to the transition between regions is determined solely by the foreground image and the edge or edges belonging to the foreground image often follow the transition, i.e. they are parallel to the transition. Luminance edges in the background tend not to have a relation to the transition.
Yet another method is the following: the image is segmented based on focus, as explained above, and luminance edges are found near the transitions between the regions. By determining the edge between regions in two different ways, by luminance segmentation and by clustering on the basis of blur radius it may be established to which region an edge belongs. Ideally the two determinations would completely coincide, but this is not the case. It has been found that clustering of blocks tends on average to extend the region to which an edge belongs to slightly beyond the luminance edge because the whole edge or at least a major part of the edge is assigned the blur radius of the edge which belongs to the foreground object. There is thus a slight bias in clustering which extends a clustered region to include the edge belonging to said cluster. This bias does not occur for determination of edges when solely differences in luminance are concerned because in luminance segmentation the transition between the regions is drawn in the middle of the edge separating the regions. There is thus a small difference in the determined position of the edge, since the clustering method based on blur radius determination as described above tends to overextend the border of the clustered foreground region to include into a region the edge belonging to said region, whereas such tendency to overextend does not exist for edges solely determined on the basis of luminance segmentation. To put it differently: luminance segmentation puts the edge exactly in the middle of the luminance transition, whereas clustering segmentation overestimates the size of the foreground region. This effect is also called morphological dilatation, i.e. the clustering slightly dilates, i.e. increases in size, the form of the foreground object. This bias of the clustering method draws foreground object edges into the foreground cluster. This seemingly negative effect can be brought to good use by comparing the edge as determined by luminance segmentation to the same edge as determined by blur radius segmentation. This allows to establish to which regions an edge belongs. Blur radius determination or more in particular determination of focus characteristics may be done using alternative algorithms. Alternative algorithms for clustering may also be used. Depending on the used algorithms the so determined foreground region will overextend or underextend in respect of edge determined by luminance edges. In both cases it is possible to determine to which region an edge belongs by comparing the regions determined by luminance segmentation to the regions determined by determination and clustering of focusing characteristics.
Depth ordering can be done simply on the basis of what region is foreground and what region is background, i.e. a fixed difference in parallax can be used to distinguishing the foreground and background regions, or foremost, intermediate range and background regions, independent of the actual values σi.
Preferably the blur radius estimates for the regions are converted into a depth or inverse depth value. Given the depth orderings and σ values we may take the disparity of blurred objects as the disparity of in focus objects, i.e. the region with lowest σ, plus a constant time the difference in blur radius between foreground and background.
1u1u0+KΔσ
Wherein Δσ is the difference in σ, K is a constant and u0is the focus plane. If σ is very small Δσ equals σ of the out-of-focus plane. The cluster with the lowest blur value is assigned the depth u0; all other clusters are assigned a depth value based on their depth ordering with respect to the cluster with the lowest radius value. In case we have only two clusters, in-focus and out-of-focus, K is positive if the foreground in is focus and negative of the out-of-focus region is foreground.
For single image blur radius estimation, the constants u0and K can not be recovered, for this we would need multiple images with different focal settings. However, if we only use the depth map for rendering, most of the time the depth map is translated and scaled anyhow to match the capabilities of the screen and the preferences of the user. For an autostereoscopic display device, we may for instance take u0in such a way that the in-focus region is rendered in the plane of the screen to have a maximal sharp image. The out-focus region can then be rendered behind or in front of the screen, depending on the depth ordering.
FIG. 7 shows a method in accordance with the invention. From aninput 2D signal, image blocks are formed instep 2, block focus characteristics, for instance the block blur radius σBare determined instep 3, these blocks are clustered into two or more clusters instep 4. Instep 6 the relation between the edge and the region is determined. This may be done directly from the focus characteristics, seeFIG. 6, or in parallel the image may be luminance segmented and image edge obtained by luminance segmentation (step 5) are compared instep 6 to edge determined by clustering wherein comparing the results leads to the determination of which edge belong to which regions and thereby which regions are positioned in front of which regions, i.e. the depth ordering of regions (step 7). The depth is determined from the focus characteristics (step 8) in accordance with a preferred embodiment, which in the examples given is the blur radius, the resulting 3D output signal is provided (step 9).
FIG. 8 shows an image device in accordance with the invention. The image device has means for performing all the steps of the method, i.e. aninput1, for receiving a 2D input signal, a former2 for formation image blocks, acomputer3 for computing block focus characteristics, aclusterer4 for clustering image regions on basis of focus, animage edge detector5, an edge-region relationship determinator6, adepth orderer7 and adepth information assigner8. It furthermore comprises anoutput9 for outputting a 3D signal to a3D display screen10. Such a display device may for instance an autostereoscopic display device.
FIG. 9 shows a transmitter in accordance with the invention. The difference withFIG. 8 is that the display screen itself is not an integral part of the device. Such a transmitter may for instance read DVD's having a 2D signal and converting the 2D signal into a 3D signal for use in 3D display device which may be separately sold. It may also be a device which makes a DVD having 3D signals from a DVD having a 2D signal, the 3D signals may thus be provided to a DVD burner, or for instance sent to another location. 3D image signals comprising information on the division of the image in regions and the depth order of the regions and, in preferred embodiments, also the focus characteristic of the various regions also form embodiments of the invention. The information may be given in a header in the signal, which header specifies which blocks belongs to the regions, or the dividing lines between the regions, the order of the regions and preferably also the focusing characteristics of the regions, preferably the region blur radii. A 3D signal made with the prior art methods and devices does not comprise such information. A 3D signal in accordance with the invention could for instance be generated as follows: A customer has a 3D display device but a normal 2D digital camera. A user sends a 2D home video or digital image to an internet site. The original 2D signal is converted into a 3D signal, which is sent back to the user which can display the video or image on his 3D display.
In short the invention can be described as follows:
2D image data are converted into 3D image data. The image is divided, on the basis of focusing characteristics, into two or more regions, it is determined to which region an edge separating two regions belongs. The regions are depth ordered in accordance with the rule that the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions. Preferably to each of the regions a depth is assigned in dependence on an average or median focusing characteristic of the region.
The invention is also embodied in any computer program product for a method or device in accordance with the invention. Under computer program product should be understood any physical realization of a collection of commands enabling a processor—generic or special purpose-, after a series of loading steps (which may include intermediate conversion steps, like translation to an intermediate language, and a final processor language) to get the commands into the processor, to execute any of the characteristic functions of an invention. In particular, the computer program product may be realized as data on a carrier such as e.g. a disk or tape, data present in a memory, data traveling over a network connection—wired or wireless—, or program code on paper. Apart from program code, characteristic data required for the program may also be embodied as a computer program product.
Some of the steps required for the working of the method may be already present in the functionality of the processor instead of described in the computer program product, such as data input and output steps.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments without departing from the scope of the appended claims.
In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim.
The word “comprising” does not exclude the presence of other elements or steps than those listed in a claim. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a device claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The invention may be implemented by any combination of features of various different preferred embodiments as described above. In particular it is mentioned that any embodiment shown or claimed in relation to an encoding method or encoder has, unless otherwise indicated or impossible, a corresponding embodiment for a decoding method or decoder and such decoding methods and decoder are embodiments of the invention and claimed herewith.

Claims (12)

1. A method of converting image display data wherein 2D image data are converted into 3D image data comprising the 2D image data and depth information, the method comprising:
extracting focus information (σ) from the 2D image data,
generating at least some of the 3D image data using the extracted focus information,
dividing the 2D image into two or more regions (C1, C2) based on the extracted focus information such that pixels or blocks of the 2D image are clustered into regions, the regions having a respective focusing characteristic (σl, σ2),
the method characterized in that it further comprises:
determining a luminance edge near a transition between two regions of the two or more regions;
establishing a depth order between the two regions following the rule that a region comprising the luminance edge is closer to the viewer than an adjacent region and
assigning depth information to the two regions in accordance with the established depth order of the two regions.
US12/095,1832005-12-022006-11-27Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data inputExpired - Fee RelatedUS8325220B2 (en)

Applications Claiming Priority (4)

Application NumberPriority DateFiling DateTitle
EP05111623.42005-12-02
EP051116232005-12-02
EP051116232005-12-02
PCT/IB2006/054458WO2007063478A2 (en)2005-12-022006-11-27Stereoscopic image display method and apparatus, method for generating 3d image data from a 2d image data input and an apparatus for generating 3d image data from a 2d image data input

Publications (2)

Publication NumberPublication Date
US20080303894A1 US20080303894A1 (en)2008-12-11
US8325220B2true US8325220B2 (en)2012-12-04

Family

ID=38057450

Family Applications (1)

Application NumberTitlePriority DateFiling Date
US12/095,183Expired - Fee RelatedUS8325220B2 (en)2005-12-022006-11-27Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input

Country Status (8)

CountryLink
US (1)US8325220B2 (en)
EP (1)EP1958149B1 (en)
JP (1)JP5073670B2 (en)
KR (1)KR101370356B1 (en)
CN (1)CN101322155B (en)
AT (1)ATE542194T1 (en)
RU (1)RU2411690C2 (en)
WO (1)WO2007063478A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20120218382A1 (en)*2010-08-022012-08-30Ron ZassMulticlass clustering with side information from multiple sources and the application of converting 2d video to 3d
US20150138319A1 (en)*2011-08-252015-05-21Panasonic Intellectual Property Corporation Of AmericaImage processor, 3d image capture device, image processing method, and image processing program
US11533464B2 (en)2018-08-212022-12-20Samsung Electronics Co., Ltd.Method for synthesizing intermediate view of light field, system for synthesizing intermediate view of light field, and method for compressing light field

Families Citing this family (59)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8330801B2 (en)*2006-12-222012-12-11Qualcomm IncorporatedComplexity-adaptive 2D-to-3D video sequence conversion
WO2008118113A1 (en)*2007-03-232008-10-02Thomson LicensingSystem and method for region classification of 2d images for 2d-to-3d conversion
KR101545008B1 (en)*2007-06-262015-08-18코닌클리케 필립스 엔.브이.Method and system for encoding a 3d video signal, enclosed 3d video signal, method and system for decoder for a 3d video signal
DE102007058779B4 (en)*2007-12-062021-01-14Robert Bosch Gmbh Device of a motor vehicle for generating an image suitable for image analysis
AU2009210672B2 (en)*2008-02-082013-09-19Google LlcPanoramic camera with multiple image sensors using timed shutters
CN102132573B (en)*2008-08-262013-10-23皇家飞利浦电子股份有限公司Method and system for encoding 3d video signal, encoder for encoding 3-d video signal, encoded 3d video signal, method and system for decoding 3d video signal, decoder for decoding 3d video signal
US8345956B2 (en)*2008-11-032013-01-01Microsoft CorporationConverting 2D video into stereo video
JP2010128450A (en)*2008-12-012010-06-10Nippon Telegr & Teleph Corp <Ntt>Three-dimensional display object, three-dimensional image forming apparatus, method and program for forming three-dimensional image
CN101751664B (en)*2008-12-022013-04-17奇景光电股份有限公司 Stereo Depth Information Generation System and Method
KR20100080704A (en)*2009-01-022010-07-12삼성전자주식회사Method and apparatus for obtaining image data
JP4903240B2 (en)*2009-03-312012-03-28シャープ株式会社 Video processing apparatus, video processing method, and computer program
US9124874B2 (en)2009-06-052015-09-01Qualcomm IncorporatedEncoding of three-dimensional conversion information with two-dimensional video sequence
JP5369952B2 (en)*2009-07-102013-12-18ソニー株式会社 Information processing apparatus and information processing method
US9083958B2 (en)*2009-08-062015-07-14Qualcomm IncorporatedTransforming video data in accordance with three dimensional input formats
US8878912B2 (en)*2009-08-062014-11-04Qualcomm IncorporatedEncapsulating three-dimensional video data in accordance with transport protocols
US8629899B2 (en)*2009-08-062014-01-14Qualcomm IncorporatedTransforming video data in accordance with human visual system feedback metrics
US8254760B2 (en)2009-08-282012-08-28Apple Inc.Pixel analysis and frame alignment for background frames
KR101082545B1 (en)2010-01-282011-11-10주식회사 팬택Mobile communication terminal had a function of transformation for a picture
WO2011097306A1 (en)*2010-02-042011-08-11Sony Corporation2d to 3d image conversion based on image content
KR101690297B1 (en)*2010-04-122016-12-28삼성디스플레이 주식회사Image converting device and three dimensional image display device including the same
KR101674568B1 (en)*2010-04-122016-11-10삼성디스플레이 주식회사Image converting device and three dimensional image display device including the same
KR20120005328A (en)2010-07-082012-01-16삼성전자주식회사 Stereoscopic glasses and display device including the same
US20130113795A1 (en)*2010-07-262013-05-09City University Of Hong KongMethod for generating multi-view images from a single image
KR20120023268A (en)*2010-09-012012-03-13삼성전자주식회사Display apparatus and image generating method thereof
US9165367B2 (en)*2010-09-022015-10-20Samsung Electronics Co., Ltd.Depth estimation system for two-dimensional images and method of operation thereof
KR101638919B1 (en)*2010-09-082016-07-12엘지전자 주식회사Mobile terminal and method for controlling the same
US9305398B2 (en)2010-10-082016-04-05City University Of Hong KongMethods for creating and displaying two and three dimensional images on a digital canvas
TWI532009B (en)*2010-10-142016-05-01華晶科技股份有限公司Method and apparatus for generating image with shallow depth of field
JP2012100116A (en)*2010-11-022012-05-24Sony CorpDisplay processing device, display processing method, and program
KR20120059367A (en)*2010-11-302012-06-08삼성전자주식회사Apparatus for processing image based on energy value, and methods thereof
KR101188105B1 (en)*2011-02-112012-10-09팅크웨어(주)Apparatus and method for providing argumented reality using image information
KR101685418B1 (en)2011-04-272016-12-12한화테크윈 주식회사Monitoring system for generating 3-dimensional picture
JP5868026B2 (en)*2011-05-242016-02-24株式会社東芝 Ultrasonic diagnostic equipment
CN102857772B (en)*2011-06-292015-11-11晨星软件研发(深圳)有限公司Image treatment method and image processor
WO2013009099A2 (en)2011-07-122013-01-17삼성전자 주식회사Device and method for blur processing
US8749548B2 (en)*2011-09-012014-06-10Samsung Electronics Co., Ltd.Display system with image conversion mechanism and method of operation thereof
CN102426693B (en)*2011-10-282013-09-11彩虹集团公司Method for converting 2D into 3D based on gradient edge detection algorithm
CN104054044A (en)*2011-11-212014-09-17株式会社尼康 Display device and display control program
JP2013172190A (en)*2012-02-172013-09-02Sony CorpImage processing device and image processing method and program
US9286658B2 (en)*2012-03-222016-03-15Qualcomm IncorporatedImage enhancement
KR20130127867A (en)*2012-05-152013-11-25삼성전자주식회사Stereo vision apparatus and control method thereof
ES2533051T3 (en)*2012-06-272015-04-07Vestel Elektronik Sanayi Ve Ticaret A.S. Procedure and device to determine a depth image
SG11201508332YA (en)*2013-04-092015-11-27Bitanimate IncTwo-dimensional video to three-dimensional video conversion method and system
JP2015149547A (en)*2014-02-052015-08-20ソニー株式会社Image processing method, image processing apparatus, and electronic apparatus
US9807372B2 (en)*2014-02-122017-10-31Htc CorporationFocused image generation single depth information from multiple images from multiple sensors
JP6603983B2 (en)*2014-09-222019-11-13カシオ計算機株式会社 Image processing apparatus, method, and program
CN104301706B (en)*2014-10-112017-03-15成都斯斐德科技有限公司A kind of synthetic method for strengthening bore hole stereoscopic display effect
CN104796684A (en)*2015-03-242015-07-22深圳市广之爱文化传播有限公司Naked eye 3D (three-dimensional) video processing method
WO2017048927A1 (en)*2015-09-182017-03-23The Regents Of The University Of CaliforniaCameras and depth estimation of images acquired in a distorting medium
EP3185209B1 (en)*2015-12-232019-02-27STMicroelectronics (Research & Development) LimitedDepth maps generated from a single sensor
CN105701823A (en)*2016-01-142016-06-22无锡北邮感知技术产业研究院有限公司Method of using occlusion relation to recover depth order
KR101825218B1 (en)*2016-04-082018-02-02한국과학기술원Apparatus and method for generaing depth information
CN105957053B (en)*2016-04-192019-01-01深圳创维-Rgb电子有限公司Two dimensional image depth of field generation method and device
FR3074385B1 (en)2017-11-282020-07-03Stmicroelectronics (Crolles 2) Sas SWITCHES AND PHOTONIC INTERCONNECTION NETWORK INTEGRATED IN AN OPTOELECTRONIC CHIP
KR101921608B1 (en)2018-01-292018-11-26한국과학기술원Apparatus and method for generating depth information
JP7137313B2 (en)2018-02-152022-09-14キヤノン株式会社 Output device, image processing method and program
US10972714B2 (en)*2018-02-152021-04-06Canon Kabushiki KaishaImage processing apparatus, image processing method and storage medium for storing program
US10523922B2 (en)*2018-04-062019-12-31Zspace, Inc.Identifying replacement 3D images for 2D images via ranking criteria
US11941782B2 (en)*2020-06-162024-03-26Adobe Inc.GPU-based lens blur rendering using depth maps

Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
EP1021049A2 (en)1999-01-142000-07-19Sony CorporationStereoscopic video display method and apparatus
US20020118275A1 (en)*2000-08-042002-08-29Harman Philip VictorImage conversion and encoding technique
US6477267B1 (en)*1995-12-222002-11-05Dynamic Digital Depth Research Pty Ltd.Image conversion and encoding techniques
WO2004061765A2 (en)2003-01-062004-07-22Koninklijke Philips Electronics N.V.Method and apparatus for depth ordering of digital images
US20050031166A1 (en)*2003-05-292005-02-10Kikuo FujimuraVisual tracking using depth data
WO2005013623A1 (en)2003-08-052005-02-10Koninklijke Philips Electronics N.V.Multi-view image generation
US20070024614A1 (en)*2005-07-262007-02-01Tam Wa JGenerating a depth map from a two-dimensional source image for stereoscopic and multiview imaging

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5790086A (en)*1995-01-041998-08-04Visualabs Inc.3-D imaging system
GB9511519D0 (en)*1995-06-071995-08-02Richmond Holographic ResAutostereoscopic display with enlargeable image volume
GB9616262D0 (en)*1996-08-021996-09-11Philips Electronics NvPost-processing generation of focus/defocus effects for computer graphics images
JP3500056B2 (en)*1997-11-102004-02-23三洋電機株式会社 Apparatus and method for converting 2D image to 3D image
JP3639108B2 (en)*1998-03-312005-04-20株式会社ソニー・コンピュータエンタテインメント Drawing apparatus, drawing method, and providing medium
CA2418013A1 (en)*2003-02-062004-08-06Peter Brown HorsleyThree-dimensional color television system adapted for depth separation by image focal length
CN100353760C (en)*2004-09-102007-12-05张保安Combined wide-screen television system

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6477267B1 (en)*1995-12-222002-11-05Dynamic Digital Depth Research Pty Ltd.Image conversion and encoding techniques
US7999844B2 (en)*1995-12-222011-08-16Dynamic Digital Depth Research Pty Ltd.Image conversion and encoding techniques
EP1021049A2 (en)1999-01-142000-07-19Sony CorporationStereoscopic video display method and apparatus
US20020118275A1 (en)*2000-08-042002-08-29Harman Philip VictorImage conversion and encoding technique
WO2004061765A2 (en)2003-01-062004-07-22Koninklijke Philips Electronics N.V.Method and apparatus for depth ordering of digital images
US20050031166A1 (en)*2003-05-292005-02-10Kikuo FujimuraVisual tracking using depth data
WO2005013623A1 (en)2003-08-052005-02-10Koninklijke Philips Electronics N.V.Multi-view image generation
US7764827B2 (en)*2003-08-052010-07-27Koninklijke Philips Electronics N.V.Multi-view image generation
US20070024614A1 (en)*2005-07-262007-02-01Tam Wa JGenerating a depth map from a two-dimensional source image for stereoscopic and multiview imaging

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Elder et al: "Local Scale Control for Edge Detection and Blur Estimation"; IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Service Center, Los Alamitos, CA, US, vol. 20, No. 7, Jul. 1998, pp. 699-716.

Cited By (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20120218382A1 (en)*2010-08-022012-08-30Ron ZassMulticlass clustering with side information from multiple sources and the application of converting 2d video to 3d
US20150138319A1 (en)*2011-08-252015-05-21Panasonic Intellectual Property Corporation Of AmericaImage processor, 3d image capture device, image processing method, and image processing program
US9438890B2 (en)*2011-08-252016-09-06Panasonic Intellectual Property Corporation Of AmericaImage processor, 3D image capture device, image processing method, and image processing program
US11533464B2 (en)2018-08-212022-12-20Samsung Electronics Co., Ltd.Method for synthesizing intermediate view of light field, system for synthesizing intermediate view of light field, and method for compressing light field

Also Published As

Publication numberPublication date
CN101322155B (en)2013-03-27
KR101370356B1 (en)2014-03-05
CN101322155A (en)2008-12-10
WO2007063478A3 (en)2007-10-11
RU2008126927A (en)2010-01-10
EP1958149B1 (en)2012-01-18
JP5073670B2 (en)2012-11-14
KR20080077391A (en)2008-08-22
WO2007063478A2 (en)2007-06-07
ATE542194T1 (en)2012-02-15
US20080303894A1 (en)2008-12-11
JP2009517951A (en)2009-04-30
EP1958149A2 (en)2008-08-20
RU2411690C2 (en)2011-02-10

Similar Documents

PublicationPublication DateTitle
US8325220B2 (en)Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input
US8411934B2 (en)System and method for depth map extraction using region-based filtering
JP3862140B2 (en) Method and apparatus for segmenting a pixelated image, recording medium, program, and image capture device
US8542929B2 (en)Image processing method and apparatus
US10298905B2 (en)Method and apparatus for determining a depth map for an angle
KR100953076B1 (en) Multipoint Matching Method and Device Using Object or Background Separation
KR102464523B1 (en) Method and apparatus for processing image property maps
KR100745691B1 (en)Binocular or multi-view stereo matching apparatus and its method using occlusion area detection
CN106851124A (en)Image processing method, processing unit and electronic installation based on the depth of field
KR100888081B1 (en) Conversion procedure and device for converting 2D video signal to 3D video signal
JP2012507907A (en) Method and apparatus for generating a depth map
JP2010510573A (en) System and method for synthesizing a three-dimensional image
US20140340486A1 (en)Image processing system, image processing method, and image processing program
US10708505B2 (en)Image processing apparatus, method, and storage medium
EP4260554B1 (en)Apparatus and method for processing a depth map
KR101458986B1 (en) A Real-time Multi-view Image Synthesis Method By Using Kinect
US10074209B2 (en)Method for processing a current image of an image sequence, and corresponding computer program and processing device
KR20060129371A (en) Create Depth Map
CN114677393A (en)Depth image processing method, depth image processing device, image pickup apparatus, conference system, and medium
JP4862004B2 (en) Depth data generation apparatus, depth data generation method, and program thereof
KR101849696B1 (en)Method and apparatus for obtaining informaiton of lighting and material in image modeling system
Tian et al.Upsampling range camera depth maps using high-resolution vision camera and pixel-level confidence classification

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:KONINKLIJKE PHILIPS ELECTRONICS N V, NETHERLANDS

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ERNST, FABIAN EDGAR;BARENBRUG, BART GERARD BERNARD;REEL/FRAME:021009/0659

Effective date:20070802

STCFInformation on status: patent grant

Free format text:PATENTED CASE

FPAYFee payment

Year of fee payment:4

FEPPFee payment procedure

Free format text:MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

LAPSLapse for failure to pay maintenance fees

Free format text:PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCHInformation on status: patent discontinuation

Free format text:PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FPLapsed due to failure to pay maintenance fee

Effective date:20201204


[8]ページ先頭

©2009-2025 Movatter.jp