The invention people is that east in Frank A Heng Laite, interior jar (unit of capacitance) Mo Shili, William J Napier, Denier S Charlie Simpkins, Frank's Wrobleski, Kevin Kang Luoyi and the charles WK lattice, denomination of invention are the U.S. Patent application of " utilizing the 3D location to strengthen the method and system of TV applications ".
The application relates on February 14th, 2005 and submits be entitled as " utilizing the free space location to strengthen the method and system (Method and System for Enhancing TelevisionApplication Using Free-Space Pointing) of TV applications " the 60/653rd, No. 229 U.S. Provisional Patent Application, and requiring its right of priority, the content of this application is incorporated this paper by reference into.
Embodiment
The present invention will be described in detail with reference to the accompanying drawings.In different accompanying drawings, same or analogous element is represented with same drawing reference numeral.In addition, following detailed description of the invention is not a limitation of the present invention.On the contrary, scope of the present invention is limited by claims.
For being provided, context so that discuss, at first can realize wherein that with reference to 3 couples in figure an exemplary integratedmedium system 200 of the present invention is described in detail.But those of ordinary skills can predict, and the present invention is not limited in this class media system and realizes, but can comprise more or less assembly therein.In native system, I/O (I/O)bus 210 links together each assembly in the media system 200.210 representatives of this I/O bus are multiple to be used for any one of mechanism that the signal between the media system assembly is transmitted and technology.For example, this I/O bus 210 can comprise the independent audio that is used for transmitting audio signal " plug " cable of right quantity, the concentric cable that is used for transmission video signal, the two-wire system serial transmission line that is used for transmission of control signals or infrared ray or radio-frequency (RF) transceiver, optical fiber or any other transmission mechanism that is used to transmit the signal of other type.
In this exemplary embodiment,media system 200 comprises TV/monitor 212, video cassette register (VCR) 214, digitized video dish (DVD) record/playback apparatus 216, audio/video tuner 218 and small-sizeddish player 220, and these devices all are connected on the I/O bus 210.VCR 214,DVD 216 and small-sizeddish player 220 can be monochromatic light dish or single magnetic tape type equipment, also can be multiple optical disk or many magnetic tape types equipment.They can be unit or be integrated in together independently.In addition,media system 200 also comprises a microphone/speaker system 222,video camera 224 and wireless I/O control device 226.According to an illustrative embodiment of the invention, wireless I/O control device 226 is media system remote control unit of supporting the 3D location, and the button that it has minimal amount navigates to be used for support, and communicates by RF signal and entertainment systems 200.For example, wireless I/O control device 226 can be utilize gyrostat (gyroscope) thereby or other mechanism limit the 3D locating device that screen position and displacement vector are determined required special command.On wireless I/O control device 226, also can comprise and to be enabled in one group of button of " click " primitive (primitive) that describes below, and comprise " retreating " button.In another exemplary embodiment, wireless I/O control device 226 is media system remote control unit that the assembly by IR signal andmedia system 200 communicates.In another embodiment, wireless I/O control device 134 can be similar to exemplary media system telechiric device in appearance, and increasing simultaneously has the characteristic that can allow the user that tracking ball or other navigation mechanism of mouse position are set in the display ofmedia system 100.
Media system 200 also comprises system controller 228.An exemplary embodiment according to the present invention is described, andsystem controller 228 can be operated with storage and show the entertainment system data that can obtain from a plurality of entertainment system datas source, and can control and the relevant various features of each assembly of system.As shown in Figure 3, in case of necessity,system controller 228 can be connected to each assembly of system directly or indirectly by I/O bus 210.In an exemplary embodiment, except that I/O bus 210, perhaps be used for substituting I/O bus 210,system controller 228 is equipped with wireless communication transmitter (or transceiver), and it can communicate by IR signal or RF Signals ﹠ Systems assembly.No matter controlling medium is what, andsystem controller 228 all is configured to control each media component in themedia system 200 by graphic user interface hereinafter described.
Shown in Fig. 3 was further,media system 200 can be set for the media program of reception from a plurality of source of media and service provider.In this exemplary embodiment,media system 200 from following data source any or all reception information and selectively to its transmission information: electrophone 230, satellite broadcasting 232 (for example antenna) via satellite, very high frequency(VHF) of broadcasting and television network 234 (VHF) or ultrahigh frequency (UHF) radio communication (for example by aerial antenna),telephone network 236 and wire line MODEM 238 (perhaps other internet content source).Those skilled in the art will envision that media component shown in Figure 3 and source of media only are examples,media system 200 can comprise these assemblies more or still less.For example, the input for other form of said system comprises AM/FM wireless device and satellite radio device.
Fig. 4 is the block scheme that illustrates according toexemplary system controller 228 of the presentinvention.System controller 228 for example can be embodied as set-top box, and for example compriseprocessor 300,storer 302,display controller 304, miscellaneous equipment controller (for example, relevant device), one or moredata storage device 308 and I/O interface 310 with other assembly of system 200.These assemblies communicate bybus 312 and processor 300.Those skilled in the art will envision that, can use one or more processing units to realize processor 300.Memory device 302 can for example comprise DRAM or SRM, ROM, in these memory devices some can be designated as the cache memory that the data used by software and/or these programs ofprocessor 300 operation are treated in storage, and these softwares and data comprise software and/or the data that are associated with the graphic user interface that is described below.Display controller 304 can be presented at GUI screen and the object that describes below to be communicated with other thing together by the demonstration ofprocessor 300 operations with control monitor 212.The convergent-divergent that scalable GUI according to an illustrative embodiment of the invention provides and resolution is irrelevant, thus makemonitor 212 to show with resolution arbitrarily.Device controller 306 provides other assembly ofmedia system 200 and the interface between the processor 300.Data-carrier store 308 can comprise one or more hard disk drives, floppy disk, CD-ROM equipment or other high capacity memory device.Input/output interface 310 can comprise and one or more in the multiple interfaces comprises for example keyboard interface, RF interface, IR interface and microphone/speech interface.According to exemplified embodiment of the present invention, I/O interface 310 will comprise the interface that is used to receive with the positional information of the motion association of radio positioner.
Graphic user interface is according to an illustrative embodiment of the invention carried out the instruction sequence that is contained in thestorer 302 bysystem controller 228 answer processors and is generated and control, and selects information with the display media item.Above-mentioned instruction can be read into thestorer 302 from other computer readable medium (for example data storage device 308) or from the computing machine that is external in media system 200.Make processor thing together with other inmonitor 212 generate graphical user interface object and control to the execution that is contained in the instruction sequence in the storer 302.In optional embodiment, can use hard-wired circuit to replace or make up and realize the present invention with software instruction.As mentioning in background technology part, traditional interface architecture related with television industry has serious defective aspect the experience ability providing simply for the user and comprehensively select.Therefore, the control architecture of Miao Shuing has overcome these restrictions in this article, and therefore is suitable for using with TV, does not get rid of certainly with other device and uses.In the interface that control architecture, graphic user interface and/or the various algorithm that also should predict the innovation of describing in this article can be applicable to use with computing machine and other non-television equipment.In order to distinguish these various application of illustrative embodiments of the present invention, use term " TV " and " TV " to refer to the subclass of display apparatus in this manual, and term " GUI ", " GUI screen " and " display screen " are common, and refer to television indicator, graphoscope and other any display equipment.More particularly, the intention of term " TV " and " TV " is meant and is need not to use adapter just TV signal (for example can be converted to other form, the subclass that can show the display apparatus of TV signal (for example, NTSC signal, PAL signal or SECAM signal) under the situation computer video form).In addition, term " TV " and " TV " refer to usually the display subclass (for example sofa is to the recreation room TV of family) that can watch from several feet or farther distance, and (for example, the display from the chair to the desktop) that graphoscope is normally closely watched.
Described can be used for realizing according to the control architecture that comprises scalable graphical interfaces of the present invention after, several embodiment at these interfaces are described now.According to an illustrative embodiment of the invention, but but user interface has shown the options of category grouping.The user points to interested one or more kind with remote control unit, and by select button amplifying, or press " retreating " button convergent-divergent and return original state.The action that user's each amplification or convergent-divergent return original state all makes the amplification stage and/or the content that are shown in the selectable program in the screen by user interface change.According to exemplary embodiment, the change of each amplification stage can be consistent, promptly changes amplification stage with predetermined rank.Illustrative embodiments of the present invention also provides to incorporate into has several visualization techniques to realize being amplified to the user interface of significant proportion.These technology comprise building block and can obtain the combination of scalable and wieldy technology (especially adopting user interface to improve user's visual memory with the technology of calling party interface object again apace).
User interface is visual experience to a great extent.In such environment, illustrative embodiments of the present invention has used the user can remember the ability of the object's position in this visual environment.This realizes by (dependable) position stable, that be correlated with is provided for the user interface options.Each object all has corresponding position in scalable layout.In case the user finds interested object, will remember the direction that object is set by the light of nature.If should to as if interested especially, user's this project of accessing more than once then, this can strengthen the user to the memory to the path of this object.User interface according to an illustrative embodiment of the invention provides visual memory system (visual mnemonics), it can help the user remember the position of interested project.This visual memory system comprises mobile (pan) and convergent-divergent animation, the virtual transitions and consistent zoom function (functionality) of looking unfamiliar into the geography sensation of motion by user interface, based on the embodiment that describes below, they will become more apparent together with other thing.
Organization mechanism is provided to make the user to select from extremely many program set, and shielding simultaneously and a large amount of related details of selection group.According to the present invention, can use various types of organization mechanism, provide embodiment below.
At first arrive Fig. 8, wherein described the use of the exemplary control architecture that comprises scalable graphic user interface according to an illustrative embodiment of the invention in demonstration and selection music media program with reference to Fig. 5.In the accompanying drawing of back, also can see different substitute and/or the additional GUI screen that is associated with interface according to the present invention.Fig. 5 has described to be in the scalable GUI of maximum deflated state.Wherein, interface display one group of shape (shape) 500.Being presented in eachshape 500 istext 502 and/orpicture 504, and these parts that are used to describe by GUI can be operated media program selection group.As shown in Figure 5,shape 500 is a rectangle, andtext 502 and/orpicture 504 described medium types.Yet, those skilled in the art will appreciate that these GUI that at first see organize the others that can represent that the available medium of user are selected, for example, artist, produce time, artistical residential area, the length of program or the further feature of selection.And if be used for not necessarily rectangle of shape that in GUI summary describes various groupings.Thetext 500 and/orpicture 504 inshape group 500, or as to the substituting oftext 500 and/orpicture 504, the down-scaled version of album cover (album cover) and other icon can be used for further browsing prompting for the user provides.The background parts of GUI506 can be shown as pure color, maybe can be the part of the picture of mapping graph for example, remembers the locus of kind to help the user, thereby needs less reading just can use this interface afterwards.Select steady arm (vernier) 508 to follow the motion of input equipment, and when the user presses button (not shown in Fig. 5) in this equipment, point out the position that to amplify.
According to an exemplary embodiment of the present invention, input equipment can be a wireless mouse, for example, by Gyration inc. (revolution company limited), (be positioned at the boulevard, Saratoga 12930 in California, Saratoga mansion C, 95070 (12930 Saratoga Avenue, Bldg.C, Saratoga, CA 95070)) wireless mouse made, it and the graphical user interface coupling of supporting location, click, rolling, hover and convergent-divergent building block (will carry out more detailed description below).Can bring a feature of this exemplary input equipment of beneficial effect to be when using together with the present invention, it only has two buttons and a scroll wheel, and three inputs just start object.One in these buttons can be configured to amplify (selection) button, and another button can be configured to dwindle (retreating) button.Compare with traditional remote control control module for example shown in Figure 1, the present invention waits aspect these that simplify GUI that the user faced when selecting his or her media program by the number that significantly reduces button.Additional preferred (but not being essential) feature of input equipment according to an illustrative embodiment of the invention is that they provide the ability of " 3D location 3D locatees " for the user.The phrase that uses in this instructions " 3D location " refers to that user's aerial three-dimensional (or more) space free before display screen moves the ability of input equipment, and user interface directly is converted to these motions the respective capabilities of the motion of vernier on screen.Therefore, " 3D location " is different from traditional computer mouse location technology, traditional computer mouse location technology uses the surface (for example table surface or mouse pad) that is different from display screen as agency (proxy) surface, and mouse moves at the vernier that these surperficial relative motions are converted in the computer display.Use 3D to locate the framework of controlling according to an illustrative embodiment of the invention and further simplified user's selection course, and the chance that posture (gesture) is incorporated into above-mentioned interface as diacritic input is provided simultaneously.Posture can be counted as can time dependently discern motor pattern, and this pattern can be converted into GUI order, for example with x, y, z, yaw (yaw), pitching (pitch) and wave dimension or their functions of the motion of sub-portfolio arbitrarily.Yet those skilled in the art should be realized that the input equipment that is fit to arbitrarily can use with scalable GUI according to the present invention.The embodiment of the input equipment that other is fit to include but not limited to tracking ball, touch pads (touchpad), traditional TV remote control equipment, voice-input device, arbitrarily can be with user's posture transmission/convert the equipment of GUI order to, or their combination in any.Its each aspect that is intended that the GUI function that is described in this article can both use at least one posture and voice command to start in framework according to the present invention.Optionally realize comprising use vernier and/or other teleswitch or or even phonetic entry come the program of identification selection.
If Fig. 6 has shown the figure that the user selects the words of kind 3 (for example, by indisplay 212 thatvernier 508 is mobile onkind 3 region surrounded at straight-flanked ring, and pressing the button) that thekind 3 that is shown is amplified in input equipment from Fig. 5.Interface of the present invention can produce the convergent-divergent from Fig. 5 to Fig. 6, thereby makes the clear convergent-divergent that produced of user.The embodiment of the convergent-divergent/transitions of above-mentioned generation will be described below.Taken screen in themost display 212 in case comprise the shape 516 ofkind 3, above-mentioned interface display in mentioned kind, have an artist of special edition.In this embodiment, seven different artists and/or their works have been shown.The unselected kind 515 adjacent with kind 3 (its in Fig. 5 for dwindling view) is still adjacent with thekind 3 of amplifying on view, but it is in the state of partly being sheared at the edge of display 212.These unselected kinds can be carried out fast browsing to their selection by select finger 508.Yet should be realized that other illustrative embodiments of the present invention can be ignored is sheared adjacent object, only shows the selection that is not sheared on the contrary.In the artist group (for example, the artist organizes 512) each can both comprise the image that dwindles album cover, artistical photo or can be by the artistic work (if above-mentioned classification comprises the playlist that the user sets up) of customization.
The user then can select artist's group to be used for further browsing and/or selecting.Fig. 7 has shown that the response user viavernier 508 being positioned and starting input equipment and selected the view of further amplification of theartist 3, wherein, can see the picture of album cover 520.The same as figure from the GUI screen transformation of Fig. 5 and Fig. 6, non-selected, contiguous artist's (beingartist #2,6 and 7 in this embodiment) is shown as the side in the demonstration that is exaggerated, the user can useselect finger 508 that they are clicked, thereby sees these artistical whole views.Except the image 520 of album cover, in this part of interface, artist information 524 can show as the program in artist's group.These information can comprise for example artistical photo, biography, trival matters, works catalogue, influence, link network address and other relevant data.Each comprised photo in the photo volume picture 520 is conferred titles of nobility on the picture of face, and includes text data alternatively.Comprise under the situation of the list of artists that the user creates that at photo volume picture 520 graphic user interface can show the picture by this interface is selected automatically or the user selects in advance.
At last, when the user selected album cover picture 520 from organizing 521, above-mentioned interface zoomed to album cover as shown in Figure 8.Continuation along with convergent-divergent, this album cover can move back for or become and comprise the like this view of some projects, for example, thelittler version 528 of the artist ofspecial edition 530 and title,song tabulation 532, thefurther information 536 relevant, album cover with special edition and be used for playing back content, modification is classified, is linked to artist's webpage, or find out and thecontrol information 534 of selecting relevant any out of Memory.Adjacent special edition 538 demonstrates those and can selectsteady arm 508 to select by using, so that their content of interface display.That is as above mentioned is such, and the selectable embodiment of the present invention can for example be amplified to and only show selecteed object (for example, special edition 5), and ignores the cutting out section of non-selected object (for example,special edition 4 and 6).This final convergent-divergent provides an embodiment of semantic property convergent-divergent (semantic zooming), and wherein sightless some GUI element is revealed in the zoom level formerly.Variously be used to realize that technology according to an illustrative embodiment of the invention is described below.
As shown in Fig. 5-8 and the instructions, the graphic user interface of this illustrative embodiments provides browsing music collections.Can also be used for the video collection according to interface of the present invention, for example be used for medium, video request program, video segment and the home theater of DVD, VHS band, other record.Other audio frequency uses and comprises browsing radio programming, teaching tape, the historical accounts of past events and sound montage.Can also use the present invention to organize and visit printing or text media, for example News Stories and e-book.
According to the description of front, those skilled in the art will appreciate that scalable graphic user interface according to the present invention provides fast and easily browsed the ability of a large amount of (or less) media item for the user.This ability is attributable to the numerous characteristics at the interface of illustrative embodiments of the present invention, these characteristics include but not limited to: (1) uses image as all or part of selection information that is used for the specific media item, (2) use convergent-divergent to provide fast and select required many as far as possible or the least possible information with the user, and (3) use several GUI technology, they are combined and all are positioned at conplane sensation to all interfaces of user, thereby can finish the navigation of GUI, and be remembered by user's sense of orientation.Can various GUI screens " reasoningly " be coupled together together with other thing according to the back one side of GUI of the present invention, it keeps realizing continuously from a GUI screen to next GUI screen by making GUI object as much as possible, for example, by showing edge adjacent, that be surrounded on the non-selected object at current GUI screen edge.Alternatively, if expectation obtains view clearly and other GUI technology provides ground rational feedback, then can ignore the object that is sheared.As using in this article, phrase " GUI screen " refers to be presented at one time one group of GUI object in one or more display units.The GUI screen can be presented in the same display of output medium item, and perhaps it can be present in the different displays.Display can be the TV display, computer monitor or other suitable GUI output device arbitrarily.
Another improves the user and feel that the GUI screen is that internuncial GUI effect is when carrying out convergent-divergent or mobile animation (panning animation) effect that produces when the contiguous object that the user selects with the zoom level of current selecteed object equates.Turn back to embodiment shown in Figure 5, when the user watched this GUI screen at first, his or her observation point concentrated on a little near 550.Yet when he or she selected kind 3 to amplify, his or her observation point transferred to a little 552.According to an illustrative embodiment of the invention, processing and amplifying is energized so that the POV center is transferred to a little 552 from putting 550.This mobile animation can be used to change various GUI, for example from the change of zoom level, or an object is changed to the change of another object in the same GUI amplification stage.Therefore, if (for example) is arranged in the non-selected kind 515 (kind 2) that the user of the GUI screen of Fig. 6 selects the leftmost side, the animation that then will be moved, this will be to the user visually with impression left or west " moving ".It is consistent sensation that illustrative embodiments of the present invention uses above-mentioned technology that the directed movement between the GUI screen is provided, thereby makes the user browse GUI between the amplification stage and between the media item at identical amplification stage more apace.
Can use various data structures and algorithm to realize according to scalable GUI of the present invention.For example, be used for that image viewer in display photos moves and the data structure of convergent-divergent and the paper of algorithm below for example in be described: " Quantum Treemaps andBubblemaps for a Zoomable Image Browser ", Benjamin B.Bederson, UIST2001, ACM Symposium on User Interface Software and Technology, CHI Letters, 3 (2), (Benjamin B compares gloomyly to be entitled as " the quantum number figure and the bubbling figure that are used for scalable image viewer " pp.71-80, UIST2001, the ACM forum of user interface software and technology, the CHI journal, 3 (2), the 71-80 page or leaf), it is merged in this paper as a reference.Yet, that medium are selected in order to be provided for, can in a large amount of application, switch with high grade, and can make the user control selected image to carry out the GUI of various medium selection functions, need additional structure and algorithm with lower grade.
Scalable GUI can be conceptualized as and can support to move and the convergent-divergent animation around the scene of the user interface components in the vision area of display device.In order to finish this effect, scalable GUI according to an illustrative embodiment of the invention can use scene graph data structure (scenegraph data structure) to realize.Each node in scene graph is represented some part of user interface components, for example button or text label or a class boundary face assembly.The child node presentation graphic element (line, text, image etc.) of intra-node.For example, an application in scene graph can be represented as the node of each feeder node with the graphic element that is used for its interface.The node of two kinds of specific types is called as camera (camera) and layering (layer) in this article.Camera is the node that vision area is provided to the other parts of scene graph by being conceived to hierarchical node.Below these hierarchical nodes, be user interface element.The steering logic that is used for scalable interface is adjusted the variation of camera sight line by programming, moves or zooming effect to provide.
Fig. 9 has shown the scene graph that comprises scalable substantially interface element, and these elements can be used for realizing illustrative embodiments of the present invention, and specifically, this scene graph has comprised acamera node 900 and a hierarchical node 902.Dot-and-dash line betweencamera node 900 and thehierarchical node 902 represents thatcamera node 900 is configured to the child node ofhierarchical node 902 is drawn (render) in the vision area of camera.Thedisplay apparatus 904 that is coupled makes the user see the vision area of camera.Hierarchical node has threechild nodes 904 describing circle and a pair of ellipse.This scene graph also further points out, by the node 912-918 rectangle that in above-mentioned circle, drawn, and at rectangle three triangles that drawn.Scene graph is gone into other scene graph of other data structure by contact by root node 920.Among the node 906-918 each has by using transformation of local coordinates's matrix to make it carry out the ability of proportional zoom and location with respect to its father node.Figure 10 (a) and 10 (b) show respectively by camera and dwindle with first amplification stage and the appearance of the scene graph that amplifies with second amplification stage.
Drawing scene graph can be achieved as follows.When display 904 needed to upgrade, for example the user had triggered the amplification from the view of Figure 10 (a) to the view of Figure 10 (b), and the incident that then repaints is called the camera node 900 that is attached to display 904 with drafting camera node 900 self.Make camera node 900 notice hierarchical nodes 902 draw the interior zone of camera vision area so conversely.Hierarchical node 902 is drawn these child nodes self by the child node of notifying it and itself is drawn, or the like.The current transformation matrix and the bounding rectangle that are used for the above-mentioned zone renewal are transmitted (pass) in each step, and obtain alternatively revising to notify each node proper proportion that they should be used for drawing and skew.Because the applied scene figure that operates in scalable GUI according to the present invention can comprise thousands of nodes, so the zone that each node can detect transformation matrix and will be updated really can be seen by the user with the drafting operation of guaranteeing them.Although previously described embodiment has described the scene graph that comprises a camera node and a hierarchical node, should predict, illustrative embodiments of the present invention can embed a plurality of cameras and layering.These cameras that are embedded into can provide the user node element for the user, for example indicate the little figure that dwindles in the whole scalable interface current sight line of user zone, and can allow user interface components convergent-divergent and moving independently.
When coordinating the operation of use (for example as the exemplary cinematic browser that will describe with reference to Figure 14-18 below) at the scalable interface of use, the storage and the resource requirement that are used for each application may be above the total effectively storages at media system more.Just hint need be with some or all codes and the data unloading of these application when the user no longer sees them for this.Yet, in scalable GUI according to the present invention, provide such outward appearance to expect, that is, make some or all of application always look like activation for the user.In order to satisfy the target of above-mentioned two antagonism, be the state that the application of " curtain outer (off-screen) " can enter into interim hang-up for user's sight line.In order in scalable GUI according to the present invention, to realize incident being sent to this function application when enter and withdraw from vision area to indicate them.A mode that realizes above-mentioned incident is to increase logic in the code of render component, when enters vision area thereby detect the user.Yet this notification logic of this meaning person is called in each drafting incident, and the more important thing is that it can not easily detect the user and when browse vision area away from assembly.The method that another kind is used for incident is sent to application is notification logic to be incorporated into GUI browse element (for example hyperlink and button), thereby make to change the camera vision area so that the interested assembly of institute is included, notice is sent to said modules at them.Yet this needs the programmer to come watchfully (vigilantly) that information code is increased in all possible UI of the browsing element.
According to an illustrative embodiments, can use counting yield node monitoring algorithms to notify and use the GUI assembly and/or be applied in when enter and withdraw from the camera vision area.On high grade, this node monitoring algorithms has three main levels of handling: (1) initialization, (2) vision area changes assessment, and (3) scene graph changes assessment.Initialisation stage is calculated by vision area and is changed the node amount that evaluation stage is used, and the correct data structure of initialization.Vision area changes evaluation stage and is called when vision area changes, and notifies all to enter or withdraw from the monitor node of vision area.At last, scene graph changes evaluation stage and is updated in the calculating that initial level produces, and these calculate owing to the variation in scene graph becomes invalid.For example, if the ancestor node of monitor node (ancestor node) has changed the position in scene graph, then the calculating of carrying out in initial level need be recomputated.
In these levels, vision area changes the remainder of evaluation stage drive node monitoring algorithms.Enter and withdraw from vision area when in order to describe node, initialization step is determined the border rectangle of expectation node, and it is converted to the local coordinate system of vision area from its local coordinate system.In this manner, detection node does not enter and need carry out a series of coordinate conversion in each vision area change place.Because the father node of this node can have transition matrix, so this initial step need travel through scene graph up to camera node from this node.As described below, if in the scene graph data structure, use the camera node that embeds, then need a plurality of borders rectangle to be contained in the node that a plurality of positions occur.
Calculate the border rectangle that is used for each monitor node in the vision area coordinate system after, initialisation stage is increased to vision area with boundary matrix and changes in the assessment data structure.Above-mentioned node monitoring algorithms uses basic building block to be used for each dimension of screen.In the scalable interface according to some illustrative embodiments, this comprises x dimension, y dimension and ratio dimension.Yet as described below, other illustrative embodiments can have additional or different dimensions.The ratio dimension has been described the amplification stage at the node of vision area, and describes by following formula:
Wherein, s is a ratio, and d is the distance of another point from a point of this node to the local coordinate of this node, and d ' be in this vision area above-mentioned point to the distance of another point.
Figure 11 has shown the exemplary elements piece that the scene that is used to detect a dimension enters and withdraws from.Processing in the x dimension is described below, but those skilled in the art should be realized that the dimension that can handle other in a similarmanner.Region unit 1100 comprises the benchmark (references) that is converted the border rectangular coordinates.This comprises a left side and the right side (top and bottom or minimum and the maximum ratio) side-play amount of rectangle.Left and right side-play amount is stored in respectively inconversion block 1102 and 1104, andconversion block 1102 and 1104 itself places orderly doubly linked list, thereby makes low side-play amount of numbering towards reference position.Current vision area border is stored in sight line border block 1106.Block 1106 has the pointer that sensing just exceeds the conversion block on the left side of view and right side, and for example, the conversion block that is located immediately at the right side of the block that is pointed to by the view left side appears in the view, unless the above-mentioned block that is pointed to by the view left side is pointed to by the vision area right side.
When vision area changes, for each dimension following processing takes place.At first, detect view left side and view right side pointer to confirm whether they need to move to comprise or to get rid of conversion block.Next, if one or two in the above-mentioned pointer need move, then they skip over (slide over) above-mentioned conversion block to arrive their new positions.Then, for each conversion block by view left side and view right side pointer process, above-mentioned node monitoring algorithms is carried out below with the conversion block notice coding of describing.This notice coding is determined the vision area whether its node separately might enter or withdraw from.If have, then this node is added in the aftertreatment tabulation (post processing list).At last, in the ending of this processing that is used for each dimension, whether its vision area state of each node that detects in the above-mentioned aftertreatment tabulation change (becoming again opposite then with change earlier) has taken place truly.If change has really taken place, then above-mentioned algorithm transmission incident is to said modules.Should be noted that if vision area jumps to the new region at scalable interface fast, then above-mentioned algorithm can detect more puppets and enter and withdraw from incident.
Conversion block notice coding can be realized as look-up table, and whether this look-up table determines to move in and out for the node of this dimension detected.Shown exemplary above-mentioned table below.
| The node side | The view side | The view moving direction | The local notice that intersects | Intersect notice fully |
| A left side | A left side | A left side | Do not have | Enter |
| A left side | A left side | Right | Do not have | Withdraw from |
| Right | A left side | A left side | Enter | Do not have |
| Right | A left side | Right | Withdraw from | Do not have |
| A left side | Right | A left side | Withdraw from | Do not have |
| A left side | Right | Right | Enter | Do not have |
| Right | Right | A left side | Do not have | Withdraw from |
| Right | Right | Right | Do not have | Enter |
Table 1-conversion notification list
1st, 2 and 3 hurdles are inputs of conversion notification list.Whether specifically, the node monitoring algorithms uses the combination of node side, view side and view moving direction to address this table, entered, withdraw from or not influence with the node of determining this assessment.The node side that conversion block characterized thathurdle 1 refers to by vision areapointer process.Hurdle 2 refers to the vision area side, andhurdle 3 refers to the direction of this vision area side shifting when vision area is passed through the conversion block of node.But whetherOutput bar 4 or 5 should notifiedly be selected according to this node in above-mentioned node section or complete apparent time.For example, in some implementations, ideal situation is only just to notify this application after visual fully using (for example video flowing window), and this is that to be loaded into because of the video window that part is visual visually be fissile among the scalable GUI.
When the output of this table was represented to enter and withdrawed from, this node monitoring algorithms was added this node to the aftertreatment tabulation.Output bar in the table 1 is filled (populate) based on following rule.If this node intersects in all dimensions, then in above-mentioned post-processing step, send and enter notice.If this node in view and present one or more dimension stop to intersect, then will send OPT-OUT NOTICE.In order to reduce the node number in the aftertreatment tabulation, the conversion block notice is coded in this node added to and detects before the tabulation and the intersecting of other dimension.When only having one or two dimension to intersect in this dimension (for example, three or more), removed post-processing step from all numbers.When user interface object (for example, use) was wanted notified its vision area state in GUI, it registered a function (function) with above-mentioned node monitoring algorithms.When above-mentioned application entered or withdraws from view, the node monitoring algorithms was called the function that this application is registered with the parameter of indication what's going on.As a kind of selection, can use the message transmission to come exercise notice.In this case, each application has event queue.Above-mentioned application tells how the node monitoring algorithms communicates by letter with its event queue.For example, it can point out the address of formation.Then, when the node supervision detected conversion, its was set up and comprises the data structure of notifying cause, and is placed in the formation of this application.
Except use is used for the node monitor for notifications that application memory is managed, this algorithm can also be used for other function according to scalable GUI of the present invention.For example, the node monitoring algorithms can be used for changing the application behavior based on the focus of User, for example switches in the current visual application by audio frequency being exported focus.What other was used for the node monitoring algorithms is applied in amplification stage when changing, and will be written into or unload higher resolution, and image is synthesized.This draws object less, the approaching more coupling display of resolution by making drawing implementer (renderer), has reduced its computational load.Except the vision area that makes node monitoring algorithms monitoring camera, make it monitor that it also is useful browsing coding (navigation code), this is browsed coding and tells the vision area position that this vision area finished after the animation manufacturing.This earlier notification component they will appear in the view, and can make scalable GUI according to embodiment of the present invention avoid notice sent to because mobile animation and the node that overflowed.
In order to understand the operation of nodal operation monitoring algorithms better, with reference to Figure 12 (a), 12 (b), 13 (a) and 13 (b) embodiment is described below.Figure 12 (a) and 12 (b) show the part of graphic user interface two different amplification stages, scalable.Low amplification stage place at Figure 12 (a) can see three nodes: circle, triangle and ellipse.In Figure 12 (b), this view is exaggerated, thereby makes ellipse and circle only partly as seen, and triangle is invisible fully.These nodes can for example characterize application or the user interface components that depends on the validity event notice, and are therefore followed the tracks of by node monitoring algorithms according to an illustrative embodiment of the invention.In this embodiment, clearly show the border rectangle that is used for each node, in GUI, can not show these border rectangles usually although those skilled in the art should be realized that at Figure 12 (a) and 12 (b).Each limit to each border rectangle in 12 (a) and 12 (b) marks, and these marks will be used for showing the limit of border rectangle and the corresponding relation between the above-described conversion block data structure.
Figure 13 (a) shows the exemplary nodes monitoring data structure of the horizontal dimensions of dwindling view that is used for figure (12).In the figure, each limit of node boundary rectangle uses conversion block to characterize.The order from left to right that the level conversion piece occurs in GUI with them is shown among Figure 13 (a).For example, Yuan Xing left side CA left sideAt first occur, follow leg-of-mutton left side TA left side, or the like, up to the right of ellipse ERightTill the appearance.Mark is come with empty sign conversion block in the two ends of this tabulation.The region unit that in Figure 13 (a), has also shown the respective pointer of the level conversion piece that is used for each node and points to their border rectangles.In the bottom of Figure 13 (a) is view data boundary structure, and it comprises the pointer that sensing just is positioned at the conversion block outside front view.For this view that dwindles, all nodes are visible fully, and therefore their all conversion blocks between the conversion block that points to by view data boundary structure.
Figure 13 (b) has shown the node monitoring data structure of the zoomed-in view that is used for Figure 12 (b).In the figure, the view boundary member of data structure is changed as can be seen, is used for leg-of-mutton the right T thereby make it point toRightAnd oval the right ERightConversion block, this be because above-mentioned two border rectangular edges just in time be positioned at outside current (amplification) view.
After providing above-mentioned example data structure and GUI scene, the relevant treatment when carrying out scale conversion in the node monitoring algorithms is as described below.At first from the left side of view, the node monitoring algorithms moves right view left side pointer, up to arriving the conversion block that just is in outside the left side view.Shown in Figure 13 (b), view left side pointer at first passes through CA left sideConversion block.For this embodiment, suppose that circular node characterizes the application related with scalable GUI or other user interface object (but its incomplete apparent time in view needs notice).After being given to the above-mentioned input of node monitoring algorithms, table 1 illustrates this circular node should receive OPT-OUT NOTICE for horizontal dimensions.Certainly, the node monitoring algorithms was assembled incompatible notice from all dimensions usually before notifying this node, to avoid sending unnecessary OPT-OUT NOTICE.Next, view left side pointer is by leg-of-mutton left side TA left sideIf this triangular nodes is asked notice when it leaves view fully, then this node monitoring algorithms do not need to point out notice in each table 1.Yet, pass through T at the view pointerRightThe time, table 1 illustrates this triangle and withdraws from view fully, and should be notified.The view pointer stops at here, because the right C of circular boundary rectangleRightIn view still as seen.
The node monitoring algorithms is similar from the processing on right side.View right side pointer moves left to oval the right ERightWhether ask all or part of notice according to ellipse, this node monitoring algorithms meeting or can be according to the oval notice that sends of table 1 pair.Can use similar data structure and top and bottom border rectangle value to handle vertical dimensions in a similar manner.Those skilled in the art it should further be appreciated that when the more accurate notice of needs, can use a plurality of borders rectangle to approach the node of non-rectangle.In addition, motion by other dimension be followed the tracks of and be handled to the present invention imagination can by the node monitoring algorithms, for example, the 3rd how much (degree of depth or ratio) dimensions, and non-how much dimensions, for example time, Content Ratings (adult, PG-13 etc.) and content type (drama, comedy etc.).According to the number of the dimension of using, this algorithm is the intersection of detection boundaries section, rectangle and n dimension hypercube more accurately.
Except above-described node monitoring algorithms, illustrative embodiments of the present invention provides the semantic convergent-divergent algorithm of the resolution unanimity that can use in scalable GUI according to an illustrative embodiment of the invention.Semantic convergent-divergent is meant the details that according to the amplification stage of assembly the assembly among the scalable GUI is increased, removes or changes assembly.For example, in the film browser interface that is described below, when the user with camera lens during enough near the image of film, this browser changes to show film metadata and playback controls.The calculating of this amplification stage is based on number of pixels that assembly uses in display apparatus.This scalable GUI can store the thresholding amplification stage, and when this thresholding amplification stage indication will switch, and for example never the view of film metadata and playback controls switches to the view with film metadata and playback controls.
TV and graphoscope have the very big monitor resolution of variation range.Some monitors have sufficiently high resolution, thereby to such an extent as to make and can be presented on the picture in the lower resolution displays and text is too little can not read fully.This also for use semantic convergent-divergent, especially (for example, the application in HDTV) has problems at high resolution display.In this case, the semantic convergent-divergent sign indicating number of drawing based on the number of pixels that shows will change image before can reading more detailed view.The thresholding of revising semantic convergent-divergent change assembly only can play effect to a resolution procedurally.
Ideal results is that all monitor resolution are all carried out consistent semantic convergent-divergent.A kind of solution is to use the display mode of low resolution in high-resolution monitor, so that the resolution in all displays is consistent.Yet if semantic convergent-divergent still carries out as expectation, high-resolution monitor user is preferred coming rendered images with the resolution of they the bests.Therefore, illustrative embodiments of the present invention provides a kind of and supports described different resolution to show and do not exist aforementioned semanteme to watch the semantic zoom technology of problem.This can be for example by realizing at the built-in upright virtual monitor of scene.Compensate monitor resolution by using embedded virtual camera node 1200 and increasing logic, figure 14 illustrates this technology.This virtual camera node 1200 has defined the vision area that its size is mapped to user's line-of-sight distance and monitor dimension.For example, big virtual camera vision area is represented the user or is before very near-earth is sitting in monitor, or having enough big monitor differentiates many details.Alternatively, little vision area is represented the user away from monitor, and needs big font and image.This scalable GUI coding can be based on seeing in above-mentioned virtual camera, and use the semantic scale conversion in the amplification stage of assembly of user best viewed status.
The principalphase machine node 1202 that is attached to displayapparatus 1204 has by configuration so that it shows the vision area of the various things thatvirtual camera 1200 is showing.Because icon and text were not mapped to pixel before thisprincipal phase machine 1202,, quality do not descend so can not taking place from virtual camera.This result is that the monitor display of pinpoint accuracy shows high-quality image, and can not trigger the semantic convergent-divergent change that makes demonstration be difficult to read.
According to an illustrative embodiment of the invention, above-mentioned processing is achieved as follows.Each camera in scene graph has relevant transition matrix (T with node1To Tn).These matrixes are converted to local coordinate system at the next node of display with the local coordinate system of node.In the drawings, T1From its vision area with coordinate conversion to the display coordinate.Equally, T2Its local coordinate system is transformed into the vision area of camera.Ifleaf node 1206 need be drawn some contents in display, the transition matrix below then it calculates:
A=T1T2…Tn
This calculating can be performed by scene graph the time.Support semantic convergent-divergent to be based onvirtual camera 1200 owing to change assembly, so the calculating below carrying out:
B=T4T5…Tn
Usually can determine T in advance with checking scene graph by the resolution of inquiry monitor1To T3Therefore, from A determine B by following formula to these matrix inversions and the realization of multiplying each other:
B=(T1T2T3)-1A
For calculated in advance T1To T3The time in-problem this situation, for example,, logic can be increased to virtual camera, thereby interception (intercept) will be used for being plotted to the transition matrix of display if figure API has hidden additional conversion.These transition matrixes that are blocked are then inverted and multiply each other (as mentioned above), with computing semantic convergent-divergent thresholding.
A favorable characteristics at scalable interface according to an illustrative embodiment of the invention be it have can be when navigation interface the ability of maintaining context.All interface assemblies appear to be present in the scalable world, and the user only needs to move and convergent-divergent reaches arbitrarily them.Above-mentioned semantic zoom technology changes the outward appearance of said modules according to convergent-divergent or amplification stage.Figure 15 (a) and 15 (b) provide the embodiment of the semantic convergent-divergent that is used for assembly, and wherein the version that dwindles of this assembly (Figure 15 (a)) is a picture, and amplified version (Figure 15 (b)) comprises identical picture and some control and details.These more more detailed embodiment are provided below.A challenge relevant with semantic convergent-divergent is can change suddenly between view, and when changing between two above-mentioned views, and for example the α switch technology of mixing can visually not provide satisfied result.
Therefore, illustrative embodiments of the present invention provides the image and the text of some public (common) in all component view, to provide focus to transitions when carrying out semantic convergent-divergent.For example, in Figure 15 (a) and 15 (b), common elements is a picture.Transitions between amplified version and down-scaled version can for example use above-mentioned node monitoring algorithms to trigger in the following manner.At first, when the down-scaled version of assembly is transformed into amplified version, carry out registration with the reception incident in the vision area of principal phase machine with above-mentioned node monitoring algorithms.Then, when incident takes place, can show that displaying is from their down-scaled version position convergent-divergents and the animation that is transformed into the common elements of their amplified version positions.Simultaneously, the camera vision area continues to be advanced to said modules.
After browsing another illustrative embodiments of describing below with reference to Figure 16-20, will become more apparent according to these abilities of graphic user interface of the present invention.In Figure 16-20, open beginning GUI screen 1400 and show a plurality of organization objects of operating as media groups.The media groups that is exemplary home theater, film, TV, physical culture, radio, music and news is fully represented can comprise different, more or less media groups certainly.When of starting by the user in these icons, will show a plurality of images according to the GUI of this illustrative embodiments of the present invention, each in these a plurality of images is grouped into special classification and kind.For example, if the user starts " film " icon among Figure 16, then can show the GUI screen of Figure 17.In Figure 17, shown a large amount of (for example 120 or more) alternatives.These alternatives can be classified as special group, for example, and action, allusion, comedy, drama, domestic play and new issue film.Those of ordinary skill in the art should be realized that the kind that can provide more or less.In this illustrative embodiments, the image of media item can be to select related feature front cover with each film.Can not allow to show the detailed description of many groups relatively of this options image although the block in Figure 17 is too little, but in application, the amplification stage of these images can make the identity by image identifying these films related with film, even if to such an extent as to some or all in these texts can be too little can not easily the readings.
Can place on one group of cine-oriented image with vernier (not shown in Figure 17) and drive input equipment and select indication for of this group provides.In this embodiment, the user selects the drama group, and then graphical interface of user shows the zoom version in the drama image sets, as shown in figure 18.The same with aforementioned embodiments, when the GUI screen of GUI from Figure 17 switches to GUI screen among Figure 18, also can show transitions, for example, before the convergent-divergent or in the middle of the convergent-divergent process, this GUI can move to the center of the GUI screen of view from Figure 17 the center of drama group image.Should be noted that although the zoom version of the drama group of Figure 18 has only shown the subclass of all images in the drama group, this zoom version can comprise all images in selected group alternatively.Show in the amplified version of arbitrarily given GUI screen or do not show that all images in selected group can be based on the number of the media item in (for example) group, and the minimum that is used for the media item of special zoom level expects that amplification stage carries out.Back feature according to GUI of the present invention can pre-determine by system planner/service provider, or customizes via the software setting among the GUI by the user.For example, can be by one among service provider and the terminal user and two number and minimum and/or maximum amplification stage that disposes the media item in the group.Above-mentioned feature makes those users that for example have bad eyesight can increase the amplification stage of the media item that is shown.On the contrary, the extraordinary user of eyesight can reduce amplification stage, and is increased in any given incident and is presented at media item number in the GUI screen, and reduces the browsing time.
An exemplary transitions that can use in graphic user interface according to the present invention is called as " footwear are to details (shoe-to-detail) " view effect.When being activated, this transitions obtains image that dwindles and the image that simultaneously this is dwindled contraction and is transformed into littler view, that is, and and the convergent-divergent that next is more high-grade.The amplification stage that uses from the GUI screen among Figure 17 is transformed into the bigger amplification stage that uses in the GUI screen in Figure 18, and this causes GUI is that the image that is shown in the amplified version of Figure 18 discloses out additional details.Whether this GUI is shown in current selecteed zoom level well according to these details, comes the optionally details of the various zoom level of show or hide.No matter they are different for the visual capabilities camera convergent-divergent how of naked eyes with attempting to offer an explanation details, illustrative embodiments of the present invention has been stipulated configurable zoom level parameter, and these parameters have been stipulated to show all images when and when shown transfer point between the image version with the details of being arrested.This transfer point can be based on the inside resolution that is independent of iamge description, rather than based on the resolution of TV/ monitor 212.Utilize this mode, GUI according to the present invention is consistent, no matter and how little the resolution of the display device that uses in media system be.
In this illustrative embodiments, can be by being provided for the amplification of the additional amount of particular image on the vernier process particular image.This feature shows that in Figure 19 wherein, vernier is at film Apollo Personnel investigation Program 13 " image on roll.Although it is not shown in Figure 19, compare with the respective image that is in low amplification stage in the GUI screen among Figure 18, above-mentioned additional amplification for example can make the quoted passage " Houston, we have run into trouble " in the feature front cover of realizing with the related media item become clear more readable.The user is for the selection of this image, and for example by pressing the button on input equipment, further convergent-divergent is to show the details that shows among Figure 20.This also provides the embodiment of another foregoing semantic convergent-divergent, and this is to be displayed in the GUI screen of Figure 20 because of sightless various information in the GUI of Figure 19 screen and control element.For example, about film Apollo Personnel investigation Program 13 " the information that mainly comprises film projection time, price and performer be shown.The information that those skilled in the art will appreciate that other type can here provide.This GUI additionally comprises the GUI controlling object, and GUI control is used for buying this film to for example comprising, watch movie trailer or turn back to the button control object of last GUI screen (it can also be pressed large buttons at input equipment and realize).Can also use hyperlink to allow the user for example to jump to and the related GUI screen of associated film (at place, the lower right corner of the GUI of Figure 20 screen sign), or with this film in the relevant information of performer.In this embodiment, some or all movie name under title " film catalogue (Filmography) " can be embodied as hyperlink, when the user starts by input equipment, these links will make that GUI is that the film that is instructed to shows and the corresponding GUI screen of GUI screen among Figure 20.
When starting hyperlink, the user can also use transitions.Because hyperlink can produce by high amplification stage, therefore, jump to the media item that is linked simply and can make the user lose he or she selects " mapping graph " to media item tracking.Correspondingly, illustrative embodiments of the present invention provides transitions with when starting hyperlink, helps to keep user's the rational topognosia in ground.An exemplary transitions that uses is the conversion of jumping for this reason.In the starting stage of this transitions, GUI dwindles and the direction of the item that points to hyperlink moves.Dwindle and moved before purpose image and original image are all seen by the user and continue to carry out.Reuse the embodiment of Figure 20, if the user selects hyperlink to be used for " Saving Private Ryan ", then the phase one of the jump effect of this hyperlink will comprise and dwindle and move towards image " Saving Private Ryan ", up to image that is used for " Saving Private Ryan " and Apollo Personnel investigation Program 13 " be visual position for the user.At this, this transitions brings the eye impressions that move up towards the purpose image with arc to the user.In case target image appears in the view, the subordinate phase of transitions is brought to the user and is amplified and for example move to the eye impressions of purpose image with second half of above-mentioned arc.Hopping time promptly shows the time quantum of phase one and subordinate phase, can be to fix between any two hyperlink image term.Alternatively, hopping time can change, for example based on the distance of advancing on GUI.For example, hopping time can be turned to by parameter: hopping time (HopTime)=Alog (magnification ratio grade/jump summit ratio grade)+B (distance between the hyperlink media item)+C, wherein, A, B and C are chosen as steady state value suitably.
The node monitoring algorithms of describing with reference to Fig. 9-13 (b) can also be with changing between the zoom level of helping describe in the exemplary GUI screen of the exemplary GUI screen of Figure 19 and Figure 20 above.Compare with the image of low convergent-divergent grade, comprise the stronger calculating and/or the storage of drafting needs of the GUI screen of text and/or control element (they are invisible in other zoom level of selecteed image).Correspondingly, this node monitoring algorithms can be used in an exemplary embodiment of the present invention embodiment, with the navigation nodes by monitoring GUI discerning scaled media item more apace, thereby help to be written in advance GUI screen, the GUI screen that for example in Figure 20, shows.
Be comprised in the exemplary realization of the present invention is screen position and based on the Navigation Control of semanteme.When the user vernier is positioned near or be in screen in these related zones of control (these controls therein are correct, as shown in figure 21) in the time, these control areas appearance.For example, when playing film, F.F., these so-called skill (trick) functions such as to retreat, suspend, stop be suitable semantically.In this illustrative embodiments, the screen area of distributing to these functions is in the lower right corner, and near vernier is positioned at these zones or when being positioned at wherein, demonstration is used for the icon sets of these skill functions.When the definite functions that is used finishes or vernier when being located in other position in the screen again, these icons then disappear.Can also use same technology to cover other navigation characteristic of selecting as text search and indoor screen.In this exemplary realization, these are controlled on all screens is semantic relevant, and joins with their zone and be positioned at the upper right corner.Vernier be positioned at these zones or these regional near the time, demonstration is used for the icon sets of these Navigation Control.Be activated or vernier when being positioned at other position of screen again in function, these icons then disappear.Should be noted that for the purpose of user's training, related control icon can show temporarily alternatively initially that (for example 5 seconds) are in some or all of screen associated, so that unskilled user notices their existence.
Behind some embodiment that provide according to scalable graphic user interface of the present invention, exemplary architecture and the structure (infrastructure) of using above-mentioned interface described now.Figure 22 provides Organization Chart, wherein, is played the part of by primitive (primitive) 1902 (being called as " atom " in the drawings) with the scalable interface that various senior application 1900 (for example, film, TV, radio and physical culture) are related.In this illustrative embodiments, primitive 1902 comprises location, click, convergent-divergent, hover and rolling, and those skilled in the art should be realized that other primitive also can be included in this group certainly, for example, moves and pulls.As mentioned above, operation location and click primitive, with the position of definite vernier, and when for example user starts the amplification in the handheld input device or dwindles button, trigger event.These primitives have been simplified navigation and have been removed the needs that are used for repetition upper and lower, left and right button.As mentioned above, the convergent-divergent primitive is used for scanning possible selection, and gives user's context when the user dwindles his or her selection.This design makes this interface select and show that arbitrarily size stipulates according to a large amount of medium.The rolling primitive is handled the input from the scroll wheel input equipment in the exemplary handheld input device, and can be used for for example quickening linear menu navigation.Hover button primitive dynamically amplifies the selection (and/or changing the content of selecting) below the pointer, so that the user can browse potential selection and need not actual execution.Can in GUI according to the present invention, start the operation of aforementioned each primitive in a large amount of different modes.For example, location, click, hover, hover and convergent-divergent can be related with the different gestures that the user can carry out.These postures can be passed to system via input equipment (no matter whether it is 3D steady arm, tracking ball, touch pads etc.), and are converted into the startup of suitable primitive.Equally, each primitive can be related with voice command separately.
Have various software and hardware structures 1904 between rudimentary primitive 1902 and senior application 1900, they are used for producing and the scalable related image of GUI according to the present invention.As seeing among Figure 22, this structure 1904 can comprise handheld input device/steady arm, application programming interfaces (API), scalable GUI screen, developing instrument etc.
The illustrative embodiments of front is fully descriptive in itself.Being provided to user's zoom level number and special information and control with each grade changes.Those skilled in the art should be realized that the invention provides revolutionary technology is used to use scalable interface to present the media item of big and group, thereby make that the user can easily search for, browses, tissue and playback media item, for example film and music.Graphic user interface according to the present invention organizing media item in virtual surface is selected, thereby makes similar selection be combined in together.Begin most, the view that dwindles on above-mentioned surface is transmitted at this interface, and under most of situation, is to cannot see actual selection in this grade, and only sees their group name.Along with the inside gradually convergent-divergent of user, show more details about groups of media items or selection.Can obtain different control in each zoom level, thereby make the user can play the group or the single selection of selection or arrive virtual other parts to browse other relevant media item.Convergent-divergent graphic user interface according to an illustrative embodiment of the invention can comprise the image kind that is nested into any degree of depth and the kind of kind.Media item can comprise the broadcasting of content, the broadcasting provider of local storage, via from the direct connection of content supplier or the information that on reciprocity basis, receives.Media item can provide with the form of schedule, and wherein, date information provides at the GUI of some grades.In addition, framework according to an illustrative embodiment of the invention and GUI can also be applied to the television commerce that the item that will be used for selecting therein is sold to the user.
In order to make one's options, traditional TV applications hypothesis user must consciously and wittingly select from one group of screen associated zone or object.Usually, in the use/down/arrow on a left side/right side and make screen give prominence to interesting areas to finish selection.When the user achieved the goal, she or he for example pressed " OK " button and handles.By contrast, illustrative embodiments of the present invention allows the user directly to point to any interesting areas on the screen.This makes that selection is more directly perceived and quick.Allow also to show that the information that strengthens shows as a part of locating, be called " semantic hover " herein.Semantic hover embodiment is in the illustrative embodiments of the present invention, and the feasible extend information about this program of hover is shown on the programm name that TV uses.In addition, also allow interesting areas is taked associative operation.The user just easily points to the screen area that can comprise or not comprise interested concrete object, and the operation that does not all have correlativity everywhere on use and the screen.In application according to the present invention, an embodiment of this operation moves to the left side of screen to use the channel selection characteristic with vernier.The user can use scroll wheel to change channel or only point to slider bar so that corresponding channel to be set then.Use traditional navigation scheme, these all are not easy to realize.
According to an illustrative embodiment of the invention, be to handle shake and unnecessary mobile ability about another unique property that 3D device and screen navigation system are combined.The situation of this and desktop mouse and PC screen is very inequality.Under such situation, because friction and static ability, mouse is in the plane static, and unnecessary moving filtered naturally.Yet, using the 3D device, it is held in user's hand with angle arbitrarily, produces very different controling environment.Algorithm that is associated with the mobile data of handling the 3D device and/or the application of accepting data must be handled shake and unnecessary moving.Another feature of this combination according to the embodiment of the present invention is a precision.Usually, the positioning resolution of desktop mouse is better than the 3D device.This means that desktop mouse can point to the littler screen object that can point to than 3D device.
System has according to an illustrative embodiment of the invention solved whole three problems.With regard to algorithm itself, modeling and processing have been carried out in shake and the mobile of expectation, so that only use moving of expectation.In addition, only because the user presses the button of 3D locating device or rolls that its scroll wheel produces, the mobile of 3D device is modeled and compensated.In addition, in the downstream application (downstream application) that receives with user's mobile phase mobile data related, that handled of 3D locating device, the knowledge that has comprised 3D resolution and mobility in the design of this application screen, it also provides or carries out the semanteme feedback so that the easier control of system.
The benefit of 3D positioning combination screen navigation also comprises because this pairing can appear at new characteristic and the function in the various application.According to different application class, listed some embodiment below.It should be noted that the whole GUI screens shown in Figure 23-33 can use above-mentioned technology to navigate.
TV:In TV guide, there is following exemplary beneficial effect:
1. show extraneous information in back, the rendition list location, and need not click or require screen to change (semantic hover) about program.For example referring to Figure 23 that this technology is shown (a) and 23 (b).Wherein, the electronic program guides programme information that has a plurality of channels and be used to be presented at these channels on the GUI screen (its can be for example from above advanced gui screen access).In the moment shown in Figure 23 (a), the channel information pane ofcurrent selection 2300 show withchannel 15 from the program that morning, 8:30 play to 9:00, i.e. " Happy Day (happy one day) " information of being associated.Vernier 2302 does not point to the rendition list at this moment.Shown in Figure 23 (b) another constantly, for example, vernier has been moved to and has pointed to indicatedchannel 15 9:00 plays in the morning " Beverly Hillbillies (Bei Fuli person from the mountains) " by mobile 3 D locating device correspondingly before TV screen.According to this illustrative embodiments of the present invention, these information in thechannel information pane 2300 be presented at current selection automatically of making become the information about the program reselected shown in Figure 23 (b), and need not except from any input user's the location.In this embodiment, the GUI screen also can have live feed (live feed) zone 2304, and it shows the video and/or audio that is associated with the channel of current selection.
2. between the rendition list, roll and simplified navigation long linear list.In addition, this rolling can for example the 3D locating device be located on TV and realization easily by using.Referring to for example Figure 24 (a) and 24 (b).Among Figure 24 (a), the program guide that the current demonstration of GUI screen has channel 2-10 and respective programinformation thereof.By vernier 2400 being moved to the screen portions shown in the tabulation below, can activate rolling function automatically, downwards so that program guide rolls and need the user to take other any operation.The GUI screen that this rolling action produces is shown in Figure 24 (b).
3. for example screen controller of volume and channel just appears when being requested.For example, Figure 25 (a) shows such GUI screen, has wherein selected special television channel, and the video that is associated with this channel and audio feed are just exported by system.By mobile 3 D steady arm before TV, and correspondingly vernier 2500 is moved to the right-hand side of GUI screen, volume control slider bar 2502 is presented on the screen and can be handled by the user, for example by dragging volume icon 2504 up or down.If user's mobile 3 D steady arm is so that vernier moves to the left-hand side of GUI screen, then the indicated channel transform controller 2506, shown in Figure 25 (c).Some performance guide information also can be presented on the flyout panel 2508, and it can for example be handled by pointing to label (shown in Figure 25 (d)).Eject 2512 by pointing to channel control strip 2510, showing, be located at the channel (Figure 25 (e)) of the position of pointing on the channel control strip 2510 with indication.Subsequently at the fixed time on inherent this position of section input (for example, pressing the button etc.) or the result that suspends be to change channel and correspondingly to change information (Figure 25 (f)) in the display pane 2508.
4. visual browser presents a plurality of selections simultaneously to the user, and the user only just can therefrom select by point operation.This of illustrative embodiments of the present invention illustrates in aforesaid Figure 17-20 on the one hand.
5. when user's hover or option, show automatic cross-linked suggestion.This of illustrative embodiments of the present invention on the one hand can with various GUI screens that interface described herein and system are associated in see.For example, shown in Figure 26 (a), the initial user of " Paid Programming (sponsored program) " that can show based on thedisplay pane 2602 by the GUI screen top shown in the figure selects, and the cross-linked of other " request (on-demand) " GUI screen of selecting based on this initial user is provided in theright pane 2600 of program guide.For example, if the user operates " Movielink (film link) " link indisplay pane 2600, for example by vernier being moved on it and clicking " Movielink " text, then the result as this operation shows the GUI screen shown in Figure 26 (b).
6. the transfer control on the screen (for example playback, F.F., play, stop, scene skipping, end and beginning) can be used for the simple control of video playback.Similar with above-mentioned volume and channel selection controller, can the predetermined portions that vernier moves to the viewing area be shown these controllers in response to the user.For example referring to shown in Figure 27 (a) and 27 (b), wherein in Figure 27 (b) in response tovernier 2702 being moved in the zone that limits by frame 2704 and on screen demonstration transfer control 2700.
Photo:In photo application, have following exemplary beneficial effect:
1. behind location on the photograph collection, the user can automatically or alternatively be increased to playlist by ole Server OLE with it.For example shown in Figure 28 (a), point to special photo of " Europe (Europe) " concentrating (realizing by the part with the bookshelf of the mode enlarged photograph very identical with the method for above describing about film and Figure 17-20) and produce " hover amplification " effect, whereinphoto 2800 is exaggerated and is increased to automatically theplaylist part 2801 of GUI screen.Thelittle version 2 802 ofphoto 2800 is placed intoplaylist part 2801 to be added in this tabulation to represent it.
2. the enlarged photograph volume can provide more contextual information and can be used for hierarchical classification.Along with increasing of photo playlist content, will scrollcontroller 2805 appear with the quantity (see Figure 28 (b)) of restriction any time at the little version image ofplaylist part 2801 demonstrations.
3. the transfer control on the screen, for example " Play (broadcast) " in Figure 28 (a) and 28 (b) and " Clear (cancellation) " button can be used for easily the lantern slide playing programs being controlled.
Family movie:In family's film applications, have following exemplary beneficial effect:
1. behind the location of home videos collection, the user can automatically or alternatively add it in playlist to by ole Server OLE, shown in Figure 29 (a).Wherein, by pointing to the image of " Hats Part3 (cap third part) " in thecollection zone 2902 in the GUI screen, it is concentrated from " Ballet (ballet) "add playlist 2900 to.Select " Recital (recital) " button of GUI upper left will collect the interior image modification in zone for being stored image as the part of " Recital " collection, from " Recital " is concentrated another family movie is added to theplaylist 2900, shown in Figure 29 (b).
2. the enlarged photograph volume can provide more contextual information and be used for classification.
3. the transfer control on the screen, for example " Play " and " Clear " button can be used for the simple control that home videos and home videos playlist are play, shown in Figure 28 (a) and 28 (b).
Schedule:In calendar application, have following exemplary beneficial effect:
1. the user can be at hover on the calender, and need not to click the additional information of just seeing about the agreement of this day, shown in Figure 30 (a) and 30 (b).
2. scalable one day of user is to see additional detail or to use the text input system on the screen to edit these details on screen.
3. the user can be to the schedule classification to find out the conflict between the interested different members.
Internet browsing:In internet browsing is used, have following exemplary beneficial effect:
1. the user can surf the web on TV with mode closely similar on desktop computer, for example shown in Figure 31 (a).Its difference is to need to handle the complicacy of 3D.
2., then can use the text input on the instrument realization screen on the screen, for example shown in Figure 31 (b) if the user does not want to use keyboard.
Music:In music application, have following exemplary beneficial effect:
1. the user can point to music list or visual music performance, and automatically it is increased to playlist or only with its broadcast.For example, the user can navigate to the part of the bookshelf of the album cover shown in Figure 32 (a), and selects this special edition by pointing to special special edition.This can produce the screen display shown in Figure 32 (b), wherein can select independent track and it is added to theplaylist part 3200 of GUI screen.
2. the user can navigate cross-linked by pointing to interested cross-linked (similar musician or similar special edition) simply.
3. the transfer control on thescreen 3202 can be used for the simple broadcast of controlling song or playlist shown in Figure 32 (c), and also can add in any visual demonstration.
Shopping:In shopping is used, have following exemplary beneficial effect:
1. the user can point to shopping bulleted list or visual commodity image, and automatically it is added to shopping cart or only buy commodity.For example, shown in Figure 33 (a),, can use some general phrase or image that different classes of commodity to be sold in a large number are shown in first scaling layer.When suspending on the special category of user with " Handbags (handbag) " of vernier 3300 in for example Figure 33 (b), this image is amplified a little to represent current selection.Automatically or by other user import (for example pressing the button) then, can realize the amplification of " Handbags " classification, the bookshelf of handbag for example has been shown in Figure 33 (c).Amplify by hover on " Smooth Leather (smooth leather) " classification of the commodity in Figure 33 (c), the position of vernier in bookshelf indicated current selection once more.When pointing to this classification and through after a while after, realizing automatically or import realization in response to the concrete user who produces by hand held equipment and amplify once more again, thus such other the more detailed view of generation shown in Figure 33 (d).Select the image of special handbag can pass through for example above-described convergent-divergent, mobile (panning) and/or conversion effect, the amplification that produces the detailed view shown in Figure 33 (e).
2. the user can navigate cross-linked by pointing to interested cross-linked (as other commodity in identical shop, the object that matches, similar designer) simply.For example, shown in Figure 33 (e), thecross-linked image 3300 that is shown as of the footwear that match.If activate this link, the user then can enter the detailed view of this project, realizes and need not to navigate by the screen to various higher levels, shown in Figure 33 (f).
3. the user can obtain the more information of interested project simply by hover or convergent-divergent.
Allow the user only to point to the commodity of screen to indicate it to want.Alternatively, gesture itself can think to use the part of input.Use to consider (for example shake, unnecessary mobile, screen resolution) and to utilize (for example semantic hover, convergent-divergent, the relevant control of screen picture) 3D to navigate by adjusting, correct powerful interactive television-j o-application can be provided.On May 2nd, 2005 submit to be entitled as " the unconscious mobile method and apparatus that is used for removing the 3D locating device " the 11/119th, the extra discussion of the unconscious mobile example technique that is used to control the 3D locating device is provided in No. 987 U.S. Patent applications, and its content is incorporated this paper by reference into.
Above-described illustrative embodiments trends towards each side of the present invention is described, rather than restrictive.Therefore the present invention can have multiple variant in detailed enforcement, and these variants can be drawn from the explanation of this paper by those skilled in the art.As described in claim hereinafter, think that whole these variants and modification are all in scope and spirit of the present invention.Unless offer some clarification on, the element that uses in the instructions of the present invention, action or indication should not be interpreted as to the present invention be decision or essence.Equally, " one " used herein trends towards comprising one or more projects.