Movatterモバイル変換


[0]ホーム

URL:


CN107371060A - Video image synthesis system and methods for using them based on TV output - Google Patents

Video image synthesis system and methods for using them based on TV output
Download PDF

Info

Publication number
CN107371060A
CN107371060ACN201710737444.7ACN201710737444ACN107371060ACN 107371060 ACN107371060 ACN 107371060ACN 201710737444 ACN201710737444 ACN 201710737444ACN 107371060 ACN107371060 ACN 107371060A
Authority
CN
China
Prior art keywords
image
video
service
audio
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710737444.7A
Other languages
Chinese (zh)
Other versions
CN107371060B (en
Inventor
佟飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Zhiwang Times Technology Co Ltd
Original Assignee
Beijing Zhiwang Times Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Zhiwang Times Technology Co LtdfiledCriticalBeijing Zhiwang Times Technology Co Ltd
Publication of CN107371060ApublicationCriticalpatent/CN107371060A/en
Application grantedgrantedCritical
Publication of CN107371060BpublicationCriticalpatent/CN107371060B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

The invention discloses a kind of video image synthesis system and methods for using them based on TV output, including television video image subsystems, service creation image subsystems, video image synthesis subsystem, video frequency output subsystem;Exported after described video image synthon system synthesis television video image and service creation image and give video frequency output subsystem, the video image after synthesis is sent to television set by the video frequency output subsystem.The present invention utilizes " voice " this tie, incorporates each road resource, has put into practice the integrated service based on TV output and concurrent services.By the way that television video image and service creation image are organically synthesized together, so that people are while TV is seen, another kind of fine, happy and joyful service can also be enjoyed, so that the TV customer group the most vast for occupying 99.3% TV popularity rate tried to be the first from video screen experienced it is intelligent bring their happy, improve their qualities of life and Happiness Index.

Description

Video image synthesis system and methods for using them based on TV output
Technical field
The present invention relates to ntelligent television technolog field, more particularly to a kind of video image based on TV output to synthesize systemSystem.
Background technology
The mankind always pursue the simplification of service without end.Although current scientific and technological level can allow television screen to doIncreasing, definition more and more higher is obtained, but often still can not take into account the most simple demand of common people:For example, when you are seeingWhen TV, can usually emerge some ideas or demand suddenly in brains, such as, the weather of today how, neighbouring supermarket hasWhich promotional product, the demand taken out, the demand of net purchase, demand of trip service etc..Although television screen is very big, do not haveIt can be your demand vacating space to have one-inch place;Although mobile phone very little, you have to while TV is seen, crawl thatMobile phone interface of tens times less than television screen and check weather forecast, take-away, trip service;Although heat, you are alsoIt is helplessly to come into that crowded crowd of supermarket, is found out the reason for Discount Promotion message.
The content of the invention
The present invention in view of the shortcomings of the prior art, proposes a kind of video image synthesis system based on TV output and applicationMethod, reach while TV is seen, the purpose for enjoying other service creation images can be synthesized in television interfaces.
The present invention above-mentioned technical purpose technical scheme is that:
A kind of video image synthesis system based on TV output, it is characterised in that:Including television video image subsystems, serviceGenerate image subsystems, video image synthesis subsystem, video frequency output subsystem;Described video image synthon system synthesisExported after television video image and service creation image and give video frequency output subsystem, the video frequency output subsystem is by after synthesisVideo image is sent to television set.
It is preferred that the HDMI sounds that described television video image generation subsystem includes being used to connect television set top box regardFrequency signal element, HDMI audio frequency and video separators, described HDMI audio frequency and video separator receives HDMI audio-video signals, and isolatesHDMI video signal, and HDMI video signal output is synthesized into subsystem to video image.
It is preferred that described service creation image subsystems include echo cancellor and noise reduction and enhancement unit, speech recognitionWith semantic understanding unit, service image generation unit;The speech recognition and the elimination of semantic understanding unit reception of echoes and noise reductionAnd strengthen later voice signal, speech recognition and semantic understanding are carried out, then the result of semantic understanding is sent to service imageGeneration unit;The service image generation unit includes cloud service image generation unit and local service image generation unit;Described cloud service image generation unit includes cloud service unit and cloud service elementary area, the local service imageGeneration unit includes local service unit and local service elementary area;The service image generation unit is by cloud service imageOr local service image is sent to video image synthesis subsystem.
It is preferred that the echo cancellor and noise reduction unit include being used for the HDMI audio frequency and video separation for connecting TV set-top boxDevice, the audio receiver for connecting tv audio output end, audio selector, echo cancellation unit, audio output one are singleMember, speech enhan-cement and noise reduction unit, playing module, audio collection unit, the unit of audio output two;The audio selector receivesThe audio signals of HDMI audio frequency and video separators or the audio signal for receiving audio receiver, be sent respectively to echo cancellation unit andThe unit of audio output one, the playing module receive the audio letter that television set is reduced after the audio signal of the unit of audio output oneNumber;After the echo cancellation unit receives the audio signal of television set and the audio signal of audio collection device respectively, power down is filteredRetain audio collection signal depending on the audio signal of machine, and the audio collection signal of reservation is sent out after speech enhan-cement and noise reductionThe unit of audio output two is given, the unit of audio output two will filter out tv audio signal and pass through speech enhan-cement and dropThe audio collection signal made an uproar is sent to speech recognition and semantic understanding unit.
It is preferred that position of the service creation image in video screen can be with self-defined setting, including the image canOutput self-defined can also be set in the lower right corner of video screen, the upper left corner, its image area size.
It is preferred that the cloud service image generation unit receives the semantic instructions of speech recognition and semantic understanding unit,Generated according to semantic instructions and export corresponding cloud service image;The cloud service content includes neighbouring supermarket promotion letterBreath, online shopping mall's service, weather service, restaurant's new product service, medicine purchase service, name doctor Medical service, trip are serviced, taken outService, other services.
It is preferred that described cloud service image, including cloud system directly invoke third party's opening API data-interface,And the independently developed service of cloud system, to generate cloud service image.
It is preferred that the independently developed service of cloud system includes integrating third party's data, third party's resource, root are integratedDemand data is generated according to user preference, and then generates cloud service image.
A kind of application process of the video image synthesis system based on TV output, it is characterised in that:Comprise the following steps:
Step 1: opening television set, television set is waited to enter a stable video clip;
Step 2: user sends phonetic order in room voice far-field range;
Step 3: phonetic order is sent to speech recognition and semantic understanding unit by voice collector;
Step 4: voice recognition and semantic understanding unit carry out semantic understanding and the semantic data after semantic understanding are sent into clothesBusiness image generation unit;
Step 5: service image generation unit generates cloud service image or local service image according to the content of semantic understanding,And image is sent back into video image synthesis subsystem;
Step 6: video image synthon system synthesis television video image and service creation image, and by the image after synthesisIt is sent to video frequency output subsystem;
Step 7: the video image after synthesis is sent to television set by video frequency output subsystem.
The turn on television set of the step 1 also includes starting television boot-strap by voice command;
The room voice far-field range of the step 2 is both the scope on 5 meters or so of voice collector periphery in room;The roomInterior voice collector is arranged in each room lamp connection box or in room audio amplifier.
Advantages of the present invention effect
1st, the present invention utilize " voice " this tie, incorporate each road resource, put into practice based on TV output integrated service andSuch a new services form for more pressing close to common people's demand of concurrent services.By by television video image and service creationImage is organically synthesized together so that people are while TV is seen, additionally it is possible to enjoy another kind it is fine, happy and joyfulService:People heartily, with being full of excitement, indiscriminately ad. as one wishes according to the preference of individual can send voice need to voice systemInstruction is asked, voice demand instruction is converted into service image and including in some given zone of video screen by voice system at onceDomain.
2nd, the present invention make those be also not equipped with smart mobile phone, be also not equipped with computer, will not also operating computer user,No longer feel sorry for these services oneself can not be enjoyed, they can equally enjoy smart mobile phone on the tv screenThe function of function, PC terminals, so that occupying the TV customer group the most vast of 99.3% TV popularity rate from TV screenTried to be the first on curtain experienced it is intelligent bring their happy, improve their qualities of life and Happiness Index.
3rd, the present invention takes full advantage of the advantage of television set giant-screen, and television set giant-screen is no longer only only for TV programme instituteAccount for, meanwhile, also as the outlet terminal of all kinds of intelligent Services;TV giant-screen no longer only serves the TV programme on line, togetherWhen, the huge numbers of families that also serve under line.
Brief description of the drawings
Fig. 1 is present system frame diagram;
Fig. 2 is TV viewing screen image subsystems schematic diagram of the present invention;
Fig. 3 is service creation image subsystems schematic diagram of the present invention;
Fig. 4 is service image generation unit structure chart of the present invention;
Fig. 5 is echo cancellor and noise reduction of the present invention and enhancement unit structural representation;
Fig. 6 is cloud service content schematic diagram of the present invention;
Fig. 7 is cloud service image construction schematic diagram of the present invention;
Fig. 8 is the independently developed service content schematic diagram of cloud system of the present invention.
Embodiment
The present invention is described in further detail below in conjunction with accompanying drawing.
First, design principle of the invention
The present invention purpose to be reached, be people while TV is seen, send phonetic order, the system is by these phonetic ordersBecome service result image to show on the tv screen.Here, three problems to be solved:
First will solve the problems, such as:User somewhere sends voice, is that appointed place sends voice or required locationVoice is sent, whom phonetic incepting object is.It is to send phonetic order against the air in room or send language against microphoneSound instructs.
The user of the system the random angle of voice far-field range can shave one's head out phonetic order in room, without handPhonetic order is sent by microphone.Principle is that the system is assembled with voice collector, language in the light switch box in each roomThe scope that 5 meters or so of sound collector is referred to as voice far-field range, in the range of this, voice recognition rate of accuracy reached to 95 withOn.Bedroom, study, dining room, lavatory, the parlor of such as house, typically it is no more than the scope of 5 meters of square, general each roomElectric light will be installed, so, the phonetic order that user sends whenever and wherever possible in room can be received.
Second will solve the problems, such as:When television program plays, the voice of sound of television and people are conflictedWhat if.Then its solution method and principle are believed tv audio as shown in figure 5, take the audio signal of television set firstNumber it is divided into two-way, the reduction for television audio signals all the way, all the way as reference signal, the reference signal is used to disappear in echoExcept in unit, when voice audio signals and the television audio signals mixing of people, removed with reference signal in mixed audio signalTelevision audio signals, and retain the voice signal of people.Meanwhile the voice signal of people may be because surrounding environment such as air-conditioning etc.Influence by many interference, strengthen the voice signal of people by speech enhan-cement and noise reduction means, finally, will optimize enhancedVoice signal is sent to speech recognition and semantic understanding unit.
3rd will solve the problems, such as:How phonetic order is changed into video image.Described video image is exactly in electricityA browser area is opened up in screen curtain setting range, described composite video image is exactly television video+browser, thisInvention system is shown in browser area on the tv screen by service creation image or for service result image.
2nd, based on above inventive principle, a kind of video image synthesis system based on TV output, as shown in figure 1, includingTelevision video image subsystems, service creation image subsystems, video image synthesis subsystem, video frequency output subsystem;It is describedVideo image synthon system synthesis television video image and service creation image after export and give video frequency output subsystem, it is describedVideo image after synthesis is sent to television set by video frequency output subsystem.
As shown in Fig. 2 described television video image generation subsystem includes being used for the HDMI for connecting television set top boxAudio-video signal unit, HDMI audio frequency and video separators, described HDMI audio frequency and video separator receives HDMI audio-video signals, and dividesHDMI video signal is separated out, and HDMI video signal output is synthesized into subsystem to video image.
As shown in Figure 3, Figure 4, described service creation image subsystems include echo cancellor and noise reduction and enhancement unit, languageSound identifies and semantic understanding unit, service image generation unit;The speech recognition and semantic understanding unit reception of echoes eliminateWith noise reduction and strengthen later voice signal, carry out speech recognition and semantic understanding, then the result of semantic understanding is sent to clothesBusiness image generation unit;The service image generation unit includes cloud service image generation unit and local service image generatesUnit;Described cloud service image generation unit includes cloud service unit and cloud service elementary area, the local clothesBusiness image generation unit includes local service unit and local service elementary area;The service image generation unit takes high in the cloudsBusiness image or local service image are sent to video image synthesis subsystem.
As shown in figure 5, the echo cancellor and noise reduction unit include being used for the HDMI audio frequency and video point for connecting TV set-top boxAudio receiver, audio selector from device, for connecting tv audio output end, echo cancellation unit, audio output oneUnit, speech enhan-cement and noise reduction unit, playing module, audio collection unit, the unit of audio output two;The audio selector connectsReceive the audio signal of HDMI audio frequency and video separators or receive the audio signal of audio receiver, be sent respectively to echo cancellation unitWith the unit of audio output one, the playing module receives the audio letter that television set is reduced after the audio signal of the unit of audio output oneNumber;After the echo cancellation unit receives the audio signal of television set and the audio signal of audio collection device respectively, power down is filteredRetain audio collection signal depending on the audio signal of machine, and the audio collection signal of reservation is sent out after speech enhan-cement and noise reductionThe unit of audio output two is given, the unit of audio output two will filter out tv audio signal and pass through speech enhan-cement and dropThe audio collection signal made an uproar is sent to speech recognition and semantic understanding unit.
Position of the service creation image in video screen can be with self-defined setting, including the image is exportable in electricityThe lower right corner, the upper left corner of screen curtain, its image area size self-defined can also be set.
As shown in fig. 6, the cloud service image generation unit receives speech recognition and the semanteme of semantic understanding unit refers toOrder, generated according to semantic instructions and export corresponding cloud service image;The cloud service content promotes including neighbouring supermarketInformation, online shopping mall's service, weather service, restaurant's new product service, medicine purchase service, name doctor Medical service, trip service, are outerThe service of selling, other services.
As shown in fig. 7, described cloud service image, including cloud system directly invoke third party's opening API data and connectMouthful, and the independently developed service of cloud system, to generate cloud service image.
As shown in figure 8, the independently developed service of cloud system includes integrating third party's data, third party's money is integratedSource, demand data is generated according to user preference, and then generate cloud service image.
A kind of application process of the video image synthesis system based on TV output, comprises the following steps:
Step 1: opening television set, television set is waited to enter a stable video clip;
Step 2: user sends phonetic order in room voice far-field range;
Step 3: phonetic order is sent to speech recognition and semantic understanding unit by voice collector;
Step 4: voice recognition and semantic understanding unit carry out semantic understanding and the semantic data after semantic understanding are sent into clothesBusiness image generation unit;
Step 5: service image generation unit generates cloud service image or local service image according to the content of semantic understanding,And image is sent back into video image synthesis subsystem;
Step 6: video image synthon system synthesis television video image and service creation image, and by the image after synthesisIt is sent to video frequency output subsystem;
Step 7: the video image after synthesis is sent to television set by video frequency output subsystem.
The turn on television set of the step 1 also includes starting television boot-strap by voice command;
The room voice far-field range of the step 2 is both the scope on 5 meters or so of voice collector periphery in room;The roomInterior voice collector is arranged in each room lamp connection box or in room audio amplifier.
This specific embodiment is only explanation of the invention, and it is not limitation of the present invention, people in the artMember can make the modification of no creative contribution to the present embodiment as needed after this specification is read, but as long as at thisAll protected in the right of invention by Patent Law.

Claims (10)

  1. A kind of 3. video image synthesis system based on TV output according to claim 1, it is characterised in that:DescribedService creation image subsystems include echo cancellor and noise reduction and enhancement unit, speech recognition and semantic understanding unit, service graphAs generation unit;The speech recognition and the elimination of semantic understanding unit reception of echoes and noise reduction simultaneously strengthen later voice signal,Speech recognition and semantic understanding are carried out, then the result of semantic understanding is sent to service image generation unit;The service imageGeneration unit includes cloud service image generation unit and local service image generation unit;Described cloud service image generationUnit includes cloud service unit and cloud service elementary area, and the local service image generation unit includes local service listMember and local service elementary area;Cloud service image or local service image are sent to and regarded by the service image generation unitFrequency image synthesizes subsystem.
  2. A kind of 4. video image synthesis system based on TV output according to claim 3, it is characterised in that:Described timeSound eliminates and noise reduction unit includes being used for the HDMI audio frequency and video separator, defeated for connecting tv audio for connecting TV set-top boxGo out audio receiver, audio selector, echo cancellation unit, the unit of audio output one, speech enhan-cement and the noise reduction unit at end, broadcastAmplification module, audio collection unit, the unit of audio output two;The audio selector receives the audio letter of HDMI audio frequency and video separatorsNumber or receive audio receiver audio signal, be sent respectively to echo cancellation unit and the unit of audio output one, the broadcastingThe audio signal of television set is reduced after the audio signal of the module reception unit of audio output one;The echo cancellation unit connects respectivelyAfter receiving the audio signal of television set and the audio signal of audio collection device, filter out the audio signal of television set and retain audio and adoptCollect signal, and the audio collection signal of reservation is sent to the unit of audio output two, the sound after speech enhan-cement and noise reductionFrequency output Unit two will filter out tv audio signal and be sent to language by the audio collection signal of speech enhan-cement and noise reductionSound identifies and semantic understanding unit.
CN201710737444.7A2017-08-092017-08-24Video image synthesis system based on television output and application methodActiveCN107371060B (en)

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
CN201710677714X2017-08-09
CN2017106777142017-08-09

Publications (2)

Publication NumberPublication Date
CN107371060Atrue CN107371060A (en)2017-11-21
CN107371060B CN107371060B (en)2023-08-08

Family

ID=60312022

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201710737444.7AActiveCN107371060B (en)2017-08-092017-08-24Video image synthesis system based on television output and application method

Country Status (1)

CountryLink
CN (1)CN107371060B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108289267A (en)*2018-04-142018-07-17北京智网时代科技有限公司Eliminate echo cancelling device, method, speaker, the voice frequency sender of TV interference
WO2020060904A1 (en)2018-09-182020-03-26Roku, Inc.Audio cancellation and content recognition of audio received over hdmi/arc

Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050120391A1 (en)*2003-12-022005-06-02Quadrock Communications, Inc.System and method for generation of interactive TV content
CN101459817A (en)*2007-12-112009-06-17中国科学院声学研究所Service information publishing method and system
WO2012144963A1 (en)*2011-04-222012-10-26Netas Telekomunikasyon Anonim SirketiEstablishing audio and video communication by means of a camera and a microphone embedded in a television and the system that supports it
CN102802031A (en)*2012-07-132012-11-28李映红Interactive system and method in allusion to television programs
CN203055434U (en)*2012-07-302013-07-10刘强Family speech interactive terminal based on cloud technique
CN105045122A (en)*2015-06-242015-11-11张子兴Intelligent household natural interaction system based on audios and videos
CN105163160A (en)*2015-08-292015-12-16天脉聚源(北京)科技有限公司Method and device for improving information synthesis security

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050120391A1 (en)*2003-12-022005-06-02Quadrock Communications, Inc.System and method for generation of interactive TV content
CN101459817A (en)*2007-12-112009-06-17中国科学院声学研究所Service information publishing method and system
WO2012144963A1 (en)*2011-04-222012-10-26Netas Telekomunikasyon Anonim SirketiEstablishing audio and video communication by means of a camera and a microphone embedded in a television and the system that supports it
CN102802031A (en)*2012-07-132012-11-28李映红Interactive system and method in allusion to television programs
CN203055434U (en)*2012-07-302013-07-10刘强Family speech interactive terminal based on cloud technique
CN105045122A (en)*2015-06-242015-11-11张子兴Intelligent household natural interaction system based on audios and videos
CN105163160A (en)*2015-08-292015-12-16天脉聚源(北京)科技有限公司Method and device for improving information synthesis security

Cited By (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108289267A (en)*2018-04-142018-07-17北京智网时代科技有限公司Eliminate echo cancelling device, method, speaker, the voice frequency sender of TV interference
WO2020060904A1 (en)2018-09-182020-03-26Roku, Inc.Audio cancellation and content recognition of audio received over hdmi/arc
EP3854103A4 (en)*2018-09-182022-05-25Roku, Inc. AUDIO CANCELLATION AND RECOGNITION OF AUDIO CONTENT RECEIVED OVER HDMI/ARC
US11392342B2 (en)2018-09-182022-07-19Roku, Inc.Audio cancellation and content recognition of audio received over HDMI/ARC
US11625215B2 (en)2018-09-182023-04-11Roku, Inc.Audio cancellation and content recognition of audio received over HDMI/ARC

Also Published As

Publication numberPublication date
CN107371060B (en)2023-08-08

Similar Documents

PublicationPublication DateTitle
CN110970014B (en)Voice conversion, file generation, broadcasting and voice processing method, equipment and medium
CN103152614B (en)Second display is used to carry out the system and method across service search of voice driven
CN103916709B (en) Server and method for controlling the server
CN109036374B (en)Data processing method and device
US11234094B2 (en)Information processing device, information processing method, and information processing system
JP6473262B1 (en) Distribution server, distribution program, and terminal
CN109218035A (en)Processing method, electronic equipment, server and the video playback apparatus of group information
CN109257659A (en)Subtitle adding method, device, electronic equipment and computer readable storage medium
CN106488264A (en)Singing the live middle method, system and device for showing the lyrics
CN103440603A (en)Order system based on augmented reality
CN110324702B (en)Information pushing method and device in video playing process
KR102208822B1 (en)Apparatus, method for recognizing voice and method of displaying user interface therefor
CN107578777A (en)Word-information display method, apparatus and system, audio recognition method and device
CN107480766A (en)The method and system of the content generation of multi-modal virtual robot
CN103873919B (en)A kind of information processing method and electronic equipment
CN110517686A (en)Intelligent sound box end voice opens the method and system of application
CN103945140B (en)The generation method and system of video caption
CN107371060A (en)Video image synthesis system and methods for using them based on TV output
CN101867742A (en) A TV System Based on Voice Control
CN103729121B (en)Image display and its operating method
US20210136323A1 (en)Information processing device, information processing method, and program
CN101729827A (en)Voice service method, system, digital television receiving terminal and front-end device
CN108153508A (en)A kind of method and device of audio frequency process
CN103095927A (en)Displaying and voice outputting method and system based on mobile communication terminal and glasses
CN207283737U (en)Video image synthesis system based on TV output

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp