CN106254939B

Movatterモバイル変換

Info

Publication number: CN106254939B
Application number: CN201610875775.2A
Authority: CN
Inventors: 张亮; 伍亮雄; 刘鸣
Original assignee: Beijing Xiaomi Mobile Software Co Ltd
Current assignee: Beijing Xiaomi Mobile Software Co Ltd
Priority date: 2016-09-30
Filing date: 2016-09-30
Publication date: 2020-02-07
Anticipated expiration: 2036-09-30
Also published as: CN106254939A

Abstract

Translated fromChinese

本公开是关于信息提示方法及装置。该方法包括：在播放视频时，获取所述视频中的音频帧；对所述音频帧进行分析，获得所述音频帧对应的人物信息；将所述音频帧对应的人物信息进行提示。该技术方案，在播放该视频时，通过获取该视频中的音频帧，并对该音频帧进行自动分析，可以获取到该音频帧对应的配音演员的信息，进而将音频帧对应的人物信息进行提示，从而使用户在观看该视频时，可以充分了解该视频中的配音演员的具体信息，这有利于进一步提高用户的观看体验，可以克服了相关技术中视频中未标识出配音演员的信息而导致用户无法了解配音演员的缺陷。

The present disclosure relates to an information prompting method and apparatus. The method includes: when a video is played, acquiring an audio frame in the video; analyzing the audio frame to obtain character information corresponding to the audio frame; and prompting the character information corresponding to the audio frame. In the technical solution, when the video is played, by acquiring the audio frame in the video and automatically analyzing the audio frame, the information of the voice actor corresponding to the audio frame can be obtained, and then the character information corresponding to the audio frame can be obtained. prompt, so that the user can fully understand the specific information of the voice actor in the video when watching the video, which is conducive to further improving the user's viewing experience, and can overcome the fact that the information of the voice actor is not identified in the video in the related art. Defects that prevent users from understanding voice actors.

Description

Translated fromChinese

信息提示方法及装置Information prompting method and device

技术领域technical field

本公开涉及终端技术领域，尤其涉及信息提示方法及装置。The present disclosure relates to the technical field of terminals, and in particular, to a method and device for prompting information.

背景技术Background technique

目前，用户在观看视频时，通常只能看到该视频的部分人物信息，如该视频的主演、导演等，而这些信息并不完整，无法使用户充分了解该视频的各种人物信息。At present, when a user is watching a video, he can usually only see part of the character information of the video, such as the starring actor and director of the video, and the information is incomplete, so that the user cannot fully understand the various character information of the video.

发明内容SUMMARY OF THE INVENTION

本公开实施例提供了信息提示方法及装置。所述技术方案如下：Embodiments of the present disclosure provide an information prompting method and apparatus. The technical solution is as follows:

根据本公开实施例的第一方面，提供一种信息提示方法，包括：According to a first aspect of the embodiments of the present disclosure, an information prompting method is provided, including:

在播放视频时，获取所述视频中的音频帧；When playing the video, obtain the audio frame in the video;

对所述音频帧进行分析，获得所述音频帧对应的人物信息；The audio frame is analyzed to obtain character information corresponding to the audio frame;

将所述音频帧对应的人物信息进行提示。The character information corresponding to the audio frame is prompted.

在一个实施例中，所述方法还包括：In one embodiment, the method further includes:

在播放所述视频时，获取所述视频中与所述音频帧相应的视频帧；When playing the video, obtain the video frame corresponding to the audio frame in the video;

对所述视频帧进行识别，获得所述视频帧对应的人物信息；Identifying the video frame to obtain character information corresponding to the video frame;

所述将所述音频帧对应的人物信息进行提示，包括：The prompting of the character information corresponding to the audio frame includes:

当所述视频帧对应的人物信息与所述音频帧对应的人物信息不匹配时，将所述音频帧对应的人物信息进行提示。When the character information corresponding to the video frame does not match the character information corresponding to the audio frame, the character information corresponding to the audio frame is prompted.

在一个实施例中，所述对所述视频帧进行识别，获得所述视频帧对应的人物信息，包括：In one embodiment, the identifying the video frame to obtain the character information corresponding to the video frame includes:

将所述视频帧的图像与至少一个预设图像进行匹配；matching the image of the video frame with at least one preset image;

当所述视频帧的图像与所述至少一个预设图像中的目标图像相匹配时，将所述目标图像对应的人物信息确定为所述视频帧对应的人物信息。When the image of the video frame matches the target image in the at least one preset image, the character information corresponding to the target image is determined as the character information corresponding to the video frame.

在一个实施例中，所述对所述音频帧进行分析，获得所述音频帧对应的人物信息，包括：In one embodiment, the analyzing the audio frame to obtain character information corresponding to the audio frame includes:

获取所述音频帧的声音参数；obtain the sound parameters of the audio frame;

将所述音频帧的声音参数与至少一个预设声音的声音参数进行匹配；matching the sound parameters of the audio frame with the sound parameters of at least one preset sound;

当所述音频帧的声音参数与所述至少一个预设声音中的目标声音的声音参数相匹配时，将所述目标声音的人物信息确定为所述音频帧对应的人物信息。When the sound parameter of the audio frame matches the sound parameter of the target sound in the at least one preset sound, the character information of the target sound is determined as the character information corresponding to the audio frame.

在一个实施例中，所述声音参数包括：响度、音调、音色中的至少一项。In one embodiment, the sound parameter includes: at least one of loudness, pitch, and timbre.

在一个实施例中，所述音频帧对应的人物信息包括：所述音频帧对应的人物的称呼、职位、联系方式中的至少一项信息。In one embodiment, the character information corresponding to the audio frame includes: at least one item of information from the title, position, and contact information of the character corresponding to the audio frame.

根据本公开实施例的第二方面，提供一种信息提示装置，包括：According to a second aspect of the embodiments of the present disclosure, an information prompting device is provided, including:

第一获取模块，用于在播放视频时，获取所述视频中的音频帧；The first acquisition module is used to acquire audio frames in the video when playing the video;

第二获取模块，用于对所述音频帧进行分析，获得所述音频帧对应的人物信息；A second acquisition module, configured to analyze the audio frame to obtain character information corresponding to the audio frame;

提示模块，用于将所述音频帧对应的人物信息进行提示。The prompting module is used for prompting the character information corresponding to the audio frame.

在一个实施例中，所述装置还包括：In one embodiment, the apparatus further comprises:

第三获取模块，用于在播放所述视频时，获取所述视频中与所述音频帧相应的视频帧；A third acquisition module, configured to acquire the video frame corresponding to the audio frame in the video when the video is played;

第四获取模块，用于对所述视频帧进行识别，获得所述视频帧对应的人物信息；a fourth acquisition module, configured to identify the video frame and obtain character information corresponding to the video frame;

所述提示模块包括：The prompt module includes:

第一提示子模块，用于当所述视频帧对应的人物信息与所述音频帧对应的人物信息不匹配时，将所述音频帧对应的人物信息进行提示。The first prompting submodule is configured to prompt the character information corresponding to the audio frame when the character information corresponding to the video frame does not match the character information corresponding to the audio frame.

在一个实施例中，所述第四获取模块包括：In one embodiment, the fourth obtaining module includes:

第一匹配子模块，用于将所述视频帧的图像与至少一个预设图像进行匹配；a first matching submodule for matching the image of the video frame with at least one preset image;

第一确定子模块，用于当所述视频帧的图像与所述至少一个预设图像中的目标图像相匹配时，将所述目标图像对应的人物信息确定为所述视频帧对应的人物信息。a first determination submodule, configured to determine the character information corresponding to the target image as the character information corresponding to the video frame when the image of the video frame matches the target image in the at least one preset image .

在一个实施例中，所述第二获取模块包括：In one embodiment, the second obtaining module includes:

获取子模块，用于获取所述音频帧的声音参数；Obtaining a submodule for obtaining the sound parameters of the audio frame;

第二匹配子模块，用于将所述音频帧的声音参数与至少一个预设声音的声音参数进行匹配；a second matching submodule, configured to match the sound parameters of the audio frame with the sound parameters of at least one preset sound;

第二确定子模块，用于当所述音频帧的声音参数与所述至少一个预设声音中的目标声音的声音参数相匹配时，将所述目标声音的人物信息确定为所述音频帧对应的人物信息。a second determination submodule, configured to determine the character information of the target sound as the audio frame corresponding to the sound parameter of the audio frame when the sound parameter of the audio frame matches the sound parameter of the target sound in the at least one preset sound character information.

根据本公开实施例的第三方面，提供了一种信息提示装置，包括：According to a third aspect of the embodiments of the present disclosure, an information prompting device is provided, including:

处理器；processor;

用于存储处理器可执行指令的存储器；memory for storing processor-executable instructions;

其中，所述处理器被配置为：wherein the processor is configured to:

在播放所述视频时，获取所述视频中的音频帧；When playing the video, obtain the audio frame in the video;

本公开的实施例提供的技术方案可以包括以下有益效果：The technical solutions provided by the embodiments of the present disclosure may include the following beneficial effects:

本公开的实施例提供的技术方案，在播放该视频时，通过获取该视频中的音频帧，并对该音频帧进行自动分析，可以获取到该音频帧对应的配音演员的信息，进而将音频帧对应的人物信息进行提示，从而使用户在观看该视频时，可以充分了解该视频中的配音演员的具体信息，这有利于进一步提高用户的观看体验，可以克服了相关技术中视频中未标识出配音演员的信息而导致用户无法了解配音演员的缺陷。According to the technical solution provided by the embodiments of the present disclosure, when the video is played, by acquiring the audio frame in the video and automatically analyzing the audio frame, the information of the voice actor corresponding to the audio frame can be acquired, and then the audio The corresponding character information of the frame is prompted, so that the user can fully understand the specific information of the voice actor in the video when watching the video, which is conducive to further improving the user's viewing experience, and can overcome the unidentified video in the related art. The defect that users cannot understand the voice actor due to the information of the voice actor.

应当理解的是，以上的一般描述和后文的细节描述仅是示例性和解释性的，并不能限制本公开。It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the present disclosure.

附图说明Description of drawings

此处的附图被并入说明书中并构成本说明书的一部分，示出了符合本公开的实施例，并与说明书一起用于解释本公开的原理。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the disclosure and together with the description serve to explain the principles of the disclosure.

图1是根据一示例性实施例示出的一种信息提示方法的流程图。Fig. 1 is a flow chart of a method for prompting information according to an exemplary embodiment.

图2是根据一示例性实施例示出的另一种信息提示方法的流程图。Fig. 2 is a flow chart of another method for prompting information according to an exemplary embodiment.

图3是根据一示例性实施例示出的又一种信息提示方法的流程图。Fig. 3 is a flow chart of yet another method for prompting information according to an exemplary embodiment.

图4是根据一示例性实施例示出的再一种信息提示方法的流程图。Fig. 4 is a flow chart of yet another method for prompting information according to an exemplary embodiment.

图5是根据一示例性实施例示出的一种信息提示装置的框图。Fig. 5 is a block diagram of an information prompting apparatus according to an exemplary embodiment.

图6是根据一示例性实施例示出的另一种信息提示装置的框图。Fig. 6 is a block diagram of another information prompting apparatus according to an exemplary embodiment.

图7是根据一示例性实施例示出的又一种信息提示装置的框图。Fig. 7 is a block diagram of yet another information prompting apparatus according to an exemplary embodiment.

图8是根据一示例性实施例示出的再一种信息提示装置的框图。Fig. 8 is a block diagram of still another information prompting apparatus according to an exemplary embodiment.

图9是根据一示例性实施例示出的适用于信息提示装置的框图。Fig. 9 is a block diagram showing an apparatus suitable for information prompting according to an exemplary embodiment.

具体实施方式Detailed ways

这里将详细地对示例性实施例进行说明，其示例表示在附图中。下面的描述涉及附图时，除非另有表示，不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反，它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。Exemplary embodiments will be described in detail herein, examples of which are illustrated in the accompanying drawings. Where the following description refers to the drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods consistent with some aspects of the present disclosure as recited in the appended claims.

目前，用户在观看电影或者电视剧等视频时，由于相关技术中的视频仅标识出了该视频的主演、导演等，并未标识出配音演员的信息，这就导致用户只能看到该视频的部分人物信息，如该视频的主演、导演等，而由于这些信息并不完整，因而，用户在观看视频时，可能并不知道当前视频中的配音演员，更不知道配音演员是谁，这就给用户带来一些困惑，无法使用户充分了解该视频的各种人物信息，同时也不利于配音演员的发展。At present, when a user is watching a movie or a TV series, because the video in the related art only identifies the main actor, director, etc. of the video, but does not identify the information of the voice actor, the user can only see the information of the video. Some character information, such as the main actor and director of the video, and because these information are incomplete, users may not know the voice actor in the current video when watching the video, let alone who the voice actor is. It brings some confusion to the user, cannot make the user fully understand the various character information of the video, and is also not conducive to the development of voice actors.

为了解决上述技术问题，本公开实施例提供了一种信息提示方法，该方法可用于信息提示程序、系统或装置中，且该方法对应的执行主体可以是手机、平板、计算机等终端，如图1所示，该方法包括步骤S101至步骤S103：In order to solve the above technical problems, an embodiment of the present disclosure provides an information prompting method, which can be used in an information prompting program, system, or device, and the execution body corresponding to the method can be a terminal such as a mobile phone, a tablet, or a computer, as shown in the figure 1, the method includes steps S101 to S103:

在步骤S101中，在播放视频时，获取视频中的音频帧；In step S101, when playing the video, obtain the audio frame in the video;

该视频可以是电影、电视剧等。The video can be a movie, a TV series, or the like.

获取该视频中的音频帧时，可以按照预设时间间隔进行获取，或者按照其他条件获取，例如：在监测到与该音频帧相应的视频帧中的人物图像为预设人物图像时，再获取该音频帧。When acquiring the audio frame in the video, it can be acquired according to a preset time interval, or acquired according to other conditions, for example: when it is detected that the person image in the video frame corresponding to the audio frame is a preset person image, then acquire the audio frame.

在步骤S102中，对音频帧进行分析，获得音频帧对应的人物信息；In step S102, the audio frame is analyzed to obtain character information corresponding to the audio frame;

该音频帧对应的人物信息可以包括：该音频帧对应的配音人员的称呼、职位、联系方式等至少一项信息。The character information corresponding to the audio frame may include: at least one item of information such as the title, position, and contact information of the dubbing personnel corresponding to the audio frame.

由于不同的音频帧可能对应不同的配音演员，因而，当该音频帧包括多帧时，对应的人物信息可能也有多个。Since different audio frames may correspond to different voice actors, when the audio frame includes multiple frames, there may also be multiple corresponding character information.

另外，该音频帧对应的人物信息和与该音频帧相应的视频帧对应的人物信息可能匹配、也可能不匹配，即该音频帧对应的配音人员和与音频帧相应的视频帧对应的扮演者可以是同一个人或者不是同一个人。In addition, the character information corresponding to the audio frame and the character information corresponding to the video frame corresponding to the audio frame may or may not match, that is, the dubbing person corresponding to the audio frame and the actor corresponding to the video frame corresponding to the audio frame may or may not match. It can be the same person or not.

在步骤S103中，将音频帧对应的人物信息进行提示。In step S103, the character information corresponding to the audio frame is prompted.

在播放该视频时，通过获取该视频中的音频帧，并对该音频帧进行自动分析，可以获取到该音频帧对应的配音演员的信息，进而将音频帧对应的人物信息进行提示，从而使用户在观看该视频时，可以充分了解该视频中的配音演员的具体信息，这有利于进一步提高用户的观看体验，可以克服了相关技术中视频中未标识出配音演员的信息而导致用户无法了解配音演员的缺陷，例如：在观看《功夫熊猫》视频时，在对音频帧进行分析后，可以获取到阿宝父亲的配音演员是成龙，则可以将配音演员——成龙进行提示以增加用户的观影体验。When playing the video, by acquiring the audio frame in the video and automatically analyzing the audio frame, the information of the voice actor corresponding to the audio frame can be obtained, and then the character information corresponding to the audio frame can be prompted, so that the When watching the video, the user can fully understand the specific information of the voice actor in the video, which is conducive to further improving the user's viewing experience, and can overcome the fact that the information of the voice actor is not identified in the video in the related art, which causes the user to be unable to understand. Defects of voice actors, for example: when watching the video of "Kung Fu Panda", after analyzing the audio frames, it can be obtained that the voice actor of A Bao's father is Jackie Chan, then the voice actor - Jackie Chan can be prompted to increase the user's reputation. Movie viewing experience.

其次，在将该音频帧对应的人物信息进行提示时，可以将该音频帧对应的人物信息在片头字幕、片尾字幕中提示，或者在该音频帧对应的视频帧的画面中提示。Secondly, when prompting the character information corresponding to the audio frame, the character information corresponding to the audio frame may be prompted in the credits, or in the picture of the video frame corresponding to the audio frame.

如图2所示，在一个实施例中，方法还包括：As shown in Figure 2, in one embodiment, the method further includes:

在步骤S201中，在播放视频时，获取视频中与音频帧相应的视频帧；In step S201, when playing the video, obtain the video frame corresponding to the audio frame in the video;

与该音频帧相应的视频帧为与该音频帧的播放时间相同的需要同步播放的视频帧(例如：如果该音频帧在该视频中的播放时间为t1—t2，则与该音频帧相应的视频帧的播放时间也为t1—t2)，或者为与该音频帧预先设置了关联关系的视频帧(例如：当该视频中的音频帧与视频帧之间不太同步时，为了避免播放紊乱，音频帧与其相应的视频帧之间预先可能设置了关联关系)。The video frame corresponding to the audio frame is a video frame that needs to be played synchronously with the same playback time of the audio frame (for example: if the playback time of the audio frame in the video is t1-t2, then the corresponding audio frame The playback time of the video frame is also t1-t2), or a video frame with a preset association relationship with the audio frame (for example: when the audio frame in the video is not very synchronized with the video frame, in order to avoid playback disorder , an association relationship may be set in advance between the audio frame and its corresponding video frame).

在步骤S202中，对视频帧进行识别，获得视频帧对应的人物信息；In step S202, the video frame is identified, and the character information corresponding to the video frame is obtained;

另外，步骤S201和步骤S202可以与步骤S101和步骤S102同步执行，也可以在其后执行。In addition, step S201 and step S202 may be performed synchronously with step S101 and step S102, or may be performed thereafter.

上述步骤S103可被执行为：The above step S103 can be performed as:

在步骤A1中，当视频帧对应的人物信息与音频帧对应的人物信息不匹配时，将音频帧对应的人物信息进行提示。In step A1, when the person information corresponding to the video frame does not match the person information corresponding to the audio frame, the person information corresponding to the audio frame is prompted.

在播放该视频时，还可以获取视频中与音频帧相应的视频帧，并对视频帧进行识别，从而获得视频帧对应的人物信息，而当视频帧对应的人物信息与音频帧对应的人物信息不匹配时，说明该音频帧对应的配音演员和视频帧对应的扮演者并不是同一个人，进而说明了该视频帧对应的音频帧的声源源自其他配音演员而非该视频帧对应的扮演者本人，因而，可以将音频帧对应的人物信息进行提示，从而使用户在观看该视频时，可以充分了解该视频中的配音演员的具体信息，以进一步提高用户的观看体验。When the video is played, the video frame corresponding to the audio frame in the video can also be obtained, and the video frame can be identified to obtain the character information corresponding to the video frame, and when the character information corresponding to the video frame and the character information corresponding to the audio frame are obtained When it does not match, it means that the voice actor corresponding to the audio frame and the actor corresponding to the video frame are not the same person, and further explain that the sound source of the audio frame corresponding to the video frame originates from other voice actors rather than the actor corresponding to the video frame. Therefore, the character information corresponding to the audio frame can be prompted, so that the user can fully understand the specific information of the voice actor in the video when watching the video, so as to further improve the user's viewing experience.

另外，当视频帧对应的人物信息与音频帧对应的人物信息相匹配时，说明该视频帧对应的演员和视频帧对应的演员是同一个演员，进而说明了该视频帧对应的音频帧并非源自其他配音演员，而是该视频帧对应的演员本人，又由于视频中的视频帧对应的演员的信息是被携带在视频中的，因而，可以不再将音频帧对应的人物信息进行提示，以避免重复提示。In addition, when the character information corresponding to the video frame matches the character information corresponding to the audio frame, it means that the actor corresponding to the video frame and the actor corresponding to the video frame are the same actor, which further means that the audio frame corresponding to the video frame is not the source From other voice actors, but the actor corresponding to the video frame, and because the information of the actor corresponding to the video frame in the video is carried in the video, the character information corresponding to the audio frame can no longer be prompted. to avoid repeated prompts.

如图3所示，在一个实施例中，上述图2所示的步骤S202可被执行为：As shown in FIG. 3, in one embodiment, step S202 shown in FIG. 2 above may be executed as:

在步骤B1中，将视频帧的图像与至少一个预设图像进行匹配；In step B1, the image of the video frame is matched with at least one preset image;

至少一个预设图像可以来自预存储在本地的图像，或者是网络侧的图像，而这些图像中每个图像的人物信息预先已被识别出。At least one preset image may come from a pre-stored image locally or an image on the network side, and the character information of each image in these images has been identified in advance.

另外，预设图像可以来自特征的图像库，，例如，可以来自影视剧演员图像库、舞台剧演员图像库、电视剧演员图像库、常用的嘉宾图像库等。在将该视频帧的图像与至少一个预设图像进行匹配时，可以将该视频帧的图像与至少一个预设图像中的每个图像进行对比，例如：可以将视频帧的图像的图像参数与至少一个预设图像中的每个图像的图像参数进行对比，其中，该图像参数可以是颜色参数、纹理特征、形状特征等。In addition, the preset image may come from a feature image library, for example, may come from a film and television actor image library, a stage actor image library, a TV drama actor image library, a commonly used guest image library, and the like. When the image of the video frame is matched with the at least one preset image, the image of the video frame can be compared with each image in the at least one preset image, for example, the image parameters of the image of the video frame can be compared with the image parameters of the at least one preset image. Image parameters of each image in the at least one preset image are compared, where the image parameters may be color parameters, texture features, shape features, and the like.

在步骤B2中，当视频帧的图像与至少一个预设图像中的目标图像相匹配时，将目标图像对应的人物信息确定为视频帧对应的人物信息，其中，该目标图像可以是该至少一个预设图像中的任一图像。In step B2, when the image of the video frame matches the target image in the at least one preset image, the character information corresponding to the target image is determined as the character information corresponding to the video frame, wherein the target image may be the at least one Any of the preset images.

当视频帧的图像与至少一个预设图像中的目标图像相匹配时，说明该视频帧的图像与该目标图像的相似度极高，基本可以确定该视频帧的图像中的人物与目标图像中的人物是同一个人物，因而，可以将预先已标注出的该目标图像对应的人物信息自动确定为该视频帧对应的人物信息。When the image of the video frame matches the target image in at least one preset image, it means that the image of the video frame has a very high similarity with the target image, and it can basically be determined that the person in the image of the video frame and the target image The characters in the target image are the same character, so the pre-marked character information corresponding to the target image can be automatically determined as the character information corresponding to the video frame.

如图4所示，在一个实施例中，上述图1中的步骤S102可被执行为：As shown in FIG. 4 , in one embodiment, step S102 in FIG. 1 can be executed as:

在步骤C1中，获取音频帧的声音参数；In step C1, obtain the sound parameter of the audio frame;

其中，该声音参数包括但不限于：响度、音调、音色中的至少一项。Wherein, the sound parameter includes but is not limited to: at least one of loudness, pitch, and timbre.

在步骤C2中，将音频帧的声音参数与至少一个预设声音的声音参数进行匹配；In step C2, the sound parameters of the audio frame are matched with the sound parameters of at least one preset sound;

在将该音频帧的声音参数与至少一个预设声音的声音参数进行匹配时，可以将该音频帧的声音参数与至少一个预设声音的声音参数进行对比。When the sound parameters of the audio frame are matched with the sound parameters of the at least one preset sound, the sound parameters of the audio frame may be compared with the sound parameters of the at least one preset sound.

至少一个预设声音可以包括预存储在本地的声音，或者是网络侧的的声音，而至少一个预设声音中每个声音的人物信息预先已被识别出。The at least one preset voice may include pre-stored voices locally or voices on the network side, and the character information of each voice in the at least one preset voice has been identified in advance.

而该预设声音可以来自特定声音库，例如，可以来自配音演员声音库。The preset sound may come from a specific sound library, for example, a voice actor sound library.

另外，在一实施例中，为了快速地找到匹配的目标声音，在将音频帧的声音参数与至少一个预设声音的声音参数进行匹配时，可以根据音频帧的声音参数先确定出该音频帧对应的人物的年龄范围、性别等人物属性信息，进而，根据这些人物属性信息从该至少一个预设声音中确定出与该人物属性信息相匹配的待选择声音，最后再将该音频帧的声音参数与该待选择声音中各声音的声音参数进行匹配，从而从这些待选择声音中确定出最终的目标声音。In addition, in an embodiment, in order to quickly find the matching target sound, when the sound parameters of the audio frame are matched with the sound parameters of at least one preset sound, the audio frame can be first determined according to the sound parameters of the audio frame. Character attribute information such as the age range and gender of the corresponding character, and then, according to the character attribute information, the to-be-selected voice that matches the character attribute information is determined from the at least one preset voice, and finally the sound of the audio frame is determined. The parameters are matched with the sound parameters of the sounds in the to-be-selected sounds, so that the final target sound is determined from the to-be-selected sounds.

在步骤C3中，当音频帧的声音参数与至少一个预设声音中的目标声音的声音参数相匹配时，将目标声音的人物信息确定为音频帧对应的人物信息。In step C3, when the sound parameter of the audio frame matches the sound parameter of the target sound in the at least one preset sound, the character information of the target sound is determined as the character information corresponding to the audio frame.

当音频帧的声音参数与至少一个预设声音中的目标声音的声音参数相匹配时，说明该音频帧的声音参数与目标声音的声音参数的相似度极高，基本可以确定该音频帧对应的人物与目标声音对应的人物是同一个人，因而，可以将目标声音的人物信息确定为音频帧对应的人物信息。When the sound parameter of the audio frame matches the sound parameter of the target sound in at least one preset sound, it means that the sound parameter of the audio frame is very similar to the sound parameter of the target sound, and it can be basically determined that the sound parameter of the audio frame corresponds to the sound parameter of the target sound. The person and the person corresponding to the target sound are the same person, therefore, the person information of the target sound can be determined as the person information corresponding to the audio frame.

在一个实施例中，声音参数包括：响度、音调、音色中的至少一项。In one embodiment, the sound parameter includes: at least one of loudness, pitch, and timbre.

在一个实施例中，音频帧对应的人物信息包括：音频帧对应的人物的称呼、职位、联系方式中的至少一项信息。In one embodiment, the character information corresponding to the audio frame includes: at least one item of information from the title, position, and contact information of the character corresponding to the audio frame.

该音频帧对应的人物信息为该音频帧的配音人员，该音频帧对应的人物的称呼可以是该音频帧的配音人员的名字、昵称等；The character information corresponding to the audio frame is the dubbing personnel of the audio frame, and the title of the character corresponding to the audio frame can be the name, nickname, etc. of the dubbing personnel of the audio frame;

音频帧对应的人物信息包括但不限于上述至少一项信息，例如，还可以包括该音频帧对应的人物的地址等信息。The character information corresponding to the audio frame includes but is not limited to at least one of the above-mentioned information, for example, information such as the address of the character corresponding to the audio frame may also be included.

对应本公开实施例提供的上述信息提示方法，本公开实施例还提供一种信息提示装置，如图5所示，该装置包括：Corresponding to the above-mentioned information prompting method provided by the embodiment of the present disclosure, the embodiment of the present disclosure further provides an information prompting device, as shown in FIG. 5 , the device includes:

第一获取模块501，被配置为在播放视频时，获取视频中的音频帧；Thefirst acquisition module 501 is configured to acquire audio frames in the video when playing the video;

第二获取模块502，被配置为对音频帧进行分析，获得音频帧对应的人物信息；Thesecond acquisition module 502 is configured to analyze the audio frame to obtain character information corresponding to the audio frame;

提示模块503，被配置为将音频帧对应的人物信息进行提示。The promptingmodule 503 is configured to prompt the character information corresponding to the audio frame.

如图6所示，在一个实施例中，装置还包括：As shown in Figure 6, in one embodiment, the device further includes:

第三获取模块601，被配置为在播放视频时，获取视频中与音频帧相应的视频帧；Thethird acquisition module 601 is configured to acquire video frames corresponding to audio frames in the video when playing the video;

第四获取模块602，被配置为对视频帧进行识别，获得视频帧对应的人物信息；The fourth obtainingmodule 602 is configured to identify the video frame and obtain the character information corresponding to the video frame;

提示模块503可以包括：Theprompt module 503 may include:

第一提示子模块5031，被配置为当视频帧对应的人物信息与音频帧对应的人物信息不匹配时，将音频帧对应的人物信息进行提示。Thefirst prompting sub-module 5031 is configured to prompt the character information corresponding to the audio frame when the character information corresponding to the video frame does not match the character information corresponding to the audio frame.

如图7所示，在一个实施例中，第四获取模块602可以包括：As shown in FIG. 7, in one embodiment, the fourth obtainingmodule 602 may include:

第一匹配子模块6021，被配置为将视频帧的图像与至少一个预设图像进行匹配；The first matching sub-module 6021 is configured to match the image of the video frame with at least one preset image;

第一确定子模块6022，被配置为当视频帧的图像与至少一个预设图像中的目标图像相匹配时，将目标图像对应的人物信息确定为视频帧对应的人物信息。The first determination sub-module 6022 is configured to determine the person information corresponding to the target image as the person information corresponding to the video frame when the image of the video frame matches the target image in the at least one preset image.

如图8所示，在一个实施例中，第二获取模块502可以包括：As shown in FIG. 8, in one embodiment, the second obtainingmodule 502 may include:

获取子模块5021，被配置为获取音频帧的声音参数；Obtaining submodule 5021, configured to obtain the sound parameters of the audio frame;

第二匹配子模块5022，被配置为将音频帧的声音参数与至少一个预设声音的声音参数进行匹配；Thesecond matching submodule 5022 is configured to match the sound parameters of the audio frame with the sound parameters of at least one preset sound;

第二确定子模块5023，被配置为当音频帧的声音参数与至少一个预设声音中的目标声音的声音参数相匹配时，将所述目标声音的人物信息确定为所述音频帧对应的人物信息。Thesecond determination sub-module 5023 is configured to determine the character information of the target sound as the character corresponding to the audio frame when the sound parameter of the audio frame matches the sound parameter of the target sound in at least one preset sound information.

在一个实施例中，声音参数包括：响度、音调、音色中的至少一项。In one embodiment, the sound parameter includes at least one of loudness, pitch, and timbre.

根据本公开实施例的第三方面，提供一种信息提示装置，包括：According to a third aspect of the embodiments of the present disclosure, an information prompting device is provided, including:

处理器；processor;

其中，处理器被配置为：where the processor is configured as:

上述处理器还可被配置为：The above processor can also be configured to:

所述方法还包括：The method also includes:

所述对所述视频帧进行识别，获得所述视频帧对应的人物信息，包括：The identifying the video frame to obtain the character information corresponding to the video frame, including:

所述对所述音频帧进行分析，获得所述音频帧对应的人物信息，包括：The described audio frame is analyzed to obtain character information corresponding to the audio frame, including:

当所述音频帧的声音参数与所述至少一个预设声音参数中的目标声音的声音参数相匹配时，将所述目标声音的人物信息确定为所述音频帧对应的人物信息。When the sound parameter of the audio frame matches the sound parameter of the target sound in the at least one preset sound parameter, the character information of the target sound is determined as the character information corresponding to the audio frame.

所述声音参数包括：响度、音调、音色中的至少一项。The sound parameters include: at least one of loudness, pitch, and timbre.

所述音频帧对应的人物信息包括：所述音频帧对应的人物的称呼、职位、联系方式中的至少一项信息。The character information corresponding to the audio frame includes: at least one item of information from the title, position, and contact information of the character corresponding to the audio frame.

图9是根据一示例性实施例示出的一种用于信息提示装置900的框图，该装置适用于终端设备。例如，装置900可以是移动电话，计算机，数字广播终端，消息收发设备，游戏控制台，平板设备，医疗设备，健身设备，个用户数字助理等。FIG. 9 is a block diagram of anapparatus 900 for prompting information according to an exemplary embodiment, and the apparatus is suitable for a terminal device. For example,apparatus 900 may be a mobile phone, computer, digital broadcast terminal, messaging device, game console, tablet device, medical device, fitness device, personal digital assistant, or the like.

参照图9，装置900可以包括以下一个或至少两个组件：处理组件902，存储器904，电源组件906，多媒体组件908，音频组件910，输入/输出(I/O)接口912，传感器组件914，以及通信组件916。9, theapparatus 900 may include one or at least two of the following components: aprocessing component 902, amemory 904, apower supply component 906, amultimedia component 908, anaudio component 910, an input/output (I/O)interface 912, asensor component 914, And thecommunication component 916 .

处理组件902通常控制装置900的整体操作，诸如与显示，电话呼叫，数据通信，相机操作和记录操作相关联的操作。处理组件902可以包括一个或至少两个处理器920来执行指令，以完成上述的方法的全部或部分步骤。此外，处理组件902可以包括一个或至少两个模块，便于处理组件902和其他组件之间的交互。例如，处理组件902可以包括多媒体模块，以方便多媒体组件908和处理组件902之间的交互。Theprocessing component 902 generally controls the overall operation of theapparatus 900, such as operations associated with display, phone calls, data communications, camera operations, and recording operations. Theprocessing component 902 may include one or at least twoprocessors 920 to execute instructions to perform all or part of the steps of the above-described methods. Additionally,processing component 902 may include one or at least two modules to facilitate interaction betweenprocessing component 902 and other components. For example,processing component 902 may include a multimedia module to facilitate interaction betweenmultimedia component 908 andprocessing component 902.

存储器904被配置为存储各种类型的数据以支持在装置900的操作。这些数据的示例包括用于在装置900上操作的任何存储对象或方法的指令，联系用户数据，电话簿数据，消息，图片，视频等。存储器904可以由任何类型的易失性或非易失性存储设备或者它们的组合实现，如静态随机存取存储器(SRAM)，电可擦除可编程只读存储器(EEPROM)，可擦除可编程只读存储器(EPROM)，可编程只读存储器(PROM)，只读存储器(ROM)，磁存储器，快闪存储器，磁盘或光盘。Memory 904 is configured to store various types of data to support operations atdevice 900 . Examples of such data include instructions for any storage object or method operating ondevice 900, contact user data, phonebook data, messages, pictures, videos, and the like.Memory 904 may be implemented by any type of volatile or non-volatile storage device or combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic or Optical Disk.

电源组件906为装置900的各种组件提供电源。电源组件906可以包括电源管理系统，一个或至少两个电源，及其他与为装置900生成、管理和分配电源相关联的组件。Power supply assembly 906 provides power to the various components ofdevice 900 .Power supply components 906 may include a power management system, one or at least two power supplies, and other components associated with generating, managing, and distributing power todevice 900 .

多媒体组件908包括在所述装置900和用户之间的提供一个输出接口的屏幕。在一些实施例中，屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板，屏幕可以被实现为触摸屏，以接收来自用户的输入信号。触摸面板包括一个或至少两个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界，而且还检测与所述触摸或滑动操作相关的持续时间和压力。在一些实施例中，多媒体组件908包括一个前置摄像头和/或后置摄像头。当装置900处于操作模式，如拍摄模式或视频模式时，前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。Multimedia component 908 includes a screen that provides an output interface between thedevice 900 and the user. In some embodiments, the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from a user. The touch panel includes one or at least two touch sensors to sense touch, swipe, and gestures on the touch panel. The touch sensor may not only sense the boundaries of a touch or swipe action, but also detect the duration and pressure associated with the touch or swipe action. In some embodiments, themultimedia component 908 includes a front-facing camera and/or a rear-facing camera. When theapparatus 900 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera may receive external multimedia data. Each of the front and rear cameras can be a fixed optical lens system or have focal length and optical zoom capability.

音频组件910被配置为输出和/或输入音频信号。例如，音频组件910包括一个麦克风(MIC)，当装置900处于操作模式，如呼叫模式、记录模式和响度识别模式时，麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器904或经由通信组件916发送。在一些实施例中，音频组件910还包括一个扬声器，用于输出音频信号。Audio component 910 is configured to output and/or input audio signals. For example,audio component 910 includes a microphone (MIC) that is configured to receive external audio signals whendevice 900 is in operating modes, such as call mode, record mode, and loudness recognition mode. The received audio signal may be further stored inmemory 904 or transmitted viacommunication component 916 . In some embodiments,audio component 910 also includes a speaker for outputting audio signals.

I/O接口912为处理组件902和外围接口模块之间提供接口，上述外围接口模块可以是键盘，点击轮，按钮等。这些按钮可包括但不限于：主页按钮、音量按钮、启动按钮和锁定按钮。The I/O interface 912 provides an interface between theprocessing component 902 and a peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to: home button, volume buttons, start button, and lock button.

传感器组件914包括一个或至少两个传感器，用于为装置900提供各个方面的状态评估。例如，传感器组件914可以检测到装置900的打开/关闭状态，组件的相对定位，例如所述组件为装置900的显示器和小键盘，传感器组件914还可以检测装置900或装置900一个组件的位置改变，用户与装置900接触的存在或不存在，装置900方位或加速/减速和装置900的温度变化。传感器组件914可以包括接近传感器，被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件914还可以包括光传感器，如CMOS或CCD图像传感器，用于在成像应用中使用。在一些实施例中，该传感器组件914还可以包括加速度传感器，陀螺仪传感器，磁传感器，压力传感器或温度传感器。Sensor assembly 914 includes one or at least two sensors for providing status assessment of various aspects ofdevice 900 . For example, thesensor assembly 914 can detect the open/closed state of thedevice 900, the relative positioning of components, such as the display and keypad of thedevice 900, and thesensor assembly 914 can also detect a change in the position of thedevice 900 or a component of thedevice 900 , the presence or absence of user contact with thedevice 900 , the orientation or acceleration/deceleration of thedevice 900 and the temperature change of thedevice 900 .Sensor assembly 914 may include a proximity sensor configured to detect the presence of nearby objects in the absence of any physical contact.Sensor assembly 914 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, thesensor assembly 914 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.

通信组件916被配置为便于装置900和其他设备之间有线或无线方式的通信。装置900可以接入基于通信标准的无线网络，如WiFi，2G或3G，或它们的组合。在一个示例性实施例中，通信组件916经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中，所述通信组件916还包括近场通信(NFC)模块，以促进短程通信。例如，在NFC模块可基于射频识别(RFID)技术，红外数据协会(IrDA)技术，超宽带(UWB)技术，蓝牙(BT)技术和其他技术来实现。Communication component 916 is configured to facilitate wired or wireless communication betweenapparatus 900 and other devices.Device 900 may access wireless networks based on communication standards, such as WiFi, 2G or 3G, or a combination thereof. In one exemplary embodiment, thecommunication component 916 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, thecommunication component 916 also includes a near field communication (NFC) module to facilitate short-range communication. For example, the NFC module may be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.

在示例性实施例中，装置900可以被一个或至少两个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子组件实现，用于执行上述方法。In an exemplary embodiment,apparatus 900 may be implemented by one or at least two application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A programmed gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation is used to perform the above method.

在示例性实施例中，还提供了一种包括指令的非临时性计算机可读存储介质，例如包括指令的存储器904，上述指令可由装置900的处理器920执行以完成上述方法。例如，所述非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。In an exemplary embodiment, there is also provided a non-transitory computer-readable storage medium including instructions, such as amemory 904 including instructions, executable by theprocessor 920 of theapparatus 900 to perform the method described above. For example, the non-transitory computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.

一种非临时性计算机可读存储介质，当所述存储介质中的指令由上述装置900的处理器执行时，使得上述装置900能够执行一种信息提示方法，包括：A non-transitory computer-readable storage medium, when the instructions in the storage medium are executed by the processor of the above-mentionedapparatus 900, the above-mentionedapparatus 900 can execute an information prompting method, comprising:

本领域技术用户员在考虑说明书及实践这里公开的公开后，将容易想到本公开的其它实施方案。本申请旨在涵盖本公开的任何变型、用途或者适应性变化，这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的，本公开的真正范围和精神由下面的权利要求指出。Other embodiments of the present disclosure will readily occur to those skilled in the art upon consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the present disclosure that follow the general principles of the present disclosure and include common knowledge or techniques in the technical field not disclosed by the present disclosure . The specification and examples are to be regarded as exemplary only, with the true scope and spirit of the disclosure being indicated by the following claims.

应当理解的是，本公开并不局限于上面已经描述并在附图中示出的精确结构，并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。It is to be understood that the present disclosure is not limited to the precise structures described above and illustrated in the accompanying drawings, and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.