CN105939250A

Movatterモバイル変換

Info

Publication number: CN105939250A
Application number: CN201610357380.3A
Authority: CN
Inventors: 韩旭; 赖卫华
Original assignee: Meizu Technology Co Ltd
Current assignee: Meizu Technology Co Ltd
Priority date: 2016-05-25
Filing date: 2016-05-25
Publication date: 2016-09-14

Abstract

The invention relates to an audio processing method and apparatus. The audio processing method comprises the following steps: acquiring at least two original audio files for communicating with contacts in an application; acquiring an audio synthesis instruction; and responding to the audio synthesis instruction, and synthesizing the acquired at least two original audio files into a target audio file according to a preset rule. A plurality of audio files in an instant messaging application are acquired, these audio files are synthesized into the target audio file, and after the target audio file is synthesized, effective voice messages are more concentrated, thereby being convenient for the management and storage of a user.

Description

Audio-frequency processing method and device

Technical field

The present invention relates to audio file processing technology field, particularly relate to audio-frequency processing method and device.

Background technology

Along with the development of mobile Internet, current various JICQs are used widely, and promoteInterpersonal exchange.Traditional JICQ comprises voice-enabled chat function, especially phrase soundChat, one end generates audio file by recorded speech, sends to the other end so that user can hearThe speech message of friend, strengthens the interaction effect of good friend.But the phrase sound of these traditional timely meanss of communicationAudio file relatively big due to quantity, and be stored under different file directorys, cause user to be difficult to audio frequencyDocument classification manages, it is impossible to manage audio file easily.

Summary of the invention

Based on this, it is necessary to for the technical problem of above-mentioned audio file difficult management, it is provided that at a kind of audio frequencyReason method and apparatus.

A kind of audio-frequency processing method, comprises the following steps:

Obtain at least two original audio file communicated in application with contact person；

Obtain audio frequency synthetic instruction；

Respond described audio frequency synthetic instruction, the described at least two original audio that will get according to preset rulesFile synthesis target audio file.

Wherein in an embodiment, described response described audio frequency synthetic instruction, will obtain according to preset rulesTo described at least two original audio file synthesis target audio file step include:

Obtain the generation time of each described original audio file；

By the sequencing of described generation time, corresponding described at least two original audio file is sorted successivelySynthesize described target audio file.

Wherein in an embodiment, original with at least two that contact person communicates in described acquisition applicationThe step of audio file includes:

Receive in application from same contact person or the speech message of different contact person；

Obtain play instruction, respond described play instruction and play described speech message；

At least recording two is from same contact person or the broadcasting sound of the described speech message of different contact person,Generate at least two original audio file.

Acquisition audio selection instructs；

Respond the instruction of described audio selection, obtain in application and lead to from same contact person or with different contact personsAt least two original audio file of letter.

Wherein in an embodiment, described response described audio frequency synthetic instruction, will obtain according to preset rulesTo described at least two original audio file synthesis target audio file step after also include:

Obtain and generate Word message instruction, respond the instruction of described generation Word message, read described target audioFile, identifies and generates Word message.

A kind of audio-frequency processing method, including:

Obtain the first audio file and the second text communicated in application with contact person；

Acquisition file synthesis instructs；

Respond the instruction of described file synthesis, described first audio file is converted to the 3rd text；

According to preset rules, the second text and the 3rd text are synthesized target text file.

A kind of apparatus for processing audio, including:

Original audio acquisition module, for obtaining the original sound of at least two communicated in application with contact personFrequency file；

Synthetic instruction acquisition module, is used for obtaining audio frequency synthetic instruction；

Target audio synthesis module, is used for responding described audio frequency synthetic instruction, will get according to preset rulesDescribed at least two original audio file synthesis target audio file.

Wherein in an embodiment, described target audio synthesis module includes:

The generation time obtains submodule, for obtaining the generation time of each described original audio file；

Sequentially synthon module, is used for corresponding described at least two by the sequencing of described generation timeTarget audio file described in original audio file rank fusion successively.

Wherein in an embodiment, described original audio acquisition module includes:

Message sink submodule, disappears for receiving in application the voice from same contact person or different contact personBreath；

Message plays submodule, is used for obtaining play instruction, responds the described play instruction described voice of broadcasting and disappearsBreath；

Sound recording submodule, at least recording two from described in same contact person or different contact personThe broadcasting sound of speech message, generates at least two original audio file.

Wherein in an embodiment, described original audio acquisition module includes:

Audio selection submodule, is used for obtaining audio selection instruction；

Original audio obtains submodule, is used for responding the instruction of described audio selection, obtains in application with sameIt is people or at least two original audio file communicated from different contact persons.

Wherein in an embodiment, also include:

Word message generation module, is used for obtaining generation Word message instruction, responds described generation Word messageInstruction, reads described target audio file, identifies and generates Word message.

A kind of apparatus for processing audio, including:

File acquisition module, for obtaining the first audio file and second communicated in application with contact personText；

Synthetic instruction acquisition module, is used for obtaining file synthesis instruction；

File modular converter, is used for responding the instruction of described file synthesis, is converted to by described first audio file3rd text；

Target text synthesis module, for closing the second text and the 3rd text according to preset rulesBecome target text file.

Above-mentioned audio-frequency processing method and device, by obtaining the multiple audio files in instant messaging application, andThese audio files are synthesized target audio file, and after synthesis, target audio file makes efficient voice messageMore concentrate, it is simple to the management of user and preservation.

Accompanying drawing explanation

Figure 1A is the schematic flow sheet of the audio-frequency processing method of an embodiment；

Figure 1B is the schematic flow sheet of the audio-frequency processing method of another embodiment；

Fig. 1 C is the schematic flow sheet of the audio-frequency processing method of another embodiment；

Fig. 1 D is the schematic flow sheet of the audio-frequency processing method of another embodiment；

Fig. 1 E is the schematic flow sheet of the audio-frequency processing method of another embodiment；

Fig. 2 is the schematic flow sheet of the audio-frequency processing method of an embodiment；

Fig. 3 A is the module frame chart of the apparatus for processing audio of an embodiment；

Fig. 3 B is the module frame chart of the target audio synthesis module of an embodiment；

Fig. 3 C is the module frame chart of the original audio acquisition module of an embodiment；

Fig. 3 D is the module frame chart of the original audio acquisition module of another embodiment；

Fig. 3 E is the module frame chart of the apparatus for processing audio of another embodiment；

Fig. 4 is the module frame chart of the apparatus for processing audio of an embodiment；

Fig. 5 A is the interactive interface schematic diagram of the application of an embodiment；

Fig. 5 B is that the speech message of the application of an embodiment is receiving length by the interface schematic diagram after instruction；

Fig. 5 C is the interface schematic diagram of the speech message selection of the application of an embodiment；

Fig. 5 D is that ejecting of the application of an embodiment merges the interface schematic diagram confirming button；

Fig. 5 E is the interface schematic diagram generating Word message of an embodiment；

Fig. 6 is the interface schematic diagram selecting contact person of the application of an embodiment.

Detailed description of the invention

For the ease of understanding the present invention, below with reference to relevant drawings, the present invention is described more fully.Accompanying drawing gives presently preferred embodiments of the present invention.But, the present invention can come real in many different formsExisting, however it is not limited to embodiment described herein.On the contrary, providing the purpose of these embodiments is to make thisThe understanding of disclosure of the invention content is more thorough comprehensively.

As shown in Figure 1A, in one embodiment, it is provided that a kind of audio-frequency processing method, comprise the following steps:

Step 120, obtains at least two original audio file communicated in application with contact person.

In the present embodiment, applying and apply into instant messaging, this application is capable of two sides or communication in many ways,Realize both sides or information in many ways by communication mutual, mutual information include voice messaging, text message,Image information and video information etc..Specifically, for mutual or interactive object in contact this application artificial,Such as, the good friend in the artificial application of contact, it is also possible to be the contact person in application in group.This contact person canTo be same contact person, it is also possible to be different contact persons, i.e. original audio file is sent by same contact personAt least speech message in obtain, it is also possible at least speech message sent by different contact persons obtains.

In the present embodiment, original audio file is to apply, by this instant messaging, the speech message pair interactedThe audio file answered, original audio file contains the speech message communicated with contact person.

Step 140, obtains audio frequency synthetic instruction.

Specifically, in this step, obtaining the audio frequency synthetic instruction of user, this audio frequency synthetic instruction is used for synthesizingAt least two original audio file.

Step 160, responds described audio frequency synthetic instruction, the described at least two that will get according to preset rulesOriginal audio file synthesis target audio file.

In the present embodiment, respond audio frequency synthetic instruction, according to preset rules by original at least two gotAudio file synthesis target audio file.The speech message that makes of the target audio file after synthesis is more concentrated,When speech message is more when, the beneficially management of speech message and preservation, and make multiple speech messageBetween relatedness higher.

As shown in Figure 1B, in one embodiment, step 160 includes:

Step 162, obtains the generation time of each described original audio file.

In the present embodiment, original audio file can be to apply to obtain when receiving speech message in interaction, it is locally generated original audio file when receiving speech message, it is also possible to be to be generated by recorded voice.The generation time of original audio file is the recording time of corresponding speech message, or receives correspondenceThe time of speech message.

Step 164, by the sequencing of described generation time by corresponding described at least two original audio fileTarget audio file described in rank fusion successively.

Specifically, when synthesizing target audio file, suitable according to the priority generating the time of original audio fileOriginal audio file is ranked up by sequence, with this rank fusion target audio file so that target audio fileThe order of the speech message play is corresponding with the generation time of original audio file.In making target audio fileSpeech message arrangement the most orderly so that the readability of target audio file is higher.

As shown in Figure 1 C, in one embodiment, step 120 includes:

Step 122, receives in application from same contact person or the speech message of different contact person.

Specifically, in instant communications applications interaction, receive in application from same contact person or differenceThe speech message of contact person.One embodiment is, when communicating with a contact person, receives this contact personMultiple speech messages, another embodiment is, application group mutual in, receive different contact personSpeech message, and in a further embodiment, user simultaneously or different time respectively from different connectionIt is that people communicates, receives the speech message of these different contact persons respectively.

Step 124, obtains play instruction, responds described play instruction and plays described speech message.

In the present embodiment, after receiving speech message, obtain the play instruction of corresponding speech message, ringAnswer play instruction played voice message.One embodiment is, on the interactive interface of application, obtains voiceThe click commands of the entry that message is corresponding, plays this speech message.

Step 126, at least records broadcasting of two described speech messages from same contact person or different contact personSound reproduction sound, generates at least two original audio file.

Specifically, when speech message is play, the broadcasting sound of recorded speech message, generate original audio literary compositionPart.Specifically, the present embodiment generates original audio file, nothing by the broadcasting sound of recorded speech messageThe lookup of original audio file storage address need to be carried out, the efficiency that original audio file obtains can be effectively improved.

Record generation original audio file to record according to record command, it is also possible to be to play voiceAutomatic recording during message.One embodiment is, when often playing a speech message, records and generates oneCorresponding original audio file；And in a further embodiment, record according to record command and generate original audioFile, such as, obtains play instruction, responds play instruction played voice message, obtain record command, rootBroadcasting sound according to record command recorded speech message；Another embodiment is, obtains and selects instruction, according toSelect instruction select speech message, play this speech message, and the broadcasting sound of recorded speech message.

As shown in figure ip, in one embodiment, described step 120 includes:

Step 120a, obtains audio selection instruction.

Specifically, audio selection instruction is for selecting to determine the original audio file of required synthesis.Specifically,Original audio file can be to obtain in the storage folder that application is corresponding, or by original audio literary compositionThe storage address of part makes a look up acquisition, it is also possible to be by clicking on corresponding message at the interactive interface appliedDisplayed entries obtains.

Step 120b, responds the instruction of described audio selection, obtain in application from same contact person or with differentIt it is at least two original audio file that communicates of people.

In the present embodiment, the speech message in the target audio file of synthesis can be from same contact person,Can also be from different contact persons, can be from the different contact persons in same group, it is also possible to comeFrom the different contact persons of different groups, so so that the management of speech message is the most flexible, and then make conjunctionSpeech message relatedness in the target audio file become is higher.

As referring to figure 1e, in one embodiment, also include after step 160:

Step 180, obtains and generates Word message instruction, responds the instruction of described generation Word message, reads describedTarget audio file, identifies and generates Word message.

In the present embodiment, obtaining and generate Word message instruction, this instruction is for being converted to text by audio fileFile, specifically, after getting generation Word message instruction, responds this instruction, reads target audio literary compositionPart, is converted to Word message by the speech message in this target audio file, generates text.And then makeObtain speech message visualization, improve the readability of message, and further such that message management is the most convenient.

As in figure 2 it is shown, in one embodiment, it is provided that a kind of audio-frequency processing method, including:

Step 220, obtains the first audio file and the second text communicated in application with contact person.

Specifically, the first audio file is to apply, by instant messaging, the sound that the speech message interacted is correspondingFrequency file, the second text is to apply, by instant messaging, the text literary composition that the word message interacted is correspondingPart.This step can be to obtain the first audio file by the message entry clicking on the interactive interface selecting applicationWith the second text, it is also possible to be the storage address corresponding by the first audio file and the second textObtain.

Step 240, obtains file synthesis instruction.

Specifically, file synthesis instruction is for synthesizing the file of at least two same type or at least two notFile with type.In the present embodiment, file synthesis instruction is for Composite tone file and text.

Step 260, responds the instruction of described file synthesis, and described first audio file is converted to the 3rd text literary compositionPart.

In the present embodiment, response file synthetic instruction, first the first audio file is converted to the 3rd text literary compositionPart, specifically, reads the audio-frequency information of the first audio file, identifies and generates Word message, and generation comprises3rd text of this Word message.

Step 280, synthesizes target text literary composition according to preset rules by the second text and the 3rd textPart.

In the present embodiment, generating after the 3rd text, according to preset rules by the second text and theThree text synthesis target text files.

The present embodiment, by synthesizing target text file by audio file and text so that voice disappearsBreath visualization, improves the readability of message, and further such that message management is the most convenient.

As shown in Figure 5A, it is the application scenarios of audio-frequency processing method of a specific embodiment, in the friendship of userDuring Hu, receive multiple speech message 502, and this speech message 502 shows in application with entry formInteractive interface 504 in, this application can be mounted in desk computer, it is also possible to be arranged on as intelligence handsThe mobile terminal of machine, panel computer, personal digital assistant, Intelligent wearable electronic equipment etc..Such as Fig. 5 B instituteShow, when the entry of speech message 502 correspondence receive long by instruction after, eject voice and merge prompting button 506,Prompting " whether merging voice ", after voice merges and points out button 506 to be triggered, as shown in Figure 5 C, usesFamily is clicked on the entry of corresponding speech message 502 and is selected, after have selected speech message, such as Fig. 5 D instituteShow, eject and merge confirmation button 508, after confirming that button 509 is triggered, obtain synthetic instruction, will selectMultiple speech messages synthesis target audio file, as shown in fig. 5e, obtaining generation Word message instructionAfter, generate Word message 510；In a further embodiment, as shown in Figure 6, user can select differentThe speech message of contact person 602 synthesizes, and such as, selects multiple speech messages of same contact person to carry outSynthesis, it is also possible to select any one of any one or multiple speech message of a contact person and another contact personOr multiple speech message synthesizes.

As shown in Figure 3A, in one embodiment, it is provided that a kind of apparatus for processing audio, including:

Original audio acquisition module 310, original for obtaining at least two communicated in application with contact personAudio file.

Synthetic instruction acquisition module 330, is used for obtaining audio frequency synthetic instruction.

Target audio synthesis module 350, is used for responding described audio frequency synthetic instruction, will obtain according to preset rulesThe described at least two original audio file synthesis target audio file arrived.

As shown in Figure 3 B, in one embodiment, described target audio synthesis module 350 specifically includes:

The generation time obtains submodule 351, for obtaining the generation time of each described original audio file；

Sequentially synthon module 353, is used for described in correspondence at least two as the sequencing of described generation timeTarget audio file described in individual original audio file rank fusion successively.

As shown in Figure 3 C, in one embodiment, described original audio acquisition module 310 includes:

Message sink submodule 311, for receiving in application from same contact person or the voice of different contact personMessage.

Message plays submodule 313, is used for obtaining play instruction, responds described play instruction and plays described voiceMessage.

Sound recording submodule 315, at least recording two from same contact person or the institute of different contact personState the broadcasting sound of speech message, generate at least two original audio file.

As shown in Figure 3 D, in one embodiment, described original audio acquisition module 310 includes:

Audio selection submodule 317, is used for obtaining audio selection instruction.

Original audio obtains submodule 319, is used for responding the instruction of described audio selection, obtains in application with sameContact person or at least two original audio file communicated from different contact persons.

As shown in FIGURE 3 E, in one embodiment, also include:

Word message generation module 370, is used for obtaining generation Word message instruction, responds described generation word letterBreath instruction, reads described target audio file, identifies and generates Word message.

As shown in Figure 4, in one embodiment, it is provided that a kind of apparatus for processing audio, including:

File acquisition module 410, for obtaining the first audio file communicated in application and the with contact personTwo texts.

Synthetic instruction acquisition module 430, is used for obtaining file synthesis instruction.

File modular converter 450, is used for responding the instruction of described file synthesis, by described first audio file conversionIt it is the 3rd text.

Target text synthesis module 470, is used for the second text and the 3rd text according to preset rulesSynthesis target text file.

Should be noted that in said system embodiment, included modules is according to function logicCarry out dividing, but be not limited to above-mentioned division, as long as being capable of corresponding function；It addition,The specific name of each functional module, also only to facilitate mutually distinguish, is not limited to the protection of the present inventionScope.

It addition, one of ordinary skill in the art will appreciate that the whole or portion realizing in the various embodiments described above methodThe program that can be by step by step completes to instruct relevant hardware, and corresponding program can be stored in readableTake in storage medium.

Each technical characteristic of embodiment described above can combine arbitrarily, for making description succinct, the most rightThe all possible combination of each technical characteristic in above-described embodiment is all described, but, if these skillsThere is not contradiction in the combination of art feature, is all considered to be the scope that this specification is recorded.

Embodiment described above only have expressed the several embodiments of the present invention, and it describes more concrete and detailed,But can not therefore be construed as limiting the scope of the patent.It should be pointed out that, for this areaFor those of ordinary skill, without departing from the inventive concept of the premise, it is also possible to make some deformation and changeEntering, these broadly fall into protection scope of the present invention.Therefore, the protection domain of patent of the present invention should be with appended powerProfit requires to be as the criterion.

Claims

1. an audio-frequency processing method, it is characterised in that including:

Obtain audio frequency synthetic instruction；

Audio-frequency processing method the most according to claim 1, it is characterised in that the described audio frequency of described responseSynthetic instruction, the described at least two original audio file synthesis target audio that will get according to preset rulesThe step of file includes:

Obtain the generation time of each described original audio file；

Audio-frequency processing method the most according to claim 1, it is characterised in that described acquisition application inThe step of at least two original audio file that contact person communicates includes:

Acquisition audio selection instructs；

Audio-frequency processing method the most according to claim 1, it is characterised in that the described audio frequency of described responseSynthetic instruction, the described at least two original audio file synthesis target audio that will get according to preset rulesAlso include after the step of file:

6. an audio-frequency processing method, it is characterised in that including:

Acquisition file synthesis instructs；

7. an apparatus for processing audio, it is characterised in that including:

Apparatus for processing audio the most according to claim 1, it is characterised in that described target audio synthesizesModule includes:

Apparatus for processing audio the most according to claim 1, it is characterised in that described original audio obtainsModule includes:

Apparatus for processing audio the most according to claim 1, it is characterised in that described original audio obtainsDelivery block includes:

Audio selection submodule, is used for obtaining audio selection instruction；

11. apparatus for processing audio according to claim 1, it is characterised in that also include:

12. 1 kinds of apparatus for processing audio, it is characterised in that including: