Movatterモバイル変換


[0]ホーム

URL:


CN107886957A - Voice wake-up method and device combined with voiceprint recognition - Google Patents

Voice wake-up method and device combined with voiceprint recognition
Download PDF

Info

Publication number
CN107886957A
CN107886957ACN201711145883.5ACN201711145883ACN107886957ACN 107886957 ACN107886957 ACN 107886957ACN 201711145883 ACN201711145883 ACN 201711145883ACN 107886957 ACN107886957 ACN 107886957A
Authority
CN
China
Prior art keywords
voice
verified
preset
vector
authority credentials
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711145883.5A
Other languages
Chinese (zh)
Inventor
陈东鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Speakin Technologies Co ltd
Original Assignee
Speakin Technologies Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Speakin Technologies Co ltdfiledCriticalSpeakin Technologies Co ltd
Priority to CN201711145883.5ApriorityCriticalpatent/CN107886957A/en
Publication of CN107886957ApublicationCriticalpatent/CN107886957A/en
Pendinglegal-statusCriticalCurrent

Links

Classifications

Landscapes

Abstract

The embodiment of the invention discloses a voice awakening method and device combined with voiceprint recognition. The method comprises the steps of judging whether the content of the voice to be verified is a preset awakening word or not through the MFCC characteristics of the voice to be verified, if so, extracting an i-vector through a preset deep neural network model, carrying out voiceprint recognition through the i-vector to confirm the identity of a speaker, acquiring the authority value of the voice to be verified, judging whether the speaker has enough authority or not according to the comparison result of the authority value of the speaker and the authority value corresponding to the preset awakening word corresponding to the voice to be verified, and if so, executing the operation corresponding to the preset awakening word corresponding to the voice to be verified.

Description

The voice awakening method and device of a kind of combination Application on Voiceprint Recognition
Technical field
The present invention relates to the voice awakening method and device of vocal print application field, more particularly to a kind of combination Application on Voiceprint Recognition.
Background technology
Voice, which wakes up, refers to user by saying a default wake-up word to realize that electronic equipment is extensive from holding stateNormal operating conditions is arrived again.Waken up by voice, user can be called out in the case where being inconvenient to click on electronic curtain by voiceFunction of waking up realizes the operation to electronic equipment.
But the current electronic product for possessing voice arousal function lacks the function to speaker's identity identification, can not sentenceDisconnected speaker's identity, therefore authority can not be further opened, it can only realize that some are simple, the equipment without user right is graspedMake.
Therefore, the voice arousal function that result in Current electronic product lacks subscription authentication function, can not carry out user's mirrorWeigh to realize the technical problem of equipment operation that is more complicated and needing user right.
The content of the invention
The invention provides a kind of voice awakening method of combination Application on Voiceprint Recognition and device, solves Current electronic productVoice arousal function lacks subscription authentication function, can not carry out subscription authentication to realize more complicated and need setting for user rightThe technical problem of standby operation.
The invention provides a kind of voice awakening method of combination Application on Voiceprint Recognition, including:
S1:Receive voice to be verified and carry out feature extraction, obtain the MFCC features of voice to be verified;
S2:The MFCC features of voice to be verified in predetermined period are cached;
S3:Whether the content that voice to be verified is judged according to the MFCC features of the voice to be verified of caching is preset wake-upWord, if so, then performing step S4;
S4:The MFCC features of the voice to be verified of caching are inputted in preset deep neural network model, obtained to be testedDemonstrate,prove the i-vector vectors of voice;
S5:The preset i-vector vectors of the i-vector vector sums of voice to be verified are compared, according to comparingWhether the matching fraction gone out obtains the authority credentials of voice to be verified, judge the authority credentials of voice to be verified more than or equal to be verifiedAuthority credentials corresponding to preset wake-up word corresponding to voice, if so, then performing preset wake-up word corresponding with voice to be verifiedCorresponding operation.
Preferably, step S4 is specifically included:
S41:The MFCC features of the voice to be verified of caching are cascaded;
S42:MFCC features after cascade are inputted in preset deep neural network model, obtain the i- of voice to be verifiedVector vectors, and preset deep neural network model is updated to by the MFCC features after cascade by new preset depthNeural network model.
Preferably, step S5 is specifically included:
S51:The preset i-vector vectors of the i-vector vector sums of voice to be verified are subjected to positive naturalization processing, will justThe i-vector vectors that the i-vector vector sums of voice to be verified after naturalization processing are preset pass through outline linear distinction pointAnalysis model is compared, and gets the matching fraction for comparing and drawing;
S52:Matching fraction is added into migration fraction, obtains new matching fraction;
S53:The authority credentials of voice to be verified is obtained according to new matching fraction, judge voice to be verified authority credentials whetherMore than or equal to authority credentials corresponding to wake-up word preset corresponding to voice to be verified, if so, then performing step S54;
S54:Perform and operated corresponding to preset wake-up word corresponding with voice to be verified.
Preferably, step S5 also includes:Step S55;
Step S53 is specifically included:The authority credentials of voice to be verified is obtained according to new matching fraction, judges voice to be verifiedAuthority credentials whether be more than or equal to authority credentials corresponding to preset wake-up word corresponding to voice to be verified, if so, then performing stepRapid S54, if it is not, then performing step S55;
S55:The prompting of sending permission deficiency.
The invention provides a kind of voice Rouser of combination Application on Voiceprint Recognition, including:
Feature unit, for receiving voice to be verified and carrying out feature extraction, obtain the MFCC features of voice to be verified;
Buffer unit, for being cached to the MFCC features of the voice to be verified in predetermined period;
Wakeup unit, the MFCC features for the voice to be verified according to caching judge voice to be verified content whether bePreset wake-up word, if so, then performing step S4;
Vector location, for the MFCC features of the voice to be verified of caching to be inputted to preset deep neural network modelIn, the i-vector for obtaining voice to be verified is vectorial;
Comparing unit, the i-vector vector preset for the i-vector vector sums by voice to be verified are compared,The matching fraction drawn according to comparing obtains the authority credentials of voice to be verified, judge voice to be verified authority credentials whether be more than orEqual to authority credentials corresponding to wake-up word preset corresponding to voice to be verified, if so, then performing corresponding with voice to be verified pre-Operated corresponding to the wake-up word put.
Preferably, vector location specifically includes:
Subelement is cascaded, for the MFCC features of the voice to be verified of caching to be cascaded;
Subelement is obtained, for the MFCC features after cascade to be inputted in preset deep neural network model, acquisition is treatedThe i-vector vectors of voice are verified, and are updated to preset deep neural network model newly by the MFCC features after cascadePreset deep neural network model.
Preferably, comparing unit specifically includes:
Coupling subelement, the i-vector preset for the i-vector vector sums by voice to be verified vectors are just being returnedChange is handled, and the i-vector vectors that the i-vector vector sums of the voice to be verified after positive naturalization is handled are preset pass through outline lineProperty distinction analysis model be compared, get and compare the matching fraction that draws;
Subelement is compensated, for matching fraction to be added into migration fraction, obtains new matching fraction;
Judgment sub-unit, for obtaining the authority credentials of voice to be verified according to new matching fraction, judge voice to be verifiedAuthority credentials whether be more than or equal to authority credentials corresponding to preset wake-up word corresponding to voice to be verified, if so, then triggering is heldRow subelement;
Subelement is performed, for performing operation corresponding to preset wake-up word corresponding with voice to be verified.
Preferably, comparing unit also includes:Prompt subelement;
Judgment sub-unit is specifically used for the authority credentials that voice to be verified is obtained according to new matching fraction, judges language to be verifiedWhether the authority credentials of sound is more than or equal to authority credentials corresponding to wake-up word preset corresponding to voice to be verified, if so, then triggeringSubelement is performed, if it is not, then triggering prompting subelement;
Subelement is prompted, the prompting for sending permission deficiency.
As can be seen from the above technical solutions, the present invention has advantages below:
The invention provides a kind of voice awakening method of combination Application on Voiceprint Recognition, including:S1:Voice to be verified is received to go forward side by sideRow feature extraction, obtain the MFCC features of voice to be verified;S2:The MFCC features of voice to be verified in predetermined period are carried outCaching;S3:Whether the content that voice to be verified is judged according to the MFCC features of the voice to be verified of caching is preset wake-up word,If so, then perform step S4;S4:The MFCC features of the voice to be verified of caching are inputted to preset deep neural network modelIn, the i-vector for obtaining voice to be verified is vectorial;S5:By the i-vector that the i-vector of voice to be verified is vectorial and presetVector is compared, and the matching fraction drawn according to comparison obtains the authority credentials of voice to be verified, judges the power of voice to be verifiedWhether limit value is more than or equal to authority credentials corresponding to preset wake-up word corresponding to voice to be verified, if so, then perform with it is to be testedDemonstrate,prove and operated corresponding to wake-up word preset corresponding to voice.
The MFCC features that voice to be verified is first passed through in the present invention judge whether the content in voice to be verified is presetWord is waken up, if it is, extracting i-vector vectors by preset deep neural network model, is entered by i-vector vectorsRow Application on Voiceprint Recognition confirms speaker's identity, obtains the authority credentials of voice to be verified, and according to the authority credentials of speaker with it is to be verifiedPreset wake-up comparative result of authority credentials corresponding to word judges whether speaker has enough authorities corresponding to voice, ifHave, then perform and operated corresponding to preset wake-up word corresponding with voice to be verified, the voice for solving Current electronic product is called outFunction of waking up lacks subscription authentication function, can not carry out subscription authentication to realize equipment operation that is more complicated and needing user rightTechnical problem.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existingThere is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are only thisSome embodiments of invention, for those of ordinary skill in the art, without having to pay creative labor, may be used alsoTo obtain other accompanying drawings according to these accompanying drawings.
Fig. 1 is a kind of stream of one embodiment of the voice awakening method of combination Application on Voiceprint Recognition provided in an embodiment of the present inventionJourney schematic diagram;
Fig. 2 is a kind of another embodiment of the voice awakening method of combination Application on Voiceprint Recognition provided in an embodiment of the present inventionSchematic flow sheet;
Fig. 3 is a kind of knot of one embodiment of the voice Rouser of combination Application on Voiceprint Recognition provided in an embodiment of the present inventionStructure schematic diagram.
Embodiment
The embodiments of the invention provide a kind of voice awakening method of combination Application on Voiceprint Recognition and device, solves Current electronicThe voice arousal function of product lacks subscription authentication function, can not carry out subscription authentication to realize more complicated and need user to weighThe technical problem of the equipment operation of limit.
To enable goal of the invention, feature, the advantage of the present invention more obvious and understandable, below in conjunction with the present inventionAccompanying drawing in embodiment, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that disclosed belowEmbodiment be only part of the embodiment of the present invention, and not all embodiment.Based on the embodiment in the present invention, this areaAll other embodiment that those of ordinary skill is obtained under the premise of creative work is not made, belongs to protection of the present inventionScope.
Referring to Fig. 1, the embodiments of the invention provide a kind of implementation of one of voice awakening method of combination Application on Voiceprint RecognitionExample, including:
Step 101:Receive voice to be verified and carry out feature extraction, obtain the MFCC features of voice to be verified;
It should be noted that if desired being waken up to equipment and Application on Voiceprint Recognition is, it is necessary to receive voice to be verified and carry outFeature extraction, obtain the MFCC features of voice to be verified.
Step 102:The MFCC features of voice to be verified in predetermined period are cached;
It should be noted that in order to save the memory space of electronic equipment, only to the voice to be verified in predetermined periodMFCC features are stored, and predetermined period could be arranged to nearest three seconds, as only to the voice to be verified in nearest three secondsMFCC features are stored.
Step 103:Whether the content that voice to be verified is judged according to the MFCC features of the voice to be verified of caching is presetWake-up word, if so, then performing step S4;
It should be noted that when voice to be verified is preset wake-up word, then follow-up step is performed, if language to be verifiedThe non-preset wake-up word of sound, then electronic equipment continue to keep standby.
Step 104:The MFCC features of the voice to be verified of caching are inputted in preset deep neural network model, obtainedThe i-vector vectors of voice to be verified;
It should be noted that when voice to be verified is preset wake-up word, then according to the MFCC features of voice to be verifiedExtract the i-vector vectors of voice to be verified;
Voiceprint Recognition System is come the system of automatic identification speaker's identity, sound groove recognition technology in e category according to the speciality of voiceIn one kind of biometric authentication technology, speaker's identity is verified using voice, this technology has preferable convenience, steadyThe features such as qualitative, measurability and security, as a kind of contactless collection and identification technology, the procurement cost of vocal print compared withIt is low, obtain it is convenient and using simple, before having huge applications in fields such as bank, social security, public security, smart home and mobile paymentsScape.
Step 105:The preset i-vector vectors of the i-vector vector sums of voice to be verified are compared, according to thanThe authority credentials of voice to be verified is obtained to the matching fraction drawn, judges whether the authority credentials of voice to be verified is more than or equal to and treatsAuthority credentials corresponding to wake-up word preset corresponding to voice is verified, if so, then performing step 106;
It should be noted that can by vectorial are compared with preset i-vector vectors of the i-vector of voice to be verifiedTo obtain matching fraction, and the identity of speaker can confirm that according to matching fraction, confirming the identity of speaker can getWhether the authority credentials of speaker, i.e., the authority credentials of voice to be verified, the authority credentials of voice to be verified are more than or equal to language to be verifiedPreset wake-up comparison of authority credentials corresponding to word corresponding to sound can decide whether to perform next step.
Step 106:Perform and operated corresponding to preset wake-up word corresponding with voice to be verified.
It should be noted that when the authority credentials of voice to be verified is more than or equal to wake-up preset corresponding to voice to be verifiedCorresponding to word during authority credentials, then perform corresponding to preset wake-up word corresponding with voice to be verified and operate.
The MFCC features that voice to be verified is first passed through in the present embodiment judge whether the content in voice to be verified is presetWake-up word, if it is, extracting i-vector vectors by preset deep neural network model, pass through i-vector vectorsCarry out Application on Voiceprint Recognition and confirm speaker's identity, obtain the authority credentials of voice to be verified, and according to the authority credentials of speaker with it is to be testedPreset wake-up comparative result of authority credentials corresponding to word judges whether speaker has enough authorities corresponding to card voice, ifHave, then perform and operated corresponding to preset wake-up word corresponding with voice to be verified, the voice for solving Current electronic product is called outFunction of waking up lacks subscription authentication function, can not carry out subscription authentication to realize equipment operation that is more complicated and needing user rightTechnical problem.
It is a kind of one embodiment of the voice awakening method of combination Application on Voiceprint Recognition provided in an embodiment of the present invention above, withIt is a kind of another embodiment of the voice awakening method of combination Application on Voiceprint Recognition provided in an embodiment of the present invention down.
Referring to Fig. 2, the embodiments of the invention provide a kind of another reality of the voice awakening method of combination Application on Voiceprint RecognitionExample is applied, including:
Step 201:Receive voice to be verified and carry out feature extraction, obtain the MFCC features of voice to be verified;
It should be noted that need to such as obtain more high-dimensional MFCC features, then can by carrying out a point window to discrete signal,Fourier transform is carried out after dividing window, and increases the number of wave filter group, then calculates mel cepstrum coefficients and can be obtained by more higher-dimensionMFCC coefficients.
Step 202:The MFCC features of voice to be verified in predetermined period are cached;
It should be noted that in order to save the memory space of electronic equipment, although electronic equipment, which is in, continues listening state,But only the MFCC features of the voice to be verified in predetermined period are stored, predetermined period could be arranged to nearest three seconds,As only stored to the MFCC features of the voice to be verified in nearest three seconds.
Step 203:Whether the content that voice to be verified is judged according to the MFCC features of the voice to be verified of caching is presetWake-up word, if so, then perform step 204;
It should be noted that when voice to be verified is preset wake-up word, then wake-up device is switched to just by holding stateNormal working condition simultaneously performs follow-up step, if the non-preset wake-up word of voice to be verified, electronic equipment continue to keep standbyState.
Step 204:The MFCC features of the voice to be verified of caching are cascaded;
It should be noted that it is that the MFCC features of the voice to be verified of caching are carried out into cascade to refer to time upper adjacent languageSound frame coefficient vector is stitched together as a longer vector.
Step 205:MFCC features after cascade are inputted in preset deep neural network model, obtain voice to be verifiedI-vector vectors, and by the MFCC features after cascade preset deep neural network model is updated to new presetDeep neural network model;
It should be noted that traditional i-vector vectors are to be extracted by mixed Gauss model (GMM), using bigSpeaker's speech data that the speaker of amount is unrelated, channel is unrelated, is pre-processed, and extracts MFCC features, general for trainingBackground model UBM and global disparity matrix T, the algorithm for training UBM are expectation maximization (ExpectationMaximization,EM);
Training finishes, and preserves universal background model and global disparity matrix, in registration and test phase, to the every of speakerDuan Yuyin, i-vector vectors corresponding to every section of voice are extracted using formula (1):
Ms=mu+Tωs
Wherein, muIt is UBM Gaussian mean super vector;T is the global disparity matrix of low-rank, for characterizing global disparity skyBetween information;ωsIt is the global disparity factor.
Using the i-vector vector extracting methods based on deep neural network (DNN) in the present embodiment, language is usedThe deep neural network (DNN) of sound identification, regards the MFCC features after cascade as phoneme model (Tri-phone) and is input to itBefore in the DNN networks that train, Short Time Speech frame is classified according to posterior probability, each frame and corresponding posterior probability canBe used to train a new UBM, UBM can be so trained by way of supervised learning, instead of in traditional UBM trainingUnsupervised EM algorithms;
I-vector vectors are extracted by deep neural network and possess high accuracy rate, noise robustness and channel robustness, adaptationThe advantages that various texts.
Step 206:The preset i-vector vectors of the i-vector vector sums of voice to be verified are subjected to positive naturalization processing,The i-vector vectors that the i-vector vector sums of voice to be verified after positive naturalization is handled are preset are linearly distinguished by outlineProperty analysis model be compared, get and compare the matching fraction that draws;
It should be noted that positive naturalization processing refers to i-vector vectorial and preset the i-vector of voice to be verifiedVector is changed into identical length;
Preset i-vector vectors can have multiple, represent different users respectively, and such as A electronic equipment, but A existsA, B, C, D and E i-vector vectors are stored in electronic equipment, assign A, B, C, D the authority credentials different with E;
It is compared the i-vector of voice to be verified is vectorial, can gets with each preset i-vector vectorsDifferent matching fractions.
Step 207:Matching fraction is added into migration fraction, obtains new matching fraction;
It should be noted that because the extraneous factor such as noise can to match fraction shifts, be according to extraneous circumstanceCompensated by preset migration fraction, obtain new matching fraction.
Step 208:The authority credentials of voice to be verified is obtained according to new matching fraction, judges the authority credentials of voice to be verifiedWhether authority credentials corresponding to wake-up word preset corresponding to voice to be verified is more than or equal to, if so, step 209 is then performed, ifIt is no, then perform step 210;
It should be noted that according to each new matching fraction, the identity of speaker is can confirm that, such as voice to be verifiedI-vector vectors and E preset i-vector Vectors matching fraction highests and be more than predetermined threshold value, then judge to speak artificiallyE, gets E authority credentials, and judges whether E authority credentials is more than or equal to wake-up word pair preset corresponding to voice to be verifiedThe authority credentials answered, the wake-up word said such as E are to pay, and the authority credentials for paying needs is arranged to 5, but E authority credentials only has 3,The step of so can determining to perform according to this judged result;
It if speaker is F, can also obtain matching fraction, but match fraction and be less than predetermined threshold value, therefore F does not possessAny authority credentials.
Step 209:Perform and operated corresponding to preset wake-up word corresponding with voice to be verified;
It should be noted that if E authority credentials is 5, then is equal to pay the authority credentials needed, then electronic equipment is heldRow delivery operation corresponding with paying this wake-up word.
Step 210:The prompting of sending permission deficiency.
If it should be noted that E authority credentials be 3, less than pay need authority credentials, then by word, light orThe prompting of the mode such as person's loudspeaker sending permission deficiency.
Sound groove recognition technology in e is as a kind of identity validation technology of Remote Non-touch, with reference to across media interactive communications with answeringWith service platform, there is huge applications prospect in fields such as bank, social security, public security, smart home, mobile payments, by vocal printIdentification technology with voice arousal function be combined improve user interactive voice experience, veritably contact-free can realize that identity is recognizedCard;
And traditional mixed Gauss model is instead of in sound groove recognition technology in e with deep neural network, there is accuracy rateHigh, noise robustness and channel robustness, the advantages that various texts is adapted to, and support cross-platform and across channel deployment, had and calculate speedThe advantages such as fast, low in energy consumption, occupying system resources are few are spent, can easily be deployed on mobile electronic device, there is wide applicationProspect;
The MFCC features that voice to be verified is first passed through in the present embodiment judge whether the content in voice to be verified is presetWake-up word;If it is, extracting i-vector vectors by preset deep neural network model, pass through i-vector vectorsCarry out Application on Voiceprint Recognition and confirm speaker's identity, obtain the authority credentials of voice to be verified, and according to the authority credentials of speaker with it is to be testedPreset wake-up comparative result of authority credentials corresponding to word judges whether speaker has enough authorities corresponding to card voice, ifHave, then perform and operated corresponding to preset wake-up word corresponding with voice to be verified, the voice for solving Current electronic product is called outFunction of waking up lacks subscription authentication function, can not carry out subscription authentication to realize equipment operation that is more complicated and needing user rightTechnical problem.
It is a kind of another embodiment of the voice awakening method of combination Application on Voiceprint Recognition provided in an embodiment of the present invention above,It is a kind of one embodiment of the voice Rouser of combination Application on Voiceprint Recognition provided in an embodiment of the present invention below.
Referring to Fig. 3, the embodiments of the invention provide a kind of implementation of one of voice Rouser of combination Application on Voiceprint RecognitionExample, including:
Feature unit 301, for receiving voice to be verified and carrying out feature extraction, the MFCC for obtaining voice to be verified is specialSign;
Buffer unit 302, for being cached to the MFCC features of the voice to be verified in predetermined period;
Wakeup unit 303, the MFCC features for the voice to be verified according to caching judge that the content of voice to be verified isNo is preset wake-up word, if so, then performing step S4;
Vector location 304, for the MFCC features of the voice to be verified of caching to be inputted to preset deep neural network mouldIn type, the i-vector vectors of voice to be verified are obtained;
Comparing unit 305, the i-vector vector preset for the i-vector vector sums by voice to be verified are comparedIt is right, according to the authority credentials for comparing the matching fraction acquisition voice to be verified drawn, judge whether the authority credentials of voice to be verified is bigIn or equal to authority credentials corresponding to wake-up word preset corresponding to voice to be verified, if so, then performing corresponding with voice to be verifiedPreset wake-up word corresponding to operate.
Preferably, vector location 304 specifically includes:
Subelement 3041 is cascaded, for the MFCC features of the voice to be verified of caching to be cascaded;
Subelement 3042 is obtained, for the MFCC features after cascade to be inputted in preset deep neural network model, is obtainedThe i-vector vectors of voice to be verified are taken, and are updated preset deep neural network model by the MFCC features after cascadeFor new preset deep neural network model.
Preferably, comparing unit 305 specifically includes:
Coupling subelement 3051, the i-vector preset for the i-vector vector sums by voice to be verified vectors are carried outPositive naturalization processing, the preset i-vector vectors of the i-vector vector sums of the voice to be verified after positive naturalization processing are passed through generalIt is compared by linear distinction analysis model, gets the matching fraction for comparing and drawing;
Subelement 3052 is compensated, for matching fraction to be added into migration fraction, obtains new matching fraction;
Judgment sub-unit 3053, for obtaining the authority credentials of voice to be verified according to new matching fraction, judge to be verifiedWhether the authority credentials of voice is more than or equal to authority credentials corresponding to wake-up word preset corresponding to voice to be verified, if so, then touchingHair performs subelement 3054;
Subelement 3054 is performed, for performing operation corresponding to preset wake-up word corresponding with voice to be verified.
Preferably, comparing unit 305 also includes:Prompt subelement 3055;
Judgment sub-unit 3053 is specifically used for the authority credentials that voice to be verified is obtained according to new matching fraction, judges to be testedWhether the authority credentials for demonstrate,proving voice is more than or equal to authority credentials corresponding to wake-up word preset corresponding to voice to be verified, if so, thenTriggering performs subelement 3054, if it is not, then triggering prompting subelement 3055;
Subelement 3055 is prompted, the prompting for sending permission deficiency.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description,The specific work process of device and unit, the corresponding process in preceding method embodiment is may be referred to, will not be repeated here.
In several embodiments provided herein, it should be understood that disclosed apparatus and method, it can be passed throughIts mode is realized.For example, device embodiment described above is only schematical, for example, the division of the unit, onlyOnly a kind of division of logic function, there can be other dividing mode when actually realizing, such as multiple units or component can be tiedAnother system is closed or is desirably integrated into, or some features can be ignored, or do not perform.It is another, it is shown or discussedMutual coupling or direct-coupling or communication connection can be the INDIRECT COUPLINGs or logical by some interfaces, device or unitLetter connection, can be electrical, mechanical or other forms.
The unit illustrated as separating component can be or may not be physically separate, show as unitThe part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multipleOn NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, can alsoThat unit is individually physically present, can also two or more units it is integrated in a unit.Above-mentioned integrated listMember can both be realized in the form of hardware, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and is used as independent production marketing or useWhen, it can be stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantiallyThe part to be contributed in other words to prior art or all or part of the technical scheme can be in the form of software productsEmbody, the computer software product is stored in a storage medium, including some instructions are causing a computerEquipment (can be personal computer, server, or network equipment etc.) performs the complete of each embodiment methods described of the present inventionPortion or part steps.And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (ROM, Read-OnlyMemory), random access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can store journeyThe medium of sequence code.
Described above, the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although with reference to beforeEmbodiment is stated the present invention is described in detail, it will be understood by those within the art that:It still can be to precedingState the technical scheme described in each embodiment to modify, or equivalent substitution is carried out to which part technical characteristic;And theseModification is replaced, and the essence of appropriate technical solution is departed from the spirit and scope of various embodiments of the present invention technical scheme.

Claims (8)

CN201711145883.5A2017-11-172017-11-17Voice wake-up method and device combined with voiceprint recognitionPendingCN107886957A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201711145883.5ACN107886957A (en)2017-11-172017-11-17Voice wake-up method and device combined with voiceprint recognition

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201711145883.5ACN107886957A (en)2017-11-172017-11-17Voice wake-up method and device combined with voiceprint recognition

Publications (1)

Publication NumberPublication Date
CN107886957Atrue CN107886957A (en)2018-04-06

Family

ID=61777214

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201711145883.5APendingCN107886957A (en)2017-11-172017-11-17Voice wake-up method and device combined with voiceprint recognition

Country Status (1)

CountryLink
CN (1)CN107886957A (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108646902A (en)*2018-05-152018-10-12广州势必可赢网络科技有限公司Control method and device for standby enabling signal
CN108665895A (en)*2018-05-032018-10-16百度在线网络技术(北京)有限公司Methods, devices and systems for handling information
CN108766446A (en)*2018-04-182018-11-06上海问之信息科技有限公司Method for recognizing sound-groove, device, storage medium and speaker
CN108877790A (en)*2018-05-212018-11-23江西午诺科技有限公司Speaker control method, device, readable storage medium storing program for executing and mobile terminal
CN108962260A (en)*2018-06-252018-12-07福来宝电子(深圳)有限公司A kind of more human lives enable audio recognition method, system and storage medium
CN109215646A (en)*2018-08-152019-01-15北京百度网讯科技有限公司Voice interaction processing method, device, computer equipment and storage medium
CN109524011A (en)*2018-10-222019-03-26四川虹美智能科技有限公司A kind of refrigerator awakening method and device based on Application on Voiceprint Recognition
CN109887511A (en)*2019-04-242019-06-14武汉水象电子科技有限公司A kind of voice wake-up optimization method based on cascade DNN
CN109978145A (en)*2019-03-292019-07-05联想(北京)有限公司A kind of processing method and processing device
CN110060693A (en)*2019-04-162019-07-26Oppo广东移动通信有限公司Model training method and device, electronic equipment and storage medium
CN110517686A (en)*2019-09-262019-11-29合肥飞尔智能科技有限公司Intelligent sound box end voice opens the method and system of application
CN110574103A (en)*2018-06-292019-12-13华为技术有限公司 A voice control method, wearable device and terminal
CN110853632A (en)*2018-08-212020-02-28蔚来汽车有限公司Voice recognition method based on voiceprint information and intelligent interaction equipment
CN111192574A (en)*2018-11-142020-05-22奇酷互联网络科技(深圳)有限公司Intelligent voice interaction method, mobile terminal and computer readable storage medium
WO2020102991A1 (en)*2018-11-202020-05-28深圳市欢太科技有限公司Method and apparatus for waking up device, storage medium and electronic device
CN111243603A (en)*2020-01-092020-06-05厦门快商通科技股份有限公司Voiceprint recognition method, system, mobile terminal and storage medium
CN111640433A (en)*2020-06-012020-09-08珠海格力电器股份有限公司Voice interaction method, storage medium, electronic equipment and intelligent home system
CN111816193A (en)*2020-08-122020-10-23深圳市友杰智新科技有限公司Voice awakening method and device based on multi-segment network and storage medium
CN111862965A (en)*2019-04-282020-10-30阿里巴巴集团控股有限公司Awakening processing method and device, intelligent sound box and electronic equipment
CN115019806A (en)*2021-12-282022-09-06北京荣耀终端有限公司 Voiceprint recognition method and device
CN115312040A (en)*2022-08-082022-11-08中国电信股份有限公司Voice wake-up method and device, electronic equipment and computer readable storage medium
CN116469397A (en)*2023-03-152023-07-21深圳市声扬科技有限公司 Method, device and electronic device for setting device operation authority

Citations (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN104658533A (en)*2013-11-202015-05-27中兴通讯股份有限公司Terminal unlocking method and device as well as terminal
US20150245154A1 (en)*2013-07-112015-08-27Intel CorporationMechanism and apparatus for seamless voice wake and speaker verification
CN106448684A (en)*2016-11-162017-02-22北京大学深圳研究生院Deep-belief-network-characteristic-vector-based channel-robust voiceprint recognition system
CN106601229A (en)*2016-11-152017-04-26华南理工大学Voice awakening method based on soc chip
CN106601258A (en)*2016-12-122017-04-26广东顺德中山大学卡内基梅隆大学国际联合研究院Speaker identification method capable of information channel compensation based on improved LSDA algorithm
CN107146615A (en)*2017-05-162017-09-08南京理工大学 Speech Recognition Method and System Based on Secondary Recognition of Matching Model

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20150245154A1 (en)*2013-07-112015-08-27Intel CorporationMechanism and apparatus for seamless voice wake and speaker verification
CN104658533A (en)*2013-11-202015-05-27中兴通讯股份有限公司Terminal unlocking method and device as well as terminal
CN106601229A (en)*2016-11-152017-04-26华南理工大学Voice awakening method based on soc chip
CN106448684A (en)*2016-11-162017-02-22北京大学深圳研究生院Deep-belief-network-characteristic-vector-based channel-robust voiceprint recognition system
CN106601258A (en)*2016-12-122017-04-26广东顺德中山大学卡内基梅隆大学国际联合研究院Speaker identification method capable of information channel compensation based on improved LSDA algorithm
CN107146615A (en)*2017-05-162017-09-08南京理工大学 Speech Recognition Method and System Based on Secondary Recognition of Matching Model

Cited By (28)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108766446A (en)*2018-04-182018-11-06上海问之信息科技有限公司Method for recognizing sound-groove, device, storage medium and speaker
CN108665895A (en)*2018-05-032018-10-16百度在线网络技术(北京)有限公司Methods, devices and systems for handling information
CN108665895B (en)*2018-05-032021-05-25百度在线网络技术(北京)有限公司Method, device and system for processing information
CN108646902A (en)*2018-05-152018-10-12广州势必可赢网络科技有限公司Control method and device for standby enabling signal
CN108877790A (en)*2018-05-212018-11-23江西午诺科技有限公司Speaker control method, device, readable storage medium storing program for executing and mobile terminal
CN108962260A (en)*2018-06-252018-12-07福来宝电子(深圳)有限公司A kind of more human lives enable audio recognition method, system and storage medium
CN110574103B (en)*2018-06-292020-10-23华为技术有限公司 A voice control method, wearable device and terminal
CN110574103A (en)*2018-06-292019-12-13华为技术有限公司 A voice control method, wearable device and terminal
CN109215646A (en)*2018-08-152019-01-15北京百度网讯科技有限公司Voice interaction processing method, device, computer equipment and storage medium
CN110853632A (en)*2018-08-212020-02-28蔚来汽车有限公司Voice recognition method based on voiceprint information and intelligent interaction equipment
CN109524011A (en)*2018-10-222019-03-26四川虹美智能科技有限公司A kind of refrigerator awakening method and device based on Application on Voiceprint Recognition
CN111192574A (en)*2018-11-142020-05-22奇酷互联网络科技(深圳)有限公司Intelligent voice interaction method, mobile terminal and computer readable storage medium
WO2020102991A1 (en)*2018-11-202020-05-28深圳市欢太科技有限公司Method and apparatus for waking up device, storage medium and electronic device
CN109978145A (en)*2019-03-292019-07-05联想(北京)有限公司A kind of processing method and processing device
CN109978145B (en)*2019-03-292021-09-14联想(北京)有限公司Processing method and device
CN110060693A (en)*2019-04-162019-07-26Oppo广东移动通信有限公司Model training method and device, electronic equipment and storage medium
CN109887511A (en)*2019-04-242019-06-14武汉水象电子科技有限公司A kind of voice wake-up optimization method based on cascade DNN
CN111862965A (en)*2019-04-282020-10-30阿里巴巴集团控股有限公司Awakening processing method and device, intelligent sound box and electronic equipment
CN111862965B (en)*2019-04-282024-10-15阿里巴巴集团控股有限公司Wake-up processing method and device, intelligent sound box and electronic equipment
CN110517686A (en)*2019-09-262019-11-29合肥飞尔智能科技有限公司Intelligent sound box end voice opens the method and system of application
CN111243603A (en)*2020-01-092020-06-05厦门快商通科技股份有限公司Voiceprint recognition method, system, mobile terminal and storage medium
CN111640433A (en)*2020-06-012020-09-08珠海格力电器股份有限公司Voice interaction method, storage medium, electronic equipment and intelligent home system
CN111816193A (en)*2020-08-122020-10-23深圳市友杰智新科技有限公司Voice awakening method and device based on multi-segment network and storage medium
CN111816193B (en)*2020-08-122020-12-15深圳市友杰智新科技有限公司Voice awakening method and device based on multi-segment network and storage medium
CN115019806A (en)*2021-12-282022-09-06北京荣耀终端有限公司 Voiceprint recognition method and device
CN115019806B (en)*2021-12-282025-02-25北京荣耀终端有限公司 Voiceprint recognition method and device
CN115312040A (en)*2022-08-082022-11-08中国电信股份有限公司Voice wake-up method and device, electronic equipment and computer readable storage medium
CN116469397A (en)*2023-03-152023-07-21深圳市声扬科技有限公司 Method, device and electronic device for setting device operation authority

Similar Documents

PublicationPublication DateTitle
CN107886957A (en)Voice wake-up method and device combined with voiceprint recognition
JP7110292B2 (en) Neural network for speaker verification
CN110289003B (en) A voiceprint recognition method, model training method and server
US9940935B2 (en)Method and device for voiceprint recognition
Chavan et al.An overview of speech recognition using HMM
EP3989217B1 (en)Method for detecting an audio adversarial attack with respect to a voice input processed by an automatic speech recognition system, corresponding device, computer program product and computer-readable carrier medium
CN107886943A (en)Voiceprint recognition method and device
CN106898355B (en)Speaker identification method based on secondary modeling
CN111462756B (en)Voiceprint recognition method and device, electronic equipment and storage medium
CN106373575A (en) A user voiceprint model construction method, device and system
CN107517207A (en)Server, auth method and computer-readable recording medium
CN111524527A (en)Speaker separation method, device, electronic equipment and storage medium
WO2014114116A1 (en)Method and system for voiceprint recognition
CN107610707A (en)A kind of method for recognizing sound-groove and device
CN112949708A (en)Emotion recognition method and device, computer equipment and storage medium
CN111199741A (en)Voiceprint identification method, voiceprint verification method, voiceprint identification device, computing device and medium
CN114913859B (en)Voiceprint recognition method, voiceprint recognition device, electronic equipment and storage medium
CN112037772B (en)Response obligation detection method, system and device based on multiple modes
CN110517696A (en)A kind of offline Voiceprint Recognition System of implantable
CN106709804A (en)Interactive wealth planning consulting robot system
CN113870840A (en)Voice recognition method, device and related equipment
CN112802481A (en)Voiceprint verification method, voiceprint recognition model training method, device and equipment
CN113436633A (en)Speaker recognition method, speaker recognition device, computer equipment and storage medium
Perdana et al.Voice recognition system for user authentication using gaussian mixture model
CN115862634A (en)Voiceprint recognition method and embedded device

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication
RJ01Rejection of invention patent application after publication

Application publication date:20180406


[8]ページ先頭

©2009-2025 Movatter.jp