Movatterモバイル変換


[0]ホーム

URL:


CN105430654A - Method and device used for identifying number attribution information - Google Patents

Method and device used for identifying number attribution information
Download PDF

Info

Publication number
CN105430654A
CN105430654ACN201510728723.8ACN201510728723ACN105430654ACN 105430654 ACN105430654 ACN 105430654ACN 201510728723 ACN201510728723 ACN 201510728723ACN 105430654 ACN105430654 ACN 105430654A
Authority
CN
China
Prior art keywords
note
title
sender
subsample
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510728723.8A
Other languages
Chinese (zh)
Other versions
CN105430654B (en
Inventor
汪平仄
张涛
陈志军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Technology Co Ltd
Xiaomi Inc
Original Assignee
Xiaomi Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiaomi IncfiledCriticalXiaomi Inc
Priority to CN201510728723.8ApriorityCriticalpatent/CN105430654B/en
Publication of CN105430654ApublicationCriticalpatent/CN105430654A/en
Application grantedgrantedCritical
Publication of CN105430654BpublicationCriticalpatent/CN105430654B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

The invention discloses a method and a device used for identifying number attribution information. The method comprises steps that a sample short message set is acquired; titles used for identifying number attribution information are extracted from sample short messages of the sample short message set; titles of sample short messages corresponding to one same short message sending party number are merged; the attribution information of the short message sending party number can be identified according to the merged information, the attribution information of the short message sending party number in the sample short message set can be automatically identified, manpower resource waste caused artificial identification can be avoided, and identification efficiency is further improved.

Description

The recognition methods of the attaching information of number and device
Technical field
The application relates to communication technical field, particularly relates to recognition methods and the device of the attaching information of number.
Background technology
Along with the fast development of mobile communication technology, terminal has become the necessary article of a lot of modern work and life, and terminal brings easily simultaneously, brings hidden danger also to the life of people.Such as receive the communication information that the user such as fraudulent call and refuse messages undesirably receives.
In correlation technique, manually each telephone number is identified and marked, set up the corresponding relation of telephone number and attaching information.When receiving telephone number or the dial instruction for telephone number of calling party, identifying attaching information corresponding to telephone number by corresponding relation, and reminding according to attaching information.But due to number variation, the artificial number needing to identify is many, the human resources of at substantial, and identify that the efficiency of the attaching information that telephone number is corresponding is low.
Summary of the invention
For overcoming Problems existing in correlation technique, present disclose provides recognition methods and the device of the attaching information of number.
According to the first aspect of disclosure embodiment, provide a kind of recognition methods of attaching information of number, described method comprises: obtain sample note collection;
The title being used for identification number attaching information is extracted from the sample note of described sample note collection;
The title of sample note corresponding for same number of sender is merged;
The attaching information of described number of sender is identified according to pooling information.
Optionally, described acquisition sample note collection comprises:
Obtain the history note in preset time period;
The sender number of described history note is identified;
Be notify that the history note of class note number is defined as sample note by described sender number, obtain sample note collection.
Optionally, the described title extracted from the sample note of described sample note collection for identification number attaching information, comprising:
When comprising special symbol group in the sample note of described sample note collection, from described sample note, extract the information between special symbol group, determine the title of described sample note according to the information extracted;
When not comprising described special symbol group in the sample note of described sample note collection, the title of described sample note is defined as sky information.
Optionally, described method also comprises:
The sender number described sample note being concentrated the sample note that comprises described special symbol group corresponding is defined as number of sender;
Concentrate from described sample note and filter out subsample note collection corresponding to each number of sender respectively, described subsample note concentrates each sample note to comprise number of sender, short message receiver number and title.
Optionally, the described title by sample note corresponding for same number of sender merges, and comprising:
Calculate the short message receiver number number that subsample note concentrates each title corresponding, obtain subsample note and merge collection, described subsample note merges concentrates each sample note to comprise number of sender, title, short message receiver number number, and described subsample note collection comprises sample note corresponding to same number of sender.
Optionally, the described attaching information identifying described number of sender according to pooling information, comprising:
Adopt following formulae discovery subsample note to merge and concentrate each title to merge the probable value concentrated in subsample note:
P(titlei)=C(titlei)Σk=1nC(titlek),i∈(1,n)
Wherein, P (titlei) represent title titleithe probable value concentrated is merged, C (title in subsample notei) represent that subsample note merges concentrated title titleicorresponding short message receiver number number, C (titlek) represent that subsample note merges concentrated title titlekcorresponding short message receiver number number, n represents that subsample note merges and concentrates title number;
The title described probable value being greater than probability threshold value is defined as the attaching information of described number of sender.
Optionally, described employing following formulae discovery subsample note merges concentrates each title before subsample note merges concentrated probable value, also comprises:
Judging that described subsample note merges concentrates short message receiver number number corresponding to title whether to be less than number threshold value;
Title corresponding for the short message receiver number number being less than number threshold value is deleted.
Optionally, described method also comprises:
According to the attaching information determination number of sender of each number of sender and the incidence relation of attaching information;
Incidence relation according to described sender number and attaching information identifies destination number to be identified, determine the attaching information of described destination number, described destination number comprise calling party's number to be transferred to, number that number, short message sending side that callee receives are to be sent or the number that short message receiver receives.
According to the second aspect of disclosure embodiment, a kind of recognition device of attaching information of number is provided, comprises:
Note collection acquisition module, is configured to obtain sample note collection;
Title abstraction module, is configured to the title extracted from the sample note of described sample note collection for identification number attaching information;
Title merges module, is configured to the title of sample note corresponding for same number of sender to merge;
First attaching information identification module, is configured to the attaching information identifying described number of sender according to pooling information.
Optionally, described note collection acquisition module comprises:
Note obtains submodule, is configured to obtain the history note in preset time period;
Number Reorganization submodule, is configured to identify the sender number of described history note;
Sample note collection determination submodule, is configured to be notify that the history note of class note number is defined as sample note by described sender number, obtains sample note collection.
Optionally, described title abstraction module comprises:
Title extracts submodule, when being configured to comprise special symbol group in the sample note of described sample note collection, extracts the information between special symbol group from described sample note, determines the title of described sample note according to the information extracted; When not comprising described special symbol group in the sample note of described sample note collection, the title of described sample note is defined as sky information.
Optionally, described device also comprises:
Number of sender determination module, the sender number being configured to described sample note to concentrate the sample note that comprises described special symbol group corresponding is defined as number of sender;
Subsample note collection determination module, be configured to concentrate from described sample note filter out subsample note collection corresponding to each number of sender respectively, described subsample note concentrates each sample note to comprise number of sender, short message receiver number and title.
Optionally, described title merging module comprises:
Merge collection and determine submodule, be configured to the short message receiver number number that calculating subsample note concentrates each title corresponding, obtain subsample note and merge collection, described subsample note merges concentrates each sample note to comprise number of sender, title, short message receiver number number, and described subsample note collection comprises sample note corresponding to same number of sender.
Optionally, described first attaching information identification module comprises:
Probable value calculating sub module, is configured to adopt following formulae discovery subsample note to merge and concentrates each title to merge the probable value concentrated in subsample note:
P(titlei)=C(titlei)Σk=1nC(titlek),i∈(1,n)
Wherein, P (titlei) represent title titleithe probable value concentrated is merged, C (title in subsample notei) represent that subsample note merges concentrated title titleicorresponding short message receiver number number, C (titlek) represent that subsample note merges concentrated title titlekcorresponding short message receiver number number, n represents that subsample note merges and concentrates title number;
Attaching information determination submodule, the title being configured to described probable value to be greater than probability threshold value is defined as the attaching information of described number of sender.
Optionally, described first attaching information identification module also comprises:
Attaching information filters submodule, judges that described subsample note merges and concentrates short message receiver number number corresponding to title whether to be less than number threshold value; Title corresponding for the short message receiver number number being less than number threshold value is deleted.
Optionally, described device also comprises:
Incidence relation determination module, is configured to the incidence relation of attaching information determination number of sender according to each number of sender and attaching information;
Second attaching information identification module, be configured to identify destination number to be identified according to the incidence relation of described sender number and attaching information, determine the attaching information of described destination number, described destination number comprise calling party's number to be transferred to, number that number, short message sending side that callee receives are to be sent or the number that short message receiver receives.
According to the third aspect of disclosure embodiment, a kind of recognition device of attaching information of number is provided, comprises:
Processor;
For the memory of storage of processor executable instruction;
Wherein, described processor is configured to:
Obtain sample note collection;
The title being used for identification number attaching information is extracted from the sample note of described sample note collection;
The title of sample note corresponding for same number of sender is merged;
The attaching information of described number of sender is identified according to pooling information.
The technical scheme that embodiment of the present disclosure provides can comprise following beneficial effect:
The disclosure obtains sample note collection, then from the sample note of sample note collection, extract the title being used for identification number attaching information, the title of sample note corresponding for same number of sender is merged, the attaching information of number of sender is identified according to pooling information, realize automatically identifying the attaching information that sample note concentrates number of sender, the waste of human resource avoiding manual identified number to cause, improves recognition efficiency simultaneously.
In the disclosure, because the sender number of notice class note is different from the sender number of other conventional notes, therefore by sender number, the present embodiment can identify whether history note is notice class note, thus notice class note is defined as sample note, obtain sample note collection, thus improve the efficiency of the attaching information of follow-up identification number.
In the disclosure, when comprising described special symbol group in sample note, the information between special symbol group can be extracted from sample note, according to the title of the information determination sample note extracted, when not comprising special symbol group in sample note, the title of this sample note can be defined as sky information, thus improve the efficiency of the title determining sample note.
The sender number that sample note can concentrate the sample note that comprises special symbol group corresponding by the disclosure is defined as number of sender, thus avoids pooling information to be empty situation, in addition, sender number sample note being concentrated the sample note that comprises special symbol group corresponding is defined as number of sender, concentrate from sample note and filter out subsample note collection corresponding to each number of sender respectively, thus sample note corresponding for number of sender is all included concentrate in subsample note, subsample note concentrates the sample note both including and comprise special symbol group, the sample note not comprising special symbol group can also be comprised, to improve the accuracy of the attaching information of follow-up identification number of sender, avoid when the sample note quantity not comprising special symbol group is larger, the error that only corresponding according to the sample note determination number of sender comprising special symbol group attaching information causes.
The disclosure calculates title by the short message receiver number number that title is corresponding and merges the probability concentrated in subsample note, and title larger for probability is defined as the attaching information of number of sender, improve the accuracy determining attaching information, and decrease the quantity of attaching information, bring facility to user.
The disclosure was calculating the concentrated each title of subsample note merging before subsample note merges concentrated probable value, also comprised: judge whether the short message receiver number number that described subsample note merges concentrated title corresponding is less than number threshold value; Title corresponding for the short message receiver number number being less than number threshold value is deleted.When calculating the concentrated each title of subsample note merging in the probable value that note merging in subsample is concentrated, calculate the subsample note after deleting title and merge the probable value concentrating each title to concentrate in the merging of subsample note, thus reduce the amount of calculation of calculating probability.
Should be understood that, it is only exemplary and explanatory that above general description and details hereinafter describe, and can not limit the disclosure.
Accompanying drawing explanation
Accompanying drawing to be herein merged in specification and to form the part of this specification, shows and meets embodiment of the present disclosure, and is used from specification one and explains principle of the present disclosure.
Fig. 1 is the flow chart of the recognition methods of the attaching information of a kind of number shown in the disclosure one exemplary embodiment.
Fig. 2 is the application scenarios figure of the recognition methods of the attaching information of a kind of number of the disclosure according to an exemplary embodiment.
Fig. 3 is the block diagram of the recognition device of the attaching information of a kind of number of the disclosure according to an exemplary embodiment.
Fig. 4 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment.
Fig. 5 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment.
Fig. 6 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment.
Fig. 7 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment.
Fig. 8 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment.
Fig. 9 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment.
Figure 10 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment.
The block diagram of the recognition device of a kind of attaching information for number of Figure 11 disclosure according to an exemplary embodiment.
Embodiment
Here will be described exemplary embodiment in detail, its sample table shows in the accompanying drawings.When description below relates to accompanying drawing, unless otherwise indicated, the same numbers in different accompanying drawing represents same or analogous key element.Execution mode described in following exemplary embodiment does not represent all execution modes consistent with the disclosure.On the contrary, they only with as in appended claims describe in detail, the example of apparatus and method that aspects more of the present disclosure are consistent.
The term used in the disclosure is only for the object describing specific embodiment, and the not intended to be limiting disclosure." one ", " described " and " being somebody's turn to do " of the singulative used in disclosure and the accompanying claims book is also intended to comprise most form, unless context clearly represents other implications.It is also understood that term "and/or" used herein refer to and comprise one or more project of listing be associated any or all may combine.
Term first, second, third, etc. may be adopted although should be appreciated that to describe various information in the disclosure, these information should not be limited to these terms.These terms are only used for the information of same type to be distinguished from each other out.Such as, when not departing from disclosure scope, the first information also can be called as the second information, and similarly, the second information also can be called as the first information.Depend on linguistic context, word as used in this " if " can be construed as into " ... time " or " when ... time " or " in response to determining ".
As shown in Figure 1, Fig. 1 is the flow chart of the recognition methods of the attaching information of a kind of number of the disclosure according to an exemplary embodiment, comprises the following steps:
In a step 101, sample note collection is obtained.
In a step 102, from the sample note of described sample note collection, extract the title being used for identification number attaching information.
In step 103, the title of sample note corresponding for same number of sender is merged.
At step 104, the attaching information of described number of sender is identified according to pooling information.
Disclosure embodiment may be used in terminal, and involved terminal can be intelligent terminal, such as, can be the smart mobile phone, Intelligent bracelet, intelligent watch etc. with communication function.Intelligent terminal can obtain sample note collection from this locality, sample note collection can be obtained from server, then from the sample note of sample note collection, extract the title being used for identification number attaching information, the title of sample note corresponding for same number of sender is merged, identifies the attaching information of number of sender according to pooling information.
Disclosure embodiment also may be used in server, and involved server can be individual server, also can be server cluster, can also be Cloud Server etc.Server obtains sample note collection, then from the sample note of sample note collection, extract the title being used for identification number attaching information, the title of sample note corresponding for same number of sender is merged, identifies the attaching information of number of sender according to pooling information.
As seen from the above-described embodiment, sample note collection can be obtained, then from the sample note of sample note collection, extract the title being used for identification number attaching information, the title of sample note corresponding for same number of sender is merged, the attaching information of number of sender is identified according to pooling information, realize automatically identifying the attaching information that sample note concentrates number of sender, the waste of human resource avoiding manual identified number to cause, improves recognition efficiency simultaneously.
In an optional implementation, disclosure embodiment may be used in terminal, terminal recognition goes out after sample note concentrates the attaching information of number of sender, can obtain the incidence relation of number of sender and attaching information, be stored by incidence relation.Then can identify destination number to be identified according to the incidence relation of described sender number and attaching information, determine the attaching information of described destination number, described destination number can comprise calling party's number to be transferred to, number that number, short message sending side that callee receives are to be sent or the number that short message receiver receives.
For the application of the corresponding relation of sender number and attaching information, can be terminal when receiving the telephone number of calling party, identify attaching information corresponding to telephone number by corresponding relation, and remind according to attaching information.Alerting pattern can be show attaching information on a display screen, can also be voice broadcast attaching information etc., thus avoid the loss caused because answering dangerous phone.And for example, terminal, before receiving the dial instruction for telephone number, identifies attaching information corresponding to telephone number by corresponding relation, and reminds according to attaching information, avoiding the loss caused because dialing dangerous telephone number, saving call cost.And for example, terminal, when receiving note, identifies attaching information corresponding to the sender number of note by corresponding relation, and carries out SMS interception or prompting according to attaching information, bring facility to user.In another optional implementation, disclosure embodiment is used in server, and server identifies after sample note concentrates the attaching information of number of sender, can obtain the incidence relation of number of sender and attaching information; Again incidence relation is sent to terminal, to make terminal identify destination number to be identified according to incidence relation, determines the attaching information of described destination number.
As shown in Figure 2, Fig. 2 is the application scenarios figure of the recognition methods of the attaching information of a kind of number of the disclosure according to an exemplary embodiment.Disclosure scheme can perform in server beyond the clouds, cloud server is after the incidence relation determining number of sender and attaching information, incidence relation can be pushed to terminal, to make terminal identify destination number to be identified according to incidence relation, determine the attaching information of destination number.Terminal can be smart mobile phone, Intelligent bracelet, ipad etc.
Wherein, send the time of incidence relation for server, can be incidence relation is broadcast to each terminal after determining incidence relation, incidence relation can be stored in this locality by each terminal; Also can be when the attaching information that server receives terminal initiation obtains request, according to request, the incidence relation of number of sender and attaching information is sent to terminal.
Wherein, how to carry out identifying and marking according to the incidence relation of number of sender and attaching information for terminal, no longer limit at this.
In this embodiment, the attaching information that sample note concentrates number of sender is identified by server centered, determine the incidence relation of number of sender and attaching information, and incidence relation is pushed to each terminal, each terminal is made to share this incidence relation, the incidence relation of number of sender and attaching information is had more comprehensive, simultaneously by server centered determination incidence relation, avoid each terminal all to determine the wasting of resources that incidence relation causes.
Then, the disclosure is described respectively to each step in Fig. 1 respectively.
About step 101, sample note collection can be the set of the note obtained in certain time.In order to save amount of calculation, sample note collection can be the set of the notice class note obtained in certain time.In an optional implementation, following manner can be adopted to obtain sample note collection:
A1: obtain the history note in preset time period.
Wherein, preset time period can preset, and such as, preset time period can be set as in one month, in one week etc.History note can be the note that different transmit leg issues different recipient.In this step, history note at least comprises sender number.Further, short message content and recipient's number can also be comprised.
A2: the sender number of described history note is identified.
A3: be notify that the history note of class note number is defined as sample note by described sender number, obtain sample note collection.
Because the sender number of notice class note is different from the sender number of other conventional notes, therefore by sender number, the present embodiment can identify whether history note is notice class note, thus notice class note is defined as sample note, obtain sample note collection, thus improve the follow-up efficiency determining number and attaching information corresponding relation.
Should be understood that, when judging whether history note is notice class note, except said method, the determination methods in correlation technique can also be adopted, no longer limit at this.
About step 102, the title being used for identification number attaching information can be extracted from the sample note of sample note collection.The attaching information of number can be number source party name, such as, can be the Business Name of number home or the organization names etc. of ownership, the attaching information of number also can be the object information of this number communication content, such as, the object of this note is " prompting ".
The method of extracting header has a variety of, such as, and word segmentation processing, keyword match etc.The disclosure enumerates a kind of method by special symbol group extracting header, and described method comprises:
B1: when comprising special symbol group in the sample note of described sample note collection, extract the information between special symbol group from described sample note, determines the title of described sample note according to the information extracted.
B2: when not comprising described special symbol group in the sample note of described sample note collection, the title of described sample note is defined as sky information.
Wherein, special symbol group can be have annotated symbols, such as bracket, bracket can comprise the forms such as braces " { } ", bracket " [] ", round bracket " () ", hexagonal bracket " () ", angle brackets " <> " and square toes bracket " [] ".
The present embodiment specifically describes a kind of method extracting title, when comprising described special symbol group in sample note, the information between special symbol group can be extracted from the short message content of sample note, such as, the mode of regular expression can be adopted to extract information between special symbol group.Be understandable that, the information of extraction is the information in the short message content of sample note, such as, when special symbol group is square brackets, then extracts the content in short message content in square brackets.When not comprising special symbol group in sample note, the title of this sample note can be defined as sky information.So-called empty information, can be do not have character, also can be specific character, and this specific character represents that attaching information is empty.
About step 103, after extracting header, every bar sample note that sample note is concentrated has corresponding title, the title of sample note corresponding for same number of sender can be merged, and obtains pooling information.
When merging, all titles that subsample note is concentrated can be merged, obtain the pooling information of the number of sender of subsample note collection, described subsample note collection comprises sample note corresponding to same number of sender; Also the short message receiver number number that subsample note concentrates each title corresponding can be calculated, obtain subsample note and merge collection, described subsample note merges concentrates each sample note to comprise number of sender, title, short message receiver number number, and described subsample note collection comprises sample note corresponding to same number of sender.
In addition, for subsample note collection, number of sender can be the sender number that sample note concentrates all sample notes, then sample note corresponding for same number of sender can be combined as subsample note collection, corresponding subsample note collection is had for each sender number, the title of the sample note that can directly subsample note be concentrated merges, and obtains the pooling information that each sender number is corresponding.But owing to concentrating in sample note, title may be there is in the short message content of sample note, also title may not be there is, the subsample note that then not necessarily each sender number is corresponding is concentrated exists title, then adopting said method may there is pooling information is empty situation, increases amount of calculation.
In order to avoid this situation, before being merged by the title of sample note corresponding for same number of sender, C1 step and C2 step can also be comprised:
C1: the sender number described sample note being concentrated the sample note that comprises described special symbol group corresponding is defined as number of sender.
C2: concentrate from described sample note and filter out subsample note collection corresponding to each number of sender respectively, described subsample note concentrates each sample note to comprise number of sender, short message receiver number and title.
Because have the annotated information of this note in special symbol group, therefore the sender number of the note with special symbol group can be thought to associate headed number of sender.
Concentrate in sample note, special symbol group may be there is in the short message content of sample note, also special symbol group may not be there is, the present embodiment filters out the sample note comprising special symbol group, the sender number of the sample note filtered out can be defined as number of sender, multiple number of sender can form number of sender collection, then division subsample note collection is carried out for number of sender, ensure that number of sender is the number at least with a title, thus avoid the situation that pooling information is sky.
Wherein, concentrate from sample note that to filter out that subsample note corresponding to each number of sender integrate respectively be concentrate from sample note to filter out the sample note of sender number as number of sender, obtain the subsample note collection that this number of sender is corresponding.When number of sender is multiple, then filter out the subsample note collection that each number of sender is corresponding, realize the corresponding subsample note collection of a number of sender.For each subsample note collection, step 103 can be performed and obtain subsample note merging collection corresponding to this number of sender.
Wherein, subsample note concentrates every bar sample note to there is corresponding title, the quantity of title can be one or more, can also be sky.Because same subsample note is concentrated, title under different sample note may be different, therefore the incidence relation of every bar sample note and title can be set up, such as can set up the triplet information of every bar sample note, triplet information can comprise number of sender, short message receiver number, title.Be understandable that, same subsample note is concentrated, and the number of sender in the tlv triple of every bar sample note is identical.
The sender number that described sample note concentrates the sample note that comprises described special symbol group corresponding by this step is defined as number of sender, and pooling information can be avoided to be empty situation, in addition, concentrate from sample note and filter out subsample note collection corresponding to each number of sender respectively, thus sample note corresponding for number of sender is all included concentrate in subsample note, subsample note concentrates the sample note both including and comprise special symbol group, the sample note not comprising special symbol group can also be comprised, to improve the follow-up accuracy identifying the attaching information of number of sender, avoid when the sample note quantity not comprising special symbol group is larger, the error that only corresponding according to the sample note determination number of sender comprising special symbol group attaching information causes.
In addition, be understandable that, about step 102, step C1 and step C2, first sample drawn note can concentrate the title of every bar sample note, then divide subsample note collection according to extraction result; Also first can divide subsample note collection, then concentrate the title extracting each sample note from subsample note, final purpose is all to obtain the subsample note set including number of sender, short message receiver number and title.
Such as, first can extract the title in all sample notes, each sample note comprises sender number, recipient's number, title.Then sender number sample note being concentrated the sample note that comprises special symbol group corresponding is defined as number of sender; Concentrate from sample note and filter out subsample note collection corresponding to each number of sender respectively, described subsample note concentrates each sample note to comprise number of sender, short message receiver number and title.
And for example, after acquisition includes the sample note collection of short message content, sender number, recipient's number, sender number sample note can being concentrated the sample note that comprises special symbol group corresponding is defined as number of sender; Concentrate from sample note and filter out initial sample note collection corresponding to each number of sender respectively, initial sample note concentrates every bar sample note to comprise short message content, sender number, recipient's number.Then from the short message content of the sample note of initial sample note collection, extract the title being used for identification number attaching information, obtain the final subsample note collection that this number of sender is corresponding, this final subsample note concentrates each sample note to comprise number of sender, short message receiver number and title.
Based on this, after adopting said method acquisition to comprise the subsample note collection of number of sender, short message receiver number and title, can merge according to the title of subsample note collection by sample note corresponding for same number of sender.
In an optional implementation, after acquisition comprises the subsample note collection of number of sender, short message receiver number and title, the described title by sample note corresponding for same number of sender merges, can comprise: calculate the short message receiver number number that described subsample note concentrates each title corresponding, obtain subsample note and merge collection, described subsample note merges concentrates each sample note to comprise number of sender, title, short message receiver number number.
Because subsample note centralized recording has the number of sender of every bar sample note, short message receiver number and title, then can count the short message receiver number number that subsample note concentrates each title corresponding, thus the subsample note merging collection comprising number of sender, title, short message receiver number number incidence relation can be obtained.
In another optional implementation, after acquisition comprises the subsample note collection of number of sender, short message receiver number and title, all titles that subsample note is concentrated can be merged, obtain the pooling information of the number of sender of subsample note collection.
About step 104, in an optional implementation, can directly according to the attaching information of pooling information determination number of sender.Such as, when directly the title that subsample note is concentrated being merged in step 103, using the attaching information of the title of merging as the number of sender of this subsample note collection.This mode is applicable to the fewer situation of title, and the situation that empty information is fewer, thisly determines that the mode efficiency of attaching information is higher.
In another optional implementation, when obtain in step 103 comprise number of sender, title, short message receiver number number subsample note merge collection time, can judge that subsample note merges and concentrate short message receiver number number corresponding to title whether to be greater than number threshold value; Using the attaching information of title corresponding for the short message receiver number number being greater than number threshold value as this number of sender.This mode can reduce the amount of attaching information.
In another optional implementation, when obtain in step 103 comprise number of sender, title, short message receiver number number subsample note merge collection time, following formulae discovery subsample note can be adopted to merge and to concentrate each title to merge the probable value concentrated in subsample note:
P(titlei)=C(titlei)&Sigma;k=1nC(titlek),i&Element;(1,n)
Wherein, P (titlei) represent title titleithe probable value concentrated is merged, C (title in subsample notei) represent that subsample note merges concentrated title titleicorresponding short message receiver number number, C (titlek) represent that subsample note merges concentrated title titlekcorresponding short message receiver number number, n represents that subsample note merges and concentrates title number; The title described probable value being greater than probability threshold value is defined as the attaching information of described number of sender.
After determining the short message receiver number number that each title is corresponding, adopt above-mentioned formulae discovery subsample note to merge and concentrate each title to merge the probable value concentrated in subsample note, thus title larger for probability is defined as the attaching information of number of sender.
The present embodiment calculates title by the short message receiver number number that title is corresponding and merges the probability concentrated in subsample note, and title larger for probability is defined as the attaching information of number of sender, improve the accuracy determining attaching information, and decrease the quantity of attaching information, bring facility to user.
Further, the title described probable value being greater than probability threshold value is defined as the attaching information step of described number of sender, can comprise: from the probable value determined, filter out most probable value, when most probable value is greater than probability threshold value, title corresponding for most probable value is defined as the attaching information of number of sender, thus the attaching information of each number can be defined as one, bring facility to user further.
Further, calculating the concentrated each title of subsample note merging before subsample note merges concentrated probable value, also comprising: judging whether the short message receiver number number that described subsample note merges concentrated title corresponding is less than number threshold value; Title corresponding for the short message receiver number number being less than number threshold value is deleted.Calculate subsample note and merge concentrated each title when subsample note merges the probable value concentrated, calculate the subsample note merging after deleting and concentrate each title to merge the probable value concentrated in subsample note, thus reduce the amount of calculation of calculating probability.
Various technical characteristics in above execution mode can combine arbitrarily, as long as there is not conflict or contradiction in the combination between feature, but as space is limited, describe one by one, the carrying out arbitrarily combining of the various technical characteristics therefore in above-mentioned execution mode also belongs to this specification scope of disclosure.
The disclosure is also enumerated one of them concrete example and is described.In this example, the recognition methods of the attaching information of number comprises:
S1: obtain notice class sample note collection S.
S2: concentrate from notice class sample note the sample note filtering out and comprise special symbol group, obtain note collection Ssub.Make note collection Ssubin sender number be number of sender, obtain number of sender collection N (number (1), number (2) ... number (t) ...).
The operation of following S3 to S7 is performed for each number of sender.
S3: integrate S from notice class sample note and filter out the initial subsample note collection S of sender number as number (t)number, initial subsample note collection Snumberin every bar sample note can comprise triplet information: <number (t), short message receiver number, short message content >.Such as:
Tlv triple 1:<106988888888,13488888888, " [Tentent Science] [mail reminder of QQ mailbox] sender: Mr. Zhang, Taobao's theme: ... " >.
Tlv triple 2:<106988888888,13444444444, " identifying code of your this operation is 5889 (in 20 minutes effectively), please complete checking, [Tentent Science] [warm tip] " >.
Tlv triple 3:<106988888888,13455555555, " Mr. Zhang pays the bill 150.00 yuan to you 134*5555.Check and accept at once.[Alipay] " >.
Tlv triple 4:<106988888888,13466666666, " you are good for the client respected! Mr. Zhang gave your incoming call 10 days 10: 30 May, please replied in time " >.
S4: when initial subsample note concentrates sample note to comprise special symbol group, extract the information between special symbol group by regular expression from sample note, according to the title of the information determination sample note extracted; When not comprising special symbol group in sample note, the title of sample note is defined as sky information.The incidence relation of number of sender, short message receiver number and title in every bar sample note is set up according to determined each title, then the tlv triple in above-mentioned example can be replaced with new tlv triple <number (t), short message receiver number, title >, obtain final subsample note collection, as follows respectively:
New tlv triple 1:<106988888888,13488888888, { Tentent Science, the mail reminder of QQ mailbox } >.
New tlv triple 2:<106988888888,13444444444, { Tentent Science, warm prompting } >.
New tlv triple 3:<106988888888,13455555555, { Alipay } >.
New tlv triple 4:<106988888888,13466666666, " { } >.
S5: calculate the short message receiver number number that subsample note concentrates each title corresponding, namely calculates subsample note and concentrates each title to be arrived by how many number reception.Such as, in above-mentioned example, " Tentent Science " quilt " 13488888888 " and " 13444444444 " receive, then recipient's number number that " Tentent Science " is corresponding is 2.To this, the incidence relation of number of sender, title, recipient's number number can be generated, obtain subsample note and merge collection, as follows:
<106988888888, " Tentent Science ", 2>;
<106988888888, " mail reminder of QQ mailbox ", 1>;
<106988888888, " warm prompting ", 1>;
<106988888888, " Alipay ", 1>;
<106988888888,“”,1>。
S6: can pre-set and count threshold value one by one, title recipient's number number being less than number threshold value is deleted.Such as, number threshold value is set to 2, then title number number being less than individual 2 is deleted, remaining following information after deleting:
<106988888888, " Tentent Science ", 2>.
Be understandable that, when after the process carrying out S6 step, if only remain next title, then directly title can be defined as attaching information.If also remain multiple title, then can perform S7 step and again screen.
S7: adopt following formulae discovery subsample note to merge and concentrate each title to merge the probable value concentrated in subsample note:
P(titlei)=C(titlei)&Sigma;k=1nC(titlek),i&Element;(1,n)
Wherein, P (titlei) represent title titleithe probable value concentrated is merged, C (title in subsample notei) represent that subsample note merges concentrated title titleicorresponding short message receiver number number, C (titlek) represent that subsample note merges concentrated title titlekcorresponding short message receiver number number, n represents that subsample note merges and concentrates title number.Title probable value being greater than probability threshold value is defined as the attaching information of described number of sender.
Corresponding with the embodiment of the recognition methods of the attaching information of aforementioned number, the recognition device that the disclosure additionally provides the attaching information of number and the embodiment of terminal applied thereof.
As shown in Figure 3, Fig. 3 is the block diagram of the recognition device of the attaching information of a kind of number of the disclosure according to an exemplary embodiment, and described device comprises: note collection acquisition module 31, title abstraction module 32, title merge module 33 and the first attaching information identification module 34.
Wherein, note collection acquisition module 31, is configured to obtain sample note collection.
Title abstraction module 32, is configured to the title extracted from the sample note of described sample note collection for identification number attaching information.
Title merges module 33, is configured to the title of sample note corresponding for same number of sender to merge.
First attaching information identification module 34, is configured to the attaching information identifying described number of sender according to pooling information.
As seen from the above-described embodiment, obtain sample note collection, then from the sample note of sample note collection, extract the title being used for identification number attaching information, the title of sample note corresponding for same number of sender is merged, the attaching information of number of sender is identified according to pooling information, realize automatically identifying the attaching information that sample note concentrates number of sender, the waste of human resource avoiding manual identified number to cause, improves recognition efficiency simultaneously.
As shown in Figure 4, Fig. 4 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment, this embodiment is on aforementioned basis embodiment illustrated in fig. 3, and described note collection acquisition module 31 comprises: note obtains submodule 311, Number Reorganization submodule 312 and sample note collection determination submodule 313.
Wherein, note obtains submodule 311, is configured to obtain the history note in preset time period.
Number Reorganization submodule 312, is configured to identify the sender number of described history note.
Sample note collection determination submodule 313, is configured to be notify that the history note of class note number is defined as sample note by described sender number, obtains sample note collection.
As seen from the above-described embodiment, because the sender number of notice class note is different from the sender number of other conventional notes, therefore by sender number, the present embodiment can identify whether history note is notice class note, thus notice class note is defined as sample note, obtain sample note collection, thus improve the efficiency of the attaching information of follow-up identification number.
As shown in Figure 5, Fig. 5 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment, this embodiment is on aforementioned basis embodiment illustrated in fig. 3, and described title abstraction module 32 comprises: title extracts submodule 321.
Wherein, title extracts submodule 321, when being configured to comprise special symbol group in the sample note of described sample note collection, extracts the information between special symbol group from described sample note, determines the title of described sample note according to the information extracted; When not comprising described special symbol group in the sample note of described sample note collection, the title of described sample note is defined as sky information.
As seen from the above-described embodiment, when comprising described special symbol group in sample note, the information between special symbol group can be extracted from sample note, according to the title of the information determination sample note extracted, when not comprising special symbol group in sample note, the title of this sample note can be defined as sky information, thus improve the efficiency of the title determining sample note.
As shown in Figure 6, Fig. 6 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment, this embodiment is on aforementioned basis embodiment illustrated in fig. 5, and described device also comprises: number of sender determination module 35 and subsample note collection determination module 36.
Wherein, number of sender determination module 35, the sender number being configured to described sample note to concentrate the sample note that comprises described special symbol group corresponding is defined as number of sender.
Subsample note collection determination module 36, be configured to concentrate from described sample note filter out subsample note collection corresponding to each number of sender respectively, described subsample note concentrates each sample note to comprise number of sender, short message receiver number and title.
As seen from the above-described embodiment, sender number sample note can being concentrated the sample note that comprises special symbol group corresponding is defined as number of sender, concentrate from sample note and filter out subsample note collection corresponding to each number of sender respectively, thus sample note corresponding for number of sender is all included concentrate in subsample note, subsample note concentrates the sample note both including and comprise special symbol group, the sample note not comprising special symbol group can also be comprised, to improve the follow-up accuracy identifying the attaching information of number of sender, avoid when the sample note quantity not comprising special symbol group is larger, the error that only corresponding according to the sample note determination number of sender comprising special symbol group attaching information causes.
As shown in Figure 7, Fig. 7 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment, this embodiment is on aforementioned basis embodiment illustrated in fig. 3, and described title merges module 33 and comprises: merge collection and determine submodule 331.
Wherein, merge collection and determine submodule 331, be configured to the short message receiver number number that calculating subsample note concentrates each title corresponding, obtain subsample note and merge collection, described subsample note merges concentrates each sample note to comprise number of sender, title, short message receiver number number, and described subsample note collection comprises sample note corresponding to same number of sender.
As shown in Figure 8, Fig. 8 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment, this embodiment is on aforementioned basis embodiment illustrated in fig. 7, and described first attaching information identification module 34 comprises: probable value calculating sub module 341 and attaching information determination submodule 342.
Wherein, probable value calculating sub module 341, is configured to adopt following formulae discovery subsample note to merge and concentrates each title to merge the probable value concentrated in subsample note:
P(titlei)=C(titlei)&Sigma;k=1nC(titlek),i&Element;(1,n)
Wherein, P (titlei) represent title titleithe probable value concentrated is merged, C (title in subsample notei) represent that subsample note merges concentrated title titleicorresponding short message receiver number number, C (titlek) represent that subsample note merges concentrated title titlekcorresponding short message receiver number number, n represents that subsample note merges and concentrates title number.
Attaching information determination submodule 342, the title being configured to described probable value to be greater than probability threshold value is defined as the attaching information of described number of sender.
As seen from the above-described embodiment, calculate title by the short message receiver number number that title is corresponding and merge the probability concentrated in subsample note, and title larger for probability is defined as the attaching information of number of sender, improve the accuracy determining attaching information, and decrease the quantity of attaching information, bring facility to user.
As shown in Figure 9, Fig. 9 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment, this embodiment is on aforementioned basis embodiment illustrated in fig. 8, and described first attaching information identification module 34 also comprises: attaching information filters submodule 343.
Wherein, attaching information filters submodule 343, judges that described subsample note merges and concentrates short message receiver number number corresponding to title whether to be less than number threshold value; Title corresponding for the short message receiver number number being less than number threshold value is deleted.
As seen from the above-described embodiment, calculating the concentrated each title of subsample note merging before subsample note merges concentrated probable value, also comprising: judging whether the short message receiver number number that described subsample note merges concentrated title corresponding is less than number threshold value; Title corresponding for the short message receiver number number being less than number threshold value is deleted.Calculate subsample note and merge concentrated each title when subsample note merges the probable value concentrated, calculate the subsample note merging after deleting and concentrate each title to merge the probable value concentrated in subsample note, thus reduce the amount of calculation of calculating probability.
As shown in Figure 10, Figure 10 is the block diagram of the recognition device of the attaching information of the another kind of number of the disclosure according to an exemplary embodiment, this embodiment is on aforementioned basis embodiment illustrated in fig. 3, and described device also comprises: incidence relation determination module 37 and the second attaching information identification module 38.
Wherein, incidence relation determination module 37, is configured to the incidence relation of attaching information determination number of sender according to each number of sender and attaching information.
Second attaching information identification module 38, be configured to identify destination number to be identified according to the incidence relation of described sender number and attaching information, determine the attaching information of described destination number, described destination number comprise calling party's number to be transferred to, number that number, short message sending side that callee receives are to be sent or the number that short message receiver receives.
Accordingly, the disclosure also provides a kind of recognition device of attaching information of number, and described device includes processor; For the memory of storage of processor executable instruction; Wherein, described processor is configured to:
Obtain sample note collection.
The title being used for identification number attaching information is extracted from the sample note of described sample note collection.
The title of sample note corresponding for same number of sender is merged.
The attaching information of described number of sender is identified according to pooling information.
In said apparatus, the implementation procedure of the function and efficacy of unit specifically refers to the implementation procedure of corresponding step in said method, does not repeat them here.
For device embodiment, because it corresponds essentially to embodiment of the method, so relevant part illustrates see the part of embodiment of the method.Device embodiment described above is only schematic, the wherein said unit illustrated as separating component or can may not be and physically separates, parts as unit display can be or may not be physical location, namely can be positioned at a place, or also can be distributed in multiple network element.Some or all of module wherein can be selected according to the actual needs to realize the object of disclosure scheme.Those of ordinary skill in the art, when not paying creative work, are namely appreciated that and implement.
As shown in figure 11, Figure 11 is a structural representation of the recognition device 1100 of a kind of attaching information for number according to an exemplary embodiment.Such as, device 1100 may be provided in a server.With reference to Figure 11, device 1100 comprises processing components 1122, and it comprises one or more processor further, and the memory resource representated by memory 1132, can such as, by the instruction of the execution of processing unit 1122, application program for storing.The application program stored in memory 1132 can comprise each module corresponding to one group of instruction one or more.In addition, processing components 1122 is configured to perform instruction, to perform the recognition methods of the attaching information of above-mentioned number.
Device 1100 can also comprise the power management that a power supply module 1126 is configured to final controlling element 1100, and a wired or wireless network interface 1150 is configured to device 1100 to be connected to network, and input and output (I/O) interface 1158.Device 1100 can operate the operating system based on being stored in memory 1132, such as WindowsServerTM, MacOSXTM, UnixTM, LinuxTM, FreeBSDTM or similar.
Those skilled in the art, at consideration specification and after putting into practice invention disclosed herein, will easily expect other embodiment of the present disclosure.The disclosure is intended to contain any modification of the present disclosure, purposes or adaptations, and these modification, purposes or adaptations are followed general principle of the present disclosure and comprised the undocumented common practise in the art of the disclosure or conventional techniques means.Specification and embodiment are only regarded as exemplary, and true scope of the present disclosure and spirit are pointed out by claim below.
Should be understood that, the disclosure is not limited to precision architecture described above and illustrated in the accompanying drawings, and can carry out various amendment and change not departing from its scope.The scope of the present disclosure is only limited by appended claim.
The foregoing is only preferred embodiment of the present disclosure, not in order to limit the disclosure, all within spirit of the present disclosure and principle, any amendment made, equivalent replacements, improvement etc., all should be included within scope that the disclosure protects.

Claims (17)

CN201510728723.8A2015-10-302015-10-30The recognition methods of the attaching information of number and deviceActiveCN105430654B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201510728723.8ACN105430654B (en)2015-10-302015-10-30The recognition methods of the attaching information of number and device

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201510728723.8ACN105430654B (en)2015-10-302015-10-30The recognition methods of the attaching information of number and device

Publications (2)

Publication NumberPublication Date
CN105430654Atrue CN105430654A (en)2016-03-23
CN105430654B CN105430654B (en)2018-12-11

Family

ID=55508523

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201510728723.8AActiveCN105430654B (en)2015-10-302015-10-30The recognition methods of the attaching information of number and device

Country Status (1)

CountryLink
CN (1)CN105430654B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN106101464A (en)*2016-05-262016-11-09北京小米移动软件有限公司Number mark method and device
CN108494977A (en)*2018-02-092018-09-04北京泰迪熊移动科技有限公司The recognition methods of note number, device and system
CN109561402A (en)*2017-09-262019-04-02中国电信股份有限公司Information acquisition method, device and mobile terminal
CN113810547A (en)*2020-06-162021-12-17中国移动通信集团重庆有限公司 Method, device and computing device for voice call security protection

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2011126506A1 (en)*2010-04-072011-10-13Apple Inc.Transitioning between circuit switched calls and video calls
CN103369095A (en)*2012-03-302013-10-23北京千橡网景科技发展有限公司Method and device for type identification of incoming call or text message
CN104618877A (en)*2015-01-302015-05-13广东欧珀移动通信有限公司Short message arranging method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2011126506A1 (en)*2010-04-072011-10-13Apple Inc.Transitioning between circuit switched calls and video calls
CN103369095A (en)*2012-03-302013-10-23北京千橡网景科技发展有限公司Method and device for type identification of incoming call or text message
CN104618877A (en)*2015-01-302015-05-13广东欧珀移动通信有限公司Short message arranging method and device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN106101464A (en)*2016-05-262016-11-09北京小米移动软件有限公司Number mark method and device
CN109561402A (en)*2017-09-262019-04-02中国电信股份有限公司Information acquisition method, device and mobile terminal
CN108494977A (en)*2018-02-092018-09-04北京泰迪熊移动科技有限公司The recognition methods of note number, device and system
CN113810547A (en)*2020-06-162021-12-17中国移动通信集团重庆有限公司 Method, device and computing device for voice call security protection
CN113810547B (en)*2020-06-162023-12-15中国移动通信集团重庆有限公司 Method, device and computing equipment for voice call security protection

Also Published As

Publication numberPublication date
CN105430654B (en)2018-12-11

Similar Documents

PublicationPublication DateTitle
EP2873204B1 (en)Method and system for delivering reminder information
CN101729639B (en)The call record method of mobile terminal and device
CN102857636B (en)A kind of address list method of operation of mobile terminal and operating system
WO2013166922A1 (en)Information processing method and terminal
CN105430654A (en)Method and device used for identifying number attribution information
CN104407873A (en)Method and device based on calendar management application
CN105072238A (en)Method and apparatus for creating contact list according to note information of newly-added number
CN104754151A (en)Bank queuing communication method and system
CN104994209A (en)Contact information obtaining method based on communication software chatting records and system
CN103167089A (en) Maintenance method of a mobile terminal and its address book
CN107920154A (en)The processing method and terminal of Stranger Calls
CN102883289A (en)Communication processing method, client and mobile terminal
CN102547614A (en)Automatic reminding method based on mobile phone and mobile phone
CN103220211A (en) Method, device and mobile terminal for processing SNS messages
CN103037355B (en)The method for remote updating of mobile terminal addressbook and numbering directory management server
CN103037338A (en)Signature inserting method of contact information and communication terminal
CN108206893A (en)call processing method and device
WO2014023182A1 (en)Method and terminal for processing message service
US9313327B2 (en)Method and apparatus for managing contact information
CN104468976A (en)Method and device for intelligently prompting user to send message
CN104836881B (en)Information control method and electronic equipment
CN103249016A (en)Method and device for displaying short message and mobile terminal
CN104796519A (en)Terminal
CN101345955A (en)Mobile terminal reminding method
CN109508937A (en)Express delivery information reminding method and device, storage medium and express delivery cabinet system

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp