Movatterモバイル変換


[0]ホーム

URL:


US20160004770A1 - Generation and use of an email frequent word list - Google Patents

Generation and use of an email frequent word list
Download PDF

Info

Publication number
US20160004770A1
US20160004770A1US14/857,517US201514857517AUS2016004770A1US 20160004770 A1US20160004770 A1US 20160004770A1US 201514857517 AUS201514857517 AUS 201514857517AUS 2016004770 A1US2016004770 A1US 2016004770A1
Authority
US
United States
Prior art keywords
word list
frequent word
words
mailbox
universal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/857,517
Inventor
Ashish Consul
Suryanarayana Murty Gorti
Andrew E. Goodsell
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Corp
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp, Microsoft Technology Licensing LLCfiledCriticalMicrosoft Corp
Priority to US14/857,517priorityCriticalpatent/US20160004770A1/en
Publication of US20160004770A1publicationCriticalpatent/US20160004770A1/en
Assigned to MICROSOFT CORPORATIONreassignmentMICROSOFT CORPORATIONASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: CONSUL, ASHISH, GOODSELL, ANDREW E., GORTI, SURYANARAYANA MURTY
Assigned to MICROSOFT TECHNOLOGY LICENSING, LLCreassignmentMICROSOFT TECHNOLOGY LICENSING, LLCASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: MICROSOFT CORPORATION
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

Technologies are described herein for generating a mailbox specific frequent word list associated with a mailbox. In one method, an index scan is performed on catalogs to retrieve search data mapping words to emails containing the words. The search data is provided across multiple mailboxes. A universal frequent word list is generated based on the search data. The mailbox specific frequent word list is generated based on the universal frequent word list.

Description

Claims (20)

What is claimed is:
1. A method for generating a mailbox specific frequent word list associated with a mailbox, comprising:
performing an index scan on catalogs to retrieve search data mapping words to emails containing the words, the search data provided across multiple mailboxes;
generating a universal frequent word list based on the search data; and
generating the mailbox specific frequent word list based on the universal frequent word list.
2. The method ofclaim 1, wherein performing an index scan on catalogs to retrieve search data mapping words to emails containing the words comprises:
receiving a request for the mailbox specific frequent word list;
upon receiving the request, determining whether the universal frequent word list has been created; and
upon determining that the universal frequent word list has not been created, performing the index scan on catalogs to retrieve search data mapping words to emails containing the words.
3. The method ofclaim 2, wherein performing an index scan on catalogs to retrieve search data mapping words to emails containing the words further comprises:
upon determining that the universal frequent word list has been created, determining whether the universal frequent word list is current;
upon determining that the universal frequent word list is not current, performing the index scan on catalogs to retrieve search data mapping words to emails containing the words; and
upon determining that the universal frequent word list is current, proceeding to generating the mailbox specific frequent word list based on the universal frequent word list.
4. The method ofclaim 2, wherein the search data maps each of the words to one or more email identifiers, each of which corresponds to one of the emails.
5. The method ofclaim 1, wherein generating a universal frequent word list based on the search data comprises:
determining a word frequency in the search data by counting a number of emails that contain each of the words; and
generating the universal frequent word list by mapping each of the words to the associated word frequency.
6. The method ofclaim 5, wherein the word frequency comprises a ratio between the number of emails in the mailbox that contain each of the words and a total number of emails in the mailbox.
7. The method ofclaim 1, wherein frequent word list for the mailbox is stored as a compressed folder associated item (“FAI”).
8. The method ofclaim 1, wherein the catalogs comprise a global catalog created and maintained by an email server across multiple domains.
9. The method ofclaim 1, further comprising generating tailored advertising based on the mailbox specific frequent word list.
10. The method ofclaim 1, further comprising adding at least a portion of the words in the mailbox specific frequent word list to a transcription dictionary, the transcription dictionary adapted to provide a vocabulary enabling a voice transcription application to phonetically map audio sequences to words in the transcription dictionary.
11. The method ofclaim 1, further comprising generating email tags based on the mailbox specific frequent word list, the email tags adapted to associate searchable categories to the emails.
12. A method for generating a mailbox specific frequent word list associated with a mailbox, comprising:
receiving a request for the mailbox specific frequent word list;
determining whether a universal frequent word list has been created, the universal frequent word list comprising a mapping of words to corresponding word frequencies across multiple mailboxes;
upon determining that the universal frequent word list has not been created, performing an index scan on a global catalog to retrieve an inverted index mapping words to email identifiers corresponding to emails containing the words;
generating the universal frequent word list based on the inverted index; and
generating the mailbox specific frequent word list based on the universal frequent word list by filtering the words and the corresponding word frequencies associated with the mailbox.
13. The method ofclaim 12, further comprising:
upon determining that the universal frequent word list has been created, determining whether the universal frequent word list is current;
upon determining that the universal frequent word list is not current, performing the index scan on the global catalog to retrieve the inverted index mapping words to email identifiers corresponding to emails containing the words; and
upon determining that the universal frequent word list is current, proceeding directly to generating the mailbox specific frequent word list based on the universal frequent word list by filtering the words and the corresponding word frequencies associated with the mailbox.
14. The method ofclaim 12, wherein receiving a request for the mailbox specific frequent word list comprises receiving the request for the mailbox specific frequent word list through an application programming interface.
15. The method ofclaim 12, wherein the request specifies a minimum value or a maximum value of the word frequencies associated with the words included in the mailbox specific frequent word list.
16. The method ofclaim 12, wherein the request specifies a number of words included in the mailbox specific frequent word list.
17. The method ofclaim 12, wherein the mailbox specific frequent word list is sorted according to the word frequencies.
18. A computer-readable storage medium having stored thereon an application programming interface for generating a mailbox specific frequent word list associated with a mailbox, the API comprising:
a first object adapted to receive, from a caller application, a request for the mailbox specific frequent word list;
a second object adapted to perform an index scan on a global catalog to retrieve an inverted index mapping words to email identifiers corresponding to emails containing the words;
a third object adapted to generate a universal frequent word list based on the inverted index, the universal frequent word list comprising a mapping of the words to corresponding word frequencies, each of the word frequencies specifying a number of emails that contain one of the words;
a fourth object adapted to generate the mailbox specific frequent word list based on the universal frequent word list by filtering the words and the corresponding word frequencies associated with the mailbox; and
a fifth object adapted to transmit the mailbox specific frequent word list to the caller application in response to the first object receiving the request.
19. The computer-readable storage medium ofclaim 18, wherein the caller application is associated with an advertising application adapted to generated tailored advertising based on the mailbox specific frequent word list.
20. The computer-readable storage medium ofclaim 18, wherein the caller application is associated with a voice transcription application adapted to add at least a portion of the words in the mailbox specific frequent word list to a transcription dictionary, the transcription dictionary adapted to provide a vocabulary enabling the voice transcription application to phonetically map audio sequences to words in the transcription dictionary.
US14/857,5172008-06-192015-09-17Generation and use of an email frequent word listAbandonedUS20160004770A1 (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US14/857,517US20160004770A1 (en)2008-06-192015-09-17Generation and use of an email frequent word list

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US12/142,622US9165056B2 (en)2008-06-192008-06-19Generation and use of an email frequent word list
US14/857,517US20160004770A1 (en)2008-06-192015-09-17Generation and use of an email frequent word list

Related Parent Applications (1)

Application NumberTitlePriority DateFiling Date
US12/142,622ContinuationUS9165056B2 (en)2008-06-192008-06-19Generation and use of an email frequent word list

Publications (1)

Publication NumberPublication Date
US20160004770A1true US20160004770A1 (en)2016-01-07

Family

ID=41432273

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US12/142,622Expired - Fee RelatedUS9165056B2 (en)2008-06-192008-06-19Generation and use of an email frequent word list
US14/857,517AbandonedUS20160004770A1 (en)2008-06-192015-09-17Generation and use of an email frequent word list

Family Applications Before (1)

Application NumberTitlePriority DateFiling Date
US12/142,622Expired - Fee RelatedUS9165056B2 (en)2008-06-192008-06-19Generation and use of an email frequent word list

Country Status (1)

CountryLink
US (2)US9165056B2 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8032537B2 (en)2008-12-102011-10-04Microsoft CorporationUsing message sampling to determine the most frequent words in a user mailbox
US20120143894A1 (en)*2010-12-022012-06-07Microsoft CorporationAcquisition of Item Counts from Hosted Web Services
WO2012090196A1 (en)*2010-12-302012-07-05Melamed GalMethod and system for processing content
KR101655876B1 (en)*2012-01-052016-09-09삼성전자 주식회사Operating Method For Conversation based on a Message and Device supporting the same
US9235565B2 (en)*2012-02-142016-01-12Facebook, Inc.Blending customized user dictionaries
WO2017131753A1 (en)*2016-01-292017-08-03Entit Software LlcText search of database with one-pass indexing including filtering

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6029195A (en)1994-11-292000-02-22Herz; Frederick S. M.System for customized electronic identification of desirable objects
US6941304B2 (en)*1998-11-172005-09-06Kana Software, Inc.Method and apparatus for performing enterprise email management
US6829607B1 (en)*2000-04-242004-12-07Microsoft CorporationSystem and method for facilitating user input by automatically providing dynamically generated completion information
US6778941B1 (en)2000-11-142004-08-17Qualia Computing, Inc.Message and user attributes in a message filtering method and system
US8744835B2 (en)2001-03-162014-06-03Meaningful Machines LlcContent conversion method and apparatus
US6760694B2 (en)2001-03-212004-07-06Hewlett-Packard Development Company, L.P.Automatic information collection system using most frequent uncommon words or phrases
JP4489994B2 (en)2001-05-112010-06-23富士通株式会社 Topic extraction apparatus, method, program, and recording medium for recording the program
US7284191B2 (en)2001-08-132007-10-16Xerox CorporationMeta-document management system with document identifiers
NL1020670C2 (en)2002-05-242003-11-25Oce Tech Bv Determining a semantic image.
US7370034B2 (en)*2003-10-152008-05-06Xerox CorporationSystem and method for performing electronic information retrieval using keywords
WO2006014343A2 (en)*2004-07-022006-02-09Text-Tech, LlcAutomated evaluation systems and methods
US7461056B2 (en)*2005-02-092008-12-02Microsoft CorporationText mining apparatus and associated methods
US20060282303A1 (en)*2005-06-082006-12-14Microsoft CorporationDistributed organizational analyzer
US20070016614A1 (en)*2005-07-152007-01-18Novy Alon R JMethod and apparatus for providing structured data for free text messages
US7716217B2 (en)2006-01-132010-05-11Bluespace Software CorporationDetermining relevance of electronic content
US7574349B2 (en)*2006-03-292009-08-11Xerox CorporationStatistical language-model based system for detection of missing attachments
US7979469B2 (en)2006-06-142011-07-12Research In Motion LimitedHandheld electronic device and associated method employing a multiple-axis input device and arranging words of an existing message thread in various linguistic categories for selection during text entry
US20130246338A1 (en)*2006-09-152013-09-19Ashok DoddapaneniSystem and method for indexing a capture system
US8150798B2 (en)*2006-10-102012-04-03Wells Fargo Bank, N.A.Method and system for automated coordination and organization of electronic communications in enterprises
US7860885B2 (en)*2007-12-052010-12-28Palo Alto Research Center IncorporatedInbound content filtering via automated inference detection
US7836061B1 (en)*2007-12-292010-11-16Kaspersky Lab, ZaoMethod and system for classifying electronic text messages and spam messages

Also Published As

Publication numberPublication date
US9165056B2 (en)2015-10-20
US20090319483A1 (en)2009-12-24

Similar Documents

PublicationPublication DateTitle
US7333976B1 (en)Methods and systems for processing contact information
US8886598B1 (en)Tag-based synchronization
CN110268431B (en)Searching and filtering message content
US20160004770A1 (en)Generation and use of an email frequent word list
US9792356B2 (en)System and method for supporting natural language queries and requests against a user's personal data cloud
US10002183B2 (en)Resource efficient document search
US20170357661A1 (en)Providing content items in response to a natural language query
JP5313337B2 (en) Providing search results for mobile computing devices
US8458189B1 (en)Automatic tagging between structured/unstructured data
US8831951B2 (en)Verbal labels for electronic messages
KR102050528B1 (en)Providing actionable content to a computing device based on user actions
US8903929B2 (en)Forgotten attachment detection
US20130191738A1 (en)Intelligent automatic expansion/contraction of abbreviations in text-based electronic communications
US20050234848A1 (en)Methods and systems for information capture and retrieval
JP2018026178A (en) Data settings for user contact entries
CN102150158A (en)Method, system, and apparatus for arranging content search results
US8600970B2 (en)Server-side search of email attachments
US11481454B2 (en)Search engine results for low-frequency queries
US9123021B2 (en)Searching linked content using an external search system
US20140344259A1 (en)Answering people-related questions
US8612451B1 (en)Searching for data structures
US10999230B2 (en)Relevant content surfacing in computer productivity platforms
US20100250580A1 (en)Searching documents using a dynamically defined ignore string
US10691702B1 (en)Generating ranked lists of entities
US10956514B2 (en)System and method for directed analysis of content using artifical intelligence for storage and recall

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:MICROSOFT CORPORATION, WASHINGTON

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CONSUL, ASHISH;GORTI, SURYANARAYANA MURTY;GOODSELL, ANDREW E.;REEL/FRAME:039473/0898

Effective date:20080618

ASAssignment

Owner name:MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:044850/0237

Effective date:20170905

STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp