









| labeling | a form of data processing where documents are analyzed, |
| and the analysis results (referred to as “labels”) are made | |
| available for later processing stages. For example, a | |
| topical analysis of documents is a labeling of the | |
| documents by subject. | |
| retrieving | a form of data mining where a subset of a document |
| corpus is returned in response to a query. Preferably, the | |
| documents are each given a rank pertaining to their | |
| relevance to the query, and are sorted by decreasing | |
| relevance. | |
| categorizing | a form of data mining where several “categories” are |
| defined, and the documents of a corpus are labeled | |
| according the category to which they fit best. A common | |
| variation is multilabel categorizing, where each document | |
| may fit zero or more categories. Preferably, information | |
| is given regarding the quality of the fit. | |
| clustering | a form of data mining similar to categorization, |
| with the difference that the “categories” are not predefined, | |
| and the data mining must reveal them automatically. | |
| classifying | a process performed on a stream of incoming |
| documents, where each is labeled and then forwarded | |
| for relevant additional processing (manual or automatic) | |
| based on the labels that have been discovered. | |
| filtering | a process performed on a stream of incoming |
| documents, where each is labeled and then forwarded or | |
| discarded based on the labels that have been discovered. | |
| salient terms | terms whose appearance in a document |
| provides information relevant to its correct labeling, and | |
| consequently to all forms of data mining subsequent | |
| to labeling. | |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/771,315US20040163035A1 (en) | 2003-02-05 | 2004-02-05 | Method for automatic and semi-automatic classification and clustering of non-deterministic texts |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US44498203P | 2003-02-05 | 2003-02-05 | |
| US10/771,315US20040163035A1 (en) | 2003-02-05 | 2004-02-05 | Method for automatic and semi-automatic classification and clustering of non-deterministic texts |
| Publication Number | Publication Date |
|---|---|
| US20040163035A1true US20040163035A1 (en) | 2004-08-19 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US10/771,409Active2027-03-11US7792671B2 (en) | 2003-02-05 | 2004-02-05 | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments |
| US10/771,315AbandonedUS20040163035A1 (en) | 2003-02-05 | 2004-02-05 | Method for automatic and semi-automatic classification and clustering of non-deterministic texts |
| US12/059,660AbandonedUS20080183468A1 (en) | 2003-02-05 | 2008-03-31 | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments |
| US12/876,207Expired - LifetimeUS8195459B1 (en) | 2003-02-05 | 2010-09-06 | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US10/771,409Active2027-03-11US7792671B2 (en) | 2003-02-05 | 2004-02-05 | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US12/059,660AbandonedUS20080183468A1 (en) | 2003-02-05 | 2008-03-31 | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments |
| US12/876,207Expired - LifetimeUS8195459B1 (en) | 2003-02-05 | 2010-09-06 | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments |
| Country | Link |
|---|---|
| US (4) | US7792671B2 (en) |
| EP (2) | EP1590798A2 (en) |
| IL (1) | IL170065A (en) |
| WO (2) | WO2004072780A2 (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060067578A1 (en)* | 2004-09-30 | 2006-03-30 | Fuji Xerox Co., Ltd. | Slide contents processor, slide contents processing method, and storage medium storing program |
| US20070050445A1 (en)* | 2005-08-31 | 2007-03-01 | Hugh Hyndman | Internet content analysis |
| US20080027888A1 (en)* | 2006-07-31 | 2008-01-31 | Microsoft Corporation | Optimization of fact extraction using a multi-stage approach |
| US20090012970A1 (en)* | 2007-07-02 | 2009-01-08 | Dror Daniel Ziv | Root cause analysis using interactive data categorization |
| US20090249253A1 (en)* | 2008-03-31 | 2009-10-01 | Palm, Inc. | Displaying mnemonic abbreviations for commands |
| US20090248647A1 (en)* | 2008-03-25 | 2009-10-01 | Omer Ziv | System and method for the quality assessment of queries |
| US20110072052A1 (en)* | 2008-05-28 | 2011-03-24 | Aptima Inc. | Systems and methods for analyzing entity profiles |
| US8725732B1 (en)* | 2009-03-13 | 2014-05-13 | Google Inc. | Classifying text into hierarchical categories |
| US20210312123A1 (en)* | 2020-04-03 | 2021-10-07 | Jon Ward | Systems and Methods For Cloud-Based Productivity Tools |
| US11244011B2 (en)* | 2015-10-23 | 2022-02-08 | International Business Machines Corporation | Ingestion planning for complex tables |
| US20230028717A1 (en)* | 2020-08-27 | 2023-01-26 | Capital One Services, Llc | Representing Confidence in Natural Language Processing |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1590798A2 (en)* | 2003-02-05 | 2005-11-02 | Verint Systems Inc. | Method for automatic and semi-automatic classification and clustering of non-deterministic texts |
| US7856355B2 (en)* | 2005-07-05 | 2010-12-21 | Alcatel-Lucent Usa Inc. | Speech quality assessment method and system |
| US20070078806A1 (en)* | 2005-10-05 | 2007-04-05 | Hinickle Judith A | Method and apparatus for evaluating the accuracy of transcribed documents and other documents |
| US20100027768A1 (en)* | 2006-11-03 | 2010-02-04 | Foskett James J | Aviation text and voice communication system |
| US8126891B2 (en)* | 2008-10-21 | 2012-02-28 | Microsoft Corporation | Future data event prediction using a generative model |
| US8379801B2 (en)* | 2009-11-24 | 2013-02-19 | Sorenson Communications, Inc. | Methods and systems related to text caption error correction |
| US9070360B2 (en)* | 2009-12-10 | 2015-06-30 | Microsoft Technology Licensing, Llc | Confidence calibration in automatic speech recognition systems |
| US8930189B2 (en) | 2011-10-28 | 2015-01-06 | Microsoft Corporation | Distributed user input to text generated by a speech to text transcription service |
| US9870520B1 (en)* | 2013-08-02 | 2018-01-16 | Intuit Inc. | Iterative process for optimizing optical character recognition |
| FR3010809B1 (en)* | 2013-09-18 | 2017-05-19 | Airbus Operations Sas | METHOD AND DEVICE FOR AUTOMATIC MANAGEMENT ON BOARD AN AIRCRAFT AUDIO MESSAGE AIRCRAFT. |
| US11481087B2 (en)* | 2014-03-27 | 2022-10-25 | Sony Corporation | Electronic device and method for identifying input commands of a user |
| US9858923B2 (en)* | 2015-09-24 | 2018-01-02 | Intel Corporation | Dynamic adaptation of language models and semantic tracking for automatic speech recognition |
| CN108777141B (en)* | 2018-05-31 | 2022-01-25 | 康键信息技术(深圳)有限公司 | Test apparatus, test method, and storage medium |
| CN110110303A (en)* | 2019-03-28 | 2019-08-09 | 苏州八叉树智能科技有限公司 | Newsletter archive generation method, device, electronic equipment and computer-readable medium |
| US12001206B2 (en) | 2020-01-16 | 2024-06-04 | Honeywell International Inc. | Methods and systems for remote operation of vehicles using hands-free functionality |
| CN111581455B (en)* | 2020-04-28 | 2023-03-21 | 北京字节跳动网络技术有限公司 | Text generation model generation method and device and electronic equipment |
| CN114637829B (en)* | 2022-02-21 | 2024-09-24 | 阿里巴巴(中国)有限公司 | Recorded text processing method, device and computer readable storage medium |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5625748A (en)* | 1994-04-18 | 1997-04-29 | Bbn Corporation | Topic discriminator using posterior probability or confidence scores |
| US6397181B1 (en)* | 1999-01-27 | 2002-05-28 | Kent Ridge Digital Labs | Method and apparatus for voice annotation and retrieval of multimedia data |
| US20020178002A1 (en)* | 2001-05-24 | 2002-11-28 | International Business Machines Corporation | System and method for searching, analyzing and displaying text transcripts of speech after imperfect speech recognition |
| US6598054B2 (en)* | 1999-01-26 | 2003-07-22 | Xerox Corporation | System and method for clustering data objects in a collection |
| US20040083101A1 (en)* | 2002-10-23 | 2004-04-29 | International Business Machines Corporation | System and method for data mining of contextual conversations |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5550930A (en)* | 1991-06-17 | 1996-08-27 | Microsoft Corporation | Method and system for training a handwriting recognizer at the time of misrecognition |
| GB9709341D0 (en)* | 1997-05-08 | 1997-06-25 | British Broadcasting Corp | Method of and apparatus for editing audio or audio-visual recordings |
| AU2001245927A1 (en)* | 2000-03-24 | 2001-10-08 | Dragon Systems, Inc. | Lexical analysis of telephone conversations with call center agents |
| US6839667B2 (en)* | 2001-05-16 | 2005-01-04 | International Business Machines Corporation | Method of speech recognition by presenting N-best word candidates |
| US6963834B2 (en) | 2001-05-29 | 2005-11-08 | International Business Machines Corporation | Method of speech recognition using empirically determined word candidates |
| EP1590798A2 (en)* | 2003-02-05 | 2005-11-02 | Verint Systems Inc. | Method for automatic and semi-automatic classification and clustering of non-deterministic texts |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5625748A (en)* | 1994-04-18 | 1997-04-29 | Bbn Corporation | Topic discriminator using posterior probability or confidence scores |
| US6598054B2 (en)* | 1999-01-26 | 2003-07-22 | Xerox Corporation | System and method for clustering data objects in a collection |
| US6397181B1 (en)* | 1999-01-27 | 2002-05-28 | Kent Ridge Digital Labs | Method and apparatus for voice annotation and retrieval of multimedia data |
| US20020178002A1 (en)* | 2001-05-24 | 2002-11-28 | International Business Machines Corporation | System and method for searching, analyzing and displaying text transcripts of speech after imperfect speech recognition |
| US20040083101A1 (en)* | 2002-10-23 | 2004-04-29 | International Business Machines Corporation | System and method for data mining of contextual conversations |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7698645B2 (en)* | 2004-09-30 | 2010-04-13 | Fuji Xerox Co., Ltd. | Presentation slide contents processor for categorizing presentation slides and method for processing and categorizing slide contents |
| US20060067578A1 (en)* | 2004-09-30 | 2006-03-30 | Fuji Xerox Co., Ltd. | Slide contents processor, slide contents processing method, and storage medium storing program |
| US20070050445A1 (en)* | 2005-08-31 | 2007-03-01 | Hugh Hyndman | Internet content analysis |
| US20080027888A1 (en)* | 2006-07-31 | 2008-01-31 | Microsoft Corporation | Optimization of fact extraction using a multi-stage approach |
| US7668791B2 (en) | 2006-07-31 | 2010-02-23 | Microsoft Corporation | Distinguishing facts from opinions using a multi-stage approach |
| US20090012970A1 (en)* | 2007-07-02 | 2009-01-08 | Dror Daniel Ziv | Root cause analysis using interactive data categorization |
| US9015194B2 (en)* | 2007-07-02 | 2015-04-21 | Verint Systems Inc. | Root cause analysis using interactive data categorization |
| US20090248647A1 (en)* | 2008-03-25 | 2009-10-01 | Omer Ziv | System and method for the quality assessment of queries |
| US20090249253A1 (en)* | 2008-03-31 | 2009-10-01 | Palm, Inc. | Displaying mnemonic abbreviations for commands |
| US9053088B2 (en)* | 2008-03-31 | 2015-06-09 | Qualcomm Incorporated | Displaying mnemonic abbreviations for commands |
| US12216687B2 (en) | 2008-05-28 | 2025-02-04 | Aptima, Inc. | Systems and methods for analyzing entity profiles |
| US9123022B2 (en) | 2008-05-28 | 2015-09-01 | Aptima, Inc. | Systems and methods for analyzing entity profiles |
| US9594825B2 (en) | 2008-05-28 | 2017-03-14 | Aptima, Inc. | Systems and methods for analyzing entity profiles |
| US20110072052A1 (en)* | 2008-05-28 | 2011-03-24 | Aptima Inc. | Systems and methods for analyzing entity profiles |
| US11461373B2 (en) | 2008-05-28 | 2022-10-04 | Aptima, Inc. | Systems and methods for analyzing entity profiles |
| US8725732B1 (en)* | 2009-03-13 | 2014-05-13 | Google Inc. | Classifying text into hierarchical categories |
| US11244011B2 (en)* | 2015-10-23 | 2022-02-08 | International Business Machines Corporation | Ingestion planning for complex tables |
| US20210312123A1 (en)* | 2020-04-03 | 2021-10-07 | Jon Ward | Systems and Methods For Cloud-Based Productivity Tools |
| US11687710B2 (en)* | 2020-04-03 | 2023-06-27 | Braincat, Inc. | Systems and methods for cloud-based productivity tools |
| US11720753B2 (en)* | 2020-08-27 | 2023-08-08 | Capital One Services, Llc | Representing confidence in natural language processing |
| US20230028717A1 (en)* | 2020-08-27 | 2023-01-26 | Capital One Services, Llc | Representing Confidence in Natural Language Processing |
| Publication number | Publication date |
|---|---|
| EP1590796A1 (en) | 2005-11-02 |
| US20080183468A1 (en) | 2008-07-31 |
| WO2004072955A1 (en) | 2004-08-26 |
| IL170065A (en) | 2013-02-28 |
| WO2004072780A3 (en) | 2004-11-11 |
| WO2004072780A2 (en) | 2004-08-26 |
| US7792671B2 (en) | 2010-09-07 |
| US20040158469A1 (en) | 2004-08-12 |
| US8195459B1 (en) | 2012-06-05 |
| EP1590798A2 (en) | 2005-11-02 |
| Publication | Publication Date | Title |
|---|---|---|
| US20040163035A1 (en) | Method for automatic and semi-automatic classification and clustering of non-deterministic texts | |
| US10431214B2 (en) | System and method of determining a domain and/or an action related to a natural language input | |
| US7415409B2 (en) | Method to train the language model of a speech recognition system to convert and index voicemails on a search engine | |
| US11182435B2 (en) | Model generation device, text search device, model generation method, text search method, data structure, and program | |
| CN108197282B (en) | File data classification method and device, terminal, server and storage medium | |
| US7272558B1 (en) | Speech recognition training method for audio and video file indexing on a search engine | |
| CN101533401B (en) | Voice data retrieval system and voice data retrieval method | |
| US9229974B1 (en) | Classifying queries | |
| US8126897B2 (en) | Unified inverted index for video passage retrieval | |
| US20230214579A1 (en) | Intelligent character correction and search in documents | |
| US20100070263A1 (en) | Speech data retrieving web site system | |
| CN107748784B (en) | Method for realizing structured data search through natural language | |
| US20040249808A1 (en) | Query expansion using query logs | |
| CN109446376B (en) | A method and system for classifying speech by word segmentation | |
| JP2013521567A (en) | System including client computing device, method of tagging media objects, and method of searching a digital database including audio tagged media objects | |
| CN111881283B (en) | Business keyword library creation method, intelligent chat guiding method and device | |
| CN109508441B (en) | Method and device for realizing data statistical analysis through natural language and electronic equipment | |
| US20220058213A1 (en) | Systems and methods for identifying dynamic types in voice queries | |
| CN108710653A (en) | One kind, which is painted, originally reads aloud order method, apparatus and system | |
| CN113177061B (en) | Searching method and device and electronic equipment | |
| CN119128120B (en) | Consultation retrieval method and system based on demand label configuration | |
| JP3921837B2 (en) | Information discrimination support device, recording medium storing information discrimination support program, and information discrimination support method | |
| WO2006118360A1 (en) | Issue trend analysis system | |
| CN111090977A (en) | Intelligent writing system and intelligent writing method | |
| CN113722447B (en) | Voice search method based on multi-strategy matching |
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment | Owner name:VERINT SYSTEMS INC., NEW YORK Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ARIEL, ASSAF;BRAND, MICHAEL;HOROWITZ, ITSIK;AND OTHERS;REEL/FRAME:014967/0368;SIGNING DATES FROM 20040129 TO 20040202 | |
| AS | Assignment | Owner name:LEHMAN COMMERCIAL PAPER INC., AS ADMINISTRATIVE AG Free format text:SECURITY AGREEMENT;ASSIGNOR:VERINT SYSTEMS INC.;REEL/FRAME:019588/0613 Effective date:20070525 | |
| AS | Assignment | Owner name:CREDIT SUISSE AS ADMINISTRATIVE AGENT, NEW YORK Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:VERINT SYSTEMS INC.;LEHMAN COMMERCIAL PAPER INC.;REEL/FRAME:022793/0888 Effective date:20090604 | |
| STCB | Information on status: application discontinuation | Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION | |
| AS | Assignment | Owner name:VERINT AMERICAS INC., NEW YORK Free format text:RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE AG;REEL/FRAME:026206/0340 Effective date:20110429 Owner name:VERINT SYSTEMS INC., NEW YORK Free format text:RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE AG;REEL/FRAME:026206/0340 Effective date:20110429 Owner name:VERINT VIDEO SOLUTIONS INC., NEW YORK Free format text:RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE AG;REEL/FRAME:026206/0340 Effective date:20110429 |