Movatterモバイル変換


[0]ホーム

URL:


US20110320189A1 - Systems and methods for filtering dictated and non-dictated sections of documents - Google Patents

Systems and methods for filtering dictated and non-dictated sections of documents
Download PDF

Info

Publication number
US20110320189A1
US20110320189A1US13/228,617US201113228617AUS2011320189A1US 20110320189 A1US20110320189 A1US 20110320189A1US 201113228617 AUS201113228617 AUS 201113228617AUS 2011320189 A1US2011320189 A1US 2011320189A1
Authority
US
United States
Prior art keywords
dictated
documents
text
sections
document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/228,617
Inventor
Alwin B. Carus
Larissa Lapshina
Bernardo Rechea
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nuance Communications Inc
Original Assignee
Dictaphone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dictaphone CorpfiledCriticalDictaphone Corp
Priority to US13/228,617priorityCriticalpatent/US20110320189A1/en
Assigned to DICTAPHONE CORPORATIONreassignmentDICTAPHONE CORPORATIONASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: CARUS, ALWIN B., LAPSHINA, LARISSA, RECHEA, BERNARDO
Publication of US20110320189A1publicationCriticalpatent/US20110320189A1/en
Assigned to NUANCE COMMUNICATIONS, INC.reassignmentNUANCE COMMUNICATIONS, INC.MERGER (SEE DOCUMENT FOR DETAILS).Assignors: DICTAPHONE CORPORATION
Assigned to NUANCE COMMUNICATIONS, INC.reassignmentNUANCE COMMUNICATIONS, INC.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: DICTAPHONE CORPORATION
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

A system and method for filtering documents to determine section boundaries between dictated and non-dictated text. The system and method identifies portions of a text report that correspond to an original dictation and, correspondingly, those portions that are not part of the original dictation. The system and method include comparing tokenized and normalized forms of the original dictation and the final report, determining mismatches between the two forms, and applying machine-learning techniques to identify document headers, footers, page turns, macros, and lists automatically and accurately.

Description

Claims (16)

34. A system for filtering dictated and non-dictated sections of documents, the system comprising:
a central processing unit; and
a computer code operatively associated with the central processing unit, the computer code including instructions to cause the central processing unit to:
gather a first set of documents having dictated and non-dictated section boundaries;
featurize text in at least one document from the first set of documents;
differentiate dictated and non-dictated sections of text in the at least one document from the first set of documents;
categorize text of a second set of documents to identify dictated and non-dictated sections of text within at least one document from the second set of documents; and
output dictated sections of the at least one document from the second set of documents to an automatic speech recognition process.
US13/228,6172006-02-272011-09-09Systems and methods for filtering dictated and non-dictated sections of documentsAbandonedUS20110320189A1 (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US13/228,617US20110320189A1 (en)2006-02-272011-09-09Systems and methods for filtering dictated and non-dictated sections of documents

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US11/362,646US8036889B2 (en)2006-02-272006-02-27Systems and methods for filtering dictated and non-dictated sections of documents
US13/228,617US20110320189A1 (en)2006-02-272011-09-09Systems and methods for filtering dictated and non-dictated sections of documents

Related Parent Applications (1)

Application NumberTitlePriority DateFiling Date
US11/362,646ContinuationUS8036889B2 (en)2006-02-272006-02-27Systems and methods for filtering dictated and non-dictated sections of documents

Publications (1)

Publication NumberPublication Date
US20110320189A1true US20110320189A1 (en)2011-12-29

Family

ID=38445102

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US11/362,646Active2029-08-05US8036889B2 (en)2006-02-272006-02-27Systems and methods for filtering dictated and non-dictated sections of documents
US13/228,617AbandonedUS20110320189A1 (en)2006-02-272011-09-09Systems and methods for filtering dictated and non-dictated sections of documents

Family Applications Before (1)

Application NumberTitlePriority DateFiling Date
US11/362,646Active2029-08-05US8036889B2 (en)2006-02-272006-02-27Systems and methods for filtering dictated and non-dictated sections of documents

Country Status (3)

CountryLink
US (2)US8036889B2 (en)
EP (1)EP1996912A4 (en)
WO (1)WO2007101192A2 (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20100324895A1 (en)*2009-01-152010-12-23K-Nfb Reading Technology, Inc.Synchronization for document narration
US20140040713A1 (en)*2012-08-022014-02-06Steven C. DzikSelecting content portions for alignment
US8903723B2 (en)2010-05-182014-12-02K-Nfb Reading Technology, Inc.Audio synchronization for document narration with user-selected playback
US8914419B2 (en)2012-10-302014-12-16International Business Machines CorporationExtracting semantic relationships from table structures in electronic documents
CN104516942A (en)*2013-09-262015-04-15国际商业机器公司Concept driven automatic section identification
US9223830B1 (en)2012-10-262015-12-29Audible, Inc.Content presentation analysis
US9280906B2 (en)2013-02-042016-03-08Audible. Inc.Prompting a user for input during a synchronous presentation of audio content and textual content
US9286290B2 (en)2014-04-252016-03-15International Business Machines CorporationProducing insight information from tables using natural language processing
US9317486B1 (en)2013-06-072016-04-19Audible, Inc.Synchronizing playback of digital content with captured physical content
US9367196B1 (en)2012-09-262016-06-14Audible, Inc.Conveying branched content
US9489360B2 (en)2013-09-052016-11-08Audible, Inc.Identifying extra material in companion content
US9632647B1 (en)2012-10-092017-04-25Audible, Inc.Selecting presentation positions in dynamic content
US9679608B2 (en)2012-06-282017-06-13Audible, Inc.Pacing content
US9792027B2 (en)2011-03-232017-10-17Audible, Inc.Managing playback of synchronized content
CN107357775A (en)*2017-06-052017-11-17百度在线网络技术(北京)有限公司The text error correction method and device of Recognition with Recurrent Neural Network based on artificial intelligence
US10671812B2 (en)*2018-03-222020-06-02Equifax Inc.Text classification using automatically generated seed data

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8165870B2 (en)*2005-02-102012-04-24Microsoft CorporationClassification filter for processing data for creating a language model
WO2006088941A2 (en)*2005-02-142006-08-24Teresis Media Management, Inc.Multipurpose media players
US8577683B2 (en)2008-08-152013-11-05Thomas Majchrowski & Associates, Inc.Multipurpose media players
US8036889B2 (en)*2006-02-272011-10-11Nuance Communications, Inc.Systems and methods for filtering dictated and non-dictated sections of documents
US7996768B2 (en)*2006-05-182011-08-09International Business Machines CorporationOperations on document components filtered via text attributes
US7809170B2 (en)*2006-08-102010-10-05Louisiana Tech University Foundation, Inc.Method and apparatus for choosing and evaluating sample size for biometric training process
US8386923B2 (en)2007-05-082013-02-26Canon Kabushiki KaishaDocument generation apparatus, method, and storage medium
US20090216532A1 (en)*2007-09-262009-08-27Nuance Communications, Inc.Automatic Extraction and Dissemination of Audio Impression
US9412372B2 (en)*2012-05-082016-08-09SpeakWrite, LLCMethod and system for audio-video integration
WO2015031449A1 (en)*2013-08-302015-03-053M Innovative Properties CompanyMethod of classifying medical documents
US9779087B2 (en)*2013-12-132017-10-03Google Inc.Cross-lingual discriminative learning of sequence models with posterior regularization
US9953646B2 (en)2014-09-022018-04-24Belleau TechnologiesMethod and system for dynamic speech recognition and tracking of prewritten script
US11531874B2 (en)*2015-11-062022-12-20Google LlcRegularizing machine learning models
US10832049B2 (en)*2018-05-312020-11-10Intematlonal Business Machlnes CorporationElectronic document classification system optimized for combining a plurality of contemporaneously scanned documents
US20250036878A1 (en)*2023-07-262025-01-30Micro Focus LlcAugmented question and answer (q&a) with large language models

Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4864502A (en)*1987-10-071989-09-05Houghton Mifflin CompanySentence analyzer
US6067514A (en)*1998-06-232000-05-23International Business Machines CorporationMethod for automatically punctuating a speech utterance in a continuous speech recognition system
US6671670B2 (en)*2001-06-272003-12-30Telelogue, Inc.System and method for pre-processing information used by an automated attendant
US20050108010A1 (en)*2003-10-012005-05-19Dictaphone CorporationSystem and method for post processing speech recognition output
US7630892B2 (en)*2004-09-102009-12-08Microsoft CorporationMethod and apparatus for transducer-based text normalization and inverse text normalization
US8036889B2 (en)*2006-02-272011-10-11Nuance Communications, Inc.Systems and methods for filtering dictated and non-dictated sections of documents
US8165870B2 (en)*2005-02-102012-04-24Microsoft CorporationClassification filter for processing data for creating a language model

Family Cites Families (37)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
JP2986345B2 (en)*1993-10-181999-12-06インターナショナル・ビジネス・マシーンズ・コーポレイション Voice recording indexing apparatus and method
US5680628A (en)*1995-07-191997-10-21Inso CorporationMethod and apparatus for automated search and retrieval process
US5794177A (en)*1995-07-191998-08-11Inso CorporationMethod and apparatus for morphological analysis and generation of natural language text
US5960447A (en)*1995-11-131999-09-28Holt; DouglasWord tagging and editing system for speech recognition
US6031625A (en)*1996-06-142000-02-29Alysis Technologies, Inc.System for data extraction from a print data stream
JP3597697B2 (en)*1998-03-202004-12-08富士通株式会社 Document summarizing apparatus and method
US6195637B1 (en)*1998-03-252001-02-27International Business Machines Corp.Marking and deferring correction of misrecognition errors
US6064965A (en)*1998-09-022000-05-16International Business Machines CorporationCombined audio playback in speech recognition proofreader
US6122614A (en)*1998-11-202000-09-19Custom Speech Usa, Inc.System and method for automating transcription services
US6185524B1 (en)*1998-12-312001-02-06Lernout & Hauspie Speech Products N.V.Method and apparatus for automatic identification of word boundaries in continuous text and computation of word boundary scores
US20030004724A1 (en)*1999-02-052003-01-02Jonathan KahnSpeech recognition program mapping tool to align an audio file to verbatim text
US6611802B2 (en)*1999-06-112003-08-26International Business Machines CorporationMethod and system for proofreading and correcting dictated text
US6711585B1 (en)*1999-06-152004-03-23Kanisa Inc.System and method for implementing a knowledge management system
US6704709B1 (en)*1999-07-282004-03-09Custom Speech Usa, Inc.System and method for improving the accuracy of a speech recognition program
US20020152076A1 (en)*2000-11-282002-10-17Jonathan KahnSystem for permanent alignment of text utterances to their associated audio utterances
US7366979B2 (en)*2001-03-092008-04-29Copernicus Investments, LlcMethod and apparatus for annotating a document
US7120581B2 (en)*2001-05-312006-10-10Custom Speech Usa, Inc.System and method for identifying an identical audio segment using text comparison
US7668718B2 (en)*2001-07-172010-02-23Custom Speech Usa, Inc.Synchronized pattern recognition source data processed by manual or automatic means for creation of shared speaker-dependent speech user profile
US6708148B2 (en)*2001-10-122004-03-16Koninklijke Philips Electronics N.V.Correction device to mark parts of a recognized text
US6928407B2 (en)*2002-03-292005-08-09International Business Machines CorporationSystem and method for the automatic discovery of salient segments in speech transcripts
US7236931B2 (en)*2002-05-012007-06-26Usb Ag, Stamford BranchSystems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems
EP1376999A1 (en)*2002-06-212004-01-02BRITISH TELECOMMUNICATIONS public limited companySpoken alpha-numeric sequence entry system with repair mode
US20060190249A1 (en)*2002-06-262006-08-24Jonathan KahnMethod for comparing a transcribed text file with a previously created file
US7516070B2 (en)*2003-02-192009-04-07Custom Speech Usa, Inc.Method for simultaneously creating audio-aligned final and verbatim text with the assistance of a speech recognition program as may be useful in form completion using a verbal entry method
US7958443B2 (en)*2003-02-282011-06-07Dictaphone CorporationSystem and method for structuring speech recognized text into a pre-selected document format
US20050144184A1 (en)*2003-10-012005-06-30Dictaphone CorporationSystem and method for document section segmentation
US7818308B2 (en)*2003-10-012010-10-19Nuance Communications, Inc.System and method for document section segmentation
JP4808160B2 (en)*2003-11-212011-11-02ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー Text segmentation and labeling using user interaction with topic-specific language model and topic-specific label statistics
JP2007512612A (en)*2003-11-282007-05-17コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Method and apparatus for transcribing audio signals
US7379946B2 (en)*2004-03-312008-05-27Dictaphone CorporationCategorization of information using natural language processing and predefined templates
US20070074102A1 (en)*2005-09-292007-03-29Reiner KraftAutomatically determining topical regions in a document
US7584103B2 (en)*2004-08-202009-09-01Multimodal Technologies, Inc.Automated extraction of semantic content and generation of a structured document from speech
US8335688B2 (en)*2004-08-202012-12-18Multimodal Technologies, LlcDocument transcription system training
US7487138B2 (en)*2004-08-252009-02-03Symantec Operating CorporationSystem and method for chunk-based indexing of file system content
US7937263B2 (en)*2004-12-012011-05-03Dictaphone CorporationSystem and method for tokenization of text using classifier models
US7565282B2 (en)*2005-04-142009-07-21Dictaphone CorporationSystem and method for adaptive automatic error correction
US8301448B2 (en)*2006-03-292012-10-30Nuance Communications, Inc.System and method for applying dynamic contextual grammars and language models to improve automatic speech recognition accuracy

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4864502A (en)*1987-10-071989-09-05Houghton Mifflin CompanySentence analyzer
US6067514A (en)*1998-06-232000-05-23International Business Machines CorporationMethod for automatically punctuating a speech utterance in a continuous speech recognition system
US6671670B2 (en)*2001-06-272003-12-30Telelogue, Inc.System and method for pre-processing information used by an automated attendant
US20050108010A1 (en)*2003-10-012005-05-19Dictaphone CorporationSystem and method for post processing speech recognition output
US7630892B2 (en)*2004-09-102009-12-08Microsoft CorporationMethod and apparatus for transducer-based text normalization and inverse text normalization
US8165870B2 (en)*2005-02-102012-04-24Microsoft CorporationClassification filter for processing data for creating a language model
US8036889B2 (en)*2006-02-272011-10-11Nuance Communications, Inc.Systems and methods for filtering dictated and non-dictated sections of documents

Cited By (20)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20100324895A1 (en)*2009-01-152010-12-23K-Nfb Reading Technology, Inc.Synchronization for document narration
US9478219B2 (en)2010-05-182016-10-25K-Nfb Reading Technology, Inc.Audio synchronization for document narration with user-selected playback
US8903723B2 (en)2010-05-182014-12-02K-Nfb Reading Technology, Inc.Audio synchronization for document narration with user-selected playback
US9792027B2 (en)2011-03-232017-10-17Audible, Inc.Managing playback of synchronized content
US9679608B2 (en)2012-06-282017-06-13Audible, Inc.Pacing content
US9799336B2 (en)2012-08-022017-10-24Audible, Inc.Identifying corresponding regions of content
US20140040713A1 (en)*2012-08-022014-02-06Steven C. DzikSelecting content portions for alignment
US10109278B2 (en)*2012-08-022018-10-23Audible, Inc.Aligning body matter across content formats
US9367196B1 (en)2012-09-262016-06-14Audible, Inc.Conveying branched content
US9632647B1 (en)2012-10-092017-04-25Audible, Inc.Selecting presentation positions in dynamic content
US9223830B1 (en)2012-10-262015-12-29Audible, Inc.Content presentation analysis
US8914419B2 (en)2012-10-302014-12-16International Business Machines CorporationExtracting semantic relationships from table structures in electronic documents
US9280906B2 (en)2013-02-042016-03-08Audible. Inc.Prompting a user for input during a synchronous presentation of audio content and textual content
US9317486B1 (en)2013-06-072016-04-19Audible, Inc.Synchronizing playback of digital content with captured physical content
US9489360B2 (en)2013-09-052016-11-08Audible, Inc.Identifying extra material in companion content
CN104516942A (en)*2013-09-262015-04-15国际商业机器公司Concept driven automatic section identification
US9286290B2 (en)2014-04-252016-03-15International Business Machines CorporationProducing insight information from tables using natural language processing
CN107357775A (en)*2017-06-052017-11-17百度在线网络技术(北京)有限公司The text error correction method and device of Recognition with Recurrent Neural Network based on artificial intelligence
US11314921B2 (en)*2017-06-052022-04-26Baidu Online Network Technology (Beijing) Co., Ltd.Text error correction method and apparatus based on recurrent neural network of artificial intelligence
US10671812B2 (en)*2018-03-222020-06-02Equifax Inc.Text classification using automatically generated seed data

Also Published As

Publication numberPublication date
US8036889B2 (en)2011-10-11
EP1996912A2 (en)2008-12-03
WO2007101192A3 (en)2008-08-07
EP1996912A4 (en)2017-05-03
WO2007101192A2 (en)2007-09-07
US20070203707A1 (en)2007-08-30

Similar Documents

PublicationPublication DateTitle
US8036889B2 (en)Systems and methods for filtering dictated and non-dictated sections of documents
US8768694B2 (en)Verification of extracted data
US9552809B2 (en)Document transcription system training
US9520124B2 (en)Discriminative training of document transcription system
US8447588B2 (en)Region-matching transducers for natural language processing
US8266169B2 (en)Complex queries for corpus indexing and search
US7996227B2 (en)System and method for inserting a description of images into audio recordings
US20020099744A1 (en)Method and apparatus providing capitalization recovery for text
JP2007256714A (en)Caption correction apparatus
CN109766434B (en) Abstract generation method and device
US20230028897A1 (en)System and method for caption validation and sync error correction
CN102591852A (en)Automatic typesetting method and automatic typesetting system for patent images
CN117727412A (en)Noise filtering method and system for electronic medical record, electronic equipment and storage medium
CN120753649A (en)Emotion analysis method, medium and device

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:DICTAPHONE CORPORATION, CONNECTICUT

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CARUS, ALWIN B.;LAPSHINA, LARISSA;RECHEA, BERNARDO;REEL/FRAME:026892/0108

Effective date:20060314

ASAssignment

Owner name:NUANCE COMMUNICATIONS, INC., MASSACHUSETTS

Free format text:MERGER;ASSIGNOR:DICTAPHONE CORPORATION;REEL/FRAME:028952/0397

Effective date:20060207

ASAssignment

Owner name:NUANCE COMMUNICATIONS, INC., MASSACHUSETTS

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:DICTAPHONE CORPORATION;REEL/FRAME:029596/0836

Effective date:20121211

STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp