Movatterモバイル変換


[0]ホーム

URL:


US20090144276A1 - Computerized data mining system and program product - Google Patents

Computerized data mining system and program product
Download PDF

Info

Publication number
US20090144276A1
US20090144276A1US12/348,580US34858009AUS2009144276A1US 20090144276 A1US20090144276 A1US 20090144276A1US 34858009 AUS34858009 AUS 34858009AUS 2009144276 A1US2009144276 A1US 2009144276A1
Authority
US
United States
Prior art keywords
data
model
data mining
customized
existing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/348,580
Inventor
Feng-Wei Chen Russell
Ameet M. Kini
Marcelo Cunha Loureiro
John A. Medicke, Jr.
Betsy M. Plunket
Ashish Sureka
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by IndividualfiledCriticalIndividual
Priority to US12/348,580priorityCriticalpatent/US20090144276A1/en
Publication of US20090144276A1publicationCriticalpatent/US20090144276A1/en
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

Under the present invention, a data exploration system, a customized model system and an existing model system are provided. The data exploration system analyzes user data to identify statistical information such as data distribution, data relationships, data outliners and invalid or missing data values. The customized model center iteratively generates customized data mining models in parallel based on permutations of the user data, user-provided business parameters and/or a set of model generation algorithms. The existing model system provides users with a library of existing data mining models, assembled based on the business parameters, from which they can choose one or more. In any event, any customized or existing data mining models selected can be run against the user data in parallel.

Description

Claims (11)

1. A computerized data mining system, comprising:
a central processing unit;
a memory operably associated with the central processing unit; and
a data mining system storable in the memory and executable by the central processing unit, the data mining system comprising:
a data exploration system for receiving and analyzing user data to provide statistical information about the user data, wherein the statistical information comprises data relationships, data outliners, invalid data values and standard deviations;
a customized model system for generating and ranking customized data mining models, and for executing a selected customized data mining model on the user data, wherein the customized data mining models are generated using multiple iterations based on permutations of at least one of the user data, business parameters and a set of model generation algorithms, wherein the business parameters comprise a business taxonomy and a set of model goals, wherein the customized model system comprises:
a model generation system for generating the customized data mining models in parallel using multiple iterations based on the permutations of at least one of the user data, the business parameters and the set of model generation algorithms;
a model ranking system for ranking the customized data mining models based on the business parameters, for identifying a predetermined quantity of the ranked customized data mining models, and for providing comparative data corresponding to the predetermined quantity of the ranked customized data mining models;
a customized model selection system for selecting at least one customized mining model from the predetermined quantity; and
a customized model execution system for executing the selected at least one customized data mining model on the user data; and
an existing model system for selecting at least one existing data mining model from a library of existing data mining models, and for executing the selected at least one existing data mining model in parallel on the user data, and outputting a result of the executing of the selected at least one customized data mining model to a user.
7. A computer-readable storage medium storing computer instructions, which when executed, enables a computer system to mine data, the computer instructions comprising:
receiving and analyzing user data to provide statistical information about the user data, wherein the statistical information comprises data relationships, data outliners, invalid data values and standard deviations;
generating and ranking customized data mining models, and executing a selected customized data mining model on the user data, wherein the customized data mining models are generated using multiple iterations based on permutations of at least one of the user data, business parameters and a set of model generation algorithms, wherein the generating and ranking comprises:
generating the customized data mining models in parallel using multiple iterations based on the permutations of at least one of the user data, the business parameters and the set of model generation algorithms, wherein the business parameters comprise a business taxonomy and a set of model goals;
ranking the customized data mining models based on the business parameters, identifying a predetermined quantity of the ranked customized data mining models, and providing comparative data corresponding to the predetermined quantity of the ranked customized data mining models;
selecting at least one customized mining model from the predetermined_quantity;
US12/348,5802003-11-242009-01-05Computerized data mining system and program productAbandonedUS20090144276A1 (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US12/348,580US20090144276A1 (en)2003-11-242009-01-05Computerized data mining system and program product

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US10/720,792US7523106B2 (en)2003-11-242003-11-24Computerized data mining system, method and program product
US12/348,580US20090144276A1 (en)2003-11-242009-01-05Computerized data mining system and program product

Related Parent Applications (1)

Application NumberTitlePriority DateFiling Date
US10/720,792ContinuationUS7523106B2 (en)2003-11-242003-11-24Computerized data mining system, method and program product

Publications (1)

Publication NumberPublication Date
US20090144276A1true US20090144276A1 (en)2009-06-04

Family

ID=34591635

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US10/720,792Expired - LifetimeUS7523106B2 (en)2003-11-242003-11-24Computerized data mining system, method and program product
US12/348,580AbandonedUS20090144276A1 (en)2003-11-242009-01-05Computerized data mining system and program product

Family Applications Before (1)

Application NumberTitlePriority DateFiling Date
US10/720,792Expired - LifetimeUS7523106B2 (en)2003-11-242003-11-24Computerized data mining system, method and program product

Country Status (1)

CountryLink
US (2)US7523106B2 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20110010374A1 (en)*2008-06-262011-01-13Alibaba Group Holding LimitedFiltering Information Using Targeted Filtering Schemes
US20110145286A1 (en)*2009-12-152011-06-16Chalklabs, LlcDistributed platform for network analysis
US20110153664A1 (en)*2009-12-222011-06-23International Business Machines CorporationSelective Storing of Mining Models for Enabling Interactive Data Mining
US20120084251A1 (en)*2010-10-052012-04-05International Business Machines CorporationProbabilistic data mining model comparison
CN109325071A (en)*2018-10-312019-02-12福建南威软件有限公司A method of reference template realizes fast large according to mining analysis
CN110245174A (en)*2019-06-132019-09-17浙江华坤道威数据科技有限公司A kind of enterprise customization DMP system and its application method

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050102303A1 (en)*2003-11-122005-05-12International Business Machines CorporationComputer-implemented method, system and program product for mapping a user data schema to a mining model schema
US7509337B2 (en)*2005-07-052009-03-24International Business Machines CorporationSystem and method for selecting parameters for data mining modeling algorithms in data mining applications
US7512626B2 (en)*2005-07-052009-03-31International Business Machines CorporationSystem and method for selecting a data mining modeling algorithm for data mining applications
EP1941432A4 (en)*2005-10-252011-04-20Angoss Software Corp STRATEGY ARBORESCENCES FOR DATA EXPLORATION
US7565335B2 (en)*2006-03-152009-07-21Microsoft CorporationTransform for outlier detection in extract, transfer, load environment
US7636698B2 (en)*2006-03-162009-12-22Microsoft CorporationAnalyzing mining pattern evolutions by comparing labels, algorithms, or data patterns chosen by a reasoning component
US20070220034A1 (en)*2006-03-162007-09-20Microsoft CorporationAutomatic training of data mining models
US7730024B2 (en)*2006-03-202010-06-01Microsoft CorporationDistributed data mining using analysis services servers
US7801836B2 (en)*2006-09-272010-09-21Infosys Technologies Ltd.Automated predictive data mining model selection using a genetic algorithm
US8180713B1 (en)2007-04-132012-05-15Standard & Poor's Financial Services LlcSystem and method for searching and identifying potential financial risks disclosed within a document
US10127299B2 (en)*2009-09-142018-11-13International Business Machines CorporationAnalytics information directories within a comprehensive framework for composing and executing analytics applications in business level languages
US10242406B2 (en)*2009-09-142019-03-26International Business Machines CorporationAnalytics integration workbench within a comprehensive framework for composing and executing analytics applications in business level languages
US8401993B2 (en)*2009-09-142013-03-19International Business Machines CorporationAnalytics integration server within a comprehensive framework for composing and executing analytics applications in business level languages
US8762299B1 (en)2011-06-272014-06-24Google Inc.Customized predictive analytical model training
CN103514240B (en)*2012-11-292016-10-12Tcl集团股份有限公司A kind of kinsfolk's relation excavation method and system based on remote controller
CN103853848A (en)*2014-03-272014-06-11华为技术有限公司Method and device for establishing social monitoring subnetwork
GB2549314A (en)2016-04-142017-10-18Ge Aviation Systems LlcSystems and methods for providing data exploration techniques
US20180101529A1 (en)*2016-10-102018-04-12Proekspert ASData science versioning and intelligence systems and methods
CN106547849B (en)*2016-10-182019-11-26华南师范大学A kind of construction method for the multi-tenant database meeting tenant's differentiated demand
CN107766424B (en)*2017-09-132020-09-15深圳市宇数科技有限公司Data exploration management method and system, electronic equipment and storage medium
US11354669B2 (en)*2018-06-282022-06-07International Business Machines CorporationCollaborative analytics for fraud detection through a shared public ledger
CA3105486C (en)2018-07-092023-09-26Rutgers, The State University Of New JerseyData exploration as search over automated pre-generated plot objects
US12346749B2 (en)2022-06-092025-07-01Sap SeAdaptive application server request balancing

Citations (22)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5526293A (en)*1993-12-171996-06-11Texas Instruments Inc.System and method for controlling semiconductor wafer processing
US5526281A (en)*1993-05-211996-06-11Arris Pharmaceutical CorporationMachine-learning approach to modeling biological activity for molecular design and to modeling other characteristics
US5621652A (en)*1995-03-211997-04-15Vlsi Technology, Inc.System and method for verifying process models in integrated circuit process simulators
US5680590A (en)*1990-09-211997-10-21Parti; MichaelSimulation system and method of using same
US5692107A (en)*1994-03-151997-11-25Lockheed Missiles & Space Company, Inc.Method for generating predictive models in a computer system
US5875284A (en)*1990-03-121999-02-23Fujitsu LimitedNeuro-fuzzy-integrated data processing system
US6094654A (en)*1996-12-062000-07-25International Business Machines CorporationData management system for file and database management
US6185549B1 (en)*1998-04-292001-02-06Lucent Technologies Inc.Method for mining association rules in data
US6240411B1 (en)*1998-06-152001-05-29Exchange Applications, Inc.Integrating campaign management and data mining
US6393387B1 (en)*1998-03-062002-05-21Perot Systems CorporationSystem and method for model mining complex information technology systems
US20020099581A1 (en)*2001-01-222002-07-25Chu Chengwen RobertComputer-implemented dimension engine
US20020127529A1 (en)*2000-12-062002-09-12Cassuto Nadav YehudahPrediction model creation, evaluation, and training
US6519602B2 (en)*1999-11-152003-02-11International Business Machine CorporationSystem and method for the automatic construction of generalization-specialization hierarchy of terms from a database of terms and associated meanings
US6532412B2 (en)*2000-11-022003-03-11General Electric Co.Apparatus for monitoring gas turbine engine operation
US6539300B2 (en)*2001-07-102003-03-25Makor Issues And Rights Ltd.Method for regional system wide optimal signal timing for traffic control based on wireless phone networks
US20030059837A1 (en)*2000-01-072003-03-27Levinson Douglas A.Method and system for planning, performing, and assessing high-throughput screening of multicomponent chemical compositions and solid forms of compounds
US6553366B1 (en)*1998-10-022003-04-22Ncr CorporationAnalytic logical data model
US20030212691A1 (en)*2002-05-102003-11-13Pavani KuntalaData mining model building using attribute importance
US6677963B1 (en)*1999-11-162004-01-13Verizon Laboratories Inc.Computer-executable method for improving understanding of business data by interactive rule manipulation
US6687696B2 (en)*2000-07-262004-02-03Recommind Inc.System and method for personalized search, information filtering, and for generating recommendations utilizing statistical latent class models
US6920458B1 (en)*2000-09-222005-07-19Sas Institute Inc.Model repository
US7117480B2 (en)*2001-11-272006-10-033M Innovative Properties CompanyReusable software components for invoking computational models

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
SU903873A1 (en)1980-06-051982-02-07Предприятие П/Я Г-4934Generator of random numbers for simulating general population by objects of a sample
JP2826138B2 (en)1988-11-121998-11-18株式会社豊田中央研究所 Mobile body interference check device
JPH1065159A (en)1996-08-221998-03-06Sharp Corp Model parameter optimization device for circuit simulation

Patent Citations (24)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6456989B1 (en)*1990-03-122002-09-24Fujitsu LimitedNeuro-fuzzy-integrated data processing system
US5875284A (en)*1990-03-121999-02-23Fujitsu LimitedNeuro-fuzzy-integrated data processing system
US5680590A (en)*1990-09-211997-10-21Parti; MichaelSimulation system and method of using same
US5526281A (en)*1993-05-211996-06-11Arris Pharmaceutical CorporationMachine-learning approach to modeling biological activity for molecular design and to modeling other characteristics
US5526293A (en)*1993-12-171996-06-11Texas Instruments Inc.System and method for controlling semiconductor wafer processing
US5692107A (en)*1994-03-151997-11-25Lockheed Missiles & Space Company, Inc.Method for generating predictive models in a computer system
US5621652A (en)*1995-03-211997-04-15Vlsi Technology, Inc.System and method for verifying process models in integrated circuit process simulators
US6094654A (en)*1996-12-062000-07-25International Business Machines CorporationData management system for file and database management
US6393387B1 (en)*1998-03-062002-05-21Perot Systems CorporationSystem and method for model mining complex information technology systems
US6185549B1 (en)*1998-04-292001-02-06Lucent Technologies Inc.Method for mining association rules in data
US6240411B1 (en)*1998-06-152001-05-29Exchange Applications, Inc.Integrating campaign management and data mining
US6826556B1 (en)*1998-10-022004-11-30Ncr CorporationTechniques for deploying analytic models in a parallel
US6553366B1 (en)*1998-10-022003-04-22Ncr CorporationAnalytic logical data model
US6519602B2 (en)*1999-11-152003-02-11International Business Machine CorporationSystem and method for the automatic construction of generalization-specialization hierarchy of terms from a database of terms and associated meanings
US6677963B1 (en)*1999-11-162004-01-13Verizon Laboratories Inc.Computer-executable method for improving understanding of business data by interactive rule manipulation
US20030059837A1 (en)*2000-01-072003-03-27Levinson Douglas A.Method and system for planning, performing, and assessing high-throughput screening of multicomponent chemical compositions and solid forms of compounds
US6687696B2 (en)*2000-07-262004-02-03Recommind Inc.System and method for personalized search, information filtering, and for generating recommendations utilizing statistical latent class models
US6920458B1 (en)*2000-09-222005-07-19Sas Institute Inc.Model repository
US6532412B2 (en)*2000-11-022003-03-11General Electric Co.Apparatus for monitoring gas turbine engine operation
US20020127529A1 (en)*2000-12-062002-09-12Cassuto Nadav YehudahPrediction model creation, evaluation, and training
US20020099581A1 (en)*2001-01-222002-07-25Chu Chengwen RobertComputer-implemented dimension engine
US6539300B2 (en)*2001-07-102003-03-25Makor Issues And Rights Ltd.Method for regional system wide optimal signal timing for traffic control based on wireless phone networks
US7117480B2 (en)*2001-11-272006-10-033M Innovative Properties CompanyReusable software components for invoking computational models
US20030212691A1 (en)*2002-05-102003-11-13Pavani KuntalaData mining model building using attribute importance

Cited By (14)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8725746B2 (en)*2008-06-262014-05-13Alibaba Group Holding LimitedFiltering information using targeted filtering schemes
US9201953B2 (en)2008-06-262015-12-01Alibaba Group Holding LimitedFiltering information using targeted filtering schemes
US20110010374A1 (en)*2008-06-262011-01-13Alibaba Group Holding LimitedFiltering Information Using Targeted Filtering Schemes
US8972443B2 (en)2009-12-152015-03-03Chalklabs, LlcDistributed platform for network analysis
US8352495B2 (en)2009-12-152013-01-08Chalklabs, LlcDistributed platform for network analysis
WO2011081909A3 (en)*2009-12-152011-09-22Chalklabs, LlcDistributed platform for network analysis
US20110145286A1 (en)*2009-12-152011-06-16Chalklabs, LlcDistributed platform for network analysis
US8380740B2 (en)2009-12-222013-02-19International Business Machines CorporationSelective storing of mining models for enabling interactive data mining
US8538988B2 (en)2009-12-222013-09-17International Business Machines CorporationSelective storing of mining models for enabling interactive data mining
US20110153664A1 (en)*2009-12-222011-06-23International Business Machines CorporationSelective Storing of Mining Models for Enabling Interactive Data Mining
US20120084251A1 (en)*2010-10-052012-04-05International Business Machines CorporationProbabilistic data mining model comparison
US8990145B2 (en)*2010-10-052015-03-24International Business Machines CorporationProbabilistic data mining model comparison
CN109325071A (en)*2018-10-312019-02-12福建南威软件有限公司A method of reference template realizes fast large according to mining analysis
CN110245174A (en)*2019-06-132019-09-17浙江华坤道威数据科技有限公司A kind of enterprise customization DMP system and its application method

Also Published As

Publication numberPublication date
US20050114360A1 (en)2005-05-26
US7523106B2 (en)2009-04-21

Similar Documents

PublicationPublication DateTitle
US7523106B2 (en)Computerized data mining system, method and program product
US9824472B2 (en)Determining alternative visualizations for data based on an initial data visualization
Saied et al.Mining multi-level API usage patterns
US8356278B2 (en)Method, system and program product for detecting deviation from software development best practice resource in a code sharing system
US7945583B2 (en)Technique for data mining using a web service
US7349919B2 (en)Computerized method, system and program product for generating a data mining model
CN113076104A (en)Page generation method, device, equipment and storage medium
US20170269971A1 (en)Migrating enterprise workflows for processing on a crowdsourcing platform
Wagner et al.Problem characterization and abstraction for visual analytics in behavior-based malware pattern analysis
US11379772B2 (en)Systems and methods for analyzing computer input to provide suggested next action for automation
US11809310B2 (en)Homomorphic encryption-based testing computing system
CN111949306A (en)Pushing method and system supporting fragmented learning of open-source project
US20180129325A1 (en)Credit Navigation System and Method
US7937311B1 (en)Apparatuses, methods, and systems for exchange fund transactions
CN116307503B (en)Method for constructing domain model flow
US20240111577A1 (en)System and method for determining critical sequences of actions causing predetermined events during application operations
CN113077169A (en)Method, system, equipment and medium for recommending customer service personnel based on label
US11042536B1 (en)Systems and methods for automated data visualization
Parulian et al.Segmentation of Libraries, CMS, and PHP Frameworks Based on Code Characteristics: Implementation of Clustering Using K-Means
CN115438151B (en)Method, device, equipment and medium for determining standard clause
US11886468B2 (en)Fingerprint-based data classification
US12443908B2 (en)Data distillery for signal detection
US20210357809A1 (en)Model improvement system and model improvement method
KarstenImproving usability at BetterBe through API analysis
CN119271777A (en) Intelligent reply method, device, equipment and medium based on natural language processing

Legal Events

DateCodeTitleDescription
STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO PAY ISSUE FEE


[8]ページ先頭

©2009-2025 Movatter.jp