WO2003090122A2 - Using neural networks for data mining - Google Patents

Using neural networks for data mining Download PDF

Info

Publication number
WO2003090122A2
WO2003090122A2 PCT/US2003/011983 US0311983W WO03090122A2 WO 2003090122 A2 WO2003090122 A2 WO 2003090122A2 US 0311983 W US0311983 W US 0311983W WO 03090122 A2 WO03090122 A2 WO 03090122A2
Authority
WO
WIPO (PCT)
Prior art keywords
neugent
training
data
request
consultation
Prior art date
Application number
PCT/US2003/011983
Other languages
French (fr)
Other versions
WO2003090122A8 (en
Inventor
Qian Yang
Charles Edward Garofalo
Yogesh Gupta
Ronald Cass
Kirk Wilson
Igor Sedukhin
Original Assignee
Computer Associates Think, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Computer Associates Think, Inc. filed Critical Computer Associates Think, Inc.
Priority to KR10-2004-7016751A priority Critical patent/KR20040101532A/en
Priority to AU2003241302A priority patent/AU2003241302A1/en
Priority to JP2003586796A priority patent/JP2006511850A/en
Priority to BRPI0309388-3A priority patent/BR0309388A/en
Priority to EP03731033A priority patent/EP1520237A2/en
Priority to CA002480951A priority patent/CA2480951A1/en
Publication of WO2003090122A2 publication Critical patent/WO2003090122A2/en
Priority to IL16462304A priority patent/IL164623A0/en
Publication of WO2003090122A8 publication Critical patent/WO2003090122A8/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/06Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising

Definitions

  • This application relates to data mining.
  • the application relates to using neural nets and other artificial intelligence techniques for data mining .
  • the data mining system includes a client and a service broker configured to include an interface to receive a consultation request from the client.
  • the service broker forwards the consultation request to a Neugent to invoke a consultation of the Neugent. After the Neugent is consulted, the service broker forwards to the client a result object returned by the Neugent.
  • the service broker also may include a training interface, and receives through the training interface a training request from the client, the training request including training data.
  • the service broker forwards the training request including the training data to the Neugent to invoke training of the Neugent with the training data.
  • the training request may include a parameter specifying a ratio to split the training data between training the Neugent and testing or validating the Neugent.
  • the service broker may forward to the client a training result object returned by the Neugent after training of the Neugent.
  • the application also provides a method for providing to a remote client machine a service to consult a Neugent.
  • the method includes receiving a consultation request from the remote client machine, forwarding the consultation request to the Neugent to invoke a consultation of the Neugent, and forwarding to the remote client machine a result object returned by the Neugent .
  • the application also provides a method for providing to a remote client machine a service to train a Neugent.
  • the method includes receiving a train request from the remote client machine, forwarding the train request to the Neugent to invoke training of the Neugent, and forwarding to the remote client machine a training result object returned by the Neugent.
  • FIG. IA shows a block diagram of a data mining system, according to one embodiment of the present disclosure
  • FIG. IB shows a schematic view of a data mining system, according to another embodiment
  • FIG. 2A shows a flow chart of a method for providing to a remote client machine a service to consult a Neugent, according to one embodiment
  • FIG. 2B shows a flow chart of a method for providing to a remote client machine a service to train a Neugent, according to one embodiment
  • FIG. 3 shows a schematic view of a functional-link net structure
  • FIGS. 4A and 4B show class diagrams for web services interface methods of Value Predict Neugent, according to one embodiment ;
  • FIGS. 5A, 5C, 5E, 5G and 51 show object schemas for assorted Neugents classes, according to another embodiment.
  • FIGS. 5B, 5D, 5F, 5H, 5J and 5K show class diagrams for the web service interface of the Neugents classes
  • FIGS. 6A, 6C, 6E, 6G and 61 show object schemas for assorted Neugents classes, according to a third embodiment
  • FIGS. 6B, 6D, 6F, 6H, 6J and 6K show class diagrams for the web service interface of the Neugents classes, according to the third embodiment
  • FIGS. 7A through 7F show class diagrams for web service interface of assorted Neugents classes, according to a fourth embodiment ;
  • FIG. 7G shows an object schema for the Value Predict Neugent, according to the fourth embodiment
  • FIGS. 8A, 8D, 8F, 8H and 8J show object schemas for assorted Neugents classes, according to a fifth embodiment
  • FIGS. 8B, 8C, 8E, 8G, 81 and 8K show class diagrams for the web service interface of the Neugents classes, according to the fifth embodiment;
  • FIG. 9A shows an object schema for Value Predict Neugent, according to a sixth embodiment;
  • FIGS. 9B and 9C show class diagrams for the web service interface of the Value Predict Neugent, according to the sixth embodiment.
  • FIGS. 10A and IOC through 10F show class diagrams for the web service interface of assorted Neugents classes, according to a seventh embodiment .
  • FIG. 10B shows an object schema for Value Predict Neugent, according to the seventh embodiment.
  • the tools may include one or more computer programs or software modules stored on a conventional program storage device or computer readable medium, and/or transmitted via a computer network or other transmission medium.
  • a data mining system 10 includes a client 11, one or more Neugents 13, and a service broker 15.
  • the service broker 15 may be configured as a server, and includes an interface to receive a consultation request from the client.
  • the service broker may also receive a train request from the client, and typically is (although it need not be) a remote server.
  • Neugents 13 are further described below.
  • a method for providing to a remote client machine a service to consult a Neugent is described with reference to FIGS. IA and 2A.
  • service broker 15 receives a consults request from the remote client machine (step S21)
  • service broker forwards the consultation request 1 Neugent 13 to invoke a consultation of the Neugent ( S22) .
  • the sei broker 15 forwards to the client a result object reti. by the Neugent (step S23) .
  • the consultation request includes data for consulting a Neugent 13.
  • the Neuger performs a predictive analysis of the data included ir consultation request .
  • the consults request includes identification of a source of data consulting a Neugent 13.
  • the Neugent 13 perforr predictive analysis of input data obtained from the sc identified in the consultation request .
  • the service broke is a remote server.
  • the consultation request from client 11 to the remote server may include an Exte Markup Language document.
  • the Neugent may be server-.
  • a method for providing to a remote client machi service to train a Neugent according to one embodiment described with reference to FIGS. IA and 2B.
  • service broker 15 receives a train request from the re client machine (step S26)
  • the service broker forward. train request to a Neugent 15 to invoke training of the Neugent (step S27) .
  • the service broker forwards to the client a training result object returned by the Neugent (step S28) .
  • a Neugent may group training data patterns into clusters, with each cluster corresponding to a group of similar data patterns, and predict a probability of membership of an input pattern to a selected group.
  • a Neugent may group training non-numeric (for example, textual) patterns into clusters, with each cluster corresponding to a group of similar non-numeric patterns, and predict a probability of membership of an input non- numeric pattern to a selected group.
  • non-numeric for example, textual
  • a Neugent may form a cluster model by grouping training data patterns into a plurality of clusters, with each cluster corresponding to a group of similar data patterns, and determining for each cluster probabilities of transition from the cluster to each of the other clusters.
  • the Neugent predicts a probability of an event occurring by applying an input pattern to the cluster model.
  • a Neugent may form an input-output model associated with a set of training data patterns, and predict an output value by applying the model to an input pattern.
  • the Neugent may include a functional-link net.
  • a Neugent may form rules associated with corresponding relationships in a set of training data patterns, and predict an outcome by applying the rules to an input pattern.
  • Neugents technologies include assorted methodologies for recognizing patterns in data and for using those patterns to make predictions on new data. New data is analyzed to determine the pattern into which it falls, thereby providing a prediction of future behavior based on the behavior that has characterized the pattern in the past.
  • neural net technology One group of underlying methodologies is often referred as neural net technology.
  • a neural net is a weighted network of interconnected input/output nodes.
  • Neugent technology covers a broader range of pattern recognition methodologies, in addition to neural net models .
  • Neugents may include ClusteringNeugent , DecisionNeugent, EventPredictNeugent , TextClusteringNeugent and ValuePredictNeugent model methodologies .
  • ClusteringNeugent uses a cluster model methodology which groups patterns that are alike, and predicts the probability of membership to a specific group.
  • DecisionNeugent uses a decision tree model methodology which uncovers rules and relationships in data, formulates rules to describe those relationships, and predicts outcomes based upon the discovered rules.
  • EventPredictNeugent uses a cluster model methodology with transition calculation to predict the probability of an event occurring.
  • TextClusteringNeugent uses a cluster model methodology which groups training data patterns comprising textual (or non-numeric) material that are alike, and predicts a probability that specified textual (or non-numeric) data with which the model is consulted is a member of (or belongs to) a specific group.
  • ValuePredictNeugent uses a functional-link neural net model methodology to predict the value of a variable (or values for a set of variables) .
  • a functional -link net is one type of neural net which can be used to model a functional relationship between input and output.
  • a functional-link net may be used to approximate any scalar function with a vector of inputs, x, and an output y, and therefore is a universal approximator .
  • FIG. 3 The structure of a functional-link net with non-linearity fully contained in a functional-link layer is illustrated in FIG. 3.
  • the nodes in the functional-link layer have associated non-linear basis functions. Since non-linearity is fully contained in the functional -link layer, and the rest of the net may be linear, linear training techniques such as regression-based training may be used with a functional-link net structure. Linear training refers to techniques that solves the parameters in the net through linear algebra techniques. Examples of functional-link net methodologies are described in commonly owned U.S. Patents Nos. 4,979,126, 5,734,796, 6,134,537 and 6,212,509 which are incorporated herein in their entirety by reference.
  • EventPredictNeugent Some methodologies associated with EventPredictNeugent are described in commonly-owned U.S. Patent No. 6,327,550 which is incorporated herein by reference. Additional clustering, neural net, decision tree and other predictive modeling methodologies are described in the following commonly-owned U.S. Patent Applications, which are also incorporated herein by reference:
  • Consult is a process of providing nei Neugent (also referred to as data for co Neugent) so that the Neugent uses its model, during training, to provide a prediction from
  • nei Neugent also referred to as data for co Neugent
  • the train and consult met available to client programs through W technology.
  • client programs through W technology.
  • only data may be passed between a client and a Neugent. Accordingly, the methodologies described in this disclosure place no burden on the client to maintain a predictive model.
  • the complexity of client/server interfaces may be reduced by simplifying protocols and by hiding issues (for example, making them transparent to the user) of platform technology mismatches.
  • Web services technology may be based on invoking procedures in a remote server (also referred herein as "Web Service Broker” or “WSB”) , such as by transmitting an Extended Mark-up Language (XML) document, which is a text document, over the HTTP protocol, as depicted in FIG. IB.
  • XML Extended Mark-up Language
  • FIG. IB the HTTP protocol
  • Web Service Broker 45 In order for Web Service Broker 45 to invoke the train and consult methods of a Neugent 43, the structure of the XML documents calling the corresponding methods of the Neugent is precisely specified.
  • the training and consultation API of the Neugents preferably is rigorously defined so that they can be invoked by the WSB.
  • Each of the Neugents mentioned above defines its own specification for training and consulting services (see, for example, FIGS. 4A-10F) .
  • the common elements of each Neugent interface include input data, train result and consult result.
  • a collection of data is passed to the Neugent.
  • Data passed to the train service and the consult service may be referred to as training data (also referred herein as "trainData") or consultation data (also referred herein as “consultData” ) , respectively.
  • trainData training data
  • consultation data also referred herein as "consultData”
  • additional parameters may be passed when training the Neugent, such as to determine the percentage of the training data split between training the model and validating or testing the model.
  • the Neugents typically use numeric data as input.
  • the TextClusteringNeugent also accommodates textual (or other non-numeric) data and the DecisionNeugent accommodates alpha-numeric data.
  • each Neugent returns an object as a result of a training session. The object provides information about the result of the training session.
  • an object representing the Neugent may be returned as part of the structure of the train result.
  • the Neugent For each Neugent type, the Neugent returns an object as a result of a consultation. Neugents may differ, however, with regard to a structure of the consultation return object. See, for example, FIGS. 5A-5K, in which only the TextClusteringNeugent and the ClusteringNeugent return similarly structured objects.
  • the ValuePredictNeugent may return the ValuePredictNeugent object itself as part of the returned consultation object.
  • Neugents train and consult services may be mapped to the architecture of the Neugent class (discussed below) .
  • the WSB API Interface is discussed exemplarily below for the ValuePredictNeugent only.
  • the WSB API can include a number of classes, with the ValuePredictNeugent class including train and consult methods .
  • the ValuePredictNeugent class may include the following train and consult methods: ValueNeugentTrainResult train (Collection of Pattern trainData, Double validationPercentage , Boolean returnResultFlag) ; and ValueNeugentConsultResult consult (Collection of Pattern consultData) .
  • the user sets up a collection of data under the Pattern class.
  • the Pattern class is a container for a row of data passed to the train or consult method. After passing the data collection into the train or consult method, a ValueNeugentTrainResult object, or a ValueNeugentConsultResult object is returned.
  • the ValueNeugentTrainResult class contains the results from the ValuePredictNeugent train method, and may include the following fields (FIG. 4A) : trainStatus indicates a process status when it returns, and is checked in order to determine if the train method returns successful; modelTrainError indicates an overall training error of a model (for all model outputs) ; modelValidationError indicates an overall validation error of the model (for all model outputs) ; numberOfData indicates a number of patterns used for training; trainError indicates for each output in the OFldNList property of the Neugent instance a corresponding training error; validationError is validation error for each individual target in OFldNList and is the same as modelValidationError when there is only one output; trainQualityScore indicates for each output in the OFldNList property of the Neugent instance a corresponding training quality score; validationQualityScore indicates for each output in the OFldNList property of the Neugent instance a validation quality score; trainResult is
  • the ValueNeugentConsultResult class contains the results from the ValuePredictNeugent consult method, and may include the following fields (FIG. 4B) : consultError indicates for each output on the OFldNList of the Neugent object a corresponding error, and is empty if the target value is not included on the consult data source ; consultQualityScore indicates for each output on the OFldNList of the Neugent object a corresponding quality score, and is empty if the target value is not included on the consult data source ; consultResult is a collection consisting of pattern label and predict values of each output for each pattern; originalConsultOutput is a collection consisting of pattern label and original output values for each pattern; rawConsultResult is a collection consisting of pattern label and binary output values for each pattern, and is used for binary output in discrete Neugent; and neugentObject is a shortcut to a model that uses the ValueNeugentTrainResult object.

Abstract

A data mining system and method are provided. The system includes at least one client and a service broker configured to include an interface to receive a consultation request from the client. The service broker forwards the consultation request to a Neugent to invoke a consultation of the Neugent, and forwards to the client a result object returned by the Neugent.

Description

USING NEURAL NETWORKS FOR DATA MINING CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims the benefit of the following co-pending U.S. provisional applications:
(a) Serial No. 60/374,064, filed April 19, 2002 and entitled "PROCESSING MIXED NUMERIC AND/OR NON-NUMERIC DATA" ;
(b) Serial No. 60/374,020, filed April 19, 2002 and entitled "AUTOMATIC NEURAL-NET MODEL GENERATION AND
MAINTENANCE";
(c) Serial No. 60/374,024, filed April 19, 2002 and entitled "VIEWING MULTI -DIMENSIONAL DATA THROUGH HIERARCHICAL VISUALIZATION"; (d) Serial No. 60/374,041, filed April 19, 2002 and entitled "METHOD AND APPARATUS FOR DISCOVERING EVOLUTIONARY CHANGES WITHIN A SYSTEM";
(e) Serial No. 60/373,977, filed April 19, 2002 and entitled "AUTOMATIC MODEL MAINTENANCE THROUGH LOCAL NETS"; and
(f) Serial No. 60/373,780, filed April 19, 2002 and entitled "USING NEURAL NETWORKS FOR DATA MINING".
TECHNICAL FIELD This application relates to data mining. In particular, the application relates to using neural nets and other artificial intelligence techniques for data mining .
DESCRIPTION OF RELATED ART
As use of computers and other information and communication appliances proliferate in the current information age, data, numeric as well as non-numeric (for example, textual), frequently is collected from numerous sources, such as the Internet. Further, large amounts of data exist in many databases. Much of the data is collected for archiving purposes only and therefore, in many instances, are stored without organization. Sifting through the morass of data to extract useful information for a specific purpose may be a substantial challenge.
For example, business concerns are finding an increasing need, in order to remain competitive in their business market, to effectively analyze and extract useful information from data they and/or others have collected and use the extracted information to improve operation of the business. This, however, often may be a daunting task. Data mining is the analysis of large qualities of data in order to extract useful information from the data, such as for making predictions over new data (also called predictive analysis) . A number of data mining products are available. However, current commercial products which allow data mining of the wealth of information on the Web require the client application to maintain a predictive model, although a service broker may collect or store raw data and forward it to the client upon demand. Since the client must maintain the predictive model, the resources of the client machine may be overwhelmed when the application is executed.
SUMMARY
This application provides a data mining system. In one embodiment, the data mining system includes a client and a service broker configured to include an interface to receive a consultation request from the client. The service broker forwards the consultation request to a Neugent to invoke a consultation of the Neugent. After the Neugent is consulted, the service broker forwards to the client a result object returned by the Neugent. The service broker also may include a training interface, and receives through the training interface a training request from the client, the training request including training data. The service broker forwards the training request including the training data to the Neugent to invoke training of the Neugent with the training data. The training request may include a parameter specifying a ratio to split the training data between training the Neugent and testing or validating the Neugent. The service broker may forward to the client a training result object returned by the Neugent after training of the Neugent.
The application also provides a method for providing to a remote client machine a service to consult a Neugent. In one embodiment, the method includes receiving a consultation request from the remote client machine, forwarding the consultation request to the Neugent to invoke a consultation of the Neugent, and forwarding to the remote client machine a result object returned by the Neugent .
The application also provides a method for providing to a remote client machine a service to train a Neugent. According to one embodiment, the method includes receiving a train request from the remote client machine, forwarding the train request to the Neugent to invoke training of the Neugent, and forwarding to the remote client machine a training result object returned by the Neugent.
BRIEF DESCRIPTION OF THE DRAWINGS
The features of the present application can be more readily understood from the following detailed description with reference to the accompanying drawings wherein:
FIG. IA shows a block diagram of a data mining system, according to one embodiment of the present disclosure;
FIG. IB shows a schematic view of a data mining system, according to another embodiment; FIG. 2A shows a flow chart of a method for providing to a remote client machine a service to consult a Neugent, according to one embodiment;
FIG. 2B shows a flow chart of a method for providing to a remote client machine a service to train a Neugent, according to one embodiment; FIG. 3 shows a schematic view of a functional-link net structure;
FIGS. 4A and 4B show class diagrams for web services interface methods of Value Predict Neugent, according to one embodiment ;
FIGS. 5A, 5C, 5E, 5G and 51 show object schemas for assorted Neugents classes, according to another embodiment; and
FIGS. 5B, 5D, 5F, 5H, 5J and 5K show class diagrams for the web service interface of the Neugents classes;
FIGS. 6A, 6C, 6E, 6G and 61 show object schemas for assorted Neugents classes, according to a third embodiment;
FIGS. 6B, 6D, 6F, 6H, 6J and 6K show class diagrams for the web service interface of the Neugents classes, according to the third embodiment;
FIGS. 7A through 7F show class diagrams for web service interface of assorted Neugents classes, according to a fourth embodiment ;
FIG. 7G shows an object schema for the Value Predict Neugent, according to the fourth embodiment;
FIGS. 8A, 8D, 8F, 8H and 8J show object schemas for assorted Neugents classes, according to a fifth embodiment;
FIGS. 8B, 8C, 8E, 8G, 81 and 8K show class diagrams for the web service interface of the Neugents classes, according to the fifth embodiment; FIG. 9A shows an object schema for Value Predict Neugent, according to a sixth embodiment;
FIGS. 9B and 9C show class diagrams for the web service interface of the Value Predict Neugent, according to the sixth embodiment;
FIGS. 10A and IOC through 10F show class diagrams for the web service interface of assorted Neugents classes, according to a seventh embodiment ; and
FIG. 10B shows an object schema for Value Predict Neugent, according to the seventh embodiment.
DETAILED DESCRIPTION
This application provides tools (in the form of systems and methodologies) for data mining. For example, the tools may include one or more computer programs or software modules stored on a conventional program storage device or computer readable medium, and/or transmitted via a computer network or other transmission medium.
A data mining system, according to a client-server paradigm, is explained below with reference to FIG. IA. It should be understood, however, that the tools of the present application are not limited to a client-server programming model, and may be adapted for use in peer-to- peer systems, message passing systems, as well as other programming models. A data mining system 10 includes a client 11, one or more Neugents 13, and a service broker 15. The service broker 15 may be configured as a server, and includes an interface to receive a consultation request from the client. The service broker may also receive a train request from the client, and typically is (although it need not be) a remote server. Neugents 13 are further described below.
A method for providing to a remote client machine a service to consult a Neugent, in accordance with one embodiment, is described with reference to FIGS. IA and 2A. After the service broker 15 receives a consults request from the remote client machine (step S21) , service broker forwards the consultation request 1 Neugent 13 to invoke a consultation of the Neugent ( S22) . After the Neugent 13 is consulted, the sei broker 15 forwards to the client a result object reti. by the Neugent (step S23) .
The consultation request, according to one embodir includes data for consulting a Neugent 13. The Neuger performs a predictive analysis of the data included ir consultation request .
According to another embodiment, the consults request includes identification of a source of data consulting a Neugent 13. The Neugent 13 perforr predictive analysis of input data obtained from the sc identified in the consultation request .
According to another embodiment, the service broke is a remote server. The consultation request from client 11 to the remote server may include an Exte Markup Language document. The Neugent may be server-.
A method for providing to a remote client machi service to train a Neugent, according to one embodiment described with reference to FIGS. IA and 2B. After service broker 15 receives a train request from the re client machine (step S26) , the service broker forward. train request to a Neugent 15 to invoke training of the Neugent (step S27) . After the Neugent is trained, the service broker forwards to the client a training result object returned by the Neugent (step S28) . A Neugent may group training data patterns into clusters, with each cluster corresponding to a group of similar data patterns, and predict a probability of membership of an input pattern to a selected group.
A Neugent may group training non-numeric (for example, textual) patterns into clusters, with each cluster corresponding to a group of similar non-numeric patterns, and predict a probability of membership of an input non- numeric pattern to a selected group.
A Neugent may form a cluster model by grouping training data patterns into a plurality of clusters, with each cluster corresponding to a group of similar data patterns, and determining for each cluster probabilities of transition from the cluster to each of the other clusters. The Neugent predicts a probability of an event occurring by applying an input pattern to the cluster model.
A Neugent may form an input-output model associated with a set of training data patterns, and predict an output value by applying the model to an input pattern. The Neugent may include a functional-link net. A Neugent may form rules associated with corresponding relationships in a set of training data patterns, and predict an outcome by applying the rules to an input pattern.
Neugents technologies include assorted methodologies for recognizing patterns in data and for using those patterns to make predictions on new data. New data is analyzed to determine the pattern into which it falls, thereby providing a prediction of future behavior based on the behavior that has characterized the pattern in the past.
One group of underlying methodologies is often referred as neural net technology. A neural net is a weighted network of interconnected input/output nodes. Neugent technology covers a broader range of pattern recognition methodologies, in addition to neural net models .
For example, Neugents may include ClusteringNeugent , DecisionNeugent, EventPredictNeugent , TextClusteringNeugent and ValuePredictNeugent model methodologies . ClusteringNeugent uses a cluster model methodology which groups patterns that are alike, and predicts the probability of membership to a specific group.
DecisionNeugent uses a decision tree model methodology which uncovers rules and relationships in data, formulates rules to describe those relationships, and predicts outcomes based upon the discovered rules.
EventPredictNeugent uses a cluster model methodology with transition calculation to predict the probability of an event occurring. TextClusteringNeugent uses a cluster model methodology which groups training data patterns comprising textual (or non-numeric) material that are alike, and predicts a probability that specified textual (or non-numeric) data with which the model is consulted is a member of (or belongs to) a specific group.
ValuePredictNeugent uses a functional-link neural net model methodology to predict the value of a variable (or values for a set of variables) .
A functional -link net is one type of neural net which can be used to model a functional relationship between input and output. A functional-link net may be used to approximate any scalar function with a vector of inputs, x, and an output y, and therefore is a universal approximator .
The structure of a functional-link net with non-linearity fully contained in a functional-link layer is illustrated in FIG. 3. The nodes in the functional-link layer have associated non-linear basis functions. Since non-linearity is fully contained in the functional -link layer, and the rest of the net may be linear, linear training techniques such as regression-based training may be used with a functional-link net structure. Linear training refers to techniques that solves the parameters in the net through linear algebra techniques. Examples of functional-link net methodologies are described in commonly owned U.S. Patents Nos. 4,979,126, 5,734,796, 6,134,537 and 6,212,509 which are incorporated herein in their entirety by reference.
Some methodologies associated with EventPredictNeugent are described in commonly-owned U.S. Patent No. 6,327,550 which is incorporated herein by reference. Additional clustering, neural net, decision tree and other predictive modeling methodologies are described in the following commonly-owned U.S. Patent Applications, which are also incorporated herein by reference:
Serial No. 60/374,064, filed April 19, 2002 and entitled PROCESSING MIXED NUMERIC AND/OR NON-NUMERIC DATA;
Serial No. 60/374,020, filed April 19, 2002 and entitled AUTOMATIC NEURAL-NET MODEL GENERATION AND
MAINTENANCE ;
Serial No. 60/374,024, filed April 19, 2002 and entitled VIEWING MULTI -DIMENSIONAL DATA THROUGH HIERARCHICAL VISUALIZATION;
Serial No. 60/374,041, filed April 19, 2002 and entitled METHOD AND APPARATUS FOR DISCOVERING EVOLUTIONARY CHANGES WITHIN A SYSTEM; Serial No. 60/373,977, filed April 19, 2002 and entitled AUTOMATIC MODEL MAINTENANCE THROUGH and
Serial No. 60/373,780, filed April 1 entitled "USING NEURAL NETWORKS FOR DATA MINI Each Neugent provides the following metho commonly referred to collectively as an Programmer Interface", or "API", and refe connection with Web services simply as "servi Train is a process of providing data (als more specifically as training data patterns) so that the Neugent performs statistical o] analysis of the training data patters which basis for future predictions. The output o Neugent is a model or other data classificati< which becomes the means by which the Neugen patterns .
Consult is a process of providing nei Neugent (also referred to as data for co Neugent) so that the Neugent uses its model, during training, to provide a prediction from A Web service enabled implementation of consult methods of the Neugents, according to embodiment, is described below, with reference and 5A through 10F. The train and consult met available to client programs through W technology. Typically, only data may be passed between a client and a Neugent. Accordingly, the methodologies described in this disclosure place no burden on the client to maintain a predictive model. The complexity of client/server interfaces may be reduced by simplifying protocols and by hiding issues (for example, making them transparent to the user) of platform technology mismatches. For example, Web services technology may be based on invoking procedures in a remote server (also referred herein as "Web Service Broker" or "WSB") , such as by transmitting an Extended Mark-up Language (XML) document, which is a text document, over the HTTP protocol, as depicted in FIG. IB. In order for Web Service Broker 45 to invoke the train and consult methods of a Neugent 43, the structure of the XML documents calling the corresponding methods of the Neugent is precisely specified. The training and consultation API of the Neugents preferably is rigorously defined so that they can be invoked by the WSB.
In addition, an interface is implemented within each respective Neugents.
Each of the Neugents mentioned above defines its own specification for training and consulting services (see, for example, FIGS. 4A-10F) . The common elements of each Neugent interface include input data, train result and consult result.
-i <=,- For both the train and consult services, a collection of data is passed to the Neugent. Data passed to the train service and the consult service may be referred to as training data (also referred herein as "trainData") or consultation data (also referred herein as "consultData" ) , respectively. In some cases (for example, the ValuePredictNeugent) , additional parameters may be passed when training the Neugent, such as to determine the percentage of the training data split between training the model and validating or testing the model. The Neugents typically use numeric data as input. However, the TextClusteringNeugent also accommodates textual (or other non-numeric) data and the DecisionNeugent accommodates alpha-numeric data. Except for EventPredictNeugent, each Neugent returns an object as a result of a training session. The object provides information about the result of the training session. For ValuePredictNeugent, an object representing the Neugent may be returned as part of the structure of the train result.
For each Neugent type, the Neugent returns an object as a result of a consultation. Neugents may differ, however, with regard to a structure of the consultation return object. See, for example, FIGS. 5A-5K, in which only the TextClusteringNeugent and the ClusteringNeugent return similarly structured objects. The ValuePredictNeugent may return the ValuePredictNeugent object itself as part of the returned consultation object.
The specification of Neugents train and consult services may be mapped to the architecture of the Neugent class (discussed below) .
The WSB API Interface is discussed exemplarily below for the ValuePredictNeugent only.
The WSB API can include a number of classes, with the ValuePredictNeugent class including train and consult methods .
For example, the ValuePredictNeugent class may include the following train and consult methods: ValueNeugentTrainResult train (Collection of Pattern trainData, Double validationPercentage , Boolean returnResultFlag) ; and ValueNeugentConsultResult consult (Collection of Pattern consultData) .
The user sets up a collection of data under the Pattern class. The Pattern class is a container for a row of data passed to the train or consult method. After passing the data collection into the train or consult method, a ValueNeugentTrainResult object, or a ValueNeugentConsultResult object is returned.
The ValueNeugentTrainResult class contains the results from the ValuePredictNeugent train method, and may include the following fields (FIG. 4A) : trainStatus indicates a process status when it returns, and is checked in order to determine if the train method returns successful; modelTrainError indicates an overall training error of a model (for all model outputs) ; modelValidationError indicates an overall validation error of the model (for all model outputs) ; numberOfData indicates a number of patterns used for training; trainError indicates for each output in the OFldNList property of the Neugent instance a corresponding training error; validationError is validation error for each individual target in OFldNList and is the same as modelValidationError when there is only one output; trainQualityScore indicates for each output in the OFldNList property of the Neugent instance a corresponding training quality score; validationQualityScore indicates for each output in the OFldNList property of the Neugent instance a validation quality score; trainResult is a collection consisting of pattern label and model predict values of each target for each pattern; validationResult is an inner collection consisting of pattern label and model predict values of each target for each pattern; rawTrainResult is a collection consisting of pattern label and raw values (before clip) of each target for each pattern, and is used for binary output in discrete Neugent; rawValidationResult is a collection consisting of pattern label and raw values (before clip) of each target for each pattern used for validation, and is used for binary output in discrete Neugent; originalTrainOutput is a collection consisting of pattern label and original values of each target for each pattern used for training; originalValidationOutput is a collection consisting of pattern label and original values of each target for each pattern used for validation; and neugentModel is a shortcut to the model that uses the ValueNeugentTrainResult object.
The ValueNeugentConsultResult class contains the results from the ValuePredictNeugent consult method, and may include the following fields (FIG. 4B) : consultError indicates for each output on the OFldNList of the Neugent object a corresponding error, and is empty if the target value is not included on the consult data source ; consultQualityScore indicates for each output on the OFldNList of the Neugent object a corresponding quality score, and is empty if the target value is not included on the consult data source ; consultResult is a collection consisting of pattern label and predict values of each output for each pattern; originalConsultOutput is a collection consisting of pattern label and original output values for each pattern; rawConsultResult is a collection consisting of pattern label and binary output values for each pattern, and is used for binary output in discrete Neugent; and neugentObject is a shortcut to a model that uses the ValueNeugentTrainResult object.
Class diagrams for additional exemplary embodiments are shown in FIGS. 5A-5K, 6A-6K, 7A-7G, 8A-8K, 9A-9C and
10A-10F. Similarly named field have similar functionality as described above. In the interest of clarity, a description of the fields in the additional exemplary embodiments is omitted. The above specific embodiments are illustrative, and many variations can be introduced on these embodiments without departing from the spirit of the disclosure or from the scope of the appended claims. Elements and/or features of different illustrative embodiments may be combined with each other and/or substituted for each other within the scope of this disclosure and appended claims.
For example, although some embodiments described herein use a combination of ClusteringNeugent, DecisionNeugent, EventPredictNeugent, TextClusteringNeugent and ValuePredictNeugent methodologies, the matter recited in the appended claims may be practiced a selected subset of these Neugents, with or without other Neugents technologies which use clustering, neural net, decision tree and/or other predictive modeling methodologies. Additional variations may be apparent to one of ordinary skill in the art from reading the following U.S. provisional applications Nos. 60/374,064, 60/374,020, f60/374,024, 60/374,041, 60/373,977 and 60/373,780, each filed April 19, 2002.

Claims

What is claimed is;
1. A data mining system comprising: a client; and a service broker configured to include an interface to receive a consultation request from the client, wherein the service broker forwards the consultation request to a Neugent to invoke a consultation of the Neugent, and forwards to the client a result object returned by the Neugent .
2. The system of claim 1, wherein the consultation request includes data for consulting the Neugent.
3. The system of claim 2, wherein the Neugent performs a predictive analysis of the data included in the consultation request.
4. The system of claim 1, wherein the consultation request includes identification of a source of data for consulting the Neugent.
5. The system of claim 4, wherein the Neugent performs a predictive analysis of input data obtained from the source identified in the consultation request.
6. The system of claim 1, wherein the service broker receives through the interface a training request from the client, the training request including training data, and forwards the training request including the training data to the Neugent to invoke training of the Neugent with the training data.
7. The system of claim 6, wherein the training request includes a parameter specifying a ratio to split the training data between training the Neugent and testing the Neugent .
8. The system of claim 6, wherein the service broker forwards to the client a training result object returned by the Neugent after training of the Neugent.
9. The system of claim 1, wherein the Neugent groups training data patterns into clusters, each cluster corresponding to a group of similar data patterns, and predicts a probability of membership of an input pattern to a selected group.
10. The system of claim 1, wherein the Neugent groups training non-numeric patterns into clusters, each cluster corresponding to a group of similar non-numeric patterns, and predicts a probability of membership of an input non- numeric pattern to a selected group.
11. The system of claim 1, wherein the Neugent forms a cluster model by grouping training data patterns into a plurality of clusters, each cluster corresponding to a group of similar data patterns, and determining for each cluster probabilities of transition from the cluster to each of the other clusters, and predicts a probability of an event occurring by applying an input pattern to the cluster model .
12. The system of claim 1, wherein the Neugent forms an input-output model associated with a set of training data patterns, and predicts an output value by applying the model to an input pattern.
13. The system of claim 1, wherein the Neugent forms rules associated with corresponding relationships in a set of training data patterns, and predicts an outcome by applying the rules to an input pattern.
1.4. The system of claim 1, wherein the Neugent includes a functional -link net.
15. The system of claim 1, wherein the service broker is a remote server.
16. The system of claim 15, wherein the consultation request includes an Extended Markup Language document .
17. The system of claim 15, wherein the Neugent is server- side .
18. A method for providing to a remote client machine a service to consult a Neugent, comprising: receiving a consultation request from the remote client machine; forwarding the consultation request to the Neugent to invoke a consultation of the Neugent; and forwarding to the remote client machine a result object returned by the Neugent.
19. A computer system, comprising: a processor; and a program storage device readable by the computer system, tangibly embodying a program of instructions executable by the processor to perform the method of claim 18.
20. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform the method of claim 18.
21. A computer data signal embodied in a transmission medium which embodies instructions executable by a computer to perform the method of claim 18.
22. A method for providing to a remote client machine a service to train a Neugent, comprising: receiving a train request from the remote client machine; forwarding the train request to the Neugent to invoke training of the Neugent; and forwarding to the remote client machine a training result object returned by the Neugent.
23. A computer system, comprising: a processor; and a program storage device readable by the computer system, tangibly embodying a program of instructions executable by the processor to perform the method of claim 22.
24. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform the method of claim 22.
25. A computer data signal embodied in a transmission medium which embodies instructions executable by a computer to perform the method of claim 22.
PCT/US2003/011983 2002-04-19 2003-04-18 Using neural networks for data mining WO2003090122A2 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
KR10-2004-7016751A KR20040101532A (en) 2002-04-19 2003-04-18 Using neural networks for data mining
AU2003241302A AU2003241302A1 (en) 2002-04-19 2003-04-18 Using neural networks for data mining
JP2003586796A JP2006511850A (en) 2002-04-19 2003-04-18 Using neural networks for data mining
BRPI0309388-3A BR0309388A (en) 2002-04-19 2003-04-18 using neural networks for data exploration
EP03731033A EP1520237A2 (en) 2002-04-19 2003-04-18 Using neural networks for data mining
CA002480951A CA2480951A1 (en) 2002-04-19 2003-04-18 Using neural networks for data mining
IL16462304A IL164623A0 (en) 2002-04-19 2004-10-14 Using neural networks for data mining

Applications Claiming Priority (12)

Application Number Priority Date Filing Date Title
US37404102P 2002-04-19 2002-04-19
US37402002P 2002-04-19 2002-04-19
US37402402P 2002-04-19 2002-04-19
US37406402P 2002-04-19 2002-04-19
US37378002P 2002-04-19 2002-04-19
US37397702P 2002-04-19 2002-04-19
US60/374,041 2002-04-19
US60/374,024 2002-04-19
US60/373,977 2002-04-19
US60/374,064 2002-04-19
US60/373,780 2002-04-19
US60/374,020 2002-04-19

Publications (2)

Publication Number Publication Date
WO2003090122A2 true WO2003090122A2 (en) 2003-10-30
WO2003090122A8 WO2003090122A8 (en) 2005-02-03

Family

ID=29255729

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/US2003/011983 WO2003090122A2 (en) 2002-04-19 2003-04-18 Using neural networks for data mining
PCT/US2003/012021 WO2003090160A2 (en) 2002-04-19 2003-04-18 Processing mixed numeric and/or non-numeric data

Family Applications After (1)

Application Number Title Priority Date Filing Date
PCT/US2003/012021 WO2003090160A2 (en) 2002-04-19 2003-04-18 Processing mixed numeric and/or non-numeric data

Country Status (10)

Country Link
US (3) US7716148B2 (en)
EP (2) EP1520237A2 (en)
JP (2) JP2005523533A (en)
KR (1) KR20050007306A (en)
CN (1) CN1647079A (en)
AU (2) AU2003241302A1 (en)
BR (2) BR0309322A (en)
CA (2) CA2480951A1 (en)
IL (2) IL164628A0 (en)
WO (2) WO2003090122A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7533095B2 (en) 2005-04-19 2009-05-12 International Business Machines Corporation Data mining within a message handling system
CN105490858A (en) * 2015-12-15 2016-04-13 北京理工大学 Dynamic link predication method of network structure

Families Citing this family (65)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7777743B2 (en) * 2002-04-19 2010-08-17 Computer Associates Think, Inc. Viewing multi-dimensional data through hierarchical visualization
BR0309322A (en) * 2002-04-19 2007-03-06 Computer Ass Think Inc mixed numeric and / or non-numeric processing data
US7313279B2 (en) 2003-07-08 2007-12-25 Computer Associates Think, Inc. Hierarchical determination of feature relevancy
US7298906B2 (en) * 2003-07-08 2007-11-20 Computer Associates Think, Inc. Hierarchical determination of feature relevancy for mixed data types
US8041797B2 (en) * 2004-03-31 2011-10-18 International Business Machines Corporation Apparatus and method for allocating resources based on service level agreement predictions and associated costs
FR2882171A1 (en) * 2005-02-14 2006-08-18 France Telecom METHOD AND DEVICE FOR GENERATING A CLASSIFYING TREE TO UNIFY SUPERVISED AND NON-SUPERVISED APPROACHES, COMPUTER PROGRAM PRODUCT AND CORRESPONDING STORAGE MEDIUM
US8364610B2 (en) * 2005-04-08 2013-01-29 Caterpillar Inc. Process modeling and optimization method and system
GB0514553D0 (en) * 2005-07-15 2005-08-24 Nonlinear Dynamics Ltd A method of analysing a representation of a separation pattern
GB0514555D0 (en) * 2005-07-15 2005-08-24 Nonlinear Dynamics Ltd A method of analysing separation patterns
CN100444075C (en) * 2005-11-08 2008-12-17 北京网秦天下科技有限公司 Virus characteristics extraction and detection system and method for mobile/intelligent terminal
US7646913B2 (en) * 2005-12-19 2010-01-12 Microsoft Corporation Allograph based writer adaptation for handwritten character recognition
TWI315054B (en) * 2006-05-10 2009-09-21 Nat Cheng Kung Universit Method for evaluating reliance level of a virtual metrology system in product manufacturing
US8738335B1 (en) 2006-05-31 2014-05-27 Worldwide Pro Ltd. Solving a circuit network in hierarchical, multicore, and distributed computing environment
US7827016B1 (en) 2006-05-31 2010-11-02 William Wai Yan Ho Simulating circuits by distributed computing
JP4338145B2 (en) * 2007-03-08 2009-10-07 インターナショナル・ビジネス・マシーンズ・コーポレーション Technology to search for keywords that determine the occurrence of an event
US8266145B2 (en) * 2007-03-16 2012-09-11 1759304 Ontario Inc. Contextual data mapping, searching and retrieval
JP5265661B2 (en) * 2007-04-02 2013-08-14 カーン,カムラン System and method for predicting the global spread of infectious pathogens via private airplane travel
US8229942B1 (en) 2007-04-17 2012-07-24 Google Inc. Identifying negative keywords associated with advertisements
US8086624B1 (en) 2007-04-17 2011-12-27 Google Inc. Determining proximity to topics of advertisements
TWI338916B (en) * 2007-06-08 2011-03-11 Univ Nat Cheng Kung Dual-phase virtual metrology method
JP5193518B2 (en) * 2007-07-13 2013-05-08 株式会社東芝 Pattern search apparatus and method
US7853432B2 (en) * 2007-10-02 2010-12-14 The Regents Of The University Of Michigan Method and apparatus for clustering and visualization of multicolor cytometry data
US8255412B2 (en) * 2008-12-17 2012-08-28 Microsoft Corporation Boosting algorithm for ranking model adaptation
US8639639B2 (en) * 2009-08-31 2014-01-28 Bhagwan Mahavir Medical Research Centre Predicting possible outcomes in multi-factored diseases
DE102010002966B4 (en) 2010-03-17 2020-07-30 Osram Opto Semiconductors Gmbh Laser diode arrangement and method for producing a laser diode arrangement
JP5768983B2 (en) 2010-06-09 2015-08-26 日本電気株式会社 Contract violation prediction system, contract violation prediction method, and contract violation prediction program
US8671071B1 (en) * 2010-07-24 2014-03-11 Apokalyyis, Inc. Data processing system and method using relational signatures
US20120059599A1 (en) * 2010-09-03 2012-03-08 University Of Louisville Hybrid fragment-ligand modeling for classifying chemical compounds
US8515850B2 (en) * 2010-09-23 2013-08-20 Thomson Reuters Global Resources (Trgr) System and method for forecasting realized volatility via wavelets and non-linear dynamics
US20130191309A1 (en) * 2010-10-14 2013-07-25 Choudur Lakshminarayan Dataset Compression
US8527317B2 (en) * 2011-03-03 2013-09-03 International Business Machines Corporation Service level agreement work prioritization system
US9262519B1 (en) * 2011-06-30 2016-02-16 Sumo Logic Log data analysis
US8843851B1 (en) * 2011-07-28 2014-09-23 Intuit Inc. Proactive chat support
US9514022B1 (en) * 2011-10-18 2016-12-06 Netapp, Inc. Modeling storage system performance
US20130204811A1 (en) * 2012-02-08 2013-08-08 Nec Corporation Optimized query generating device and method, and discriminant model learning method
US9336302B1 (en) 2012-07-20 2016-05-10 Zuci Realty Llc Insight and algorithmic clustering for automated synthesis
US9317812B2 (en) * 2012-11-30 2016-04-19 Facebook, Inc. Customized predictors for user actions in an online system
EP3008938B1 (en) * 2013-06-13 2020-05-20 Nokia Solutions and Networks Oy Coordination in self-organizing networks
US10445311B1 (en) 2013-09-11 2019-10-15 Sumo Logic Anomaly detection
US10043194B2 (en) * 2014-04-04 2018-08-07 International Business Machines Corporation Network demand forecasting
CN105243069A (en) * 2014-07-09 2016-01-13 南方科技大学 Intelligent system for big data cognitive decision-making
KR102239714B1 (en) 2014-07-24 2021-04-13 삼성전자주식회사 Neural network training method and apparatus, data processing apparatus
US9811540B2 (en) * 2015-04-08 2017-11-07 Nec Corporation Compact, clustering-based indexes for large-scale real-time lookups on streaming videos
US10542961B2 (en) 2015-06-15 2020-01-28 The Research Foundation For The State University Of New York System and method for infrasonic cardiac monitoring
CN105005911B (en) * 2015-06-26 2017-09-19 深圳市腾讯计算机系统有限公司 The arithmetic system and operation method of deep neural network
US10878320B2 (en) * 2015-07-22 2020-12-29 Qualcomm Incorporated Transfer learning in neural networks
RU2632133C2 (en) * 2015-09-29 2017-10-02 Общество С Ограниченной Ответственностью "Яндекс" Method (versions) and system (versions) for creating prediction model and determining prediction model accuracy
US20170235782A1 (en) * 2016-02-12 2017-08-17 Nutanix, Inc. Entity database notifications
CN113610239A (en) 2016-09-27 2021-11-05 第四范式(北京)技术有限公司 Feature processing method and feature processing system for machine learning
US11373110B2 (en) 2016-10-03 2022-06-28 Mitsubishi Electric Corporation Apparatus and network construction method for determining the number of elements in an intermediate layer of a neural network
WO2018092957A1 (en) * 2016-11-21 2018-05-24 주식회사 알고리고 Method, device and program for determining for re-learning with respect to input value in neural network model
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
US20180218287A1 (en) * 2017-02-01 2018-08-02 Facebook, Inc. Determining performance of a machine-learning model based on aggregation of finer-grain normalized performance metrics
US10832135B2 (en) * 2017-02-10 2020-11-10 Samsung Electronics Co., Ltd. Automatic thresholds for neural network pruning and retraining
US11481644B2 (en) * 2017-02-17 2022-10-25 Nike, Inc. Event prediction
US10540683B2 (en) * 2017-04-24 2020-01-21 Microsoft Technology Licensing, Llc Machine-learned recommender system for performance optimization of network-transferred electronic content items
US10893068B1 (en) * 2017-06-30 2021-01-12 Fireeye, Inc. Ransomware file modification prevention technique
US11348029B1 (en) * 2017-11-22 2022-05-31 Amazon Technologies, Inc. Transformation of machine learning models for computing hubs
RU2693324C2 (en) 2017-11-24 2019-07-02 Общество С Ограниченной Ответственностью "Яндекс" Method and a server for converting a categorical factor value into its numerical representation
US11087394B2 (en) * 2018-09-19 2021-08-10 Rapid Financial Services, LLC System and method for anticipating and preventing account overdrafts
US11237713B2 (en) * 2019-01-21 2022-02-01 International Business Machines Corporation Graphical user interface based feature extraction application for machine learning and cognitive models
KR102331158B1 (en) * 2019-12-18 2021-11-25 주식회사 씨앤에이아이 Electronic device, method, and computer readable medium for estimating sales rate of textile material
CN111082997B (en) * 2019-12-30 2021-05-14 西安电子科技大学 Network function arrangement method based on service identification in mobile edge computing platform
CN114764550A (en) 2021-01-12 2022-07-19 联华电子股份有限公司 Operation method and operation device of failure detection and classification model
US11694018B2 (en) * 2021-01-29 2023-07-04 Salesforce, Inc. Machine-learning based generation of text style variations for digital content items

Family Cites Families (76)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4193115A (en) 1977-12-15 1980-03-11 The United States Of America As Represented By The Secretary Of Commerce Method and apparatus for implementation of the CMAC mapping algorithm
US4215396A (en) 1978-08-24 1980-07-29 Texas Instruments Incorporated Intelligent programmable process control system
US4438497A (en) 1981-07-20 1984-03-20 Ford Motor Company Adaptive strategy to control internal combustion engine
US4649515A (en) 1984-04-30 1987-03-10 Westinghouse Electric Corp. Methods and apparatus for system fault diagnosis and control
JPH0789283B2 (en) 1984-11-02 1995-09-27 株式会社日立製作所 Formula processing control system
US4670848A (en) 1985-04-10 1987-06-02 Standard Systems Corporation Artificial intelligence system
US4663703A (en) 1985-10-02 1987-05-05 Westinghouse Electric Corp. Predictive model reference adaptive controller
US4754410A (en) 1986-02-06 1988-06-28 Westinghouse Electric Corp. Automated rule based process control method with feedback and apparatus therefor
US4858147A (en) 1987-06-15 1989-08-15 Unisys Corporation Special purpose neurocomputer system for solving optimization problems
FR2625347B1 (en) 1987-12-23 1990-05-04 Labo Electronique Physique NEURON NETWORK STRUCTURE AND CIRCUIT AND ARRANGEMENT OF NEURON NETWORKS
US4979126A (en) 1988-03-30 1990-12-18 Ai Ware Incorporated Neural network with non-linear transformations
US4928484A (en) 1988-12-20 1990-05-29 Allied-Signal Inc. Nonlinear multivariable control system
US4972363A (en) 1989-02-01 1990-11-20 The Boeing Company Neural network using stochastic processing
JPH0660826B2 (en) 1989-02-07 1994-08-10 動力炉・核燃料開発事業団 Plant abnormality diagnosis method
US5119468A (en) 1989-02-28 1992-06-02 E. I. Du Pont De Nemours And Company Apparatus and method for controlling a process using a trained parallel distributed processing network
JPH0738186B2 (en) 1989-03-13 1995-04-26 シャープ株式会社 Self-expanding neural network
US5033087A (en) 1989-03-14 1991-07-16 International Business Machines Corp. Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system
JPH0375860A (en) * 1989-08-18 1991-03-29 Hitachi Ltd Personalized terminal
JP2821189B2 (en) 1989-09-01 1998-11-05 株式会社日立製作所 Learning type decision support system
US5140523A (en) 1989-09-05 1992-08-18 Ktaadn, Inc. Neural network for predicting lightning
JPH0711256B2 (en) 1989-09-06 1995-02-08 本田技研工業株式会社 Control device for internal combustion engine
IT1232989B (en) * 1989-09-14 1992-03-13 Rizzi & Co Spa Luigi SINGLE-PASS LEATHER SHAVING MACHINE
CA2031765C (en) 1989-12-08 1996-02-20 Masahide Nomura Method and system for performing control conforming with characteristics of controlled system
US5111531A (en) 1990-01-08 1992-05-05 Automation Technology, Inc. Process control using neural network
US5142665A (en) * 1990-02-20 1992-08-25 International Business Machines Corporation Neural network shell for application programs
US5052043A (en) 1990-05-07 1991-09-24 Eastman Kodak Company Neural network with back propagation controlled through an output confidence measure
US5113483A (en) 1990-06-15 1992-05-12 Microelectronics And Computer Technology Corporation Neural network with semi-localized non-linear mapping of the input space
US5142612A (en) 1990-08-03 1992-08-25 E. I. Du Pont De Nemours & Co. (Inc.) Computer neural network supervisory process control system and method
US5175678A (en) 1990-08-15 1992-12-29 Elsag International B.V. Method and procedure for neural control of dynamic processes
US5335291A (en) 1991-09-20 1994-08-02 Massachusetts Institute Of Technology Method and apparatus for pattern mapping system with self-reliability check
US5483650A (en) 1991-11-12 1996-01-09 Xerox Corporation Method of constant interaction-time clustering applied to document browsing
US5442778A (en) 1991-11-12 1995-08-15 Xerox Corporation Scatter-gather: a cluster-based method and apparatus for browsing large document collections
US5349541A (en) 1992-01-23 1994-09-20 Electric Power Research Institute, Inc. Method and apparatus utilizing neural networks to predict a specified signal value within a multi-element system
KR970008532B1 (en) 1993-08-10 1997-05-24 재단법인 한국전자통신연구소 Neural metwork
US5420937A (en) * 1993-09-01 1995-05-30 The Phoenix Group, Inc. Fingerprint information extraction by twin tracker border line analysis
US5485390A (en) 1993-11-30 1996-01-16 The United States Of America As Represented By The Secrectary Of The Air Force Inductive-deductive process design for machined parts
US5835901A (en) 1994-01-25 1998-11-10 Martin Marietta Corporation Perceptive system including a neural network
US5848373A (en) * 1994-06-24 1998-12-08 Delorme Publishing Company Computer aided map location system
US5848402A (en) 1994-07-07 1998-12-08 Ai Ware, Inc. Universal system for artificial intelligence based learning, categorization, and optimization
JPH0877010A (en) * 1994-09-07 1996-03-22 Hitachi Ltd Method and device for data analysis
KR100305874B1 (en) 1995-06-02 2001-11-30 모리시타 요이찌 Multi television set
GB9517988D0 (en) * 1995-09-04 1995-11-08 Ibm Interactive visualisation aid
US6134537A (en) 1995-09-29 2000-10-17 Ai Ware, Inc. Visualization and self organization of multidimensional data through equalized orthogonal mapping
US5734796A (en) 1995-09-29 1998-03-31 Ai Ware, Inc. Self-organization of pattern data with dimension reduction through learning of non-linear variance-constrained mapping
US5787422A (en) 1996-01-11 1998-07-28 Xerox Corporation Method and apparatus for information accesss employing overlapping clusters
US5822741A (en) 1996-02-05 1998-10-13 Lockheed Martin Corporation Neural network/conceptual clustering fraud detection architecture
US6278986B1 (en) 1996-06-27 2001-08-21 Yahama Hatsudoki Kabushiki Kaisha Integrated controlling system
US5790121A (en) 1996-09-06 1998-08-04 Sklar; Peter Clustering user interface
US6025843A (en) 1996-09-06 2000-02-15 Peter Sklar Clustering user interface
JP3825845B2 (en) 1996-09-27 2006-09-27 ヤマハ発動機株式会社 Evolutionary control method
US6032139A (en) 1996-09-27 2000-02-29 Yamaha Hatsudoki Kabushiki Kaisha Electronic controller using genetic evolution techniques suitable for controlling a motor
US5933818A (en) * 1997-06-02 1999-08-03 Electronic Data Systems Corporation Autonomous knowledge discovery system and method
US6003029A (en) * 1997-08-22 1999-12-14 International Business Machines Corporation Automatic subspace clustering of high dimensional data for data mining applications
US6128609A (en) 1997-10-14 2000-10-03 Ralph E. Rose Training a neural network using differential input
US6134541A (en) 1997-10-31 2000-10-17 International Business Machines Corporation Searching multidimensional indexes using associated clustering and dimension reduction information
US6122628A (en) 1997-10-31 2000-09-19 International Business Machines Corporation Multidimensional data clustering and dimension reduction for indexing and searching
US5983224A (en) * 1997-10-31 1999-11-09 Hitachi America, Ltd. Method and apparatus for reducing the computational requirements of K-means data clustering
GB9803466D0 (en) 1998-02-19 1998-04-15 Chemical Computing Group Inc Discrete QSAR:a machine to determine structure activity and relationships for high throughput screening
US6327550B1 (en) 1998-05-26 2001-12-04 Computer Associates Think, Inc. Method and apparatus for system state monitoring using pattern recognition and neural networks
US6269351B1 (en) 1999-03-31 2001-07-31 Dryken Technologies, Inc. Method and system for training an artificial neural network
US6236942B1 (en) 1998-09-15 2001-05-22 Scientific Prediction Incorporated System and method for delineating spatially dependent objects, such as hydrocarbon accumulations from seismic data
US6289354B1 (en) 1998-10-07 2001-09-11 International Business Machines Corporation System and method for similarity searching in high-dimensional data space
US6496832B2 (en) 1998-10-20 2002-12-17 University Of Minnesota Visualization spreadsheet
WO2000029829A1 (en) 1998-11-16 2000-05-25 California Institute Of Technology Simultaneous determination of equilibrium and kinetic properties
WO2000046701A1 (en) 1999-02-08 2000-08-10 Huntsman Ici Chemicals Llc Method for retrieving semantically distant analogies
US6629097B1 (en) 1999-04-28 2003-09-30 Douglas K. Keith Displaying implicit associations among items in loosely-structured data sets
US6216267B1 (en) * 1999-07-26 2001-04-10 Rockwell Collins, Inc. Media capture and compression communication system using holographic optical classification, voice recognition and neural network decision processing
US6496812B1 (en) 2000-05-13 2002-12-17 Object Power, Inc. Method and system for measuring and valuing contributions by group members to the achievement of a group goal
US6950786B1 (en) 2000-10-10 2005-09-27 Schlumberger Technology Corporation Method and apparatus for generating a cross plot in attribute space from a plurality of attribute data sets and generating a class data set from the cross plot
US6697791B2 (en) 2001-05-04 2004-02-24 International Business Machines Corporation System and method for systematic construction of correlation rules for event management
US7777743B2 (en) 2002-04-19 2010-08-17 Computer Associates Think, Inc. Viewing multi-dimensional data through hierarchical visualization
US7444310B2 (en) 2002-04-19 2008-10-28 Computer Associates Think, Inc. Automatic model maintenance through local nets
BR0309322A (en) * 2002-04-19 2007-03-06 Computer Ass Think Inc mixed numeric and / or non-numeric processing data
US7483868B2 (en) 2002-04-19 2009-01-27 Computer Associates Think, Inc. Automatic neural-net model generation and maintenance
EP1570424A2 (en) 2002-10-01 2005-09-07 Target Discovery, Inc. Methods and compositions utilizing evolutionary computation techniques and differential data sets
US7103874B2 (en) 2003-10-23 2006-09-05 Microsoft Corporation Model-based management of computer systems and distributed applications

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
No Search *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7533095B2 (en) 2005-04-19 2009-05-12 International Business Machines Corporation Data mining within a message handling system
CN105490858A (en) * 2015-12-15 2016-04-13 北京理工大学 Dynamic link predication method of network structure
CN105490858B (en) * 2015-12-15 2018-08-03 北京理工大学 A kind of dynamic link prediction technique of network structure

Also Published As

Publication number Publication date
AU2003221986A1 (en) 2003-11-03
JP2006511850A (en) 2006-04-06
WO2003090122A8 (en) 2005-02-03
EP1504412B1 (en) 2018-04-11
US20040220900A1 (en) 2004-11-04
IL164623A0 (en) 2005-12-18
US7321879B2 (en) 2008-01-22
CA2481432A1 (en) 2003-10-30
BR0309388A (en) 2007-02-21
US20040019574A1 (en) 2004-01-29
EP1520237A2 (en) 2005-04-06
US7533006B2 (en) 2009-05-12
CN1647079A (en) 2005-07-27
IL164628A0 (en) 2005-12-18
WO2003090160A8 (en) 2004-11-18
WO2003090160A2 (en) 2003-10-30
US20040215430A1 (en) 2004-10-28
US7716148B2 (en) 2010-05-11
EP1504412A2 (en) 2005-02-09
CA2480951A1 (en) 2003-10-30
KR20050007306A (en) 2005-01-17
AU2003241302A1 (en) 2003-11-03
JP2005523533A (en) 2005-08-04
BR0309322A (en) 2007-03-06

Similar Documents

Publication Publication Date Title
US7321879B2 (en) Using neural networks for data mining
US20220076165A1 (en) Systems and methods for automating data science machine learning analytical workflows
US11586463B2 (en) Automated process flow learning
Calvanese et al. The MASTRO system for ontology-based data access
Van Der Aalst Process discovery from event data: Relating models and logs through abstractions
US9928526B2 (en) Methods and systems that predict future actions from instrumentation-generated events
Fischer et al. A taxonomy and archetypes of smart services for smart living
Misaghian et al. An approach for requirements prioritization based on tensor decomposition
Becker et al. Scalable decision support for digital preservation
Yamashita et al. Measuring change impact based on usage profiles
Ochoa et al. Context-aware workflow management for smart manufacturing: A literature review of semantic web-based approaches
Ataei et al. NeoMycelia: A software reference architecturefor big data systems
Zender et al. Ontology-based meta automl
Albashiri et al. An investigation into the issues of Multi-Agent Data Mining
US11748248B1 (en) Scalable systems and methods for discovering and documenting user expectations
Safdar et al. Using multi-objective search and machine learning to infer rules constraining product configurations
Yasin et al. Big data services requirements analysis
Yenduri et al. A review on soft computing approaches for predicting maintainability of software: State‐of‐the‐art, technical challenges, and future directions
Daosabah et al. Integrating context and intention for optimal semantic web service composition using AI planning
CN113918534A (en) Policy processing system and method
KR20040101532A (en) Using neural networks for data mining
Bodenstaff et al. Representing and validating digital business processes
Wang Intelligent Web Services Architecture Evolution Via An Automated Learning-Based Refactoring Framework
US20230196203A1 (en) Agnostic machine learning training integrations
Derguech Business Capability-centric Management of Services and Process Models.

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SK SL TJ TM TN TR TT TZ UA UG UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 200407723

Country of ref document: ZA

WWE Wipo information: entry into national phase

Ref document number: 2003241302

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2480951

Country of ref document: CA

Ref document number: 2968/DELNP/2004

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2003731033

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1020047016751

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2003586796

Country of ref document: JP

Ref document number: 2003808838X

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 1020047016751

Country of ref document: KR

D17 Declaration under article 17(2)a
WWP Wipo information: published in national office

Ref document number: 2003731033

Country of ref document: EP

ENP Entry into the national phase

Ref document number: PI0309388

Country of ref document: BR