WO2006083596A2 - Responding to situations using knowledge representation and inference - Google Patents

Responding to situations using knowledge representation and inference Download PDF

Info

Publication number
WO2006083596A2
WO2006083596A2 PCT/US2006/002204 US2006002204W WO2006083596A2 WO 2006083596 A2 WO2006083596 A2 WO 2006083596A2 US 2006002204 W US2006002204 W US 2006002204W WO 2006083596 A2 WO2006083596 A2 WO 2006083596A2
Authority
WO
WIPO (PCT)
Prior art keywords
nodes
data
response
knowledge
receiving data
Prior art date
Application number
PCT/US2006/002204
Other languages
French (fr)
Other versions
WO2006083596A3 (en
Inventor
Rakesh Gupta
Pedro Vasco Calais
Original Assignee
Honda Motor Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Honda Motor Co., Ltd. filed Critical Honda Motor Co., Ltd.
Priority to JP2007553151A priority Critical patent/JP2008529163A/en
Publication of WO2006083596A2 publication Critical patent/WO2006083596A2/en
Publication of WO2006083596A3 publication Critical patent/WO2006083596A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models

Definitions

  • the present invention generally relates to the field of machine learning, and more specifically, to responding to situations based on distributed knowledge capture and inference.
  • Robots represent a major step in applying machine technology toward assisting persons in the home.
  • Potential applications include assisted living, wherein a robot could help bring an elderly person his medicine or glasses or assist the handicapped. Additional applications may encompass a myriad of daily activities, such as performing household chores, attending infants and responding to calls and queries.
  • visual and voice recognition techniques robots may be able to recognize and greet their users by name.
  • robots should be able to learn through human interaction and other methods.
  • robots Fundamental to these goals is the ability to endow robots in indoor environments with the ability to effectively interact with their users and with other people and the environment.
  • robots must be able to respond "appropriately” to given situations, that is, so as to satisfy the perceived desires of their users.
  • the robot need not find the "right” response, but rather the one that reflects the majority consensus opinion. This is referred to as "common sense.”
  • a robot must be instilled with a knowledge base, and with a means of formulating responses to perceived situations.
  • robots should be capable of adding to their knowledge bases online.
  • a robot may observe a baby crying, as shown in Figure (Fig.) 1.
  • An internal knowledge base might indicate that several responses might be appropriate, including feeding, entertaining and calming the baby.
  • a second database or algorithm may include or calculate some indicia of the relative likelihood that a particular response is most appropriate in this situation. The robot will initiate the most likely response according to distributed knowledge. Over time, the robot would modify the likelihood information according to changes in the knowledge base.
  • rule-based systems which represent an important part of reasoning in artificial intelligence (AI).
  • AI artificial intelligence
  • rule-based systems provide efficient and elegant knowledge representation, they exhibit several weaknesses that reduce their usability in the large-domain, real-time reasoning applications of interest.
  • the handcrafted rules require manual effort by specialists in the domain who are fluent in the pertinent representations.
  • maintaining the consistency of the large set of rules required to deal with a large domain becomes increasingly difficult as the number of rules grows.
  • the rule sets are generally not scalable to the millions of pieces of knowledge required.
  • the systems may break down when rules conflict.
  • the reasoning process is limited to literal matching of the preconditions of the rules.
  • a knowledge database is assembled automatically, based on distributed knowledge capture. Specifically, the knowledge is contributed by many human subjects, in response to templates containing written queries. By conducting this activity over the worldwide web, contributions from a great number of people may be practically collected. As a benefit, the knowledge base embodies the "common sense," that is, the consensus, of the subjects.
  • MSN multidimensional semantic network
  • the MSN provides a compact and efficient semantic representation suitable for extraction of knowledge for inference purposes.
  • the MSN includes relationships between individual pieces of knowledge, as well as notions of real-life situations and responses. It thereby serves as the basis for task and response selection. Additional linguistic techniques, including expansion and contraction, are used to increase the overlap of knowledge within the MSN, making it “denser,” and thereby more useful.
  • Figure 1 illustrates a hypothetical situation perceived by a robot, and candidate responses determined by the robot.
  • Figure 2 shows one embodiment of the method of the invention.
  • Figure 3 illustrates one embodiment for the derivation of a multidimensional semantic network.
  • Figure 4 illustrates exemplary nodes and edges of a multidimensional semantic network.
  • Figure 5 illustrates exemplary dimensions of a multidimensional semantic network.
  • Figure 6 illustrates a portion of an exemplary multidimensional semantic network.
  • Figure 7 illustrates exemplary compound relations within a multidimensional semantic network.
  • Figure 8 illustrates one embodiment of the derivation of a Bayes network.
  • Figure 9 illustrates an exemplary portion of a Bayes network and an exemplary response calculation.
  • Figure 10 illustrates exemplary response to multiple situations.
  • Figure 11 illustrates a system for implementing one embodiment of the present invention.
  • Figure 12 illustrates an example of merging multiple synonyms into a single node.
  • the method of the present invention is shown in Figure 2.
  • "Raw" knowledge is assembled into a knowledge database 210.
  • the knowledge may derive from a variety of sources, including databases and humans.
  • Database 210 may be assembled automatically, and resides within a robot or other mobile machine that serves a user.
  • the knowledge may be organized in a graphical representation for more efficient subsequent processing, as will be discussed.
  • an appropriate subset of the knowledge database is extracted and optimized 220.
  • a number of candidate responses are determined, and the response that is most likely desired by the user is performed by the robot.
  • database 210 and response determination 220 are refined through experience as the robot interacts with and learns from its environment and user.
  • Knowledge database operations 210 are further illustrated in Figure 3.
  • distributed knowledge is captured 306.
  • knowledge may be derived from many sources, potentially comprising databases, articles and contribution from human subjects.
  • knowledge is collected from multiple human subjects responding to queries made available over the worldwide web.
  • the queries have the form of "sentence templates," e.g., natural-language sentences with blanks.
  • a respondent enters words or phrases as he or she deems appropriate to complete the sentences.
  • the knowledge base embodies the frequency of response, and thus consensus, of the respondents.
  • the scope of the knowledge may be constrained to matters regarding, e.g., indoor or other environments, as well as to real objects, according to the expected application.
  • the database may be object-centric. That is, objects are characterized by their properties, and actions taken by the robot are assumed to be grounded in the properties.
  • the knowledge may focus on articles in the home, such as trash cans and coffee makers.
  • the knowledge may also focus on actions commonly performed in connection with various properties of such articles. For example, if the trash can is full, there is associated a need to empty it.
  • the database is more "dense," that is, it contains repetitions of knowledge regarding notions of articles and actions within the environment of interest.
  • the phrase "the cup from the supermarket” may be reduced by eliminating descriptors, yielding the noun reference “the cup.” Applying the rule that "A of B” gives “B” as the object reduces “bunch of flowers” to “flowers.” Similarly, removal of determiners gives “book” from “the book.” Where a noun is preceded by an adjective, adjective removal gives, e.g., "car” from “red car.” A description of this can be found in Brill, "A Simple Rule-based Part-of-speech Tagger,” Proceedings ofANLP-92, 3 rd Conference on Applied Natural Language Processing, Trento, IT, 1992, which is incorporated by reference herein in its entirety.
  • Processing step 312 may also perform action-object pairing to capture relationships among words. For example, for the phrase "put the coffee in the closet,” the word “put” may be identified as an action, and associated with object words “coffee” and “closet.” This enables candidate actions to be identified for objects.
  • Step 312 may also perform tagging, lemmatizing or chunking. Tagging attributes a word type, e.g., verb, noun, etc., to each word. Lemmatizing converts a word into its "root” form, e.g., converts "is” to "be.” Chunking converts a set of tagged words and converts them into a phrase. For example, a word tagged as a noun may be converted into a related phrase that is also tagged as a noun. These operations may also be used to add or improve relations among words.
  • a preliminary multidimensional semantic network is constructed 318.
  • An MSN is a multidimensional graph wherein each dimension represents a type of semantic network.
  • the MSN includes nodes and edges.
  • the nodes represent a variety of types of knowledge, including concepts, objects and properties, and each node comprises a name and knowledge type. For example, a node may have the name "dog" and type "object.”
  • Edges define relationships between pairs of nodes, and each edge comprises a label and semantic type.
  • the label identifies the particular edge instance, while the semantic type indicates the nature of the relationship between the pair of nodes connected.
  • the semantic type indicates the nature of the relationship between the pair of nodes connected.
  • all of the edges have a common semantic type.
  • Two additional types of nodes are situation and actions, as illustrated in
  • FIG. 4 A situation is the conjunction of an object and a property, while an action is the conjunction of an atomic action, i.e., elemental action and an object. These are thus examples of compound nodes, which advantageously embed notions of causality and responses to situations. Situation and action nodes are respectively generated from cause and response data within the database. Causes generate edges between pairs of situations, while responses generate edges between situations and actions. [0038] Also shown in Figure 4 are examples of the edge labels (i.e., edge types) has_obj(ect), has_prop(erty), and has_action. These and other edge labels and explanatory examples are listed in the table of Figure 5.
  • the situation (compound) node “floor__slippery” will be connected to the node “floor” by an edge with label “has_object,” indicating that "floor” is a conjunctive object of “floor_slippery.”
  • “Floor_slippery” will also be connected to the node “slippery” by an edge with label “has__property,” indicating that “slippery” is a conjunctive property of "floor_slippery.”
  • the nodes “sweep_floor” and “sweep” would be connected by an edge with label “has_action,” indicating that "sweep” is an action associated with “sweep floor.”
  • Other edge labels include “response,” whereby a situation is related to an action.
  • “dry_floor_mat” is an action that might be performed by a robot when the situation "floor_mat_wet” is perceived.
  • the edge label “patient” indicates that one node might be the benefactor of another node representing an action, e.g., "infant” benefits from “shake_infant.”
  • Causality is expressed by the edge label “causes,” e.g., "trash__in_trash_can” causes "house_clean.”
  • Lemmas are edge labels that may connect nodes that might appear together in a glossary entry, while Synonyms connect nodes with the same or similar meanings. Hypernyms and hyponyms relate nodes that are more or less generic than other nodes. For example, tableware is a hypernym of knife, while poodle is ahyponym of dog.
  • an MSN is a directed labeled graph defined by
  • this graph includes T nodes and L edges, each of which has a label.
  • a description of this can be found in Gupta and Pedro, "Praxi-Net: Knowledge Representation and Inference using the Open Mind Indoor Commonsense Data,” Hyundai Research Institute USA, Inc., 2004, which is incorporated by reference herein in its entirety.
  • step 324 mitigates this shortcoming.
  • Step 324 expands the MSN by adding edges that are defined by consulting auxiliary references, such as thesauri or lexical databases, hi one embodiment, this is accomplished automatically using online references.
  • the added edges typically comprise synonyms, hypernyms and hyponyms.
  • the compound nodes "baby_crying” and “infant_crying” are each related to node “crying" by edges with label “has_action.” Therefore, existing nodes “baby” and “infant” can be linked by a synonym edge.
  • the MSN When the MSN has been fully expanded, it will contain redundant edges, or edges that will likely not be useful.
  • the MSN is condensed 330.
  • some synonyms may not be bidirectional. That is, there may have been a basis for establishing that a first node is a synonym of a second node, but not vice versa, and thus resulting in a non-bidirectional synonym edge.
  • Such edges are considered to be of lower utility than bidirectional synonyms, and as a consequence they are pruned from the MSN.
  • Unnecessary synonyms may be present, as indicated by "baby” and "infant” in the example of Figure 12. Such synonyms are merged into a single node to eliminate redundancy as shown. However, it is important not to lose the original senses of merged words, and so a hash table is created to preserve merged synonyms. For every pair of synonyms (nj, ni), an entry ⁇ ri 2 - > nl) is created and used to redirect the pertinent edges from ri 2 to ni. The resultant MSN exhibits an optimal balance between utility and complexity.
  • the optimized MSN thus created is available for use in determining a response to a situation 220.
  • a Bayesian network i.e., a Bayes network.
  • a Bayes network is also comprised of nodes and edges.
  • the nodes represent random variables, which may be discrete or continuous.
  • the edges connect pairs of nodes; if there is an arrow from a node X to another node Y, X is said to be a parent of Y.
  • a parent node has a direct influence on a child node, that is, a parent node generates an expectation for the nodes below.
  • Each node X 1 has a conditional probability distribution P(Xi ⁇ Parents (X,)) that quantifies the effect of the parents on the node below. Accordingly, the probability of a given node can be computed iteratively by working upward through the chain of nodes and edges. A discussion of this can be found in Artificial Intelligence: A Modern Approach, second edition, S. J. Russell and P. Norvig, Pearson Education Inc., 2003.
  • the causes and responses within the MSN pertinent to the situation of interest serve as the basis for extracting the Bayes network.
  • the Bayes network thus extracted from the MSN is a simplified causal network with situations and responses as nodes. Furthermore, causes and response edges generate compound nodes including situations and actions.
  • Figure 9 shows an exemplary Bayes network synthesized from an MSN in response to the situation node 910 "floor mat_wet,” or state SO.
  • This node is the parent of situation node 920 "floor mat_dirty” (Sl) and action node 930 "dry_floor mat” (Rl).
  • a set of conditional node probabilities is computed within the Bayes network 816. These are shown at 940 in the example of Figure 9. For example, when prompted with the situation "floor mat_wet,” for every five persons queried during knowledge collection, three, or 0.6 of all respondents, responded that this situation causes "floor mat_dirty,” while two persons, or 0.4 of all respondents, responded that this situation demands the response "dry_floor mat.” These node probabilities may be expressed mathematically as P (S1
  • the method of the invention is not limited to robotic applications.
  • a similar method may be applied to security applications, wherein one or more responses to a perceived anomalous situation within, for example, an airport or other facility, are calculated based on an appropriate knowledge database.
  • Persons skilled in the art will appreciate that other applications may be accommodated by the method described as well.
  • Computer system 1100 comprises an input module 1110, a memory device 1114, a processor 1116, and an output module 1118.
  • an image processor 1112 can be part of the main processor 1116 or a dedicated device to perceive situations as digital images captured in a preferred image format.
  • memory device 1114 may be a standalone memory device, (e.g., a random access memory chip, flash memory, or the like), or an on-chip memory with the processor 1116 (e.g., cache memory).
  • computer system 1100 can be a standalone system, such as, a server, a personal computer, or the like.
  • computer system 1100 can be part of a larger system such as, for example, a robot having a vision system, a security system (e.g., airport security system), or the like.
  • computer system 1100 comprises an input module 1110 to receive the digital images I.
  • the digital images, I may be received directly from an imaging device 1101, for example, a digital camera 1101a (e.g., robotic eyes), a video system 1101b (e.g., closed circuit television), image scanner, or the like.
  • the input module 1110 may be a network interface to receive digital images from another network system, for example, an image database, another vision system, Internet servers, or the like.
  • the network interface may be a wired interface, such as, a USB, RS-232 serial port, Ethernet card, or the like, or may be a wireless interface module, such as, a wireless device configured to communicate using a wireless protocol, e.g., Bluetooth, WiFi, IEEE 802.11, or the like.
  • a wireless protocol e.g., Bluetooth, WiFi, IEEE 802.11, or the like.
  • An optional image processor 1112 may be part of the processor 1116 or a dedicated component of the system 1100.
  • the image processor 1112 could be used to pre-process the digital images I received through the input module 1110 to convert the digital images, I, to the preferred format on which the processor 1116 operates. For example, if the digital images, I, received through the input module 1110 come from a digital camera 1110a in a JPEG format and the processor is configured to operate on raster image data, image processor 1112 can be used to convert from JPEG to raster image data.
  • the digital images, I once in the preferred image format if an image processor 1112 is used, are stored in the memory device 1114 to be processed by processor 1116.
  • Processor 1116 applies a set of instructions that when executed perform one or more of the methods according to the present invention, e.g., synthesizing an MSN, a Bayes network, calculating response probabilities and the like, as described herein. While executing the set of instructions, processor 1116 accesses memory device 1114 to perform the operations according to methods of the present invention on the image data stored therein.
  • Processor 1116 interprets a situation from the input images, I, and outputs indications of calculated responses through the output module 1118 to an external device 1125 (e.g., a database 1125a, a network element or server 1125b, a display device 1125c, or the like).
  • an external device 1125 e.g., a database 1125a, a network element or server 1125b, a display device 1125c, or the like.
  • output module 1118 can be wired or wireless.
  • Output module 1118 may be a storage drive interface, (e.g., hard-drive or optical drive driver), a network interface device (e.g., an Ethernet interface card, wireless network card, or the like), or a display driver (e.g., a graphics card, or the like), or any other such device for outputting the response or responses determined.
  • a storage drive interface e.g., hard-drive or optical drive driver
  • network interface device e.g., an Ethernet interface card, wireless network card, or
  • Advantages of the present invention include scalability to accommodate on the order of millions of pieces of information.
  • the method of the invention is independent of the language and actual words that comprise the knowledge base. Where multiple responses are possible for a given situation, the consensus response is determined based on the relative frequencies of response of the subjects who contribute the knowledge.
  • the restriction to indoor domains makes the knowledge base sufficiently dense to be statistically useful for inferencing. Furthermore, inferencing over multiple situations is possible. Knowledge regarding synonymous concepts is combined to make the knowledge base more dense for improved inferencing.

Abstract

A system, apparatus and application for providing robots with the ability to intelligently respond to perceived situations are described. A knowledge database is assembled automatically, based on distributed knowledge capture. The knowledge base embodies the 'common sense,' that is, the consensus, of the subjects who contribute the knowledge. Systems are provided to automatically preprocess, or 'clean' the information to make it more useful. The knowledge thus refined is utilized to construct a multidimensional semantic network, or MSN. The MSN provides a compact and efficient semantic representation suitable for extraction of knowledge for inference purposes and serves as the basis for task and response selection. When the robot perceives a situation that warrants a response, an appropriate subset of the MSN is extracted into a Bayes network. The resultant network is refined, and used to derive a set of response probabilities, which the robot uses to formulate a response.

Description

RESPONDING TO SITUATIONS USING KNOWLEDGE REPRESENTATION AND INFERENCE
INVENTORS: RAKESH GUPTA, VASCO CALAIS PEDRO
FIELD OF THE INVENTION
[0001] The present invention generally relates to the field of machine learning, and more specifically, to responding to situations based on distributed knowledge capture and inference.
BACKGROUND OF THE INVENTION
[0002] Humanoid robots represent a major step in applying machine technology toward assisting persons in the home. Potential applications include assisted living, wherein a robot could help bring an elderly person his medicine or glasses or assist the handicapped. Additional applications may encompass a myriad of daily activities, such as performing household chores, attending infants and responding to calls and queries. Through visual and voice recognition techniques, robots may be able to recognize and greet their users by name. In addition, robots should be able to learn through human interaction and other methods.
[0003] Fundamental to these goals is the ability to endow robots in indoor environments with the ability to effectively interact with their users and with other people and the environment. In particular, robots must be able to respond "appropriately" to given situations, that is, so as to satisfy the perceived desires of their users. Importantly, the robot need not find the "right" response, but rather the one that reflects the majority consensus opinion. This is referred to as "common sense." Thus, a robot must be instilled with a knowledge base, and with a means of formulating responses to perceived situations. Furthermore, robots should be capable of adding to their knowledge bases online.
[0004] For example, a robot may observe a baby crying, as shown in Figure (Fig.) 1. An internal knowledge base might indicate that several responses might be appropriate, including feeding, entertaining and calming the baby. A second database or algorithm may include or calculate some indicia of the relative likelihood that a particular response is most appropriate in this situation. The robot will initiate the most likely response according to distributed knowledge. Over time, the robot would modify the likelihood information according to changes in the knowledge base.
[0005] Conventional solutions to such problems have included rule-based systems, which represent an important part of reasoning in artificial intelligence (AI). Although rule-based systems provide efficient and elegant knowledge representation, they exhibit several weaknesses that reduce their usability in the large-domain, real-time reasoning applications of interest. First, the handcrafted rules require manual effort by specialists in the domain who are fluent in the pertinent representations. Second, maintaining the consistency of the large set of rules required to deal with a large domain becomes increasingly difficult as the number of rules grows. As a consequence, the rule sets are generally not scalable to the millions of pieces of knowledge required. Third, the systems may break down when rules conflict. Finally, when retrieving the knowledge from the knowledge base, the reasoning process is limited to literal matching of the preconditions of the rules.
[0006] Other conventional approaches have involved a variety of mechanisms for storing knowledge and for foπnulating responses to situations. For example, MindNet (Dolan, Richardson, & Vanderwende 1998) receives knowledge from a dictionary, but can comprehend only a limited number of relations (e.g. used_for). Cyc (Lenat & Guha 1990) relies upon manual formation of rules. Cyc includes more than a million rules entered by over 50 people over the last 15 years; it initially utilized a human-like reasoning system but has evolved to specialize in defense applications. The information embodied in the MIT Media Lab Common Sense reasoning project (Liu & Singh 2004) is too broad, and the knowledge is not dense enough for the deep inferences required. Similarly, other common sense knowledge bases have attempted to capture very broad but overly sparse human common sense knowledge (Liu, Lieberman, & Selker 2003; Eagle, Singh, & Pentland 2003; Mueller 1998; Guha et al. 1990). [0007] Attempts to mitigate these shortcomings have involved alternative techniques including knowledge capture, linguistic tools and Bayesian reasoning. For example, common sense knowledge may be gathered from non-specialist "netizens," using distributed techniques, as with the Open Mind Initiative (Stork 1999; 2000). While offering advantages over other methods, distributed knowledge capture results in "messy" knowledge, e.g., having redundancy, missing relationships, mis-spelling and error. Thus, processing is required to refine such knowledge into a form useful for providing robots with knowledge.
[0008] Accordingly, there is a need for an improved method for providing robots with the ability to satisfy perceived desires or requests of their users. The method should be reliable and flexible, guided by notions of common sense and instilled with the ability to learn through interaction with humans and the environment.
SUMMARY OF THE INVENTION
[0009] The present invention meets these needs with a method, apparatus and application for providing robots with the ability to intelligently respond to perceived situations. According to one aspect of the invention, a knowledge database is assembled automatically, based on distributed knowledge capture. Specifically, the knowledge is contributed by many human subjects, in response to templates containing written queries. By conducting this activity over the worldwide web, contributions from a great number of people may be practically collected. As a benefit, the knowledge base embodies the "common sense," that is, the consensus, of the subjects.
[0010] As can be appreciated, the "raw" knowledge thus gathered is initially
"noisy." That is, the knowledge contains, e.g., redundancy, error and mis-spelling. Means such as linguistic tools are provided to automatically preprocess, or "clean" the information to make it more accurate and useful. The knowledge thus refined is next utilized to construct a multidimensional semantic network, or MSN. The MSN provides a compact and efficient semantic representation suitable for extraction of knowledge for inference purposes. The MSN includes relationships between individual pieces of knowledge, as well as notions of real-life situations and responses. It thereby serves as the basis for task and response selection. Additional linguistic techniques, including expansion and contraction, are used to increase the overlap of knowledge within the MSN, making it "denser," and thereby more useful.
[0011] When the robot perceives a situation that warrants a response, an appropriate subset of the MSN is extracted into a Bayes network. The latter is refined, and used to derive a set of response probabilities, which the robot uses to formulate a response. [0012] In a practical implementation, the system thus described can be scaled to accommodate millions of pieces of knowledge, and to find an appropriate response to a given situation. The system is independent of word usage, and weights the common sense responses by frequency of occurrence to handle conflicts. [0013] The features and advantages described in the specification are not all inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter.
BRIEF DESCRIPTION OF THE DRAWINGS
[0014] The invention has other advantages and features which will be more readily apparent from the following detailed description of the invention and the appended claims, when taken in conjunction with the accompanying drawings, in which:
[0015] Figure 1 illustrates a hypothetical situation perceived by a robot, and candidate responses determined by the robot.
[0016] Figure 2 shows one embodiment of the method of the invention.
[0017] Figure 3 illustrates one embodiment for the derivation of a multidimensional semantic network.
[0018] Figure 4 illustrates exemplary nodes and edges of a multidimensional semantic network.
[0019] Figure 5 illustrates exemplary dimensions of a multidimensional semantic network.
[0020] Figure 6 illustrates a portion of an exemplary multidimensional semantic network.
[0021] Figure 7 illustrates exemplary compound relations within a multidimensional semantic network.
[0022] Figure 8 illustrates one embodiment of the derivation of a Bayes network.
[0023] Figure 9 illustrates an exemplary portion of a Bayes network and an exemplary response calculation.
[0024] Figure 10 illustrates exemplary response to multiple situations.
[0025] Figure 11 illustrates a system for implementing one embodiment of the present invention.
[0026] Figure 12 illustrates an example of merging multiple synonyms into a single node.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0027] The Figures and the following description relate to preferred embodiments of the present invention by way of illustration only. It should be noted that from the following discussion, alternative embodiments of the structures and methods disclosed herein will be readily recognized as viable alternatives that may be employed without departing from the principles of the claimed invention.
[0028] The method of the present invention according to one embodiment is shown in Figure 2. "Raw" knowledge is assembled into a knowledge database 210. The knowledge may derive from a variety of sources, including databases and humans. Database 210 may be assembled automatically, and resides within a robot or other mobile machine that serves a user. The knowledge may be organized in a graphical representation for more efficient subsequent processing, as will be discussed. When a situation is perceived by the robot, an appropriate subset of the knowledge database is extracted and optimized 220. Then, a number of candidate responses are determined, and the response that is most likely desired by the user is performed by the robot. According to one embodiment, database 210 and response determination 220 are refined through experience as the robot interacts with and learns from its environment and user. [0029] Knowledge database operations 210 are further illustrated in Figure 3. First, distributed knowledge is captured 306. Such knowledge may be derived from many sources, potentially comprising databases, articles and contribution from human subjects. According to one embodiment, knowledge is collected from multiple human subjects responding to queries made available over the worldwide web. The queries have the form of "sentence templates," e.g., natural-language sentences with blanks. A respondent enters words or phrases as he or she deems appropriate to complete the sentences. As responses from more subjects are collected, the knowledge base embodies the frequency of response, and thus consensus, of the respondents. This ultimately enables the robot to respond to situations based on a notion of "common sense." For example, if in real life more people respond to a crying baby by feeding the baby versus entertaining the baby, the robot would first feed a crying baby, and, if unsuccessful, would resort to entertaining the crying baby.
[0030] To enhance efficiency, the scope of the knowledge may be constrained to matters regarding, e.g., indoor or other environments, as well as to real objects, according to the expected application. Furthermore, the database may be object-centric. That is, objects are characterized by their properties, and actions taken by the robot are assumed to be grounded in the properties. For example, for domestic applications, the knowledge may focus on articles in the home, such as trash cans and coffee makers. In addition, the knowledge may also focus on actions commonly performed in connection with various properties of such articles. For example, if the trash can is full, there is associated a need to empty it. As a benefit, the database is more "dense," that is, it contains repetitions of knowledge regarding notions of articles and actions within the environment of interest. A discussion of this can be found in Gupta and Kochenderfer, "Common Sense Data Acquisition for Indoor Mobile Robots," Nineteenth National Conference on Artificial Intelligence (AAAI-04), 2004, which is incorporated by reference herein in its entirety [0031] The raw data thus captured is "noisy." In other words, the data requires refinement before it may be practically useful. For example, the data may contain spelling errors, which, if uncorrected, would create non-existent concepts and reduce the amount of valid data. Also, redundant words may be present. Accordingly, processing 312 may be applied to "clean" the data using such tools as spell checking and non-essential word removal. For example, the phrase "the cup from the supermarket" may be reduced by eliminating descriptors, yielding the noun reference "the cup." Applying the rule that "A of B" gives "B" as the object reduces "bunch of flowers" to "flowers." Similarly, removal of determiners gives "book" from "the book." Where a noun is preceded by an adjective, adjective removal gives, e.g., "car" from "red car." A description of this can be found in Brill, "A Simple Rule-based Part-of-speech Tagger," Proceedings ofANLP-92, 3rd Conference on Applied Natural Language Processing, Trento, IT, 1992, which is incorporated by reference herein in its entirety.
[0032] In addition, some sources of data may be untrustworthy, and mistakes may be present. If the database contains sufficient information, mistakes and untrustworthy information may be mitigated or removed by taking the consensus of cumulative evidence.
[0033] Processing step 312 may also perform action-object pairing to capture relationships among words. For example, for the phrase "put the coffee in the closet," the word "put" may be identified as an action, and associated with object words "coffee" and "closet." This enables candidate actions to be identified for objects. [0034] Step 312 may also perform tagging, lemmatizing or chunking. Tagging attributes a word type, e.g., verb, noun, etc., to each word. Lemmatizing converts a word into its "root" form, e.g., converts "is" to "be." Chunking converts a set of tagged words and converts them into a phrase. For example, a word tagged as a noun may be converted into a related phrase that is also tagged as a noun. These operations may also be used to add or improve relations among words.
[0035] Returning again to Figure 3, a preliminary multidimensional semantic network (MSN) is constructed 318. An MSN is a multidimensional graph wherein each dimension represents a type of semantic network. The MSN includes nodes and edges. The nodes represent a variety of types of knowledge, including concepts, objects and properties, and each node comprises a name and knowledge type. For example, a node may have the name "dog" and type "object."
[0036] Edges define relationships between pairs of nodes, and each edge comprises a label and semantic type. The label identifies the particular edge instance, while the semantic type indicates the nature of the relationship between the pair of nodes connected. Within each dimension of the MSN, all of the edges have a common semantic type. [0037] Two additional types of nodes are situation and actions, as illustrated in
Figure 4. A situation is the conjunction of an object and a property, while an action is the conjunction of an atomic action, i.e., elemental action and an object. These are thus examples of compound nodes, which advantageously embed notions of causality and responses to situations. Situation and action nodes are respectively generated from cause and response data within the database. Causes generate edges between pairs of situations, while responses generate edges between situations and actions. [0038] Also shown in Figure 4 are examples of the edge labels (i.e., edge types) has_obj(ect), has_prop(erty), and has_action. These and other edge labels and explanatory examples are listed in the table of Figure 5. For example, the situation (compound) node "floor__slippery" will be connected to the node "floor" by an edge with label "has_object," indicating that "floor" is a conjunctive object of "floor_slippery." "Floor_slippery" will also be connected to the node "slippery" by an edge with label "has__property," indicating that "slippery" is a conjunctive property of "floor_slippery." Similarly, the nodes "sweep_floor" and "sweep" would be connected by an edge with label "has_action," indicating that "sweep" is an action associated with "sweep floor." [0039] Other edge labels include "response," whereby a situation is related to an action. For example, "dry_floor_mat" is an action that might be performed by a robot when the situation "floor_mat_wet" is perceived. The edge label "patient" indicates that one node might be the benefactor of another node representing an action, e.g., "infant" benefits from "shake_infant." Causality is expressed by the edge label "causes," e.g., "trash__in_trash_can" causes "house_clean." Lemmas are edge labels that may connect nodes that might appear together in a glossary entry, while Synonyms connect nodes with the same or similar meanings. Hypernyms and hyponyms relate nodes that are more or less generic than other nodes. For example, tableware is a hypernym of knife, while poodle is ahyponym of dog.
[0040] Mathematically, an MSN is a directed labeled graph defined by
G=(N,T,E,L,a(x),b (y)} where
• N=(H1, n2, n3... nj defines the node set,
• T— (tb t2, t3... tj defines the node type set,
• E—(eb e2, e3... ey} defines the edge set,
• L= (I1, 12, 13.. Iy) defines the edge label set,
• a(x) defines the type of a node such that Vx, a(x)=tΩ tx e∑ and
• β(y) defines the label of the edge where Vy, β (y)=Iy, ly eL.
By way of illustration, this graph includes T nodes and L edges, each of which has a label. A description of this can be found in Gupta and Pedro, "Praxi-Net: Knowledge Representation and Inference using the Open Mind Indoor Commonsense Data," Honda Research Institute USA, Inc., 2004, which is incorporated by reference herein in its entirety.
[0041] The preliminary MSN thus obtained is relatively sparse, e.g., it lacks a sufficient number of semantic relations between nodes to be practically useful. This is in part a consequence of the fact that two people will use the same nomenclature for a particular well-known object less than twenty percent of the time. For example, one person might ask "Where can I find the goblet?" but another might substitute "chalice" for "goblet." Returning to Figure 3, step 324 mitigates this shortcoming. Step 324 expands the MSN by adding edges that are defined by consulting auxiliary references, such as thesauri or lexical databases, hi one embodiment, this is accomplished automatically using online references. The added edges typically comprise synonyms, hypernyms and hyponyms. Advantageously, "vocabulary robustness" is thereby fostered, since the stored knowledge will be representative of a larger composite of the population. In addition, additional inferences can be made through this process. A description of this can be found in Gupta and Pedro, "Praxi-Net: Knowledge Representation and Inference using the Open Mind Indoor Commonsense Data," which is cited above. [0042] Additional MSN expansion may be accomplished by analyzing compound relations. For example, if two compound nodes have common intermediate nodes, then they may be related by adding edges. For example, as illustrated in Figure 7, the compound nodes "baby_crying" and "infant_crying" are each related to node "crying" by edges with label "has_action." Therefore, existing nodes "baby" and "infant" can be linked by a synonym edge.
[0043] When the MSN has been fully expanded, it will contain redundant edges, or edges that will likely not be useful. Returning to Figure 3, to maximize efficiency, the MSN is condensed 330. For example, some synonyms may not be bidirectional. That is, there may have been a basis for establishing that a first node is a synonym of a second node, but not vice versa, and thus resulting in a non-bidirectional synonym edge. Such edges are considered to be of lower utility than bidirectional synonyms, and as a consequence they are pruned from the MSN.
[0044] Unnecessary synonyms may be present, as indicated by "baby" and "infant" in the example of Figure 12. Such synonyms are merged into a single node to eliminate redundancy as shown. However, it is important not to lose the original senses of merged words, and so a hash table is created to preserve merged synonyms. For every pair of synonyms (nj, ni), an entry {ri2 -> nl) is created and used to redirect the pertinent edges from ri2 to ni. The resultant MSN exhibits an optimal balance between utility and complexity.
[0045] Returning to Figure 2, the optimized MSN thus created is available for use in determining a response to a situation 220. As shown in more detail in Figure 8, once a situation is perceived, the relevant subset of the MSN is extracted 806 into a Bayesian network, i.e., a Bayes network. A Bayes network is also comprised of nodes and edges. The nodes represent random variables, which may be discrete or continuous. The edges connect pairs of nodes; if there is an arrow from a node X to another node Y, X is said to be a parent of Y. Generally, a parent node has a direct influence on a child node, that is, a parent node generates an expectation for the nodes below. Each node X1 has a conditional probability distribution P(Xi \ Parents (X,)) that quantifies the effect of the parents on the node below. Accordingly, the probability of a given node can be computed iteratively by working upward through the chain of nodes and edges. A discussion of this can be found in Artificial Intelligence: A Modern Approach, second edition, S. J. Russell and P. Norvig, Pearson Education Inc., 2003.
[0046] According to one embodiment, the causes and responses within the MSN pertinent to the situation of interest serve as the basis for extracting the Bayes network. The Bayes network thus extracted from the MSN is a simplified causal network with situations and responses as nodes. Furthermore, causes and response edges generate compound nodes including situations and actions.
[0047] For example, Figure 9 shows an exemplary Bayes network synthesized from an MSN in response to the situation node 910 "floor mat_wet," or state SO. This node is the parent of situation node 920 "floor mat_dirty" (Sl) and action node 930 "dry_floor mat" (Rl).
[0048] Referring back to Figure 8, a set of conditional node probabilities is computed within the Bayes network 816. These are shown at 940 in the example of Figure 9. For example, when prompted with the situation "floor mat_wet," for every five persons queried during knowledge collection, three, or 0.6 of all respondents, responded that this situation causes "floor mat_dirty," while two persons, or 0.4 of all respondents, responded that this situation demands the response "dry_floor mat." These node probabilities may be expressed mathematically as P (S1|SO) = 0.6 and P (RIjSO) = 0.4, respectively.
[0049] The remaining response probabilities in Figure 9 are similarly computed.
This is indicated at step 818 of Figure 8, and is shown in Figure 9 at 970. As a result, of the three possible responses to the given situation, namely Rl, R2 and R3, the consensus response (that is, the one with the highest probability) would be R2, with probability 0.48. Thus, according to the example of Figure 9, a robot would choose R2 (clean_floor mat) as the most appropriate response to the given situation "floor mat_wet." [0050] The method of the invention is not limited to the simple example given, but is capable of accommodating more complex situations. For example, inferencing multiple responses to multiple situations may be carried out concurrently as shown in Figure 10. For each of the perceived situations 1010, the method described above would be applied to determine an appropriate response 1020 by robot 130.
Il [0051] The method of the invention is not limited to robotic applications. For example, a similar method may be applied to security applications, wherein one or more responses to a perceived anomalous situation within, for example, an airport or other facility, are calculated based on an appropriate knowledge database. Persons skilled in the art will appreciate that other applications may be accommodated by the method described as well.
[0052] Now referring to Figure 11 , a system according to one embodiment of the present invention is shown. Computer system 1100 comprises an input module 1110, a memory device 1114, a processor 1116, and an output module 1118. In an alternative embodiment, an image processor 1112 can be part of the main processor 1116 or a dedicated device to perceive situations as digital images captured in a preferred image format. Similarly, memory device 1114 may be a standalone memory device, (e.g., a random access memory chip, flash memory, or the like), or an on-chip memory with the processor 1116 (e.g., cache memory). Likewise, computer system 1100 can be a standalone system, such as, a server, a personal computer, or the like. Alternatively, computer system 1100 can be part of a larger system such as, for example, a robot having a vision system, a security system (e.g., airport security system), or the like. [0053] According to this embodiment, computer system 1100 comprises an input module 1110 to receive the digital images I. The digital images, I, may be received directly from an imaging device 1101, for example, a digital camera 1101a (e.g., robotic eyes), a video system 1101b (e.g., closed circuit television), image scanner, or the like. Alternatively, the input module 1110 may be a network interface to receive digital images from another network system, for example, an image database, another vision system, Internet servers, or the like. The network interface may be a wired interface, such as, a USB, RS-232 serial port, Ethernet card, or the like, or may be a wireless interface module, such as, a wireless device configured to communicate using a wireless protocol, e.g., Bluetooth, WiFi, IEEE 802.11, or the like.
[0054] An optional image processor 1112 may be part of the processor 1116 or a dedicated component of the system 1100. The image processor 1112 could be used to pre-process the digital images I received through the input module 1110 to convert the digital images, I, to the preferred format on which the processor 1116 operates. For example, if the digital images, I, received through the input module 1110 come from a digital camera 1110a in a JPEG format and the processor is configured to operate on raster image data, image processor 1112 can be used to convert from JPEG to raster image data.
[0055] The digital images, I, once in the preferred image format if an image processor 1112 is used, are stored in the memory device 1114 to be processed by processor 1116. Processor 1116 applies a set of instructions that when executed perform one or more of the methods according to the present invention, e.g., synthesizing an MSN, a Bayes network, calculating response probabilities and the like, as described herein. While executing the set of instructions, processor 1116 accesses memory device 1114 to perform the operations according to methods of the present invention on the image data stored therein.
[0056] Processor 1116 interprets a situation from the input images, I, and outputs indications of calculated responses through the output module 1118 to an external device 1125 (e.g., a database 1125a, a network element or server 1125b, a display device 1125c, or the like). Like the input module 1110, output module 1118 can be wired or wireless. Output module 1118 may be a storage drive interface, (e.g., hard-drive or optical drive driver), a network interface device (e.g., an Ethernet interface card, wireless network card, or the like), or a display driver (e.g., a graphics card, or the like), or any other such device for outputting the response or responses determined.
[0057] Advantages of the present invention include scalability to accommodate on the order of millions of pieces of information. The method of the invention is independent of the language and actual words that comprise the knowledge base. Where multiple responses are possible for a given situation, the consensus response is determined based on the relative frequencies of response of the subjects who contribute the knowledge. The restriction to indoor domains makes the knowledge base sufficiently dense to be statistically useful for inferencing. Furthermore, inferencing over multiple situations is possible. Knowledge regarding synonymous concepts is combined to make the knowledge base more dense for improved inferencing.
[0058] Upon reading this disclosure, those of skill in the art will appreciate still additional alternative structural and functional designs for a system and a process for responding to situations through knowledge representation and inference through the disclosed principles of the present invention. Thus, while particular embodiments and applications of the present invention have been illustrated and described, it is to be understood that the invention is not limited to the precise construction and components disclosed herein and that various modifications, changes and variations which will be apparent to those skilled in the art may be made in the arrangement, operation and details of the method and apparatus of the present invention disclosed herein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims

CLAIMS WHAT IS CLAIMED IS:
1. A method for analyzing a knowledge database, comprising the steps of: receiving data collected by a distributed capture technique; and deriving a Bayes network from said data.
2. The method of claim 1 , wherein the step of receiving data further comprises receiving said data from multiple users or from multiple locations.
3. The method of claim 2, wherein said distributed capture technique comprises the step of prompting a plurality of users to supply one or more words in response to a query.
4. The method of claim 1 , further comprising the step of associating causality and response relations to one or more situations.
5. The method of claim 1, further comprising the step of processing said data, including at least one of: checking a spelling of a word; removing a template by eliminating a non-essential word; tagging a part of speech to a word; lemmatizing a word into its root form; chunking two or more tagged words into a tagged phrase; or removing extraneous data.
6. The method of claim 1 , further comprising the step of synthesizing a multidimensional semantic net (MSN) from said data.
7. The method of claim 6, wherein said MSN comprises: two or more nodes comprising at least one of: object, property, atom action, action, or situation; and one or more edges that connect pairs of said nodes.
8. The method of claim 7, further comprising the steps of: determining semantic relationships between said nodes; and adding edges based on said semantic relationships.
9. The method of claim 8, further comprising the step of pruning at least one of: non- bidirectional synonyms, hypernym pairs or hyponym pairs.
10. The method of claim 1 , wherein said Bayes network comprises a plurality of nodes, each node including a relative probability calculated from said data.
11. The method of claim 10, wherein each said relative probability is calculated from a frequency of said data within the knowledge database.
12. The method of claim 11, wherein said nodes further comprise situation nodes and response nodes.
13. A system for analyzing a knowledge database, comprising: means for receiving data collected by a distributed capture technique; and means for deriving a Bayes network from said data.
14. The system of claim 13, wherein said means for receiving data further comprises receiving data from multiple users or from multiple locations.
15. The system of claim 14, wherein said distributed capture technique comprises means for prompting a plurality of users to supply one or more words in response to a query.
16. The system of claim 15, wherein said means for receiving data further comprises means for associating causality and response relations to one or more situations.
17. The system of claim 13, wherein said means for receiving data further comprises means for processing said data, including at least one of: means for checking a spelling of a word; means for removing a template by eliminating a non-essential word; means for tagging a part of speech to a word; means for lemmatizing a word into its root form; means for chunking to combine two or more tagged words into a tagged phrase; or means for removing extraneous data.
18. The system of claim 13 , wherein said means for receiving data further comprises means for synthesizing a multidimensional semantic net (MSN) from said data.
19. The system of claim 18, wherein said MSN comprises: two or more nodes comprising at least one of : object, property, atom action, action, or situation; and one or more edges that connect pairs of said nodes.
20. The system of claim 19, wherein said means for receiving data further comprises: means for determining semantic relationships between nodes; and means for adding edges based on said semantic relationships.
21. The system of claim 20, wherein said means for receiving data further comprises means for pruning at least one of: non-bidirectional synonyms, hypernym pairs or hyponym pairs.
22. The system of claim 13, wherein said means for deriving a Bayes network comprises means for deriving a plurality of nodes, each node including a relative probability calculated from said data.
23. The system of claim 22, wherein each said relative probability is calculated from a frequency of said data within the knowledge database.
24. The system of claim 23, wherein said nodes further comprise situation nodes and response nodes.
25. A computer program product, comprising a computer-readable medium having computer program instructions and data embodied thereon for implementing a method for analyzing a knowledge database, the method comprising the steps of: receiving data collected by a distributed capture technique; and deriving a Bayes network from said data.
26. The computer program product of claim 25, wherein the step of receiving data further comprises receiving said data from multiple users or from multiple locations.
27. The computer program product of claim 26, wherein said distributed capture technique comprises the step of prompting a plurality of users to supply one or more words in response to a query.
28. The computer program product of claim 25, wherein said receiving data further comprises the steps of associating causality and response relations to one or more situations.
29. The computer program product of claim 25, wherein said receiving data further comprises the step of processing said data, including at least one of the steps of: checking a spelling of a word; removing a template by eliminating a non-essential word; tagging a part of speech to a word; lemmatizing a word into its root form; chunking two or more tagged words into a tagged phrase; or removing extraneous data.
30. The computer program product of claim 25, wherein the step of receiving data further comprises synthesizing a multidimensional semantic net (MSN) from said data.
31. The computer program product of claim 30, wherein said MSN comprises: two or more nodes comprising at least one of object, property, atom action, action, or situation; and one or more edges that connect pairs of said nodes.
32. The computer program product of claim 31 , wherein said receiving data further comprises the steps of: determining semantic relationships between said nodes; and adding edges based on said semantic relationships.
33. The computer program product of claim 32, wherein said receiving data further comprises the step of pruning at least one of: non-bidirectional synonyms, hypernym pairs or hyponym pairs.
34. The computer program product of claim 25, wherein said Bayes network comprises a plurality of nodes, each node including a relative probability calculated from said data.
35. The computer program product of claim 34, wherein each said relative probability is calculated from a frequency of said data within the knowledge database.
36. The computer program product of claim 35, wherein said nodes further comprise situation nodes and response nodes.
PCT/US2006/002204 2005-01-28 2006-01-19 Responding to situations using knowledge representation and inference WO2006083596A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2007553151A JP2008529163A (en) 2005-01-28 2006-01-19 Responding to situations using knowledge representation and reasoning

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/046,343 2005-01-28
US11/046,343 US7725418B2 (en) 2005-01-28 2005-01-28 Responding to situations using multidimensional semantic net and Bayes inference

Publications (2)

Publication Number Publication Date
WO2006083596A2 true WO2006083596A2 (en) 2006-08-10
WO2006083596A3 WO2006083596A3 (en) 2008-04-10

Family

ID=36777748

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2006/002204 WO2006083596A2 (en) 2005-01-28 2006-01-19 Responding to situations using knowledge representation and inference

Country Status (3)

Country Link
US (1) US7725418B2 (en)
JP (1) JP2008529163A (en)
WO (1) WO2006083596A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009140348A (en) * 2007-12-07 2009-06-25 Toyota Motor Corp Behavior determination device, behavior learning device, and robot device

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7013308B1 (en) 2000-11-28 2006-03-14 Semscript Ltd. Knowledge storage and retrieval system and method
US8666928B2 (en) 2005-08-01 2014-03-04 Evi Technologies Limited Knowledge repository
US7593911B1 (en) * 2005-10-12 2009-09-22 At&T Corp. System and method for applying rule sets and rule interactions
US7966093B2 (en) * 2007-04-17 2011-06-21 Yefim Zhuk Adaptive mobile robot system with knowledge-driven architecture
US8838659B2 (en) * 2007-10-04 2014-09-16 Amazon Technologies, Inc. Enhanced knowledge repository
US10867133B2 (en) * 2008-05-01 2020-12-15 Primal Fusion Inc. System and method for using a knowledge representation to provide information based on environmental inputs
KR101021836B1 (en) * 2008-10-09 2011-03-17 한국전자통신연구원 System for cooperation of multiple mobile robot using dynamic behavior binding and method thereof
US9805089B2 (en) * 2009-02-10 2017-10-31 Amazon Technologies, Inc. Local business and product search system and method
US20120221589A1 (en) * 2009-08-25 2012-08-30 Yuval Shahar Method and system for selecting, retrieving, visualizing and exploring time-oriented data in multiple subject records
US9110882B2 (en) 2010-05-14 2015-08-18 Amazon Technologies, Inc. Extracting structured knowledge from unstructured text
KR101573993B1 (en) 2010-09-17 2015-12-02 노키아 코포레이션 Method and apparatus for segmenting context information
KR101945185B1 (en) * 2012-01-12 2019-02-07 삼성전자주식회사 robot and method to recognize and handle exceptional situations
TWI474315B (en) * 2012-05-25 2015-02-21 Univ Nat Taiwan Normal Infant cries analysis method and system
US20140059011A1 (en) * 2012-08-27 2014-02-27 International Business Machines Corporation Automated data curation for lists
US9984332B2 (en) * 2013-11-05 2018-05-29 Npc Robotics Corporation Bayesian-centric autonomous robotic learning
CN106951963B (en) * 2017-03-29 2020-05-22 苏州大学 Knowledge refining method and device
CN108038250A (en) * 2017-12-29 2018-05-15 重庆柚瓣家科技有限公司 The interaction systems and method of suitable aging content are realized by robot
DE102020211648A1 (en) 2020-09-17 2022-03-17 Robert Bosch Gesellschaft mit beschränkter Haftung Apparatus and method for controlling a robotic device
US20230281565A1 (en) * 2022-03-04 2023-09-07 HireTeamMate Incorporated System and method for generating lower-dimension graph representations in talent acquisition platforms

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4638445A (en) 1984-06-08 1987-01-20 Mattaboni Paul J Autonomous mobile robot
US4884217A (en) 1987-09-30 1989-11-28 E. I. Du Pont De Nemours And Company Expert system with three classes of rules
US5717598A (en) 1990-02-14 1998-02-10 Hitachi, Ltd. Automatic manufacturability evaluation method and system
US5555403A (en) 1991-11-27 1996-09-10 Business Objects, S.A. Relational database access system using semantically dynamic objects
US5392382A (en) 1992-12-01 1995-02-21 Schoppers; Marcel J. Automated plan synthesizer and plan execution method
US5835684A (en) 1994-11-09 1998-11-10 Amada Company, Ltd. Method for planning/controlling robot motion
DE4440859C2 (en) 1994-11-15 1998-08-06 Alexander Kaske Method and device for controlling an autonomously exploring robot
JPH09245015A (en) 1996-03-13 1997-09-19 Toshiba Corp Hierarchical planning method
WO1997046929A2 (en) 1996-06-04 1997-12-11 Werbos Paul J 3-brain architecture for an intelligent decision and control system
JP3952544B2 (en) 1996-09-17 2007-08-01 株式会社東芝 Distributed system
US6135396A (en) 1997-02-07 2000-10-24 Ge-Harris Railway Electronics, Llc System and method for automatic train operation
US6353814B1 (en) 1997-10-08 2002-03-05 Michigan State University Developmental learning machine and method
JP2001157979A (en) 1999-11-30 2001-06-12 Sony Corp Robot device, and control method thereof
US6687685B1 (en) 2000-04-07 2004-02-03 Dr. Red Duke, Inc. Automated medical decision making utilizing bayesian network knowledge domain modeling
US6604094B1 (en) 2000-05-25 2003-08-05 Symbionautics Corporation Simulating human intelligence in computers using natural language dialog
US6766316B2 (en) 2001-01-18 2004-07-20 Science Applications International Corporation Method and system of ranking and clustering for document indexing and retrieval
EP1262844A1 (en) 2001-06-01 2002-12-04 Sony International (Europe) GmbH Method for controlling a man-machine-interface unit
US7379993B2 (en) 2001-09-13 2008-05-27 Sri International Prioritizing Bayes network alerts
US20050278362A1 (en) 2003-08-12 2005-12-15 Maren Alianna J Knowledge discovery system
US7917460B2 (en) * 2004-06-30 2011-03-29 Northrop Grumman Corporation Systems and methods for generating a decision network from text

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
RAKESH GUPTA AND KOCHENDERFER M.J.: 'Common Sense Data Acquisition for Indoor Mobile Robots' AMERICAN ASSOCIATION FOR ARTIFICIAL INTELLIGENCE July 2004, *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009140348A (en) * 2007-12-07 2009-06-25 Toyota Motor Corp Behavior determination device, behavior learning device, and robot device

Also Published As

Publication number Publication date
JP2008529163A (en) 2008-07-31
WO2006083596A3 (en) 2008-04-10
US7725418B2 (en) 2010-05-25
US20060184491A1 (en) 2006-08-17

Similar Documents

Publication Publication Date Title
US7725418B2 (en) Responding to situations using multidimensional semantic net and Bayes inference
US7370022B2 (en) Building plans for household tasks from distributed knowledge
CN110114764B (en) Providing dietary assistance in conversation
US7603330B2 (en) Meta learning for question classification
US8019713B2 (en) Commonsense reasoning about task instructions
US8712780B2 (en) Systems and methods for picture based communication
CN113704428B (en) Intelligent inquiry method, intelligent inquiry device, electronic equipment and storage medium
Satapathy et al. Sentiment analysis in the bio-medical domain
Grassi et al. Knowledge triggering, extraction and storage via human–robot verbal interaction
Kejriwal et al. Transfer-based taxonomy induction over concept labels
Takagi et al. Just keep tweeting, dear: web-mining methods for helping a social robot understand user needs
Alishahi et al. A computational model of learning semantic roles from child-directed language
Dominey Emergence of grammatical constructions: evidence from simulation and grounded agent experiments
Assem et al. QASAR: self-supervised learning framework for extractive question answering
Yang et al. An Ontology-Supported Case-Based Reasoning Technique for FAQ Proxy Service.
CN113806620A (en) Content recommendation method, device, system and storage medium
Ritt-Benmimoun et al. Three idioms, three dialects, one history: Egyptian, Nigerian and Tunisian Arabic
Russell Real-time topic and sentiment analysis in human-robot conversation
Chang et al. Interactive Healthcare Robot using Attention-based Question-Answer Retrieval and Medical Entity Extraction Models
Dennler et al. The RoSiD Tool: Empowering Users to Design Multimodal Signals for Human-Robot Collaboration
Funakoshi Non-axiomatic term logic: A computational theory of cognitive symbolic reasoning
Kop How to summarize an OWL domain ontology
Belabbes et al. An ontology-based approach for building and querying ICH video datasets
Gupta et al. Knowledge representation and bayesian inference for response to situations
Nguyen et al. SimplerVoice: A Key Message & Visual Description Generator System for Illiteracy

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2007553151

Country of ref document: JP

122 Ep: pct application non-entry in european phase

Ref document number: 06719163

Country of ref document: EP

Kind code of ref document: A2