CN101185074A - 用于事实查询引擎的带有来自信息源的包含查询词语和回答词语的片段的用户界面 - Google Patents

用于事实查询引擎的带有来自信息源的包含查询词语和回答词语的片段的用户界面 Download PDF

Info

Publication number
CN101185074A
CN101185074A CNA2006800187948A CN200680018794A CN101185074A CN 101185074 A CN101185074 A CN 101185074A CN A2006800187948 A CNA2006800187948 A CN A2006800187948A CN 200680018794 A CN200680018794 A CN 200680018794A CN 101185074 A CN101185074 A CN 101185074A
Authority
CN
China
Prior art keywords
answer
words
inquiry
true
fact
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2006800187948A
Other languages
English (en)
Other versions
CN101185074B (zh
Inventor
安德鲁·威廉·霍格
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Google LLC
Original Assignee
Google LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google LLC filed Critical Google LLC
Publication of CN101185074A publication Critical patent/CN101185074A/zh
Application granted granted Critical
Publication of CN101185074B publication Critical patent/CN101185074B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99934Query formulation, input preparation, or translation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access

Abstract

公开了一种可以对事实查询提供回答的源文档的片段的方法和系统。为了响应用户对源文档的请求可以提供源文档的片段,从该源文档提取对事实查询的事实回答。所述片段包含所述事实查询的词语和所述回答的词语。所述片段可以连同所述源文档的统一资源定位器(URL)一起被显示。

Description

用于事实查询引擎的带有来自信息源的包含查询词语和回答词语的片段的用户界面
相关申请
本申请涉及以下申请,其中每一个都通过引用并入此处作为参考:
2005年3月31日提出申请的美国专利申请号为11/097,688的“Corroborating Facts Extractedx from Multiple Sources”;
2005年3月31日提出申请的美国专利申请号为11/097,676的“Bloom Filters for Query Simultation”;
2005年3月31日提出申请的美国专利申清号为11/097,690的“Selecting the Best Answer to a Fact Query from Among a Set ofPotential Answers”;以及
2004年12月30日提出申请的美国专利申请号为11/024,784的“Supplementing Search Results with Information of Interest”。
技术领域
公开的实施例通常涉及对事实的查询,尤其涉及用于事实查询引擎的用户界面和具有查询词语和回答词语的源的片段。
背景技术
万维网(也可以叫做web)和万维网中的网页都是事实信息的巨大资源。用户可以查看网页来获得事实问题的回答,例如“波兰的首都是什么”或者“乔治·华盛顿的出生日期是什么”。然而,万维网搜索引擎在这一点上对用户不能提供帮助,那就是他们通常不能提供对于例如上述的那些事实查询的简单、简洁的回答。相反的,万维网搜索引擎提供被确定为匹配用户查询的网页的列表,用户必须对匹配的网页进行分类才能发现回答。
试图建立搜索引擎,可以对事实问题提供快速回答,但这种方式有其固有的缺点。例如,一些搜索引擎从单一的资源提取它们的事实,例如从特定的百科全书。这就限制了这些引擎可以应答的问题的类型。例如,基于某百科全书的搜索引擎不可能回答许多关于流行文化的问题,例如关于电影、歌曲等的问题,而且也不可能回答许多关于产品、服务、零售和批发业务诸如此类的问题。如果扩展由这样的搜索引擎所使用的资源集,然而,这样的扩展可能引入对事实查询的多种可能回答的可能性,其中一些可能是矛盾的或者含混的。另外,随着资源集的扩展,信息可能来自不可靠的来源或者未知可靠性的来源。
发明内容
根据本发明的一方面,公开了一种用于显示事实的源的方法,该方法包括接收包括一个或多个词语的事实查询,查找对该包括一个或多个同语的事实查询的回答,查找一个或多个源文档,所述源文档包括所述查询的一个或多个词语以及所述回答的一个或多个词语,生成至少一个源文档的片段,该片段包括所述查询的一个或多个词语以及所述回答的一个或多个词语,生成包括所述片段的响应。
附图说明
图1示出根据本发明一些实施例的网络。
图2示出根据本发明一些实施例的用于对象的数据结构和事实储存库中的相关联事实。
图3示出根据本发明一些实施例的用于事实索引的数据结构。
图4示出根据本发明一些实施例的用于可能回答的列表的数据结构。
图5A-5C是根据本发明一些实施例的用于选择对事实查询的回答以及显示回答和回答的源的过程的流程图。
图6示出根据本发明一些实施例的对事实查询的回答的表示。
图7示出根据本发明一些实施例的对于事实查询的回答的源的列表的表示。
图8示出了根据本发明一些实施例示的系统,用于选择对事实查询的回答以及显示该回答和回答的源的列表。
在全部图中相同的参考数字表示相应的部分。
具体实施方式
查询引擎可以存储从很多分散的源收集的事实信息并且响应于用户对事实信息(或者“事实查询”)的查询而返回回答。从许多源收集信息扩展了用于查询引擎的可用事实信息的范围,但也引入了多个可能回答的可能性。查询引擎可以识别可能的回答并且从可能的回答中选择最好的回答提供给用户,或者它也可以确定没有一个可能的回答可以提供给用户。查询引擎还可以提供回答的源的列表,包括来自每一个源的文本部分。文本的部分或多个部分被称作片段(snippet),可以包括事实查询词语和回答词语。当片段示出了由搜索引擎查找或选择的回答,源的列表给用户提供回答的基础并且可以帮助用户评估回答的真实性。
图1根据本发明的一些实施例示出了网络100。网络100包括一个或多个客户端102和查询引擎106。客户端102可以包括客户端应用程序(未示出)。网络100还可以包括一个或多个可以耦接这些组件的通信网络104。
客户端应用程序为客户端102的用户(未示出)提供至查询引擎106的接口。使用在客户端102上运行的客户端应用程序,用户可以向查询引擎106提交关于文档的搜索(例如,web搜索)和事实查询并且观察来自查询引擎106的响应。客户端应用程序可以包括万维网浏览器。万维网浏览器的实例包括FIREFOX,INTERNET EXPLORER和OPERA。
查询引擎106提供平台用于存储事实信息和响应事实查询,以及处理其它类型的搜索。查询引擎106可以处理对文档的搜索,例如万维网搜索,还可以处理事实信息的查询。查询引擎106提供查询服务器108。查询服务器108为查询引擎106提供前端。查询服务器108从客户端102接收查询,将查询引导至能够处理事实查询和其它搜索的查询引擎106的组件,产生响应,并且将响应传送给客户端102。查询服务器108可以被分布到多个计算机。在其它实施例中,查询引擎可以处理更多或者更少的功能。例如,在其它实施例中,响应产生可以在查询引擎106中的其它地方来处理。
查询引擎106包括第一搜索控制器110,第一高速缓存112,文档索引114和文档数据库116,用于处理文档搜索。在一些实施例中,为了提供对大量文档的快速存取可以在多个计算机中布局这些组件。例如,文档数据库116可以布局在N个服务器中,利用映射功能例如“模数N”功能来确定哪些文档被存储在N个服务器的每一个。N可以是大于1的整数,例如介于2和8196之间的整数。类似的,文档索引114可以分布在多个服务器中,第一高速缓存112也可以分布在多个服务器中。另外,第一搜索控制器110也可以分布在多个计算机中。
第一搜索控制器110耦接到查询服务器108。第一搜索控制器110也耦接到第一高速缓存112、文档索引114和文档数据库116。配置第一搜索控制器110使得可从查询服务器108接收文档搜索查询并且传送该查询到第一高速缓存112、文档索引114和文档数据库116。第一高速缓存112用于通过临时存储以前找到的搜索结果来提高搜索效率。
第一搜索控制器110从第一高速缓存112和/或文档索引114接收文档搜索结果并且构建经排序的搜索结果列表。第一搜索控制器110然后向查询服务器108返回定位文档列表用于向前传送到客户端102。由第一搜索控制器110从第一高速缓存112和/或文档索引114接收的文档搜索结果可以伴随着搜索结果中的被定位文档片断。
查询引擎106还包括第二搜索控制器118,第二高速缓存10,事实索引122,以及事实储存库124。在一些实施例中,为了提供对大量事实的更快速的存取可以在多个计算机中布局这些组件。例如,事实储存库124可以布局在N个服务器中,可利用映射功能例如“模数N”功能来确定哪些事实被存储在N个服务器的每一个中。N可以是大于1的整数,例如介于2和8196之间的整数。类似的,事实索引122可以分布在多个服务器中,第二高速缓存120也可以分布在多个服务器中。另外,第二搜索控制器118也可以分布在多个计算机中。
第二搜索控制器118耦接到查询服务器108。第二搜索控制器118还耦接到第二高速缓存120、事实索引122和事实储存库124。第二搜索控制器118被配置为可以从查询服务器108接收对事实问题的回答的查询并且传送该查询到第二高速缓存120和事实储存库124(通过事实索引122)。第二高速缓存120用于通过临时存储以前定位的搜索结果来提高事实检索效率。
第二搜索控制器118从第二高速缓存120和/或事实储存库124接收事实查向的可能回答的事实。第二搜索控制器118从可能的回答中选择同答作为最好的回答提供给用户。回答被传送给查询服务器108,其中产生包括该回答的响应并且传送给客户端102用于呈现给用户。为了响应用户在所显示在事实查询的回答上的或其下一个上的图标的选择,查询服务器108可以查找与回答关联的源的列表并且传递源的列表到第一搜索控制器110。第一搜索控制器110存取响应于源和至少源文档子集的片断的文档。在一些实施例中,该片断包括来自查询的词语和来自回答的词语。
事实储存库124存储从多个文档中提取的事实信息。从中提取特定事实的文档是该事实的源文档(或者称“源”)。换句话说,事实的源包括在其内容中的事实。源文档可以包括但是不限于网页。在事实储存库124中,实体、概念等等这些由事实储存库124可对其存储事实信息的,都可以通过对象来表达。一个对象可以具有一个或多个与其关联的事实。每一个对象都是事实的集合;没有事实与其关联的对象(空对象)在事实储存库124中可以被看作不存在的对象。在每一个对象中,每一个与对象关联的事实都作为一个属性-值对来存储。每一个事实还包括源文档的列表,该源文档包括它们内容中的事实并且从源文档中可提取该事实。事实储存库中的另外的关于对象和事实的细节都在下面进行描述,涉及到图2。
为了在事实储存库124中查找信息,第二搜索控制器118在事实索引122中搜索该搜索查询中的词语。这就产生了事实储存库位置的列表(对应于事实或对象),其匹配搜索查询中的不同的词语。使用搜索查询的逻辑结构(可以认为是布尔表达示或树),第二搜索控制器118然后形成了这些位置列表的逻辑组合来查找可能的事实,如果存在的话,这样的实事匹配搜索查询。
事实索引122给事实储存库124提供索引并且为在事实储存库124中信息的有效查询提供了方便。事实索引122可以基于一个或多个参数索引事实储存库124。例如,事实索引122具有索引(其可以被称为主索引或词语索引),它索引唯一词语到事实储存库124中的位置。另外的关于事实索引122的细节将在下面进行描述,涉及到图3。
应当理解,尽管查询引擎106的任何组件可分布到多个计算机,为了解释的方便,我们将讨论查询引擎106的组件就像它们在一个单一的计算机上被实现一样。
图2示出了根据本发明的一些实施例的用于事实储存库124中的对象的实例数据结构。如上所述,事实储存库包括对象,每一个对象都可以包括一个或多个事实。每一个对象200包括唯一标识符,例如对象ID 202。对象200包括一个或多个事实204。每一事实204包括唯一标识符用于该事实,例如事实ID 210。每一事实204包括属性212和值214。例如,包含在表示乔治·华盛顿的对象中的事实可以包括具有属性“出生日期”和“死亡日期”的事实,这些事实的值分别是实际的出生日期和死亡日期。事实204可以包括到另一对象的链接216,另一对象可以是对象标识符,例如事实储存库124中的另一对象的对象ID 202。链接216使得对象可具有这样的事实,该事实的值是其它对象。例如,对于对象“美国”,它可以是具有属性“总统”的事实,该属性“总统”的值是“乔治·W·布什”,而“乔治·W·布什”是事实储存库124中的另一个对象。在一些实施例中,值字段214存储所链接对象的名称,链接216存储所链接对象的对象标识符。在一些其它实施例中,事实204不包括链接字段216,因为事实204的值214可以存储至另一对象的链接。
每一事实204还可以包括一个或多个指标218。指标可以提供事实质量的指示。在一些实施例中,指标包括置信度和重要度。置信度指示事实正确的可能性。重要度指示该事实和对象的相关性,与其它事实对于相同对象的相关性相比。换句话说,重要度衡量该事实对于由对象表示的实体或者概念的理解有多重要。
每一事实204包括其中包含事实并且从中可提取事实的源220的列表。每一源都可以由统一资源定位符(URL)或者web地址来标识。
在一些实施例中,一些事实包括代理字段222,该字段标识提取事实的模块。例如,代理可以是从特定源提取事实的特殊模块,或者是通过万维网在文档中从自由文本中提取事实的模块,等等。
在一些实施例中,对象200可以具有一个或多个特定的事实,例如名称事实206和性质事实208。名称事实206是一个为对象200所表示的实体或者概念表达名称的事实。例如,对于一个表示西班牙国家的对象,可以是一个能够表达对象的名称为“西班牙”的事实。名称事实206,可以是一般事实204的特定实例,包括与任何其它事实204相同的参数;它具有:属性,值,事实ID,指标,源,等等。名称事实206的属性224指示该事实是名称事实,值是实际的名称。名称可以是文本的字符串。对象200可以具有一个或多个名称事实,同时许多事实或者概念可以具有多于一个名称。例如,表示西班牙的对象可以具有能够表达国家的通用名称“西班牙”的名称事实和官方名称“西班牙王国”。在另一实施例中,表示美国专利和商标局的对象可以具有能够表达代理的首字母缩略词“PTO”和“USPTO”的名称事实和正式官方名称“美国专利和商标局”。
性质事实208是表达关于所关注对象200所表示的实体或者概念的陈述的事实。例如,对于表示西班牙的对象,性质事实可以表达西班牙是欧洲的一个国家。性质事实208,作为一般事实204的特定实例,还可以包括与其它事实204相同的参数(例如:属性,值,事实ID,指标,源,等等)。性质事实208的属性字段226指示该事实是性质事实,值的字段是能够表达所关注陈述的文本的字符串。例如,对于表示西班牙的对象,性质事实的值可以是文本字符串“是欧洲的一个国家”。对象200可以具有0个或更多的性质事实。
应当理解图2中示出的数据结构和上面所描述的只是实例性的。事实储存库124的数据结构可以采取其它形式。其它字段可以被包含在事实中,并且上面描述的其中一些字段可以省略。另外,每一对象除了名称事实和性质事实外都还可以具有另外的特定事实,例如表达类型或种类的事实(例如,人,位置,电影,演员,等),用于将对象所表示的实体或者概念进行分类。在一些实施例中,对象的名称和/或性质都由特定的记录表示,这些特定的记录都具有与对象的属性-值对相关联的事实记录204不同的格式。
图3根据本发明的一些实施例示出了实例事实索引。如上所述,事实索引122可以基于一个或多个参数来索引事实储存库。在一些实施例中,事实索引300可以是这种索引。事实索引300映射唯一词语到事实,或者映射到事实储存库124中的信息位置上。如这里所使用的,词语是单词(例如英文单词“Spain”或者“George”)或者数字(例如“123”或者“-9”)。在一些实施例中,词语还可以包括包含两个或更多单词的词语,例如英文“United States”或者“birth data”。事实索引300包括词语的多个组303和相关联的词语位置记录,并且可选的包括带有关于索引300的信息(例如关于索引大小的信息,关于用于定位所述组的映射功能的信息,等)的索引头部302。在每一个组303中都是词语304和一个或多个词语定位记录306,它可以识别事实储存库124中词语的每次出现的位置。每一词语定位记录都具有对象标识符308(用于标识出现该词语的对象)、事实标识符310(标识该对象中的事实)、事实字段标识符312(标识该事实中的字段)以及符号串(token)标识符314(标识字段中的符号串)。这四个字段把词语映射到事实储存库124中的位置。然而,应当理解,事实索引300只是一个示例,事实索引300的其它形式和其它事实索引都是可能的。在一些实施例中,当词语定位记录306指向作为一整体的对象(例如,该词语是该对象的名称),事实标识符310、字段标识符312和符号串标识符314都具有预定值或者空值。
图4示出根据本发明一些实施例的对于事实查询的可能回答的实例列表。第二搜索控制器118从第二高速缓存120或者从事实储存库124的搜索中接收对于事实查询的一个或更多可能的回答的列表,从可能回答的列表中选择最好的回答,将最好的回答传递到查询服务器108用于进一步的处理,这其中另外的细节将在下面进行描述,涉及到图5A-5C。图4示出了可能回答400的实例列表。列表400包括一个或多个可能的回答403。每一可能的回答403都具有一个或多个字段。对象ID 404标识对象,该对象包括了是可能回答的事实。对象名称406标识由对象ID 404标识的对象所表示的实体或者概念的名称。对象名称406可以是包含在对象中的名称事实的值(参见上述的对象的数据结构)。事实属性408标识事实的属性,该事实为可能的回答。事实值410标识事实的值,该事实是可能的回答。回答字段412标识三个字段(对象名称406,事实属性408,或者事实值410)的哪一个具有响应于实际查询(也就是,用户正在寻找的回答的种类)的实际回答。QA类型414标识由实际查询提出的问题(也就是,用户寻问的问题的类型以及,隐含地,响应于被询问的问题的种类)的类型。分值416指示可能回答的得分值(评分)。分值是一种指标,其试图测出该可能回答作为精确和响应回答的质量。事实查询418是由第二搜索控制器118产生的内部查询,可以导致如所述那样的可能回答的标识。事实查询418基于用户查询(由用户在客户端102输入的查询)而产生。关于QA类型414、分值416以及事实查询418的其他细节都在下面进行描述,涉及图5A-5C。在一些实施例中,可能的回答403可以由更多或更少的信息字段来表示。在一些实施例中,列表400包括列表头部402,其包含对于整个列表400可用的信息。例如,头部402可以包括用户查询的副本,指向列表40的顶端项的指针,或者其它数据结构用于方便对列表400中的词语或者记录进行存取。
图5A-5C示出根据本发明一些实施例的用于选择对于实际查询的最好回答和表示该回答的实例过程。对于实际查询的回答是事实储存库124中的事实,该回答被标识为对于该实际查询的最好响应。一旦接收实际查询,查询引擎106处理查询,查找可能的回答,选择最好的回答,产生包含该回答的响应。查询引擎106还产生包含该回答的源的列表的响应。
查询引擎106接收查询(502)。查询由用户在客户端102输入并且由客户端102传递到查询引擎106。查询包括一个或多个词语。由用户输入的查询是用户查询。
用户查询被处理(504)。用户查询被传递到第一搜索控制器110和第二搜索控制器118。因为用户查询包括一个或多个词语,它可以作为对文档的搜索查询,例如万维网搜索,并且传递到能够处理这种搜索的系统组件,例如第一搜索控制器110。搜索文档,例如万维网搜索,这在现有技术中都是公知的,并且不需要进一步进行描述。
用户查询还可以传递到第二搜索控制器118。用户查询被预处理并且被分析以确定用户是否适合一个或多个QA类型中的任何一个。预处理可以包括剔除“停用词(stop word)”(例如英语中的定冠词和不定冠词和介词)和扩展用户查询中的词和/或短语来包括它们的各自的同义同或等效物。例如,短语“birth date”可以被扩展成包括其同义词“date birth”(没有停用词“of”)和“birthday”。分析可以包括对用户查询进行句法分析和分析用户查询的文本。如果用户查询被确定为适合任何一个QA类型,可以为用户查询产生对应于相应QA类型的事实查询。事实查询是查询引擎106内部的查询并且用于访问第二高速缓存120和事实储存库124(通过事实索引122)用于找到可能的回答。如果用户查询被确定为不适合任何QA类型,另外的由第二搜索控制器118在用户查询上的处理可以被废止,从第二搜索控制器118来看,用户查询不是实事查询。应当理解,用户查询可以被确定为适合不止一个QA类型,这样,对单一的用户查询可产生多于一个的事实查询。每一个这些事实查询都被用于访问事实储存库124、第二高速缓存120和事实索引122,用于可能的回答。
用户查询可以适合一个或多个QA类型。QA类型是问题到回答的映射,指示了该用户查询询问了哪些问题以及响应于该事实查询的回答种类。在一些实施例中,有三个一般QA类型:名称和属性到值(“NA-V”);属性和值或性质到名称(“AV-N”);以及名称到性质、类型或者名称(“N-PTN”)。在一些实施例中,还有另外的特定QA类型以处理问题的特定类型。在一些实施例中,这些特定的QA类型可以是一般QA类型的特定实例。
在NA-V类型中,用户(通过用户查询的项)提供对象名称和属性并且想知道具有给定的名字的对象的相应属性的值。NA-V类型查询的实例可以是“波兰的首都是什么”,其中“波兰”是对象名称,“首都”是期望知道其值的“波兰”的属性。用于该查询的回答可以是事实的值,与具有名称“波兰”的对象相关联,并具有属性“首都”。在这种情况下,具有属性“首都”的事实的值可以是字符串“华沙”。该值还可以是具有名称“华沙”的对象的对象标识符,这种情况下名称“华沙”可以由对象标识符替代并且作为可能回答返回。
在AV-N类型中,用户提供属性和值(或者性质,由于性质只是特定的属性值对,如上所述)并且想知道具有给定属性的给定值的名称。在某种意义上,这是“反向查找”。AV-N类型查询的实例可以是“哪个国家将华沙作为其首都”,在这种情况下“首都”是属性,“华沙”是值。一种可能的回答可以是具有该属性值对的对象的名称,也就是“波兰”。
在N-PTN类型中,用户提供名称并且想知道与给定名称相关联的对象的性质或者类型或者替代名称。N-PTN类型查询的一个实例可以是“NRA是什么(what is the NRA)”。“NRA”是用户想知道其性质、类型或者替代名称的对象的名称。对于“NRA”的一种可能的性质的回答是“第二修正权拥护组(a second amendment rights advocacygroup)”。一种可以表达由对象表示的实体或者概念的分类的回答类型,对于“NRA”可以是“组织”,指示NAR是一个组织,与其它类型例如人、书、电影等等相对。对于“NRA”的一种替代名称可以是“国家步枪联合会(National Rifle Association)”,这是由具有名称(首字母缩略词)“NRA”的对象表示的实体的正式名称。
在一些实施例中,事实查询可以包括额外的限制。例如,事实查询可以制定某个词语可只在特定字段而不在其它字段匹配。其它的限制可以是任何可能的回答都必须匹配特定类型(例如人,书等)。这样的限制可以由第二搜索控制器118在分析和处理用户查询的期间产生。
在处理用户查询并产生一个或多个事实查询之后,事实查询用于存取事实储存库124(通过事实索引122)和第二高速缓存以查找可能的回答(506)。可能的回答可以是匹配一个或多个事实查询的事实。对可能回答进行评分(508)。可能回答的分值提供了该可能回答作为精确的和相应的回答的质量指示。
在一些实施例中,可能回答的分值是多个因子值的乘积。在一些实施例中,一个或多个因子值可以是0和1之间的规范化的值,0和1包括在内。实际上用于确定分值的因子可以随与可能回答相匹配的事实查询的QA类型变化。在一些实施例中,因为分值,作为0和1之间的因子的乘积,0和1包括在内,可以保持不变或者向0减少但是不会增加,如果用于特定回答的分值减少到预定阈值以下,用于任何特定可能回答的分值可以丢弃。这可以指示该可能回答是如此差的质量以至于进一步的评分是浪费的。
在一些实施例中,所述因子可以基于QA类型、匹配事实查询的事实的指标(例如置信指标和重要性指标)、提取该匹配事实的代理、事实中的字段与事实查询匹配的程度、事实中的特定字段完全匹配事实查询的程度,等等。应当理解上述因子只是实例性的,除了上述的那些,其它因子也可以包括在内,上面描述的一些因子可以省略。
每一个可能的回答被评分之后,可能回答被收集到可能的回答列表中,例如上述的可能回答列表400,参见图4。在一些实施例中,只有预定数量的高评分的回答被收集到可能回答列表400。例如,可能回答列表可以只包含100个最高评分的可能回答。在一些实施例中,可能回答列表400的进一步处理由第二搜索控制器118来控制。
在图5B中继续,从可能回答列表400中查找一定数量的最高评分的可能回答(510)。该数量可以是预定数量,它可以指定有多少高评分的回答将要被进一步处理。只要仍然存在已找到的高评分的可能回答需要被处理(512-否),就会处理下一个高评分的回答。该处理涉及到为各高评分的回答查找支持回答(514)以及基于各高评分的回答的分值和它的支持回答的分值来确定用于各高评分回答的支持分值(516)。支持回答的识别将在下面进行详细描述。
在一些实施例中,支持分值通过将每一个高评分回答的分值和它的支持回答转化为奇空间值(odds space value)。分值s被转化为奇空间值x。
x = ( s 1 - s )
所转化的各值(也就是分值的奇空间转化)相加以生成值X,总和X被转化同概率空间值以获得用于该高评分回答的支持分值S。
S = ( X 1 - X )
在为已找到的高评分回答确定支持分值(512-是)之后,识别具有最高支持分值(下文中为“最好支持回答”)的高评分回答(518)。对于该最好支持回答,查找与该最好支持回答矛盾的可能回答列表中的高评分回答(520)。对于该矛盾回答,确定用于该矛盾回答的支持分值的矛盾分值C(522)。另外,在一些实施例中,在与最好支持回答不相关的可能回答的列表中识别高评分回答(524)。对于该不相关的回答,确定用于该不相关回答的支持分值的不相关分值U(526)。应当理解,用于确定矛盾分值C和不相关分值U的过程类似于确定S的过程:查找支持回答,分值被转化为奇空间值,奇空间值被相加,相加的和被转化回概率空间值。矛盾和不相关的回答的确定将在下面进行描述。
两个可能的回答是支持的、矛盾的或者不相关的都是基于两个回答的字段比较。每一个回答的所关注的字段,也就是名称、属性和值都被分组到输入和输出。例如,在一些实施例中,对于NA-V类型查询,输入是名称和属性,输出是值。对于AV-N类型查询,输入是属性和值,输出是名称。两个可能的回答通过它们的输入字段和输出字段的成对比较而进行比较。这种比较考虑到字段中的数据的类型,也就是,字段中的数据究竟是词串、日期、数字等等。回答的源也可以被考虑。
在一些实施例中,成对的字段比较的结果是五种分类的其中之一。它们是:
●不可比:字段具有不同的数据类型(例如,词串对日期),因此不能比较;
●不相似:字段属于相同的数据类型,但是根本就不相同;
●有些类似:字段具有一些相似性,但是难以得出结论是否它们指相同的东西;
●非常相似:字段几乎是相同的;以及
●相同:字段完全相同。
字段相同与否的实际确定可因数据类型的不同而不同。例如,对于数字,如果数字都是小整数,那么它们一定是完全相等才能被认为是相同的。如果数字是非常大的整数或者浮点型数字,那么如果它们互相在某一百分比之内就可以被认为相同。
基于成对的字段比较,两个回答之间的关系可以被分类:
●如果这些回答来自于相同的源,两个回答可以被分类为“互补”。与回答A互补的回答被忽略;
●如果这些回答具有相同的或者非常类似的输入但是输出只是有些类似,则这两个回答被分类为“可支持”(即回答A“可支持”回答B)。“可支持”回答B的回答A被忽略;
●如果这两个回答具有相同的或者非常类似的输入以及相同的或者非常类似的输出,那么这两个回答被分类为“支持”,除非两个回答来自于相同的源。“支持”的回答的分值是支持分值确定的部分;
●如果输入是相同或者非常类似但是输出不相似或者是不可比的,那么两个回答是“矛盾的”;以及
●如果输入不相似或者不可比,那么两个回答是“不相关的”。
最好的支持回答的支持分值S与预定阈值T做比较(528)。阈值T是是如果该最好的支持回答要被进一步考虑该支持分值S必须达到的最小分值。如果S小于或者等于T(528-否),那么执行图5B中示出的处理的处理器(例如第二搜索控制器118或者查询引擎106)可以产生指示查询引擎106不能提供回答的响应(534)。例如,第二搜索控制器118可以传递指示回答不可用的响应到查询服务器108,查询服务器108可以产生对该效果的响应并传递该响应到客户端102表示给用户。
如果S大于T(528-是),可以进行检查,看该最好支持回答的支持分值S较之矛盾回答的最好支持分值C是否超出了至少第一预定裕度。在一个实施例中,这种比较通过将S与乘了常量α的矛盾分值C进行比较而进行(530)。常量α代表为了把该最后支持回答选择为对于该事实查询的最好回答而必须达到的S与C的最小比率。换句话说,S必须至少是矛盾分值C的α倍。如果S小于αC(530-否),那么执行图5B中示出的处理的处理器(例如第二搜索控制器118或者查询引擎106)可以产生指示查询引擎(106)不能提供回答的响应(534)。
如果S等于或者大于αC(530-是),则进行另一个检查,看看最好支持回答的支持分值S较之不相关回答的最好支持分值U是否超出了至少第二预定裕度。在一个实施例中,这种检查可以通过与乘了常量β的不相关的分值U进行比较而进行(532)。常量β表示该最好支持回答可被选择作为该事实查询的最好回答之前所必须获得的S与U的最小比率。换句话说,S必须是不相关的分值U的β倍。如果S小于βU(532-否),那么执行图5B中示出的处理的处理器(例如第二搜索控制器118或者查询引擎106)可以产生指示查询引擎106不能提供回答的响应(534)。如果S等于或者大于βU(532-是),那么该最好的支持回答被选择作为该事实查询的回答并且被进一步处理,进一步的细节将参考图5C在下面进行描述。
在图5C中继续,该最好支持回答被选择作为该事实查询的最好回答之后,查询服务器108产生响应(536)。该响应包括该最好支持回答。该响应可以包括用于该最好支持回答的源的标识符和/或超级链接(例如URL)。在一些实施例中,该响应还可以包括链接,当用户在客户端102点击该链接时,可以产生请求用于该最好支持回答的各源的列表。在一些实施例中,响应还可以包括文档搜索的结果,例如基于用户查询的web搜索。文档搜索结果可以从处理这些搜索的查询引擎106的组件传递到查询服务器108,例如第一搜索控制器110。该响应被传递到客户端102用于呈现给用户(538)。包含回答和使用用户查询的文档搜索结果的示例响应将在下面参考图6进行详细描述。
看到在客户端102上提供的响应,用户可以请求该回答的源的列表。在一些实施例中,用户可以通过点击包含在该响应(如上所述)中的链接提出该请求,一旦点击链接则产生对源列表的请求。
查询引擎106接收对该回答的源的列表的请求(540)。查找该回答的源(542)。在一些实施例中,可以通过查找事实储存库124中查找该回答事实的源220(图2)而识别该回答的源。把片断产生请求发送到第一搜索控制器,连同源的列表、用户查询、匹配该回答的事实查询418以及该回答。片断产生请求被提交到第一高速缓存112、文档索引114和/或文档数据库116。在一些实施例中,如果源列表比预定的限界长,可以通过第一搜索控制器110选择该源列表的子集并提交到第一高速缓存112、文档索引114和/或文档数据库116。第一高速缓存112、文档索引114和/或文档数据库116或片断产生请求被提交到的一个或多个处理器,对于所列出的源的每个产生片断(544)。每一片断可以包括文本的连续部分或者来自相应源的文本的多个非连续部分。对于特定片断来讲,如果所选择的包含在片断内的文本部分在源内是不连续的,这些部分就可以用省略号分隔开。
产生每一片断使得它能够包括尽可能多的该用户查询和/或事实查询的词语和尽可能多的该回答的词语。可以分析该源内查询和回答词语的散布(也就是查询词语和回答词语在源文档中如何散布)以帮助产生片断。产生最少查询词语和回答词语散布的一个或多个文本部分被选择用于包含在该片断内。
产生包含该片断的响应(546)。该响应包含源的列表和每一源的片断,所述片断包含用户/事实查询词语和回答词语。该响应还可以包含回答、用户查询、到每一源的超链接。该响应被传递到客户端102用于呈现给用户(548)。
在一些实施例中,当响应被呈现给用户时,每一片断中的用户/事实查询词语和回答词语都被突出以使它们更加醒目。如这里所使用的,在片断中突出词语是指当呈现给用户时能让词语更加醒目的任何方式,包括但不限于:粗体表示词语,为词语添加下划线,使词语变为斜体字,改变词语的字体颜色,和/或为词语的局部添加背景颜色。包含源的列表和片断的示例响应将在下面参考图7进行描述。
在一些实施例中,提交给查询引擎106的查询可以由URL表示,包含用户查询词语和一个或多个其它参数。例如,对词语“britney spearsparents”的查询可以由URL“http://www.google.com/search?hl=en&q=britney+spears+parents”表示。在一些实施例中,请求显示该回答的源的列表可以通过添加额外的参数来进行,例如把“&fsrc=1”添加到查询URL。因此,对于上述URL,如果对于查询“britney spearsparents”的回答的源的列表是所期望的,查询URL可以如这样“http://www.google.com/search?hl=en&q=britney+spears+parents&fsrc=1”。在一些实施例中,当用户点击响应中包含回答的链接时,触发对于该回答的源的列表的请求,该链接是除额外参数之外的用于用户查询的查询URL。
在一些其它的实施例中,查询引擎106可以连同事实查询一起接收预定的特殊算符,可用于指示查询引擎106来寻找对事实查询的回答并且返回该回答和该回答的源的列表,而无需首先返回使用该查询作为文档搜索的输入而找到的文档列表。例如,用户可以输入“Z:X ofY”,“Z:”作为特殊算符,来指示查询引擎106寻找对事实查询“X ofY”的回答和该回答的源的列表。在某种意义上来讲,算符与查询的一起使用将查询与可对该查询找到回答的源的列表的请求合并在一起。在一些实施例中,响应中能产生对该回答的源的列表的请求的链接,如上所述,当用户选择(例如点击)该链接时,对原来的查询添加该特殊算符并且把带有该特殊算符的查询提交到查询引擎106。
图6示出了根据本发明一些实施例的对实事查询的实例响应,如在客户端102呈现给用户的,包含回答和使用该事实查询作为输入的文档搜索结果。该响应600可以显示带有原始用户查询的搜索框602。响应600包括对于查询604的回答,至该回答的源606的超链接,以及链接608,当由用户点击链接608时会触发对该回答的源的列表的请求。在一些实施例中,链接608可以是添加了源列表请求参数的用于用户查询的查询URL,如上所述。在一些实施例中,如果回答事实604在事实储存库124中只有一个源,用户点击时可触发对该回答的源的列表的请求的链接可在响应600中被省略。响应还可以包括文档搜索结果610的列表,例如万维网搜索,使用该事实查询作为输入。
图7示出了根据本发明的一些实施例的对于回答的源列表的请求的示例响应。响应700包括带有原始用户查询的搜索框702。在一些实施例中,搜索框702还可以包括特定算符,如上所述,它可以与事实查询一起被用于请求源的列表。例如,在搜索框702中,“factsources:”是特定算符,“vritney spears parents”是原始用户查询。在一些其它实施例中,算符可以从搜索框中的查询表示中省略,例如,如果对于源的列表请求的触发是用户点击链接,例如链接608,这包括带有源列表请求参数的查询URL。该响应还可以包括对于事实查询的回答704,以及用于该回答的一个或多个源706的列表,连同URL、超链接和用于每一源的片断708。在一些实施例中,在每一片断708中,查询词语和回答词语可以被突出显示。在片断708中,查询词语和回答词语通过被加黑加粗以突出显示。
图8是方块图,示出根据本发明的一些实施例的事实查询应答系统800。系统800通常包括:一个或多个处理单元(CPU)802,一个或多个网络或其它通信接口810,存储器812,以及一个或多个通信总线814用于互连这些组件。系统800可选的包括用户界面804,该界面包括显示装置806和键盘/鼠标808。存储器812包括高速随机存取存储器,例如DRAM、SRAM、DDR RAM或者其它随机存取固态存储器装置;并且可以包括非易失性存储器,例如一个或多个磁盘存储器装置、光盘存储器装置、闪存装置或者其它非易失性固态存储器装置。存储器812可选地包括一个或多个存储器装置,其与CPU(802)异地远程定位。在一些实施例中,存储器812存储下面的程序、模块和数据结构或者其子集:
●操作系统816,其包括用于处理不同基本系统服务和用于执行硬件依赖任务的程序;
●查询接受和处理模块818,用于接受查询和处理查询,例如对该查询句法分析以确定QA类型并产生事实查询;
●回答查找模块820,用于查找对于事实查询的可能回答;
●回答评分模块822,用于确定回答的分值和支持分值;
●回答比较模块824,用于比较各回答以确定是否它们是支持的、矛盾的等;
●回答选择模块825,用于选择可能回答作为提供给用户的回答;
●源查找模块826,用于查找回答的源;
●文档索引接口828,用于当搜索文档时提供与文档索引的接口;
●文档存储接口830,用于当请求和接收片断时提供与文档存储系统的接口;
●事实索引接口832,用于当搜索事实时提供与事实索引的接口;
●事实存储接口834,用于提供与事实存储系统的接口;以及
●响应产生模块838,用于产生被传递到客户端102的响应。
在一些实施例中,系统800的存储器812包括事实索引而不是事实索引接口832。系统800还包括文档存储系统840用于存储文档的内容,其中一些内容可以作为用于回答事实的源。文档存储系统包括片断生成器842用于存取文档的内容并从内容中产生片断,还包括片断词语突出模块836用于突出片断中的查询词语和回答词语。系统800还包括事实存储系统844用于存储事实。存储在事实存储系统844中的每一事实都包括源的对应列表,相应的事实从这些源中提取。
上述每一所查找到的要素被存储在一个或多个上述的存储器装置中,并且对应于用于执行上述功能的指令集。上述的模块或程序(也就是指令集)不必实现成单独的软件程序、过程或者模块,因此这些模块的不同子集可以在不同实施例中被组合或者以其他方式重新安排。在一些实施例中,存储器812可以存储上述的模块和数据结构的子集。另外,存储器812可以存储另外的上面没有描述的模块和数据结构。
虽然图8示出了事实查询应答系统,图8试图给出不同特征的功能性描述,其中这些不同的特征可以表示为一组服务,而不是这里描述的实施例的结构性示意图。实际上,本领域技术人员可以理解,单独示出的项目可以被组合并且一些项目也可以被分开。例如,图8中分离的示出的一些项目可以在单一的服务器上实现,并且单一的项目也可以由一个或多个服务器实现。用于实现事实查询应答系统的服务器的实际数量以及这些特征如何在这些服务器中被分配都将因实现的不同而不同,并且可以部分依靠于在峰值使用期间以及在平均使用期间系统必须控制的数据通信量。
前面的描述,出于解释的目的,参考了特定的实施例进行了描述。然而,上面讨论的说明性的讨论并不是穷举或者将本发明限定到所公开的确切形式。考虑到上面的教导,许多改变和变化都是可能的。选择和描述实施例是为了更好的解释本发明的原理和其实际的应用,从而使本领域技术人员更好的利用本发明和作为特定用途的使用的具有不同变化的不同实施例。

Claims (15)

1.一种用于显示事实的源的方法,包括:
接收事实查询,该事实查询包括一个或多个词语;
查找对该事实查询的回答,该回答包括一个或多个词语;
查找那些包括所述事实查询的一个或多个词语和所述回答的一个或多个词语的源文档;
为所述源文档中的至少一个产生片断,所述片断包括所述事实查询的一个或多个词语和所述回答的一个或多个词语;以及
产生包括所述片断的响应。
2.根据权利要求1的方法,其中产生响应进一步包括在片断中突出所述事实查询的一个或多个词语和所述回答的一个或多个词语。
3.根据权利要求1的方法,进一步包括响应在对所述事实查询的所述回答的显示表示中的链接的用户选择。
4.根据权利要求3的方法,其中产生响应是响应于所述链接的用户选择。
5.根据权利要求1的方法,进一步包括接收列出一个或多个源文档的请求。
6.根据权利要求5的方法,其中产生响应是响应于所述列出一个或多个源文档的请求。
7.一种用于显示事实的源的系统,包括:
查询接收器,用于接收事实查询,所述事实查询包括一个或多个词语;
回答查找器,用于查找对于事实查询的回答,该回答包括一个或多个词语;
源文档查找器,用于查找源文档,所述源文档包括所述事实查询的一个或多个词语和所述回答的一个或多个词语;
片断生成器,用于对所述源文档中的至少一个产生片断,所述片断包括所述事实查询的一个或多个词语和所述回答的一个或多个词语;以及
响应生成器,用于产生包括所述片断的响应。
8.根据权利要求7的系统,其中所述响应生成器进一步包括指令,用于在所产生的片断中突出所述事实查询的一个或多个词语和所述回答的一个或多个词语。
9.根据权利要求7的系统,其中所述查询接收器进一步包括指令用于接收列出一个或多个源文档的请求。
10.根据权利要求7的系统,其中所述查询接收器进一步包括指令,用于响应在所显示的对所述事实查询的回答中的链接的用户选择。
11.一种用于显示事实的源的系统,包括:
用于接收事实查询的装置,所述事实查询包括一个或多个词语;
用于查找对于所述事实查询的回答的装置,所述回答包括一个或多个词语;
用于查找源文档的装置,所述源文档包括所述事实查询的一个或多个词语和所述回答的一个或多个词语;
用于对所述源文档中的至少一个产生片段的装置,所述片断包括所述事实查询的一个或多个词语和所述回答的一个或多个词语;以及
用于产生包括所述片断的响应的装置。
12.用于结合计算机系统使用的计算机程序产品,该计算机程序产品包括计算机可读存储器介质和嵌入在其中的计算机程序机制,该计算机程序机制包括:
接收事实查询,所述事实查询包括一个或多个词语;
查找对所述事实查询的回答,所述回答包括一个或多个词语;
查找源文档,所述源文档包括所述事实查询的一个或多个词语和所述回答的一个或多个词语;
对所述源文档中的至少一个产生片段,所述片断包括所述事实查询的一个或多个词语和所述回答的一个或多个词语;以及
产生包括所述片断的响应。
13.根据权利要求12的计算机程序产品,所述计算机程序机制进一步包括在所述片断中突出所述事实查询的一个或多个词语和所述回答的一个或多个词语。
14.根据权利要求12的计算机程序产品,所述计算机程序机制进一步包括接收列出一个或多个源文档的请求。
15.根据权利要求12的计算机程序产品,所述计算机程序机制进一步包括响应在所显示的对所述事实查询的回答中链接的用户选择。
CN2006800187948A 2005-03-31 2006-03-24 用于事实查询引擎的带有来自信息源的包含查询词语和回答词语的片段的用户界面 Active CN101185074B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US11/097,689 US7587387B2 (en) 2005-03-31 2005-03-31 User interface for facts query engine with snippets from information sources that include query terms and answer terms
US11/097,689 2005-03-31
PCT/US2006/010965 WO2006104951A1 (en) 2005-03-31 2006-03-24 User interface for facts query engine with snippets from information sources that include query terms and answer terms

Publications (2)

Publication Number Publication Date
CN101185074A true CN101185074A (zh) 2008-05-21
CN101185074B CN101185074B (zh) 2010-06-23

Family

ID=36590811

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2006800187948A Active CN101185074B (zh) 2005-03-31 2006-03-24 用于事实查询引擎的带有来自信息源的包含查询词语和回答词语的片段的用户界面

Country Status (6)

Country Link
US (5) US7587387B2 (zh)
EP (1) EP1872283A1 (zh)
JP (1) JP5074378B2 (zh)
CN (1) CN101185074B (zh)
CA (1) CA2603085C (zh)
WO (1) WO2006104951A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103092945A (zh) * 2013-01-11 2013-05-08 北京百度网讯科技有限公司 一种基于界面返回的搜索方法和装置

Families Citing this family (143)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7769579B2 (en) 2005-05-31 2010-08-03 Google Inc. Learning facts from semi-structured text
US8682913B1 (en) 2005-03-31 2014-03-25 Google Inc. Corroborating facts extracted from multiple sources
US7953720B1 (en) 2005-03-31 2011-05-31 Google Inc. Selecting the best answer to a fact query from among a set of potential answers
US9208229B2 (en) 2005-03-31 2015-12-08 Google Inc. Anchor text summarization for corroboration
US7587387B2 (en) 2005-03-31 2009-09-08 Google Inc. User interface for facts query engine with snippets from information sources that include query terms and answer terms
US8239394B1 (en) 2005-03-31 2012-08-07 Google Inc. Bloom filters for query simulation
US8996470B1 (en) 2005-05-31 2015-03-31 Google Inc. System for ensuring the internal consistency of a fact repository
US7831545B1 (en) 2005-05-31 2010-11-09 Google Inc. Identifying the unifying subject of a set of facts
US8225403B2 (en) * 2005-06-02 2012-07-17 Microsoft Corporation Displaying a security element to help detect spoofing
US11216520B2 (en) * 2005-06-27 2022-01-04 Make Sence, Inc. Knowledge correlation search engine
US8706730B2 (en) * 2005-12-29 2014-04-22 International Business Machines Corporation System and method for extraction of factoids from textual repositories
US8954426B2 (en) 2006-02-17 2015-02-10 Google Inc. Query language
US7991797B2 (en) 2006-02-17 2011-08-02 Google Inc. ID persistence through normalization
US7925676B2 (en) 2006-01-27 2011-04-12 Google Inc. Data object visualization using maps
US8055674B2 (en) * 2006-02-17 2011-11-08 Google Inc. Annotation framework
US8060357B2 (en) * 2006-01-27 2011-11-15 Xerox Corporation Linguistic user interface
US20070185870A1 (en) 2006-01-27 2007-08-09 Hogue Andrew W Data object visualization using graphs
US8260785B2 (en) 2006-02-17 2012-09-04 Google Inc. Automatic object reference identification and linking in a browseable fact repository
US8700568B2 (en) 2006-02-17 2014-04-15 Google Inc. Entity normalization via name normalization
WO2008030529A2 (en) * 2006-09-06 2008-03-13 Nexplore Corporation System and method for providing focused search term results
US8954412B1 (en) 2006-09-28 2015-02-10 Google Inc. Corroborating facts in electronic documents
US9495358B2 (en) 2006-10-10 2016-11-15 Abbyy Infopoisk Llc Cross-language text clustering
US8122026B1 (en) 2006-10-20 2012-02-21 Google Inc. Finding and disambiguating references to entities on web pages
WO2008057268A2 (en) * 2006-10-26 2008-05-15 Mobile Content Networks, Inc. Techniques for determining relevant advertisements in response to queries
US11392595B2 (en) 2006-10-26 2022-07-19 EMB Partners, LLC Techniques for determining relevant electronic content in response to queries
US7996410B2 (en) * 2007-02-01 2011-08-09 Yahoo! Inc. Word pluralization handling in query for web search
US7899803B2 (en) 2007-02-19 2011-03-01 Viewzi, Inc. Multi-view internet search mashup
US7849096B2 (en) * 2007-03-13 2010-12-07 Fein Gene S Multiple parameter data media search in a distributed network
US8347202B1 (en) 2007-03-14 2013-01-01 Google Inc. Determining geographic locations for place names in a fact repository
US8239350B1 (en) 2007-05-08 2012-08-07 Google Inc. Date ambiguity resolution
US8239751B1 (en) * 2007-05-16 2012-08-07 Google Inc. Data from web documents in a spreadsheet
US7966291B1 (en) 2007-06-26 2011-06-21 Google Inc. Fact-based object merging
US7895189B2 (en) * 2007-06-28 2011-02-22 International Business Machines Corporation Index exploitation
US8086597B2 (en) * 2007-06-28 2011-12-27 International Business Machines Corporation Between matching
US7970766B1 (en) * 2007-07-23 2011-06-28 Google Inc. Entity type assignment
US7788276B2 (en) * 2007-08-22 2010-08-31 Yahoo! Inc. Predictive stemming for web search with statistical machine translation models
US8280721B2 (en) * 2007-08-31 2012-10-02 Microsoft Corporation Efficiently representing word sense probabilities
US8229730B2 (en) * 2007-08-31 2012-07-24 Microsoft Corporation Indexing role hierarchies for words in a search index
US8868562B2 (en) * 2007-08-31 2014-10-21 Microsoft Corporation Identification of semantic relationships within reported speech
US20090070322A1 (en) * 2007-08-31 2009-03-12 Powerset, Inc. Browsing knowledge on the basis of semantic relations
US8229970B2 (en) * 2007-08-31 2012-07-24 Microsoft Corporation Efficient storage and retrieval of posting lists
US8346756B2 (en) * 2007-08-31 2013-01-01 Microsoft Corporation Calculating valence of expressions within documents for searching a document index
US8463593B2 (en) * 2007-08-31 2013-06-11 Microsoft Corporation Natural language hypernym weighting for word sense disambiguation
US8712758B2 (en) 2007-08-31 2014-04-29 Microsoft Corporation Coreference resolution in an ambiguity-sensitive natural language processing system
US8316036B2 (en) * 2007-08-31 2012-11-20 Microsoft Corporation Checkpointing iterators during search
US8209321B2 (en) * 2007-08-31 2012-06-26 Microsoft Corporation Emphasizing search results according to conceptual meaning
US8812435B1 (en) 2007-11-16 2014-08-19 Google Inc. Learning objects and facts from documents
US7853587B2 (en) * 2008-01-31 2010-12-14 Microsoft Corporation Generating search result summaries
US8061142B2 (en) * 2008-04-11 2011-11-22 General Electric Company Mixer for a combustor
US8086590B2 (en) * 2008-04-25 2011-12-27 Microsoft Corporation Product suggestions and bypassing irrelevant query results
US20090281994A1 (en) * 2008-05-09 2009-11-12 Byron Robert V Interactive Search Result System, and Method Therefor
US20090327235A1 (en) * 2008-06-27 2009-12-31 Google Inc. Presenting references with answers in forums
US8635528B2 (en) * 2008-11-06 2014-01-21 Nexplore Technologies, Inc. System and method for dynamic search result formatting
US8260800B2 (en) * 2008-11-06 2012-09-04 Nexplore Technolgies, Inc. System and method for image generation, delivery, and management
US8440937B2 (en) * 2008-12-11 2013-05-14 GM Global Technology Operations LLC Self-cleaning welding nozzle
US9087059B2 (en) 2009-08-07 2015-07-21 Google Inc. User interface for presenting search results for multiple regions of a visual query
US9135277B2 (en) 2009-08-07 2015-09-15 Google Inc. Architecture for responding to a visual query
WO2011053758A1 (en) * 2009-10-29 2011-05-05 Chacha Search, Inc. Method and system of processing a query using human assistants
US20110119268A1 (en) * 2009-11-13 2011-05-19 Rajaram Shyam Sundar Method and system for segmenting query urls
US8612472B2 (en) * 2009-12-16 2013-12-17 Microsoft Corporation Determining preferences from user queries
US8386239B2 (en) 2010-01-25 2013-02-26 Holovisions LLC Multi-stage text morphing
US9760634B1 (en) 2010-03-23 2017-09-12 Firstrain, Inc. Models for classifying documents
US10643227B1 (en) 2010-03-23 2020-05-05 Aurea Software, Inc. Business lines
US8463789B1 (en) 2010-03-23 2013-06-11 Firstrain, Inc. Event detection
US9858338B2 (en) 2010-04-30 2018-01-02 International Business Machines Corporation Managed document research domains
US8788260B2 (en) * 2010-05-11 2014-07-22 Microsoft Corporation Generating snippets based on content features
US8434134B2 (en) 2010-05-26 2013-04-30 Google Inc. Providing an electronic document collection
US20110295864A1 (en) * 2010-05-29 2011-12-01 Martin Betz Iterative fact-extraction
EP2622592A4 (en) * 2010-09-28 2017-04-05 International Business Machines Corporation Providing answers to questions using multiple models to score candidate answers
US8612423B2 (en) * 2010-10-29 2013-12-17 Microsoft Corporation Search cache for document search
US8515986B2 (en) 2010-12-02 2013-08-20 Microsoft Corporation Query pattern generation for answers coverage expansion
US9460207B2 (en) 2010-12-08 2016-10-04 Microsoft Technology Licensing, Llc Automated database generation for answering fact lookup queries
US20120150861A1 (en) * 2010-12-10 2012-06-14 Microsoft Corporation Highlighting known answers in search results
US8655866B1 (en) 2011-02-10 2014-02-18 Google Inc. Returning factual answers in response to queries
US8560468B1 (en) 2011-02-10 2013-10-15 Google Inc. Learning expected values for facts
US9594788B2 (en) * 2011-02-25 2017-03-14 International Business Machines Corporation Displaying logical statement relationships between diverse documents in a research domain
US9116996B1 (en) 2011-07-25 2015-08-25 Google Inc. Reverse question answering
US8909665B2 (en) 2011-08-30 2014-12-09 Microsoft Corporation Subsnippet handling in search results
US8782042B1 (en) 2011-10-14 2014-07-15 Firstrain, Inc. Method and system for identifying entities
US9173092B1 (en) * 2011-12-02 2015-10-27 Google Inc. Base station data embedded for transmission within information provided to a user device
US20130151936A1 (en) * 2011-12-12 2013-06-13 Microsoft Corporation Page preview using contextual template metadata and labeling
US8856640B1 (en) 2012-01-20 2014-10-07 Google Inc. Method and apparatus for applying revision specific electronic signatures to an electronically stored document
US8819047B2 (en) 2012-04-04 2014-08-26 Microsoft Corporation Fact verification engine
US9292505B1 (en) 2012-06-12 2016-03-22 Firstrain, Inc. Graphical user interface for recurring searches
US9405821B1 (en) * 2012-08-03 2016-08-02 tinyclues SAS Systems and methods for data mining automation
US20140052647A1 (en) * 2012-08-17 2014-02-20 Truth Seal Corporation System and Method for Promoting Truth in Public Discourse
US9870554B1 (en) 2012-10-23 2018-01-16 Google Inc. Managing documents based on a user's calendar
US9529916B1 (en) 2012-10-30 2016-12-27 Google Inc. Managing documents based on access context
US11308037B2 (en) 2012-10-30 2022-04-19 Google Llc Automatic collaboration
US9384285B1 (en) 2012-12-18 2016-07-05 Google Inc. Methods for identifying related documents
US9495341B1 (en) 2012-12-18 2016-11-15 Google Inc. Fact correction and completion during document drafting
US10592480B1 (en) 2012-12-30 2020-03-17 Aurea Software, Inc. Affinity scoring
US9224103B1 (en) 2013-03-13 2015-12-29 Google Inc. Automatic annotation for training and evaluation of semantic analysis engines
US9235626B2 (en) 2013-03-13 2016-01-12 Google Inc. Automatic generation of snippets based on context and user interest
US10810193B1 (en) 2013-03-13 2020-10-20 Google Llc Querying a data graph using natural language queries
US10713261B2 (en) 2013-03-13 2020-07-14 Google Llc Generating insightful connections between graph entities
US9098543B2 (en) 2013-03-14 2015-08-04 Wal-Mart Stores, Inc. Attribute detection
US8965915B2 (en) 2013-03-17 2015-02-24 Alation, Inc. Assisted query formation, validation, and result previewing in a database having a complex schema
US9235653B2 (en) 2013-06-26 2016-01-12 Google Inc. Discovering entity actions for an entity graph
US9342622B2 (en) 2013-06-27 2016-05-17 Google Inc. Two-phase construction of data graphs from disparate inputs
US9514113B1 (en) 2013-07-29 2016-12-06 Google Inc. Methods for automatic footnote generation
US9842113B1 (en) 2013-08-27 2017-12-12 Google Inc. Context-based file selection
US9785696B1 (en) 2013-10-04 2017-10-10 Google Inc. Automatic discovery of new entities using graph reconciliation
WO2015051480A1 (en) 2013-10-09 2015-04-16 Google Inc. Automatic definition of entity collections
US9798829B1 (en) 2013-10-22 2017-10-24 Google Inc. Data graph interface
US10002117B1 (en) 2013-10-24 2018-06-19 Google Llc Translating annotation tags into suggested markup
US9529791B1 (en) 2013-12-12 2016-12-27 Google Inc. Template and content aware document and template editing
US20150186514A1 (en) * 2013-12-26 2015-07-02 Iac Search & Media, Inc. Central aggregator architechture for question and answer search engine
US9495457B2 (en) 2013-12-26 2016-11-15 Iac Search & Media, Inc. Batch crawl and fast crawl clusters for question and answer search engine
US9659056B1 (en) 2013-12-30 2017-05-23 Google Inc. Providing an explanation of a missing fact estimate
US9430573B2 (en) 2014-01-14 2016-08-30 Microsoft Technology Licensing, Llc Coherent question answering in search results
RU2586577C2 (ru) 2014-01-15 2016-06-10 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Фильтрация дуг в синтаксическом графе
US9471689B2 (en) * 2014-05-29 2016-10-18 International Business Machines Corporation Managing documents in question answering systems
US10318753B2 (en) * 2014-06-30 2019-06-11 Vescel, Llc Semantic data structure and method
US10019513B1 (en) 2014-08-12 2018-07-10 Google Llc Weighted answer terms for scoring answer passages
US10180964B1 (en) 2014-08-13 2019-01-15 Google Llc Candidate answer passages
US9703840B2 (en) 2014-08-13 2017-07-11 International Business Machines Corporation Handling information source ingestion in a question answering system
US9916348B1 (en) 2014-08-13 2018-03-13 Google Llc Answer facts from structured content
US9940367B1 (en) 2014-08-13 2018-04-10 Google Llc Scoring candidate answer passages
US9703763B1 (en) 2014-08-14 2017-07-11 Google Inc. Automatic document citations by utilizing copied content for candidate sources
US9483582B2 (en) 2014-09-12 2016-11-01 International Business Machines Corporation Identification and verification of factual assertions in natural language
US10331673B2 (en) * 2014-11-24 2019-06-25 International Business Machines Corporation Applying level of permanence to statements to influence confidence ranking
US9626358B2 (en) 2014-11-26 2017-04-18 Abbyy Infopoisk Llc Creating ontologies by analyzing natural language texts
US10296644B2 (en) 2015-03-23 2019-05-21 Microsoft Technology Licensing, Llc Salient terms and entities for caption generation and presentation
US10755294B1 (en) 2015-04-28 2020-08-25 Intuit Inc. Method and system for increasing use of mobile devices to provide answer content in a question and answer based customer support system
US10095783B2 (en) 2015-05-25 2018-10-09 Microsoft Technology Licensing, Llc Multiple rounds of results summarization for improved latency and relevance
US10475044B1 (en) 2015-07-29 2019-11-12 Intuit Inc. Method and system for question prioritization based on analysis of the question content and predicted asker engagement before answer content is generated
US10176264B2 (en) 2015-12-01 2019-01-08 Microsoft Technology Licensing, Llc Generating topic pages based on data sources
US10854101B1 (en) * 2016-03-09 2020-12-01 Naveed Iftikhar Multi-media method for enhanced recall and retention of educational material
US10552843B1 (en) 2016-12-05 2020-02-04 Intuit Inc. Method and system for improving search results by recency boosting customer support content for a customer self-help system associated with one or more financial management systems
US10748157B1 (en) 2017-01-12 2020-08-18 Intuit Inc. Method and system for determining levels of search sophistication for users of a customer self-help system to personalize a content search user experience provided to the users and to increase a likelihood of user satisfaction with the search experience
US11200269B2 (en) 2017-06-15 2021-12-14 Microsoft Technology Licensing, Llc Method and system for highlighting answer phrases
US20180365318A1 (en) * 2017-06-19 2018-12-20 Microsoft Technology Licensing, Llc Semantic analysis of search results to generate snippets responsive to receipt of a query
US10922367B2 (en) * 2017-07-14 2021-02-16 Intuit Inc. Method and system for providing real time search preview personalization in data management systems
US11093951B1 (en) 2017-09-25 2021-08-17 Intuit Inc. System and method for responding to search queries using customer self-help systems associated with a plurality of data management systems
US11436642B1 (en) 2018-01-29 2022-09-06 Intuit Inc. Method and system for generating real-time personalized advertisements in data management self-help systems
US11269665B1 (en) 2018-03-28 2022-03-08 Intuit Inc. Method and system for user experience personalization in data management systems using machine learning
US10740541B2 (en) * 2018-05-24 2020-08-11 Microsoft Technology Licensing, Llc Fact validation in document editors
US11049604B2 (en) 2018-09-26 2021-06-29 International Business Machines Corporation Cognitive monitoring of online user profiles to detect changes in online behavior
US11822918B2 (en) * 2018-10-13 2023-11-21 Affirm, Inc. Code search and code navigation
US11037049B2 (en) 2018-10-29 2021-06-15 International Business Machines Corporation Determining rationale of cognitive system output
US11289092B2 (en) * 2019-09-25 2022-03-29 International Business Machines Corporation Text editing using speech recognition
US11526756B1 (en) * 2020-06-24 2022-12-13 Amazon Technologies, Inc. Artificial intelligence system with composite models for multiple response-string queries

Family Cites Families (361)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4888690A (en) 1985-01-11 1989-12-19 Wang Laboratories, Inc. Interactive error handling means in database management
US5010478A (en) 1986-04-11 1991-04-23 Deran Roger L Entity-attribute value database system with inverse attribute for selectively relating two different entities
US4899292A (en) 1988-03-02 1990-02-06 Image Storage/Retrieval Systems, Inc. System for storing and retrieving text and associated graphics
US5133075A (en) 1988-12-19 1992-07-21 Hewlett-Packard Company Method of monitoring changes in attribute values of object in an object-oriented database
US5440730A (en) 1990-08-09 1995-08-08 Bell Communications Research, Inc. Time index access structure for temporal databases having concurrent multiple versions
CA2048306A1 (en) 1990-10-02 1992-04-03 Steven P. Miller Distributed configuration profile for computing system
US5347653A (en) 1991-06-28 1994-09-13 Digital Equipment Corporation System for reconstructing prior versions of indexes using records indicating changes between successive versions of the indexes
US5694590A (en) 1991-09-27 1997-12-02 The Mitre Corporation Apparatus and method for the detection of security violations in multilevel secure databases
US5649183A (en) 1992-12-08 1997-07-15 Microsoft Corporation Method for compressing full text indexes with document identifiers and location offsets
US5574898A (en) 1993-01-08 1996-11-12 Atria Software, Inc. Dynamic software version auditor which monitors a process to provide a list of objects that are accessed
US7082426B2 (en) 1993-06-18 2006-07-25 Cnet Networks, Inc. Content aggregation method and apparatus for an on-line product catalog
US5519608A (en) 1993-06-24 1996-05-21 Xerox Corporation Method for extracting from a text corpus answers to questions stated in natural language by using linguistic analysis and hypothesis generation
US5546507A (en) 1993-08-20 1996-08-13 Unisys Corporation Apparatus and method for generating a knowledge base
US5544051A (en) 1993-09-17 1996-08-06 Digital Equipment Corporation Document management system using multiple threaded processes and having asynchronous repository responses and no busy cursor
US5560005A (en) 1994-02-25 1996-09-24 Actamed Corp. Methods and systems for object-based relational distributed databases
US6216138B1 (en) 1994-04-22 2001-04-10 Brooks Automation Inc. Computer interface system for automatically generating graphical representations of computer operations linked together according to functional relationships
US5680622A (en) 1994-06-30 1997-10-21 Borland International, Inc. System and methods for quickly detecting shareability of symbol and type information in header files
US5675785A (en) 1994-10-04 1997-10-07 Hewlett-Packard Company Data warehouse which is accessed by a user using a schema of virtual tables
JP2809341B2 (ja) 1994-11-18 1998-10-08 松下電器産業株式会社 情報要約方法、情報要約装置、重み付け方法、および文字放送受信装置。
US6029195A (en) 1994-11-29 2000-02-22 Herz; Frederick S. M. System for customized electronic identification of desirable objects
US5608903A (en) 1994-12-15 1997-03-04 Novell, Inc. Method and apparatus for moving subtrees in a distributed network directory
US5717911A (en) 1995-01-23 1998-02-10 Tandem Computers, Inc. Relational database system and method with high availability compliation of SQL programs
US5793966A (en) 1995-12-01 1998-08-11 Vermeer Technologies, Inc. Computer system and computer-implemented process for creation and maintenance of online services
US5724571A (en) 1995-07-07 1998-03-03 Sun Microsystems, Inc. Method and apparatus for generating query responses in a computer-based document retrieval system
US5717951A (en) 1995-08-07 1998-02-10 Yabumoto; Kan W. Method for storing and retrieving information on a magnetic storage medium via data blocks of variable sizes
US6026388A (en) 1995-08-16 2000-02-15 Textwise, Llc User interface and other enhancements for natural language information retrieval system and method
US5963940A (en) 1995-08-16 1999-10-05 Syracuse University Natural language information retrieval system and method
US6006221A (en) * 1995-08-16 1999-12-21 Syracuse University Multilingual document retrieval system and method using semantic vector matching
US5838979A (en) 1995-10-31 1998-11-17 Peritus Software Services, Inc. Process and tool for scalable automated data field replacement
US5701470A (en) 1995-12-08 1997-12-23 Sun Microsystems, Inc. System and method for space efficient object locking using a data subarray and pointers
US5815415A (en) 1996-01-19 1998-09-29 Bentley Systems, Incorporated Computer system for portable persistent modeling
US6076088A (en) 1996-02-09 2000-06-13 Paik; Woojin Information extraction system and method using concept relation concept (CRC) triples
US5802299A (en) 1996-02-13 1998-09-01 Microtouch Systems, Inc. Interactive system for authoring hypertext document collections
US5778378A (en) 1996-04-30 1998-07-07 International Business Machines Corporation Object oriented information retrieval framework mechanism
US6014661A (en) 1996-05-06 2000-01-11 Ivee Development Ab System and method for automatic analysis of data bases and for user-controlled dynamic querying
US6101515A (en) 1996-05-31 2000-08-08 Oracle Corporation Learning system for classification of terminology
US5920859A (en) 1997-02-05 1999-07-06 Idd Enterprises, L.P. Hypertext document retrieval system and method
US5819210A (en) 1996-06-21 1998-10-06 Xerox Corporation Method of lazy contexted copying during unification
US6052693A (en) 1996-07-02 2000-04-18 Harlequin Group Plc System for assembling large databases through information extracted from text sources
US5987460A (en) 1996-07-05 1999-11-16 Hitachi, Ltd. Document retrieval-assisting method and system for the same and document retrieval service using the same with document frequency and term frequency
US5819265A (en) 1996-07-12 1998-10-06 International Business Machines Corporation Processing names in a text
US5778373A (en) 1996-07-15 1998-07-07 At&T Corp Integration of an information server database schema by generating a translation map from exemplary files
US5787413A (en) 1996-07-29 1998-07-28 International Business Machines Corporation C++ classes for a digital library
US6820093B2 (en) 1996-07-30 2004-11-16 Hyperphrase Technologies, Llc Method for verifying record code prior to an action based on the code
US5870739A (en) 1996-09-20 1999-02-09 Novell, Inc. Hybrid query apparatus and method
US5826258A (en) 1996-10-02 1998-10-20 Junglee Corporation Method and apparatus for structuring the querying and interpretation of semistructured information
JP3099756B2 (ja) 1996-10-31 2000-10-16 富士ゼロックス株式会社 文書処理装置、単語抽出装置及び単語抽出方法
US6480194B1 (en) 1996-11-12 2002-11-12 Silicon Graphics, Inc. Computer-related method, system, and program product for controlling data visualization in external dimension(s)
US6498921B1 (en) 1999-09-01 2002-12-24 Chi Fai Ho Method and system to answer a natural-language question
US5966126A (en) 1996-12-23 1999-10-12 Szabo; Andrew J. Graphic user interface for database system
US6285999B1 (en) 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
AUPO525497A0 (en) 1997-02-21 1997-03-20 Mills, Dudley John Network-based classified information systems
US6134555A (en) 1997-03-10 2000-10-17 International Business Machines Corporation Dimension reduction using association rules for data mining application
US5822743A (en) 1997-04-08 1998-10-13 1215627 Ontario Inc. Knowledge-based information retrieval system
US5882743A (en) 1997-04-21 1999-03-16 Kimberly-Clark Worldwide, Inc. Absorbent folded hand towel
US5946692A (en) 1997-05-08 1999-08-31 At & T Corp Compressed representation of a data base that permits AD HOC querying
US6038560A (en) 1997-05-21 2000-03-14 Oracle Corporation Concept knowledge base search and retrieval system
US5974254A (en) 1997-06-06 1999-10-26 National Instruments Corporation Method for detecting differences between graphical programs
CA2242158C (en) 1997-07-01 2004-06-01 Hitachi, Ltd. Method and apparatus for searching and displaying structured document
US5893093A (en) 1997-07-02 1999-04-06 The Sabre Group, Inc. Information search and retrieval with geographical coordinates
US5987448A (en) 1997-07-25 1999-11-16 Claritech Corporation Methodology for displaying search results using character recognition
AU735024B2 (en) 1997-07-25 2001-06-28 British Telecommunications Public Limited Company Scheduler for a software system
AU753202B2 (en) 1997-07-25 2002-10-10 British Telecommunications Public Limited Company Software system generation
DE69803575T2 (de) 1997-07-25 2002-08-29 British Telecomm Visualisierung in einem modularen softwaresystem
US5909689A (en) 1997-09-18 1999-06-01 Sony Corporation Automatic update of file versions for files shared by several computers which record in respective file directories temporal information for indicating when the files have been created
US6073130A (en) 1997-09-23 2000-06-06 At&T Corp. Method for improving the results of a search in a structured database
US6442540B2 (en) 1997-09-29 2002-08-27 Kabushiki Kaisha Toshiba Information retrieval apparatus and information retrieval method
US6018741A (en) 1997-10-22 2000-01-25 International Business Machines Corporation Method and system for managing objects in a dynamic inheritance tree
US6112210A (en) 1997-10-31 2000-08-29 Oracle Corporation Apparatus and method for null representation in database object storage
US5943670A (en) 1997-11-21 1999-08-24 International Business Machines Corporation System and method for categorizing objects in combined categories
US6188403B1 (en) 1997-11-21 2001-02-13 Portola Dimensional Systems, Inc. User-friendly graphics generator using direct manipulation
US6349275B1 (en) 1997-11-24 2002-02-19 International Business Machines Corporation Multiple concurrent language support system for electronic catalogue using a concept based knowledge representation
US6212526B1 (en) 1997-12-02 2001-04-03 Microsoft Corporation Method for apparatus for efficient mining of classification models from databases
US6094650A (en) 1997-12-15 2000-07-25 Manning & Napier Information Services Database analysis using a probabilistic ontology
FI106089B (fi) 1997-12-23 2000-11-15 Sonera Oyj Liikkuvan päätelaitteen seuranta matkaviestinjärjestelmässä
US6105030A (en) 1998-02-27 2000-08-15 Oracle Corporation Method and apparatus for copying data that resides in a database
US6173287B1 (en) 1998-03-11 2001-01-09 Digital Equipment Corporation Technique for ranking multimedia annotations of interest
JPH11265400A (ja) * 1998-03-13 1999-09-28 Omron Corp 情報処理装置および方法、ネットワークシステム、並びに記録媒体
US6044366A (en) 1998-03-16 2000-03-28 Microsoft Corporation Use of the UNPIVOT relational operator in the efficient gathering of sufficient statistics for data mining
US6078918A (en) 1998-04-02 2000-06-20 Trivada Corporation Online predictive memory
US6112203A (en) 1998-04-09 2000-08-29 Altavista Company Method for ranking documents in a hyperlinked environment using connectivity and selective content analysis
US6567846B1 (en) 1998-05-15 2003-05-20 E.Piphany, Inc. Extensible user interface for a distributed messaging framework in a computer network
US6122647A (en) 1998-05-19 2000-09-19 Perspecta, Inc. Dynamic generation of contextual links in hypertext documents
US6742003B2 (en) 2001-04-30 2004-05-25 Microsoft Corporation Apparatus and accompanying methods for visualizing clusters of data and hierarchical cluster classifications
US6327574B1 (en) 1998-07-07 2001-12-04 Encirq Corporation Hierarchical models of consumer attributes for targeting content in a privacy-preserving manner
US6240546B1 (en) 1998-07-24 2001-05-29 International Business Machines Corporation Identifying date fields for runtime year 2000 system solution process, method and article of manufacture
US7409381B1 (en) 1998-07-30 2008-08-05 British Telecommunications Public Limited Company Index to a semi-structured database
US6665837B1 (en) 1998-08-10 2003-12-16 Overture Services, Inc. Method for identifying related pages in a hyperlinked database
US6694482B1 (en) 1998-09-11 2004-02-17 Sbc Technology Resources, Inc. System and methods for an architectural framework for design of an adaptive, personalized, interactive content delivery system
US6470330B1 (en) 1998-11-05 2002-10-22 Sybase, Inc. Database system with methods for estimation and usage of index page cluster ratio (IPCR) and data page cluster ratio (DPCR)
FR2787957B1 (fr) 1998-12-28 2001-10-05 Inst Nat Rech Inf Automat Procede de traitement d'une requete
US6572661B1 (en) 1999-01-11 2003-06-03 Cisco Technology, Inc. System and method for automated annotation of files
US6529900B1 (en) 1999-01-14 2003-03-04 International Business Machines Corporation Method and apparatus for data visualization
US6377943B1 (en) 1999-01-20 2002-04-23 Oracle Corp. Initial ordering of tables for database queries
US7003719B1 (en) 1999-01-25 2006-02-21 West Publishing Company, Dba West Group System, method, and software for inserting hyperlinks into documents
US6565610B1 (en) 1999-02-11 2003-05-20 Navigation Technologies Corporation Method and system for text placement when forming maps
GB9903451D0 (en) 1999-02-16 1999-04-07 Hewlett Packard Co Similarity searching for documents
US6574635B2 (en) 1999-03-03 2003-06-03 Siebel Systems, Inc. Application instantiation based upon attributes and values stored in a meta data repository, including tiering of application layers objects and components
US6584464B1 (en) * 1999-03-19 2003-06-24 Ask Jeeves, Inc. Grammar template query system
US6763496B1 (en) 1999-03-31 2004-07-13 Microsoft Corporation Method for promoting contextual information to display pages containing hyperlinks
US6397228B1 (en) 1999-03-31 2002-05-28 Verizon Laboratories Inc. Data enhancement techniques
US6263328B1 (en) 1999-04-09 2001-07-17 International Business Machines Corporation Object oriented query model and process for complex heterogeneous database queries
US20030195872A1 (en) 1999-04-12 2003-10-16 Paul Senn Web-based information content analyzer and information dimension dictionary
US6304864B1 (en) 1999-04-20 2001-10-16 Textwise Llc System for retrieving multimedia information from the internet using multiple evolving intelligent agents
US6629097B1 (en) 1999-04-28 2003-09-30 Douglas K. Keith Displaying implicit associations among items in loosely-structured data sets
US6721713B1 (en) 1999-05-27 2004-04-13 Andersen Consulting Llp Business alliance identification in a web architecture framework
US6885990B1 (en) 1999-05-31 2005-04-26 Nippon Telegraph And Telephone Company Speech recognition based on interactive information retrieval scheme using dialogue control to reduce user stress
US6606625B1 (en) 1999-06-03 2003-08-12 University Of Southern California Wrapper induction by hierarchical data analysis
US6711585B1 (en) 1999-06-15 2004-03-23 Kanisa Inc. System and method for implementing a knowledge management system
US6438543B1 (en) 1999-06-17 2002-08-20 International Business Machines Corporation System and method for cross-document coreference
US6473898B1 (en) 1999-07-06 2002-10-29 Pcorder.Com, Inc. Method for compiling and selecting data attributes
US6873982B1 (en) 1999-07-16 2005-03-29 International Business Machines Corporation Ordering of database search results based on user feedback
EP1072987A1 (en) 1999-07-29 2001-01-31 International Business Machines Corporation Geographic web browser and iconic hyperlink cartography
US6341306B1 (en) * 1999-08-13 2002-01-22 Atomica Corporation Web-based information retrieval responsive to displayed word identified by a text-grabbing algorithm
CA2281331A1 (en) 1999-09-03 2001-03-03 Cognos Incorporated Database management system
US6845354B1 (en) 1999-09-09 2005-01-18 Institute For Information Industry Information retrieval system with a neuro-fuzzy structure
US6754873B1 (en) 1999-09-20 2004-06-22 Google Inc. Techniques for finding related hyperlinked documents using link-based analysis
GB2371901B (en) 1999-09-21 2004-06-23 Andrew E Borthwick A probabilistic record linkage model derived from training data
US6105020A (en) 1999-10-11 2000-08-15 International Business Machines Corporation System and method for identifying and constructing star joins for execution by bitmap ANDing
AU2702701A (en) 1999-10-15 2001-04-23 Milind Kotwal Method of categorization and indexing of information
US6665666B1 (en) 1999-10-26 2003-12-16 International Business Machines Corporation System, method and program product for answering questions using a search engine
US6850896B1 (en) 1999-10-28 2005-02-01 Market-Touch Corporation Method and system for managing and providing sales data using world wide web
JP3888812B2 (ja) 1999-11-01 2007-03-07 富士通株式会社 事実データ統合方法および装置
US6804667B1 (en) 1999-11-30 2004-10-12 Ncr Corporation Filter for checking for duplicate entries in database
US6963867B2 (en) 1999-12-08 2005-11-08 A9.Com, Inc. Search query processing to provide category-ranked presentation of search results
US7305380B1 (en) 1999-12-15 2007-12-04 Google Inc. Systems and methods for performing in-context searching
US6865582B2 (en) 2000-01-03 2005-03-08 Bechtel Bwxt Idaho, Llc Systems and methods for knowledge discovery in spatial data
US6546388B1 (en) 2000-01-14 2003-04-08 International Business Machines Corporation Metadata search results ranking system
US6606659B1 (en) 2000-01-28 2003-08-12 Websense, Inc. System and method for controlling access to internet sites
US6665659B1 (en) 2000-02-01 2003-12-16 James D. Logan Methods and apparatus for distributing and using metadata via the internet
US6567936B1 (en) 2000-02-08 2003-05-20 Microsoft Corporation Data clustering using error-tolerant frequent item sets
AU2001241564A1 (en) 2000-02-17 2001-08-27 E-Numerate Solutions, Inc. Rdl search engine
US6584646B2 (en) * 2000-02-29 2003-07-01 Katoh Electrical Machinery Co., Ltd. Tilt hinge for office automation equipment
US6901403B1 (en) 2000-03-02 2005-05-31 Quovadx, Inc. XML presentation of general-purpose data sources
US6311194B1 (en) 2000-03-15 2001-10-30 Taalee, Inc. System and method for creating a semantic web and its applications in browsing, searching, profiling, personalization and advertising
US20030120659A1 (en) 2000-03-20 2003-06-26 Sridhar Mandayam Anandampillai Systems for developing websites and methods therefor
US6738767B1 (en) 2000-03-20 2004-05-18 International Business Machines Corporation System and method for discovering schematic structure in hypertext documents
US6502102B1 (en) 2000-03-27 2002-12-31 Accenture Llp System, method and article of manufacture for a table-driven automated scripting architecture
US6643641B1 (en) * 2000-04-27 2003-11-04 Russell Snyder Web search engine with graphic snapshots
EP1156430A2 (en) 2000-05-17 2001-11-21 Matsushita Electric Industrial Co., Ltd. Information retrieval system
US6957213B1 (en) 2000-05-17 2005-10-18 Inquira, Inc. Method of utilizing implicit references to answer a query
US7325201B2 (en) 2000-05-18 2008-01-29 Endeca Technologies, Inc. System and method for manipulating content in a hierarchical data-driven search and navigation system
US7062483B2 (en) 2000-05-18 2006-06-13 Endeca Technologies, Inc. Hierarchical data-driven search and navigation system and method for information retrieval
US7401131B2 (en) 2000-05-22 2008-07-15 Verizon Business Global Llc Method and system for implementing improved containers in a global ecosystem of interrelated services
WO2001090921A2 (en) 2000-05-25 2001-11-29 Kanisa, Inc. System and method for automatically classifying text
US6754677B1 (en) 2000-05-30 2004-06-22 Outlooksoft Corporation Method and system for facilitating information exchange
US6487495B1 (en) 2000-06-02 2002-11-26 Navigation Technologies Corporation Navigation applications using related location-referenced keywords
US6963876B2 (en) 2000-06-05 2005-11-08 International Business Machines Corporation System and method for searching extended regular expressions
US6745189B2 (en) 2000-06-05 2004-06-01 International Business Machines Corporation System and method for enabling multi-indexing of objects
US7328349B2 (en) 2001-12-14 2008-02-05 Bbn Technologies Corp. Hash-based systems and methods for detecting, preventing, and tracing network worms and viruses
US20020042707A1 (en) 2000-06-19 2002-04-11 Gang Zhao Grammar-packaged parsing
US7162499B2 (en) 2000-06-21 2007-01-09 Microsoft Corporation Linked value replication
GB0015233D0 (en) 2000-06-21 2000-08-16 Canon Kk Indexing method and apparatus
MXPA03000110A (es) 2000-06-22 2006-06-08 Mayer Yaron Sistema y metodo de investigacion para buscar y contactar citas en mensajeros instantaneos en la red y/o en otros metodos capaces de encontrar y crear un contacto inmediato.
US7003506B1 (en) 2000-06-23 2006-02-21 Microsoft Corporation Method and system for creating an embedded search link document
US6578032B1 (en) 2000-06-28 2003-06-10 Microsoft Corporation Method and system for performing phrase/word clustering and cluster merging
US7080085B1 (en) 2000-07-12 2006-07-18 International Business Machines Corporation System and method for ensuring referential integrity for heterogeneously scoped references in an information management system
US6728728B2 (en) 2000-07-24 2004-04-27 Israel Spiegler Unified binary model and methodology for knowledge representation and for data and information mining
US6675159B1 (en) 2000-07-27 2004-01-06 Science Applic Int Corp Concept-based search and retrieval system
US7640489B2 (en) 2000-08-01 2009-12-29 Sun Microsystems, Inc. Methods and systems for inputting data into spreadsheet documents
US7100082B2 (en) 2000-08-04 2006-08-29 Sun Microsystems, Inc. Check creation and maintenance for product knowledge management
US7100083B2 (en) 2000-08-04 2006-08-29 Sun Microsystems, Inc. Checks for product knowledge management
US7146536B2 (en) 2000-08-04 2006-12-05 Sun Microsystems, Inc. Fact collection for product knowledge management
US7080073B1 (en) 2000-08-18 2006-07-18 Firstrain, Inc. Method and apparatus for focused crawling
IT1318683B1 (it) 2000-08-22 2003-08-27 Ausimont Spa Miscele di elastomeri fluorurati ed acrilici.
US20030217052A1 (en) 2000-08-24 2003-11-20 Celebros Ltd. Search engine method and apparatus
WO2002019272A1 (en) 2000-09-01 2002-03-07 Togethersoft Corporation Methods and systems for animating a workflow and a project plan
US6556991B1 (en) 2000-09-01 2003-04-29 E-Centives, Inc. Item name normalization
US6823495B1 (en) 2000-09-14 2004-11-23 Microsoft Corporation Mapping tool graphical user interface
US6832218B1 (en) 2000-09-22 2004-12-14 International Business Machines Corporation System and method for associating search results
US7493308B1 (en) 2000-10-03 2009-02-17 A9.Com, Inc. Searching documents using a dimensional database
US20020065815A1 (en) 2000-10-04 2002-05-30 Xcelerix, Inc. Systems and methods for searching a database
US6684205B1 (en) 2000-10-18 2004-01-27 International Business Machines Corporation Clustering hypertext with applications to web searching
JP2002157276A (ja) * 2000-11-16 2002-05-31 Hitachi Software Eng Co Ltd 問題解決支援方法及びシステム
US7013308B1 (en) 2000-11-28 2006-03-14 Semscript Ltd. Knowledge storage and retrieval system and method
US20020174099A1 (en) 2000-11-28 2002-11-21 Anthony Raj Minimal identification
US8402068B2 (en) 2000-12-07 2013-03-19 Half.Com, Inc. System and method for collecting, associating, normalizing and presenting product and vendor information on a distributed network
US6781599B2 (en) 2001-01-04 2004-08-24 At&T System and method for visualizing massive multi-digraphs
JP2002230035A (ja) 2001-01-05 2002-08-16 Internatl Business Mach Corp <Ibm> 情報整理方法、情報処理装置、情報処理システム、記憶媒体、およびプログラム伝送装置
US6879969B2 (en) 2001-01-21 2005-04-12 Volvo Technological Development Corporation System and method for real-time recognition of driving patterns
US6693651B2 (en) 2001-02-07 2004-02-17 International Business Machines Corporation Customer self service iconic interface for resource search results display and selection
US7143099B2 (en) 2001-02-08 2006-11-28 Amdocs Software Systems Limited Historical data warehousing system
US7216073B2 (en) 2001-03-13 2007-05-08 Intelligate, Ltd. Dynamic natural language understanding
US6820081B1 (en) 2001-03-19 2004-11-16 Attenex Corporation System and method for evaluating a structured message store for message redundancy
US20020147738A1 (en) 2001-04-06 2002-10-10 Reader Scot A. Method and appratus for finding patent-relevant web documents
US6920477B2 (en) 2001-04-06 2005-07-19 President And Fellows Of Harvard College Distributed, compressed Bloom filter Web cache server
JP4159366B2 (ja) 2001-04-12 2008-10-01 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ ユーザ嗜好を登録するための方法とシステム
US7031955B1 (en) 2001-04-27 2006-04-18 I2 Technologies Us, Inc. Optimization using a multi-dimensional data model
US20020169770A1 (en) 2001-04-27 2002-11-14 Kim Brian Seong-Gon Apparatus and method that categorize a collection of documents into a hierarchy of categories that are defined by the collection of documents
US6961723B2 (en) 2001-05-04 2005-11-01 Sun Microsystems, Inc. System and method for determining relevancy of query responses in a distributed network search mechanism
US7020662B2 (en) 2001-05-29 2006-03-28 Sun Microsystems, Inc. Method and system for determining a directory entry's class of service based on the value of a specifier in the entry
MXPA03011976A (es) 2001-06-22 2005-07-01 Nervana Inc Sistema y metodo para la recuperacion, manejo, entrega y presentacion de conocimientos.
US7263656B2 (en) 2001-07-16 2007-08-28 Canon Kabushiki Kaisha Method and device for scheduling, generating and processing a document comprising blocks of information
WO2003009251A1 (en) 2001-07-18 2003-01-30 Hyunjae Tech Co., Ltd System for automatic recognizing licence number of other vehicles on observation vehicles and method thereof
JP4571404B2 (ja) 2001-07-26 2010-10-27 インターナショナル・ビジネス・マシーンズ・コーポレーション データ処理方法、データ処理システムおよびプログラム
CA2354443A1 (en) 2001-07-31 2003-01-31 Ibm Canada Limited-Ibm Canada Limitee Method and system for visually constructing xml schemas using an object-oriented model
WO2003017023A2 (en) 2001-08-14 2003-02-27 Quigo Technologies, Inc. System and method for extracting content for submission to a search engine
US7398201B2 (en) 2001-08-14 2008-07-08 Evri Inc. Method and system for enhanced data searching
US7386832B2 (en) 2001-08-31 2008-06-10 Siebel Systems, Inc. Configurator using structure to provide a user interface
US7058653B2 (en) 2001-09-17 2006-06-06 Ricoh Company, Ltd. Tree system diagram output method, computer program and recording medium
US7403938B2 (en) 2001-09-24 2008-07-22 Iac Search & Media, Inc. Natural language query processing
US7020641B2 (en) 2001-10-22 2006-03-28 Sun Microsystems, Inc. Method, system, and program for maintaining a database of data objects
US7197449B2 (en) 2001-10-30 2007-03-27 Intel Corporation Method for extracting name entities and jargon terms using a suffix tree data structure
CN100461156C (zh) 2001-11-09 2009-02-11 无锡永中科技有限公司 集成数据处理系统
US6826572B2 (en) 2001-11-13 2004-11-30 Overture Services, Inc. System and method allowing advertisers to manage search listings in a pay for placement search system using grouping
JP3931214B2 (ja) 2001-12-17 2007-06-13 日本アイ・ビー・エム株式会社 データ解析装置およびプログラム
US6965900B2 (en) 2001-12-19 2005-11-15 X-Labs Holdings, Llc Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents
US6988018B2 (en) 2001-12-26 2006-01-17 Eames John D System and method for analyzing controlling forming sections of a paper machine in operation
US7096231B2 (en) 2001-12-28 2006-08-22 American Management Systems, Inc. Export engine which builds relational database directly from object model
US7209906B2 (en) 2002-01-14 2007-04-24 International Business Machines Corporation System and method for implementing a metrics engine for tracking relationships over time
US7398461B1 (en) 2002-01-24 2008-07-08 Overture Services, Inc. Method for ranking web page search results
US20030149567A1 (en) 2002-02-04 2003-08-07 Tony Schmitz Method and system for using natural language in computer resource utilization analysis via a communications network
EP1485825A4 (en) 2002-02-04 2008-03-19 Cataphora Inc DETAILED EXPLORATION TECHNIQUE OF SOCIOLOGICAL DATA AND CORRESPONDING APPARATUS
US7421660B2 (en) 2003-02-04 2008-09-02 Cataphora, Inc. Method and apparatus to visually present discussions for data mining purposes
US20030154071A1 (en) 2002-02-11 2003-08-14 Shreve Gregory M. Process for the document management and computer-assisted translation of documents utilizing document corpora constructed by intelligent agents
US7343365B2 (en) 2002-02-20 2008-03-11 Microsoft Corporation Computer system architecture for automatic context associations
US7165024B2 (en) 2002-02-22 2007-01-16 Nec Laboratories America, Inc. Inferring hierarchical descriptions of a set of documents
US6928436B2 (en) 2002-02-28 2005-08-09 Ilog Sa Interactive generation of graphical visualizations of large data structures
JP4098539B2 (ja) 2002-03-15 2008-06-11 富士通株式会社 プロファイル情報の推薦方法、プログラム及び装置
US7043521B2 (en) 2002-03-21 2006-05-09 Rockwell Electronic Commerce Technologies, Llc Search agent for searching the internet
JP3896014B2 (ja) * 2002-03-22 2007-03-22 株式会社東芝 情報収集システム、情報収集方法及びコンピュータに情報収集を実行させるプログラム
CA2479228C (en) 2002-03-27 2011-08-09 British Telecommunications Public Limited Company Network security system
US20040030731A1 (en) 2002-04-03 2004-02-12 Liviu Iftode System and method for accessing files in a network
US6857053B2 (en) 2002-04-10 2005-02-15 International Business Machines Corporation Method, system, and program for backing up objects by creating groups of objects
US20030208665A1 (en) 2002-05-01 2003-11-06 Jih-Kwon Peir Reducing data speculation penalty with early cache hit/miss prediction
TWI256562B (en) 2002-05-03 2006-06-11 Ind Tech Res Inst Method for named-entity recognition and verification
US6963880B1 (en) 2002-05-10 2005-11-08 Oracle International Corporation Schema evolution of complex objects
US20040015481A1 (en) 2002-05-23 2004-01-22 Kenneth Zinda Patent data mining
US7003522B1 (en) 2002-06-24 2006-02-21 Microsoft Corporation System and method for incorporating smart tags in online content
US20040003067A1 (en) 2002-06-27 2004-01-01 Daniel Ferrin System and method for enabling a user interface with GUI meta data
GB0215464D0 (en) 2002-07-04 2002-08-14 Hewlett Packard Co Combining data descriptions
WO2004019264A1 (en) 2002-08-22 2004-03-04 Agency For Science, Technology And Research Prediction by collective likelihood from emerging patterns
US20040059726A1 (en) 2002-09-09 2004-03-25 Jeff Hunter Context-sensitive wordless search
US7158983B2 (en) 2002-09-23 2007-01-02 Battelle Memorial Institute Text analysis technique
US20040064447A1 (en) 2002-09-27 2004-04-01 Simske Steven J. System and method for management of synonymic searching
DE10245900A1 (de) 2002-09-30 2004-04-08 Neven jun., Hartmut, Prof.Dr. Bildbasiertes Anfragesystem für Suchmaschinen für mobile Endgeräte mit eingebauter Kamera
US6886010B2 (en) 2002-09-30 2005-04-26 The United States Of America As Represented By The Secretary Of The Navy Method for data and text mining and literature-based discovery
US7096217B2 (en) 2002-10-31 2006-08-22 International Business Machines Corporation Global query correlation attributes
US20050108256A1 (en) 2002-12-06 2005-05-19 Attensity Corporation Visualization of integrated structured and unstructured data
US7277879B2 (en) 2002-12-17 2007-10-02 Electronic Data Systems Corporation Concept navigation in data storage systems
US7181450B2 (en) 2002-12-18 2007-02-20 International Business Machines Corporation Method, system, and program for use of metadata to create multidimensional cubes in a relational database
US20040122846A1 (en) 2002-12-19 2004-06-24 Ibm Corporation Fact verification system
US7107528B2 (en) 2002-12-20 2006-09-12 International Business Machines Corporation Automatic completion of dates
US7278111B2 (en) 2002-12-26 2007-10-02 Yahoo! Inc. Systems and methods for selecting a date or range of dates
US7472182B1 (en) 2002-12-31 2008-12-30 Emc Corporation Data collection policy for storage devices
GB0304639D0 (en) 2003-02-28 2003-04-02 Kiq Ltd Classification using re-sampling of probability estimates
US7020666B2 (en) 2003-03-07 2006-03-28 Microsoft Corporation System and method for unknown type serialization
US7146538B2 (en) 2003-03-28 2006-12-05 Hewlett-Packard Development Company, L.P. Bus interface module
US7051023B2 (en) 2003-04-04 2006-05-23 Yahoo! Inc. Systems and methods for generating concept units from search queries
EP1629359A4 (en) 2003-04-07 2008-01-09 Sevenecho Llc METHOD, SYSTEM AND SOFTWARE FOR CUSTOMIZING PERSONALIZED NARRATIVE PRESENTATIONS
US7747571B2 (en) 2003-04-15 2010-06-29 At&T Intellectual Property, I,L.P. Methods, systems, and computer program products for implementing logical and physical data models
US20040243552A1 (en) 2003-05-30 2004-12-02 Dictaphone Corporation Method, system, and apparatus for viewing data
US8095544B2 (en) 2003-05-30 2012-01-10 Dictaphone Corporation Method, system, and apparatus for validation
US7111025B2 (en) 2003-04-30 2006-09-19 International Business Machines Corporation Information retrieval system and method using index ANDing for improving performance
EP1477892B1 (en) 2003-05-16 2015-12-23 Sap Se System, method, computer program product and article of manufacture for inputting data in a computer system
US7853508B2 (en) 2003-05-19 2010-12-14 Serena Software, Inc. Method and system for object-oriented management of multi-dimensional data
JP2004362223A (ja) 2003-06-04 2004-12-24 Hitachi Ltd 情報マイニングシステム
US7926103B2 (en) 2003-06-05 2011-04-12 Hewlett-Packard Development Company, L.P. System and method for preventing replay attacks
US7836391B2 (en) 2003-06-10 2010-11-16 Google Inc. Document search engine including highlighting of confident results
US9026901B2 (en) 2003-06-20 2015-05-05 International Business Machines Corporation Viewing annotations across multiple applications
US8321470B2 (en) 2003-06-20 2012-11-27 International Business Machines Corporation Heterogeneous multi-level extendable indexing for general purpose annotation systems
US7162473B2 (en) 2003-06-26 2007-01-09 Microsoft Corporation Method and system for usage analyzer that determines user accessed sources, indexes data subsets, and associated metadata, processing implicit queries based on potential interest to users
US7739588B2 (en) 2003-06-27 2010-06-15 Microsoft Corporation Leveraging markup language data for semantically labeling text strings and data and for providing actions based on semantically labeled text strings and data
US7565425B2 (en) 2003-07-02 2009-07-21 Amazon Technologies, Inc. Server architecture and methods for persistently storing and serving event data
WO2005008358A2 (en) 2003-07-22 2005-01-27 Kinor Technologies Inc. Information access using ontologies
WO2005010727A2 (en) 2003-07-23 2005-02-03 Praedea Solutions, Inc. Extracting data from semi-structured text documents
US7814093B2 (en) 2003-07-25 2010-10-12 Microsoft Corporation Method and system for building a report for execution against a data store
US7444515B2 (en) 2003-08-14 2008-10-28 Washington University Method and apparatus for detecting predefined signatures in packet payload using Bloom filters
CA2536265C (en) 2003-08-21 2012-11-13 Idilia Inc. System and method for processing a query
US7174328B2 (en) 2003-09-02 2007-02-06 International Business Machines Corp. Selective path signatures for query processing over a hierarchical tagged data structure
US7409406B2 (en) 2003-09-08 2008-08-05 International Business Machines Corporation Uniform search system and method for selectively sharing distributed access-controlled documents
US20050057566A1 (en) 2003-09-11 2005-03-17 International Business Machines Corporation Rich graphic visualization generation from abstract data representation
US7644076B1 (en) 2003-09-12 2010-01-05 Teradata Us, Inc. Clustering strings using N-grams
US7236982B2 (en) 2003-09-15 2007-06-26 Pic Web Services, Inc. Computer systems and methods for platform independent presentation design
US7496560B2 (en) 2003-09-23 2009-02-24 Amazon Technologies, Inc. Personalized searchable library with highlighting capabilities
US7158980B2 (en) 2003-10-02 2007-01-02 Acer Incorporated Method and apparatus for computerized extracting of scheduling information from a natural language e-mail
AU2003290397A1 (en) 2003-10-15 2005-04-27 Dharamdas Gautam Goradia Interactive wisdom system
KR100533810B1 (ko) 2003-10-16 2005-12-07 한국전자통신연구원 백과사전 질의응답 시스템의 지식베이스 반자동 구축 방법
US7620624B2 (en) 2003-10-17 2009-11-17 Yahoo! Inc. Systems and methods for indexing content for fast and scalable retrieval
US20050083413A1 (en) 2003-10-20 2005-04-21 Logicalis Method, system, apparatus, and machine-readable medium for use in connection with a server that uses images or audio for initiating remote function calls
GB0325626D0 (en) 2003-11-03 2003-12-10 Infoshare Ltd Data aggregation
US20050108630A1 (en) 2003-11-19 2005-05-19 Wasson Mark D. Extraction of facts from text
US7512553B2 (en) 2003-12-05 2009-03-31 International Business Machines Corporation System for automated part-number mapping
US8150824B2 (en) 2003-12-31 2012-04-03 Google Inc. Systems and methods for direct navigation to specific portion of target document
US20050149851A1 (en) 2003-12-31 2005-07-07 Google Inc. Generating hyperlinks and anchor text in HTML and non-HTML documents
US7424467B2 (en) 2004-01-26 2008-09-09 International Business Machines Corporation Architecture for an indexer with fixed width sort and variable width sort
US20050187898A1 (en) 2004-02-05 2005-08-25 Nec Laboratories America, Inc. Data Lookup architecture
WO2005083597A1 (en) 2004-02-20 2005-09-09 Dow Jones Reuters Business Interactive, Llc Intelligent search and retrieval system and method
US7756823B2 (en) 2004-03-26 2010-07-13 Lockheed Martin Corporation Dynamic reference repository
US20050216464A1 (en) 2004-03-27 2005-09-29 Microsoft Corporation Automated authoring tool and method to facilitate inclusion of maps and other geographical data into travelogues
US20050219929A1 (en) 2004-03-30 2005-10-06 Navas Julio C Method and apparatus achieving memory and transmission overhead reductions in a content routing network
US7725498B2 (en) 2004-04-22 2010-05-25 International Business Machines Corporation Techniques for identifying mergeable data
US7260573B1 (en) 2004-05-17 2007-08-21 Google Inc. Personalizing anchor text scores in a search engine
US20050268212A1 (en) 2004-05-28 2005-12-01 Michael Dagel System, apparatus, and method for desktop-based creation and publication of a periodic community newsletter
US7716225B1 (en) 2004-06-17 2010-05-11 Google Inc. Ranking documents based on user behavior and/or feature data
US7685112B2 (en) 2004-06-17 2010-03-23 The Regents Of The University Of California Method and apparatus for retrieving and indexing hidden pages
US8051207B2 (en) 2004-06-25 2011-11-01 Citrix Systems, Inc. Inferring server state in s stateless communication protocol
US7493335B2 (en) 2004-07-02 2009-02-17 Graphlogic Inc. Object process graph relational database interface
US20060020582A1 (en) 2004-07-22 2006-01-26 International Business Machines Corporation Method and system for processing abstract derived entities defined in a data abstraction model
US20060036504A1 (en) 2004-08-11 2006-02-16 Allocca William W Dynamically classifying items for international delivery
US7809695B2 (en) 2004-08-23 2010-10-05 Thomson Reuters Global Resources Information retrieval systems with duplicate document detection and presentation functions
US20060047691A1 (en) 2004-08-31 2006-03-02 Microsoft Corporation Creating a document index from a flex- and Yacc-generated named entity recognizer
US20060053175A1 (en) 2004-09-03 2006-03-09 Biowisdom Limited System and method for creating, editing, and utilizing one or more rules for multi-relational ontology creation and maintenance
US20060053171A1 (en) 2004-09-03 2006-03-09 Biowisdom Limited System and method for curating one or more multi-relational ontologies
US20060074910A1 (en) 2004-09-17 2006-04-06 Become, Inc. Systems and methods of retrieving topic specific information
JP4587756B2 (ja) 2004-09-21 2010-11-24 ルネサスエレクトロニクス株式会社 半導体集積回路装置
US20060064411A1 (en) 2004-09-22 2006-03-23 William Gross Search engine using user intent
US7809763B2 (en) 2004-10-15 2010-10-05 Oracle International Corporation Method(s) for updating database object metadata
US20060085386A1 (en) 2004-10-19 2006-04-20 Microsoft Corporation Two pass calculation to optimize formula calculations for a spreadsheet
US20060149700A1 (en) 2004-11-11 2006-07-06 Gladish Randolph J System and method for automatic geospatial web network generation via metadata transformation
US7822768B2 (en) 2004-11-23 2010-10-26 International Business Machines Corporation System and method for automating data normalization using text analytics
US9137115B2 (en) 2004-12-06 2015-09-15 Bmc Software, Inc. System and method for resource reconciliation in an enterprise management system
US20060167991A1 (en) 2004-12-16 2006-07-27 Heikes Brian D Buddy list filtering
US20060143227A1 (en) 2004-12-27 2006-06-29 Helm Martin W System and method for persisting software objects
US8719779B2 (en) 2004-12-28 2014-05-06 Sap Ag Data object association based on graph theory techniques
US7769579B2 (en) 2005-05-31 2010-08-03 Google Inc. Learning facts from semi-structured text
US7464090B2 (en) 2006-01-27 2008-12-09 Google Inc. Object categorization for information extraction
US7672971B2 (en) 2006-02-17 2010-03-02 Google Inc. Modular architecture for entity normalization
US7685136B2 (en) 2005-01-12 2010-03-23 International Business Machines Corporation Method, system and program product for managing document summary information
US20060173824A1 (en) 2005-02-01 2006-08-03 Metalincs Corporation Electronic communication analysis and visualization
EP1859378A2 (en) 2005-03-03 2007-11-28 Washington University Method and apparatus for performing biosequence similarity searching
US7587387B2 (en) 2005-03-31 2009-09-08 Google Inc. User interface for facts query engine with snippets from information sources that include query terms and answer terms
US7953720B1 (en) * 2005-03-31 2011-05-31 Google Inc. Selecting the best answer to a fact query from among a set of potential answers
US9208229B2 (en) 2005-03-31 2015-12-08 Google Inc. Anchor text summarization for corroboration
US20060238919A1 (en) 2005-04-20 2006-10-26 The Boeing Company Adaptive data cleaning
US20060248456A1 (en) 2005-05-02 2006-11-02 Ibm Corporation Assigning a publication date for at least one electronic document
US7809722B2 (en) 2005-05-09 2010-10-05 Like.Com System and method for enabling search and retrieval from image files based on recognized information
US20060259462A1 (en) 2005-05-12 2006-11-16 Sybase, Inc. System and Methodology for Real-time Content Aggregation and Syndication
US7590647B2 (en) 2005-05-27 2009-09-15 Rage Frameworks, Inc Method for extracting, interpreting and standardizing tabular data from unstructured documents
US20060277169A1 (en) 2005-06-02 2006-12-07 Lunt Tracy T Using the quantity of electronically readable text to generate a derivative attribute for an electronic file
US20080005064A1 (en) 2005-06-28 2008-01-03 Yahoo! Inc. Apparatus and method for content annotation and conditional annotation retrieval in a search context
US7630977B2 (en) 2005-06-29 2009-12-08 Xerox Corporation Categorization including dependencies between different category systems
US20070005593A1 (en) 2005-06-30 2007-01-04 Microsoft Corporation Attribute-based data retrieval and association
US20070022085A1 (en) 2005-07-22 2007-01-25 Parashuram Kulkarni Techniques for unsupervised web content discovery and automated query generation for crawling the hidden web
CA2545232A1 (en) 2005-07-29 2007-01-29 Cognos Incorporated Method and system for creating a taxonomy from business-oriented metadata content
US8666928B2 (en) 2005-08-01 2014-03-04 Evi Technologies Limited Knowledge repository
US7797282B1 (en) 2005-09-29 2010-09-14 Hewlett-Packard Development Company, L.P. System and method for modifying a training set
US7493317B2 (en) 2005-10-20 2009-02-17 Omniture, Inc. Result-based triggering for presentation of online content
US7730013B2 (en) 2005-10-25 2010-06-01 International Business Machines Corporation System and method for searching dates efficiently in a collection of web documents
KR100755678B1 (ko) 2005-10-28 2007-09-05 삼성전자주식회사 개체명 검출 장치 및 방법
US7574449B2 (en) 2005-12-02 2009-08-11 Microsoft Corporation Content matching
US8260785B2 (en) 2006-02-17 2012-09-04 Google Inc. Automatic object reference identification and linking in a browseable fact repository
US7991797B2 (en) 2006-02-17 2011-08-02 Google Inc. ID persistence through normalization
US20070179965A1 (en) 2006-01-27 2007-08-02 Hogue Andrew W Designating data objects for analysis
US7774328B2 (en) 2006-02-17 2010-08-10 Google Inc. Browseable fact repository
US7454398B2 (en) 2006-02-17 2008-11-18 Google Inc. Support for object search
US7555471B2 (en) 2006-01-27 2009-06-30 Google Inc. Data object visualization
US8954426B2 (en) 2006-02-17 2015-02-10 Google Inc. Query language
US8700568B2 (en) 2006-02-17 2014-04-15 Google Inc. Entity normalization via name normalization
US9286404B2 (en) 2006-06-28 2016-03-15 Nokia Technologies Oy Methods of systems using geographic meta-metadata in information retrieval and document displays
US9176984B2 (en) 2006-07-31 2015-11-03 Ricoh Co., Ltd Mixed media reality retrieval of differentially-weighted links
US7685201B2 (en) 2006-09-08 2010-03-23 Microsoft Corporation Person disambiguation using name entity extraction-based clustering
US8458207B2 (en) 2006-09-15 2013-06-04 Microsoft Corporation Using anchor text to provide context
US7698336B2 (en) 2006-10-26 2010-04-13 Microsoft Corporation Associating geographic-related information with objects
US8108501B2 (en) 2006-11-01 2012-01-31 Yahoo! Inc. Searching and route mapping based on a social network, location, and time
US7917154B2 (en) 2006-11-01 2011-03-29 Yahoo! Inc. Determining mobile content for a social network based on location and time
KR100865973B1 (ko) 2007-02-08 2008-10-30 (주)올라웍스 동영상에서 특정인을 검색하는 방법, 동영상에서 특정인에대한 저작권 보고서를 생성하는 방법 및 장치
US9058370B2 (en) 2007-02-27 2015-06-16 International Business Machines Corporation Method, system and program product for defining imports into and exports out from a database system using spread sheets by use of a control language
US20080267504A1 (en) 2007-04-24 2008-10-30 Nokia Corporation Method, device and computer program product for integrating code-based and optical character recognition technologies into a mobile visual search
US8316007B2 (en) 2007-06-28 2012-11-20 Oracle International Corporation Automatically finding acronyms and synonyms in a corpus
US8024281B2 (en) 2008-02-29 2011-09-20 Red Hat, Inc. Alpha node hashing in a rule engine

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103092945A (zh) * 2013-01-11 2013-05-08 北京百度网讯科技有限公司 一种基于界面返回的搜索方法和装置
CN103092945B (zh) * 2013-01-11 2019-11-26 北京百度网讯科技有限公司 一种基于界面返回的搜索方法和装置

Also Published As

Publication number Publication date
CA2603085C (en) 2013-12-31
US20140129538A1 (en) 2014-05-08
CN101185074B (zh) 2010-06-23
US20120278301A1 (en) 2012-11-01
US8224802B2 (en) 2012-07-17
US20060224582A1 (en) 2006-10-05
US7587387B2 (en) 2009-09-08
JP5074378B2 (ja) 2012-11-14
EP1872283A1 (en) 2008-01-02
US8065290B2 (en) 2011-11-22
US20110295888A1 (en) 2011-12-01
JP2008535095A (ja) 2008-08-28
WO2006104951A1 (en) 2006-10-05
CA2603085A1 (en) 2006-10-05
US8650175B2 (en) 2014-02-11
US20090313247A1 (en) 2009-12-17

Similar Documents

Publication Publication Date Title
CN101185074B (zh) 用于事实查询引擎的带有来自信息源的包含查询词语和回答词语的片段的用户界面
US9348871B2 (en) Method and system for assessing relevant properties of work contexts for use by information services
US7895595B2 (en) Automatic method and system for formulating and transforming representations of context used by information services
CA2583042C (en) Providing information relating to a document
US20010049674A1 (en) Methods and systems for enabling efficient employment recruiting
US20080306968A1 (en) Method and system for extracting, analyzing, storing, comparing and reporting on data stored in web and/or other network repositories and apparatus to detect, prevent and obfuscate information removal from information servers
Lal et al. Comparison of ranking algorithms with dataspace
Grossman et al. IIT Intranet Mediator: Bringing data together on a corporate intranet
Rowley Knowledge organisation in a Web‐based environment
Choudhary A comparative analysis of various web search engines
EP1254413A2 (en) System and method for database searching
Nolan et al. Exploring Data Science Jobs with Web Scraping and Text Mining
Arnold dpf
Khalil et al. A COMPARISON OF SEARCH ENGINE's

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: American California

Patentee after: Google limited liability company

Address before: American California

Patentee before: Google Inc.