US6351755B1 - System and method for associating an extensible set of data with documents downloaded by a web crawler - Google Patents
System and method for associating an extensible set of data with documents downloaded by a web crawler Download PDFInfo
- Publication number
- US6351755B1 US6351755B1 US09/433,006 US43300699A US6351755B1 US 6351755 B1 US6351755 B1 US 6351755B1 US 43300699 A US43300699 A US 43300699A US 6351755 B1 US6351755 B1 US 6351755B1
- Authority
- US
- United States
- Prior art keywords
- queue element
- document
- queue
- records
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
TABLE 1 | |||
Priority | Priority Weight | ||
1 | 32 | ||
2 | 16 | ||
3 | 8 | ||
4 | 4 | ||
5 | 2 | ||
6 | 1 | ||
TABLE 2 |
Mapping a Random Value z to a Priority Level |
Priority Level | Range of z For |
||
1 | 0.0 to 0.5079 | ||
2 | 0.5080 to 0.7619 | ||
3 | 0.7620 to 0.8888 | ||
4 | 0.8889 to 0.9524 | ||
5 | 0.9525 to 0.9841 | ||
6 | 0.9842 to 1.0000 | ||
Claims (29)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/433,006 US6351755B1 (en) | 1999-11-02 | 1999-11-02 | System and method for associating an extensible set of data with documents downloaded by a web crawler |
PCT/US2000/029496 WO2001033428A1 (en) | 1999-11-02 | 2000-10-26 | System and method for associating an extensible set of data with documents downloaded by a web crawler |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/433,006 US6351755B1 (en) | 1999-11-02 | 1999-11-02 | System and method for associating an extensible set of data with documents downloaded by a web crawler |
Publications (1)
Publication Number | Publication Date |
---|---|
US6351755B1 true US6351755B1 (en) | 2002-02-26 |
Family
ID=23718477
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/433,006 Expired - Lifetime US6351755B1 (en) | 1999-11-02 | 1999-11-02 | System and method for associating an extensible set of data with documents downloaded by a web crawler |
Country Status (2)
Country | Link |
---|---|
US (1) | US6351755B1 (en) |
WO (1) | WO2001033428A1 (en) |
Cited By (115)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020016730A1 (en) * | 2000-04-25 | 2002-02-07 | Icplanet Acquisition Corporation | Method,system, and computer program product for employment market statistics generation and analysis |
US20020016809A1 (en) * | 2000-04-25 | 2002-02-07 | Icplanet Acquisition Corporation | System and method for scheduling execution of cross-platform computer processes |
US6424966B1 (en) * | 1998-06-30 | 2002-07-23 | Microsoft Corporation | Synchronizing crawler with notification source |
US20020099737A1 (en) * | 2000-11-21 | 2002-07-25 | Porter Charles A. | Metadata quality improvement |
US20020198962A1 (en) * | 2001-06-21 | 2002-12-26 | Horn Frederic A. | Method, system, and computer program product for distributing a stored URL and web document set |
US20030014512A1 (en) * | 2001-07-10 | 2003-01-16 | Murata Kikai Kabushiki Kaisha | Communication apparatus and structured document management apparatus |
US20030041077A1 (en) * | 2001-01-24 | 2003-02-27 | Davis Russell T. | RDX enhancement of system and method for implementing reusable data markup language (RDL) |
US20030056138A1 (en) * | 2001-08-22 | 2003-03-20 | Wenge Ren | Method and system for implementing OSPF redundancy |
US6547829B1 (en) * | 1999-06-30 | 2003-04-15 | Microsoft Corporation | Method and system for detecting duplicate documents in web crawls |
US20030158835A1 (en) * | 2002-02-19 | 2003-08-21 | International Business Machines Corporation | Plug-in parsers for configuring search engine crawler |
US20030172134A1 (en) * | 2000-09-11 | 2003-09-11 | Konstantin Zervas | Method for dynamic caching |
US6681255B1 (en) * | 2000-04-19 | 2004-01-20 | Icplanet Corporation | Regulating rates of requests by a spider engine to web sites by creating instances of a timing module |
US20040045040A1 (en) * | 2000-10-24 | 2004-03-04 | Hayward Monte Duane | Method of sizing an embedded media player page |
US20040047596A1 (en) * | 2000-10-31 | 2004-03-11 | Louis Chevallier | Method for processing video data designed for display on a screen and device therefor |
US20040064500A1 (en) * | 2001-11-20 | 2004-04-01 | Kolar Jennifer Lynn | System and method for unified extraction of media objects |
US20040098378A1 (en) * | 2002-11-19 | 2004-05-20 | Gur Kimchi | Distributed client server index update system and method |
US20040201619A1 (en) * | 2001-05-23 | 2004-10-14 | Konstantin Zervas | Method for optimizing utilization of client capacity |
US20040260679A1 (en) * | 2003-06-19 | 2004-12-23 | International Business Machines Corporation | Personalized indexing and searching for information in a distributed data processing system |
US20040260680A1 (en) * | 2003-06-19 | 2004-12-23 | International Business Machines Corporation | Personalized indexing and searching for information in a distributed data processing system |
US6883135B1 (en) * | 2000-01-28 | 2005-04-19 | Microsoft Corporation | Proxy server using a statistical model |
US20050086216A1 (en) * | 2000-02-17 | 2005-04-21 | E-Numerate Solutions, Inc. | RDL search engine |
US20050086191A1 (en) * | 2001-03-23 | 2005-04-21 | Lars Werner | Method for retrieving documents |
US6892196B1 (en) | 1999-12-22 | 2005-05-10 | Accenture Llp | System, method and article of manufacture for a user programmable diary interface link |
US6910029B1 (en) * | 2000-02-22 | 2005-06-21 | International Business Machines Corporation | System for weighted indexing of hierarchical documents |
US6931397B1 (en) * | 2000-02-11 | 2005-08-16 | International Business Machines Corporation | System and method for automatic generation of dynamic search abstracts contain metadata by crawler |
US6941379B1 (en) * | 2000-05-23 | 2005-09-06 | International Business Machines Corporation | Congestion avoidance for threads in servers |
US20050198042A1 (en) * | 1999-05-21 | 2005-09-08 | E-Numerate Solutions, Inc. | Chart view for reusable data markup language |
US20050210006A1 (en) * | 2004-03-18 | 2005-09-22 | Microsoft Corporation | Field weighting in text searching |
US20060069982A1 (en) * | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Click distance determination |
US20060074871A1 (en) * | 2004-09-30 | 2006-04-06 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US20060074903A1 (en) * | 2004-09-30 | 2006-04-06 | Microsoft Corporation | System and method for ranking search results using click distance |
US20060129536A1 (en) * | 2000-04-18 | 2006-06-15 | Foulger Michael G | Interactive intelligent searching with executable suggestions |
US20060136411A1 (en) * | 2004-12-21 | 2006-06-22 | Microsoft Corporation | Ranking search results using feature extraction |
US20060200460A1 (en) * | 2005-03-03 | 2006-09-07 | Microsoft Corporation | System and method for ranking search results using file types |
US7139747B1 (en) * | 2000-11-03 | 2006-11-21 | Hewlett-Packard Development Company, L.P. | System and method for distributed web crawling |
US20060294100A1 (en) * | 2005-03-03 | 2006-12-28 | Microsoft Corporation | Ranking search results using language types |
US20070016562A1 (en) * | 2000-04-25 | 2007-01-18 | Cooper Jeremy S | System and method for proximity searching position information using a proximity parameter |
US20070022170A1 (en) * | 2000-04-25 | 2007-01-25 | Foulger Michael G | System and method related to generating an email campaign |
US20070038622A1 (en) * | 2005-08-15 | 2007-02-15 | Microsoft Corporation | Method ranking search results using biased click distance |
US20070113091A1 (en) * | 2005-11-16 | 2007-05-17 | Sun Microsystems, Inc. | Extensible fingerprinting functions and content addressed storage system using the same |
US20070118441A1 (en) * | 2005-11-22 | 2007-05-24 | Robert Chatwani | Editable electronic catalogs |
US20070130205A1 (en) * | 2005-12-05 | 2007-06-07 | Microsoft Corporation | Metadata driven user interface |
US20070130207A1 (en) * | 2005-11-22 | 2007-06-07 | Ebay Inc. | System and method for managing shared collections |
US20070150804A1 (en) * | 2000-04-18 | 2007-06-28 | Kforce Inc. | Method, system, and computer program product for propagating remotely configurable posters of host site content |
US20070239701A1 (en) * | 2006-03-29 | 2007-10-11 | International Business Machines Corporation | System and method for prioritizing websites during a webcrawling process |
US20070255601A1 (en) * | 2006-04-27 | 2007-11-01 | Guidewire Software, Inc. | Insurance policy revisioning method and apparatus |
US20070263019A1 (en) * | 2006-05-09 | 2007-11-15 | Naohiro Furukawa | Information management method and information management system |
US7305610B1 (en) * | 2000-04-06 | 2007-12-04 | Google, Inc. | Distributed crawling of hyperlinked documents |
US20080012569A1 (en) * | 2005-05-21 | 2008-01-17 | Hall David R | Downhole Coils |
US20080033806A1 (en) * | 2006-07-20 | 2008-02-07 | Howe Karen N | Targeted advertising for playlists based upon search queries |
US7356768B1 (en) * | 2002-11-27 | 2008-04-08 | Adobe Systems Incorporated | Using document templates to assemble a collection of documents |
US7401067B2 (en) | 1997-11-14 | 2008-07-15 | Adobe Systems Incorporated | Retrieving documents transitively linked to an initial document |
US7421648B1 (en) | 1999-05-21 | 2008-09-02 | E-Numerate Solutions, Inc. | Reusable data markup language |
US20080222091A1 (en) * | 1997-11-14 | 2008-09-11 | Adobe Systems Incorporated | Retrieving Documents Transitively Linked to an Initial Document |
US20080282139A1 (en) * | 1999-05-21 | 2008-11-13 | E-Numerate Solutions, Inc. | Tree view for reusable data markup language |
US20090055436A1 (en) * | 2007-08-20 | 2009-02-26 | Olakunle Olaniyi Ayeni | System and Method for Integrating on Demand/Pull and Push Flow of Goods-and-Services Meta-Data, Including Coupon and Advertising, with Mobile and Wireless Applications |
US20090106221A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features |
US20090106235A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Document Length as a Static Relevance Feature for Ranking Search Results |
US20090106223A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US20090112720A1 (en) * | 2007-10-31 | 2009-04-30 | Tyler Close | Identifying And Displaying Messages Containing An Identifier |
US20090119280A1 (en) * | 2007-11-02 | 2009-05-07 | Christopher Waters | Hosted searching of private local area network information with support for add-on applications |
US20090248622A1 (en) * | 2008-03-26 | 2009-10-01 | International Business Machines Corporation | Method and device for indexing resource content in computer networks |
US20090259651A1 (en) * | 2008-04-11 | 2009-10-15 | Microsoft Corporation | Search results ranking using editing distance and document information |
US20100017403A1 (en) * | 2004-09-27 | 2010-01-21 | Microsoft Corporation | System and method for scoping searches using index keys |
US20100017850A1 (en) * | 2008-07-21 | 2010-01-21 | Workshare Technology, Inc. | Methods and systems to fingerprint textual information using word runs |
US20100064347A1 (en) * | 2008-09-11 | 2010-03-11 | Workshare Technology, Inc. | Methods and systems for protect agents using distributed lightweight fingerprints |
US7707157B1 (en) | 2004-03-25 | 2010-04-27 | Google Inc. | Document near-duplicate detection |
US7725452B1 (en) * | 2003-07-03 | 2010-05-25 | Google Inc. | Scheduler for search engine crawler |
US20100250516A1 (en) * | 2009-03-28 | 2010-09-30 | Microsoft Corporation | Method and apparatus for web crawling |
US20100299727A1 (en) * | 2008-11-18 | 2010-11-25 | Workshare Technology, Inc. | Methods and systems for exact data match filtering |
US7877369B2 (en) | 2007-11-02 | 2011-01-25 | Paglo Labs, Inc. | Hosted searching of private local area network information |
US20110022960A1 (en) * | 2009-07-27 | 2011-01-27 | Workshare Technology, Inc. | Methods and systems for comparing presentation slide decks |
US20110071859A1 (en) * | 2009-09-24 | 2011-03-24 | Guidewire Software, Inc. | Method and Apparatus for Pricing Insurance Policies |
US20110153589A1 (en) * | 2009-12-21 | 2011-06-23 | Ganesh Vaitheeswaran | Document indexing based on categorization and prioritization |
US7987172B1 (en) | 2004-08-30 | 2011-07-26 | Google Inc. | Minimizing visibility of stale content in web searching including revising web crawl intervals of documents |
US8042112B1 (en) | 2003-07-03 | 2011-10-18 | Google Inc. | Scheduler for search engine crawler |
US8140505B1 (en) * | 2005-03-31 | 2012-03-20 | Google Inc. | Near-duplicate document detection for web crawling |
US8285703B1 (en) * | 2009-05-13 | 2012-10-09 | Softek Solutions, Inc. | Document crawling systems and methods |
US20120317075A1 (en) * | 2011-06-13 | 2012-12-13 | Suresh Pasumarthi | Synchronizing primary and secondary repositories |
US20130144967A1 (en) * | 2011-12-05 | 2013-06-06 | International Business Machines Corporation | Scalable Queuing System |
US20130144858A1 (en) * | 2011-01-21 | 2013-06-06 | Google Inc. | Scheduling resource crawls |
US8577610B2 (en) | 2011-12-21 | 2013-11-05 | Telenav Inc. | Navigation system with point of interest harvesting mechanism and method of operation thereof |
US8595475B2 (en) | 2000-10-24 | 2013-11-26 | AOL, Inc. | Method of disseminating advertisements using an embedded media player page |
US8620020B2 (en) | 2008-11-20 | 2013-12-31 | Workshare Technology, Inc. | Methods and systems for preventing unauthorized disclosure of secure information using image fingerprinting |
US8666964B1 (en) | 2005-04-25 | 2014-03-04 | Google Inc. | Managing items in crawl schedule |
US8676783B1 (en) * | 2011-06-28 | 2014-03-18 | Google Inc. | Method and apparatus for managing a backlog of pending URL crawls |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US8793706B2 (en) | 2010-12-16 | 2014-07-29 | Microsoft Corporation | Metadata-based eventing supporting operations on data |
US9170990B2 (en) | 2013-03-14 | 2015-10-27 | Workshare Limited | Method and system for document retrieval with selective document comparison |
US9262383B2 (en) | 1999-05-21 | 2016-02-16 | E-Numerate Solutions, Inc. | System, method, and computer program product for processing a markup document |
US9262384B2 (en) | 1999-05-21 | 2016-02-16 | E-Numerate Solutions, Inc. | Markup language system, method, and computer program product |
US9268748B2 (en) | 1999-05-21 | 2016-02-23 | E-Numerate Solutions, Inc. | System, method, and computer program product for outputting markup language documents |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US9513961B1 (en) * | 2014-04-02 | 2016-12-06 | Google Inc. | Monitoring application loading |
US9613340B2 (en) | 2011-06-14 | 2017-04-04 | Workshare Ltd. | Method and system for shared document approval |
US20170228588A1 (en) * | 2012-08-16 | 2017-08-10 | Groupon, Inc. | Method, apparatus, and computer program product for classification of documents |
US9922022B2 (en) * | 2016-02-01 | 2018-03-20 | Microsoft Technology Licensing, Llc. | Automatic template generation based on previous documents |
US9948676B2 (en) | 2013-07-25 | 2018-04-17 | Workshare, Ltd. | System and method for securing documents prior to transmission |
US10025759B2 (en) | 2010-11-29 | 2018-07-17 | Workshare Technology, Inc. | Methods and systems for monitoring documents exchanged over email applications |
US10133723B2 (en) | 2014-12-29 | 2018-11-20 | Workshare Ltd. | System and method for determining document version geneology |
US10497051B2 (en) | 2005-03-30 | 2019-12-03 | Ebay Inc. | Methods and systems to browse data items |
US10574729B2 (en) | 2011-06-08 | 2020-02-25 | Workshare Ltd. | System and method for cross platform document sharing |
US10783326B2 (en) | 2013-03-14 | 2020-09-22 | Workshare, Ltd. | System for tracking changes in a collaborative document editing environment |
US10839149B2 (en) | 2016-02-01 | 2020-11-17 | Microsoft Technology Licensing, Llc. | Generating templates from user's past documents |
US10880359B2 (en) | 2011-12-21 | 2020-12-29 | Workshare, Ltd. | System and method for cross platform document sharing |
US10911492B2 (en) | 2013-07-25 | 2021-02-02 | Workshare Ltd. | System and method for securing documents prior to transmission |
US10963584B2 (en) | 2011-06-08 | 2021-03-30 | Workshare Ltd. | Method and system for collaborative editing of a remotely stored document |
US11030163B2 (en) | 2011-11-29 | 2021-06-08 | Workshare, Ltd. | System for tracking and displaying changes in a set of related electronic documents |
CN113065055A (en) * | 2021-04-21 | 2021-07-02 | 平安国际智慧城市科技股份有限公司 | News information capturing method and device, electronic equipment and storage medium |
US11182551B2 (en) | 2014-12-29 | 2021-11-23 | Workshare Ltd. | System and method for determining document version geneology |
US11188978B2 (en) | 2002-12-31 | 2021-11-30 | Ebay Inc. | Method and system to generate a listing in a network-based commerce system |
US11263679B2 (en) | 2009-10-23 | 2022-03-01 | Ebay Inc. | Product identification using multiple services |
US11347579B1 (en) | 2021-04-29 | 2022-05-31 | Bank Of America Corporation | Instinctive slither application assessment engine |
US11567907B2 (en) | 2013-03-14 | 2023-01-31 | Workshare, Ltd. | Method and system for comparing document versions encoded in a hierarchical representation |
US11763013B2 (en) | 2015-08-07 | 2023-09-19 | Workshare, Ltd. | Transaction document management system and method |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1269760B1 (en) * | 2000-03-02 | 2013-03-20 | TiVo, Inc. | System and method for internet access to personal television service |
CN109213912A (en) * | 2018-08-16 | 2019-01-15 | 北京神州泰岳软件股份有限公司 | A kind of method and network data crawl dispatching device of crawl network data |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5748954A (en) * | 1995-06-05 | 1998-05-05 | Carnegie Mellon University | Method for searching a queued and ranked constructed catalog of files stored on a network |
US5832494A (en) * | 1993-06-14 | 1998-11-03 | Libertech, Inc. | Method and apparatus for indexing, searching and displaying data |
US5875446A (en) * | 1997-02-24 | 1999-02-23 | International Business Machines Corporation | System and method for hierarchically grouping and ranking a set of objects in a query context based on one or more relationships |
US5944783A (en) * | 1997-07-29 | 1999-08-31 | Lincom Corporation | Apparatus and method for data transfers through software agents using client-to-server and peer-to-peer transfers |
US6006217A (en) * | 1997-11-07 | 1999-12-21 | International Business Machines Corporation | Technique for providing enhanced relevance information for documents retrieved in a multi database search |
US6038610A (en) * | 1996-07-17 | 2000-03-14 | Microsoft Corporation | Storage of sitemaps at server sites for holding information regarding content |
US6094649A (en) * | 1997-12-22 | 2000-07-25 | Partnet, Inc. | Keyword searches of structured databases |
US6145003A (en) * | 1997-12-17 | 2000-11-07 | Microsoft Corporation | Method of web crawling utilizing address mapping |
-
1999
- 1999-11-02 US US09/433,006 patent/US6351755B1/en not_active Expired - Lifetime
-
2000
- 2000-10-26 WO PCT/US2000/029496 patent/WO2001033428A1/en active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5832494A (en) * | 1993-06-14 | 1998-11-03 | Libertech, Inc. | Method and apparatus for indexing, searching and displaying data |
US5748954A (en) * | 1995-06-05 | 1998-05-05 | Carnegie Mellon University | Method for searching a queued and ranked constructed catalog of files stored on a network |
US6038610A (en) * | 1996-07-17 | 2000-03-14 | Microsoft Corporation | Storage of sitemaps at server sites for holding information regarding content |
US5875446A (en) * | 1997-02-24 | 1999-02-23 | International Business Machines Corporation | System and method for hierarchically grouping and ranking a set of objects in a query context based on one or more relationships |
US5944783A (en) * | 1997-07-29 | 1999-08-31 | Lincom Corporation | Apparatus and method for data transfers through software agents using client-to-server and peer-to-peer transfers |
US6006217A (en) * | 1997-11-07 | 1999-12-21 | International Business Machines Corporation | Technique for providing enhanced relevance information for documents retrieved in a multi database search |
US6145003A (en) * | 1997-12-17 | 2000-11-07 | Microsoft Corporation | Method of web crawling utilizing address mapping |
US6094649A (en) * | 1997-12-22 | 2000-07-25 | Partnet, Inc. | Keyword searches of structured databases |
Cited By (253)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7937409B2 (en) | 1997-11-14 | 2011-05-03 | Adobe Systems Incorporated | Retrieving documents transitively linked to an initial document |
US7401067B2 (en) | 1997-11-14 | 2008-07-15 | Adobe Systems Incorporated | Retrieving documents transitively linked to an initial document |
US20080222091A1 (en) * | 1997-11-14 | 2008-09-11 | Adobe Systems Incorporated | Retrieving Documents Transitively Linked to an Initial Document |
US20080252912A1 (en) * | 1997-11-14 | 2008-10-16 | Adobe Systems Incorporated | Retrieving Documents Transitively Linked To An Initial Document |
US8005843B2 (en) | 1997-11-14 | 2011-08-23 | Adobe Systems Incorporated | Retrieving documents transitively linked to an initial document |
US6424966B1 (en) * | 1998-06-30 | 2002-07-23 | Microsoft Corporation | Synchronizing crawler with notification source |
US8489982B2 (en) | 1999-05-21 | 2013-07-16 | E-Numerate Solutions, Inc. | Reusable data markup language |
US9262383B2 (en) | 1999-05-21 | 2016-02-16 | E-Numerate Solutions, Inc. | System, method, and computer program product for processing a markup document |
US9262384B2 (en) | 1999-05-21 | 2016-02-16 | E-Numerate Solutions, Inc. | Markup language system, method, and computer program product |
US8185816B2 (en) | 1999-05-21 | 2012-05-22 | E-Numerate Solutions, Inc. | Combining reusable data markup language documents |
US7512875B2 (en) | 1999-05-21 | 2009-03-31 | E-Numerate Solutions, Inc. | Chart view for reusable data markup language |
US20090083619A1 (en) * | 1999-05-21 | 2009-03-26 | E-Numerate Solutions, Inc. | Reusable data markup language |
US9268748B2 (en) | 1999-05-21 | 2016-02-23 | E-Numerate Solutions, Inc. | System, method, and computer program product for outputting markup language documents |
US20050198042A1 (en) * | 1999-05-21 | 2005-09-08 | E-Numerate Solutions, Inc. | Chart view for reusable data markup language |
US20080282139A1 (en) * | 1999-05-21 | 2008-11-13 | E-Numerate Solutions, Inc. | Tree view for reusable data markup language |
US7421648B1 (en) | 1999-05-21 | 2008-09-02 | E-Numerate Solutions, Inc. | Reusable data markup language |
US7650355B1 (en) | 1999-05-21 | 2010-01-19 | E-Numerate Solutions, Inc. | Reusable macro markup language |
US6547829B1 (en) * | 1999-06-30 | 2003-04-15 | Microsoft Corporation | Method and system for detecting duplicate documents in web crawls |
US6892196B1 (en) | 1999-12-22 | 2005-05-10 | Accenture Llp | System, method and article of manufacture for a user programmable diary interface link |
US7603616B2 (en) | 2000-01-28 | 2009-10-13 | Microsoft Corporation | Proxy server using a statistical model |
US20050086583A1 (en) * | 2000-01-28 | 2005-04-21 | Microsoft Corporation | Proxy server using a statistical model |
US20050165778A1 (en) * | 2000-01-28 | 2005-07-28 | Microsoft Corporation | Adaptive Web crawling using a statistical model |
US7328401B2 (en) | 2000-01-28 | 2008-02-05 | Microsoft Corporation | Adaptive web crawling using a statistical model |
US6883135B1 (en) * | 2000-01-28 | 2005-04-19 | Microsoft Corporation | Proxy server using a statistical model |
US6931397B1 (en) * | 2000-02-11 | 2005-08-16 | International Business Machines Corporation | System and method for automatic generation of dynamic search abstracts contain metadata by crawler |
US7401076B2 (en) * | 2000-02-17 | 2008-07-15 | E-Numerate Solutions, Inc. | RDL search engine |
US20050086216A1 (en) * | 2000-02-17 | 2005-04-21 | E-Numerate Solutions, Inc. | RDL search engine |
US6910029B1 (en) * | 2000-02-22 | 2005-06-21 | International Business Machines Corporation | System for weighted indexing of hierarchical documents |
US7305610B1 (en) * | 2000-04-06 | 2007-12-04 | Google, Inc. | Distributed crawling of hyperlinked documents |
US8812478B1 (en) | 2000-04-06 | 2014-08-19 | Google Inc. | Distributed crawling of hyperlinked documents |
US8266134B1 (en) | 2000-04-06 | 2012-09-11 | Google Inc. | Distributed crawling of hyperlinked documents |
US20070204219A1 (en) * | 2000-04-18 | 2007-08-30 | Foulger Michael G | Method, system, and computer program product for propagating remotely configurable posters of host site content |
US20100223275A1 (en) * | 2000-04-18 | 2010-09-02 | Foulger Michael G | Interactive Intelligent Searching with Executable Suggestions |
US8219516B2 (en) | 2000-04-18 | 2012-07-10 | Archeron Limited Llc | Interactive intelligent searching with executable suggestions |
US20070150804A1 (en) * | 2000-04-18 | 2007-06-28 | Kforce Inc. | Method, system, and computer program product for propagating remotely configurable posters of host site content |
US7730008B2 (en) | 2000-04-18 | 2010-06-01 | Foulger Michael G | Database interface and database analysis system |
US8266242B2 (en) | 2000-04-18 | 2012-09-11 | Archeron Limited L.L.C. | Method, system, and computer program product for propagating remotely configurable posters of host site content |
US8055605B2 (en) | 2000-04-18 | 2011-11-08 | Archeron Limited Llc | Interactive intelligent searching with executable suggestions |
US20060129536A1 (en) * | 2000-04-18 | 2006-06-15 | Foulger Michael G | Interactive intelligent searching with executable suggestions |
US20040210589A1 (en) * | 2000-04-19 | 2004-10-21 | Cooper Jeremy S. | Regulating rates of requests by a spider engine to web sites by creating instances of a timing module |
US7949748B2 (en) * | 2000-04-19 | 2011-05-24 | Archeron Limited Llc | Timing module for regulating hits by a spidering engine |
US6681255B1 (en) * | 2000-04-19 | 2004-01-20 | Icplanet Corporation | Regulating rates of requests by a spider engine to web sites by creating instances of a timing module |
US20080270604A1 (en) * | 2000-04-19 | 2008-10-30 | Archeron Limited Llc | Timing Module for Regulating Hits by a Spidering Engine |
US7401155B2 (en) | 2000-04-19 | 2008-07-15 | Archeron Limited Llc | Method and system for downloading network data at a controlled data transfer rate |
US20080244027A1 (en) * | 2000-04-25 | 2008-10-02 | Foulger Michael G | System and Method Related to Generating and Tracking an Email Campaign |
US20070022170A1 (en) * | 2000-04-25 | 2007-01-25 | Foulger Michael G | System and method related to generating an email campaign |
US20020016730A1 (en) * | 2000-04-25 | 2002-02-07 | Icplanet Acquisition Corporation | Method,system, and computer program product for employment market statistics generation and analysis |
US7693950B2 (en) | 2000-04-25 | 2010-04-06 | Foulger Michael G | System and method related to generating and tracking an email campaign |
US20020016809A1 (en) * | 2000-04-25 | 2002-02-07 | Icplanet Acquisition Corporation | System and method for scheduling execution of cross-platform computer processes |
US20070016562A1 (en) * | 2000-04-25 | 2007-01-18 | Cooper Jeremy S | System and method for proximity searching position information using a proximity parameter |
US7783621B2 (en) | 2000-04-25 | 2010-08-24 | Cooper Jeremy S | System and method for proximity searching position information using a proximity parameter |
US8156499B2 (en) | 2000-04-25 | 2012-04-10 | Icp Acquisition Corporation | Methods, systems and articles of manufacture for scheduling execution of programs on computers having different operating systems |
US8015047B2 (en) | 2000-04-25 | 2011-09-06 | Archeron Limited Llc | Method, system, and computer program product for employment market statistics generation and analysis |
US20090094541A1 (en) * | 2000-04-25 | 2009-04-09 | Foulger Michael G | Methods, Systems and Computer Program Products for Scheduling Executions of Programs |
US7386594B2 (en) | 2000-04-25 | 2008-06-10 | Archeron Limited Llc | System and method related to generating an email campaign |
US7469405B2 (en) | 2000-04-25 | 2008-12-23 | Kforce Inc. | System and method for scheduling execution of cross-platform computer processes |
US6941379B1 (en) * | 2000-05-23 | 2005-09-06 | International Business Machines Corporation | Congestion avoidance for threads in servers |
US20030172134A1 (en) * | 2000-09-11 | 2003-09-11 | Konstantin Zervas | Method for dynamic caching |
US7613792B2 (en) * | 2000-09-11 | 2009-11-03 | Handmark, Inc. | Method for dynamic caching |
US9595050B2 (en) | 2000-10-24 | 2017-03-14 | Aol Inc. | Method of disseminating advertisements using an embedded media player page |
US9454775B2 (en) | 2000-10-24 | 2016-09-27 | Aol Inc. | Systems and methods for rendering content |
US20040045040A1 (en) * | 2000-10-24 | 2004-03-04 | Hayward Monte Duane | Method of sizing an embedded media player page |
US8595475B2 (en) | 2000-10-24 | 2013-11-26 | AOL, Inc. | Method of disseminating advertisements using an embedded media player page |
US8819404B2 (en) | 2000-10-24 | 2014-08-26 | Aol Inc. | Method of disseminating advertisements using an embedded media player page |
US8918812B2 (en) | 2000-10-24 | 2014-12-23 | Aol Inc. | Method of sizing an embedded media player page |
US20040047596A1 (en) * | 2000-10-31 | 2004-03-11 | Louis Chevallier | Method for processing video data designed for display on a screen and device therefor |
US7139747B1 (en) * | 2000-11-03 | 2006-11-21 | Hewlett-Packard Development Company, L.P. | System and method for distributed web crawling |
US9110931B2 (en) | 2000-11-21 | 2015-08-18 | Microsoft Technology Licensing, Llc | Fuzzy database retrieval |
US20020099737A1 (en) * | 2000-11-21 | 2002-07-25 | Porter Charles A. | Metadata quality improvement |
US7925967B2 (en) | 2000-11-21 | 2011-04-12 | Aol Inc. | Metadata quality improvement |
US9009136B2 (en) | 2000-11-21 | 2015-04-14 | Microsoft Technology Licensing, Llc | Methods and systems for enhancing metadata |
US20050038809A1 (en) * | 2000-11-21 | 2005-02-17 | Abajian Aram Christian | Internet streaming media workflow architecture |
US10210184B2 (en) | 2000-11-21 | 2019-02-19 | Microsoft Technology Licensing, Llc | Methods and systems for enhancing metadata |
US8209311B2 (en) | 2000-11-21 | 2012-06-26 | Aol Inc. | Methods and systems for grouping uniform resource locators based on masks |
US20050177568A1 (en) * | 2000-11-21 | 2005-08-11 | Diamond Theodore G. | Full-text relevancy ranking |
US20070130131A1 (en) * | 2000-11-21 | 2007-06-07 | Porter Charles A | System and process for searching a network |
US20050193014A1 (en) * | 2000-11-21 | 2005-09-01 | John Prince | Fuzzy database retrieval |
US8095529B2 (en) | 2000-11-21 | 2012-01-10 | Aol Inc. | Full-text relevancy ranking |
US20110004604A1 (en) * | 2000-11-21 | 2011-01-06 | AOL, Inc. | Grouping multimedia and streaming media search results |
US8700590B2 (en) | 2000-11-21 | 2014-04-15 | Microsoft Corporation | Grouping multimedia and streaming media search results |
US7752186B2 (en) | 2000-11-21 | 2010-07-06 | Aol Inc. | Grouping multimedia and streaming media search results |
US20020103920A1 (en) * | 2000-11-21 | 2002-08-01 | Berkun Ken Alan | Interpretive stream metadata extraction |
US7720836B2 (en) | 2000-11-21 | 2010-05-18 | Aol Inc. | Internet streaming media workflow architecture |
US20030041077A1 (en) * | 2001-01-24 | 2003-02-27 | Davis Russell T. | RDX enhancement of system and method for implementing reusable data markup language (RDL) |
US9600842B2 (en) | 2001-01-24 | 2017-03-21 | E-Numerate Solutions, Inc. | RDX enhancement of system and method for implementing reusable data markup language (RDL) |
US20050086191A1 (en) * | 2001-03-23 | 2005-04-21 | Lars Werner | Method for retrieving documents |
US20040201619A1 (en) * | 2001-05-23 | 2004-10-14 | Konstantin Zervas | Method for optimizing utilization of client capacity |
US7743344B2 (en) | 2001-05-23 | 2010-06-22 | Handmark, Inc. | Method for optimizing utilization of client capacity |
US20020198962A1 (en) * | 2001-06-21 | 2002-12-26 | Horn Frederic A. | Method, system, and computer program product for distributing a stored URL and web document set |
US20030014512A1 (en) * | 2001-07-10 | 2003-01-16 | Murata Kikai Kabushiki Kaisha | Communication apparatus and structured document management apparatus |
US20030056138A1 (en) * | 2001-08-22 | 2003-03-20 | Wenge Ren | Method and system for implementing OSPF redundancy |
US7490161B2 (en) | 2001-08-22 | 2009-02-10 | Nokia Inc. | Method and system for implementing OSPF redundancy |
US20040064500A1 (en) * | 2001-11-20 | 2004-04-01 | Kolar Jennifer Lynn | System and method for unified extraction of media objects |
US8527495B2 (en) * | 2002-02-19 | 2013-09-03 | International Business Machines Corporation | Plug-in parsers for configuring search engine crawler |
US20030158835A1 (en) * | 2002-02-19 | 2003-08-21 | International Business Machines Corporation | Plug-in parsers for configuring search engine crawler |
WO2003090033A2 (en) * | 2002-04-17 | 2003-10-30 | Horn Frederic A | Method, system, and computer program product for distributing a stored url and web document set |
WO2003090033A3 (en) * | 2002-04-17 | 2004-02-26 | Frederic A Horn | Method, system, and computer program product for distributing a stored url and web document set |
US20040098378A1 (en) * | 2002-11-19 | 2004-05-20 | Gur Kimchi | Distributed client server index update system and method |
US7356768B1 (en) * | 2002-11-27 | 2008-04-08 | Adobe Systems Incorporated | Using document templates to assemble a collection of documents |
US9092414B2 (en) | 2002-11-27 | 2015-07-28 | Adobe Systems Incorporated | Using document templates to assemble a collection of documents |
US9842174B2 (en) | 2002-11-27 | 2017-12-12 | Adobe Systems Incorporated | Using document templates to assemble a collection of documents |
US11188978B2 (en) | 2002-12-31 | 2021-11-30 | Ebay Inc. | Method and system to generate a listing in a network-based commerce system |
US20040260679A1 (en) * | 2003-06-19 | 2004-12-23 | International Business Machines Corporation | Personalized indexing and searching for information in a distributed data processing system |
US7289983B2 (en) | 2003-06-19 | 2007-10-30 | International Business Machines Corporation | Personalized indexing and searching for information in a distributed data processing system |
US20070271247A1 (en) * | 2003-06-19 | 2007-11-22 | Best Steven F | Personalized Indexing And Searching For Information In A Distributed Data Processing System |
US7865494B2 (en) | 2003-06-19 | 2011-01-04 | International Business Machines Corporation | Personalized indexing and searching for information in a distributed data processing system |
US20040260680A1 (en) * | 2003-06-19 | 2004-12-23 | International Business Machines Corporation | Personalized indexing and searching for information in a distributed data processing system |
US20100241621A1 (en) * | 2003-07-03 | 2010-09-23 | Randall Keith H | Scheduler for Search Engine Crawler |
US7725452B1 (en) * | 2003-07-03 | 2010-05-25 | Google Inc. | Scheduler for search engine crawler |
US10216847B2 (en) | 2003-07-03 | 2019-02-26 | Google Llc | Document reuse in a search engine crawler |
US8161033B2 (en) | 2003-07-03 | 2012-04-17 | Google Inc. | Scheduler for search engine crawler |
US8707312B1 (en) | 2003-07-03 | 2014-04-22 | Google Inc. | Document reuse in a search engine crawler |
US8775403B2 (en) | 2003-07-03 | 2014-07-08 | Google Inc. | Scheduler for search engine crawler |
US10621241B2 (en) * | 2003-07-03 | 2020-04-14 | Google Llc | Scheduler for search engine crawler |
US9679056B2 (en) | 2003-07-03 | 2017-06-13 | Google Inc. | Document reuse in a search engine crawler |
US8042112B1 (en) | 2003-07-03 | 2011-10-18 | Google Inc. | Scheduler for search engine crawler |
US8707313B1 (en) | 2003-07-03 | 2014-04-22 | Google Inc. | Scheduler for search engine crawler |
US20140324818A1 (en) * | 2003-07-03 | 2014-10-30 | Google Inc. | Scheduler for Search Engine Crawler |
US20050210006A1 (en) * | 2004-03-18 | 2005-09-22 | Microsoft Corporation | Field weighting in text searching |
US7584221B2 (en) | 2004-03-18 | 2009-09-01 | Microsoft Corporation | Field weighting in text searching |
US7707157B1 (en) | 2004-03-25 | 2010-04-27 | Google Inc. | Document near-duplicate detection |
US8364686B1 (en) | 2004-03-25 | 2013-01-29 | Google Inc. | Document near-duplicate detection |
US7962491B1 (en) | 2004-03-25 | 2011-06-14 | Google Inc. | Document near-duplicate detection |
US7987172B1 (en) | 2004-08-30 | 2011-07-26 | Google Inc. | Minimizing visibility of stale content in web searching including revising web crawl intervals of documents |
US8407204B2 (en) | 2004-08-30 | 2013-03-26 | Google Inc. | Minimizing visibility of stale content in web searching including revising web crawl intervals of documents |
US8782032B2 (en) | 2004-08-30 | 2014-07-15 | Google Inc. | Minimizing visibility of stale content in web searching including revising web crawl intervals of documents |
US20100017403A1 (en) * | 2004-09-27 | 2010-01-21 | Microsoft Corporation | System and method for scoping searches using index keys |
US8843486B2 (en) | 2004-09-27 | 2014-09-23 | Microsoft Corporation | System and method for scoping searches using index keys |
US7827181B2 (en) | 2004-09-30 | 2010-11-02 | Microsoft Corporation | Click distance determination |
US20060069982A1 (en) * | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Click distance determination |
US20060074871A1 (en) * | 2004-09-30 | 2006-04-06 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US8082246B2 (en) | 2004-09-30 | 2011-12-20 | Microsoft Corporation | System and method for ranking search results using click distance |
US7761448B2 (en) | 2004-09-30 | 2010-07-20 | Microsoft Corporation | System and method for ranking search results using click distance |
US7739277B2 (en) | 2004-09-30 | 2010-06-15 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US20060074903A1 (en) * | 2004-09-30 | 2006-04-06 | Microsoft Corporation | System and method for ranking search results using click distance |
US7716198B2 (en) | 2004-12-21 | 2010-05-11 | Microsoft Corporation | Ranking search results using feature extraction |
US20060136411A1 (en) * | 2004-12-21 | 2006-06-22 | Microsoft Corporation | Ranking search results using feature extraction |
US20060294100A1 (en) * | 2005-03-03 | 2006-12-28 | Microsoft Corporation | Ranking search results using language types |
US7792833B2 (en) | 2005-03-03 | 2010-09-07 | Microsoft Corporation | Ranking search results using language types |
US20060200460A1 (en) * | 2005-03-03 | 2006-09-07 | Microsoft Corporation | System and method for ranking search results using file types |
US11455680B2 (en) | 2005-03-30 | 2022-09-27 | Ebay Inc. | Methods and systems to process a selection of a browser back button |
US10559027B2 (en) | 2005-03-30 | 2020-02-11 | Ebay Inc. | Methods and systems to process a selection of a browser back button |
US11461835B2 (en) | 2005-03-30 | 2022-10-04 | Ebay Inc. | Method and system to dynamically browse data items |
US10497051B2 (en) | 2005-03-30 | 2019-12-03 | Ebay Inc. | Methods and systems to browse data items |
US11455679B2 (en) | 2005-03-30 | 2022-09-27 | Ebay Inc. | Methods and systems to browse data items |
US8140505B1 (en) * | 2005-03-31 | 2012-03-20 | Google Inc. | Near-duplicate document detection for web crawling |
US8548972B1 (en) * | 2005-03-31 | 2013-10-01 | Google Inc. | Near-duplicate document detection for web crawling |
US8666964B1 (en) | 2005-04-25 | 2014-03-04 | Google Inc. | Managing items in crawl schedule |
US20080012569A1 (en) * | 2005-05-21 | 2008-01-17 | Hall David R | Downhole Coils |
US7599917B2 (en) | 2005-08-15 | 2009-10-06 | Microsoft Corporation | Ranking search results using biased click distance |
US20070038622A1 (en) * | 2005-08-15 | 2007-02-15 | Microsoft Corporation | Method ranking search results using biased click distance |
US20070113091A1 (en) * | 2005-11-16 | 2007-05-17 | Sun Microsystems, Inc. | Extensible fingerprinting functions and content addressed storage system using the same |
US7844774B2 (en) * | 2005-11-16 | 2010-11-30 | Sun Microsystems, Inc. | Extensible fingerprinting functions and content addressed storage system using the same |
US20070118441A1 (en) * | 2005-11-22 | 2007-05-24 | Robert Chatwani | Editable electronic catalogs |
US9672551B2 (en) | 2005-11-22 | 2017-06-06 | Ebay Inc. | System and method for managing shared collections |
US20070130207A1 (en) * | 2005-11-22 | 2007-06-07 | Ebay Inc. | System and method for managing shared collections |
US8977603B2 (en) * | 2005-11-22 | 2015-03-10 | Ebay Inc. | System and method for managing shared collections |
US10229445B2 (en) | 2005-11-22 | 2019-03-12 | Ebay Inc. | System and method for managing shared collections |
US20070130205A1 (en) * | 2005-12-05 | 2007-06-07 | Microsoft Corporation | Metadata driven user interface |
US8095565B2 (en) | 2005-12-05 | 2012-01-10 | Microsoft Corporation | Metadata driven user interface |
US7966337B2 (en) | 2006-03-29 | 2011-06-21 | International Business Machines Corporation | System and method for prioritizing websites during a webcrawling process |
US7475069B2 (en) * | 2006-03-29 | 2009-01-06 | International Business Machines Corporation | System and method for prioritizing websites during a webcrawling process |
US20070239701A1 (en) * | 2006-03-29 | 2007-10-11 | International Business Machines Corporation | System and method for prioritizing websites during a webcrawling process |
US20080256046A1 (en) * | 2006-03-29 | 2008-10-16 | Blackman David L | System and method for prioritizing websites during a webcrawling process |
US8676703B2 (en) | 2006-04-27 | 2014-03-18 | Guidewire Software, Inc. | Insurance policy revisioning method and apparatus |
US20070255601A1 (en) * | 2006-04-27 | 2007-11-01 | Guidewire Software, Inc. | Insurance policy revisioning method and apparatus |
US20100070311A1 (en) * | 2006-04-27 | 2010-03-18 | Clark Allan Heydon | Insurance Policy Revisioning Method |
US20070263019A1 (en) * | 2006-05-09 | 2007-11-15 | Naohiro Furukawa | Information management method and information management system |
US9633356B2 (en) | 2006-07-20 | 2017-04-25 | Aol Inc. | Targeted advertising for playlists based upon search queries |
US20080033806A1 (en) * | 2006-07-20 | 2008-02-07 | Howe Karen N | Targeted advertising for playlists based upon search queries |
US20090055436A1 (en) * | 2007-08-20 | 2009-02-26 | Olakunle Olaniyi Ayeni | System and Method for Integrating on Demand/Pull and Push Flow of Goods-and-Services Meta-Data, Including Coupon and Advertising, with Mobile and Wireless Applications |
US20090106223A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US20090106235A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Document Length as a Static Relevance Feature for Ranking Search Results |
US7840569B2 (en) | 2007-10-18 | 2010-11-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US20090106221A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features |
US9348912B2 (en) | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US20090112720A1 (en) * | 2007-10-31 | 2009-04-30 | Tyler Close | Identifying And Displaying Messages Containing An Identifier |
US9117202B2 (en) * | 2007-10-31 | 2015-08-25 | Hewlett-Packard Development Company, L.P. | Identifying and displaying messages containing an identifier |
US7877369B2 (en) | 2007-11-02 | 2011-01-25 | Paglo Labs, Inc. | Hosted searching of private local area network information |
US20090119280A1 (en) * | 2007-11-02 | 2009-05-07 | Christopher Waters | Hosted searching of private local area network information with support for add-on applications |
US8285704B2 (en) | 2007-11-02 | 2012-10-09 | Citrix Online Llc | Hosted searching of private local area network information with support for add-on application |
US20110106786A1 (en) * | 2007-11-02 | 2011-05-05 | Christopher Waters | Hosted searching of private local area network information with support for add-on application |
US8285705B2 (en) | 2007-11-02 | 2012-10-09 | Citrix Online Llc | Hosted searching of private local area network information |
US20110106787A1 (en) * | 2007-11-02 | 2011-05-05 | Christopher Waters | Hosted searching of private local area network information |
US7877368B2 (en) | 2007-11-02 | 2011-01-25 | Paglo Labs, Inc. | Hosted searching of private local area network information with support for add-on applications |
US8359317B2 (en) * | 2008-03-26 | 2013-01-22 | International Business Machines Corporation | Method and device for indexing resource content in computer networks |
US20090248622A1 (en) * | 2008-03-26 | 2009-10-01 | International Business Machines Corporation | Method and device for indexing resource content in computer networks |
US20090259651A1 (en) * | 2008-04-11 | 2009-10-15 | Microsoft Corporation | Search results ranking using editing distance and document information |
US8812493B2 (en) | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US8286171B2 (en) * | 2008-07-21 | 2012-10-09 | Workshare Technology, Inc. | Methods and systems to fingerprint textual information using word runs |
US9614813B2 (en) | 2008-07-21 | 2017-04-04 | Workshare Technology, Inc. | Methods and systems to implement fingerprint lookups across remote agents |
US9473512B2 (en) | 2008-07-21 | 2016-10-18 | Workshare Technology, Inc. | Methods and systems to implement fingerprint lookups across remote agents |
US20100064372A1 (en) * | 2008-07-21 | 2010-03-11 | Workshare Technology, Inc. | Methods and systems to implement fingerprint lookups across remote agents |
US20100017850A1 (en) * | 2008-07-21 | 2010-01-21 | Workshare Technology, Inc. | Methods and systems to fingerprint textual information using word runs |
US20100064347A1 (en) * | 2008-09-11 | 2010-03-11 | Workshare Technology, Inc. | Methods and systems for protect agents using distributed lightweight fingerprints |
US8555080B2 (en) | 2008-09-11 | 2013-10-08 | Workshare Technology, Inc. | Methods and systems for protect agents using distributed lightweight fingerprints |
US20100299727A1 (en) * | 2008-11-18 | 2010-11-25 | Workshare Technology, Inc. | Methods and systems for exact data match filtering |
US9092636B2 (en) | 2008-11-18 | 2015-07-28 | Workshare Technology, Inc. | Methods and systems for exact data match filtering |
US10963578B2 (en) | 2008-11-18 | 2021-03-30 | Workshare Technology, Inc. | Methods and systems for preventing transmission of sensitive data from a remote computer device |
US8620020B2 (en) | 2008-11-20 | 2013-12-31 | Workshare Technology, Inc. | Methods and systems for preventing unauthorized disclosure of secure information using image fingerprinting |
US8670600B2 (en) | 2008-11-20 | 2014-03-11 | Workshare Technology, Inc. | Methods and systems for image fingerprinting |
US8712992B2 (en) * | 2009-03-28 | 2014-04-29 | Microsoft Corporation | Method and apparatus for web crawling |
US20100250516A1 (en) * | 2009-03-28 | 2010-09-30 | Microsoft Corporation | Method and apparatus for web crawling |
US8285703B1 (en) * | 2009-05-13 | 2012-10-09 | Softek Solutions, Inc. | Document crawling systems and methods |
US8473847B2 (en) | 2009-07-27 | 2013-06-25 | Workshare Technology, Inc. | Methods and systems for comparing presentation slide decks |
US20110022960A1 (en) * | 2009-07-27 | 2011-01-27 | Workshare Technology, Inc. | Methods and systems for comparing presentation slide decks |
US9984415B2 (en) | 2009-09-24 | 2018-05-29 | Guidewire Software, Inc. | Method and apparatus for pricing insurance policies |
US11900472B2 (en) | 2009-09-24 | 2024-02-13 | Guidewire Software, Inc. | Method and apparatus for managing revisions and tracking of insurance policy elements |
US11080790B2 (en) | 2009-09-24 | 2021-08-03 | Guidewire Software, Inc. | Method and apparatus for managing revisions and tracking of insurance policy elements |
US20110071859A1 (en) * | 2009-09-24 | 2011-03-24 | Guidewire Software, Inc. | Method and Apparatus for Pricing Insurance Policies |
US20110071858A1 (en) * | 2009-09-24 | 2011-03-24 | Guidewire Software, Inc. | Method and apparatus for managing revisions and tracking of insurance policy elements |
US11263679B2 (en) | 2009-10-23 | 2022-03-01 | Ebay Inc. | Product identification using multiple services |
US8983958B2 (en) * | 2009-12-21 | 2015-03-17 | Business Objects Software Limited | Document indexing based on categorization and prioritization |
US20110153589A1 (en) * | 2009-12-21 | 2011-06-23 | Ganesh Vaitheeswaran | Document indexing based on categorization and prioritization |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US11042736B2 (en) | 2010-11-29 | 2021-06-22 | Workshare Technology, Inc. | Methods and systems for monitoring documents exchanged over computer networks |
US10025759B2 (en) | 2010-11-29 | 2018-07-17 | Workshare Technology, Inc. | Methods and systems for monitoring documents exchanged over email applications |
US10445572B2 (en) | 2010-11-29 | 2019-10-15 | Workshare Technology, Inc. | Methods and systems for monitoring documents exchanged over email applications |
US8793706B2 (en) | 2010-12-16 | 2014-07-29 | Microsoft Corporation | Metadata-based eventing supporting operations on data |
US8868541B2 (en) * | 2011-01-21 | 2014-10-21 | Google Inc. | Scheduling resource crawls |
US20130144858A1 (en) * | 2011-01-21 | 2013-06-06 | Google Inc. | Scheduling resource crawls |
US10574729B2 (en) | 2011-06-08 | 2020-02-25 | Workshare Ltd. | System and method for cross platform document sharing |
US10963584B2 (en) | 2011-06-08 | 2021-03-30 | Workshare Ltd. | Method and system for collaborative editing of a remotely stored document |
US11386394B2 (en) | 2011-06-08 | 2022-07-12 | Workshare, Ltd. | Method and system for shared document approval |
US20120317075A1 (en) * | 2011-06-13 | 2012-12-13 | Suresh Pasumarthi | Synchronizing primary and secondary repositories |
US8862543B2 (en) * | 2011-06-13 | 2014-10-14 | Business Objects Software Limited | Synchronizing primary and secondary repositories |
US9613340B2 (en) | 2011-06-14 | 2017-04-04 | Workshare Ltd. | Method and system for shared document approval |
US8676783B1 (en) * | 2011-06-28 | 2014-03-18 | Google Inc. | Method and apparatus for managing a backlog of pending URL crawls |
US11030163B2 (en) | 2011-11-29 | 2021-06-08 | Workshare, Ltd. | System for tracking and displaying changes in a set of related electronic documents |
US20130144967A1 (en) * | 2011-12-05 | 2013-06-06 | International Business Machines Corporation | Scalable Queuing System |
US10880359B2 (en) | 2011-12-21 | 2020-12-29 | Workshare, Ltd. | System and method for cross platform document sharing |
US8577610B2 (en) | 2011-12-21 | 2013-11-05 | Telenav Inc. | Navigation system with point of interest harvesting mechanism and method of operation thereof |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US20170228588A1 (en) * | 2012-08-16 | 2017-08-10 | Groupon, Inc. | Method, apparatus, and computer program product for classification of documents |
US11715315B2 (en) | 2012-08-16 | 2023-08-01 | Groupon, Inc. | Systems, methods and computer readable media for identifying content to represent web pages and creating a representative image from the content |
US10339375B2 (en) * | 2012-08-16 | 2019-07-02 | Groupon, Inc. | Method, apparatus, and computer program product for classification of documents |
US11068708B2 (en) | 2012-08-16 | 2021-07-20 | Groupon, Inc. | Method, apparatus, and computer program product for classification of documents |
US11341191B2 (en) | 2013-03-14 | 2022-05-24 | Workshare Ltd. | Method and system for document retrieval with selective document comparison |
US10783326B2 (en) | 2013-03-14 | 2020-09-22 | Workshare, Ltd. | System for tracking changes in a collaborative document editing environment |
US11567907B2 (en) | 2013-03-14 | 2023-01-31 | Workshare, Ltd. | Method and system for comparing document versions encoded in a hierarchical representation |
US9170990B2 (en) | 2013-03-14 | 2015-10-27 | Workshare Limited | Method and system for document retrieval with selective document comparison |
US10911492B2 (en) | 2013-07-25 | 2021-02-02 | Workshare Ltd. | System and method for securing documents prior to transmission |
US9948676B2 (en) | 2013-07-25 | 2018-04-17 | Workshare, Ltd. | System and method for securing documents prior to transmission |
US9513961B1 (en) * | 2014-04-02 | 2016-12-06 | Google Inc. | Monitoring application loading |
US11182551B2 (en) | 2014-12-29 | 2021-11-23 | Workshare Ltd. | System and method for determining document version geneology |
US10133723B2 (en) | 2014-12-29 | 2018-11-20 | Workshare Ltd. | System and method for determining document version geneology |
US11763013B2 (en) | 2015-08-07 | 2023-09-19 | Workshare, Ltd. | Transaction document management system and method |
US9922022B2 (en) * | 2016-02-01 | 2018-03-20 | Microsoft Technology Licensing, Llc. | Automatic template generation based on previous documents |
US10839149B2 (en) | 2016-02-01 | 2020-11-17 | Microsoft Technology Licensing, Llc. | Generating templates from user's past documents |
CN113065055A (en) * | 2021-04-21 | 2021-07-02 | 平安国际智慧城市科技股份有限公司 | News information capturing method and device, electronic equipment and storage medium |
CN113065055B (en) * | 2021-04-21 | 2024-04-02 | 深圳赛安特技术服务有限公司 | News information capturing method and device, electronic equipment and storage medium |
US11347579B1 (en) | 2021-04-29 | 2022-05-31 | Bank Of America Corporation | Instinctive slither application assessment engine |
US11663071B2 (en) | 2021-04-29 | 2023-05-30 | Bank Of America Corporation | Instinctive slither application assessment engine |
Also Published As
Publication number | Publication date |
---|---|
WO2001033428A1 (en) | 2001-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6351755B1 (en) | System and method for associating an extensible set of data with documents downloaded by a web crawler | |
US6263364B1 (en) | Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining document freshness | |
US6321265B1 (en) | System and method for enforcing politeness while scheduling downloads in a web crawler | |
US10216847B2 (en) | Document reuse in a search engine crawler | |
US6377984B1 (en) | Web crawler system using parallel queues for queing data sets having common address and concurrently downloading data associated with data set in each queue | |
US6952730B1 (en) | System and method for efficient filtering of data set addresses in a web crawler | |
US10621241B2 (en) | Scheduler for search engine crawler | |
US6301614B1 (en) | System and method for efficient representation of data set addresses in a web crawler | |
CN105956183B (en) | The multilevel optimization's storage method and system of mass small documents in a kind of distributed data base | |
US6898592B2 (en) | Scoping queries in a search engine | |
US20060041606A1 (en) | Indexing system for a computer file store | |
US20020178341A1 (en) | System and method for indexing and retriving cached objects | |
US8799291B2 (en) | Forensic index method and apparatus by distributed processing | |
WO2010123705A2 (en) | System and method for performing longest common prefix strings searches | |
US20120239652A1 (en) | Hardware Accelerated Application-Based Pattern Matching for Real Time Classification and Recording of Network Traffic | |
US7836108B1 (en) | Clustering by previous representative | |
WO2021016050A1 (en) | Multi-record index structure for key-value stores | |
CN109634933A (en) | The method, apparatus and system of data processing | |
US20020107986A1 (en) | Methods and systems for replacing data transmission request expressions | |
US6931491B2 (en) | Hardware-assisted tuple space | |
US7653787B2 (en) | System for storing web site names and caching audio resources of the most visited web sites | |
US7779057B2 (en) | Method and apparatus for retrieving and sorting entries from a directory | |
CN104537017B (en) | A kind of file search method and device based on path | |
CN117951172A (en) | Key field processing method and device, electronic equipment and storage medium | |
CN116049180A (en) | Tenant data processing method and device for Paas platform |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: ALTA VISTA COMPANY, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NAJORK, MARC ALEXANDER;HEYDON, CLARK ALLAN;REEL/FRAME:010368/0055 Effective date: 19991102 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: OVERTURE SERVICES, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ALTA VISTA COMPANY;REEL/FRAME:014394/0899 Effective date: 20030425 |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
AS | Assignment |
Owner name: YAHOO! INC,CALIFORNIA Free format text: MERGER;ASSIGNOR:OVERTURE SERVICES, INC;REEL/FRAME:021652/0654 Effective date: 20081001 Owner name: YAHOO! INC, CALIFORNIA Free format text: MERGER;ASSIGNOR:OVERTURE SERVICES, INC;REEL/FRAME:021652/0654 Effective date: 20081001 |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
FPAY | Fee payment |
Year of fee payment: 12 |
|
FEPP | Fee payment procedure |
Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
AS | Assignment |
Owner name: EXCALIBUR IP, LLC, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO! INC.;REEL/FRAME:038383/0466 Effective date: 20160418 |
|
AS | Assignment |
Owner name: YAHOO! INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:EXCALIBUR IP, LLC;REEL/FRAME:038951/0295 Effective date: 20160531 |
|
AS | Assignment |
Owner name: EXCALIBUR IP, LLC, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO! INC.;REEL/FRAME:038950/0592 Effective date: 20160531 |
|
AS | Assignment |
Owner name: R2 SOLUTIONS LLC, TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:EXCALIBUR IP, LLC;REEL/FRAME:055283/0483 Effective date: 20200428 |