WO2006052618A3 - A method, apparatus, and system for clustering and classification - Google Patents

A method, apparatus, and system for clustering and classification Download PDF

Info

Publication number
WO2006052618A3
WO2006052618A3 PCT/US2005/039718 US2005039718W WO2006052618A3 WO 2006052618 A3 WO2006052618 A3 WO 2006052618A3 US 2005039718 W US2005039718 W US 2005039718W WO 2006052618 A3 WO2006052618 A3 WO 2006052618A3
Authority
WO
WIPO (PCT)
Prior art keywords
documents
clusters
machines
hash
classification
Prior art date
Application number
PCT/US2005/039718
Other languages
French (fr)
Other versions
WO2006052618A2 (en
Inventor
Seth Patinkin
Original Assignee
Cuttr Inc
Seth Patinkin
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cuttr Inc, Seth Patinkin filed Critical Cuttr Inc
Priority to CA002590476A priority Critical patent/CA2590476A1/en
Priority to JP2007540011A priority patent/JP2008529105A/en
Priority to EP05824443A priority patent/EP1825386A4/en
Publication of WO2006052618A2 publication Critical patent/WO2006052618A2/en
Publication of WO2006052618A3 publication Critical patent/WO2006052618A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/107Computer-aided management of electronic mailing [e-mailing]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/21Monitoring or handling of messages
    • H04L51/212Monitoring or handling of messages using filtering or selective blocking

Abstract

The invention provides a method, apparatus and system for classification and clustering electronic data streams such as email, images and sound files for identification, sorting and efficient storage. The inventive systems disclose labeling a document as belonging to a predefined class though computer methods that comprise the steps of identifying an electronic data stream using one or more learning machines and comparing the outputs from the machines to determine the label to associate with the data. The method further utilizes learning machines in combination with hashing schemes to cluster and classify documents. In one embodiment hash apparatuses and methods taxonomize clusters. In yet another embodiment, clusters of documents utilize geometric hash to contain the documents in a data corpus without the overhead of search and storage.
PCT/US2005/039718 2004-11-04 2005-11-03 A method, apparatus, and system for clustering and classification WO2006052618A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CA002590476A CA2590476A1 (en) 2004-11-04 2005-11-03 A method, apparatus, and system for clustering and classification
JP2007540011A JP2008529105A (en) 2004-11-04 2005-11-03 Method, apparatus and system for clustering and classification
EP05824443A EP1825386A4 (en) 2004-11-04 2005-11-03 A method, apparatus, and system for clustering and classification

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/983,258 2004-11-04
US10/983,258 US7574409B2 (en) 2004-11-04 2004-11-04 Method, apparatus, and system for clustering and classification

Publications (2)

Publication Number Publication Date
WO2006052618A2 WO2006052618A2 (en) 2006-05-18
WO2006052618A3 true WO2006052618A3 (en) 2006-08-31

Family

ID=36263371

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/039718 WO2006052618A2 (en) 2004-11-04 2005-11-03 A method, apparatus, and system for clustering and classification

Country Status (5)

Country Link
US (2) US7574409B2 (en)
EP (1) EP1825386A4 (en)
JP (1) JP2008529105A (en)
CA (1) CA2590476A1 (en)
WO (1) WO2006052618A2 (en)

Families Citing this family (490)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6735253B1 (en) 1997-05-16 2004-05-11 The Trustees Of Columbia University In The City Of New York Methods and architecture for indexing and editing compressed video over the world wide web
US20040073617A1 (en) * 2000-06-19 2004-04-15 Milliken Walter Clark Hash-based systems and methods for detecting and preventing transmission of unwanted e-mail
US6973574B2 (en) * 2001-04-24 2005-12-06 Microsoft Corp. Recognizer of audio-content in digital signals
US8046832B2 (en) 2002-06-26 2011-10-25 Microsoft Corporation Spam detector with challenges
US7299261B1 (en) 2003-02-20 2007-11-20 Mailfrontier, Inc. A Wholly Owned Subsidiary Of Sonicwall, Inc. Message classification using a summary
US8266215B2 (en) 2003-02-20 2012-09-11 Sonicwall, Inc. Using distinguishing properties to classify messages
US7406502B1 (en) 2003-02-20 2008-07-29 Sonicwall, Inc. Method and system for classifying a message based on canonical equivalent of acceptable items included in the message
RU2635259C1 (en) * 2016-06-22 2017-11-09 Общество с ограниченной ответственностью "Аби Девелопмент" Method and device for determining type of digital document
US8938595B2 (en) * 2003-08-05 2015-01-20 Sepaton, Inc. Emulated storage system
US7451487B2 (en) 2003-09-08 2008-11-11 Sonicwall, Inc. Fraudulent message detection
US7184929B2 (en) * 2004-01-28 2007-02-27 Microsoft Corporation Exponential priors for maximum entropy models
US7191175B2 (en) 2004-02-13 2007-03-13 Attenex Corporation System and method for arranging concept clusters in thematic neighborhood relationships in a two-dimensional visual display space
US7770014B2 (en) 2004-04-30 2010-08-03 Microsoft Corporation Randomized signal transforms and their applications
US7574409B2 (en) * 2004-11-04 2009-08-11 Vericept Corporation Method, apparatus, and system for clustering and classification
WO2006096612A2 (en) 2005-03-04 2006-09-14 The Trustees Of Columbia University In The City Of New York System and method for motion estimation and mode decision for low-complexity h.264 decoder
US7801841B2 (en) * 2005-06-20 2010-09-21 New York University Method, system and software arrangement for reconstructing formal descriptive models of processes from functional/modal data using suitable ontology
US8572018B2 (en) * 2005-06-20 2013-10-29 New York University Method, system and software arrangement for reconstructing formal descriptive models of processes from functional/modal data using suitable ontology
US7930353B2 (en) * 2005-07-29 2011-04-19 Microsoft Corporation Trees of classifiers for detecting email spam
US20070076869A1 (en) * 2005-10-03 2007-04-05 Microsoft Corporation Digital goods representation based upon matrix invariants using non-negative matrix factorizations
US8566928B2 (en) 2005-10-27 2013-10-22 Georgia Tech Research Corporation Method and system for detecting and responding to attacking networks
US8065370B2 (en) 2005-11-03 2011-11-22 Microsoft Corporation Proofs to filter spam
US7809670B2 (en) * 2005-12-09 2010-10-05 Microsoft Corporation Classification of malware using clustering that orders events in accordance with the time of occurance
US7647630B2 (en) * 2005-12-15 2010-01-12 International Business Machines Corporation Associating security information with information objects in a data processing system
US7512792B2 (en) * 2005-12-15 2009-03-31 International Business Machines Corporation Reference monitor method for enforcing information flow policies
US20070143601A1 (en) * 2005-12-15 2007-06-21 Arroyo Diana J System and method for authorizing information flows
US7730141B2 (en) * 2005-12-16 2010-06-01 Microsoft Corporation Graphical interface for defining mutually exclusive destinations
US20070162761A1 (en) 2005-12-23 2007-07-12 Davis Bruce L Methods and Systems to Help Detect Identity Fraud
US7765212B2 (en) * 2005-12-29 2010-07-27 Microsoft Corporation Automatic organization of documents through email clustering
US8839418B2 (en) * 2006-01-18 2014-09-16 Microsoft Corporation Finding phishing sites
US7526530B2 (en) * 2006-05-05 2009-04-28 Adobe Systems Incorporated System and method for cacheing web files
US8706470B2 (en) 2006-05-08 2014-04-22 David T. Lorenzen Methods of offering guidance on common language usage utilizing a hashing function consisting of a hash triplet
JP4812508B2 (en) * 2006-05-12 2011-11-09 富士通株式会社 System that handles presence information
US8055597B2 (en) * 2006-05-16 2011-11-08 Sony Corporation Method and system for subspace bounded recursive clustering of categorical data
US7640220B2 (en) 2006-05-16 2009-12-29 Sony Corporation Optimal taxonomy layer selection method
US7761394B2 (en) * 2006-05-16 2010-07-20 Sony Corporation Augmented dataset representation using a taxonomy which accounts for similarity and dissimilarity between each record in the dataset and a user's similarity-biased intuition
US7630946B2 (en) * 2006-05-16 2009-12-08 Sony Corporation System for folder classification based on folder content similarity and dissimilarity
US7844557B2 (en) 2006-05-16 2010-11-30 Sony Corporation Method and system for order invariant clustering of categorical data
US7664718B2 (en) * 2006-05-16 2010-02-16 Sony Corporation Method and system for seed based clustering of categorical data using hierarchies
US8601065B2 (en) * 2006-05-31 2013-12-03 Cisco Technology, Inc. Method and apparatus for preventing outgoing spam e-mails by monitoring client interactions
US7558771B2 (en) * 2006-06-07 2009-07-07 Gm Global Technology Operations, Inc. System and method for selection of prediction tools
EP2039089B1 (en) * 2006-06-29 2009-07-29 International Business Machines Corporation Method and system for low-redundancy e-mail handling
US8615800B2 (en) 2006-07-10 2013-12-24 Websense, Inc. System and method for analyzing web content
US8020206B2 (en) 2006-07-10 2011-09-13 Websense, Inc. System and method of analyzing web content
US8707459B2 (en) 2007-01-19 2014-04-22 Digimarc Corporation Determination of originality of content
US8010511B2 (en) 2006-08-29 2011-08-30 Attributor Corporation Content monitoring and compliance enforcement
US8738749B2 (en) 2006-08-29 2014-05-27 Digimarc Corporation Content monitoring and host compliance evaluation
US8010534B2 (en) * 2006-08-31 2011-08-30 Orcatec Llc Identifying related objects using quantum clustering
JP2010503072A (en) * 2006-09-02 2010-01-28 ティーティービー テクノロジーズ,エルエルシー Computer-based meeting preparation method and execution system
US7945627B1 (en) 2006-09-28 2011-05-17 Bitdefender IPR Management Ltd. Layout-based electronic communication filtering systems and methods
US7672912B2 (en) * 2006-10-26 2010-03-02 Microsoft Corporation Classifying knowledge aging in emails using Naïve Bayes Classifier
US9654495B2 (en) 2006-12-01 2017-05-16 Websense, Llc System and method of analyzing web addresses
US8224905B2 (en) * 2006-12-06 2012-07-17 Microsoft Corporation Spam filtration utilizing sender activity data
US7788265B2 (en) * 2006-12-21 2010-08-31 Finebrain.Com Ag Taxonomy-based object classification
JP2008158861A (en) * 2006-12-25 2008-07-10 Neocsm Corp Reception system and program
US20080168453A1 (en) * 2007-01-09 2008-07-10 Caterpillar Inc. Work prioritization system and method
GB2445764A (en) * 2007-01-22 2008-07-23 Surfcontrol Plc Resource access filtering system and database structure for use therewith
US8930331B2 (en) 2007-02-21 2015-01-06 Palantir Technologies Providing unique views of data based on changes or rules
US8027983B1 (en) 2007-04-10 2011-09-27 Google Inc. Enhanced query performance using fixed length hashing of multidimensional data
US8423616B2 (en) * 2007-05-03 2013-04-16 Microsoft Corporation Identifying and correlating electronic mail messages
US8612523B1 (en) * 2007-05-22 2013-12-17 Trend Micro Incorporated Methods and apparatus for detecting botnet attacks
US8086675B2 (en) 2007-07-12 2011-12-27 International Business Machines Corporation Generating a fingerprint of a bit sequence
CL2007002345A1 (en) * 2007-08-10 2009-09-11 Pablo Zegers Fernandez Method for solving complex problems through cascading learning.
US7765204B2 (en) * 2007-09-27 2010-07-27 Microsoft Corporation Method of finding candidate sub-queries from longer queries
US8572184B1 (en) 2007-10-04 2013-10-29 Bitdefender IPR Management Ltd. Systems and methods for dynamically integrating heterogeneous anti-spam filters
US8010614B1 (en) * 2007-11-01 2011-08-30 Bitdefender IPR Management Ltd. Systems and methods for generating signatures for electronic communication classification
US7904399B2 (en) * 2007-11-15 2011-03-08 International Business Machines Corporation Method and apparatus for determining decision points for streaming conversational data
US8635285B2 (en) * 2007-12-22 2014-01-21 Paul D'Amato Email categorization methods, coding, and tools
US7836061B1 (en) * 2007-12-29 2010-11-16 Kaspersky Lab, Zao Method and system for classifying electronic text messages and spam messages
JP5035049B2 (en) * 2008-03-17 2012-09-26 富士通株式会社 E-mail relay system and control method of e-mail relay system
US8745731B2 (en) * 2008-04-03 2014-06-03 Microsoft Corporation Clustering botnet behavior using parameterized models
WO2009126785A2 (en) 2008-04-10 2009-10-15 The Trustees Of Columbia University In The City Of New York Systems and methods for image archaeology
US8676001B2 (en) 2008-05-12 2014-03-18 Google Inc. Automatic discovery of popular landmarks
US8682819B2 (en) * 2008-06-19 2014-03-25 Microsoft Corporation Machine-based learning for automatically categorizing data on per-user basis
EP2318955A1 (en) * 2008-06-30 2011-05-11 Websense, Inc. System and method for dynamic and real-time categorization of webpages
US10027688B2 (en) 2008-08-11 2018-07-17 Damballa, Inc. Method and system for detecting malicious and/or botnet-related domain names
CN101650717B (en) * 2008-08-13 2013-07-31 阿里巴巴集团控股有限公司 Method and system for saving storage space of database
JP5156540B2 (en) * 2008-08-22 2013-03-06 株式会社日立製作所 Hash value generator
US8180838B2 (en) * 2008-08-29 2012-05-15 Microsoft Corporation Efficiently managing modular data storage systems
WO2010030794A1 (en) 2008-09-10 2010-03-18 Digital Infuzion, Inc. Machine learning methods and systems for identifying patterns in data
US8984390B2 (en) 2008-09-15 2015-03-17 Palantir Technologies, Inc. One-click sharing for screenshots and related documents
US8379937B1 (en) * 2008-09-29 2013-02-19 Videomining Corporation Method and system for robust human ethnicity recognition using image feature-based probabilistic graphical models
US8472728B1 (en) * 2008-10-31 2013-06-25 The Rubicon Project, Inc. System and method for identifying and characterizing content within electronic files using example sets
US8170966B1 (en) 2008-11-04 2012-05-01 Bitdefender IPR Management Ltd. Dynamic streaming message clustering for rapid spam-wave detection
US8606724B2 (en) * 2008-11-06 2013-12-10 International Business Machines Corporation Policy evolution with machine learning
KR101266358B1 (en) 2008-12-22 2013-05-22 한국전자통신연구원 A distributed index system based on multi-length signature files and method thereof
WO2010075408A1 (en) * 2008-12-22 2010-07-01 The Trustees Of Columbia University In The City Of New York System and method for annotating and searching media
US8671069B2 (en) 2008-12-22 2014-03-11 The Trustees Of Columbia University, In The City Of New York Rapid image annotation via brain state decoding and visual pattern mining
US8621625B1 (en) * 2008-12-23 2013-12-31 Symantec Corporation Methods and systems for detecting infected files
US20100169178A1 (en) * 2008-12-26 2010-07-01 Microsoft Corporation Advertising Method for Image Search
US8578485B2 (en) * 2008-12-31 2013-11-05 Sonicwall, Inc. Identification of content by metadata
US20100191734A1 (en) * 2009-01-23 2010-07-29 Rajaram Shyam Sundar System and method for classifying documents
DE102009006857A1 (en) * 2009-01-30 2010-08-19 Living-E Ag A method for automatically classifying a text by a computer system
EP2216947A1 (en) * 2009-02-10 2010-08-11 Alcatel Lucent Method of identifying spam messages
US20100256977A1 (en) * 2009-04-01 2010-10-07 Microsoft Corporation Maximum entropy model with continuous features
US8234344B2 (en) * 2009-05-06 2012-07-31 Business Objects S.A. Providing analytical view of email information
US8515956B2 (en) * 2009-05-11 2013-08-20 H5 Method and system for clustering datasets
US8396287B2 (en) 2009-05-15 2013-03-12 Google Inc. Landmarks from digital photo collections
US8103672B2 (en) * 2009-05-20 2012-01-24 Detectent, Inc. Apparatus, system, and method for determining a partial class membership of a data record in a class
US8719302B2 (en) * 2009-06-09 2014-05-06 Ebh Enterprises Inc. Methods, apparatus and software for analyzing the content of micro-blog messages
US8925087B1 (en) * 2009-06-19 2014-12-30 Trend Micro Incorporated Apparatus and methods for in-the-cloud identification of spam and/or malware
US8381290B2 (en) * 2009-07-17 2013-02-19 Exelis Inc. Intrusion detection systems and methods
US8713018B2 (en) 2009-07-28 2014-04-29 Fti Consulting, Inc. System and method for displaying relationships between electronically stored information to provide classification suggestions via inclusion
US20110029476A1 (en) * 2009-07-29 2011-02-03 Kas Kasravi Indicating relationships among text documents including a patent based on characteristics of the text documents
US8612446B2 (en) 2009-08-24 2013-12-17 Fti Consulting, Inc. System and method for generating a reference set for use during document review
US8332624B2 (en) * 2009-08-26 2012-12-11 Nokia Corporation Method and apparatus for encoding decision diagrams
US20110055221A1 (en) * 2009-08-26 2011-03-03 Sovio Sampo Juhani Method and apparatus for obtaining decision diagrams from unknown hash identifiers
US8924439B2 (en) * 2009-09-01 2014-12-30 Verizon Patent And Licensing Inc. System for and method of partial file hashing
WO2011029474A1 (en) * 2009-09-09 2011-03-17 Universität Bremen Document comparison
US9355171B2 (en) * 2009-10-09 2016-05-31 Hewlett Packard Enterprise Development Lp Clustering of near-duplicate documents
US8244767B2 (en) * 2009-10-09 2012-08-14 Stratify, Inc. Composite locality sensitive hash based processing of documents
US8943144B2 (en) * 2009-12-10 2015-01-27 International Business Machines Corporation Consolidating duplicate messages for a single destination on a computer network
US20110154495A1 (en) * 2009-12-21 2011-06-23 Stranne Odd Wandenor Malware identification and scanning
US9001673B2 (en) 2009-12-29 2015-04-07 Ebay Inc. Outgoing communications inventory
US8578497B2 (en) 2010-01-06 2013-11-05 Damballa, Inc. Method and system for detecting malware
US8826438B2 (en) * 2010-01-19 2014-09-02 Damballa, Inc. Method and system for network-based detecting of malware from behavioral clustering
US8447741B2 (en) * 2010-01-25 2013-05-21 Sepaton, Inc. System and method for providing data driven de-duplication services
US9838349B2 (en) 2010-03-08 2017-12-05 Microsoft Technology Licensing, Llc Zone classification of electronic mail messages
US8914368B2 (en) 2010-03-31 2014-12-16 International Business Machines Corporation Augmented and cross-service tagging
US8909950B1 (en) 2010-04-18 2014-12-09 Aptima, Inc. Systems and methods of power management
US8682898B2 (en) * 2010-04-30 2014-03-25 International Business Machines Corporation Systems and methods for discovering synonymous elements using context over multiple similar addresses
US20120030080A1 (en) 2010-07-29 2012-02-02 Accenture Global Services Gmbh Systemic Risk Monitoring System And Method For Revenue Agencies
US9516058B2 (en) 2010-08-10 2016-12-06 Damballa, Inc. Method and system for determining whether domain names are legitimate or malicious
US9165051B2 (en) * 2010-08-24 2015-10-20 Board Of Trustees Of The University Of Illinois Systems and methods for detecting a novel data class
US8392424B2 (en) * 2010-08-24 2013-03-05 Mitsubishi Electric Research Laboratories, Inc. Method for hierarchical signal quantization and hashing
US8788500B2 (en) * 2010-09-10 2014-07-22 International Business Machines Corporation Electronic mail duplicate detection
US8898177B2 (en) 2010-09-10 2014-11-25 International Business Machines Corporation E-mail thread hierarchy detection
US9177259B1 (en) * 2010-11-29 2015-11-03 Aptima Inc. Systems and methods for recognizing and reacting to spatiotemporal patterns
US8607140B1 (en) * 2010-12-21 2013-12-10 Google Inc. Classifying changes to resources
US8661341B1 (en) * 2011-01-19 2014-02-25 Google, Inc. Simhash based spell correction
US9122639B2 (en) 2011-01-25 2015-09-01 Sepaton, Inc. Detection and deduplication of backup sets exhibiting poor locality
US8631489B2 (en) 2011-02-01 2014-01-14 Damballa, Inc. Method and system for detecting malicious domain names at an upper DNS hierarchy
US8554907B1 (en) * 2011-02-15 2013-10-08 Trend Micro, Inc. Reputation prediction of IP addresses
US8527516B1 (en) * 2011-02-25 2013-09-03 Google Inc. Identifying similar digital text volumes
US8495737B2 (en) 2011-03-01 2013-07-23 Zscaler, Inc. Systems and methods for detecting email spam and variants thereof
US9547693B1 (en) 2011-06-23 2017-01-17 Palantir Technologies Inc. Periodic database search manager for multiple data sources
US9092482B2 (en) 2013-03-14 2015-07-28 Palantir Technologies, Inc. Fair scheduling for mixed-query loads
US8799240B2 (en) 2011-06-23 2014-08-05 Palantir Technologies, Inc. System and method for investigating large amounts of data
US9280532B2 (en) 2011-08-02 2016-03-08 Palantir Technologies, Inc. System and method for accessing rich objects via spreadsheets
US8732574B2 (en) 2011-08-25 2014-05-20 Palantir Technologies, Inc. System and method for parameterizing documents for automatic workflow generation
US8504542B2 (en) 2011-09-02 2013-08-06 Palantir Technologies, Inc. Multi-row transactions
US20150046221A1 (en) * 2011-09-17 2015-02-12 Autogrid Inc. Load forecasting from individual customer to system level based on price
US9020952B2 (en) * 2011-10-04 2015-04-28 Reputation.Com, Inc. Methods and systems for providing unique signatures
US8762300B2 (en) * 2011-10-18 2014-06-24 Ming Chuan University Method and system for document classification
US9054876B1 (en) * 2011-11-04 2015-06-09 Google Inc. Fast efficient vocabulary computation with hashed vocabularies applying hash functions to cluster centroids that determines most frequently used cluster centroid IDs
US9111218B1 (en) 2011-12-27 2015-08-18 Google Inc. Method and system for remediating topic drift in near-real-time classification of customer feedback
US9110984B1 (en) 2011-12-27 2015-08-18 Google Inc. Methods and systems for constructing a taxonomy based on hierarchical clustering
US9436758B1 (en) 2011-12-27 2016-09-06 Google Inc. Methods and systems for partitioning documents having customer feedback and support content
US9002848B1 (en) 2011-12-27 2015-04-07 Google Inc. Automatic incremental labeling of document clusters
US9367814B1 (en) 2011-12-27 2016-06-14 Google Inc. Methods and systems for classifying data using a hierarchical taxonomy
US8972404B1 (en) 2011-12-27 2015-03-03 Google Inc. Methods and systems for organizing content
US8977620B1 (en) 2011-12-27 2015-03-10 Google Inc. Method and system for document classification
TWI473474B (en) * 2012-01-06 2015-02-11 Univ Nat Central Method for classifying email
US9189746B2 (en) * 2012-01-12 2015-11-17 Microsoft Technology Licensing, Llc Machine-learning based classification of user accounts based on email addresses and other account information
US9098941B2 (en) * 2012-01-23 2015-08-04 Ayasdi, Inc. Systems and methods for graphical layout
US9130778B2 (en) * 2012-01-25 2015-09-08 Bitdefender IPR Management Ltd. Systems and methods for spam detection using frequency spectra of character strings
US8954519B2 (en) * 2012-01-25 2015-02-10 Bitdefender IPR Management Ltd. Systems and methods for spam detection using character histograms
US9922190B2 (en) 2012-01-25 2018-03-20 Damballa, Inc. Method and system for detecting DGA-based malware
US20130205235A1 (en) * 2012-02-03 2013-08-08 TrueMaps LLC Apparatus and Method for Comparing and Statistically Adjusting Search Engine Results
US9477749B2 (en) * 2012-03-02 2016-10-25 Clarabridge, Inc. Apparatus for identifying root cause using unstructured data
US10600002B2 (en) 2016-08-04 2020-03-24 Loom Systems LTD. Machine learning techniques for providing enriched root causes based on machine-generated data
US10740692B2 (en) 2017-10-17 2020-08-11 Servicenow, Inc. Machine-learning and deep-learning techniques for predictive ticketing in information technology systems
US11416325B2 (en) 2012-03-13 2022-08-16 Servicenow, Inc. Machine-learning and deep-learning techniques for predictive ticketing in information technology systems
US9798802B2 (en) * 2012-03-23 2017-10-24 Avast Software B.V. Systems and methods for extraction of policy information
US9069798B2 (en) * 2012-05-24 2015-06-30 Mitsubishi Electric Research Laboratories, Inc. Method of text classification using discriminative topic transformation
US9390240B1 (en) 2012-06-11 2016-07-12 Dell Software Inc. System and method for querying data
US9578060B1 (en) 2012-06-11 2017-02-21 Dell Software Inc. System and method for data loss prevention across heterogeneous communications platforms
US9501744B1 (en) * 2012-06-11 2016-11-22 Dell Software Inc. System and method for classifying data
US9779260B1 (en) 2012-06-11 2017-10-03 Dell Software Inc. Aggregation and classification of secure data
US9015744B1 (en) * 2012-06-25 2015-04-21 IMBD.com, Inc. Ascertaining events in media
US9501572B2 (en) 2012-06-29 2016-11-22 Google Inc. Content placement criteria expansion
US9430755B2 (en) * 2012-06-29 2016-08-30 Yahoo! Inc. System and method to enable communication group identification
US9336302B1 (en) 2012-07-20 2016-05-10 Zuci Realty Llc Insight and algorithmic clustering for automated synthesis
US9411327B2 (en) 2012-08-27 2016-08-09 Johnson Controls Technology Company Systems and methods for classifying data in building automation systems
US10547674B2 (en) 2012-08-27 2020-01-28 Help/Systems, Llc Methods and systems for network flow analysis
US10084806B2 (en) 2012-08-31 2018-09-25 Damballa, Inc. Traffic simulation to identify malicious activity
US9680861B2 (en) 2012-08-31 2017-06-13 Damballa, Inc. Historical analysis to identify malicious activity
US9166994B2 (en) 2012-08-31 2015-10-20 Damballa, Inc. Automation discovery to identify malicious activity
US9894088B2 (en) 2012-08-31 2018-02-13 Damballa, Inc. Data mining to identify malicious activity
US9773039B2 (en) 2012-09-14 2017-09-26 Fti Consulting, Inc. Computer-implemented system and method for identifying near duplicate documents
US9348677B2 (en) 2012-10-22 2016-05-24 Palantir Technologies Inc. System and method for batch evaluation programs
US9256836B2 (en) 2012-10-31 2016-02-09 Open Text Corporation Reconfigurable model for auto-classification system and method
CN103810424B (en) * 2012-11-05 2017-02-08 腾讯科技(深圳)有限公司 Method and device for identifying abnormal application programs
US9087301B2 (en) 2012-12-21 2015-07-21 International Business Machines Corporation Hardware architecture for simulating a neural network of neurons
US9053429B2 (en) * 2012-12-21 2015-06-09 International Business Machines Corporation Mapping neural dynamics of a neural model on to a coarsely grained look-up table
US10354187B2 (en) 2013-01-17 2019-07-16 Hewlett Packard Enterprise Development Lp Confidentiality of files using file vectorization and machine learning
US9380431B1 (en) 2013-01-31 2016-06-28 Palantir Technologies, Inc. Use of teams in a mobile application
US8966637B2 (en) 2013-02-08 2015-02-24 PhishMe, Inc. Performance benchmarking for simulated phishing attacks
US9356948B2 (en) 2013-02-08 2016-05-31 PhishMe, Inc. Collaborative phishing attack detection
US9632781B2 (en) 2013-02-26 2017-04-25 Qualcomm Incorporated Vector register addressing and functions based on a scalar register data value
US10037314B2 (en) 2013-03-14 2018-07-31 Palantir Technologies, Inc. Mobile reports
US9965937B2 (en) 2013-03-15 2018-05-08 Palantir Technologies Inc. External malware data item clustering and analysis
US8868486B2 (en) 2013-03-15 2014-10-21 Palantir Technologies Inc. Time-sensitive cube
US10275778B1 (en) 2013-03-15 2019-04-30 Palantir Technologies Inc. Systems and user interfaces for dynamic and interactive investigation based on automatic malfeasance clustering of related data in various data structures
US8909656B2 (en) 2013-03-15 2014-12-09 Palantir Technologies Inc. Filter chains with associated multipath views for exploring large data sets
US8788405B1 (en) 2013-03-15 2014-07-22 Palantir Technologies, Inc. Generating data clusters with customizable analysis strategies
US8937619B2 (en) 2013-03-15 2015-01-20 Palantir Technologies Inc. Generating an object time series from data objects
US9230280B1 (en) * 2013-03-15 2016-01-05 Palantir Technologies Inc. Clustering data based on indications of financial malfeasance
US8917274B2 (en) 2013-03-15 2014-12-23 Palantir Technologies Inc. Event matrix based on integrated data
US9122681B2 (en) 2013-03-15 2015-09-01 Gordon Villy Cormack Systems and methods for classifying electronic information using advanced active learning techniques
US9766832B2 (en) 2013-03-15 2017-09-19 Hitachi Data Systems Corporation Systems and methods of locating redundant data using patterns of matching fingerprints
US9204448B2 (en) 2013-03-26 2015-12-01 Empire Technology Department Llc Predictive spectral allocation in mobile networks
US20140304340A1 (en) * 2013-04-08 2014-10-09 International Business Machines Corporation System and method to break communication threads
US8799799B1 (en) 2013-05-07 2014-08-05 Palantir Technologies Inc. Interactive geospatial map
JP6193779B2 (en) * 2013-06-03 2017-09-06 株式会社デンソーアイティーラボラトリ Feature value conversion device, learning device, recognition device, and feature value conversion program
US9342795B1 (en) * 2013-06-05 2016-05-17 Emc Corporation Assisted learning for document classification
US9256611B2 (en) 2013-06-06 2016-02-09 Sepaton, Inc. System and method for multi-scale navigation of data
RU2541123C1 (en) * 2013-06-06 2015-02-10 Закрытое акционерное общество "Лаборатория Касперского" System and method of rating electronic messages to control spam
WO2014200512A1 (en) 2013-06-11 2014-12-18 Empire Technology Development, Llc Smooth transition between predictive and mobile-assisted spectral allocation
US9571511B2 (en) 2013-06-14 2017-02-14 Damballa, Inc. Systems and methods for traffic classification
US8938686B1 (en) 2013-10-03 2015-01-20 Palantir Technologies Inc. Systems and methods for analyzing performance of an entity
US9678973B2 (en) 2013-10-15 2017-06-13 Hitachi Data Systems Corporation Multi-node hybrid deduplication
US11270316B2 (en) * 2013-10-16 2022-03-08 Salesforce.Com, Inc. Systems, methods, and apparatuses for implementing automatic entry of customer relationship management (CRM) data into a CRM database system
US9116975B2 (en) 2013-10-18 2015-08-25 Palantir Technologies Inc. Systems and user interfaces for dynamic and interactive simultaneous querying of multiple data stores
US8924872B1 (en) 2013-10-18 2014-12-30 Palantir Technologies Inc. Overview user interface of emergency call data of a law enforcement agency
US20150169682A1 (en) * 2013-10-18 2015-06-18 Google Inc. Hash Learning
US20150112879A1 (en) * 2013-10-23 2015-04-23 Mastercard International Incorporated Systems and methods for evaluating pricing of real estate
US9021384B1 (en) 2013-11-04 2015-04-28 Palantir Technologies Inc. Interactive vehicle information map
US8868537B1 (en) 2013-11-11 2014-10-21 Palantir Technologies, Inc. Simple web search
US9584989B2 (en) 2013-11-25 2017-02-28 At&T Intellectual Property I, L.P. System and method for crowd-sourcing mobile messaging spam detection and defense
US9105000B1 (en) 2013-12-10 2015-08-11 Palantir Technologies Inc. Aggregating data from a plurality of data sources
US10579647B1 (en) 2013-12-16 2020-03-03 Palantir Technologies Inc. Methods and systems for analyzing entity performance
US10025834B2 (en) 2013-12-16 2018-07-17 Palantir Technologies Inc. Methods and systems for analyzing entity performance
US11403711B1 (en) 2013-12-23 2022-08-02 Massachusetts Mutual Life Insurance Company Method of evaluating heuristics outcome in the underwriting process
US10489861B1 (en) 2013-12-23 2019-11-26 Massachusetts Mutual Life Insurance Company Methods and systems for improving the underwriting process
US10356032B2 (en) 2013-12-26 2019-07-16 Palantir Technologies Inc. System and method for detecting confidential information emails
US8832832B1 (en) 2014-01-03 2014-09-09 Palantir Technologies Inc. IP reputation
US9379951B2 (en) * 2014-01-10 2016-06-28 Instep Software, Llc Method and apparatus for detection of anomalies in integrated parameter systems
CN103793908A (en) * 2014-01-17 2014-05-14 首都医科大学 Method for constructing prediction model of multifunctional veins based on brain nuclear magnetic resonance image
US20150206064A1 (en) * 2014-01-19 2015-07-23 Jacob Levman Method for supervised machine learning
US9483162B2 (en) 2014-02-20 2016-11-01 Palantir Technologies Inc. Relationship visualizations
US9009827B1 (en) 2014-02-20 2015-04-14 Palantir Technologies Inc. Security sharing system
US9727376B1 (en) 2014-03-04 2017-08-08 Palantir Technologies, Inc. Mobile tasks
US8935201B1 (en) 2014-03-18 2015-01-13 Palantir Technologies Inc. Determining and extracting changed data from a data source
US9857958B2 (en) 2014-04-28 2018-01-02 Palantir Technologies Inc. Systems and user interfaces for dynamic and interactive access of, investigation of, and analysis of data objects stored in one or more databases
US10796243B2 (en) 2014-04-28 2020-10-06 Hewlett Packard Enterprise Development Lp Network flow classification
US10333877B2 (en) * 2014-04-29 2019-06-25 At&T Intellectual Property I, L.P. Methods of generating signatures from groups of electronic messages and related methods and systems for identifying spam messages
US9928465B2 (en) 2014-05-20 2018-03-27 Oath Inc. Machine learning and validation of account names, addresses, and/or identifiers
US9349016B1 (en) 2014-06-06 2016-05-24 Dell Software Inc. System and method for user-context-based data loss prevention
US9619557B2 (en) 2014-06-30 2017-04-11 Palantir Technologies, Inc. Systems and methods for key phrase characterization of documents
US9535974B1 (en) 2014-06-30 2017-01-03 Palantir Technologies Inc. Systems and methods for identifying key phrase clusters within documents
US10572496B1 (en) 2014-07-03 2020-02-25 Palantir Technologies Inc. Distributed workflow system and database with access controls for city resiliency
US9202249B1 (en) * 2014-07-03 2015-12-01 Palantir Technologies Inc. Data item clustering and analysis
US9256664B2 (en) 2014-07-03 2016-02-09 Palantir Technologies Inc. System and method for news events detection and visualization
US9419992B2 (en) 2014-08-13 2016-08-16 Palantir Technologies Inc. Unwanted tunneling alert system
WO2016036163A2 (en) * 2014-09-03 2016-03-10 삼성전자 주식회사 Method and apparatus for learning and recognizing audio signal
US9454281B2 (en) 2014-09-03 2016-09-27 Palantir Technologies Inc. System for providing dynamic linked panels in user interface
US9426173B2 (en) * 2014-09-05 2016-08-23 Luminoso Technologies, Inc. System and method for elimination of spam in a data stream according to information density
US9767172B2 (en) 2014-10-03 2017-09-19 Palantir Technologies Inc. Data aggregation and analysis system
US9501851B2 (en) 2014-10-03 2016-11-22 Palantir Technologies Inc. Time-series analysis system
US9996603B2 (en) * 2014-10-14 2018-06-12 Adobe Systems Inc. Detecting homologies in encrypted and unencrypted documents using fuzzy hashing
US9984133B2 (en) 2014-10-16 2018-05-29 Palantir Technologies Inc. Schematic and database linking system
US9805099B2 (en) * 2014-10-30 2017-10-31 The Johns Hopkins University Apparatus and method for efficient identification of code similarity
US9229952B1 (en) 2014-11-05 2016-01-05 Palantir Technologies, Inc. History preserving data pipeline system and method
US9043894B1 (en) 2014-11-06 2015-05-26 Palantir Technologies Inc. Malicious software detection in a computing system
US9792289B2 (en) * 2014-11-07 2017-10-17 Semandex Networks Inc. Systems and methods for file clustering, multi-drive forensic analysis and data protection
EP3222024A1 (en) * 2014-11-21 2017-09-27 Bluvector, Inc. System and method for network data characterization
US10409909B2 (en) 2014-12-12 2019-09-10 Omni Ai, Inc. Lexical analyzer for a neuro-linguistic behavior recognition system
US10409910B2 (en) 2014-12-12 2019-09-10 Omni Ai, Inc. Perceptual associative memory for a neuro-linguistic behavior recognition system
CN104699747B (en) * 2014-12-15 2017-10-31 宁波大学 A kind of approximate member's querying method based on high dimensional data filter
US9367872B1 (en) 2014-12-22 2016-06-14 Palantir Technologies Inc. Systems and user interfaces for dynamic and interactive investigation of bad actor behavior based on automatic clustering of related data in various data structures
US9348920B1 (en) 2014-12-22 2016-05-24 Palantir Technologies Inc. Concept indexing among database of documents using machine learning techniques
US10552994B2 (en) 2014-12-22 2020-02-04 Palantir Technologies Inc. Systems and interactive user interfaces for dynamic retrieval, analysis, and triage of data items
US10362133B1 (en) 2014-12-22 2019-07-23 Palantir Technologies Inc. Communication data processing architecture
US10452651B1 (en) 2014-12-23 2019-10-22 Palantir Technologies Inc. Searching charts
US9665716B2 (en) * 2014-12-23 2017-05-30 Mcafee, Inc. Discovery of malicious strings
US9335911B1 (en) 2014-12-29 2016-05-10 Palantir Technologies Inc. Interactive user interface for dynamic data analysis exploration and query processing
US9817563B1 (en) 2014-12-29 2017-11-14 Palantir Technologies Inc. System and method of generating data points from one or more data stores of data items for chart creation and manipulation
US9870205B1 (en) 2014-12-29 2018-01-16 Palantir Technologies Inc. Storing logical units of program code generated using a dynamic programming notebook user interface
US9519679B2 (en) 2015-01-19 2016-12-13 Sas Institute Inc. Techniques for query homogenization in cache operations
US9727560B2 (en) 2015-02-25 2017-08-08 Palantir Technologies Inc. Systems and methods for organizing and identifying documents via hierarchies and dimensions of tags
US10326748B1 (en) 2015-02-25 2019-06-18 Quest Software Inc. Systems and methods for event-based authentication
US11275747B2 (en) * 2015-03-12 2022-03-15 Yahoo Assets Llc System and method for improved server performance for a deep feature based coarse-to-fine fast search
CN106034146B (en) * 2015-03-12 2019-10-22 阿里巴巴集团控股有限公司 Information interacting method and system
US9891808B2 (en) 2015-03-16 2018-02-13 Palantir Technologies Inc. Interactive user interfaces for location-based data analysis
US10417613B1 (en) 2015-03-17 2019-09-17 Quest Software Inc. Systems and methods of patternizing logged user-initiated events for scheduling functions
US9886467B2 (en) 2015-03-19 2018-02-06 Plantir Technologies Inc. System and method for comparing and visualizing data entities and data entity series
US9930065B2 (en) 2015-03-25 2018-03-27 University Of Georgia Research Foundation, Inc. Measuring, categorizing, and/or mitigating malware distribution paths
US9990506B1 (en) 2015-03-30 2018-06-05 Quest Software Inc. Systems and methods of securing network-accessible peripheral devices
US9641555B1 (en) 2015-04-10 2017-05-02 Dell Software Inc. Systems and methods of tracking content-exposure events
US10298602B2 (en) * 2015-04-10 2019-05-21 Cofense Inc. Suspicious message processing and incident response
US9563782B1 (en) 2015-04-10 2017-02-07 Dell Software Inc. Systems and methods of secure self-service access to content
US9842218B1 (en) 2015-04-10 2017-12-12 Dell Software Inc. Systems and methods of secure self-service access to content
US9842220B1 (en) 2015-04-10 2017-12-12 Dell Software Inc. Systems and methods of secure self-service access to content
US9906539B2 (en) 2015-04-10 2018-02-27 PhishMe, Inc. Suspicious message processing and incident response
US9569626B1 (en) 2015-04-10 2017-02-14 Dell Software Inc. Systems and methods of reporting content-exposure events
US20160314184A1 (en) * 2015-04-27 2016-10-27 Google Inc. Classifying documents by cluster
US10103953B1 (en) 2015-05-12 2018-10-16 Palantir Technologies Inc. Methods and systems for analyzing entity performance
JP6344311B2 (en) * 2015-05-26 2018-06-20 ソニー株式会社 Display device, information processing system, and control method
US9734436B2 (en) 2015-06-05 2017-08-15 At&T Intellectual Property I, L.P. Hash codes for images
US9672257B2 (en) 2015-06-05 2017-06-06 Palantir Technologies Inc. Time-series data storage and processing database system
US9384203B1 (en) 2015-06-09 2016-07-05 Palantir Technologies Inc. Systems and methods for indexing and aggregating data records
US10242001B2 (en) 2015-06-19 2019-03-26 Gordon V. Cormack Systems and methods for conducting and terminating a technology-assisted review
US9407652B1 (en) 2015-06-26 2016-08-02 Palantir Technologies Inc. Network anomaly detection
US10001765B2 (en) 2015-07-02 2018-06-19 buildpulse, Inc. Advanced identification and classification of sensors and other points in a building automation system
US10509770B2 (en) 2015-07-13 2019-12-17 Samsung Electronics Co., Ltd. Heuristic interface for enabling a computer device to utilize data property-based data placement inside a nonvolatile memory device
US10282324B2 (en) * 2015-07-13 2019-05-07 Samsung Electronics Co., Ltd. Smart I/O stream detection based on multiple attributes
US11461010B2 (en) 2015-07-13 2022-10-04 Samsung Electronics Co., Ltd. Data property-based data placement in a nonvolatile memory device
US9454785B1 (en) 2015-07-30 2016-09-27 Palantir Technologies Inc. Systems and user interfaces for holistic, data-driven investigation of bad actor behavior based on clustering and scoring of related data
US9996595B2 (en) 2015-08-03 2018-06-12 Palantir Technologies, Inc. Providing full data provenance visualization for versioned datasets
US10536352B1 (en) 2015-08-05 2020-01-14 Quest Software Inc. Systems and methods for tuning cross-platform data collection
US9456000B1 (en) 2015-08-06 2016-09-27 Palantir Technologies Inc. Systems, methods, user interfaces, and computer-readable media for investigating potential malicious communications
US10489391B1 (en) 2015-08-17 2019-11-26 Palantir Technologies Inc. Systems and methods for grouping and enriching data items accessed from one or more databases for presentation in a user interface
US9600146B2 (en) 2015-08-17 2017-03-21 Palantir Technologies Inc. Interactive geospatial map
US9537880B1 (en) 2015-08-19 2017-01-03 Palantir Technologies Inc. Anomalous network monitoring, user behavior detection and database system
US10853378B1 (en) 2015-08-25 2020-12-01 Palantir Technologies Inc. Electronic note management via a connected entity graph
US11150917B2 (en) 2015-08-26 2021-10-19 Palantir Technologies Inc. System for data aggregation and analysis of data from a plurality of data sources
US9485265B1 (en) 2015-08-28 2016-11-01 Palantir Technologies Inc. Malicious activity detection system capable of efficiently processing data accessed from databases and generating alerts for display in interactive user interfaces
US10706434B1 (en) 2015-09-01 2020-07-07 Palantir Technologies Inc. Methods and systems for determining location information
US9576015B1 (en) 2015-09-09 2017-02-21 Palantir Technologies, Inc. Domain-specific language for dataset transformations
US9454564B1 (en) 2015-09-09 2016-09-27 Palantir Technologies Inc. Data integrity checks
WO2017053779A1 (en) 2015-09-24 2017-03-30 Trustees Of Boston University Data storage and retrieval system using online supervised hashing
US10417579B2 (en) 2015-09-25 2019-09-17 Mcafee, Inc. Multi-label classification for overlapping classes
RU2632133C2 (en) 2015-09-29 2017-10-02 Общество С Ограниченной Ответственностью "Яндекс" Method (versions) and system (versions) for creating prediction model and determining prediction model accuracy
RU2622626C2 (en) * 2015-09-30 2017-06-16 Акционерное общество "Лаборатория Касперского" System and method for detecting phishing scripts
US10218588B1 (en) 2015-10-05 2019-02-26 Quest Software Inc. Systems and methods for multi-stream performance patternization and optimization of virtual meetings
US10296617B1 (en) 2015-10-05 2019-05-21 Palantir Technologies Inc. Searches of highly structured data
US10157358B1 (en) 2015-10-05 2018-12-18 Quest Software Inc. Systems and methods for multi-stream performance patternization and interval-based prediction
US10044745B1 (en) 2015-10-12 2018-08-07 Palantir Technologies, Inc. Systems for computer network security risk assessment including user compromise analysis associated with a network of devices
US10089382B2 (en) * 2015-10-19 2018-10-02 Conduent Business Services, Llc Transforming a knowledge base into a machine readable format for an automated system
US10534326B2 (en) 2015-10-21 2020-01-14 Johnson Controls Technology Company Building automation system with integrated building information model
US10630631B1 (en) 2015-10-28 2020-04-21 Wells Fargo Bank, N.A. Message content cleansing
US10783268B2 (en) 2015-11-10 2020-09-22 Hewlett Packard Enterprise Development Lp Data allocation based on secure information retrieval
WO2017096454A1 (en) * 2015-12-07 2017-06-15 Adlib Publishing Systems Inc. Clustering documents based on textual content
US9542446B1 (en) 2015-12-17 2017-01-10 Palantir Technologies, Inc. Automatic generation of composite datasets based on hierarchical fields
US9823818B1 (en) 2015-12-29 2017-11-21 Palantir Technologies Inc. Systems and interactive user interfaces for automatic generation of temporal representation of data objects
US9612723B1 (en) 2015-12-30 2017-04-04 Palantir Technologies Inc. Composite graphical interface with shareable data-objects
US11947785B2 (en) 2016-01-22 2024-04-02 Johnson Controls Technology Company Building system with a building graph
US11268732B2 (en) 2016-01-22 2022-03-08 Johnson Controls Technology Company Building energy management system with energy analytics
WO2017137439A1 (en) * 2016-02-08 2017-08-17 Koninklijke Philips N.V. Device for and method of determining clusters
US10061845B2 (en) 2016-02-18 2018-08-28 Fmr Llc Analysis of unstructured computer text to generate themes and determine sentiment
US10698938B2 (en) 2016-03-18 2020-06-30 Palantir Technologies Inc. Systems and methods for organizing and identifying documents via hierarchies and dimensions of tags
US9818067B2 (en) * 2016-03-24 2017-11-14 Accenture Global Solutions Limited Self-learning log classification system
US11062236B2 (en) * 2016-03-24 2021-07-13 Accenture Global Solutions Limited Self-learning analytical attribute and clustering segmentation system
US10528889B2 (en) * 2016-03-25 2020-01-07 Futurewei Technologies, Inc. Stereoscopic learning for classification
US10142391B1 (en) 2016-03-25 2018-11-27 Quest Software Inc. Systems and methods of diagnosing down-layer performance problems via multi-stream performance patternization
CN105893477A (en) * 2016-03-25 2016-08-24 中国科学技术大学 Distance preserving Hash method based on double-circuit neural network
US11768004B2 (en) 2016-03-31 2023-09-26 Johnson Controls Tyco IP Holdings LLP HVAC device registration in a distributed building management system
US10580012B2 (en) * 2016-03-31 2020-03-03 ZenDesk, Inc. Article-suggestion system for automatically resolving customer-service requests
KR102179399B1 (en) * 2016-05-02 2020-11-16 삼성전자주식회사 Smart i/o stream detection based on multiple attributes
US10417451B2 (en) 2017-09-27 2019-09-17 Johnson Controls Technology Company Building system with smart entity personal identifying information (PII) masking
US11774920B2 (en) 2016-05-04 2023-10-03 Johnson Controls Technology Company Building system with user presentation composition based on building context
US10505756B2 (en) 2017-02-10 2019-12-10 Johnson Controls Technology Company Building management system with space graphs
US10901373B2 (en) 2017-06-15 2021-01-26 Johnson Controls Technology Company Building management system with artificial intelligence for unified agent based control of building subsystems
CN106682506B (en) * 2016-05-06 2020-03-17 腾讯科技(深圳)有限公司 Virus program detection method and terminal
US11068546B2 (en) 2016-06-02 2021-07-20 Nuix North America Inc. Computer-implemented system and method for analyzing clusters of coded documents
US10007674B2 (en) 2016-06-13 2018-06-26 Palantir Technologies Inc. Data revision control in large-scale data analytic systems
US20170372371A1 (en) * 2016-06-23 2017-12-28 International Business Machines Corporation Machine learning to manage contact with an inactive customer to increase activity of the customer
RU2634180C1 (en) * 2016-06-24 2017-10-24 Акционерное общество "Лаборатория Касперского" System and method for determining spam-containing message by topic of message sent via e-mail
US9552548B1 (en) * 2016-07-01 2017-01-24 Intraspexion Inc. Using classified text and deep learning algorithms to identify risk and provide early warning
US10324609B2 (en) 2016-07-21 2019-06-18 Palantir Technologies Inc. System for providing dynamic linked panels in user interface
US10719188B2 (en) 2016-07-21 2020-07-21 Palantir Technologies Inc. Cached database and synchronization system for providing dynamic linked panels in user interface
JP6708043B2 (en) * 2016-07-28 2020-06-10 富士通株式会社 Data search program, data search method, and data search device
US9753935B1 (en) 2016-08-02 2017-09-05 Palantir Technologies Inc. Time-series data storage and processing database system
US10789119B2 (en) 2016-08-04 2020-09-29 Servicenow, Inc. Determining root-cause of failures based on machine-generated textual data
US10963634B2 (en) * 2016-08-04 2021-03-30 Servicenow, Inc. Cross-platform classification of machine-generated textual data
US10438133B2 (en) * 2016-08-17 2019-10-08 Global Eprocure Spend data enrichment and classification
US10437840B1 (en) 2016-08-19 2019-10-08 Palantir Technologies Inc. Focused probabilistic entity resolution from multiple data sources
US10657182B2 (en) * 2016-09-20 2020-05-19 International Business Machines Corporation Similar email spam detection
US11080301B2 (en) 2016-09-28 2021-08-03 Hewlett Packard Enterprise Development Lp Storage allocation based on secure data comparisons via multiple intermediaries
RU2638710C1 (en) 2016-10-10 2017-12-15 Акционерное общество "Лаборатория Касперского" Methods of detecting malicious elements of web pages
US10542017B1 (en) * 2016-10-13 2020-01-21 Symantec Corporation Systems and methods for personalizing security incident reports
US10133588B1 (en) 2016-10-20 2018-11-20 Palantir Technologies Inc. Transforming instructions for collaborative updates
US10929775B2 (en) 2016-10-26 2021-02-23 Accenture Global Solutions Limited Statistical self learning archival system
US10853858B2 (en) * 2016-10-28 2020-12-01 Walmart Apollo, Llc Systems and methods for optimizing normalization of product attributes for a webpage of an online retailer
US10318630B1 (en) 2016-11-21 2019-06-11 Palantir Technologies Inc. Analysis of large bodies of textual data
US10594640B2 (en) * 2016-12-01 2020-03-17 Oath Inc. Message classification
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
US10884875B2 (en) 2016-12-15 2021-01-05 Palantir Technologies Inc. Incremental backup of computer data files
US10620618B2 (en) 2016-12-20 2020-04-14 Palantir Technologies Inc. Systems and methods for determining relationships between defects
US10223099B2 (en) 2016-12-21 2019-03-05 Palantir Technologies Inc. Systems and methods for peer-to-peer build sharing
US10460602B1 (en) 2016-12-28 2019-10-29 Palantir Technologies Inc. Interactive vehicle information mapping system
US10684033B2 (en) 2017-01-06 2020-06-16 Johnson Controls Technology Company HVAC system with automated device pairing
US20220292069A1 (en) * 2017-01-23 2022-09-15 Oliver Wendel Gamble Method and System for Enhancement and Cross Relating Messages Received and Stored on a Mobile Device
US10636048B2 (en) * 2017-01-27 2020-04-28 Oath Inc. Name-based classification of electronic account users
US11900287B2 (en) 2017-05-25 2024-02-13 Johnson Controls Tyco IP Holdings LLP Model predictive maintenance system with budgetary constraints
US10515098B2 (en) 2017-02-10 2019-12-24 Johnson Controls Technology Company Building management smart entity creation and maintenance using time series data
US11360447B2 (en) 2017-02-10 2022-06-14 Johnson Controls Technology Company Building smart entity system with agent based communication and control
US20190361412A1 (en) 2017-02-10 2019-11-28 Johnson Controls Technology Company Building smart entity system with agent based data ingestion and entity creation using time series data
US10854194B2 (en) 2017-02-10 2020-12-01 Johnson Controls Technology Company Building system with digital twin based data ingestion and processing
US11764991B2 (en) 2017-02-10 2023-09-19 Johnson Controls Technology Company Building management system with identity management
WO2018148591A1 (en) * 2017-02-10 2018-08-16 Secured FTP Hosting, LLC d/b/a SmartFile System for describing and tracking the creation and evolution of digital files
US10169486B2 (en) 2017-02-10 2019-01-01 Johnson Controls Technology Company Building management system with timeseries processing
US10325224B1 (en) 2017-03-23 2019-06-18 Palantir Technologies Inc. Systems and methods for selecting machine learning training data
WO2018175912A1 (en) 2017-03-24 2018-09-27 Johnson Controls Technology Company Building management system with dynamic channel communication
US10606866B1 (en) 2017-03-30 2020-03-31 Palantir Technologies Inc. Framework for exposing network activities
US11327737B2 (en) 2017-04-21 2022-05-10 Johnson Controls Tyco IP Holdings LLP Building management system with cloud management of gateway configurations
US10235461B2 (en) 2017-05-02 2019-03-19 Palantir Technologies Inc. Automated assistance for generating relevant and valuable search results for an entity of interest
US10482382B2 (en) 2017-05-09 2019-11-19 Palantir Technologies Inc. Systems and methods for reducing manufacturing failure rates
US10788229B2 (en) 2017-05-10 2020-09-29 Johnson Controls Technology Company Building management system with a distributed blockchain database
US10896097B1 (en) 2017-05-25 2021-01-19 Palantir Technologies Inc. Approaches for backup and restoration of integrated databases
GB201708818D0 (en) 2017-06-02 2017-07-19 Palantir Technologies Inc Systems and methods for retrieving and processing data
US11022947B2 (en) 2017-06-07 2021-06-01 Johnson Controls Technology Company Building energy optimization system with economic load demand response (ELDR) optimization and ELDR user interfaces
US10956406B2 (en) 2017-06-12 2021-03-23 Palantir Technologies Inc. Propagated deletion of database records and derived data
WO2019018304A1 (en) 2017-07-17 2019-01-24 Johnson Controls Technology Company Systems and methods for agent based building simulation for optimal control
US10403011B1 (en) 2017-07-18 2019-09-03 Palantir Technologies Inc. Passing system with an interactive user interface
EP3655824A1 (en) 2017-07-21 2020-05-27 Johnson Controls Technology Company Building management system with dynamic work order generation with adaptive diagnostic task details
US20190034066A1 (en) 2017-07-27 2019-01-31 Johnson Controls Technology Company Building management system with central plantroom dashboards
US11334552B2 (en) 2017-07-31 2022-05-17 Palantir Technologies Inc. Lightweight redundancy tool for performing transactions
US11531859B2 (en) 2017-08-08 2022-12-20 Samsung Electronics Co., Ltd. System and method for hashed compressed weighting matrix in neural networks
US10417224B2 (en) 2017-08-14 2019-09-17 Palantir Technologies Inc. Time series database processing system
US10708297B2 (en) 2017-08-25 2020-07-07 Ecrime Management Strategies, Inc. Security system for detection and mitigation of malicious communications
US11227108B2 (en) * 2017-09-11 2022-01-18 Nec Corporation Convolutional neural network architecture with adaptive filters
US10311368B2 (en) * 2017-09-12 2019-06-04 Sas Institute Inc. Analytic system for graphical interpretability of and improvement of machine learning models
EP3675433A4 (en) * 2017-09-14 2020-09-30 Mitsubishi Electric Corporation Email inspection device, email inspection method, and email inspection program
US10216695B1 (en) 2017-09-21 2019-02-26 Palantir Technologies Inc. Database system for time series data storage, processing, and analysis
US11314788B2 (en) 2017-09-27 2022-04-26 Johnson Controls Tyco IP Holdings LLP Smart entity management for building management systems
US11120012B2 (en) 2017-09-27 2021-09-14 Johnson Controls Tyco IP Holdings LLP Web services platform with integration and interface of smart entities with enterprise applications
US10559181B2 (en) 2017-09-27 2020-02-11 Johnson Controls Technology Company Building risk analysis system with risk combination for multiple threats
US10962945B2 (en) 2017-09-27 2021-03-30 Johnson Controls Technology Company Building management system with integration of data into smart entities
US11281169B2 (en) 2017-11-15 2022-03-22 Johnson Controls Tyco IP Holdings LLP Building management system with point virtualization for online meters
US10809682B2 (en) 2017-11-15 2020-10-20 Johnson Controls Technology Company Building management system with optimized processing of building system data
US11127235B2 (en) 2017-11-22 2021-09-21 Johnson Controls Tyco IP Holdings LLP Building campus with integrated smart environment
KR101980977B1 (en) * 2017-11-23 2019-05-21 성균관대학교산학협력단 Method for User based Application Grouping under Multi-User Environment and Table Top Display Apparatus for Performing the Same
RU2693324C2 (en) 2017-11-24 2019-07-02 Общество С Ограниченной Ответственностью "Яндекс" Method and a server for converting a categorical factor value into its numerical representation
US11201882B2 (en) * 2017-11-30 2021-12-14 Nec Corporation Of America Detection of malicious network activity
US10614069B2 (en) 2017-12-01 2020-04-07 Palantir Technologies Inc. Workflow driven database partitioning
US11281726B2 (en) 2017-12-01 2022-03-22 Palantir Technologies Inc. System and methods for faster processor comparisons of visual graph features
US11016986B2 (en) 2017-12-04 2021-05-25 Palantir Technologies Inc. Query-based time-series data display and processing system
WO2019140390A1 (en) * 2018-01-12 2019-07-18 Gamalon, Inc. Probabilistic modeling system and method
US11030705B1 (en) * 2018-02-28 2021-06-08 Intuit Inc. Quick serve tax application
US11599369B1 (en) 2018-03-08 2023-03-07 Palantir Technologies Inc. Graphical user interface configuration system
US11954713B2 (en) 2018-03-13 2024-04-09 Johnson Controls Tyco IP Holdings LLP Variable refrigerant flow system with electricity consumption apportionment
US10834128B1 (en) * 2018-03-30 2020-11-10 Fireeye, Inc. System and method for identifying phishing cyber-attacks through deep machine learning via a convolutional neural network (CNN) engine
US10754822B1 (en) 2018-04-18 2020-08-25 Palantir Technologies Inc. Systems and methods for ontology migration
US10885021B1 (en) 2018-05-02 2021-01-05 Palantir Technologies Inc. Interactive interpreter and graphical user interface
GB201807534D0 (en) 2018-05-09 2018-06-20 Palantir Technologies Inc Systems and methods for indexing and searching
US11544445B2 (en) 2018-05-18 2023-01-03 Visa International Service Association System, method, and computer program product for classifying service request messages
US11372893B2 (en) 2018-06-01 2022-06-28 Ntt Security Holdings Corporation Ensemble-based data curation pipeline for efficient label propagation
US11431475B2 (en) 2018-06-15 2022-08-30 Dynatrace Llc Method and system for log data analytics based on SuperMinHash signatures
US11119630B1 (en) 2018-06-19 2021-09-14 Palantir Technologies Inc. Artificial intelligence assisted evaluations and user interface for same
US11521108B2 (en) 2018-07-30 2022-12-06 Microsoft Technology Licensing, Llc Privacy-preserving labeling and classification of email
US11159469B2 (en) * 2018-09-12 2021-10-26 Commvault Systems, Inc. Using machine learning to modify presentation of mailbox objects
KR102211785B1 (en) 2018-09-17 2021-02-04 데이터로그, 엘엘씨 Log inspection system and related methods
KR102103518B1 (en) * 2018-09-18 2020-04-22 이승일 A system that generates text and picture data from video data using artificial intelligence
US11611440B2 (en) * 2018-09-20 2023-03-21 Intralinks, Inc. Deal room platform using blockchain
US11416518B2 (en) 2018-10-30 2022-08-16 Optum, Inc. Machine learning for machine-assisted data classification
US11016648B2 (en) 2018-10-30 2021-05-25 Johnson Controls Technology Company Systems and methods for entity visualization and management with an entity node editor
US11927925B2 (en) 2018-11-19 2024-03-12 Johnson Controls Tyco IP Holdings LLP Building system with a time correlated reliability data stream
US20200234220A1 (en) 2019-01-18 2020-07-23 Johnson Controls Technology Company Smart building automation system with employee productivity features
JP7338160B2 (en) * 2019-01-25 2023-09-05 日本電気株式会社 Electronic mail transmission determination device, electronic mail transmission determination method, and electronic mail transmission determination program
US10788798B2 (en) 2019-01-28 2020-09-29 Johnson Controls Technology Company Building management system with hybrid edge-cloud processing
US11094405B2 (en) * 2019-01-30 2021-08-17 International Business Machines Corporation Cognitive care plan recommendation system
US11645043B2 (en) * 2019-02-11 2023-05-09 Dynatrace Llc Method and system for calculating minwise hash signatures from weighted sets
CN109889891B (en) * 2019-03-05 2023-03-24 腾讯科技(深圳)有限公司 Method, device and storage medium for acquiring target media file
CN110110080A (en) * 2019-03-29 2019-08-09 平安科技(深圳)有限公司 Textual classification model training method, device, computer equipment and storage medium
US11080309B2 (en) 2019-04-04 2021-08-03 Paypal, Inc. Validating cluster results
US11308036B2 (en) * 2019-04-11 2022-04-19 EMC IP Holding Company LLC Selection of digest hash function for different data sets
CN112099725A (en) * 2019-06-17 2020-12-18 华为技术有限公司 Data processing method and device and computer readable storage medium
CN111143625B (en) * 2019-09-03 2023-04-25 西北工业大学 Cross-modal retrieval method based on semi-supervised multi-modal hash coding
KR102198462B1 (en) * 2019-12-09 2021-01-05 주식회사 티맥스소프트 Design remodeling testing method for remodeling text-based display design to thereby achieve graphic-based display design and design remodeling testing device using the same
US11095600B2 (en) 2019-12-09 2021-08-17 Oracle International Corporation End-to-end email tag prediction
CN111159409B (en) * 2019-12-31 2023-06-02 腾讯科技(深圳)有限公司 Text classification method, device, equipment and medium based on artificial intelligence
US11894944B2 (en) 2019-12-31 2024-02-06 Johnson Controls Tyco IP Holdings LLP Building data platform with an enrichment loop
US20210200792A1 (en) 2019-12-31 2021-07-01 Johnson Controls Technology Company Building data platform with graph projections
US11556878B2 (en) 2020-02-05 2023-01-17 Accenture Global Solutions Limited Systems and methods for managing organizational structures
US11537386B2 (en) 2020-04-06 2022-12-27 Johnson Controls Tyco IP Holdings LLP Building system with dynamic configuration of network resources for 5G networks
KR102322125B1 (en) * 2020-05-19 2021-11-04 국방과학연구소 Parameter estimation method for decompression of compressed data
CN111611261B (en) * 2020-05-24 2023-09-01 山东三宏信息科技有限公司 Garbage classification and identification system based on text decoupling
US11874809B2 (en) 2020-06-08 2024-01-16 Johnson Controls Tyco IP Holdings LLP Building system with naming schema encoding entity type and entity relationships
US11528347B2 (en) * 2020-06-25 2022-12-13 Nokia Solutions And Networks Oy Inter-packet communication of machine learning information
CN111767707B (en) * 2020-06-30 2023-10-31 平安科技(深圳)有限公司 Method, device, equipment and storage medium for detecting Leideogue cases
KR102473854B1 (en) * 2020-07-02 2022-12-02 주식회사 에프에스 Industrial occupation code classificational system and the method including the same
US11775408B2 (en) * 2020-08-03 2023-10-03 Adp, Inc. Sparse intent clustering through deep context encoders
JP7212017B2 (en) * 2020-09-18 2023-01-24 ヤフー株式会社 Information processing device, system, learning device, information processing method, and program
US11397773B2 (en) 2020-09-30 2022-07-26 Johnson Controls Tyco IP Holdings LLP Building management system with semantic model integration
US11954154B2 (en) 2020-09-30 2024-04-09 Johnson Controls Tyco IP Holdings LLP Building management system with semantic model integration
CN112347246B (en) * 2020-10-15 2024-04-02 中科曙光南京研究院有限公司 Self-adaptive document clustering method and system based on spectrum decomposition
US20220138492A1 (en) 2020-10-30 2022-05-05 Johnson Controls Technology Company Data preprocessing and refinement tool
US11544177B2 (en) * 2020-11-19 2023-01-03 Ebay Inc. Mapping of test cases to test data for computer software testing
US20220215345A1 (en) * 2021-01-07 2022-07-07 Verizon Media Inc. Computerized system and method for multi-class, multi-label classification of electronic messages
US11348617B1 (en) 2021-03-08 2022-05-31 Bank Of America Corporation System for implementing content retrofitting using information vectorization
EP4060962A1 (en) * 2021-03-15 2022-09-21 AO Kaspersky Lab System and method for creating a signature of a spam message
JP2024511974A (en) 2021-03-17 2024-03-18 ジョンソン・コントロールズ・タイコ・アイピー・ホールディングス・エルエルピー System and method for determining equipment energy waste
US11798258B2 (en) 2021-05-03 2023-10-24 Bank Of America Corporation Automated categorization and assembly of low-quality images into electronic documents
US11704352B2 (en) 2021-05-03 2023-07-18 Bank Of America Corporation Automated categorization and assembly of low-quality images into electronic documents
CN113190929B (en) * 2021-05-21 2022-12-30 北京理工大学 Workpiece family construction method of delayed reconstruction manufacturing system based on machine learning
US11769066B2 (en) 2021-11-17 2023-09-26 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin triggers and actions
US11899723B2 (en) 2021-06-22 2024-02-13 Johnson Controls Tyco IP Holdings LLP Building data platform with context based twin function processing
US11847598B2 (en) * 2021-08-13 2023-12-19 Edgeverve Systems Limited Method and system for analyzing process flows for a process performed by users
US20230076010A1 (en) * 2021-08-23 2023-03-09 Paypal, Inc. Hardline Threshold Softening
US11881041B2 (en) 2021-09-02 2024-01-23 Bank Of America Corporation Automated categorization and processing of document images of varying degrees of quality
US20230104862A1 (en) * 2021-09-28 2023-04-06 Red Hat, Inc. Systems and methods for identifying computing devices
US11656881B2 (en) 2021-10-21 2023-05-23 Abbyy Development Inc. Detecting repetitive patterns of user interface actions
US11796974B2 (en) 2021-11-16 2023-10-24 Johnson Controls Tyco IP Holdings LLP Building data platform with schema extensibility for properties and tags of a digital twin
US11934966B2 (en) 2021-11-17 2024-03-19 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin inferences
US11704311B2 (en) 2021-11-24 2023-07-18 Johnson Controls Tyco IP Holdings LLP Building data platform with a distributed digital twin
US11714930B2 (en) 2021-11-29 2023-08-01 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin based inferences and predictions for a graphical building model
CN114338437B (en) * 2022-01-13 2023-12-29 北京邮电大学 Network traffic classification method and device, electronic equipment and storage medium
CN116595164B (en) * 2023-07-17 2023-10-31 浪潮通用软件有限公司 Method, system, equipment and storage medium for generating bill abstract information
CN117235580A (en) * 2023-09-26 2023-12-15 复旦大学 Sample type detection and category confidence determination model generation method and device

Family Cites Families (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US1261167A (en) 1917-10-25 1918-04-02 Robert C Russell Index.
US5032987A (en) 1988-08-04 1991-07-16 Digital Equipment Corporation System with a plurality of hash tables each using different adaptive hashing functions
US6088039A (en) 1996-01-04 2000-07-11 Compaq Computer Corporation Compression of grey scale images of text
US5909677A (en) 1996-06-18 1999-06-01 Digital Equipment Corporation Method for determining the resemblance of documents
US5991808A (en) 1997-06-02 1999-11-23 Digital Equipment Corporation Task processing optimization in a multiprocessor system
US6438740B1 (en) 1997-08-21 2002-08-20 Compaq Information Technologies Group, L.P. System and method for dynamically identifying free registers
US5974481A (en) 1997-09-15 1999-10-26 Digital Equipment Corporation Method for estimating the probability of collisions of fingerprints
US5953503A (en) 1997-10-29 1999-09-14 Digital Equipment Corporation Compression protocol with multiple preset dictionaries
US6711568B1 (en) 1997-11-25 2004-03-23 Krishna Asur Bharat Method for estimating coverage of web search engines
US6389436B1 (en) 1997-12-15 2002-05-14 International Business Machines Corporation Enhanced hypertext categorization using hyperlinks
US6269362B1 (en) 1997-12-19 2001-07-31 Alta Vista Company System and method for monitoring web pages by comparing generated abstracts
US6073135A (en) 1998-03-10 2000-06-06 Alta Vista Company Connectivity server for locating linkage information between Web pages
US6119124A (en) 1998-03-26 2000-09-12 Digital Equipment Corporation Method for clustering closely resembling data objects
US6195698B1 (en) 1998-04-13 2001-02-27 Compaq Computer Corporation Method for selectively restricting access to computer systems
US6385609B1 (en) * 1998-04-23 2002-05-07 Lucent Technologies Inc. System and method for analyzing and displaying telecommunications switch report output
WO2002091211A1 (en) * 2001-05-07 2002-11-14 Biowulf Technologies, Llc Kernels and methods for selecting kernels for use in learning machines
US6161130A (en) * 1998-06-23 2000-12-12 Microsoft Corporation Technique which utilizes a probabilistic classifier to detect "junk" e-mail by automatically updating a training and re-training the classifier based on the updated training set
US6292762B1 (en) 1998-07-13 2001-09-18 Compaq Computer Corporation Method for determining a random permutation of variables by applying a test function
US6665837B1 (en) 1998-08-10 2003-12-16 Overture Services, Inc. Method for identifying related pages in a hyperlinked database
US6445834B1 (en) 1998-10-19 2002-09-03 Sony Corporation Modular image query system
US6256409B1 (en) 1998-10-19 2001-07-03 Sony Corporation Method for determining a correlation between images using multi-element image descriptors
US6349269B1 (en) * 1998-12-11 2002-02-19 Dell U.S.A., L.P. Thermal management data prediction system
US6732149B1 (en) 1999-04-09 2004-05-04 International Business Machines Corporation System and method for hindering undesired transmission or receipt of electronic messages
US6487555B1 (en) 1999-05-07 2002-11-26 Alta Vista Company Method and apparatus for finding mirrored hosts by analyzing connectivity and IP addresses
US6286006B1 (en) 1999-05-07 2001-09-04 Alta Vista Company Method and apparatus for finding mirrored hosts by analyzing urls
US7406603B1 (en) * 1999-08-31 2008-07-29 Intertrust Technologies Corp. Data protection systems and methods
US7685423B1 (en) 2000-02-15 2010-03-23 Silverbrook Research Pty Ltd Validation protocol and system
US6560600B1 (en) 2000-10-25 2003-05-06 Alta Vista Company Method and apparatus for ranking Web page search results
US6658423B1 (en) 2001-01-24 2003-12-02 Google, Inc. Detecting duplicate and near-duplicate files
US6769016B2 (en) * 2001-07-26 2004-07-27 Networks Associates Technology, Inc. Intelligent SPAM detection system using an updateable neural analysis engine
US7333966B2 (en) * 2001-12-21 2008-02-19 Thomson Global Resources Systems, methods, and software for hyperlinking names
US7295966B2 (en) * 2002-01-14 2007-11-13 Microsoft Corporation System for normalizing a discourse representation structure and normalized data structure
US7177799B2 (en) * 2002-01-14 2007-02-13 Microsoft Corporation Semantic analysis system for interpreting linguistic structures output by a natural language linguistic analysis system
US7451458B2 (en) * 2002-08-02 2008-11-11 Tuchow Jonathan A Software methods of an optical networking apparatus with multiple multi-protocol optical networking modules having packet filtering resources
US6667700B1 (en) * 2002-10-30 2003-12-23 Nbt Technology, Inc. Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation
US6732157B1 (en) * 2002-12-13 2004-05-04 Networks Associates Technology, Inc. Comprehensive anti-spam system, method, and computer program product for filtering unwanted e-mail messages
US7349386B1 (en) * 2003-02-18 2008-03-25 Cisco Technology, Inc. Method and apparatus for transporting MPEG streams on IP networks including removing null packets
US7389178B2 (en) * 2003-12-11 2008-06-17 Greenroad Driving Technologies Ltd. System and method for vehicle driver behavior analysis and evaluation
US7463774B2 (en) * 2004-01-07 2008-12-09 Microsoft Corporation Global localization by fast image matching
US7487321B2 (en) * 2004-04-19 2009-02-03 Cisco Technology, Inc. Method and system for memory leak detection
US7574409B2 (en) * 2004-11-04 2009-08-11 Vericept Corporation Method, apparatus, and system for clustering and classification
JP4592438B2 (en) * 2005-02-08 2010-12-01 株式会社東芝 Semiconductor integrated circuit layout method, manufacturing method, and layout program
US20070112701A1 (en) * 2005-08-15 2007-05-17 Microsoft Corporation Optimization of cascaded classifiers
US7281664B1 (en) * 2005-10-05 2007-10-16 Leapfrog Enterprises, Inc. Method and system for hierarchical management of a plurality of regions of an encoded surface used by a pen computer

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
AIROLDI E. ET AL.: "ScamSlam: An Architecture for Learning the Criminal Relations behind Scam Spam", DATA PRIVACY LAB. INST. FOR SOFTWARE RESEARCH INTL. CMU-ISRI-04-121, May 2004 (2004-05-01), pages 1 - 6, XP008119533 *
AVERY M. ET AL.: "eMailSift:Adapting Graph Mining Techniques for Email Classification", TECHNICAL REPORT CSE 2004-7, July 2004 (2004-07-01), pages 2 - 5 *
DAMIANI E. ET AL.: "An Open Digest based Technique for Spam Detection", PROC. OF 2004 INTL WORKSHOP ON SECURITY IN PARALLEL AND DISTRIBUTED SYSTEMS, September 2004 (2004-09-01), SAN FRANCISCO, CA, pages 1 - 6, XP008119297 *
METZGER J. ET AL.: "A Multiagent based Peer-to-Peer Network in Java for Distributed Spam Filtering", SPRINGER-VERLAG BERLIN HEIDELBERG, June 2003 (2003-06-01), pages 616 - 625, XP008119296 *

Also Published As

Publication number Publication date
US7574409B2 (en) 2009-08-11
WO2006052618A2 (en) 2006-05-18
CA2590476A1 (en) 2006-05-18
US8010466B2 (en) 2011-08-30
EP1825386A2 (en) 2007-08-29
US20100017487A1 (en) 2010-01-21
US20060095521A1 (en) 2006-05-04
JP2008529105A (en) 2008-07-31
EP1825386A4 (en) 2009-07-29

Similar Documents

Publication Publication Date Title
WO2006052618A3 (en) A method, apparatus, and system for clustering and classification
WO2007038389A3 (en) Method and apparatus for identifying and classifying network documents as spam
SG155922A1 (en) Apparatus and method for analysing a video broadcast
EP2169571A3 (en) Methods and systems for managing data
WO2009098468A3 (en) A method and system of indexing numerical data
TW200742442A (en) Scalability techniques based on content information
WO2007143614A3 (en) Techniques to associate media information with related information
WO2008144964A8 (en) Detecting name entities and new words
WO2004089397A3 (en) Method and apparatus for grouping content items
CA2682585A1 (en) Open profile content identification
WO2007059232A3 (en) Methods and apparatus for probe-based clustering
DE602005021581D1 (en) Method and device for classifying image pages by means of summaries
JP2005525600A (en) Embedding and extracting watermarks
WO2008085637A3 (en) Clustered search processing
WO2017160654A3 (en) Systems, methods, and computer readable media for extracting data from portable document format (pdf) files
WO2007059216A3 (en) Methods and apparatus for rank-based response set clustering
EP1587009A3 (en) Content propagation for enhanced document retrieval
UA92368C2 (en) Scalability method basing of on content information
WO2008152805A1 (en) Image recognizing apparatus and image recognizing method
WO2008100849A3 (en) Semantics-based method and system for document analysis
WO2008057474A3 (en) Methods and systems for analyzing data in media material having a layout
EP2375347A3 (en) Systems and methods for classifying and transferring information in a storage network
WO2007143223A3 (en) System and method for entity based information categorization
WO2008049023A9 (en) Method and system for offline indexing of content and classifying stored data
WO2009124200A3 (en) Ink tags in a smart pen computing system

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KN KP KR KZ LC LK LR LS LT LU LV LY MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2590476

Country of ref document: CA

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2007540011

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2005824443

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2136/CHENP/2007

Country of ref document: IN

WWP Wipo information: published in national office

Ref document number: 2005824443

Country of ref document: EP