USRE41080E1 - Voice activated/voice responsive item locater - Google Patents
Voice activated/voice responsive item locater Download PDFInfo
- Publication number
- USRE41080E1 USRE41080E1 US11/592,316 US59231606A USRE41080E US RE41080 E1 USRE41080 E1 US RE41080E1 US 59231606 A US59231606 A US 59231606A US RE41080 E USRE41080 E US RE41080E
- Authority
- US
- United States
- Prior art keywords
- user
- voice
- location
- processor
- feedback
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime, expires
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/08—Logistics, e.g. warehousing, loading or distribution; Inventory or stock management
- G06Q10/087—Inventory or stock management, e.g. order filling, procurement or balancing against orders
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
Definitions
- the present invention relates to voice activated/voice responsive item locators, i.e. item directories which direct a user such as a consumer or shopper, to a specific location to view, treat, retrieve, order, purchase or otherwise use the information obtained in the system.
- the present invention could be used at retail stores to locate items to be purchased. Alternatively, it could be used at a production facility or distribution facility having a large number of parts, to locate specific parts for an employee. In other embodiments, it could be used in non-commercial entities, such as a public library to locate a particular book.
- the locator of the present invention relies upon a specific software module to accomplish voice recognition and response, and includes manager programming for customization, updates and modifications.
- U.S. Pat. No. 5,111,501 to Masanobu Shimanuki describes a telephone terminal device equipped with a transmitter microphone, a receiver, a speech recognition unit that receives and recognizes speech signals from the transmitter microphone and a circuit to reduce the level of signals send from a telephone network to the receiver when the speech recognition unit receives speech signals from the transmitter microphone.
- this device is preferably equipped with a speech reproduction unit that reproduces the speech information stored in a memory, in response to the information of recognition result from the speech recognition unit, and a circuit that prevents transmission of signals from the telephone network to the receiver when the regenerated speech information in sent to the receiver.
- this device it is desirable for this device to be provided with a circuit that prevents generation of ringing tones when an incoming call arrives.
- U.S. Pat. No. 5,136,634 to David C. Rae et al. describes voice operated facsimile machine network which includes a method and apparatus for transmitting specifically requested graphic and/or textual data from an unattended database storage location to a requestor's facsimile machine over a telephone line which includes a host computer such as a PC modified with a facsimile transmission board and a voice generation board.
- the host computer receives incoming phone calls and prompts the caller using the voice board to select data files by using the DTMF keys of a standard telephone handset.
- the PC can be left unattended and can run automatically in the facsimile transmission mode. Callers can immediately access needed textual and image data with the use of just a standard telephone and facsimile machine.
- Multiple workstation nodes can be configured in a network setup to handle a high volume of calls in real time and to allow multiple data services to operate simultaneously.
- U.S. Pat. No. 5,165,095 to Mark A. Borcherding describes a method for dialing a telephone, using voice recognition to initiate the dialing and to determine the correct telephone number.
- the dialing is initiated with a spoken dial command that is recognized by using speaker independent templates that are stored locally with respect to the caller's telephone.
- the correct telephone number is recognized by using speaker dependent template that are downloaded from a central database or by using speaker independent templates stored locally.
- U.S. Pat. No. 5,168,548 to Steven Kaufman et al. describes a reporting system which is disclosed herein, a speech recognizer which is used to select selections of text from a report form stored in a computer and to insert recognized terms in the text hereby to generate a report text under voice control.
- a command interpreter also responsive to spoken words, initiates creation of the report text and its subsequent storing, printing and transmission.
- the command processor is responsive to respective spoken commands to select a destination telephone number and to cause the report text to be sent to apparatus for converting report text to image data and for modulating an audio band signal with the image data for facsimile transmission over telephone lines.
- U.S. Pat. No. 5,222,121 to Keiko Shimada describes a voice recognition dialing unit of a telephone mounted on a vehicle or similar mobile body and which allows a call to be originated with ease.
- the dialing unit When the user of the telephone enters a voice command on voice inputting section, the dialing unit originates a call automatically and thereby connects the other party to the telephone line.
- the operations for call origination and the verifications are performed between the user and the unit in an interactive sequence.
- the unit has a particular call origination procedure in which, when the other party recognized by the unit is wrong as determined by the user by verification, lower place candidates for the other party are called up in response to a particular voice command.
- the unit indicates the other party by voicing a name for verification purpose.
- the alternative embodiment selects and stores only the name of the other party in response to an entered voice signal and, in the event of response for verification, combines the name having been stored and response information stored beforehand to produce composite response voice.
- U.S. Pat. No. 5,231,670 to Richard S. Goldbor et al. describes a system and method for generating text from a voice input that divides the processing of each speech event into a dictation event and a text event.
- Each dictation event handles the processing of data relating to the input into the system, and each text event deals with the generation of text from the inputted voice signals.
- the system and method creates a data structure for storing certain information relating to each individual event. Such data structures enable the system and method to process both simple spoken words as well as spoken commands and to provide the necessary text generation in response to the spoken words or to execute an appropriate function in response to a command.
- Speech recognition includes the ability to distinguish between dictation text and commands.
- U.S. Pat. No. 5,239,586 to Kuniyoshi Marui describes a voice recognition system which comprises a handset and a hands-free microphone for generating an input audio signal, a high-pass filter for eliminating low frequency components from the signal from the handset or hands-free microphone, a signal lever controller for adjusting the level of the high-pass signal in response to the user of either the handset or hands-free microphone, a storer for storing the speech data and a controller for controlling the store so that a user's utterance is stored or the user's utterance is recognized by comparing the utterance to speech data already stored.
- the handset hook switch provides an on-hook control signal to reduce amplifier gain during hands-free microphone operation.
- U.S. Pat. No. 5,301,227 to Shoichi Kamei et al. describes an automatic dial telephone that is useable in a motor vehicle, when a voice input is provided during a period in which input of the names of called parties is awaited, a voice pattern of the name of the called party is compared with reference patterns of called parties stored in reference patterns storing device, to determine the degree of the similarity therebetween.
- the names of the called parties are output to a user in the order of decreasing degree of similarity.
- a command word for confirmation is a waited from a user for a predetermined time period.
- a voice confirmation command is input and is recognized during this waiting period, a telephone number corresponding to the name of the called party is supplied to a channel. Consequently, the command word for confirmation may be input only if the name of the called party outputted is one desired by the user.
- Sensors continually monitor the driving condition of the motor vehicle in which the telephone is installed. When the operation of the steering wheel or brakes of the motor vehicle exceeds a predetermined threshold or the speed of the motor vehicle is excessive, the sensors generate safety signals that inhibit the operation of the telephone.
- U.S. Pat. No. 5,333,276 to E. Earle Thompson et al. describes a communication system which is provided with multiple purpose personal communication devices.
- Each communication device includes a touch-sensitive visual display to communicate text and graphic information to and from the user and for operating the communication device.
- Voice activation and voice control capabilities are included within communication devices to perform the same functions as the touch-sensitive visual display.
- the communication device includes a built-in modem, audio input and output, telephone jacks and wireless communication.
- a plurality of application modules are used with personal communication devices to perform a wide variety of communication function such as information retrievable, on-line data base services, electronic and voice mail.
- Communication devices and application modules cooperate to allow integrating multiple functions such as real time communication, information storage and processing, specialized information services, and remote control of other equipment into an intuitively user friendly apparatus.
- the system includes both desktop and hand-held communication devices with the same full range of communication capabilities provided in each type of communication device.
- U.S. Pat. No. 5,349,636 to Roberto Irribarren describes a communication system for verbal telephonic communication which has a voice message system for storing and retrieving voice messages integrated with a computer database accessing system for storing and retrieving text messages from a separate computer system and for converting the text messages into voice.
- the systems are integrated via a network which coordinates the functions of each individual system.
- the input/output ports of the voice message system and the computer database accessing system are connected to a parallel fashion to at least one telephone line.
- a user may access both voice messages and database information, including text or electronic mail messages, with a single telephone call.
- facsimile messages can be stored, retrieved and manipulated with a single telephone call.
- U.S. Pat. No. 5,406,618 to Stephen B. Knuth et al. describes a telephone answering device that is activated by a proximity sensor when a user crosses its field of detection and whose operation is controlled by simple voice commands.
- the device incorporates speaker-independent voice recognition circuitry to respond to spoken commands of the user that are elicited by a system generated voice request menu.
- the telephone answering device performs all the basic functions of a telephone answering machine in response to these simple commands and there is no need for the user to manually operate the telephone answering device.
- U.S. Pat. No. 5,602,963 to W. Michael Bissonnette et al. describes a small, portable, hand-held electronic personal organizer which performs voice recognition on words spoken by user to input data into the organizer and records voice messages from the user.
- the spoken words and the voice messages are input via a microphone.
- the voice messages are compressed before being converted into digital signals for storage.
- the stored digital voice messages are reconverted into analog signals and then expanded for reproduction using a speaker.
- the organizer is capable of a number of different functions, including voice training, memo record, reminder, manual reminder, timer setting, message review, waiting message, calendar, phone group select, number retrieval, add phone number, security and “no” logic.
- data is principally entered by voice and occasionally through use of a limited keypad, and voice recordings are made and played back as appropriate.
- a visual display provides feedback to the user. during the various function, the user can edit various different data within the organizer by eliminating or correcting such data or entering new data.
- U.S. Pat. No. 5,621,658 to Brion K. Jackson describes an action contained within an electronic mail object which is communicated from a data processing system to another data processing system via an audio device.
- the action is executable on a data processing system.
- the action is converted to a predetermined audio pattern.
- the electronic mail object may contain text in addition to an action.
- the text is also converted to an audio pattern.
- the audio pattern are then communicated to the audio device over telephone lines or other communication medium.
- the audio device records the object.
- a user can provide the recorded object to a data processing system, when then executes the action and converts the text audio patterns back to text.
- the action can be converted to text and displayed on the data processing system.
- U.S. Pat. No. 5,631,745 to John J. Wong et al. describes a telephone terminal adapted for business or home use that includes the ability to receive and send facsimiles, a voice answering function and a computer modem.
- Various input and output devices may be used for the facsimile function.
- a voice annotated facsimile may be sent and received.
- the facsimile is viewed on a video monitor or ordinary television set, an accompanying voice message is heard through the second system of the monitor or television set.
- the terminal has an architecture including a central processor and an internal bus structure to which several types of memory, various input-output devices and an interface with the telephone line are connected, among others.
- Audio Random Access Memory (ARAM) is used for storing both facsimile data and voice data.
- U.S. Pat. No. 5,671,328 to Gregory P. Fitzpatrick et al. describes a method and data processing system which are disclosed for automatically creating voice processing template entities.
- the invention automatically assembles a plurality of commands received by the data processing system, at least one of said commands having a voice recognition criteria component associated therewith, counts the occurrences of the plurality of commands, assembles voice recognition criteria components associated with the plurality of commands, and, as a result of the occurrence count exceeding a predefined minimum, constructs a voice recognition template entry by associating the assembled voice recognition criteria components with the assembled plurality of commands.
- U.S. Pat. No. 5,850,627 to Joel M. Gould et al. describes a word recognition system which can: respond to the input of a character string from a user by limiting the words it will recognize to words having a related, but not necessarily the same, string: score signals generated after a user has been prompted to generate a given word against words other than the prompted word to determine if the signal should be used to train the prompted words; vary the number of signals a user is prompted to generate to train a given word as a function of how well the training signals score against each other or prior models for the prompted word; create a new acoustic model of a phrase by concatenating prior acoustic models of the words in the phrase; obtain information from another program running on the same computer, such as its commands or the context of text being entered into it, and use that information to vary which words it can recognize; determine which program unit, such as an application program or dialog box, currently has input focus on its computer and create a vocabulary state associated with that program
- a voice activated/voice responsive item locator system is disclosed to enable a user to speak into the system and have the system respond with location information for an item requested by the user.
- shopper at a home supply store may pick up a locator phone or just speak into a wall mounted or otherwise situated microphone and say “Locate Outdoor Paint” or “Find Hammers” of simply state what is sought without the use of a verb, e.g. “Caulking”.
- the system may reply either with voice or visual (words on a screen, or map), or both voice and visual, e.g. “Aisle 3, Shelf 4”. In some instances the system will reply, for example, with a “Repeat”, or “Restate in different words” or “Please talk to information desk” or other default instructions.
- the locator system may be a stand alone device, but in most embodiments would be part of an internal connected system. It could be an intranet or secured internet system, but would in many cases be a storewide system with a plurality of user locations (units, phones, or microphones, with feedback at each location).
- the system will include an embedded voice-driven interface for speech control of: (1) operational instructions; (2) core system locator function operations, that is, recognition of specific requests and responses thereto; and, (3) optional and default functions.
- the present invention device is both operated by speech (speech or voice activated) and speech responsive (voice answers and instructions to the user from the system).
- speech speech or voice activated
- the present invention device relies upon automatic speech recognition (ASR), either in place of or in addition to manual locator systems, e.g. books, list, map and computer directories.
- ASR automatic speech recognition
- user feedback features are included wherein both audio and visual feedback is given to a user in response to recognizable voice signals,
- FIG. 1 shows a general schematic diagram showing software and functional features of a present invention item locator system
- FIG. 2 shows a schematic diagram illustrating the physical functions of a present invention voice recognition item locator device
- FIG. 3 shows a schematic diagram of a present invention device illustrating details of a voice recognition submodule.
- the present invention is a voice activated/voice responsive item locator and system.
- item is meant a place or thing that a user desires to locate.
- a item could be a particular brand of canned string beans, a type of outdoor stain, a booth at a convention, a particular part in inventory for sale, assemblage or distribution, a particular automobile in a production facility lot or in a large parking garage, or a room, a functional group or a person in an office building or the like.
- the response may be in the form of a word or sentence presented visually or audibly and it may designate an aisle, a shelf, a bin number, a room number, a row and slot or space, etc.
- the voice recognition system digitizes words spoken via a receiver (microphone) handset, headset, or built-in microphone for conversion from analog to digital utilizing a continuous speech recognition digital signal processor (DSP).
- the main support structure may be a conventional type housing for phones and other communication devices, may be of a different shape or configuration or may be built into a device such as a wall or desk unit, with or without monitor. They could be portable or permanently affixed and could be powered by any means available, e.g. AC or DC current.
- the system would be wireless for the user and would, in that respect operate like a cell phone, two way radio, “walkie talkie” or other short distance wireless device, but would have a processor at a central or fixed location having the same features as described above, i.e., the DSP with programming capabilities, etc.
- the DSP is connected to a programmable microprocessor and either by customized input or a standard program, the system enables the user to quickly enter voice-activated fields, e.g,, such as “Where is . . . ”, “Find . . . ”, etc.
- Verification of voice recognition accuracy is optional and may be accomplished via synthesized voice playback and/or a screen confirmation which requires a “YES” or “NO” to execute or open for revision.
- a screen e.g., LCD, enables visual feedback during input phase, with support for detection, insertion, correction, etc. Cancellation of the entire command or programming instructions may be possible at any time (prior to execution), via keystroke or voice command.
- the essential features of the present invention involve the creation of a voice based guide or locator to offer enhanced convenience and speed to users for location of one or more items.
- FIG. 1 shows a general schematic diagram of a present invention system showing general software features and functional features.
- the present invention device includes a central processor 1 which may be an external or internal component, i.e., within a single unit or at a separate location from audio receivers and transmitters, e.g., microphones/speakers for user inputs and feedback to users.
- a central processor 1 which may be an external or internal component, i.e., within a single unit or at a separate location from audio receivers and transmitters, e.g., microphones/speakers for user inputs and feedback to users.
- the system may be preprogrammed with the user being required to follow concise instructions for activation and operation, or may be programmable to alter, add or enhance ease or methods of use, e.g. through a limited access code, for manager inputs 3 of user instructions.
- manager inputs 3 shall include functional selections and inputs of items and their locations, with provision for subsequent access for modification.
- This programming may include direct keyboard, voice, etc., and, as mentioned, may include security capabilities for preventing unauthorized use, e.g., voice identification (user recognition) or user security code system, as well as other options which may be included therein, such as a “help” detailed manager instruction section.
- the user operation unit(s) 5 provide functional access, which may be passive, i.e., the user speaks, picks up a phone, presses a button, or otherwise takes some action to activate the system; or it may be active, i.e., a proximity sensor, a periodicity timer, or other internal mechanism may automatically activate the system and could trigger an audio or visual query, such as “May I help you locate a product?”
- recognition/non-recognition response 7 results from processing the user inputs to central processor 1 , and audio and/or video response unit(s) 9 provide feedback 11 to the user, either by answering the inquiry, conditionally defaulting, e.g., asking for a repeat or a restate the question, or fully defaulting e.g. directing the user to a courtesy desk or check out counter for help.
- conditionally defaulting e.g., asking for a repeat or a restate the question
- fully defaulting e.g. directing the user to a courtesy desk or check out counter for help.
- FIG. 2 shows a schematic diagram illustrating a present invention voice activated/voice responsive item locator system, showing the physical arrangement and function of components.
- symbol 17 indicates an optional user prompter proximity sensor
- symbol 21 is a microphone or equivalent component for voice input.
- the voice input to sent to audio controller 19 an to automatic speech recognition unit 23 and is converted from analog to digital signals.
- the speech recognition 23 communicates with a continuous speech signal recognizer 41 and a continuous speech signal interpreter 43 .
- CPU/Memory 25 compares the digital signal to the set up or dictionary of digital words or phrases in memory.
- the system processor 27 and data storage 31 operate to respond with an answer or a default instruction or a query by providing digital text to text-to-speech generator 29 , which provides audio feedback to a user via audio controller 19 and speaker 33 Feedback to a user may also be provided on visual screen 37 via display controller 35 .
- Keypad 39 is used for manager set up and modifications.
- FIG. 3 shows the details of one preferred embodiment of the submodule used in the present invention device.
- the voice recognition component converts an acoustic signal into a subsequence of labels.
- the system takes the raw acoustic data, and processes it through the recognizer.
- the recognizer matches it against a set of models using a decoder that generates a recognition token.
- This token represents what the user said as either a single word or utterance.
- the recognizer itself does not interpret the meaning of the recognized output, that is the function of the interpreter (described later).
- the recognizer uses Hidden Markov Models (HMMS) to provide for a continuous speech recognition engine. HMMs do not process the acoustic signal directly but instead split the signal into a sequence of discrete observations.
- HMMS Hidden Markov Models
- Each acoustic model represents a short sound.
- the interpreter combines these sounds into words using a dictionary.
- This dictionary specifies the pronunciation of each word in terms of the acoustic models.
- the interpreter then joins sets of models together (using a Viterbi decoder) in a series of pre-defined connections such that paths can be established to provide for a degree of “natural language” recognition; in other words, the user can say “Find hammers”, “Where are hammers” or “hammers” and they are all understood to mean the same thing.
- these sets of models and dictionaries are interchangeable, allowing the same voice recognition component to be used in a variety of applications.
- the voice recognition component As the voice recognition component is running continuously, there needs to be a way onto distinguish background conversations that might accidentally trigger an unwanted action by the device. For example, two people standing by a voice-activated device might be discussing locations of different goods in a supermarket and be misinterpreted or undesireably resonded to. To avoid this problem, the recognition unit requires a command word to trigger before beginning further recognition.
- the trigger word is a user-definable setting.
- initialization 51 initiates monitoring 53 for a trigger word from a user.
- a word is received, it is analyzed to determine whether or not a trigger word 55 has been received. If not, signal 57 returns the status to monitoring 53 for a new word. This loop continues until a trigger word is recognized and an inactivity timer 59 is started.
- the monitor 61 proceeds with the monitoring for the next word and waits for timer pop 65 .
- timer pop 65 returns to the monitor 53 to continue the monitoring process and the voice data is sent to interpretation 67 .
- an action 75 if process and feedback function 77 is performed.
- signal 79 prompts user 71 .
- timer 59 begins again.
Abstract
Description
Claims (31)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/592,316 USRE41080E1 (en) | 2000-08-31 | 2006-11-02 | Voice activated/voice responsive item locater |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/653,658 US6813341B1 (en) | 2000-08-31 | 2000-08-31 | Voice activated/voice responsive item locator |
US11/592,316 USRE41080E1 (en) | 2000-08-31 | 2006-11-02 | Voice activated/voice responsive item locater |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/653,658 Reissue US6813341B1 (en) | 2000-08-31 | 2000-08-31 | Voice activated/voice responsive item locator |
Publications (1)
Publication Number | Publication Date |
---|---|
USRE41080E1 true USRE41080E1 (en) | 2010-01-19 |
Family
ID=33159998
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/653,658 Ceased US6813341B1 (en) | 2000-08-31 | 2000-08-31 | Voice activated/voice responsive item locator |
US11/592,316 Expired - Lifetime USRE41080E1 (en) | 2000-08-31 | 2006-11-02 | Voice activated/voice responsive item locater |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/653,658 Ceased US6813341B1 (en) | 2000-08-31 | 2000-08-31 | Voice activated/voice responsive item locator |
Country Status (1)
Country | Link |
---|---|
US (2) | US6813341B1 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100187306A1 (en) * | 2005-08-01 | 2010-07-29 | Worthwhile Products | Inventory control system |
US20110153614A1 (en) * | 2005-08-01 | 2011-06-23 | Worthwhile Products | Inventory control system process |
US20120245934A1 (en) * | 2011-03-25 | 2012-09-27 | General Motors Llc | Speech recognition dependent on text message content |
US8823491B2 (en) | 2012-01-12 | 2014-09-02 | International Business Machines Corporation | Security-enhanced radio frequency object locator system, method and program storage device |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7292678B2 (en) * | 2000-08-31 | 2007-11-06 | Lamson Holdings Llc | Voice activated, voice responsive product locator system, including product location method utilizing product bar code and aisle-situated, aisle-identifying bar code |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7693720B2 (en) * | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
US20040199426A1 (en) * | 2003-04-04 | 2004-10-07 | International Business Machines Corporation | Enhanced customer service apparatus, method, and system |
US6975709B2 (en) * | 2003-07-08 | 2005-12-13 | Telcordia Technologies, Inc. | Triggered playback of recorded messages to incoming telephone calls to a cellular phone |
US20050256720A1 (en) * | 2004-05-12 | 2005-11-17 | Iorio Laura M | Voice-activated audio/visual locator with voice recognition |
US7742923B2 (en) * | 2004-09-24 | 2010-06-22 | Microsoft Corporation | Graphic user interface schemes for supporting speech recognition input systems |
US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7620549B2 (en) | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
US8832064B2 (en) * | 2005-11-30 | 2014-09-09 | At&T Intellectual Property Ii, L.P. | Answer determination for natural language questioning |
US7388494B2 (en) * | 2005-12-20 | 2008-06-17 | Pitney Bowes Inc. | RFID systems and methods for probabalistic location determination |
US20080087725A1 (en) * | 2006-10-11 | 2008-04-17 | Qing Liu | Fixture based Item Locator System |
US8073681B2 (en) | 2006-10-16 | 2011-12-06 | Voicebox Technologies, Inc. | System and method for a cooperative conversational voice user interface |
US20080170676A1 (en) * | 2007-01-17 | 2008-07-17 | Sony Corporation | Voice recognition advertisements |
US7818176B2 (en) | 2007-02-06 | 2010-10-19 | Voicebox Technologies, Inc. | System and method for selecting and presenting advertisements based on natural language processing of voice-based input |
US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
US8584939B2 (en) * | 2008-05-09 | 2013-11-19 | Lutron Electronics Co., Inc. | Merchandise display systems for lighting control devices |
US8589161B2 (en) | 2008-05-27 | 2013-11-19 | Voicebox Technologies, Inc. | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US9305548B2 (en) | 2008-05-27 | 2016-04-05 | Voicebox Technologies Corporation | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US20090304161A1 (en) * | 2008-06-05 | 2009-12-10 | Nathan Marshall Pettyjohn | system and method utilizing voice search to locate a product in stores from a phone |
US9147212B2 (en) * | 2008-06-05 | 2015-09-29 | Aisle411, Inc. | Locating products in stores using voice search from a communication device |
US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
US9171541B2 (en) | 2009-11-10 | 2015-10-27 | Voicebox Technologies Corporation | System and method for hybrid processing in a natural language voice services environment |
US8880397B2 (en) * | 2011-10-21 | 2014-11-04 | Wal-Mart Stores, Inc. | Systems, devices and methods for list display and management |
US9047857B1 (en) * | 2012-12-19 | 2015-06-02 | Rawles Llc | Voice commands for transitioning between device states |
EP2887348B1 (en) * | 2013-12-18 | 2022-05-04 | Harman International Industries, Incorporated | Voice recognition query response system |
WO2016044290A1 (en) | 2014-09-16 | 2016-03-24 | Kennewick Michael R | Voice commerce |
US9898459B2 (en) | 2014-09-16 | 2018-02-20 | Voicebox Technologies Corporation | Integration of domain information into state transitions of a finite state transducer for natural language processing |
EP3207467A4 (en) | 2014-10-15 | 2018-05-23 | VoiceBox Technologies Corporation | System and method for providing follow-up responses to prior natural language inputs of a user |
US10431214B2 (en) | 2014-11-26 | 2019-10-01 | Voicebox Technologies Corporation | System and method of determining a domain and/or an action related to a natural language input |
US10614799B2 (en) | 2014-11-26 | 2020-04-07 | Voicebox Technologies Corporation | System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance |
US10853761B1 (en) | 2016-06-24 | 2020-12-01 | Amazon Technologies, Inc. | Speech-based inventory management system and method |
US11315071B1 (en) * | 2016-06-24 | 2022-04-26 | Amazon Technologies, Inc. | Speech-based storage tracking |
US10331784B2 (en) | 2016-07-29 | 2019-06-25 | Voicebox Technologies Corporation | System and method of disambiguating natural language processing requests |
US10235353B1 (en) * | 2017-09-15 | 2019-03-19 | Dell Products Lp | Natural language translation interface for networked devices |
US10515640B2 (en) * | 2017-11-08 | 2019-12-24 | Intel Corporation | Generating dialogue based on verification scores |
CN117765926A (en) * | 2024-02-19 | 2024-03-26 | 上海蜜度科技股份有限公司 | Speech synthesis method, system, electronic equipment and medium |
Citations (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4783803A (en) | 1985-11-12 | 1988-11-08 | Dragon Systems, Inc. | Speech recognition apparatus and method |
US5111501A (en) | 1989-02-09 | 1992-05-05 | Kabushiki Kaisha Toshiba | Telephone terminal device having speech recognition unit |
US5136634A (en) | 1989-03-10 | 1992-08-04 | Spectrafax Corp. | Voice operated facsimile machine network |
US5165095A (en) | 1990-09-28 | 1992-11-17 | Texas Instruments Incorporated | Voice telephone dialing |
US5168548A (en) | 1990-05-17 | 1992-12-01 | Kurzweil Applied Intelligence, Inc. | Integrated voice controlled report generating and communicating system |
US5222121A (en) | 1989-06-19 | 1993-06-22 | Nec Corporation | Voice recognition dialing unit |
US5231670A (en) | 1987-06-01 | 1993-07-27 | Kurzweil Applied Intelligence, Inc. | Voice controlled system and method for generating text from a voice controlled input |
US5239586A (en) | 1987-05-29 | 1993-08-24 | Kabushiki Kaisha Toshiba | Voice recognition system used in telephone apparatus |
US5301227A (en) | 1989-04-17 | 1994-04-05 | Sanyo Electic Co., Ltd. | Automatic dial telephone |
US5335276A (en) | 1992-12-16 | 1994-08-02 | Texas Instruments Incorporated | Communication system and methods for enhanced information transfer |
US5349636A (en) | 1991-10-28 | 1994-09-20 | Centigram Communications Corporation | Interface system and method for interconnecting a voice message system and an interactive voice response system |
US5390278A (en) | 1991-10-08 | 1995-02-14 | Bell Canada | Phoneme based speech recognition |
US5406618A (en) | 1992-10-05 | 1995-04-11 | Phonemate, Inc. | Voice activated, handsfree telephone answering device |
US5426284A (en) | 1990-12-12 | 1995-06-20 | Engineered Data Products, Inc. | Apparatus for locating and tracking information storage items using predefined labels |
US5602963A (en) | 1993-10-12 | 1997-02-11 | Voice Powered Technology International, Inc. | Voice activated personal organizer |
US5621658A (en) | 1993-07-13 | 1997-04-15 | International Business Machines Corporation | Method and apparatus for communicating an electronic action from a data processing system to another data processing system via an audio device |
US5631745A (en) | 1992-05-14 | 1997-05-20 | Current Logic | Multi-function telecommunications instrument |
US5671328A (en) | 1992-12-30 | 1997-09-23 | International Business Machines Corporation | Method and apparatus for automatic creation of a voice recognition template entry |
US5786764A (en) | 1995-06-07 | 1998-07-28 | Engellenner; Thomas J. | Voice activated electronic locating systems |
US5832063A (en) | 1996-02-29 | 1998-11-03 | Nynex Science & Technology, Inc. | Methods and apparatus for performing speaker independent recognition of commands in parallel with speaker dependent recognition of names, words or phrases |
US5850627A (en) | 1992-11-13 | 1998-12-15 | Dragon Systems, Inc. | Apparatuses and methods for training and operating speech recognition systems |
US5884221A (en) | 1991-01-17 | 1999-03-16 | Highwaymaster Communications, Inc. | Vehicle locating and communicating method and apparatus |
US5899973A (en) | 1995-11-04 | 1999-05-04 | International Business Machines Corporation | Method and apparatus for adapting the language model's size in a speech recognition system |
US5930336A (en) | 1996-09-30 | 1999-07-27 | Matsushita Electric Industrial Co., Ltd. | Voice dialing server for branch exchange telephone systems |
US5991712A (en) | 1996-12-05 | 1999-11-23 | Sun Microsystems, Inc. | Method, apparatus, and product for automatic generation of lexical features for speech recognition systems |
EP1003119A2 (en) | 1998-11-19 | 2000-05-24 | Ncr International Inc. | System and methods for mapping and conveying product location |
US6092045A (en) | 1997-09-19 | 2000-07-18 | Nortel Networks Corporation | Method and apparatus for speech recognition |
US6123259A (en) | 1998-04-30 | 2000-09-26 | Fujitsu Limited | Electronic shopping system including customer relocation recognition |
US6148291A (en) | 1998-01-26 | 2000-11-14 | K & T Of Lorain, Ltd. | Container and inventory monitoring methods and systems |
US6157705A (en) | 1997-12-05 | 2000-12-05 | E*Trade Group, Inc. | Voice control of a server |
US6236715B1 (en) | 1997-04-15 | 2001-05-22 | Nortel Networks Corporation | Method and apparatus for using the control channel in telecommunications systems for voice dialing |
US6260012B1 (en) | 1998-02-27 | 2001-07-10 | Samsung Electronics Co., Ltd | Mobile phone having speaker dependent voice recognition method and apparatus |
US6394278B1 (en) | 2000-03-03 | 2002-05-28 | Sort-It, Incorporated | Wireless system and method for sorting letters, parcels and other items |
US6408307B1 (en) | 1995-01-11 | 2002-06-18 | Civix-Ddi, Llc | System and methods for remotely accessing a selected group of items of interest from a database |
US6462616B1 (en) | 1998-09-24 | 2002-10-08 | Ericsson Inc. | Embedded phonetic support and TTS play button in a contacts database |
US6507352B1 (en) | 1998-12-23 | 2003-01-14 | Ncr Corporation | Apparatus and method for displaying a menu with an interactive retail terminal |
US6529940B1 (en) * | 1998-05-28 | 2003-03-04 | David R. Humble | Method and system for in-store marketing |
US6547141B1 (en) | 2001-10-10 | 2003-04-15 | Vernon D. Lepore | Inventory locating device |
US6598025B1 (en) | 2000-12-29 | 2003-07-22 | Ncr Corporation | Geospatial inventory control |
US6870464B2 (en) | 2000-04-04 | 2005-03-22 | Leading Information Technology Institute, Inc. | Inventory control system |
US7231380B1 (en) | 1999-10-09 | 2007-06-12 | Innovaport Llc | Apparatus and method for providing products location information to customers in a store |
-
2000
- 2000-08-31 US US09/653,658 patent/US6813341B1/en not_active Ceased
-
2006
- 2006-11-02 US US11/592,316 patent/USRE41080E1/en not_active Expired - Lifetime
Patent Citations (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4783803A (en) | 1985-11-12 | 1988-11-08 | Dragon Systems, Inc. | Speech recognition apparatus and method |
US5239586A (en) | 1987-05-29 | 1993-08-24 | Kabushiki Kaisha Toshiba | Voice recognition system used in telephone apparatus |
US5231670A (en) | 1987-06-01 | 1993-07-27 | Kurzweil Applied Intelligence, Inc. | Voice controlled system and method for generating text from a voice controlled input |
US5111501A (en) | 1989-02-09 | 1992-05-05 | Kabushiki Kaisha Toshiba | Telephone terminal device having speech recognition unit |
US5136634A (en) | 1989-03-10 | 1992-08-04 | Spectrafax Corp. | Voice operated facsimile machine network |
US5301227A (en) | 1989-04-17 | 1994-04-05 | Sanyo Electic Co., Ltd. | Automatic dial telephone |
US5222121A (en) | 1989-06-19 | 1993-06-22 | Nec Corporation | Voice recognition dialing unit |
US5168548A (en) | 1990-05-17 | 1992-12-01 | Kurzweil Applied Intelligence, Inc. | Integrated voice controlled report generating and communicating system |
US5165095A (en) | 1990-09-28 | 1992-11-17 | Texas Instruments Incorporated | Voice telephone dialing |
US5426284A (en) | 1990-12-12 | 1995-06-20 | Engineered Data Products, Inc. | Apparatus for locating and tracking information storage items using predefined labels |
US5884221A (en) | 1991-01-17 | 1999-03-16 | Highwaymaster Communications, Inc. | Vehicle locating and communicating method and apparatus |
US5390278A (en) | 1991-10-08 | 1995-02-14 | Bell Canada | Phoneme based speech recognition |
US5349636A (en) | 1991-10-28 | 1994-09-20 | Centigram Communications Corporation | Interface system and method for interconnecting a voice message system and an interactive voice response system |
US5631745A (en) | 1992-05-14 | 1997-05-20 | Current Logic | Multi-function telecommunications instrument |
US5406618A (en) | 1992-10-05 | 1995-04-11 | Phonemate, Inc. | Voice activated, handsfree telephone answering device |
US5850627A (en) | 1992-11-13 | 1998-12-15 | Dragon Systems, Inc. | Apparatuses and methods for training and operating speech recognition systems |
US5335276A (en) | 1992-12-16 | 1994-08-02 | Texas Instruments Incorporated | Communication system and methods for enhanced information transfer |
US5671328A (en) | 1992-12-30 | 1997-09-23 | International Business Machines Corporation | Method and apparatus for automatic creation of a voice recognition template entry |
US5621658A (en) | 1993-07-13 | 1997-04-15 | International Business Machines Corporation | Method and apparatus for communicating an electronic action from a data processing system to another data processing system via an audio device |
US5602963A (en) | 1993-10-12 | 1997-02-11 | Voice Powered Technology International, Inc. | Voice activated personal organizer |
US6408307B1 (en) | 1995-01-11 | 2002-06-18 | Civix-Ddi, Llc | System and methods for remotely accessing a selected group of items of interest from a database |
US5786764A (en) | 1995-06-07 | 1998-07-28 | Engellenner; Thomas J. | Voice activated electronic locating systems |
US5899973A (en) | 1995-11-04 | 1999-05-04 | International Business Machines Corporation | Method and apparatus for adapting the language model's size in a speech recognition system |
US5832063A (en) | 1996-02-29 | 1998-11-03 | Nynex Science & Technology, Inc. | Methods and apparatus for performing speaker independent recognition of commands in parallel with speaker dependent recognition of names, words or phrases |
US5930336A (en) | 1996-09-30 | 1999-07-27 | Matsushita Electric Industrial Co., Ltd. | Voice dialing server for branch exchange telephone systems |
US5991712A (en) | 1996-12-05 | 1999-11-23 | Sun Microsystems, Inc. | Method, apparatus, and product for automatic generation of lexical features for speech recognition systems |
US6236715B1 (en) | 1997-04-15 | 2001-05-22 | Nortel Networks Corporation | Method and apparatus for using the control channel in telecommunications systems for voice dialing |
US6092045A (en) | 1997-09-19 | 2000-07-18 | Nortel Networks Corporation | Method and apparatus for speech recognition |
US6157705A (en) | 1997-12-05 | 2000-12-05 | E*Trade Group, Inc. | Voice control of a server |
US6148291A (en) | 1998-01-26 | 2000-11-14 | K & T Of Lorain, Ltd. | Container and inventory monitoring methods and systems |
US6260012B1 (en) | 1998-02-27 | 2001-07-10 | Samsung Electronics Co., Ltd | Mobile phone having speaker dependent voice recognition method and apparatus |
US6123259A (en) | 1998-04-30 | 2000-09-26 | Fujitsu Limited | Electronic shopping system including customer relocation recognition |
US6529940B1 (en) * | 1998-05-28 | 2003-03-04 | David R. Humble | Method and system for in-store marketing |
US6462616B1 (en) | 1998-09-24 | 2002-10-08 | Ericsson Inc. | Embedded phonetic support and TTS play button in a contacts database |
EP1003119A2 (en) | 1998-11-19 | 2000-05-24 | Ncr International Inc. | System and methods for mapping and conveying product location |
US6442530B1 (en) | 1998-11-19 | 2002-08-27 | Ncr Corporation | Computer-based system and method for mapping and conveying product location |
US6507352B1 (en) | 1998-12-23 | 2003-01-14 | Ncr Corporation | Apparatus and method for displaying a menu with an interactive retail terminal |
US7231380B1 (en) | 1999-10-09 | 2007-06-12 | Innovaport Llc | Apparatus and method for providing products location information to customers in a store |
US6394278B1 (en) | 2000-03-03 | 2002-05-28 | Sort-It, Incorporated | Wireless system and method for sorting letters, parcels and other items |
US6870464B2 (en) | 2000-04-04 | 2005-03-22 | Leading Information Technology Institute, Inc. | Inventory control system |
US6598025B1 (en) | 2000-12-29 | 2003-07-22 | Ncr Corporation | Geospatial inventory control |
US6547141B1 (en) | 2001-10-10 | 2003-04-15 | Vernon D. Lepore | Inventory locating device |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100187306A1 (en) * | 2005-08-01 | 2010-07-29 | Worthwhile Products | Inventory control system |
US20110153614A1 (en) * | 2005-08-01 | 2011-06-23 | Worthwhile Products | Inventory control system process |
US8374926B2 (en) | 2005-08-01 | 2013-02-12 | Worthwhile Products | Inventory control system |
US8577759B2 (en) | 2005-08-01 | 2013-11-05 | Worthwhile Products | Inventory control system process |
US20120245934A1 (en) * | 2011-03-25 | 2012-09-27 | General Motors Llc | Speech recognition dependent on text message content |
US9202465B2 (en) * | 2011-03-25 | 2015-12-01 | General Motors Llc | Speech recognition dependent on text message content |
US8823491B2 (en) | 2012-01-12 | 2014-09-02 | International Business Machines Corporation | Security-enhanced radio frequency object locator system, method and program storage device |
Also Published As
Publication number | Publication date |
---|---|
US6813341B1 (en) | 2004-11-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
USRE41080E1 (en) | Voice activated/voice responsive item locater | |
US7292678B2 (en) | Voice activated, voice responsive product locator system, including product location method utilizing product bar code and aisle-situated, aisle-identifying bar code | |
US7136465B2 (en) | Voice activated, voice responsive product locator system, including product location method utilizing product bar code and product-situated, location-identifying bar code | |
US7747342B2 (en) | Product location method utilizing product bar code and aisle-situated, aisle-identifying bar code | |
US6463413B1 (en) | Speech recognition training for small hardware devices | |
US8396710B2 (en) | Distributed voice user interface | |
US6940951B2 (en) | Telephone application programming interface-based, speech enabled automatic telephone dialer using names | |
US6584439B1 (en) | Method and apparatus for controlling voice controlled devices | |
US7203651B2 (en) | Voice control system with multiple voice recognition engines | |
US20020193989A1 (en) | Method and apparatus for identifying voice controlled devices | |
US20020142787A1 (en) | Method to select and send text messages with a mobile | |
US20050043948A1 (en) | Speech recognition method remote controller, information terminal, telephone communication terminal and speech recognizer | |
US20030093281A1 (en) | Method and apparatus for machine to machine communication using speech | |
US20040117188A1 (en) | Speech based personal information manager | |
US20060287854A1 (en) | Voice integration platform | |
US20080154601A1 (en) | Method and system for providing menu and other services for an information processing system using a telephone or other audio interface | |
US6563911B2 (en) | Speech enabled, automatic telephone dialer using names, including seamless interface with computer-based address book programs | |
KR20030044899A (en) | Method and apparatus for a voice controlled foreign language translation device | |
US6671354B2 (en) | Speech enabled, automatic telephone dialer using names, including seamless interface with computer-based address book programs, for telephones without private branch exchanges | |
GB2317782A (en) | Voice dialling server for branch exchange telephone systems | |
CN1381831A (en) | Phonetic recognition device independent unconnected with loudspeaker | |
US20010056345A1 (en) | Method and system for speech recognition of the alphabet | |
US20050092833A1 (en) | Product location method utilizing product bar code and product-situated, aisle-identifying bar code | |
KR20020020585A (en) | System and method for managing conversation -type interface with agent and media for storing program source thereof | |
KR20010000595A (en) | Mobile phone controlled by interactive speech and control method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: IVOICE.COM, INC., NEW JERSEY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MAHONEY, JEROME R.;REEL/FRAME:020546/0998 Effective date: 20000807 Owner name: LAMSON HOLDINGS LLC, NEVADA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:IVOICE, INC.;REEL/FRAME:020547/0181 Effective date: 20060621 Owner name: IVOICE, INC., NEW JERSEY Free format text: CHANGE OF NAME;ASSIGNOR:IVOICE.COM, INC.;REEL/FRAME:020547/0087 Effective date: 20010824 |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
AS | Assignment |
Owner name: XYLON LLC, NEVADA Free format text: MERGER;ASSIGNOR:LAMSON HOLDINGS LLC;REEL/FRAME:036250/0764 Effective date: 20150623 |
|
FPAY | Fee payment |
Year of fee payment: 12 |
|
AS | Assignment |
Owner name: INTELLECTUAL VENTURES ASSETS 191 LLC, DELAWARE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:XYLON LLC;REEL/FRAME:062708/0435 Effective date: 20221222 |
|
AS | Assignment |
Owner name: INTELLECTUAL VENTURES ASSETS 186 LLC, DELAWARE Free format text: SECURITY INTEREST;ASSIGNOR:MIND FUSION, LLC;REEL/FRAME:063295/0001 Effective date: 20230214 Owner name: INTELLECTUAL VENTURES ASSETS 191 LLC, DELAWARE Free format text: SECURITY INTEREST;ASSIGNOR:MIND FUSION, LLC;REEL/FRAME:063295/0001 Effective date: 20230214 |
|
AS | Assignment |
Owner name: MIND FUSION, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTELLECTUAL VENTURES ASSETS 191 LLC;REEL/FRAME:064270/0685 Effective date: 20230214 |
|
AS | Assignment |
Owner name: THINKLOGIX, LLC, TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MIND FUSION, LLC;REEL/FRAME:064357/0554 Effective date: 20230715 |