US6216267B1 - Media capture and compression communication system using holographic optical classification, voice recognition and neural network decision processing - Google Patents

Media capture and compression communication system using holographic optical classification, voice recognition and neural network decision processing Download PDF

Info

Publication number
US6216267B1
US6216267B1 US09/360,512 US36051299A US6216267B1 US 6216267 B1 US6216267 B1 US 6216267B1 US 36051299 A US36051299 A US 36051299A US 6216267 B1 US6216267 B1 US 6216267B1
Authority
US
United States
Prior art keywords
signal
video
processing
audio
processing system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US09/360,512
Inventor
James P. Mitchell
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Rockwell Collins Inc
Original Assignee
Rockwell Collins Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Rockwell Collins Inc filed Critical Rockwell Collins Inc
Priority to US09/360,512 priority Critical patent/US6216267B1/en
Assigned to ROCKWELL COLLINS, INC. reassignment ROCKWELL COLLINS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MITCHELL, JAMES P.
Application granted granted Critical
Publication of US6216267B1 publication Critical patent/US6216267B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks

Definitions

  • the present invention generally relates to the field of media communications systems, and particularly to an audio/video system utilizing optical processing.
  • optical processors using holographic image processing/classification techniques are capable of processing information in parallel such that much more complex image processing algorithms such as compression, correlations, and transform decompositions may be processed in much shorter amount of time than with traditional semiconductor processors.
  • Such optically implemented image classification and processing algorithms may provide optimized transmission of video, image and graphics file signals over lower bandwidth channels.
  • voice command recognition, control and neural network signal processing may be implemented with the system by taking advantage of the parallel processing and data classification provided by an optical processing system.
  • the present invention is directed to a system for capturing and processing video and audio for optimized transmission over a lower bandwidth communication channel.
  • the system includes a digital processing system for receiving a video and an audio signal from a media source or sensor wherein the video and audio signals are representative of the media delivered or captured by the media source or sensor, at least one or more optical processing systems coupled to the digital processing system for performing signal classification and processing algorithms on the video and audio signals, and a transceiver for transmitting the combined signals over a lower bandwidth communications channel wherein the signal processing algorithms and classification performed by the optical processing system optimizes the video and audio signals for transmission over the lower bandwidth communications channel.
  • the system provides optical processing for voice recognition and neural network processing.
  • the present system level invention is further directed to a method for transmitting audio and video signals over a lower bandwidth communications channel.
  • the method includes steps for capturing audio and images and providing the information as a multimedia signal, optimizing the signal for transmission over the lower bandwidth communications channel by performing separate compression algorithms on the video and audio signals with an optical processing system, and transmitting the optimized multimedia signal over the lower bandwidth communications channel.
  • FIG. 1 is a block diagram of a video and audio multimedia system capable of optically processing a video signal and an audio signal in accordance with the present invention
  • FIG. 2 is a block diagram of the hardware of a computer processing system operable to tangibly embody a digital processing system of the video and audio multimedia system of the present invention
  • FIG. 3 is a block diagram of a holographic optical processing system for optically processing a video signal and an audio signal in accordance with the present invention.
  • the multimedia system 100 receives video images or files captured with a video transducer 110 of a media source 108 and captures audio signals with an audio transducer 112 .
  • Video transducer 110 may include, for example, a charge coupled device (CCD) array and related hardware for capturing and buffering frames of video.
  • Audio transducer may comprise, for example, a microphone and preamplifier for capturing and amplifying sounds and providing the sound as a signal representative of the sound at a predetermined signal level.
  • the images and sounds captured by video transducer 110 and audio transducer 112 of media source 108 are provided to a digital processing system 114 that may be, for example, an electronic computer system (such as shown and described with respect to FIG. 2 ).
  • An optical processing system 116 couples to digital processing system 116 for performing signal processing algorithms utilizing an optical signal processing apparatus.
  • multimedia system 100 receives a video signal and an audio signal from media source 110 that are provided to digital processing system 114 as a multimedia signal intended to be transmitted to a remote device or location with a transceiver 120 coupled to digital processing system 114 .
  • the bandwidth of the channel 122 over which the image is to be transmitted is too narrow for real-time transmission of a complete, full bandwidth video and audio multimedia signal.
  • the video and audio signals are processed by an optical processing system 116 coupled to digital processing system 114 prior to transmission such that the video and audio signals are optimized for transmission over the limited bandwidth channel 122 .
  • Optical processing system 116 implements a library of definable and programmable signal transforms and holographic image correlators for implementing a wide range of signal decomposition functions including Fourier transforms, Hartley transforms, discrete valued transforms (e.g., z-transforms), transform inversions, signal compression, filtering, etc.
  • a data storage device 118 may be coupled to digital processing system 114 for storing video and audio signal data during processing as required, or for longer term storage of the video and audio signals.
  • the video and audio signals may be transmitted as a processed multimedia signal via channel 122 to be received by a second transceiver 124 disposed at a remote location.
  • the received multimedia signal is processed by a second digital processing system 126 coupled to transceiver 124 for reconstructing the video and audio signals from the received multimedia signal.
  • a second optical processing system 128 coupled to digital processing system 126 implements algorithms for reconstructing the video and audio signals (e.g., inverse transforms) from the received multimedia signal.
  • the video signal may then be displayed on display 134 , and the audio signal may be reproduced with an audio transducer 132 (e.g., an amplifier and speaker).
  • an audio transducer 132 e.g., an amplifier and speaker
  • a data storage device 130 coupled to digital processing system 126 may be used for storing the video and audio signals during processing as required, or for longer term storage of the video and audio signals.
  • the definable and programmable transforms or correlations performed by optical processing systems 116 and 128 may be optimally selected for the particular utilization of media source 108 .
  • a first optical processing algorithm and system may be selected for processing of landscapes
  • a second algorithm or system may be selected for processing buildings
  • a third algorithm or system may be selected for processing vehicles
  • a fourth algorithm or system may be selected for processing images of human beings
  • a fifth algorithm may be selected for processing conversations, and so on.
  • optical processing system 116 or optical processing system 128 is configured to perform a compression algorithm on the video or audio signals as received and delivered through a Transmission Control Protocol/Internet Protocol (TCP/IP) packet communications network.
  • TCP/IP Transmission Control Protocol/Internet Protocol
  • media source 108 may receive video or audio signals, images, graphics files or multimedia files from a TCP/IP packet communications network (e.g., the Internet).
  • the computer system 200 may be utilized for either digital processing system 114 or digital processing system 126 and generally includes a central bus 218 for transferring data among the components of computer system 200 .
  • a clock 210 provides a timing reference signal to the components of computer system 200 via bus 218 and to a central processing unit 212 .
  • Central processing unit 212 is utilized for interpreting and executing instructions and for performing calculations for computer system 200 .
  • Central processing unit 212 may be a special purpose processor such as a digital signal processor.
  • a random access memory (RAM) device 214 couples to bus 218 and to central processing unit 212 for operating as memory for central processing unit 212 and for other devices coupled to bus 218 .
  • a read-only memory device (ROM) 216 is coupled to the components of computer system 200 via bus 218 for operating as memory for storing instructions or data that are normally intended to be read but not to be altered except under specific circumstances (e.g., when the instructions or data are desired to be updated).
  • ROM device 216 typically stores instructions for performing basic input and output functions for computer system 200 and for loading an operating system into RAM device 214 .
  • An input device controller 220 is coupled to bus 218 for allowing an input device 222 to provide input signals into computer system 200 .
  • Input device 222 may be a keyboard, mouse, joystick, trackpad or trackball, microphone, modem, or a similar input device.
  • input device 222 may be a graphical or tactile input device such as a touch pad for inputting data with a finger or a stylus such.
  • Such a graphical or tactile input device 222 may be overlaid upon a screen of a display device 226 for correlating the coordinates of a tactile input with information displayed on display 226 .
  • Display 226 is controlled by a video controller 224 that provides a video signal received via bus 218 to display 226 .
  • Display 226 may be any type of display or monitor suitable for displaying information generated by computer system 200 such as cathode ray tube (CRT), a liquid crystal display (LCD), gas or plasma display, or a field emission display panel.
  • CTR cathode ray tube
  • LCD liquid crystal display
  • FED plasma display
  • a peripheral bus controller 228 couples peripheral devices to central bus 218 of computer system 200 via a peripheral bus 228 .
  • Peripheral bus 230 is preferably in compliance with a standard bus architecture such as an Electrical Industries Association Recommended Standard 232 (RS-232) standard, an Institute of Electrical and Electronics Engineers (IEEE) 1394 serial bus standard, a Peripheral Component Interconnect (PCI) standard, or a Universal Serial Bus (USB) standard, etc.
  • RS-232 Electrical Industries Association Recommended Standard 232
  • IEEE Institute of Electrical and Electronics Engineers
  • PCI Peripheral Component Interconnect
  • USB Universal Serial Bus
  • a mass storage device controller 232 controls a mass storage device 234 for storing large quantities of data or information, such as a quantity of information larger than the capacity of RAM device 214 .
  • Mass storage device 232 is typically non-volatile memory and may be a disk drive such as a hard disk drive, floppy disk drive, optical disk drive, floptical disk drive, etc.
  • optical processing system 300 may be utilized as one or both of optical processing systems 116 and 128 discussed with respect to FIG. 2 .
  • Optical processing system 300 may be utilized to perform a correlation algorithm or the like type of algorithm (e.g., convolution, cross-correlation, etc.).
  • Digital processing system 114 provides the audio signals 330 and the video signals 332 to be processed by optical processing system 300 .
  • An audio scan signal 318 to be processed in conjunction with audio signals 330 is coupled to a spatial light modulator (SLM) 314 for modulating the light beam output of a laser 310 impinging upon SLM 314 .
  • SLM spatial light modulator
  • Audio scan signal 318 may be, for example, a codebook of voice phonemes for decoding phonemes in audio signal 330 .
  • digital processing system 114 provides video signals 332 to a second SLM 316 for modulating the light beam output of a second laser 312 .
  • a video scan signal 320 is provided to SLM 316 such for processing in conjunction with video signals 332 .
  • video scan signal may be a codebook of images to be correlated with images present in video signals 332 .
  • Optical processing system 300 may be thereby utilized to perform correlation processes between audio signals 330 and audio scan signal 318 , and between video signals 332 and video scan signal 320 .
  • Other types of optical processing algorithms may also be implemented by optical processing system 300 .
  • Optical processing system 300 may be utilized to perform an autocorrelation, for example, to remove undesirable noise from audio signal 330 or video signals 332 . Further, optical processing system 300 may be utilized to perform a convolution algorithm, for example, in order to perform digital filtering on audio signals 330 or video signals 332 . Additional signal processing techniques (e.g., compression, signal transforms, etc.) may also be implemented by optical processing system 300 .
  • the modulated laser beams are applied to a lens system 322 for directing and focusing the laser beams through a photorefractive (PR) crystal 324 and thereby impinge upon a detector 328 .
  • PR photorefractive
  • the modulated light beams from lasers 310 and 312 impinge upon photorefractive crystal 324 at different angles of incidence, thereby forming a holographic grating within the crystal.
  • PR crystal 324 may contain data stored holographically that may be further utilized in a signal processing algorithm. For example, a correlation may be performed on audio signals 330 or video signals 332 and the data stored in PR crystal 324 .
  • a third light beam from a third laser 326 is applied to PR crystal 324 for reading out the holographic output onto detector 328 as an output signal such that an appropriate signal processing algorithm on audio signals 330 or video signals 332 318 is obtained.
  • Detector 328 may be a charge-coupled device (CCD) for converting optical signals processed by optical processing system into a digital signal readable by digital processing systems 114 or 126 for further signal processing in a discrete-time domain, discrete-amplitude range.
  • CCD charge-coupled device
  • multimedia processing system 100 may be utilized to process audio signals and video signals received by media source 108 .
  • Audio transducer 112 may include a microphone and preamplifier for capturing an audio signal. The audio signal may be incorporated as part of the video signal captured by video transducer 110 , for example when capturing the image of a person speaking such that the contents of the speech is captured and processed, and the audio and video signals are combined into a multimedia signal (data signal). Additionally, audio transducer 112 may capture voice commands from a user of multimedia system 100 such that the user may operate and control multimedia system 100 through spoken commands and utterances with voice recognition algorithms. Thus, optical processing system 116 may be utilized to process voice commands (i.e. audio instructions) received via audio transducer 112 .
  • voice commands i.e. audio instructions
  • Optical processing system 116 is capable of processing audio command signal faster than a semiconductor based processor since optical processing may be performed in parallel rather than serially as with semiconductor processors.
  • a book of commands may be stored in PR crystal 324 , and a correlation preformed on audio signals 330 to determine which of the commands holographically stored in PR crystal 324 corresponds to the spoken command represented by audio signals 330 .
  • optical processing system 116 may be utilized to implement neural network decision processing algorithms for controlling the operation of multimedia system 100 , for pattern recognition and image and speech analysis. Since multiple data sets may be stored in PR crystal 324 as holographic images, signal processing on all of the data set images stored in PR crystal may be performed simultaneously such that a neural network computing system may be implemented.

Abstract

A system and method for capturing and processing audio and video images for optimized transmission over a lower bandwidth communications channel are disclosed. A digital processing system receives video and audio signals from a media source such that the video and audio signals are representative of captured images and sounds as a multimedia signal. An optical processing system is coupled to the digital processing system and performs signal processing algorithms on the multimedia signal. A transceiver transmits the multimedia signal over a lower bandwidth communications channel wherein the multimedia signal is optimized for transmission over the lower bandwidth communications channel by the signal processing algorithms performed by the optical processing system.

Description

FIELD OF THE INVENTION
The present invention generally relates to the field of media communications systems, and particularly to an audio/video system utilizing optical processing.
BACKGROUND OF THE INVENTION
Traditional semiconductor based digital electronic processors are typically serial devices processing data in a serial manner, i.e. a first operation is performed on a first set of data before a second set of data is fetched and operated upon. Although advents in semiconductor based vocoders and processor architectures, such as predictive branching and higher microprocessor speed, have provided systems capable of performing increasingly faster operations, the fundamental serial structure of semiconductor processing systems inherent in the device technology (e.g., von Neumann architecture) have limited the speed at which complex processing algorithms such as video signal processing and compression may be performed with a general purpose semiconductor based processor. Further, although specialized semiconductor processors have been developed having architectures optimized for signal processing algorithms (e.g., digital signal processors, Harvard architecture), semiconductor devices still exhibit a signal processing limit. These problems become apparent when it is desired to transmit video or audio over a limited bandwidth channel. Since video and audio signals many times have a greater bandwidth than the channel over which it is desired to transmit, compression algorithms are utilized to reduce the bandwidth such that transmission over the lower bandwidth channel may be achieved. However, video and some audio compression algorithms require large amounts of processing power. Using traditional semiconductor processors to perform the compression algorithms in real-time or near real-time many times requires too much processing time to accomplish high quality image or audio compression. Therefore, at best, semiconductor processors only provide lossy video compression (where some information quality and content is sacrificed) as real time compression is approached.
However, optical processors using holographic image processing/classification techniques are capable of processing information in parallel such that much more complex image processing algorithms such as compression, correlations, and transform decompositions may be processed in much shorter amount of time than with traditional semiconductor processors. Such optically implemented image classification and processing algorithms may provide optimized transmission of video, image and graphics file signals over lower bandwidth channels. Furthermore, by combining an optical processor with a media communication system, voice command recognition, control and neural network signal processing may be implemented with the system by taking advantage of the parallel processing and data classification provided by an optical processing system. Thus, there lies a need for a media processing and transmission system that utilizes optical processing to provide faster and more optimized transmission of video and audio signals over lower bandwidth communications channels and that further utilizes optical processing to implement voice command recognition and neural network decision processing.
SUMMARY OF THE INVENTION
The present invention is directed to a system for capturing and processing video and audio for optimized transmission over a lower bandwidth communication channel. In one embodiment, the system includes a digital processing system for receiving a video and an audio signal from a media source or sensor wherein the video and audio signals are representative of the media delivered or captured by the media source or sensor, at least one or more optical processing systems coupled to the digital processing system for performing signal classification and processing algorithms on the video and audio signals, and a transceiver for transmitting the combined signals over a lower bandwidth communications channel wherein the signal processing algorithms and classification performed by the optical processing system optimizes the video and audio signals for transmission over the lower bandwidth communications channel. In a further embodiment, the system provides optical processing for voice recognition and neural network processing.
The present system level invention is further directed to a method for transmitting audio and video signals over a lower bandwidth communications channel. In one embodiment, the method includes steps for capturing audio and images and providing the information as a multimedia signal, optimizing the signal for transmission over the lower bandwidth communications channel by performing separate compression algorithms on the video and audio signals with an optical processing system, and transmitting the optimized multimedia signal over the lower bandwidth communications channel.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention as claimed.
The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate an embodiment of the invention and together with the general description, serve to explain the principles of the invention.
BRIEF DESCRIPTION OF THE DRAWINGS
The numerous advantages of the present invention may be better understood by those skilled in the art by reference to the accompanying figures in which:
FIG. 1 is a block diagram of a video and audio multimedia system capable of optically processing a video signal and an audio signal in accordance with the present invention;
FIG. 2 is a block diagram of the hardware of a computer processing system operable to tangibly embody a digital processing system of the video and audio multimedia system of the present invention; and
FIG. 3 is a block diagram of a holographic optical processing system for optically processing a video signal and an audio signal in accordance with the present invention.
DETAILED DESCRIPTION OF THE INVENTION
Reference will now be made in detail to the presently preferred embodiment of the invention, an example of which is illustrated in the accompanying drawings.
Referring now to FIG. 1, a block diagram of a video and audio multimedia system capable of optically processing a video source and an audio source will be discussed. The multimedia system 100 receives video images or files captured with a video transducer 110 of a media source 108 and captures audio signals with an audio transducer 112. Video transducer 110 may include, for example, a charge coupled device (CCD) array and related hardware for capturing and buffering frames of video. Audio transducer may comprise, for example, a microphone and preamplifier for capturing and amplifying sounds and providing the sound as a signal representative of the sound at a predetermined signal level. The images and sounds captured by video transducer 110 and audio transducer 112 of media source 108 are provided to a digital processing system 114 that may be, for example, an electronic computer system (such as shown and described with respect to FIG. 2). An optical processing system 116 couples to digital processing system 116 for performing signal processing algorithms utilizing an optical signal processing apparatus.
In operation, multimedia system 100 receives a video signal and an audio signal from media source 110 that are provided to digital processing system 114 as a multimedia signal intended to be transmitted to a remote device or location with a transceiver 120 coupled to digital processing system 114. Typically, the bandwidth of the channel 122 over which the image is to be transmitted is too narrow for real-time transmission of a complete, full bandwidth video and audio multimedia signal. The video and audio signals are processed by an optical processing system 116 coupled to digital processing system 114 prior to transmission such that the video and audio signals are optimized for transmission over the limited bandwidth channel 122. Optical processing system 116 implements a library of definable and programmable signal transforms and holographic image correlators for implementing a wide range of signal decomposition functions including Fourier transforms, Hartley transforms, discrete valued transforms (e.g., z-transforms), transform inversions, signal compression, filtering, etc. A data storage device 118 may be coupled to digital processing system 114 for storing video and audio signal data during processing as required, or for longer term storage of the video and audio signals.
After optical system 116 performs the desired signal processing, the video and audio signals may be transmitted as a processed multimedia signal via channel 122 to be received by a second transceiver 124 disposed at a remote location. The received multimedia signal is processed by a second digital processing system 126 coupled to transceiver 124 for reconstructing the video and audio signals from the received multimedia signal. A second optical processing system 128 coupled to digital processing system 126 implements algorithms for reconstructing the video and audio signals (e.g., inverse transforms) from the received multimedia signal. Upon reconstruction of the video and audio signals, the video signal may then be displayed on display 134, and the audio signal may be reproduced with an audio transducer 132 (e.g., an amplifier and speaker). A data storage device 130 coupled to digital processing system 126 may be used for storing the video and audio signals during processing as required, or for longer term storage of the video and audio signals. As required by the particular application in which multimedia system 100 is utilized, the definable and programmable transforms or correlations performed by optical processing systems 116 and 128 may be optimally selected for the particular utilization of media source 108. For example, a first optical processing algorithm and system may be selected for processing of landscapes, a second algorithm or system may be selected for processing buildings, a third algorithm or system may be selected for processing vehicles, a fourth algorithm or system may be selected for processing images of human beings, a fifth algorithm may be selected for processing conversations, and so on. In one particular embodiment, either of optical processing system 116 or optical processing system 128, alone or in combination, is configured to perform a compression algorithm on the video or audio signals as received and delivered through a Transmission Control Protocol/Internet Protocol (TCP/IP) packet communications network. For example, media source 108 may receive video or audio signals, images, graphics files or multimedia files from a TCP/IP packet communications network (e.g., the Internet).
Referring now to FIG. 2, a computer hardware system operable to tangibly embody a digital processing system of a multimedia system of the present invention will be discussed. The computer system 200 may be utilized for either digital processing system 114 or digital processing system 126 and generally includes a central bus 218 for transferring data among the components of computer system 200. A clock 210 provides a timing reference signal to the components of computer system 200 via bus 218 and to a central processing unit 212. Central processing unit 212 is utilized for interpreting and executing instructions and for performing calculations for computer system 200. Central processing unit 212 may be a special purpose processor such as a digital signal processor. A random access memory (RAM) device 214 couples to bus 218 and to central processing unit 212 for operating as memory for central processing unit 212 and for other devices coupled to bus 218. A read-only memory device (ROM) 216 is coupled to the components of computer system 200 via bus 218 for operating as memory for storing instructions or data that are normally intended to be read but not to be altered except under specific circumstances (e.g., when the instructions or data are desired to be updated). ROM device 216 typically stores instructions for performing basic input and output functions for computer system 200 and for loading an operating system into RAM device 214.
An input device controller 220 is coupled to bus 218 for allowing an input device 222 to provide input signals into computer system 200. Input device 222 may be a keyboard, mouse, joystick, trackpad or trackball, microphone, modem, or a similar input device. Further, input device 222 may be a graphical or tactile input device such as a touch pad for inputting data with a finger or a stylus such. Such a graphical or tactile input device 222 may be overlaid upon a screen of a display device 226 for correlating the coordinates of a tactile input with information displayed on display 226. Display 226 is controlled by a video controller 224 that provides a video signal received via bus 218 to display 226. Display 226 may be any type of display or monitor suitable for displaying information generated by computer system 200 such as cathode ray tube (CRT), a liquid crystal display (LCD), gas or plasma display, or a field emission display panel. Preferably, display 226 is a flat-panel display having a depth being shallower than its width. A peripheral bus controller 228 couples peripheral devices to central bus 218 of computer system 200 via a peripheral bus 228. Peripheral bus 230 is preferably in compliance with a standard bus architecture such as an Electrical Industries Association Recommended Standard 232 (RS-232) standard, an Institute of Electrical and Electronics Engineers (IEEE) 1394 serial bus standard, a Peripheral Component Interconnect (PCI) standard, or a Universal Serial Bus (USB) standard, etc. A mass storage device controller 232 controls a mass storage device 234 for storing large quantities of data or information, such as a quantity of information larger than the capacity of RAM device 214. Mass storage device 232 is typically non-volatile memory and may be a disk drive such as a hard disk drive, floppy disk drive, optical disk drive, floptical disk drive, etc.
Referring now to FIG. 3, an optical processing system for optically processing video and audio signals in accordance with the present invention will be discussed. The optical processing system of FIG. 3 may be utilized as one or both of optical processing systems 116 and 128 discussed with respect to FIG. 2. Optical processing system 300 may be utilized to perform a correlation algorithm or the like type of algorithm (e.g., convolution, cross-correlation, etc.). Digital processing system 114 provides the audio signals 330 and the video signals 332 to be processed by optical processing system 300. An audio scan signal 318 to be processed in conjunction with audio signals 330 is coupled to a spatial light modulator (SLM) 314 for modulating the light beam output of a laser 310 impinging upon SLM 314. Audio scan signal 318 may be, for example, a codebook of voice phonemes for decoding phonemes in audio signal 330. Likewise, digital processing system 114 provides video signals 332 to a second SLM 316 for modulating the light beam output of a second laser 312. A video scan signal 320 is provided to SLM 316 such for processing in conjunction with video signals 332. For example, video scan signal may be a codebook of images to be correlated with images present in video signals 332. Optical processing system 300 may be thereby utilized to perform correlation processes between audio signals 330 and audio scan signal 318, and between video signals 332 and video scan signal 320. Other types of optical processing algorithms may also be implemented by optical processing system 300. Optical processing system 300 may be utilized to perform an autocorrelation, for example, to remove undesirable noise from audio signal 330 or video signals 332. Further, optical processing system 300 may be utilized to perform a convolution algorithm, for example, in order to perform digital filtering on audio signals 330 or video signals 332. Additional signal processing techniques (e.g., compression, signal transforms, etc.) may also be implemented by optical processing system 300.
The modulated laser beams are applied to a lens system 322 for directing and focusing the laser beams through a photorefractive (PR) crystal 324 and thereby impinge upon a detector 328. The modulated light beams from lasers 310 and 312 impinge upon photorefractive crystal 324 at different angles of incidence, thereby forming a holographic grating within the crystal. Further, PR crystal 324 may contain data stored holographically that may be further utilized in a signal processing algorithm. For example, a correlation may be performed on audio signals 330 or video signals 332 and the data stored in PR crystal 324. A third light beam from a third laser 326 is applied to PR crystal 324 for reading out the holographic output onto detector 328 as an output signal such that an appropriate signal processing algorithm on audio signals 330 or video signals 332 318 is obtained. Detector 328 may be a charge-coupled device (CCD) for converting optical signals processed by optical processing system into a digital signal readable by digital processing systems 114 or 126 for further signal processing in a discrete-time domain, discrete-amplitude range.
It will be seen that multimedia processing system 100 may be utilized to process audio signals and video signals received by media source 108. Audio transducer 112 may include a microphone and preamplifier for capturing an audio signal. The audio signal may be incorporated as part of the video signal captured by video transducer 110, for example when capturing the image of a person speaking such that the contents of the speech is captured and processed, and the audio and video signals are combined into a multimedia signal (data signal). Additionally, audio transducer 112 may capture voice commands from a user of multimedia system 100 such that the user may operate and control multimedia system 100 through spoken commands and utterances with voice recognition algorithms. Thus, optical processing system 116 may be utilized to process voice commands (i.e. audio instructions) received via audio transducer 112. Optical processing system 116 is capable of processing audio command signal faster than a semiconductor based processor since optical processing may be performed in parallel rather than serially as with semiconductor processors. For example, a book of commands may be stored in PR crystal 324, and a correlation preformed on audio signals 330 to determine which of the commands holographically stored in PR crystal 324 corresponds to the spoken command represented by audio signals 330. Furthermore, optical processing system 116 may be utilized to implement neural network decision processing algorithms for controlling the operation of multimedia system 100, for pattern recognition and image and speech analysis. Since multiple data sets may be stored in PR crystal 324 as holographic images, signal processing on all of the data set images stored in PR crystal may be performed simultaneously such that a neural network computing system may be implemented.
It is believed that the video capture and compression system using holographic optical correlation, signal transform decomposition, voice recognition and neural network decision processing of the present invention and many of its attendant advantages will be understood by the foregoing description, and it will be apparent that various changes may be made in the form, construction and arrangement of the components thereof without departing from the scope and spirit of the invention or without sacrificing all of its material advantages. The form herein before described being merely an explanatory embodiment thereof. It is the intention of the following claims to encompass and include such changes.

Claims (20)

What is claimed is:
1. A system for capturing and processing video and audio for optimized transmission over a lower bandwidth communications channel, comprising: a digital processing system for receiving a video signal and an audio signal from
a media source, the video signal being representative of an image captured by the media source and the audio signal being representative of sounds captured by the media source;
an optical processing system, coupled to said digital processing system, for performing a signal processing algorithm on the video signal and a for performing a signal processing algorithm on the audio signal; and
a transceiver for transmitting the video and audio signals over a lower bandwidth communications channel as a multimedia signal, wherein the signal processing algorithms performed by said optical processing system optimize the video and audio signals for transmission over the lower bandwidth communications channel;
said optical processing system comprising first and second spatial light modulators, said first spatial light modulator encoding audio signals onto a first light beam and said second spatial light modulator encoding video signals onto a second light beam, a lens system for focusing the first and second light beams through a photorefractive crystal, said photorefractive crystal being actuated with a control light beam such that a holographic output is provide by said photorefractive crystal, and a detector for detecting the holographic output of said photorefractive crystal and providing an output in response thereto, the output of said detector being representative of a signal processing routine performed on the video or audio signals by said optical processing system.
2. A system as claimed in claim 1, said digital processing system comprising a semiconductor processor and a memory coupled to said semiconductor processor, said semiconductor processor for executing a program of instructions stored in said memory.
3. A system as claimed in claim 1, said optical processing system being configured to perform a compression algorithm on the video or audio signal.
4. A system as claimed in claim 1, said optical processing system being configured to perform a correlation algorithm on the video signal or the audio signal, respectively, and a reference signal.
5. A system as claimed in claim 1, said optical processing system being configured to perform a transform algorithm on the video signal or the audio signal.
6. A system as claimed in claim 1, said optical processing system being configured to perform an inverse transform on the video signal or the audio signal.
7. A system as claimed in claim 1, the audio signal being captured with an audio transducer, said optical processing system being configured to perform voice command recognition of the audio signal for controlling the system.
8. A system as claimed in claim 1, said optical processing system being configured to perform a neural network algorithm.
9. A system as claimed in claims 1, said optical processing system being configured to perform a compression algorithm on the video or audio signals as received and delivered through a TCP/IP packet communications network.
10. A system for capturing and processing video and audio signals for optimized transmission over a lower bandwidth communications channel, comprising:
means for receiving a video signal and an audio signal, the video signal being representative of captured images, and the audio signal being representative of captured sounds;
processing means, coupled to said signal receiving means, for performing an optical signal processing algorithm on the video signal and the audio signal; and
means for transmitting the video signal and the audio signal over a lower bandwidth communications channel as a multimedia signal, wherein the optical signal processing algorithm performed by said processing means optimizes the video and audio signals for transmission over the lower bandwidth communications channel;
said processing system comprising first and second means for encoding a signal onto a beam of light, said first encoding means encoding the video signal onto a first light beam and said encoding means encoding the audio signal onto a second light beam, means for focusing the first and second light beams through means for storing data holographically, said holographic data storage means being actuated with a control light beam such that a holographic output is provide by said holographic data storage means, and means for detecting the holographic output of said holographic data storage means and providing an output in response thereto, the output of said detecting means being representative of a signal processing routine performed on the video or audio signals by said processing means.
11. A system as claimed in claim 10, said signal receiving means comprising electronic processing means and means, coupled to said electronic processing means, for storing a program of instructions, said electronic processing means executing a program of instructions stored in said storing means.
12. A system as claimed in claim 10, said processing means being configured to perform a compression algorithm on the video or audio signals.
13. A system as claimed in claim 10, said processing means being configured to perform a correlation algorithm on the video or audio signals, respectively, and a reference signal.
14. A system as claimed in claim 10, said processing means being configured to perform a transform algorithm on the video or audio signals.
15. A system as claimed in claim 10, said processing means being configured to perform an inverse transform on the video or audio signals.
16. A system as claimed in claim 10, the audio signal being captured with means for transducing an audio signal, said processing means being configured to perform voice command recognition of the audio signal for controlling the system.
17. A system as claimed in claim 10, said processing means being configured to perform a neural network algorithm.
18. A system as claimed in claim 10, said processing means being configured to perform a compression algorithm on the video or audio signals as received and delivered through a TCP/IP packet communications network.
19. A method for transmitting a video signal and an audio signal over a lower bandwidth communications channel, comprising:
capturing images and sounds and providing the images and sounds as a multimedia signal;
optimizing the multimedia signal for transmission over a lower bandwidth communications channel by performing signal processing algorithms on the multimedia signal with an optical processing system; and
transmitting the optimized multimedia signal over the lower bandwidth communications channel;
the optical processing system comprising first and second spatial light modulators, said first spatial light modulator encoding audio signals onto a first light beam and said second spatial light modulator encoding video signals onto a second light beam, a lens system for focusing the first and second light beams through a photorefractive crystal, said photorefractive crystal being actuated with a control light beam such that a holographic output is provide by said photorefractive crystal, and a detector for detecting the holographic output of said photorefractive crystal and providing an output in response thereto, the output of said detector being representative of a signal processing routine performed on the video or audio signals by said optical processing system.
20. A method as claimed in claim 19, further comprising the steps of receiving the optimized multimedia signal and decoding the optimized multimedia signal with an optical processing system whereby the images may be displayed on a display and the sounds may be reproduced.
US09/360,512 1999-07-26 1999-07-26 Media capture and compression communication system using holographic optical classification, voice recognition and neural network decision processing Expired - Lifetime US6216267B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/360,512 US6216267B1 (en) 1999-07-26 1999-07-26 Media capture and compression communication system using holographic optical classification, voice recognition and neural network decision processing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US09/360,512 US6216267B1 (en) 1999-07-26 1999-07-26 Media capture and compression communication system using holographic optical classification, voice recognition and neural network decision processing

Publications (1)

Publication Number Publication Date
US6216267B1 true US6216267B1 (en) 2001-04-10

Family

ID=23418293

Family Applications (1)

Application Number Title Priority Date Filing Date
US09/360,512 Expired - Lifetime US6216267B1 (en) 1999-07-26 1999-07-26 Media capture and compression communication system using holographic optical classification, voice recognition and neural network decision processing

Country Status (1)

Country Link
US (1) US6216267B1 (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6487526B1 (en) * 1999-04-14 2002-11-26 Rockwell Collins Vector correlator for speech VOCODER using optical processor
US20030139181A1 (en) * 2002-01-22 2003-07-24 Honeywell International, Inc. VHF ground station selection algorithm
US20040019574A1 (en) * 2002-04-19 2004-01-29 Zhuo Meng Processing mixed numeric and/or non-numeric data
EP1411641A2 (en) * 2002-10-15 2004-04-21 Communications Research Laboratory, Independent Administrative Institution Error correction decoding using optical signal processing
WO2004107072A1 (en) * 2003-05-30 2004-12-09 Trw Limited An optical driver assistance system for a motor vehicle
WO2005107399A2 (en) * 2004-04-30 2005-11-17 Vulcan Inc. Voice control of multimedia content
WO2005107407A2 (en) * 2004-04-30 2005-11-17 Vulcan Inc. Voice control of television-related information
US7675461B1 (en) 2007-09-18 2010-03-09 Rockwell Collins, Inc. System and method for displaying radar-estimated terrain
US8049644B1 (en) 2007-04-17 2011-11-01 Rcokwell Collins, Inc. Method for TAWS depiction on SVS perspective displays
US20140099068A1 (en) * 2001-05-02 2014-04-10 Mantrose Group Limited Liability Company System and methodology for utilizing a portable media player
NO342409B1 (en) * 2007-10-26 2018-05-14 Hung Nguyen Method and system for transmitting multimedia content using an existing digital audio transmission protocol
CN110493596A (en) * 2019-09-02 2019-11-22 西北工业大学 A kind of video coding framework neural network based
US20220115002A1 (en) * 2020-10-14 2022-04-14 Beijing Horizon Robotics Technology Research And Development Co., Ltd. Speech recognition method, speech recognition device, and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4156914A (en) * 1977-08-18 1979-05-29 Baird Corporation Computer image display system and processor therefor
US5253530A (en) * 1991-08-12 1993-10-19 Letcher Iii John H Method and apparatus for reflective ultrasonic imaging
US5659665A (en) * 1994-12-08 1997-08-19 Lucent Technologies Inc. Method and apparatus for including speech recognition capabilities in a computer system
US5793871A (en) * 1996-11-26 1998-08-11 California Institute Of Technology Optical encryption interface
US5953452A (en) * 1992-11-05 1999-09-14 The Johns Hopkins University Optical-digital method and processor for pattern recognition

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4156914A (en) * 1977-08-18 1979-05-29 Baird Corporation Computer image display system and processor therefor
US5253530A (en) * 1991-08-12 1993-10-19 Letcher Iii John H Method and apparatus for reflective ultrasonic imaging
US5953452A (en) * 1992-11-05 1999-09-14 The Johns Hopkins University Optical-digital method and processor for pattern recognition
US5659665A (en) * 1994-12-08 1997-08-19 Lucent Technologies Inc. Method and apparatus for including speech recognition capabilities in a computer system
US5793871A (en) * 1996-11-26 1998-08-11 California Institute Of Technology Optical encryption interface

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
An Optical Fourier/Electronic Neurocomputer Automated Inspection system. David E. Glover. Global Holonetics Corporation, Fairfield, IA 52556.*
Digital Video Compression-An overview. Joseph B. Waltrich, Member, IEEE. Journal of Lightwave technology, vol. 11, No. 1, Jan. 1993.*
Digital Video Compression—An overview. Joseph B. Waltrich, Member, IEEE. Journal of Lightwave technology, vol. 11, No. 1, Jan. 1993.*
Method and apparatus for machine processing using wavelet/wavelet-packet bases. Coifman Ronald et al. WO 91/18362. Nov. 28, 1991.*
Optical Computing Techniques for Image/Video compression. Akitoshi Yoshida and John H. Reif, Fellow, IEEE. Proceeding of the IEEE, vol. 82, No. 6, Jun. 1994. *
Optical Hartley Transforms. John D. Villasensor, member IEEE. Proceedings of IEEE, vol. 82, No. 3, Mar. 1994.*
Two-dimensional all-optical parallel-to series TDM and demultiplexing by spectral-holographic methods. Peter H. Handel, Physics Dept. Univ. of Missouri, St. Louis, Mo 63121.*

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6487526B1 (en) * 1999-04-14 2002-11-26 Rockwell Collins Vector correlator for speech VOCODER using optical processor
US9819898B2 (en) 2001-05-02 2017-11-14 Callahan Cellular L.L.C. System and methodology for utilizing a portable media player
US9124852B2 (en) * 2001-05-02 2015-09-01 Mantrose Group Limited Liability Company System and methodology for utilizing a portable media player
US20140099068A1 (en) * 2001-05-02 2014-04-10 Mantrose Group Limited Liability Company System and methodology for utilizing a portable media player
US20030139181A1 (en) * 2002-01-22 2003-07-24 Honeywell International, Inc. VHF ground station selection algorithm
US7184763B2 (en) 2002-01-22 2007-02-27 Honeywell International, Inc. VHF ground station selection algorithm
US20040019574A1 (en) * 2002-04-19 2004-01-29 Zhuo Meng Processing mixed numeric and/or non-numeric data
US7423793B2 (en) 2002-10-15 2008-09-09 National Institute Of Information And Communication Technology Method and apparatus for decoding digital signal
EP1411641A2 (en) * 2002-10-15 2004-04-21 Communications Research Laboratory, Independent Administrative Institution Error correction decoding using optical signal processing
US20040175184A1 (en) * 2002-10-15 2004-09-09 Communications Res. Lab., Ind. Admin. Inst. Method and apparatus for decoding digital signal
EP1411641A3 (en) * 2002-10-15 2004-11-17 National Institute of Information and Communications Technology, Independent Administrative Institution Error correction decoding using optical signal processing
WO2004107072A1 (en) * 2003-05-30 2004-12-09 Trw Limited An optical driver assistance system for a motor vehicle
US20060075429A1 (en) * 2004-04-30 2006-04-06 Vulcan Inc. Voice control of television-related information
WO2005107407A3 (en) * 2004-04-30 2007-07-26 Vulcan Inc Voice control of television-related information
WO2005107399A3 (en) * 2004-04-30 2007-03-01 Vulcan Inc Voice control of multimedia content
US20060041926A1 (en) * 2004-04-30 2006-02-23 Vulcan Inc. Voice control of multimedia content
WO2005107407A2 (en) * 2004-04-30 2005-11-17 Vulcan Inc. Voice control of television-related information
WO2005107399A2 (en) * 2004-04-30 2005-11-17 Vulcan Inc. Voice control of multimedia content
US8049644B1 (en) 2007-04-17 2011-11-01 Rcokwell Collins, Inc. Method for TAWS depiction on SVS perspective displays
US7675461B1 (en) 2007-09-18 2010-03-09 Rockwell Collins, Inc. System and method for displaying radar-estimated terrain
NO342409B1 (en) * 2007-10-26 2018-05-14 Hung Nguyen Method and system for transmitting multimedia content using an existing digital audio transmission protocol
CN110493596A (en) * 2019-09-02 2019-11-22 西北工业大学 A kind of video coding framework neural network based
US20220115002A1 (en) * 2020-10-14 2022-04-14 Beijing Horizon Robotics Technology Research And Development Co., Ltd. Speech recognition method, speech recognition device, and electronic equipment

Similar Documents

Publication Publication Date Title
US6216267B1 (en) Media capture and compression communication system using holographic optical classification, voice recognition and neural network decision processing
CN101313483B (en) Configuration of echo cancellation
US8441515B2 (en) Method and apparatus for minimizing acoustic echo in video conferencing
JP2000215021A (en) Voice control input method for portable input device
US20190066654A1 (en) Adaptive suppression for removing nuisance audio
US20210287652A1 (en) System and method for data augmentation of feature-based voice data
EP3522570A2 (en) Spatial audio signal filtering
KR102191736B1 (en) Method and apparatus for speech enhancement with artificial neural network
CN111654806B (en) Audio playing method and device, storage medium and electronic equipment
US6487526B1 (en) Vector correlator for speech VOCODER using optical processor
US10079028B2 (en) Sound enhancement through reverberation matching
JP4952368B2 (en) Sound collector
CN110517682A (en) Audio recognition method, device, equipment and storage medium
US11665373B2 (en) Virtual spectator experience for live events
CN112261236B (en) Method and equipment for mute processing in multi-person voice
WO2022173982A1 (en) Comparing acoustic relative transfer functions from at least a pair of time frames
CN116508099A (en) Deep learning-based speech enhancement
Shi et al. Automatic gain control for parametric array loudspeakers
EP4303868A1 (en) Audio signal processing method, devices, system, and storage medium
CN111145769A (en) Audio processing method and device
WO2017136587A1 (en) Adaptive suppression for removing nuisance audio
US11532314B2 (en) Amplitude-independent window sizes in audio encoding
JP2003298916A (en) Imaging apparatus, data processing apparatus and method, and program
US20230230599A1 (en) Data augmentation system and method for multi-microphone systems
WO2023234939A1 (en) Methods and systems for audio processing using visual information

Legal Events

Date Code Title Description
AS Assignment

Owner name: ROCKWELL COLLINS, INC., IOWA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MITCHELL, JAMES P.;REEL/FRAME:010136/0446

Effective date: 19990726

STCF Information on status: patent grant

Free format text: PATENTED CASE

REMI Maintenance fee reminder mailed
FPAY Fee payment

Year of fee payment: 4

SULP Surcharge for late payment
REMI Maintenance fee reminder mailed
FPAY Fee payment

Year of fee payment: 8

SULP Surcharge for late payment

Year of fee payment: 7

FPAY Fee payment

Year of fee payment: 12