WO2007138617A1 - Video camera for desktop videocommunication - Google Patents

Video camera for desktop videocommunication Download PDF

Info

Publication number
WO2007138617A1
WO2007138617A1 PCT/IT2006/000395 IT2006000395W WO2007138617A1 WO 2007138617 A1 WO2007138617 A1 WO 2007138617A1 IT 2006000395 W IT2006000395 W IT 2006000395W WO 2007138617 A1 WO2007138617 A1 WO 2007138617A1
Authority
WO
WIPO (PCT)
Prior art keywords
video camera
digital
audio
microphones
fire
Prior art date
Application number
PCT/IT2006/000395
Other languages
French (fr)
Inventor
Andrea Santilli
Original Assignee
Asdsp S.R.L.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Asdsp S.R.L. filed Critical Asdsp S.R.L.
Priority to PCT/IT2006/000395 priority Critical patent/WO2007138617A1/en
Publication of WO2007138617A1 publication Critical patent/WO2007138617A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/4143Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a Personal Computer [PC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/4223Cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting

Definitions

  • This invention refers to an integrated digital video camera, to be used for desktop videoconferencing. Background of the invention
  • the audio quality is still poor and strongly limited by the audio capturing devices which are presently available.
  • hand-free communication is often made possible only by wearing a headsets with built in microphone (placed very close to the user's mouth) : using the PC loudspeakers and a fixed microphone (either tabletop or built in the webcam) would require the use of an acoustic echo canceller (AEC) to have a full-duplex communication.
  • AEC acoustic echo canceller
  • EP 1 377 041 discloses a web cam which is composed by 5 or more cameras, positioned along radii around a table, supported by a plastic rod and carrying array microphones on a basis thereof. This contruction is comparatively expensive and hindering. Summary of the invention
  • Fig. 1 shows a structure of webcam and microphones according to a preferred embodiment of the invention
  • Fig. 2 shows a structure of webcam and microphones according to an alternative embodiment
  • Fig. 3 shows a structure of webcam and microphones according to another alternative embodiment of the invention. Best way to carry out the invention
  • the overall device to be employed for PC based desktop videoconferencing comprises a video camera lens and an audio capturing device.
  • a video camera lens a conventional webcam can be used.
  • a high-speed USB 2 link is a sufficient even though a IEEE 1394 (FireWire) interface is to be preferred for the ASIO 2 audio I/O support, which is preferably employed.
  • the audio system is separated from the video camera lens, since they are usually independent and work on different kind of circuit.
  • video camera and microphones form a unit, directly connected to the host system (for instance a personal computer) . This arrangement allows large space sparings and to optimise the system configuration.
  • a broadside linear array microphones (for instance, four elements 1 to 4) can be easily designed so as to exhibit a size smaller than the width of most PC monitors including desktop LCD screens and laptop lid displays. With the typical microphone spacing (4-6 cm) it is pretty easy to place the camera lens 5 between the central two microphones 2 and 3.
  • an alternative embodiment uses a single end-fire array (microphones 1 and 2 are shown in Fig.2) could be positioned either on the bottom or on the top of the camera lens 5.
  • another alternative embodiment includes a double end-fire array, where each end-fire array is placed on each side of the central camera lens. In such an arrangement, the two end-fire outputs are mixed together.
  • many existing beamforming algorithms able to steer a single beam to the front direction can achieve a significant signal-to-noise ratio improvement compared to a single microphone solution. Because of its low sensitivity to calibration errors, a weighed delay and sum beamformer is chosen here.
  • cardioid microphones instead of the omnidirectional capsules used in most of the commercial microphone arrays, a significant back rejection is achieved with no extra complexity: anyway, special care must be taken while designing the plastic case as cardioid microphones require large openings .

Abstract

An integrated digital video camera is described, to be used for desktop videoconferencing, including a digital camera which captures video image data, a microphone array, a digital audio interface for capturing microphone signals, a digital audio interface and a high speed digital interface.

Description

VIDEO CAMERA FOR DESKTOP VIDEOCOMMUNICATION Field of the invention
This invention refers to an integrated digital video camera, to be used for desktop videoconferencing. Background of the invention
Desktop videoconferencing is becoming more and more popular, mainly due to the availability of Internet connections exhibiting both low costs and high speed and video-communication solutions based on free software: presently most the Internet Messaging ■ Service providers have added a voice chat and video chat functionalities and hence a large customer base is getting accustomed to PC based videoconferencing (software videophone) .
Despite the video quality is mainly limited by the connection speed and hence is expected to quickly improve in the near future, the audio quality is still poor and strongly limited by the audio capturing devices which are presently available. In particular, hand-free communication is often made possible only by wearing a headsets with built in microphone (placed very close to the user's mouth) : using the PC loudspeakers and a fixed microphone (either tabletop or built in the webcam) would require the use of an acoustic echo canceller (AEC) to have a full-duplex communication. In order to have a conventional acoustic echo canceller properly working, an accurate control of the audio latency is required: this is very difficult or impossible with the typical consumer level, Windows compatible audio I/O interfaces; professional level audio cards have always low and consistent latency (usually support ASIO 2 drivers) and so are suitable for AEC applications.
Several microphone array products are already available on the market but they are usually designed for speech recognition (speech-to-text) applications and so no special care is taken to make it possible for such devices to be used as input source of an echo canceller based, hand-free system: furthermore they should be placed in the same place where a webcam is usually placed: the top border of a PC monitor. So integrating a microphone array into a digital video camera (webcam) make it possible for this two peripherals to share the same location and the same digital connection while further benefits can be achieved by integrating a constant latency audio I/O interface (ASIO 2) in order to make AEC integration viable.
EP 1 377 041 discloses a web cam which is composed by 5 or more cameras, positioned along radii around a table, supported by a plastic rod and carrying array microphones on a basis thereof. This contruction is comparatively expensive and hindering. Summary of the invention
All of the above problems are brilliantly solved by this invention, which refers to an integrated digital video camera, to be used for desktop videoconferencing, characterised in that it includes a digital camera lens which captures video image data, a microphone array, a digital audio input interface for capturing microphone signals, digital audio output interface to send the received signals to loudspeakers and a high speed digital interface to send both audio and video data to the host system. Brief description of the drawings Fig. 1 shows a structure of webcam and microphones according to a preferred embodiment of the invention;
Fig. 2 shows a structure of webcam and microphones according to an alternative embodiment; and
Fig. 3 shows a structure of webcam and microphones according to another alternative embodiment of the invention. Best way to carry out the invention
As already stated, this invention refers to an integrated digital video camera, to be used for desktop video conferencing. According to a preferred embodiment, the overall device to be employed for PC based desktop videoconferencing comprises a video camera lens and an audio capturing device. As a video camera lens, a conventional webcam can be used. Preferably, in order to get a high-quality grade of the overall solution, it is desirable to choose the best state-of-the-art optical and CCD sensors, in order to provide true VGA resolution at 30 fps . For this kind of digital cameras a high-speed USB 2 link is a sufficient even though a IEEE 1394 (FireWire) interface is to be preferred for the ASIO 2 audio I/O support, which is preferably employed. In conventional devices, the audio system is separated from the video camera lens, since they are usually independent and work on different kind of circuit. According to the present invention, video camera and microphones form a unit, directly connected to the host system (for instance a personal computer) . This arrangement allows large space sparings and to optimise the system configuration.
As shown in Fig. 1, according to a preferred embodiment, a broadside linear array microphones (for instance, four elements 1 to 4) can be easily designed so as to exhibit a size smaller than the width of most PC monitors including desktop LCD screens and laptop lid displays. With the typical microphone spacing (4-6 cm) it is pretty easy to place the camera lens 5 between the central two microphones 2 and 3.
As shown in Fig. 2, an alternative embodiment uses a single end-fire array (microphones 1 and 2 are shown in Fig.2) could be positioned either on the bottom or on the top of the camera lens 5.
As shown in Fig. 3, another alternative embodiment includes a double end-fire array, where each end-fire array is placed on each side of the central camera lens. In such an arrangement, the two end-fire outputs are mixed together.
In the above embodiments, many existing beamforming algorithms, able to steer a single beam to the front direction can achieve a significant signal-to-noise ratio improvement compared to a single microphone solution. Because of its low sensitivity to calibration errors, a weighed delay and sum beamformer is chosen here. By using cardioid microphones instead of the omnidirectional capsules used in most of the commercial microphone arrays, a significant back rejection is achieved with no extra complexity: anyway, special care must be taken while designing the plastic case as cardioid microphones require large openings .
According to the second and the third embodiments , shown in Figs. 2 and 3 respectively, it is possible to attain a higher directivity (the so called "superdirectivity" ) with respect to the prior art devices, and his adaptive generalisation allows to drive a null of the directional response toward the strongest interfering noise sources and hence to achieve a significant background noise rejection, without affecting the speech coming from the front direction.
Beside the proposed array geometries and the related beamforming algorithms many other combinations of array geometries and beamforming algorithms can be used, and the above embodiments are by no means intended to limit the scope of the present invention, which is mainly directed to a combined microphone array to be used as voice capture device for hand free communication, combined with a digital camera lens in order to achieve an audio/video peripheral of reasonably small size, so that it can be installed on the top of a video screen, as it can be argued from the annexed claims.

Claims

1) An integrated digital video camera, to be used for desktop videoconferencing, characterised in that it includes a digital camera lens (5) which captures video image data, a microphone array (1 to 4) , a digital audio input interface for capturing microphone signals, a digital audio output interface to send the receeived signals to the loudspeakers and a high speed digital interface to send both audio and video data to the host system.
2) A video camera as claimed in claim 1) , characterised in that true VGA resolution at 30 fps is provided.
3) A video camera as claimed in claim 1) or 2) , characterised in that it employs an ASIO 2 audio I/O support.
4) A video camera as claimed in claim 3) , characterised in that a high-speed USB 2 link or a IEEE 1394 (FireWire) interface is provided for the ASIO 2 audio I/O support.
5) A video camera as claimed in any previous claim, characterised in that a broadside linear array microphones (1 to 4) are designed so as to exhibit a size smaller than the width of most PC monitors including desktop LCD screens and laptop lid displays .
6) A video camera as claimed in claim 5) , characterised in that microphones are spaced by 4-6 cm from each other.
7) A video camera as in claim 6) , characterised in that the camera lens (5) is positioned between the two central microphones (2 and 3) .
8} A video camera as claimed in any claim 1) to 4) , characterised in that a single end-fire microphone array is provided .
9) A video camera as claimed in claim 8) , characterised in that the end-fire array is positioned either on the bottom or on the top of the camera lens (5) .
10) A video camera as claimed in any claim 1) to 4) , characterised in that a double end-fire arrays is provided.
11) A video camera as claimed in claim 10) , characterised in that the end-fire arrays are positioned apart from the video camera lens (5) .
12) A video camera as claimed in claim 10) or 11) , characterised in that the two end fire output are mixed together. 13) A video camera as claimed in any previous claim, characterised in that one beamforming algorithm, able to steer a single beam to the front direction, is employed, in order to attain a significant signal-to-noise ratio (RSD) improvement.
14) A video camera as set forth in any previous claim, characterised in that a weighed delay and sum beamformer is chosen.
15) A video camera as claimed in any previous claim, characterised in that a cardioid microphones is used.
PCT/IT2006/000395 2006-05-25 2006-05-25 Video camera for desktop videocommunication WO2007138617A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/IT2006/000395 WO2007138617A1 (en) 2006-05-25 2006-05-25 Video camera for desktop videocommunication

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/IT2006/000395 WO2007138617A1 (en) 2006-05-25 2006-05-25 Video camera for desktop videocommunication

Publications (1)

Publication Number Publication Date
WO2007138617A1 true WO2007138617A1 (en) 2007-12-06

Family

ID=37649468

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IT2006/000395 WO2007138617A1 (en) 2006-05-25 2006-05-25 Video camera for desktop videocommunication

Country Status (1)

Country Link
WO (1) WO2007138617A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030160862A1 (en) * 2002-02-27 2003-08-28 Charlier Michael L. Apparatus having cooperating wide-angle digital camera system and microphone array
EP1377041A2 (en) * 2002-06-27 2004-01-02 Microsoft Corporation Integrated design for omni-directional camera and microphone array
US20040012669A1 (en) * 2002-03-25 2004-01-22 David Drell Conferencing system with integrated audio driver and network interface device
US20040041902A1 (en) * 2002-04-11 2004-03-04 Polycom, Inc. Portable videoconferencing system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030160862A1 (en) * 2002-02-27 2003-08-28 Charlier Michael L. Apparatus having cooperating wide-angle digital camera system and microphone array
US20040012669A1 (en) * 2002-03-25 2004-01-22 David Drell Conferencing system with integrated audio driver and network interface device
US20040041902A1 (en) * 2002-04-11 2004-03-04 Polycom, Inc. Portable videoconferencing system
EP1377041A2 (en) * 2002-06-27 2004-01-02 Microsoft Corporation Integrated design for omni-directional camera and microphone array

Similar Documents

Publication Publication Date Title
JP4252377B2 (en) System for omnidirectional camera and microphone array
US9247334B2 (en) Portable electronic device
EP2172054B1 (en) Microphone array for a camera speakerphone
US7822338B2 (en) Camera for electronic device
US11477413B2 (en) System and method for providing wide-area imaging and communications capability to a handheld device
US9736427B1 (en) Communication system
US9294839B2 (en) Augmentation of a beamforming microphone array with non-beamforming microphones
US7724284B2 (en) Multi-camera system and method having a common processing block
EP2823631B1 (en) Portable electronic device with directional microphones for stereo recording
US8451315B2 (en) System and method for distributed meeting capture
KR102327160B1 (en) Apparatus and method for processing image received through a plurality of cameras
US10609476B2 (en) Display apparatus and communication terminal
US20080205874A1 (en) Wide coverage image capturing and standable web phone
WO2007138617A1 (en) Video camera for desktop videocommunication
WO2007136648A2 (en) Imaging panels including arrays of audio and video input and output elements
US20240064420A1 (en) Cameras for multiple views
KR200249455Y1 (en) Camera having microphone and speaker integrated therein
JP6169195B2 (en) Delegate unit and conference system equipped with the delegate unit
Liles CONFERENCE MICS AND SYSTEMS.
TW201735652A (en) Intelligence TV external?device connecting piece
JP2020184656A (en) Sound acquisition control system, information terminal, sound acquisition control method, and program
US20090103756A1 (en) Multi-Media Device
West Microphones for speech and speech recognition
CA2947565A1 (en) Connecting piece for external device of smart television
TWM355519U (en) Photographic device and moving equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 06766300

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 06766300

Country of ref document: EP

Kind code of ref document: A1