WO2007138617A1 - Video camera for desktop videocommunication - Google Patents
Video camera for desktop videocommunication Download PDFInfo
- Publication number
- WO2007138617A1 WO2007138617A1 PCT/IT2006/000395 IT2006000395W WO2007138617A1 WO 2007138617 A1 WO2007138617 A1 WO 2007138617A1 IT 2006000395 W IT2006000395 W IT 2006000395W WO 2007138617 A1 WO2007138617 A1 WO 2007138617A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- video camera
- digital
- audio
- microphones
- fire
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/142—Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/414—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
- H04N21/4143—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a Personal Computer [PC]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/42203—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/4223—Cameras
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4788—Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
Definitions
- This invention refers to an integrated digital video camera, to be used for desktop videoconferencing. Background of the invention
- the audio quality is still poor and strongly limited by the audio capturing devices which are presently available.
- hand-free communication is often made possible only by wearing a headsets with built in microphone (placed very close to the user's mouth) : using the PC loudspeakers and a fixed microphone (either tabletop or built in the webcam) would require the use of an acoustic echo canceller (AEC) to have a full-duplex communication.
- AEC acoustic echo canceller
- EP 1 377 041 discloses a web cam which is composed by 5 or more cameras, positioned along radii around a table, supported by a plastic rod and carrying array microphones on a basis thereof. This contruction is comparatively expensive and hindering. Summary of the invention
- Fig. 1 shows a structure of webcam and microphones according to a preferred embodiment of the invention
- Fig. 2 shows a structure of webcam and microphones according to an alternative embodiment
- Fig. 3 shows a structure of webcam and microphones according to another alternative embodiment of the invention. Best way to carry out the invention
- the overall device to be employed for PC based desktop videoconferencing comprises a video camera lens and an audio capturing device.
- a video camera lens a conventional webcam can be used.
- a high-speed USB 2 link is a sufficient even though a IEEE 1394 (FireWire) interface is to be preferred for the ASIO 2 audio I/O support, which is preferably employed.
- the audio system is separated from the video camera lens, since they are usually independent and work on different kind of circuit.
- video camera and microphones form a unit, directly connected to the host system (for instance a personal computer) . This arrangement allows large space sparings and to optimise the system configuration.
- a broadside linear array microphones (for instance, four elements 1 to 4) can be easily designed so as to exhibit a size smaller than the width of most PC monitors including desktop LCD screens and laptop lid displays. With the typical microphone spacing (4-6 cm) it is pretty easy to place the camera lens 5 between the central two microphones 2 and 3.
- an alternative embodiment uses a single end-fire array (microphones 1 and 2 are shown in Fig.2) could be positioned either on the bottom or on the top of the camera lens 5.
- another alternative embodiment includes a double end-fire array, where each end-fire array is placed on each side of the central camera lens. In such an arrangement, the two end-fire outputs are mixed together.
- many existing beamforming algorithms able to steer a single beam to the front direction can achieve a significant signal-to-noise ratio improvement compared to a single microphone solution. Because of its low sensitivity to calibration errors, a weighed delay and sum beamformer is chosen here.
- cardioid microphones instead of the omnidirectional capsules used in most of the commercial microphone arrays, a significant back rejection is achieved with no extra complexity: anyway, special care must be taken while designing the plastic case as cardioid microphones require large openings .
Abstract
An integrated digital video camera is described, to be used for desktop videoconferencing, including a digital camera which captures video image data, a microphone array, a digital audio interface for capturing microphone signals, a digital audio interface and a high speed digital interface.
Description
VIDEO CAMERA FOR DESKTOP VIDEOCOMMUNICATION Field of the invention
This invention refers to an integrated digital video camera, to be used for desktop videoconferencing. Background of the invention
Desktop videoconferencing is becoming more and more popular, mainly due to the availability of Internet connections exhibiting both low costs and high speed and video-communication solutions based on free software: presently most the Internet Messaging ■ Service providers have added a voice chat and video chat functionalities and hence a large customer base is getting accustomed to PC based videoconferencing (software videophone) .
Despite the video quality is mainly limited by the connection speed and hence is expected to quickly improve in the near future, the audio quality is still poor and strongly limited by the audio capturing devices which are presently available. In particular, hand-free communication is often made possible only by wearing a headsets with built in microphone (placed very close to the user's mouth) : using the PC loudspeakers and a fixed microphone (either tabletop or built in the webcam) would require the use of an acoustic echo canceller (AEC) to have a full-duplex communication. In order to have a conventional acoustic echo canceller properly working, an accurate control of the audio latency is required: this is very difficult or impossible with the typical consumer level, Windows compatible audio I/O interfaces; professional level audio cards have always low and consistent latency (usually support ASIO 2 drivers) and so are suitable for AEC applications.
Several microphone array products are already available on the market but they are usually designed for speech recognition (speech-to-text) applications and so no special care is taken to make it possible for such devices to be used as input source of an echo canceller based, hand-free system: furthermore they should be placed in the same place where a webcam is usually placed: the top border of a PC monitor. So integrating a microphone array into a
digital video camera (webcam) make it possible for this two peripherals to share the same location and the same digital connection while further benefits can be achieved by integrating a constant latency audio I/O interface (ASIO 2) in order to make AEC integration viable.
EP 1 377 041 discloses a web cam which is composed by 5 or more cameras, positioned along radii around a table, supported by a plastic rod and carrying array microphones on a basis thereof. This contruction is comparatively expensive and hindering. Summary of the invention
All of the above problems are brilliantly solved by this invention, which refers to an integrated digital video camera, to be used for desktop videoconferencing, characterised in that it includes a digital camera lens which captures video image data, a microphone array, a digital audio input interface for capturing microphone signals, digital audio output interface to send the received signals to loudspeakers and a high speed digital interface to send both audio and video data to the host system. Brief description of the drawings Fig. 1 shows a structure of webcam and microphones according to a preferred embodiment of the invention;
Fig. 2 shows a structure of webcam and microphones according to an alternative embodiment; and
Fig. 3 shows a structure of webcam and microphones according to another alternative embodiment of the invention. Best way to carry out the invention
As already stated, this invention refers to an integrated digital video camera, to be used for desktop video conferencing. According to a preferred embodiment, the overall device to be employed for PC based desktop videoconferencing comprises a video camera lens and an audio capturing device. As a video camera lens, a conventional webcam can be used. Preferably, in order to get a high-quality grade of the overall solution, it is desirable to choose the best state-of-the-art optical and CCD sensors, in order
to provide true VGA resolution at 30 fps . For this kind of digital cameras a high-speed USB 2 link is a sufficient even though a IEEE 1394 (FireWire) interface is to be preferred for the ASIO 2 audio I/O support, which is preferably employed. In conventional devices, the audio system is separated from the video camera lens, since they are usually independent and work on different kind of circuit. According to the present invention, video camera and microphones form a unit, directly connected to the host system (for instance a personal computer) . This arrangement allows large space sparings and to optimise the system configuration.
As shown in Fig. 1, according to a preferred embodiment, a broadside linear array microphones (for instance, four elements 1 to 4) can be easily designed so as to exhibit a size smaller than the width of most PC monitors including desktop LCD screens and laptop lid displays. With the typical microphone spacing (4-6 cm) it is pretty easy to place the camera lens 5 between the central two microphones 2 and 3.
As shown in Fig. 2, an alternative embodiment uses a single end-fire array (microphones 1 and 2 are shown in Fig.2) could be positioned either on the bottom or on the top of the camera lens 5.
As shown in Fig. 3, another alternative embodiment includes a double end-fire array, where each end-fire array is placed on each side of the central camera lens. In such an arrangement, the two end-fire outputs are mixed together.
In the above embodiments, many existing beamforming algorithms, able to steer a single beam to the front direction can achieve a significant signal-to-noise ratio improvement compared to a single microphone solution. Because of its low sensitivity to calibration errors, a weighed delay and sum beamformer is chosen here. By using cardioid microphones instead of the omnidirectional capsules used in most of the commercial microphone arrays, a significant back rejection is achieved with no extra complexity:
anyway, special care must be taken while designing the plastic case as cardioid microphones require large openings .
According to the second and the third embodiments , shown in Figs. 2 and 3 respectively, it is possible to attain a higher directivity (the so called "superdirectivity" ) with respect to the prior art devices, and his adaptive generalisation allows to drive a null of the directional response toward the strongest interfering noise sources and hence to achieve a significant background noise rejection, without affecting the speech coming from the front direction.
Beside the proposed array geometries and the related beamforming algorithms many other combinations of array geometries and beamforming algorithms can be used, and the above embodiments are by no means intended to limit the scope of the present invention, which is mainly directed to a combined microphone array to be used as voice capture device for hand free communication, combined with a digital camera lens in order to achieve an audio/video peripheral of reasonably small size, so that it can be installed on the top of a video screen, as it can be argued from the annexed claims.
Claims
1) An integrated digital video camera, to be used for desktop videoconferencing, characterised in that it includes a digital camera lens (5) which captures video image data, a microphone array (1 to 4) , a digital audio input interface for capturing microphone signals, a digital audio output interface to send the receeived signals to the loudspeakers and a high speed digital interface to send both audio and video data to the host system.
2) A video camera as claimed in claim 1) , characterised in that true VGA resolution at 30 fps is provided.
3) A video camera as claimed in claim 1) or 2) , characterised in that it employs an ASIO 2 audio I/O support.
4) A video camera as claimed in claim 3) , characterised in that a high-speed USB 2 link or a IEEE 1394 (FireWire) interface is provided for the ASIO 2 audio I/O support.
5) A video camera as claimed in any previous claim, characterised in that a broadside linear array microphones (1 to 4) are designed so as to exhibit a size smaller than the width of most PC monitors including desktop LCD screens and laptop lid displays .
6) A video camera as claimed in claim 5) , characterised in that microphones are spaced by 4-6 cm from each other.
7) A video camera as in claim 6) , characterised in that the camera lens (5) is positioned between the two central microphones (2 and 3) .
8} A video camera as claimed in any claim 1) to 4) , characterised in that a single end-fire microphone array is provided .
9) A video camera as claimed in claim 8) , characterised in that the end-fire array is positioned either on the bottom or on the top of the camera lens (5) .
10) A video camera as claimed in any claim 1) to 4) , characterised in that a double end-fire arrays is provided.
11) A video camera as claimed in claim 10) , characterised in
that the end-fire arrays are positioned apart from the video camera lens (5) .
12) A video camera as claimed in claim 10) or 11) , characterised in that the two end fire output are mixed together. 13) A video camera as claimed in any previous claim, characterised in that one beamforming algorithm, able to steer a single beam to the front direction, is employed, in order to attain a significant signal-to-noise ratio (RSD) improvement.
14) A video camera as set forth in any previous claim, characterised in that a weighed delay and sum beamformer is chosen.
15) A video camera as claimed in any previous claim, characterised in that a cardioid microphones is used.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/IT2006/000395 WO2007138617A1 (en) | 2006-05-25 | 2006-05-25 | Video camera for desktop videocommunication |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/IT2006/000395 WO2007138617A1 (en) | 2006-05-25 | 2006-05-25 | Video camera for desktop videocommunication |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007138617A1 true WO2007138617A1 (en) | 2007-12-06 |
Family
ID=37649468
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IT2006/000395 WO2007138617A1 (en) | 2006-05-25 | 2006-05-25 | Video camera for desktop videocommunication |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2007138617A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030160862A1 (en) * | 2002-02-27 | 2003-08-28 | Charlier Michael L. | Apparatus having cooperating wide-angle digital camera system and microphone array |
EP1377041A2 (en) * | 2002-06-27 | 2004-01-02 | Microsoft Corporation | Integrated design for omni-directional camera and microphone array |
US20040012669A1 (en) * | 2002-03-25 | 2004-01-22 | David Drell | Conferencing system with integrated audio driver and network interface device |
US20040041902A1 (en) * | 2002-04-11 | 2004-03-04 | Polycom, Inc. | Portable videoconferencing system |
-
2006
- 2006-05-25 WO PCT/IT2006/000395 patent/WO2007138617A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030160862A1 (en) * | 2002-02-27 | 2003-08-28 | Charlier Michael L. | Apparatus having cooperating wide-angle digital camera system and microphone array |
US20040012669A1 (en) * | 2002-03-25 | 2004-01-22 | David Drell | Conferencing system with integrated audio driver and network interface device |
US20040041902A1 (en) * | 2002-04-11 | 2004-03-04 | Polycom, Inc. | Portable videoconferencing system |
EP1377041A2 (en) * | 2002-06-27 | 2004-01-02 | Microsoft Corporation | Integrated design for omni-directional camera and microphone array |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4252377B2 (en) | System for omnidirectional camera and microphone array | |
US9247334B2 (en) | Portable electronic device | |
EP2172054B1 (en) | Microphone array for a camera speakerphone | |
US7822338B2 (en) | Camera for electronic device | |
US11477413B2 (en) | System and method for providing wide-area imaging and communications capability to a handheld device | |
US9736427B1 (en) | Communication system | |
US9294839B2 (en) | Augmentation of a beamforming microphone array with non-beamforming microphones | |
US7724284B2 (en) | Multi-camera system and method having a common processing block | |
EP2823631B1 (en) | Portable electronic device with directional microphones for stereo recording | |
US8451315B2 (en) | System and method for distributed meeting capture | |
KR102327160B1 (en) | Apparatus and method for processing image received through a plurality of cameras | |
US10609476B2 (en) | Display apparatus and communication terminal | |
US20080205874A1 (en) | Wide coverage image capturing and standable web phone | |
WO2007138617A1 (en) | Video camera for desktop videocommunication | |
WO2007136648A2 (en) | Imaging panels including arrays of audio and video input and output elements | |
US20240064420A1 (en) | Cameras for multiple views | |
KR200249455Y1 (en) | Camera having microphone and speaker integrated therein | |
JP6169195B2 (en) | Delegate unit and conference system equipped with the delegate unit | |
Liles | CONFERENCE MICS AND SYSTEMS. | |
TW201735652A (en) | Intelligence TV external?device connecting piece | |
JP2020184656A (en) | Sound acquisition control system, information terminal, sound acquisition control method, and program | |
US20090103756A1 (en) | Multi-Media Device | |
West | Microphones for speech and speech recognition | |
CA2947565A1 (en) | Connecting piece for external device of smart television | |
TWM355519U (en) | Photographic device and moving equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 06766300 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 06766300 Country of ref document: EP Kind code of ref document: A1 |