WO1993012613A1 - Quantization table adjustment - Google Patents

Quantization table adjustment Download PDF

Info

Publication number
WO1993012613A1
WO1993012613A1 PCT/US1992/010644 US9210644W WO9312613A1 WO 1993012613 A1 WO1993012613 A1 WO 1993012613A1 US 9210644 W US9210644 W US 9210644W WO 9312613 A1 WO9312613 A1 WO 9312613A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
compression
quantization table
jpeg
dct coefficients
Prior art date
Application number
PCT/US1992/010644
Other languages
French (fr)
Inventor
Eric C. Peters
Original Assignee
Avid Technology, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Avid Technology, Inc. filed Critical Avid Technology, Inc.
Publication of WO1993012613A1 publication Critical patent/WO1993012613A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/18Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/117Filters, e.g. for pre-processing or post-processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • H04N19/126Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/15Data rate or code amount at the encoder output by monitoring actual compressed data size at the memory before deciding storage at the transmission buffer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/154Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding

Definitions

  • This invention relates to hardware designs coupled with software-based algorithms for capture, compression, decompression, and playback of digital image sequences, particularly in an editing environment.
  • JPEG Joint Photographic Experts Group
  • DVI vector quantization
  • DCT Discrete Cosine Transform
  • the JPEG standard has wide implications for image capture and storage, image transmission, and image playback.
  • a color photograph can be compressed by 10 to 1 with virtually no visible loss of quality. Compression of 30 to 1 can be achieved with loss that is so minimal that most people cannot see the difference. Compression factors of 100 to 1 and more can be achieved while maintaining image quality acceptable for a wide range of purposes.
  • the creation of the JPEG standard has spurred a variety of important hardware developments.
  • the DCT algorithm used by the JPEG standard is extremely complex. It requires converting an image from the spatial domain to the frequency domain, the quantization of the various frequency components, followed by Huf ⁇ man coding of the resulting components.
  • the conversion from spatial to frequency domain, the quantization, and the Huffman coding are all computationally intensive.
  • Hardware vendors have responded by building specialized integrated circuits to implement the JPEG algorithm.
  • JPEG chip (the CL550B) that not only implements the JPEG standard in hardware, but can process an image with a resolution of, for example. 720 x 488 pixels (CCTRR 601 video standard) in just l/30th of a second.
  • the same chip can be used to compress or decompress images or image sequences.
  • the availability of this JPEG chip has spurred computer vendors and system integrators to design new products that incorporate the JPEG chip for motion video.
  • the implementation of the chip in a hardware and software environment capable of processing images with a resolution of 640 x 480 pixels or greater at a rate of 30 frames per second in an editing environment introduces multiple problems. It is often desirable to vary the quality of an image during compression in order to optimize the degree of data compression. For example, during some portions of a sequence, detail may not be important, and quality can be sacrificed by compressing the data to a greater degree. " Other portions may require greater quality, and hence this greater degree of compression may be unsuitable. In prior implementations of the JPEG algorithm, quality is adjusted by scaling the elements of a quantization table (discussed in detail hereinbelow).
  • the present invention is a method that allows for quality changes during compression to enable optimum data compression for all portions of a sequence, while allowing playback with a single quantization table.
  • This invention relates to an apparatus and method for adjusting the post decompression quality of a compressed image.
  • the image quality adjustment is performed by constructing a quantization table that specifies the high frequency image components to be filtered, and by subsequently filtering out those components specified by the table.
  • Fig. 1 is a block diagram of a video image capture and playback system implementing data compression
  • Fig. 2 is a schematic illustration of data compression and decompression according to the JPEG algorithm. Description of the Preferred Embodiment
  • FIG. 1 A block diagram according to a preferred embodiment of a system for capture, compression, storage, decompression, and playback of images is illustrated in Fig. 1.
  • an image digitizer (frame grabber) 10, captures and digitizes the images from an analog source, such as videotape.
  • Image digitizer 10 may be, for example, a Truevision NuVista+ board.
  • the NuVista+ board is preferably modified and augmented with a pixel engine as described in copending application "Image Digitizer Including Pixel Engine” by B. Joshua Rosen et al., filed December 13, 1991, to provide better data throughput for a variety of image formats and modes of operation.
  • the compression processor 12 compresses the data according to a compression algorithm.
  • this algorithm is the JPEG algorithm. introduced above.
  • C-Cube produces a compression processor (CL550B) hased on the JPEG algorith that is appropriate for use as compression processor 12.
  • Compression processor 12 may be a processor that implements the new MPEG (Motion Picture Experts Group) algorithm, or a processor that implements any of a variety of other image compression algorithms known to those skilled in the art.
  • the compressed data from the processor 12 is preferably input to a compressed data buffer 14 which is interfaced to host computer 16 connected to disk 18.
  • the compressed data buffer 14 preferably implements a DMA process in order to absorb speed differences between compression processor 12 and disk 18, and further to permit data transfer between processor 12 and disk 18 with a single pass through the CPU of host computer 16.
  • the host computer 16 may be, for example, an Apple Macintosh. JPEG Encoding and Decoding
  • Fig. 2 illustrates the key steps in data compression and decompression according to the JPEG algorithm for a single component of what will generally be a three-component image.
  • JPEG JPEG standard
  • an image described in the RGB color space will be transformed into the YUV color space via a 3 x 3 multiplier prior to compression. This conversion sacrifices some color information, but preserves the more important detail information.
  • the algorithm works with blocks of 8 x 8 pixels from the image.
  • Each 8 x 8 block is input to the compressor, goes through the illustrated steps, and the compressed data is output as a data stream.
  • the first step in the JPEG algorithm is a Forward Discrete Cosine Transform (FDCT).
  • FDCT Forward Discrete Cosine Transform
  • each 8 x 8 block of pixels can be thought of as a 64-point discrete signal which is a function of two spatial dimensions.
  • the FDCT computes the "spectrum" of this signal in the form of 64 two-dimensional "spatial frequencies,” termed DCT coefficients.
  • the DCT coefficients represent the relative amounts of the two-dimensional spatial frequencies contained in the 64-point discrete signal.
  • the coefficient with zero frequency in both dimensions is called the "DC coefficient” and the remaining 63 coefficients are called the "AC coefficients.”
  • each pixel component corresponds to 8 bits, as is the case in 24 bit color.
  • each coefficient is described by greater than 8 bits.
  • the number of bits per coefficient is 12. Therefore, at this point, the algorithm has actually led to an expansion, rather than a compression of data.
  • pixel values usually vary slowly across an image, most of the pixel information will be contained in the lower spatial frequencies. For typical 8 8 pixel blocks, most of the spatial frequencies at the high end of the spectrum will have zero or negligible amplitude. Data compression can then be achieved by "throwing out" these coefficients, which is the purpose of the next step.
  • each of the 64 DCT coefficients is quantized in accordance with a 64-element quantization table.
  • This table is specified by the user.
  • the C-Cube chip allows user adjustability of this table via software inputs to the chip.
  • Each element in the table is any integer from 1 to 255, according to the JPEG standard.
  • Each element is the quantizer step size for a corresponding DCT coefficient.
  • Quantization is achieved by dividing each DCT coefficient by its corresponding quantizer step size, and rounding to the nearest integer, a very lossy process.
  • the elements of the table are chosen so that the generally large lower frequency components are represented by a smaller number of bits, and the negligible higher frequency components become zero.
  • the goal is to represent each DCT coefficient by no more precision than is necessary for a desired image quality. Since the coefficients, therefore, depend on human visual parameters, the table is sometimes called a psycho-visual weighing table.
  • Compression is achieved by the use of run-length encoding, which puts an en ⁇ -of-block code at the start of the sequence of zeros that will typically form, the end of the 64 coefficient string. The zeros, therefore, don't contribute to the length of the data stream.
  • the coefficients After the coefficients have been quantized, they are ordered into a "zig-zag" sequence, as illustrated in Fig. 2. This sequence facilitates the run-length encoding.
  • This sequence facilitates the run-length encoding.
  • the DC coefficient is generally one of the largest coefficients, and furthermore since it is a measure of the average value of the 64 pixels in the 8 x 8 block, there is generally a strong correlation between the DC coefficients of adjacent blocks, and therefore, the DC component is encoded as the difference from the DC term of the previous block in the compression order.
  • the final step is entropy coding, wherein additional compression is achieved by encoding the quantized DCT coefficients according to their statistical characteristics. This is a lossless step. As this step is not as relevant to the methods of the present invention as those of the previous steps, the reader is referred to Wallace, cited above for a detailed discussion.
  • Image Quality Adjustment From the above discussion, it can be seen that image quality can be adjusted by scaling the values of the quantization table. For higher quality images, the elements should be small, since the larger the elements, the greater the loss.
  • a variable quaHty scaling factor (1-255) called the quantization factor or Q-factor is used with JPEG to adjust the degree of quantization of the compressed image.
  • Q-factor the quantization factor
  • the problem with the above method is that if the quantization table values are scaled during image capture, they must be correspondingly descaled during image playback. To illustrate the importance of this, imagine the result if the quantization table element corresponding to the DC coefficient is multipHed by a factor of 10 at some point during image capture in an effort to increase the degree of data compression. If at playback, the original quantization table is used (prior to the upward scaling), the DC coefficient will be 10 times too small. Since the DC component primarily corresponds to brightness, the result is dramatic.
  • the method of the present invention is an alternate method for adjusting quaHty during image capture which permits playback using a single quantization table. According to the invention, the DCT coefficients are filtered during image capture according to the following technique.
  • the DC coefficient is the most important in terms of human perception.
  • the higher the frequency of a coefficient the finer the detail it describes in an image. Humans are much less sensitive to these high frequency components. Therefore, according to the invention, if image quaHty is to be lowered to further compress the data, the high frequency components are filtered out.
  • the cut-off frequency of the filter determines the degree of compression. This method is in clear contradistinction to the prior method of adjusting the Q-factor.
  • the coefficients are sequenced in a zig-zag pattern as part of the quantization step.
  • a filter according to one embodiment of the invention can be characterized as a diagonal line indicating the cutoff frequency.
  • the effect of throwing out the higher frequency components is a blur of the image to an extent determined by the cutoff frequency. This artifact is often acceptable, depending on the scene and the quaHty required.
  • the artifact caused by the filtering can be made more tolerable to the eye by adjusting the filter in the following manner. If in addition to throwing out all frequency components above cutoff, the frequency components just below cutoff are muted, the artifact is made less harsh.
  • the filter described above can be created by hand-creating quantization tables.
  • the table elements 5 should be large, preferably as large as possible without overflowing the arithmetic of the system.
  • the table elements can be exactly as used in standard JPEG implementations.
  • the table elements below but near cut-off are increased by some amount to mute the corresponding frequency components as described 0 above.
  • this muting is greatest at cutoff, decreasing as the DC coefficient is approached.
  • the filter can be easily adjusted during image capture to control the degree of data compression by changing the quantization table.
  • the filter is user adjusted.
  • the filter may be automatically adjusted by the system when it senses bottlenecks forming. 1
  • the invention was developed as a method for adjusting quaHty during image capture in such a way that playback can take place in the absence of the history of such adjustment. It should be 0 clear that this is achieved when the images are played back using the original quantization tables. This is because only the least important coefficients are affected by the filtering. In contrast, in the prior methods for quality adjustment, all coefficients were affected to the same degree.
  • Subsampling introduces artifacts called aliases to the signal. These 5 frequencies can be predicted and removed by increasing the Q table entries for them.
  • the interrupt routine gets activated on each frame. It computes the current frame size and compares it with the desired target size, then it adjusts the table by moving the filter cut-off frequency to approach the target. What is claimed is:

Abstract

The method for adjusting quality during image capture includes computing a discrete cosine transform of a digital image to create DCT coefficients. A quantization table is generated that specifies frequency bands to be filtered and the DCT coefficients are digitized using the quantization table. It is preferred that the DCT coefficients be ordered in a zig-zag sequence to facilitate run-length encoding.

Description

QUANTIZATION TABLE ADJUSTMENT
Background of the Invention This invention relates to hardware designs coupled with software-based algorithms for capture, compression, decompression, and playback of digital image sequences, particularly in an editing environment.
The idea of taking motion video, digitizing it, compressing the digital datastream, and storing it on some kind of media for later playback is not new. RCA's Sarnoff labs began working on this in the early days of the video disk, seeking to create a digital rather than an analog approach. This technology has since become known as Digital Video Interactive (DVD. Another group, led by Phillips in Europe, has also worked on a digital motion video approach for a product they call CDI (Compact Disk Interactive). Both DVI and CDI seek to store motion video and sound on CD-ROM disks for playback in low cost players. In the case of DVI, the compression is done in batch mode, and takes a long time, but the playback hardware is low cost. CDI is less specific about the compression approach, and mainly provides a format for the data to be stored on the disk.
A few years ago, a standards-making body known as CCITT, based in France, working in conjunction with ISO, the International Standards Organization, created a working group to focus on image compression. This group, called the Joint Photographic Experts Group (JPEG) met for many years to determine the most effective way to compress digital images. They evaluated a wide range of compression schemes, including vector quantization (the technique used by DVI) and DCT (Discrete Cosine Transform). After exhaustive qualitative tests and careful study, the JPEG group picked the DCT approach, and also defined in detail the various ways this approach could be used for image compression. The group published a proposed ISO standard that is generally referred to as the JPEG standard. This standard is now in its final form, and is awaiting ratification by ISO, which is expected. The JPEG standard has wide implications for image capture and storage, image transmission, and image playback. A color photograph can be compressed by 10 to 1 with virtually no visible loss of quality. Compression of 30 to 1 can be achieved with loss that is so minimal that most people cannot see the difference. Compression factors of 100 to 1 and more can be achieved while maintaining image quality acceptable for a wide range of purposes.
The creation of the JPEG standard has spurred a variety of important hardware developments. The DCT algorithm used by the JPEG standard is extremely complex. It requires converting an image from the spatial domain to the frequency domain, the quantization of the various frequency components, followed by Hufϊman coding of the resulting components. The conversion from spatial to frequency domain, the quantization, and the Huffman coding are all computationally intensive. Hardware vendors have responded by building specialized integrated circuits to implement the JPEG algorithm.
One vendor, C-Cube of San Jose, California, has created a JPEG chip (the CL550B) that not only implements the JPEG standard in hardware, but can process an image with a resolution of, for example. 720 x 488 pixels (CCTRR 601 video standard) in just l/30th of a second. This means that the JPEG algorithm can be applied to a digitized video sequence, and the resulting compressed data can be stored for later playback. The same chip can be used to compress or decompress images or image sequences. The availability of this JPEG chip has spurred computer vendors and system integrators to design new products that incorporate the JPEG chip for motion video. However, the implementation of the chip in a hardware and software environment capable of processing images with a resolution of 640 x 480 pixels or greater at a rate of 30 frames per second in an editing environment introduces multiple problems. It is often desirable to vary the quality of an image during compression in order to optimize the degree of data compression. For example, during some portions of a sequence, detail may not be important, and quality can be sacrificed by compressing the data to a greater degree. " Other portions may require greater quality, and hence this greater degree of compression may be unsuitable. In prior implementations of the JPEG algorithm, quality is adjusted by scaling the elements of a quantization table (discussed in detail hereinbelow). If these elements are scaled during compression, they must be correspondingly re-scaled during decompression in order to obtain a suitable image. This re-scaling is cumbersome to implement and can cause delays during playback. The present invention is a method that allows for quality changes during compression to enable optimum data compression for all portions of a sequence, while allowing playback with a single quantization table.
Summary of the Invention This invention relates to an apparatus and method for adjusting the post decompression quality of a compressed image. The image quality adjustment is performed by constructing a quantization table that specifies the high frequency image components to be filtered, and by subsequently filtering out those components specified by the table. Brief Description of the Drawing
Fig. 1 is a block diagram of a video image capture and playback system implementing data compression,
Fig. 2 is a schematic illustration of data compression and decompression according to the JPEG algorithm. Description of the Preferred Embodiment
A block diagram according to a preferred embodiment of a system for capture, compression, storage, decompression, and playback of images is illustrated in Fig. 1. As shown, an image digitizer (frame grabber) 10, captures and digitizes the images from an analog source, such as videotape. Image digitizer 10 may be, for example, a Truevision NuVista+ board. However, the NuVista+ board is preferably modified and augmented with a pixel engine as described in copending application "Image Digitizer Including Pixel Engine" by B. Joshua Rosen et al., filed December 13, 1991, to provide better data throughput for a variety of image formats and modes of operation.
The compression processor 12 compresses the data according to a compression algorithm. Preferably, this algorithm is the JPEG algorithm. introduced above. As discussed above, C-Cube produces a compression processor (CL550B) hased on the JPEG algorith that is appropriate for use as compression processor 12. However, other embodiments are within the scope of the invention. Compression processor 12 may be a processor that implements the new MPEG (Motion Picture Experts Group) algorithm, or a processor that implements any of a variety of other image compression algorithms known to those skilled in the art.
The compressed data from the processor 12 is preferably input to a compressed data buffer 14 which is interfaced to host computer 16 connected to disk 18. The compressed data buffer 14 preferably implements a DMA process in order to absorb speed differences between compression processor 12 and disk 18, and further to permit data transfer between processor 12 and disk 18 with a single pass through the CPU of host computer 16. The host computer 16 may be, for example, an Apple Macintosh. JPEG Encoding and Decoding
Detailed discussions of the JPEG algorithm and its implementation are contained in "The JPEG Still Picture Compression Standard" by G.K. Wallace, in Communications of the ACM, Vol. 34, April 1991, and in "Digital Compression and Coding of Continuous-Tone Still Images, Part 1, Requirements and Guidelines," ISO/TEC JTC1 Committee Draft 10918-1, February, 1991, both of which are incorporated herein by reference.
Fig. 2 illustrates the key steps in data compression and decompression according to the JPEG algorithm for a single component of what will generally be a three-component image. In the JPEG standard, an image described in the RGB color space will be transformed into the YUV color space via a 3 x 3 multiplier prior to compression. This conversion sacrifices some color information, but preserves the more important detail information. The algorithm works with blocks of 8 x 8 pixels from the image.
Each 8 x 8 block is input to the compressor, goes through the illustrated steps, and the compressed data is output as a data stream.
The first step in the JPEG algorithm is a Forward Discrete Cosine Transform (FDCT). As described in Wallace, cited above, each 8 x 8 block of pixels can be thought of as a 64-point discrete signal which is a function of two spatial dimensions. The FDCT computes the "spectrum" of this signal in the form of 64 two-dimensional "spatial frequencies," termed DCT coefficients. The DCT coefficients represent the relative amounts of the two-dimensional spatial frequencies contained in the 64-point discrete signal. The coefficient with zero frequency in both dimensions is called the "DC coefficient" and the remaining 63 coefficients are called the "AC coefficients." Typically each pixel component corresponds to 8 bits, as is the case in 24 bit color. According to the JPEG algorithm, each coefficient is described by greater than 8 bits. In the C-Cube chip discussed above, the number of bits per coefficient is 12. Therefore, at this point, the algorithm has actually led to an expansion, rather than a compression of data. However, since pixel values usually vary slowly across an image, most of the pixel information will be contained in the lower spatial frequencies. For typical 8 8 pixel blocks, most of the spatial frequencies at the high end of the spectrum will have zero or negligible amplitude. Data compression can then be achieved by "throwing out" these coefficients, which is the purpose of the next step.
The next step in the JPEG algorithm is quantization, wherein each of the 64 DCT coefficients is quantized in accordance with a 64-element quantization table. This table is specified by the user. The C-Cube chip allows user adjustability of this table via software inputs to the chip. Each element in the table is any integer from 1 to 255, according to the JPEG standard. Each element is the quantizer step size for a corresponding DCT coefficient. Quantization is achieved by dividing each DCT coefficient by its corresponding quantizer step size, and rounding to the nearest integer, a very lossy process. The elements of the table are chosen so that the generally large lower frequency components are represented by a smaller number of bits, and the negligible higher frequency components become zero. The goal is to represent each DCT coefficient by no more precision than is necessary for a desired image quality. Since the coefficients, therefore, depend on human visual parameters, the table is sometimes called a psycho-visual weighing table.
Compression is achieved by the use of run-length encoding, which puts an enά-of-block code at the start of the sequence of zeros that will typically form, the end of the 64 coefficient string. The zeros, therefore, don't contribute to the length of the data stream.
After the coefficients have been quantized, they are ordered into a "zig-zag" sequence, as illustrated in Fig. 2. This sequence facilitates the run-length encoding. Before going on'to this step, it should be noted, that since the DC coefficient is generally one of the largest coefficients, and furthermore since it is a measure of the average value of the 64 pixels in the 8 x 8 block, there is generally a strong correlation between the DC coefficients of adjacent blocks, and therefore, the DC component is encoded as the difference from the DC term of the previous block in the compression order.
The final step is entropy coding, wherein additional compression is achieved by encoding the quantized DCT coefficients according to their statistical characteristics. This is a lossless step. As this step is not as relevant to the methods of the present invention as those of the previous steps, the reader is referred to Wallace, cited above for a detailed discussion.
The above steps are essentially reversed, as illustrated in Fig. ib, during playback. Here too, the reader is referred to Wallace for further details.
Image Quality Adjustment From the above discussion, it can be seen that image quality can be adjusted by scaling the values of the quantization table. For higher quality images, the elements should be small, since the larger the elements, the greater the loss.
In prior art systems, this is precisely the technique used to adjust image quality during image capture. A variable quaHty scaling factor (1-255) called the quantization factor or Q-factor is used with JPEG to adjust the degree of quantization of the compressed image. For sequences requiring high quaHty, low Q-factors are used. For sequences in which quality can be sacrificed, high Q-factors are used. It can be imagined that a user may want to continuously adjust the quaHty over the range of the Q-factor at the time of capture as scenes change.
The problem with the above method is that if the quantization table values are scaled during image capture, they must be correspondingly descaled during image playback. To illustrate the importance of this, imagine the result if the quantization table element corresponding to the DC coefficient is multipHed by a factor of 10 at some point during image capture in an effort to increase the degree of data compression. If at playback, the original quantization table is used (prior to the upward scaling), the DC coefficient will be 10 times too small. Since the DC component primarily corresponds to brightness, the result is dramatic. The method of the present invention is an alternate method for adjusting quaHty during image capture which permits playback using a single quantization table. According to the invention, the DCT coefficients are filtered during image capture according to the following technique.
As has already been discussed, the DC coefficient is the most important in terms of human perception. The higher the frequency of a coefficient, the finer the detail it describes in an image. Humans are much less sensitive to these high frequency components. Therefore, according to the invention, if image quaHty is to be lowered to further compress the data, the high frequency components are filtered out. The cut-off frequency of the filter determines the degree of compression. This method is in clear contradistinction to the prior method of adjusting the Q-factor. As described above and illustrated in Fig. 2, the coefficients are sequenced in a zig-zag pattern as part of the quantization step. A filter according to one embodiment of the invention can be characterized as a diagonal line indicating the cutoff frequency. The effect of throwing out the higher frequency components is a blur of the image to an extent determined by the cutoff frequency. This artifact is often acceptable, depending on the scene and the quaHty required.
Furthermore, the artifact caused by the filtering can be made more tolerable to the eye by adjusting the filter in the following manner. If in addition to throwing out all frequency components above cutoff, the frequency components just below cutoff are muted, the artifact is made less harsh.
The filter described above can be created by hand-creating quantization tables. For all frequencies above cutoff, the table elements 5 should be large, preferably as large as possible without overflowing the arithmetic of the system. For frequencies below cutoff, the table elements can be exactly as used in standard JPEG implementations. However, preferably, the table elements below but near cut-off are increased by some amount to mute the corresponding frequency components as described 0 above. Preferably, this muting is greatest at cutoff, decreasing as the DC coefficient is approached.
The filter can be easily adjusted during image capture to control the degree of data compression by changing the quantization table. In one mode of operation, the filter is user adjusted. However, in another mode of 5 operation, the filter may be automatically adjusted by the system when it senses bottlenecks forming.1
As stated above, the invention was developed as a method for adjusting quaHty during image capture in such a way that playback can take place in the absence of the history of such adjustment. It should be 0 clear that this is achieved when the images are played back using the original quantization tables. This is because only the least important coefficients are affected by the filtering. In contrast, in the prior methods for quality adjustment, all coefficients were affected to the same degree.
Subsampling introduces artifacts called aliases to the signal. These 5 frequencies can be predicted and removed by increasing the Q table entries for them.
1 The interrupt routine gets activated on each frame. It computes the current frame size and compares it with the desired target size, then it adjusts the table by moving the filter cut-off frequency to approach the target. What is claimed is:

Claims

1. Method for adjusting quality during image capture comprising: computing a discrete cosine transform (DCT) of a digital image to create DCT coefficients; creating a quantization table that specifies frequency bands to be filtered; and quantizing the DCT coefficients by means of the quantization table whereby image quality is adjusted by scaling the values of the quantization table.
2. The method of claim 1 wherein high-frequency components are filtered out to further compress the data.
3. The method of claim 1 further including ordering the quantized coefficients in a zig-zag sequence to facilitate run-length encoding.
4. The method of claim 1 wherein the frequency bands to be filtered are user adjusted.
5. The method of claim 1 wherein the frequency bands to be filtered are automatically adjusted.
6. The method of claim 1 wherein only the least important DCT coefficients are affected during the quantizing step.
PCT/US1992/010644 1991-12-13 1992-12-10 Quantization table adjustment WO1993012613A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US80711791A 1991-12-13 1991-12-13
US807,117 1991-12-13

Publications (1)

Publication Number Publication Date
WO1993012613A1 true WO1993012613A1 (en) 1993-06-24

Family

ID=25195617

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1992/010644 WO1993012613A1 (en) 1991-12-13 1992-12-10 Quantization table adjustment

Country Status (3)

Country Link
US (1) US6023531A (en)
AU (1) AU3274593A (en)
WO (1) WO1993012613A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997005748A1 (en) * 1995-07-28 1997-02-13 Polaroid Corporation Jpeg compression circuit with filtering
US5715018A (en) * 1992-04-10 1998-02-03 Avid Technology, Inc. Digital advertisement insertion system
WO1999013646A2 (en) * 1997-09-08 1999-03-18 Limt Technology Ab Image signal processing method and apparatus
US5909250A (en) * 1993-04-16 1999-06-01 Media 100 Inc. Adaptive video compression using variable quantization
US5926223A (en) * 1993-04-16 1999-07-20 Media 100 Inc. Adaptive video decompression
EP1079636A1 (en) * 1999-08-13 2001-02-28 Nokia Multimedia Terminals Oy Method and arrangement for reducing the volume or rate of an encoded digital video bitstream

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5355450A (en) 1992-04-10 1994-10-11 Avid Technology, Inc. Media composer with adjustable source material compression
US6678461B1 (en) 1992-04-10 2004-01-13 Avid Technology, Inc. Media recorder for capture and playback of live and prerecorded audio and/or video information
US6353685B1 (en) 1998-09-01 2002-03-05 Divio, Inc. Method and apparatus for image compression
JP2002191050A (en) * 2000-12-22 2002-07-05 Fuji Xerox Co Ltd Image coder and method
US6606418B2 (en) * 2001-01-16 2003-08-12 International Business Machines Corporation Enhanced compression of documents
US7092578B2 (en) * 2001-10-23 2006-08-15 Agilent Technologies, Inc. Signaling adaptive-quantization matrices in JPEG using end-of-block codes
US7053953B2 (en) * 2001-12-21 2006-05-30 Eastman Kodak Company Method and camera system for blurring portions of a verification image to show out of focus areas in a captured archival image
US7403561B2 (en) * 2003-04-04 2008-07-22 Avid Technology, Inc. Fixed bit rate, intraframe compression and decompression of video
US7433519B2 (en) * 2003-04-04 2008-10-07 Avid Technology, Inc. Bitstream format for compressed image data
TWI491261B (en) * 2010-06-24 2015-07-01 Mstar Semiconductor Inc Image coding method for facilitating run length coding and image encoding device thereof
US20140328406A1 (en) 2013-05-01 2014-11-06 Raymond John Westwater Method and Apparatus to Perform Optimal Visually-Weighed Quantization of Time-Varying Visual Sequences in Transform Space

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4672441A (en) * 1985-04-17 1987-06-09 Siemens Aktiengesellschaft Method and apparatus for picture data reduction for digital video signals
DE3940554A1 (en) * 1988-12-10 1990-06-13 Fuji Photo Film Co Ltd COMPRESSION CODING DEVICE AND EXPANSION DECODING DEVICE FOR AN IMAGE SIGNAL
US4982282A (en) * 1988-12-09 1991-01-01 Fuji Photo Film Co. Ltd. Image signal compression encoding apparatus and image signal expansion reproducing apparatus
US5038209A (en) * 1990-09-27 1991-08-06 At&T Bell Laboratories Adaptive buffer/quantizer control for transform video coders
WO1991014339A1 (en) * 1990-03-15 1991-09-19 Thomson Consumer Electronics S.A. Digital image coding with quantization level computation

Family Cites Families (66)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US34824A (en) * 1862-04-01 Improvement in-grain-separators
US3813485A (en) * 1972-01-05 1974-05-28 Ibm System for compression of digital data
US4191971A (en) * 1977-05-30 1980-03-04 Rca Corporation System for connecting a plurality of video sending television apparatus
US4302775A (en) * 1978-12-15 1981-11-24 Compression Labs, Inc. Digital video compression system and methods utilizing scene adaptive coding with rate buffer feedback
US4394774A (en) * 1978-12-15 1983-07-19 Compression Labs, Inc. Digital video compression system and methods utilizing scene adaptive coding with rate buffer feedback
US4937685A (en) * 1983-12-02 1990-06-26 Lex Computer And Management Corporation Method of display presentation for video editing
US4599689A (en) * 1983-02-28 1986-07-08 Data Translations, Inc. Continuous data transfer system
US4574351A (en) * 1983-03-03 1986-03-04 International Business Machines Corporation Apparatus for compressing and buffering data
US4704730A (en) * 1984-03-12 1987-11-03 Allophonix, Inc. Multi-state speech encoder and decoder
FR2575351B1 (en) * 1984-12-21 1988-05-13 Thomson Csf ADAPTIVE METHOD OF ENCODING AND DECODING A SUITE OF IMAGES BY TRANSFORMATION, AND DEVICES FOR CARRYING OUT SAID METHOD
DE3605032A1 (en) * 1986-02-18 1987-08-20 Thomson Brandt Gmbh DIGITAL MESSAGE TRANSMISSION METHOD
JPS62222783A (en) * 1986-03-24 1987-09-30 Kokusai Denshin Denwa Co Ltd <Kdd> Highly efficient encoding system for animation picture
JPS62230281A (en) * 1986-03-31 1987-10-08 Toshiba Corp Picture transmission system
FR2597282B1 (en) * 1986-04-11 1995-03-17 Guichard Jacques QUANTIFICATION METHOD IN TRANSFORMATION CODING FOR TRANSMISSION OF IMAGE SIGNALS
KR910000707B1 (en) * 1986-05-26 1991-01-31 미쓰비시덴기 가부시기가이샤 Method and apparatus for encoding transmitting
US4704628A (en) * 1986-07-16 1987-11-03 Compression Labs, Inc. Combined intraframe and interframe transform coding system
DE3626916A1 (en) * 1986-08-08 1988-02-11 Thomson Brandt Gmbh METHOD FOR TRANSMITTING A VIDEO SIGNAL
US4988982A (en) * 1987-03-25 1991-01-29 The Grass Valley Group, Inc. Touch pad machine control
US4729020A (en) * 1987-06-01 1988-03-01 Delta Information Systems System for formatting digital signals to be transmitted
GB8722394D0 (en) * 1987-09-23 1987-10-28 British Telecomm Video coder
US4785349A (en) * 1987-10-05 1988-11-15 Technology Inc. 64 Digital video decompression system
US4897855A (en) * 1987-12-01 1990-01-30 General Electric Company DPCM system with adaptive quantizer having unchanging bin number ensemble
FR2625635B1 (en) * 1987-12-30 1994-04-15 Thomson Grand Public ADAPTIVE METHOD OF ENCODING AND DECODING A SUITE OF IMAGES BY TRANSFORMATION, AND DEVICES FOR CARRYING OUT SAID METHOD
JP2629238B2 (en) * 1988-02-05 1997-07-09 ソニー株式会社 Decoding device and decoding method
US4951139A (en) * 1988-03-30 1990-08-21 Starsignal, Inc. Computer-based video compression system
FR2633133B1 (en) * 1988-06-17 1990-10-05 Thomson Csf METHOD FOR CONTROLLING THE FILLING OF THE BUFFER MEMORY OF AN IMAGE ENCODER, AND CONTROLLING DEVICE FOR CARRYING OUT SAID METHOD
US4962463A (en) * 1988-07-01 1990-10-09 Digital Equipment Corporation Video imaging device with image altering controls and related method
JPH0752951B2 (en) * 1988-10-13 1995-06-05 富士写真フイルム株式会社 Image data compression processing method and apparatus
US5179651A (en) * 1988-11-08 1993-01-12 Massachusetts General Hospital Apparatus for retrieval and processing of selected archived images for display at workstation terminals
US5073821A (en) * 1989-01-30 1991-12-17 Matsushita Electric Industrial Co., Ltd. Orthogonal transform coding apparatus for reducing the amount of coded signals to be processed and transmitted
US5146564A (en) * 1989-02-03 1992-09-08 Digital Equipment Corporation Interface between a system control unit and a service processing unit of a digital computer
FR2643531B1 (en) * 1989-02-21 1996-04-26 Thomson Csf INFORMATION COMPRESSION METHOD AND DEVICE FOR COMPATIBLE DECODING OF A FAMILY OF INCREASING RESOLUTIONS TELEVISION SIGNALS
US5130797A (en) * 1989-02-27 1992-07-14 Mitsubishi Denki Kabushiki Kaisha Digital signal processing system for parallel processing of subsampled data
JP2525666B2 (en) * 1989-04-03 1996-08-21 富士写真フイルム株式会社 Image filing method
US5050230A (en) * 1989-11-29 1991-09-17 Eastman Kodak Company Hybrid residual-based hierarchical storage and display method for high resolution digital images in a multiuse environment
ES2093649T3 (en) * 1990-02-06 1997-01-01 Alcatel Italia SYSTEM, PACKAGE STRUCTURE AND DEVICE TO PROCESS THE INFORMATION PROVIDED BY A SIGNAL ENCODER.
US5164980A (en) * 1990-02-21 1992-11-17 Alkanox Corporation Video telephone system
US5021891A (en) * 1990-02-27 1991-06-04 Qualcomm, Inc. Adaptive block size image compression method and system
US5107345A (en) * 1990-02-27 1992-04-21 Qualcomm Incorporated Adaptive block size image compression method and system
US5270832A (en) * 1990-03-14 1993-12-14 C-Cube Microsystems System for compression and decompression of video data using discrete cosine transform and coding techniques
US5191548A (en) * 1990-03-14 1993-03-02 C-Cube Microsystems System for compression and decompression of video data using discrete cosine transform and coding techniques
US5253078A (en) * 1990-03-14 1993-10-12 C-Cube Microsystems, Inc. System for compression and decompression of video data using discrete cosine transform and coding techniques
US5341318A (en) * 1990-03-14 1994-08-23 C-Cube Microsystems, Inc. System for compression and decompression of video data using discrete cosine transform and coding techniques
US5047853A (en) * 1990-03-16 1991-09-10 Apple Computer, Inc. Method for compresssing and decompressing color video data that uses luminance partitioning
US5046119A (en) * 1990-03-16 1991-09-03 Apple Computer, Inc. Method and apparatus for compressing and decompressing color video data with an anti-aliasing mode
FR2660139B1 (en) * 1990-03-23 1995-08-25 France Etat ENCODING AND TRANSMISSION METHOD FOR AT LEAST TWO QUALITY LEVELS OF DIGITAL IMAGES BELONGING TO A SEQUENCE OF IMAGES, AND CORRESPONDING DEVICES.
FR2660138B1 (en) * 1990-03-26 1992-06-12 France Telecom Cnet DEVICE FOR CODING / DECODING IMAGE SIGNALS.
JPH0828820B2 (en) * 1990-05-28 1996-03-21 村田機械株式会社 Image data coding circuit
US5237675A (en) * 1990-06-04 1993-08-17 Maxtor Corporation Apparatus and method for efficient organization of compressed data on a hard disk utilizing an estimated compression factor
DE69130275T2 (en) * 1990-07-31 1999-04-08 Canon Kk Image processing method and apparatus
US5801716A (en) * 1990-08-16 1998-09-01 Canon Kabushiki Kaisha Pipeline structures for full-color computer graphics
US5138459A (en) * 1990-11-20 1992-08-11 Personal Computer Cameras, Inc. Electronic still video camera with direct personal computer (pc) compatible digital format output
JP3069377B2 (en) * 1990-12-20 2000-07-24 キヤノン株式会社 Image processing apparatus and control method for image processing apparatus
US5228126A (en) * 1991-01-08 1993-07-13 Radius Inc. Image data accelerated processing apparatus and method
US5061924B1 (en) * 1991-01-25 1996-04-30 American Telephone & Telegraph Efficient vector codebook
JPH04256298A (en) * 1991-02-08 1992-09-10 Toshiba Corp Moving picture encoder
US5122875A (en) * 1991-02-27 1992-06-16 General Electric Company An HDTV compression system
US5191645A (en) * 1991-02-28 1993-03-02 Sony Corporation Of America Digital signal processing system employing icon displays
CA2062200A1 (en) * 1991-03-15 1992-09-16 Stephen C. Purcell Decompression processor for video applications
EP0514663A3 (en) * 1991-05-24 1993-07-14 International Business Machines Corporation An apparatus and method for motion video encoding employing an adaptive quantizer
ES2110504T3 (en) * 1991-06-04 1998-02-16 Qualcomm Inc SYSTEM OF COMPRESSION OF IMAGES BY SIZE SELF-ADAPTIVE OF BLOCKS.
HU215861B (en) * 1991-06-11 1999-03-29 Qualcomm Inc. Methods for performing speech signal compression by variable rate coding and decoding of digitized speech samples and means for impementing these methods
EP0526064B1 (en) * 1991-08-02 1997-09-10 The Grass Valley Group, Inc. Video editing system operator interface for visualization and interactive control of video material
US5309528A (en) * 1991-12-13 1994-05-03 Avid Technology, Inc. Image digitizer including pixel engine
US5355450A (en) * 1992-04-10 1994-10-11 Avid Technology, Inc. Media composer with adjustable source material compression
US5287420A (en) * 1992-04-08 1994-02-15 Supermac Technology Method for image compression on a personal computer

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4672441A (en) * 1985-04-17 1987-06-09 Siemens Aktiengesellschaft Method and apparatus for picture data reduction for digital video signals
US4982282A (en) * 1988-12-09 1991-01-01 Fuji Photo Film Co. Ltd. Image signal compression encoding apparatus and image signal expansion reproducing apparatus
DE3940554A1 (en) * 1988-12-10 1990-06-13 Fuji Photo Film Co Ltd COMPRESSION CODING DEVICE AND EXPANSION DECODING DEVICE FOR AN IMAGE SIGNAL
WO1991014339A1 (en) * 1990-03-15 1991-09-19 Thomson Consumer Electronics S.A. Digital image coding with quantization level computation
US5038209A (en) * 1990-09-27 1991-08-06 At&T Bell Laboratories Adaptive buffer/quantizer control for transform video coders

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
COMMUNICATIONS OF THE ASSOCIATION FOR COMPUTING MACHINERY vol. 34, no. 4, April 1991, NEW YORK US pages 30 - 44 G.K.WALLACE 'The JPEG still picture compression standard' cited in the application *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5715018A (en) * 1992-04-10 1998-02-03 Avid Technology, Inc. Digital advertisement insertion system
US5909250A (en) * 1993-04-16 1999-06-01 Media 100 Inc. Adaptive video compression using variable quantization
US5926223A (en) * 1993-04-16 1999-07-20 Media 100 Inc. Adaptive video decompression
US6072836A (en) * 1993-04-16 2000-06-06 Media 100 Inc. Adaptive video compression and decompression
WO1997005748A1 (en) * 1995-07-28 1997-02-13 Polaroid Corporation Jpeg compression circuit with filtering
WO1999013646A2 (en) * 1997-09-08 1999-03-18 Limt Technology Ab Image signal processing method and apparatus
WO1999013646A3 (en) * 1997-09-08 1999-05-27 Limt Technology Ab Image signal processing method and apparatus
EP1079636A1 (en) * 1999-08-13 2001-02-28 Nokia Multimedia Terminals Oy Method and arrangement for reducing the volume or rate of an encoded digital video bitstream

Also Published As

Publication number Publication date
US6023531A (en) 2000-02-08
AU3274593A (en) 1993-07-19

Similar Documents

Publication Publication Date Title
US6687407B2 (en) Quantization table adjustment
US6023531A (en) Quantization table adjustment
EP0519962B1 (en) Digital image coding using a random scanning of image frames
US8031769B2 (en) Method and device for controlling quantization scales of a video encoding bit stream
US5416604A (en) Image compression method for bit-fixation and the apparatus therefor
CA2452550C (en) An apparatus and method for encoding digital image data in a lossless manner
US6222881B1 (en) Using numbers of non-zero quantized transform signals and signal differences to determine when to encode video signals using inter-frame or intra-frame encoding
US5787204A (en) Image signal decoding device capable of removing block distortion with simple structure
US6301304B1 (en) Architecture and method for inverse quantization of discrete cosine transform coefficients in MPEG decoders
JP3298915B2 (en) Encoding device
US6330369B1 (en) Method and apparatus for limiting data rate and image quality loss in lossy compression of sequences of digital images
JPH06189285A (en) Quantization/inverse quantization circuit for picture data compression/expansion device
WO1994022108A1 (en) Rapid thumbnail image reconstruction of dct compressed image data
JPH0832037B2 (en) Image data compression device
JP2891773B2 (en) Method and apparatus for processing digital image sequences
JP3469438B2 (en) Image signal processing method and apparatus, recording medium
JP3302091B2 (en) Encoding device and encoding method
JP2922598B2 (en) Image coding method
CA2640597C (en) Fixed bit rate, intraframe compression and decompression of video
JPH0549021A (en) High efficient coder
JPH08163561A (en) Picture data compression device
JP3038022B2 (en) Electronic camera device
JP3192133B2 (en) Electronic camera device
JPH0746407A (en) Picture data compressing device and picture data restoring device
JPH09224246A (en) Image compression coding and image compression decoding device

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA CS HU JP KR PL

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: CA