WO2000048319A1 - Method and apparatus for truncated decoding - Google Patents

Method and apparatus for truncated decoding Download PDF

Info

Publication number
WO2000048319A1
WO2000048319A1 PCT/US2000/003299 US0003299W WO0048319A1 WO 2000048319 A1 WO2000048319 A1 WO 2000048319A1 US 0003299 W US0003299 W US 0003299W WO 0048319 A1 WO0048319 A1 WO 0048319A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
block
value
numerator
denominator
Prior art date
Application number
PCT/US2000/003299
Other languages
French (fr)
Inventor
Tetsujiro Kondo
James J. Carrig
Yasuhiro Fujimori
Sugata Ghosal
Original Assignee
Sony Electronics, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Electronics, Inc. filed Critical Sony Electronics, Inc.
Priority to AU36977/00A priority Critical patent/AU3697700A/en
Publication of WO2000048319A1 publication Critical patent/WO2000048319A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/89Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder
    • H04N19/895Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder in combination with error concealment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/98Adaptive-dynamic-range coding [ADRC]

Definitions

  • the present invention relates to the recovery of data. More particularly, the present invention relates to the recovery of lost/ damaged block data in a bitstream of compressed data.
  • the discrete data points that make up a digital image are known as pixels.
  • each pixel is represented independently using 8 bits, but other representations also are used for the purposes of compression analysis.
  • Most of the alternative representations begin by dividing this raw data into disjoint sets. For historical reasons, these sets are referred to as "blocks", even though they may not have a traditional block shape.
  • the alternative representation then characterizes the data by some block-wide information and per-pixel information.
  • Per-pixel information may indicate where the pixel value lies within the range specified by the global information. For compression to be achieved, the per-pixel information must use only a few bits of storage so that the total number of bits used is less than that required to store the raw image.
  • the block data is comprised of the MIN, DR and Qbit number (defined below), and the pixel data is comprised of Q codes.
  • a Q code is a Qbit number that corresponds to one value in the set ⁇ MIN, MIN+1, ....,MAX ⁇ .
  • a method and apparatus for hardware efficient decoding of compression coefficients In one embodiment, a numerator of an equation used to compute a compression coefficient is computed. The denominator is also computed. The numerator and denominator values are truncated such that each numerator and denominator are equal in length to a predetermined constant K. A K-bit integer division is then executed to determine the value of the compression constant.
  • Figure 1 generally illustrates the processes and apparatus of signal encoding, transmission, and decoding.
  • Figure 2 is a flow diagram illustrating one embodiment of the decoding process in accordance with the teachings of the present invention.
  • Figure 3 is a flow diagram generally illustrating one embodiment of the data recovery process of the present invention.
  • Figure 4 is a flow chart illustrating one embodiment of the process of the present invention.
  • FIGS 5a and 5b illustrate embodiments of the system of the present invention.
  • Figure 6 is a flow diagram of one embodiment of the Qbit and Motion Flag recovery process of the present invention.
  • Figure 7 is a table illustrating one embodiment of candidate decodings.
  • FIGS 8a, 8b, 8c, 8d illustrate embodiments of measurements utilized in the Qbit and Motion Flag recovery process of Figure 6.
  • Figure 9 illustrates one embodiment of a table used to determine a square error probability function utilized in the Qbit and Motion Flag recovery process of Figure 6.
  • Figure 10 illustrates one embodiment of a Qbit, Motion Flag and auxiliary information recovery process in accordance with one embodiment of the present invention.
  • Figure 11 illustrates the use of a post-amble in one embodiment of a bidirectional Qbit and Motion Flag recovery process.
  • Figures 12a, 12b and 12c illustrate an alternate embodiment for evaluating candidate decodings.
  • Figure 13 illustrates the use of smoothness measures in accordance with the teachings of one embodiment of the present invention.
  • Figures 14a, 14b, 14c, 14d and 14e illustrate an alternate embodiment of a process for evaluating candidate decodings.
  • Figure 15a illustrates an alternate process for evaluating candidate decodings and Figure 15b illustrates one embodiment for determining weighting values.
  • the present invention provides a method for decoding a bitstream to provide for a robust error recovery.
  • numerous details are set forth, in order to provide a thorough understanding of the present invention. However, it will be apparent to one skilled in the art that these specific details are not required in order to practice the present invention. In other instances, well known electrical structures and circuits are shown in block diagram form in order not to obscure the present invention unnecessarily.
  • ADRC Adaptive Dynamic Range Coding
  • MAX' is the averaged value of x' in the case of q - 2 a - 1 ;
  • MIN' is the averaged value of x' in the case of q - 0 ;
  • DR' MAX' - MIN'
  • DR represents a dynamic range value
  • MAX represents the maximum level of a block
  • MIN represents the minimum level of a block
  • x represents the signal level of each sample
  • Q represents the number of quantization bits (qbit)
  • q represents the quantization code (encoded data or Q code)
  • x' represents the decoded level of each sample
  • [_• J represent a truncation operation performed on the value within the square brackets.
  • Signal 100 is a data stream input to Encoder 110.
  • Encoder 110 follows the Adaptive Dynamic Range Coding ("ADRC") compression algorithm and generates Packets 1, . . . ⁇ for transmission along Transmission Media 135.
  • Decoder 120 receives Packets 1, . . . N from Transmission Media 135 and generates Signal 130.
  • Signal 130 is a reconstruction of Signal 100.
  • ADRC Adaptive Dynamic Range Coding
  • Encoder 110 and Decoder 120 can be implemented a variety of ways to perform the functionality described herein.
  • Encoder 110 and /or Decoder 120 are embodied as software stored on media and executed by a general purpose or specifically configured computer system, typically including a central processing unit, memory and one or more input /output devices and coprocessors.
  • the Encoder 110 and/or Decoder 120 may be implemented as logic to perform the functionality described herein.
  • Encoder 110 and /or Decoder 120 can be implemented as a combination of hardware, software or firmware.
  • Signal 100 is a color video image comprising a sequence of video frames, each frame including information representative of an image in an interlaced video system.
  • Each frame is composed of two fields, wherein one field contains data of the even lines of the image and the other field containing the odd lines of the image.
  • the data includes pixel values which describe the color components of a corresponding location in the image.
  • the color components consist of the luminance signal Y, and color difference signals U, and V. It is readily apparent the process of the present invention can be applied to signals other than interlaced video signals. Furthermore, it is apparent that the present invention is not limited to implementations in the Y, U, V color space, but can be applied to images represented in other color spaces.
  • Encoder 110 divides the Y, U, and V signals and processes each group of signals independently in accordance with the ADRC algorithm.
  • the following description for purposes of simplifying the discussion, describes the processing of the Y signal; however, the encoding steps are replicated for the U and V signals.
  • Encoder 110 groups Y signals across two subsequent frames, referred to herein as a frame pair, of Signal 100 into three dimensional blocks ("3D") blocks.
  • 3D three dimensional blocks
  • a 3D block is generated from grouping two 2D blocks from the same localized area across a given frame pair, wherein a two dimensional 2D block is created by grouping localized pixels within a frame or a field. It is contemplated that the process described herein can be applied to different block structures. The grouping of signals will be further described in the image-to-block mapping section below.
  • Encoder 110 calculates whether there is a change in pixel values between the 2D blocks forming the 3D block.
  • a Motion Flag is set if there are substantial changes in values. As is known in the art, use of a Motion Flag allows Encoder 110 to reduce the number of quantization codes when there is localized image repetition within each frame pair.
  • Encoder 110 encodes signals on a frame by frame basis for a stream of frames representing a sequence of video frames. In another embodiment, Encoder 110 encodes signals on a field by field basis for a stream of fields representing a sequence of video fields. Accordingly, Motion Flags are not used and 2D blocks are used to calculate the MIN, MAX, and DR values.
  • Encoder 110 references the calculated DR against a threshold table (not shown) to determine the number of quantization bits (“Qbits") used to encode pixels within the block corresponding to the DR. Encoding of a pixel results in a quantization code ("Q code").
  • Q code quantization code
  • the Qbit selection is derived from the DR of a 3D block. Accordingly, all pixels within a given 3D block are encoded using the same Qbit, resulting in a 3D encoded block.
  • the collection of Q codes, MIN, Motion Flag, and DR for a 3D encoded block is referred to as a 3D ADRC block.
  • 2D blocks are encoded and the collection of Q codes, MIN, and DR for a given 2D block results in 2D ADRC blocks.
  • the threshold table consists of a row of DR threshold values.
  • a Qbit corresponds to the number of quantization bits used to encode a range of DR values between two adjacent DRs within a row of the threshold table.
  • the threshold table includes multiple rows and selection of a row depends on the desired transmission rate. Each row in the threshold table is identified by a threshold index.
  • a detailed description of one embodiment of threshold selection is described below in the discussion of partial buffering.
  • a further description of ADRC encoding and buffering is disclosed in US Patent no. 4,722,003 entitled “High Efficiency Coding Apparatus” and US Patent no. 4,845,560 also entitled “High Efficiency Coding Apparatus", assigned to an assignee of the present invention.
  • the Q codes are referred to as variable length data ("VL-data").
  • the DR, MIN, and Motion Flag are referred to as block attributes or compression constants.
  • the block attributes, together with the threshold index, constitute the fixed length data (“FL-data").
  • the term block attribute describes a parameter associated with a component of a signal element, wherein a signal element includes multiple components.
  • Frames, block attributes, and VL-data describe a variety of components within a video signal. The boundaries, location, and quantity of these components are dependent on the transmission and compression properties of a video signal.
  • the encoded data is formed into packets and transmitted across a transmission media or stored on a storage media.
  • Figure 2 is a simplified flow diagram illustrating one embodiment of decoding process performed by Decoder 120.
  • Figure 2 further describes, in different combinations of Qbit, Motion Flag, DR, MIN and pixel data, an innovative process for error recovery. More particularly, a method and apparatus for recovering lost or damaged (lost/ damaged) compression constants of data to be decoded is described.
  • the lost/damaged compression constants may include DR, MIN and/or MAX.
  • the data received in packets is processed to decode the data encoded in the received bitstream.
  • data is received.
  • the data may be deshuffled, step 210, if the data was shuffled at time of encoding.
  • ADRC decoding is then applied to the data, step 245, in accordance with the teachings known in the art.
  • a recovery process is performed to recover the Qbit and Motion Flag values that were located in lost packets.
  • the Qbit value is lost typically due to DR loss (due to lost packets).
  • the location of Q code bits corresponding to a pixel cannot be determined from the data bitstream. If a Qbit or Motion Flag value is improperly determined then this error will propagate to subsequent data as the starting point of subsequent blocks in the bitstream will be incorrectly identified.
  • a number of techniques may be used to recover Qbit or Motion Flag values.
  • Figure 3 describes the general process for recovering the Qbit and Motion Flag values in accordance with the teachings of the present invention. This particular embodiment describes the process using multiple blocks of data to recover the Qbit and Motion Flag values; however, the particular number of blocks could be one or more blocks.
  • step 305 candidate decodings based on specified parameters are generated for the three blocks examined.
  • step 315 each candidate decoding is scored on the likelihood that it is an accurate decoding and at step 320, the candidate decoding with the best score is used.
  • the Qbit and Motion Flag values identified enable the subsequent decoding of pixels of the affected blocks.
  • any DR or MIN values that were lost due to lost packets are recovered, step 265.
  • a variety of recovery processes known to one skilled in the art can be applied to recover DR and MIN, including least squares or the averaging of values of adjacent blocks.
  • DR is recovered in accordance with the following equation: Edge-matching ADRC:
  • DR' is the recovered DR value
  • represents the number of terms used (e.g., the number of neighboring values yj used)
  • ej is the i-th encoded value in an ADRC block and e
  • y is a decoded value of a corresponding adjacent block pixel
  • MI ⁇ is the MI ⁇ value of the block
  • Q is the Qbit value.
  • MI ⁇ is determined according to the following equation: Edge-matching ADRC:
  • MI ⁇ ' is the recovered MI ⁇ value and DR is the DR value of the block.
  • DR may be simplified to eliminate second order terms to obtain a more cost-effective solution.
  • MI ⁇ ' formulae do not contain second order terms, no such simplification is necessary.
  • DR may be determined as follows:
  • DR and MI ⁇ may be recovered according to the following equations: ⁇ on edge-matching ADRC:
  • MI ⁇ is determined according to the following equation:
  • the equation can be expanded to a quotient consisting of a numerator calculation and denominator calculation:
  • DR 2 ⁇ MN .
  • the numerator value is determined using full precision.
  • the denominator is determined using full precision.
  • the numerator and the denominator are reduced to at least K-bits in length. For example, in one embodiment: while (n ⁇ 2 K or d > 2 K ), then n/2 (shift off LSB) d/2 (shift off LSB).
  • the numerator and denominator are shifted the same number of bits.
  • the numerator and denominator may be shifted differing number of bits; in such an embodiment, the different amounts of shifts may be compensated for in subsequent computations.
  • K is selected such that integer division can be performed using cost efficient logic while maintaining an acceptable image quality.
  • K is selected such that the maximum error is not usually detectable, e.g., the maximum error is not greater than 3%.
  • K is selected to be 13.
  • the division operation is performed if the answer does not overflow or underflow. If an underflow or overflow occurs, the value is respectively clipped to the lower bound or upper bound. This may be determined by comparing the numerator to values corresponding to the product of the bound and the denominator. In the present example, the following steps would therefore be performed: For MIN: ii n ⁇ ⁇ U M - d)
  • MIN — (K bit division) d
  • L M represents the lower bound of MIN allowed by auxiliary information
  • U M represents the upper bound of MIN allowed by auxiliary information
  • Auxiliary information in one embodiment, consists of predefined compression information used for a particular application.
  • the lower and upper bounds are respectively the lower and upper bounds of the range of pixel values represented by the number of quantization bits used.
  • the range can be restricted to MIN + DR ⁇ MAX, where MAX is the maximum pixel value. In one embodiment, for 8 bit encoding, MAX is equal to 255.
  • MAX is equal to 255.
  • the bounds for DR are similarly determined as discussed above.
  • the value is clipped for consistency with the auxiliary information available, for example other available compression constants.
  • the value is clipped according to the following equation which comprises the functions for clipping to a lower bound or upper bound: For MIN: where max represents a maximum function, min represents a minimum function, and U MD represents the upper bound defined by MIN+DR; Similarly, for DR:
  • circuitry used to implement the hardware efficient implementation described above is illustrated in the simplified functional block diagram of Figure 5a.
  • the circuitry may be implemented in specially configured logic, such as large scale integration (LSI) logic or programmable gate arrays.
  • LSI large scale integration
  • the circuitry may be implemented as code executed by a dedicated, specially configured or general purpose processor, which executes instructions stored in a memory or other storage device.
  • the present invention may be implemented as a combination of the above.
  • numerator logic 510 determines the full precision value of the numerator portion of the computation performed to estimate a lost/damaged compression constant in the encoded domain.
  • Denominator logic 520 similarly determines a full precision value of the denominator portion of the computation.
  • a least significant bit shift operation is performed on the numerator and denominator until the numerator and denominator are each, at a minimum, K- bits in length.
  • the shift operation is performed by computation logic 550.
  • K-bit shift registers may be used to shift out least significant bits of the values generated by logic 510, 520 until the numerator and denominator are K-bits in length.
  • K is chosen, for example, empirically, such that the amount of logic /hardware required to perform subsequent operations is minimized for efficiency while maintaining an acceptable level of precision.
  • Computation logic 550 performs an integer division of the numerator and denominator. If an overflow or underflow will occur, the value is clipped to an upper bound or lower bound respectively defined by known compression constants.
  • Clip logic 560 is optionally included to clip the output of computation logic 550 to be consistent with other available compression constants.
  • the value may be clipped to a lower bound of the compression constant as best estimated based upon auxiliary information that defines the range of compression values. This information may be predefined according to the particular embodiment of encoding and decoding processes.
  • this structure provides a fast, cost efficient circuit for estimating lost/ damaged compression constants.
  • the output of the circuit is preferably coupled to additional logic (not shown) which decodes using data including the recovered compression constant.
  • additional logic not shown
  • the decoded data is used to drive a display device.
  • FIG. 5b An alternate embodiment of the circuit for recovering lost /damaged compression constants is shown in Figure 5b.
  • the methods described herein can be implemented on a specially configured or general purpose processor system 570. Instructions are stored in the memory 590 and accessed by the processor 575 to perform many of the steps described herein.
  • An input 580 receives the input bitstream and forwards the data to the processor 575.
  • the output 585 outputs the data.
  • the output may consist of the decoded data, such as image data decoded once the compression constant is recovered, sufficient to drive an external device such as display 595.
  • the output 585 outputs the recovered compression constant.
  • the recovered compression constant is then input to other circuitry (not shown) to generate the decoded data.
  • step 270 ADRC decoding is applied to those blocks not previously decoded.
  • a pixel recovery process is executed, step 275, to recover any erroneous pixel data that may have occurred due to lost packets or random errors.
  • step 280 a 3:1:0 -> 4:2:2 back conversion is performed, step 280, to place the image in the desired format for display.
  • Figure 6 illustrates one particular embodiment of the Qbit and Motion Flag recovery process of the decoding process of the present invention.
  • the inputs to the process are adjacent block information.
  • the block attributes include a compression constant and pixel data for the three blocks to be processed. Error flags indicating the location of the lost data are also input.
  • the error flags can be generated in a variety of ways known to one skilled in the art and will not be discussed further herein except to say that the flags indicate which bits were transmitted by damaged or lost packets.
  • the candidate decodings are generated.
  • the candidate decodings can be generated a variety of ways. For example, although the processing burden would be quite significant, the candidate decodings can include all possible decodings. Alternately, the candidate decodings can be generated based on pre-specified parameters to narrow the number of candidate decodings to be evaluated.
  • the candidate decodings are determined based on the possible key values used to shuffle the encoded data.
  • candidate decodings are further limited by the length of the bits remaining to be decoded and knowledge of how many blocks remain. For example, as will be discussed, if processing the last block typically the decoding length of that block is known.
  • shuffling is performed using a masking key.
  • a key referred to herein as KEY
  • KEY is used to mask a bitstream of Q codes.
  • KEY may be used to mask a bitstream of Q codes corresponding to three blocks of data.
  • Each key element (di) of the masking key is generated by the combination of certain compression constants, used to encode a corresponding block of data.
  • the MF and Qbit values are used to define KEY.
  • KEY is formed according to the following:
  • KEY values are regenerated depending upon the values used to create the masking keys.
  • the regenerated KEY values are used to unmask the received bitstream of Q codes resulting in candidate encoded data.
  • the MF or Qbit value used to generate the mask is not correct, the corresponding Q codes will exhibit a low level of correlation, which will be typically readily detectable.
  • Figure 7 illustrates possible cases for the present embodiment, where the value x indicates an unknown value (which may be due to packet loss).
  • the variable mj is defined as the Motion Flag of the i-th block
  • qi is the number of the quantization bits of the i-th block
  • nj is the number of possible candidates of the i-th block
  • di is the value of a key element of the i-th block.
  • the i-th block is defined within each group.
  • the number of blocks within each group is three.
  • a key for the three block group is generated as, do + 10-d ⁇ + 100 -0 2 - Assuming that in the first block the Motion Flag is unknown and the number of quantization bits is 2, mo equals x and qo equals 2.
  • di 5 -mi + q , the set of possible digits for do consists of ⁇ 2 and 7). Thus, the number of possible values (no) is 2.
  • the third block has a Motion Flag value of 1 and an unknown number of quantization bits.
  • the candidate decodings generated are evaluated or scored on the likelihood that it is a correct decoding of the data. Furthermore, at step 320, the candidate decoding with the best score is selected to be used.
  • the score may be derived from an analysis of how pixels or blocks of a particular candidate decoding fit in with other pixels of the image.
  • the score is derived based upon a criteria indicative of error, such as a square error and correlation. For example, with respect to correlation, it is a fairly safe assumption that the adjacent pixels will be somewhat closely correlated. Thus, a significant or a lack of correlation is indicative that the candidate decoding is or is not the correct decoding.
  • the present embodiment utilizes four subscoring criteria which are subsequently combined into a final score.
  • the square error measure is generated; in step 620, horizontal correlation is determined; in step 625, vertical correlation is determined; and at step 630 temporal activity is measured.
  • Each step utilizes an M- by-2-N matrix in accordance with M candidates, N blocks and 2 frames /block of data.
  • horizontal and vertical correlation is discussed herein, it should be recognized that a variety of correlation measurements, including diagonal correlation, can be used.
  • a confidence measure is generated for each criterion to normalize the measurements generated.
  • a probability function for each of the different criteria is generated. These probability functions are then combined, for example, by multiplying the probability values to generate a score, for example, the likelihood function shown in Figure 6, step 675.
  • the score for the candidate decoding is subsequently compared against all candidate decoding scores to determine the likely candidate.
  • a variety of techniques can be used to evaluate the candidate decodings and generate the "scorings" for each candidate. For example, confidence measures are one way of normalizing the criteria. Furthermore, a variety of confidence measures, besides the ones described below, can be used. Similarly, multiplying the probability values based on each criterion to generate a total likelihood function is just one way of combining the variety of criteria examined.
  • the encoding processes facilitate the determination of the best candidate decoding because typically the candidate decodings which are not the likely candidate will have a relatively poor score, while decodings that are the likely candidate will have a significantly better score.
  • Figures 8a, 8b, 8c and 8d provide illustrations of the different measurements performed at steps 615, 620, 625 and 630 of Figure 6 to generate the scoring and total score for a particular candidate decoding.
  • Figure 8a illustrates the square error to evaluate a candidate decoded pixel Xj as compared to its decoded neighbors yi i, wherein the suffix "i,j" is corresponding to the neighboring address of "i".
  • some of the largest terms are removed to remove any influences due to spikes, that is the terms that arise due to legitimate edges in the image.
  • the three largest terms of (xry )* ⁇ may be discarded to remove spikes.
  • Figure 8b illustrates the temporal activity criteria. This is applicable only when it is or is assumed to be a motion block.
  • the temporal activity criteria assumes that the better the candidate decoding, the smaller the differences between blocks. Thus the worse the candidate decoding, the larger the differences between blocks. Spatial correlation assumes that the more likely candidate decodings will result in heavy correlations as real images tend to change in a slow consistent way.
  • the horizontal correlation process illustrated in Figure 8c and vertical correlation process illustrated by Figure 8d utilize that assumption.
  • the confidence measures, steps 635, 640, 645, and 650 of Figure 6, provide a process for normalizing the criteria determined in the previous steps (steps 615, 620, 625 and 630).
  • the confidence measure for the square error takes values from the interval [0,1], and confidence is equal to 0 if the errors are equal and equal to 1 if one error is 0.
  • Other measures or methods to normalize are also contemplated.
  • the confidence measure for the spatial correlation is: maximum(Y,0) - maximum(X,0) where Y is the best correlation value and X is the correlation for the current candidate decoding.
  • the probability function is generated for each of the different criteria.
  • a variety of methods can be used to generate the probability measure. For example, a score can be prescribed to a confidence measure. If the confidence measure is greater than a predetermined value, e.g., 0.8, the base score is decreased by 10; if between 0.5 and 0.8, the base score decreased by 5...
  • Figure 9 illustrates one embodiment in which a table used to generate the probability function for the square error measurement criteria.
  • the table includes empirically determined data containing arbitrarily binned confidence and square error measures and known candidate decodings. More particularly, the table can be generated by using undamaged data and assuming that the DR was corrupted or lost. Keys and confidence measures for correct and incorrect decodings are then generated.
  • the table reflects the probability ratio of correct to incorrect decodings. Using this table, for a particular squared error value (row) and confidence value (column), the probability can be determined. For example, it can therefore be seen that for a variety of square error measures at a confidence measure of zero, there is approximately a 40% to 50% probability that the candidate is correct. If the confidence is not 0, but small, the probability drops significantly. Similar probability tables are generated for the correlation and temporal measurements based on corresponding empirically determined criteria measurements and confidence measurements.
  • the probabilities generated are considered data to generate "scores" in the present embodiment. Other techniques to score candidate decodings may also be used.
  • the different probabilities are combined into a likelihood
  • Pi, j , and Pi, j is the probability function for candidate i, block j.
  • the candidate is therefore selected as the one that maximizes the function L[.
  • DR and MIN values are recovered where necessary.
  • a variety of techniques, from default values, averaging, squared error functions to more sophisticated techniques, including those discussed in Kondo, Fujimori, Nakaya and Uchida, "A New Concealment Method for Digital VCRs", IEEE Visual Signal Processing and Communications, September 20-22, 1993, Melbourne Australia, may be used.
  • the recovered values are utilized to generate the candidate decodings as discussed above.
  • the DR and MIN values are determined during the Qbit determination process. This is illustrated in Figure 10.
  • the Motion Flag and number of quantization bits are used in the encoding process and later used during the recovery process to narrow the number of possible candidate decodings. Other information can also be used.
  • the value of DR and /or value of MIN may also be used to encode the data.
  • a portion of bits of DR are used for encoding (e.g., the two least significant bits of DR). Although the DR data is encoded, the number of possible candidate decodings is increased significantly as variables are added.
  • the DR and MIN are therefore recovered using the auxiliary information provided, e.g., the encoded two bits of the sum of DRi, DR 2 and DR 3 . This improves the process of candidate selection at the cost of additional overhead to examine the larger number of candidate decodings.
  • the process is applied to each subsequent block of a buffer; if all or some of the FL-data is available, the number of candidate decodings can be reduced, possibly to one candidate decoding given all the FL-data for a block is available.
  • the Qbit and Motion Flag recovery process be avoided altogether as the process is a relatively time consuming one.
  • blocks are processed from the beginning of a buffer until a block with lost Qbit/Motion Flag information is reached. This is referred to as forward Qbit and Motion Flag recovery.
  • the end of the buffer is referenced to determine the location of the end of the last block of the buffer and the data is recovered from the end of the buffer until a block with lost Qbit/Motion Flag data is reached. This is referred to as backward Qbit and Motion Flag recovery.
  • the blocks are variable in length, due the length of the VL- data; therefore there is a need to determine the number of bits forming the VL-data of a block so that the position of subsequent blocks in the buffer can be accurately located.
  • a post-amble of a predetermined and preferably easily recognizable pattern is placed in the buffer to fill the unused bit locations.
  • the post-amble will be located between the block and the end of the buffer.
  • review of patterns of bits enables the system to locate the beginning of the post- amble and therefore the end of the last block in the buffer.
  • This information can be used in two ways. If the last block contains damaged Qbit/Motion Flag data and the beginning of the last block is known (e.g., the preceding blocks have been successfully decoded), the difference between the end of the immediate preceding block and the beginning of the post-amble corresponds to the length of the block. This information can be used to calculate the Qbit and /or Motion Flag of the block. The starting location of the post-amble can also be used to perform Qbit and Motion Flag recovery starting at the last block and proceeding towards the beginning of the buffer. Thus, the Qbit and Motion Flag recovery process can be implemented bidirectionally. Figure 11 illustrates the use of a post-amble in the bidirectional Qbit and Motion Flag recovery process.
  • the buffer 1100 includes FL- data 1103 for the N groups of blocks of VL-data. Each group consists of a plurality of blocks (e.g., 3 blocks).
  • the first two groups 1105, 1110 are decoded and the third group 1115 cannot immediately be decoded due to damaged DR/Motion Flag data.
  • the Qbit/Motion Flag recovery process is required in order to recover the damaged data.
  • the process refers to the end of the buffer, determined by looking for the post-amble pattern 1120. The beginning of the post-amble and therefore the end of the last group of blocks are determined.
  • the DR/Motion Flag data is indicative of the length of the VL-data
  • the beginning of the VL data of the last block, and therefore the end of the immediate preceding block is determined. Therefore, the blocks can be decoded , e.g., blocks 1125, 1130, 1135 until a block 1140 with damaged data is reached.
  • the damaged 1115, 1140 and obstructed blocks 1150 are then recovered, for example, using the Qbit/Motion Flag recovery process described above.
  • the bidirectional process is not limited to a sequence of forward and reverse processing; processing can occur in either or both directions. Furthermore, in some embodiments, it may be desirable to perform such processing in parallel to improve efficiency. Finally, it is contemplated that undamaged obstructed blocks may be recovered by directly accessing the Qbit/Motion Flag information without executing the Qbit/Motion Flag recovery process described above.
  • the smoothness of the image using each candidate decoding is evaluated.
  • the Laplacian measurement is performed.
  • the Laplacian measurement measures a second-order image surface property, e.g., surface curvature.
  • the Laplacian measurement will result in a value that is approximately zero.
  • Figure 12a illustrates one embodiment of the Laplacian kernel. It is contemplated that other embodiments may also be used.
  • the kernel "L" represents a 3x3 region. To measure smoothness of the image region, 3x3 subregions of the image ( Figure 12b) are convolved with the kernel and the convolved values are averaged. The size of the region and subregion (and therefore kernel size) can be varied according to application.
  • the process utilizes a kernel and subregion size of 3x3 and a region size of 8x8, the individual elements identified by indices i,j.
  • the candidate decoded values x[i][j] are normalized.
  • the values can be normalized according to the following equation:
  • the normalized values are used to compute a block Laplacian value Lx indicative of smoothness according to the following:
  • the Laplacian evaluation can also be achieved using candidate encoded values q[i][j] .
  • the basic process is the same as the candidate decoded value case of Figure 12c.
  • This embodiment utilizes a kernel and subregion size of 3x3 and a region size 8x8, the individual elements identifies by the indices i,j.
  • the candidate encoded values q[i][j] are normalized. For example, the values can be normalized according to the following equation:
  • the normalized values are used to compute the block Laplacian value L q indicative of smoothness according to the following equation:
  • higher order image surface properties can be used as a smoothness measure.
  • higher order kernels would be used.
  • a fourth order block Laplacian measurement may be performed using a fourth order kernel.
  • Such a fourth order kernel can be realized using two second order Laplacian computations in cascade.
  • the evaluation process can be dependent upon whether the image has an activity or motion larger than a predetermined level. If the image portion is evaluated to have larger motion than a predetermined level, then it may be preferable to perform the measurements on a field basis as opposed to on a frame basis. This is explained with reference to Figure 13.
  • Figure 13 explains the process using smoothness measures; however, it is contemplated that this process can be implemented using a variety of types of measures.
  • Frame 1305 of an image region is composed of field 0 and field 1. If motion is not detected, step 1310, the smoothness measurement is computed by computing the block Laplacian value for the block within each frame, step 1315. If larger motion than a predetermined level is detected, block Laplacian measurements are performed on each field, steps 1320, 1325, and the two measurements are combined, step 1330, e.g. averaged, to generate the smoothness measurement.
  • Motion can be detected /measured a variety of ways.
  • the extent of change between fields is evaluated and motion is detected if it exceeds a predetermined threshold.
  • Motion detection and the use of frame information and field information to generate recovered values can be applied to any portion of the process that requires a recovered value to be generated.
  • motion detection and the selective use of frame information and field information to generate recovered values can be applied to DR/MIN recovery, pixel recovery as well as Qbit and Motion Flag recovery processes.
  • the recovery process will utilize existing information on a field basis or frame basis.
  • this process can be combined with the application of weighting values that are selected based upon levels of correlation in particular directions (e.g., horizontal or vertical).
  • candidate decodings are evaluated based upon intra block and inter block measurements.
  • block refers to a portion of a frame or field.
  • the intra block measurement evaluates the candidate decoded image portion, e.g., the smoothness of the image portion.
  • the inter block measurement measures how well the candidate decoding fits with the neighboring image portions.
  • Figures 14a and 14b illustrate the combined inter block and intra block evaluation. In particular, Figure 14a shows an acceptable candidate decoding as both the inter block and intra block measurements are good, whereas in Figure 14b the inter block measurement is poor, even though the intra block measurement is quite good.
  • Examples of intra block measurements include the smoothness measurement described above.
  • Examples of inter block measurements include the square error measurements described earlier.
  • An alternative inter block measurement is the ratio of compatible boundary pixels and the total number of boundary pixels at the candidate ADRC block.
  • Figure 14d illustrates an image portion (block) of data of a encoded values 1450 consisting of q values from which candidate decoded values x are generated and neighboring decoded data 1455 consisting of y values.
  • the intra block measure is computed to generate a measure, e.g., block Laplacian L x .
  • the inter block measure S x is computed to generate a measure of compatibility between adjacent blocks.
  • the combined measure M x is generated. The combined measure provides the information used to select a candidate decoding.
  • S x is computed as the number of neighboring data that lies in a valid range for each boundary pixel of candidate decoding (see Figure 14e).
  • Figure 14e is a chart illustrating a valid range for one embodiment which shows a valid range of each observed quantized value qi.
  • a median technique is applied.
  • the value of MIN is recovered as the median of all MINi values computed as: where qi represents the encoded pixel value and yi represents the decoded pixel neighboring qi.
  • s DR/(2 Q - 1).
  • ADRC, s DR/2 Q , where Q represents the number of quantization bits per pixel (Qbit value).
  • the values used may be temporally proximate or spatially proximate.
  • the values of yi may be the decoded value of the neighboring pixel in an adjacent frame/field or the same field.
  • the values of yi may be the decoded value of the pixel from the same location as qi in an adjacent frame/field or the same field.
  • any DR and /or MIN recovery technique may be combined with a clipping process to improve recovery accuracy and prevent data overflow during the recovery process.
  • the clipping process restricts the recovered data to a predetermined range of values; thus those values outside the range are clipped to the closest range bound.
  • the clipping process restricts values in the range [LQ, UQ], where LQ, UQ respectively represent the lower and upper bounds of the range of pixel values represented by the number of quantization bits Q.
  • the values can be further restricted to MIN + DR ⁇ Num, where Num represents the minimum pixel value; in the present embodiment, Num is 255.
  • UQ +1 LQ+I.
  • val max(min(val, min(U Q ,255-MIN)),L Q ) where min and max respectively represent minimum and maximum functions.
  • boundary pixels yj used to generate an recovered DR and /or MIN can be filtered to only use those that appear to correlate best, thereby better recovering DR and MIN. Those boundary pixels not meeting the criteria are not used.
  • a boundary pixel yj is considered valid for DR calculations if there exists a value of DR such that LQ ⁇ DR ⁇ UQ and an original pixel yi would have been encoded as qi.
  • a pixel is valid if the following equations are satisfied:
  • the value can then be clipped into the valid range.
  • This process forces the DR recovered value into the interior of the valid region as defined by the threshold table, reducing the accuracy for points whose true DR lies near the threshold table boundary.
  • the valid pixel selection process is modified to relax the upper and lower bounds, allowing border pixels that encroach into the neighboring valid region. By including points just outside the boundary, it is more likely that the recovered value will take on a value near that of the upper or lower bound.
  • the relaxed bounds L'Q and U'Q are computed by means of a relaxation constant r. In one embodiment, r is set to a value of .5. Other values can be used:
  • a recovered value of the DR or MIN value to be recovered is generated in one direction and at step 1515, a recovered value is generated in another direction.
  • a recovered value is generated in another direction.
  • boundary pixels along horizontal borders are used to generate a first recovered value, "hest”, and boundary pixels along vertical borders are used to generated a second recovered value, "vest”.
  • boundary pixels between adjacent fields are used to generate a first recovered value and boundary pixels between adjacent frames are used to generate a second recovered value.
  • the recovered values are weighted according to correlation calculations indicative of the level of correlation in each direction.
  • the weighted first and second recovered values are combined to generate a combined recovered value, step 1525.
  • the process is not limited to generated weighted recovered values in only two directions; the number of recovered values that are weighted and combined can be varied according to the application.
  • the combined recovered value is computed as follows:
  • vc represents the vertical correlation
  • hest represents a DR recovered value based only on left and right boundary information
  • vest represents a DR recovered value based only on top and bottom boundary information
  • a represents the weighting value
  • the weighting value can be determined a variety of ways.
  • Figure 15b illustrates one embodiment for determining weighting values as a function of the difference between the horizontal correlation and vertical correlation. More particularly, a was chosen to be: ⁇ ⁇ 35
  • the temporal correlation can be evaluated and used to weight recovered values.
  • a combination of temporal and spatial correlation can be performed. For example, one recovered value is generated between fields as a temporal recovered value. Another recovered value is generated within one field as a spatial recovered value. The final recovered value is computed as the combination value with a combination of temporal and spatial correlation. The correlation combination can be replaced with a motion quantity.
  • the techniques described herein can be applied to audio data.
  • a low complexity modification to the least squares technique is used.
  • the blinking experienced due to recovered DR values is reduced.
  • QV represents a list of encoded values from the image section or ADRC block whose DR is being recovered having a set of points qi and Y is a list of decoded values taken from the vertical or horizontal neighbors of the points in QV, where yi represents a vertical or horizontal neighbor of qi.
  • yi represents a vertical or horizontal neighbor of qi.
  • each point qi may have up to four decoded neighbors, one pixel or point may give rise to as many as four (qi, y pairings.
  • the unconstrained least squares estimate of DR (DR u is) is thus:
  • the unconstrained least squares estimate is can be clipped to assure consistency with the threshold table and the equation MI ⁇ + DR ⁇ 255 which is enforced during encoding (Typically, for non-edge-matching ADRC, permissible DR values are in the range of 1-256).
  • the least squares estimate is clipped (DR ⁇ sc ) by:
  • the estimation can be enhanced by selecting the pixels that are more suitable for DR estimation to calculate the estimate of DR.
  • flat regions in an image provide pixels which are more suitable for DR estimation than those regions in which high activity occurs.
  • a sharp edge in the edge may decrease the accuracy of the estimate.
  • the following embodiment provides a computationally light method for selecting the pixels to use to calculate an estimate of DR.
  • the least squares estimate (DR ⁇ se ), e.g., DR u ⁇ s or DR ⁇ sc . is computed.
  • the list of encoded values QV is transformed into candidate decoded values X, where Xi are members of X derived from qi.
  • the xj value is a recovered decoded value formed using the first estimate of DR.
  • the Xi value is defined according to the following equation:
  • New X and Y lists may then be formed by considering only the matches where X[ and yi are close and the least squares estimate recomputed to generate an updated estimate.
  • new lists X and Y are generated by selecting only those matches where gi is less than some threshold. If the new lists are sufficiently long, these lists may be used to generate a refined least squares estimate DR r ⁇ s .
  • DR estimation can be improved by clipping potential DR values and recomputing a DR estimate.
  • the clipped method (DR c i s ) may be combined with other DR estimates , e.g., DR ⁇ se in a weighted average to produce a final DR value.

Abstract

A method and apparatus for hardware efficient decoding of compression coefficients. In one embodiment, a numerator of an equation used to compute (405) a compression coefficient is computed. The denominator is also computed (410). The numerator and denominator values are truncated (415) such that each numerator and denominator are equal in length to a predetermined constant K. A K-bit integer division (425) is then executed to determine the value of the compression constant.

Description

METHOD AND APPARATUS FOR TRUNCATED DECODING
BACKGROUND OF THE INVENTION
1. FIELD OF THE INVENTION
The present invention relates to the recovery of data. More particularly, the present invention relates to the recovery of lost/ damaged block data in a bitstream of compressed data.
2. ART BACKGROUND
It is often desirable to compress data, such as video images or sound data, for transmission and storage. Typically, when data is compressed, compression constants are generated. In some instances block- wide data is generated. These constants are transmitted or stored along with the compressed image. Problems can arise if the compression constants are lost or damaged prior to decompression of the data. As an illustration, the discussion below illustrates the problems that arise if image data compression constants are lost.
The discrete data points that make up a digital image are known as pixels. For example, each pixel is represented independently using 8 bits, but other representations also are used for the purposes of compression analysis. Most of the alternative representations begin by dividing this raw data into disjoint sets. For historical reasons, these sets are referred to as "blocks", even though they may not have a traditional block shape. The alternative representation then characterizes the data by some block-wide information and per-pixel information.
Examples of block-wide information include the minimum pixel value (MIN), the maximum pixel value (MAX), and the dynamic range of the pixel values (DR), where DR=MAX-MIN or DR=1+MAX-MIN. Per-pixel information may indicate where the pixel value lies within the range specified by the global information. For compression to be achieved, the per-pixel information must use only a few bits of storage so that the total number of bits used is less than that required to store the raw image. In one example, the block data is comprised of the MIN, DR and Qbit number (defined below), and the pixel data is comprised of Q codes. A Q code is a Qbit number that corresponds to one value in the set {MIN, MIN+1, ....,MAX}. Since the Qbit number is generally small and the DR value may be relatively large, it is generally not possible to represent all pixel values exactly. Therefore, some quantization error is introduced when pixel values are reduced to Q code values. For instance, if the Qbit number is 3, then it is generally possible to represent 23 = 8 values from the set {MIN, MIN+1,..., MAX} without any error. Pixels with other values are rounded to one of these eight values. This rounding introduces quantization error.
If any of the block information, e.g., MIN, MAX or DR, is lost, the damage to the image is potentially large as many pixels are affected. For this reason, it is desirable to have techniques for accurately estimating or recovering the values of this lost data.
SUMMARY OF THE INVENTION
A method and apparatus for hardware efficient decoding of compression coefficients. In one embodiment, a numerator of an equation used to compute a compression coefficient is computed. The denominator is also computed. The numerator and denominator values are truncated such that each numerator and denominator are equal in length to a predetermined constant K. A K-bit integer division is then executed to determine the value of the compression constant.
BRIEF DESCRIPTION OF THE DRAWINGS
The objects, features and advantages of the present invention will be apparent to one skilled in the art in light of the following detailed description in which:
Figure 1 generally illustrates the processes and apparatus of signal encoding, transmission, and decoding. Figure 2 is a flow diagram illustrating one embodiment of the decoding process in accordance with the teachings of the present invention.
Figure 3 is a flow diagram generally illustrating one embodiment of the data recovery process of the present invention.
Figure 4 is a flow chart illustrating one embodiment of the process of the present invention.
Figures 5a and 5b illustrate embodiments of the system of the present invention.
Figure 6 is a flow diagram of one embodiment of the Qbit and Motion Flag recovery process of the present invention.
Figure 7 is a table illustrating one embodiment of candidate decodings.
Figures 8a, 8b, 8c, 8d illustrate embodiments of measurements utilized in the Qbit and Motion Flag recovery process of Figure 6.
Figure 9 illustrates one embodiment of a table used to determine a square error probability function utilized in the Qbit and Motion Flag recovery process of Figure 6.
Figure 10 illustrates one embodiment of a Qbit, Motion Flag and auxiliary information recovery process in accordance with one embodiment of the present invention.
Figure 11 illustrates the use of a post-amble in one embodiment of a bidirectional Qbit and Motion Flag recovery process.
Figures 12a, 12b and 12c illustrate an alternate embodiment for evaluating candidate decodings.
Figure 13 illustrates the use of smoothness measures in accordance with the teachings of one embodiment of the present invention.
Figures 14a, 14b, 14c, 14d and 14e illustrate an alternate embodiment of a process for evaluating candidate decodings.
Figure 15a illustrates an alternate process for evaluating candidate decodings and Figure 15b illustrates one embodiment for determining weighting values. DETAILED DESCRIPTION
The present invention provides a method for decoding a bitstream to provide for a robust error recovery. In the following description, for purposes of explanation, numerous details are set forth, in order to provide a thorough understanding of the present invention. However, it will be apparent to one skilled in the art that these specific details are not required in order to practice the present invention. In other instances, well known electrical structures and circuits are shown in block diagram form in order not to obscure the present invention unnecessarily.
The signal processing methods and structures are described from the perspective of one embodiment in which the signals are video signals. However, it is contemplated that the methods and apparatus described herein are applicable to a variety of types of signals including audio signals or other digital bitstreams of data, wherein each signal is composed of multiple signal elements. Furthermore the embodiment of the process described herein utilizes the Adaptive Dynamic Range Coding ("ADRC") process to compress data; however a variety of coding techniques and algorithms may be used. For a more detailed discussion on ADRC, see "Adaptive Dynamic Range Coding Scheme for Future HDTV Digital VTR", Kondo, Fujimori and Nakaya, Fourth International Workshop on HDTV and Beyond, September 4-6, 1991, Turin, Italy.
In the above paper, three different kinds of ADRC are explained. These are achieved according to the following equations: Non-edge-matching ADRC:
DR = MAX -MIN + 1
Figure imgf000006_0001
(q + 0.5)- DR . ._- .
— -J + MIN
2β
Edge-matching ADRC:
DR = MAX -MIN
Figure imgf000007_0001
x = 1^. + MIN + 0.5 2β - l
Multi-stage ADRC:
DR = MAX -MIN + l
Figure imgf000007_0002
l(q + .5} DR] + MIN
where MAX' is the averaged value of x' in the case of q - 2a - 1 ; MIN' is the averaged value of x' in the case of q - 0 ; and DR' = MAX' - MIN'
Figure imgf000007_0003
where DR represents a dynamic range value, MAX represents the maximum level of a block, MIN represents the minimum level of a block, x represents the signal level of each sample, Q represents the number of quantization bits (qbit), q represents the quantization code (encoded data or Q code), x' represents the decoded level of each sample, and the square brackets
[_• J represent a truncation operation performed on the value within the square brackets.
The signal encoding, transmission, and subsequent decoding processes are generally illustrated in Figure 1. Signal 100 is a data stream input to Encoder 110. Encoder 110 follows the Adaptive Dynamic Range Coding ("ADRC") compression algorithm and generates Packets 1, . . . Ν for transmission along Transmission Media 135. Decoder 120 receives Packets 1, . . . N from Transmission Media 135 and generates Signal 130. Signal 130 is a reconstruction of Signal 100.
Encoder 110 and Decoder 120 can be implemented a variety of ways to perform the functionality described herein. In one embodiment, Encoder 110 and /or Decoder 120 are embodied as software stored on media and executed by a general purpose or specifically configured computer system, typically including a central processing unit, memory and one or more input /output devices and coprocessors. Alternately, the Encoder 110 and/or Decoder 120 may be implemented as logic to perform the functionality described herein. In addition, Encoder 110 and /or Decoder 120 can be implemented as a combination of hardware, software or firmware.
In the present embodiment Signal 100 is a color video image comprising a sequence of video frames, each frame including information representative of an image in an interlaced video system. Each frame is composed of two fields, wherein one field contains data of the even lines of the image and the other field containing the odd lines of the image. The data includes pixel values which describe the color components of a corresponding location in the image. For example, in the present embodiment, the color components consist of the luminance signal Y, and color difference signals U, and V. It is readily apparent the process of the present invention can be applied to signals other than interlaced video signals. Furthermore, it is apparent that the present invention is not limited to implementations in the Y, U, V color space, but can be applied to images represented in other color spaces.
Referring back to Figure 1, Encoder 110 divides the Y, U, and V signals and processes each group of signals independently in accordance with the ADRC algorithm. The following description, for purposes of simplifying the discussion, describes the processing of the Y signal; however, the encoding steps are replicated for the U and V signals.
In the present embodiment, Encoder 110 groups Y signals across two subsequent frames, referred to herein as a frame pair, of Signal 100 into three dimensional blocks ("3D") blocks. For one embodiment, a 3D block is generated from grouping two 2D blocks from the same localized area across a given frame pair, wherein a two dimensional 2D block is created by grouping localized pixels within a frame or a field. It is contemplated that the process described herein can be applied to different block structures. The grouping of signals will be further described in the image-to-block mapping section below.
Continuing with the present embodiment, for a given 3D block, Encoder 110 calculates whether there is a change in pixel values between the 2D blocks forming the 3D block. A Motion Flag is set if there are substantial changes in values. As is known in the art, use of a Motion Flag allows Encoder 110 to reduce the number of quantization codes when there is localized image repetition within each frame pair. Encoder 110 also detects the maximum pixel intensity value ("MAX") and the minimum pixel intensity value ("MIN") within a 3D block. Using values MAX and MIN, Encoder 110 calculates the dynamic range ("DR") for a given 3D block of data. For one embodiment DR = MAX - MIN + 1 in the case of non-edge-matching ADRC. For edge-matching ADRC, DR = MAX - MIN.
In an alternative embodiment, Encoder 110 encodes signals on a frame by frame basis for a stream of frames representing a sequence of video frames. In another embodiment, Encoder 110 encodes signals on a field by field basis for a stream of fields representing a sequence of video fields. Accordingly, Motion Flags are not used and 2D blocks are used to calculate the MIN, MAX, and DR values.
In the present embodiment, Encoder 110 references the calculated DR against a threshold table (not shown) to determine the number of quantization bits ("Qbits") used to encode pixels within the block corresponding to the DR. Encoding of a pixel results in a quantization code ("Q code"). The Q codes are the relevant compressed image data used for storage or transmission purposes.
In one embodiment, the Qbit selection is derived from the DR of a 3D block. Accordingly, all pixels within a given 3D block are encoded using the same Qbit, resulting in a 3D encoded block. The collection of Q codes, MIN, Motion Flag, and DR for a 3D encoded block is referred to as a 3D ADRC block. Alternately, 2D blocks are encoded and the collection of Q codes, MIN, and DR for a given 2D block results in 2D ADRC blocks.
A number of threshold tables can be implemented. In one embodiment, the threshold table consists of a row of DR threshold values. A Qbit corresponds to the number of quantization bits used to encode a range of DR values between two adjacent DRs within a row of the threshold table. In an alternative embodiment, the threshold table includes multiple rows and selection of a row depends on the desired transmission rate. Each row in the threshold table is identified by a threshold index. A detailed description of one embodiment of threshold selection is described below in the discussion of partial buffering. A further description of ADRC encoding and buffering is disclosed in US Patent no. 4,722,003 entitled "High Efficiency Coding Apparatus" and US Patent no. 4,845,560 also entitled "High Efficiency Coding Apparatus", assigned to an assignee of the present invention.
The Q codes are referred to as variable length data ("VL-data"). In addition, the DR, MIN, and Motion Flag are referred to as block attributes or compression constants. The block attributes, together with the threshold index, constitute the fixed length data ("FL-data"). Furthermore, in view of the above discussion, the term block attribute describes a parameter associated with a component of a signal element, wherein a signal element includes multiple components.
Frames, block attributes, and VL-data describe a variety of components within a video signal. The boundaries, location, and quantity of these components are dependent on the transmission and compression properties of a video signal. The encoded data is formed into packets and transmitted across a transmission media or stored on a storage media. Figure 2 is a simplified flow diagram illustrating one embodiment of decoding process performed by Decoder 120. Figure 2 further describes, in different combinations of Qbit, Motion Flag, DR, MIN and pixel data, an innovative process for error recovery. More particularly, a method and apparatus for recovering lost or damaged (lost/ damaged) compression constants of data to be decoded is described. In the present embodiment, the lost/damaged compression constants may include DR, MIN and/or MAX. Referring to Figure 2, the data received in packets is processed to decode the data encoded in the received bitstream. At step 205, data is received. In one embodiment, the data may be deshuffled, step 210, if the data was shuffled at time of encoding. ADRC decoding is then applied to the data, step 245, in accordance with the teachings known in the art.
A recovery process is performed to recover the Qbit and Motion Flag values that were located in lost packets. The Qbit value is lost typically due to DR loss (due to lost packets). When the Qbit or Motion Flag value is unknown, the location of Q code bits corresponding to a pixel cannot be determined from the data bitstream. If a Qbit or Motion Flag value is improperly determined then this error will propagate to subsequent data as the starting point of subsequent blocks in the bitstream will be incorrectly identified. A number of techniques may be used to recover Qbit or Motion Flag values.
Figure 3 describes the general process for recovering the Qbit and Motion Flag values in accordance with the teachings of the present invention. This particular embodiment describes the process using multiple blocks of data to recover the Qbit and Motion Flag values; however, the particular number of blocks could be one or more blocks. Referring to Figure 3, based on the detection of an error in the bitstream, step 305, candidate decodings based on specified parameters are generated for the three blocks examined. At step 315, each candidate decoding is scored on the likelihood that it is an accurate decoding and at step 320, the candidate decoding with the best score is used. The Qbit and Motion Flag values identified enable the subsequent decoding of pixels of the affected blocks.
Referring back to the decoding process of Figure 2, once the best decoding is selected, any DR or MIN values that were lost due to lost packets are recovered, step 265. A variety of recovery processes known to one skilled in the art can be applied to recover DR and MIN, including least squares or the averaging of values of adjacent blocks.
In one embodiment, DR is recovered in accordance with the following equation: Edge-matching ADRC:
(2δ - l)' ∑( θ.5 - /N)-
DR'= ι=l
ι=l
Νon edge-matching ADRC:
2Ω ■ j_.(y , - MIN)- {e, + 0.5)
DR'= ι=l
∑(e, + 0.5)2 ι=l where DR' is the recovered DR value, Ν represents the number of terms used (e.g., the number of neighboring values yj used), ej is the i-th encoded value in an ADRC block and e;ε {0,1,...2Q-1}; y is a decoded value of a corresponding adjacent block pixel, MIΝ is the MIΝ value of the block and Q is the Qbit value.
In one embodiment, MIΝ is determined according to the following equation: Edge-matching ADRC:
Figure imgf000012_0001
Νon edge-matching ADRC
Figure imgf000012_0002
where MIΝ' is the recovered MIΝ value and DR is the DR value of the block.
Alternately, the above DR' formulae may be simplified to eliminate second order terms to obtain a more cost-effective solution. As the MIΝ' formulae do not contain second order terms, no such simplification is necessary. Thus, DR may be determined as follows:
Edge-matching ADRC:
(2G - l)" ∑( , - 0.5 -MIN)
DR'= .=1
Σ 1=1 «. Non edge-matching ADRC:
Figure imgf000013_0001
In another embodiment, if the DR and MIΝ of the same block are damaged at the same time, DR and MIΝ may be recovered according to the following equations: Νon edge-matching ADRC:
N-∑(e,+0.5)-y,-∑(e,+0.5)-∑y,
DR'= ι=l ι=l
N-∑(e,+0.5)2 ∑(e,+0.5)
1=1 ι=l
Figure imgf000013_0002
Edge-matching ADRC:
Figure imgf000013_0003
DR' y,-
MIN'= ι=l 2MΪ
N
Alternately, a hardware efficient implementation may be used to determine the compression constants DR and MIΝ. In one embodiment, MIΝ is determined according to the following equation:
1 N DR
MIN'= — y (e,+0.5)
N ! y,
2β where ej represents encoded data of the block.
The equation can be expanded to a quotient consisting of a numerator calculation and denominator calculation:
2Q+i-∑ ιyl-DR-∑ -(2-el+l)
MIN'=
2βMN Similarly, with respect to DR, DR is estimated as follows:
2β-∑(y,- /N)
DR'-- 1=1
∑fe+0.5) ι=l
The above equation can be expanded to:
2β+1-∑(y,-M/N)
DR'= 1=1
∑(2-e,+l)
(=1
The equations above can be efficiently implemented and executed in hardware. One embodiment of the process is illustrated in Figure 4. At step 405, the numerator value is determined using full precision. At step 410, the denominator is determined using full precision. At step 415, the numerator and the denominator are reduced to at least K-bits in length. For example, in one embodiment: while (n ≥ 2K or d > 2K ), then n/2 (shift off LSB) d/2 (shift off LSB).
In one embodiment, the numerator and denominator are shifted the same number of bits. Alternately, the numerator and denominator may be shifted differing number of bits; in such an embodiment, the different amounts of shifts may be compensated for in subsequent computations.
A value of K is selected such that integer division can be performed using cost efficient logic while maintaining an acceptable image quality. In one embodiment, K is selected such that the maximum error is not usually detectable, e.g., the maximum error is not greater than 3%. In the present embodiment in which an 8 bit encoding is used, K is selected to be 13.
At step 425, the division operation is performed if the answer does not overflow or underflow. If an underflow or overflow occurs, the value is respectively clipped to the lower bound or upper bound. This may be determined by comparing the numerator to values corresponding to the product of the bound and the denominator. In the present example, the following steps would therefore be performed: For MIN: ii n ≥ {UM - d)
MIN = UM (clip to upper bound) else if n ≤ (LM - d)
MIN = LM (clip to lower bound) else
MIN = — (K bit division) d where LM represents the lower bound of MIN allowed by auxiliary information, and UM represents the upper bound of MIN allowed by auxiliary information.
Auxiliary information, in one embodiment, consists of predefined compression information used for a particular application. In one embodiment, the lower and upper bounds are respectively the lower and upper bounds of the range of pixel values represented by the number of quantization bits used. Furthermore, the range can be restricted to MIN + DR ≤ MAX, where MAX is the maximum pixel value. In one embodiment, for 8 bit encoding, MAX is equal to 255. Similarly, for DR: if n ≥ (UD - d)
DR = UD (clip to upper bound) else if n ≤ (Ld - d)
DR = LD (clip to lower bound) n else DR = — (K bit division) d where LD represents the lower bound of DR allowed by auxiliary information, and UD represents the upper bound of DR allowed by auxiliary information. The bounds for DR are similarly determined as discussed above.
At step 430, the value is clipped for consistency with the auxiliary information available, for example other available compression constants. In the present example, the value is clipped according to the following equation which comprises the functions for clipping to a lower bound or upper bound: For MIN:
Figure imgf000016_0001
where max represents a maximum function, min represents a minimum function, and UMD represents the upper bound defined by MIN+DR; Similarly, for DR:
DR = max(min(DR,[/MD -MIN),LD )
One embodiment of circuitry used to implement the hardware efficient implementation described above is illustrated in the simplified functional block diagram of Figure 5a. For example, the circuitry may be implemented in specially configured logic, such as large scale integration (LSI) logic or programmable gate arrays. Alternately, as is illustrated in Figure 5b, the circuitry may be implemented as code executed by a dedicated, specially configured or general purpose processor, which executes instructions stored in a memory or other storage device. Furthermore, the present invention may be implemented as a combination of the above.
Referring to Figure 5a, numerator logic 510 determines the full precision value of the numerator portion of the computation performed to estimate a lost/damaged compression constant in the encoded domain. Denominator logic 520 similarly determines a full precision value of the denominator portion of the computation. A least significant bit shift operation is performed on the numerator and denominator until the numerator and denominator are each, at a minimum, K- bits in length. In one embodiment, the shift operation is performed by computation logic 550. Alternately, K-bit shift registers may be used to shift out least significant bits of the values generated by logic 510, 520 until the numerator and denominator are K-bits in length.
As will be described in more detail below, K is chosen, for example, empirically, such that the amount of logic /hardware required to perform subsequent operations is minimized for efficiency while maintaining an acceptable level of precision.
Computation logic 550 performs an integer division of the numerator and denominator. If an overflow or underflow will occur, the value is clipped to an upper bound or lower bound respectively defined by known compression constants.
Clip logic 560 is optionally included to clip the output of computation logic 550 to be consistent with other available compression constants. For example, the value may be clipped to a lower bound of the compression constant as best estimated based upon auxiliary information that defines the range of compression values. This information may be predefined according to the particular embodiment of encoding and decoding processes. Thus, this structure provides a fast, cost efficient circuit for estimating lost/ damaged compression constants.
The output of the circuit is preferably coupled to additional logic (not shown) which decodes using data including the recovered compression constant. In one embodiment in which the data is image data, the decoded data is used to drive a display device.
An alternate embodiment of the circuit for recovering lost /damaged compression constants is shown in Figure 5b. The methods described herein can be implemented on a specially configured or general purpose processor system 570. Instructions are stored in the memory 590 and accessed by the processor 575 to perform many of the steps described herein. An input 580 receives the input bitstream and forwards the data to the processor 575. The output 585 outputs the data. In one embodiment, the output may consist of the decoded data, such as image data decoded once the compression constant is recovered, sufficient to drive an external device such as display 595. In another embodiment, the output 585 outputs the recovered compression constant. The recovered compression constant is then input to other circuitry (not shown) to generate the decoded data.
Referring back to Figure 2, at step 270, ADRC decoding is applied to those blocks not previously decoded. A pixel recovery process is executed, step 275, to recover any erroneous pixel data that may have occurred due to lost packets or random errors. In addition a 3:1:0 -> 4:2:2 back conversion is performed, step 280, to place the image in the desired format for display.
Figure 6 illustrates one particular embodiment of the Qbit and Motion Flag recovery process of the decoding process of the present invention. In this particular embodiment, the inputs to the process are adjacent block information. The block attributes include a compression constant and pixel data for the three blocks to be processed. Error flags indicating the location of the lost data are also input. The error flags can be generated in a variety of ways known to one skilled in the art and will not be discussed further herein except to say that the flags indicate which bits were transmitted by damaged or lost packets.
At step 605, the candidate decodings are generated. The candidate decodings can be generated a variety of ways. For example, although the processing burden would be quite significant, the candidate decodings can include all possible decodings. Alternately, the candidate decodings can be generated based on pre-specified parameters to narrow the number of candidate decodings to be evaluated.
In one embodiment, the candidate decodings are determined based on the possible key values used to shuffle the encoded data. In addition, it should be noted that candidate decodings are further limited by the length of the bits remaining to be decoded and knowledge of how many blocks remain. For example, as will be discussed, if processing the last block typically the decoding length of that block is known. In one embodiment, shuffling is performed using a masking key. Thus, during the encoding process, a key, referred to herein as KEY, is used to mask a bitstream of Q codes. KEY may be used to mask a bitstream of Q codes corresponding to three blocks of data. Each key element (di) of the masking key is generated by the combination of certain compression constants, used to encode a corresponding block of data.
For example, in one embodiment, the MF and Qbit values are used to define KEY. Alternately, the masking key is generated from DR and MIN values. More particularly, for 4 bit ADRC encoding which uses MF and Qbit values to generate KEY, the value of the key elements composing KEY are determined in accordance with the following equation: di = 5 -mi + qi where i=0, 1, 2 and qi represents the number of quantization bits; qi = 0, 1, 2, 3, 4 and m represents the motion flag (MF) value, for example, 0 for a stationary block and 1 for a motion block.
Continuing with the present example, if KEY is generated using three blocks, KEY is formed according to the following:
Figure imgf000019_0001
It therefore follows that during recovery of MF of Qbit data, possible KEY values are regenerated depending upon the values used to create the masking keys. The regenerated KEY values are used to unmask the received bitstream of Q codes resulting in candidate encoded data. Thus, if the MF or Qbit value used to generate the mask is not correct, the corresponding Q codes will exhibit a low level of correlation, which will be typically readily detectable.
Continuing with the present example, Figure 7 illustrates possible cases for the present embodiment, where the value x indicates an unknown value (which may be due to packet loss). This is further explained by example. The variable mj is defined as the Motion Flag of the i-th block, qi is the number of the quantization bits of the i-th block, nj is the number of possible candidates of the i-th block and di is the value of a key element of the i-th block. The i-th block is defined within each group.
In this example, the number of blocks within each group is three. A key for the three block group is generated as, do + 10-dι + 100 -02- Assuming that in the first block the Motion Flag is unknown and the number of quantization bits is 2, mo equals x and qo equals 2. Following the equation described above to generate the key element, di = 5 -mi + q , the set of possible digits for do consists of {2 and 7). Thus, the number of possible values (no) is 2.
Assuming the second block to have a Motion Flag value of 1 and one quantization bit, and the value for di is 5- 1+1 = 6 and
Figure imgf000020_0001
= 1. The third block has a Motion Flag value of 1 and an unknown number of quantization bits. Thus, the digit d2 includes a set consisting of {6, 7, 8, 9} and n2 = 4. Thus, the number of possible candidates of this group, M, is 2 1 4 = 8, and the keys used to generate the candidate decodings are the variations of 662, 667, 762, 767, 862, 867, 962, 967. This process is preferably used for each group which was affected by data loss.
As noted in Figure 3, at step 315, once the data has been decoded in accordance with the key data, the candidate decodings generated are evaluated or scored on the likelihood that it is a correct decoding of the data. Furthermore, at step 320, the candidate decoding with the best score is selected to be used.
A variety of techniques can be used to score the candidate decodings. For example, the score may be derived from an analysis of how pixels or blocks of a particular candidate decoding fit in with other pixels of the image. Preferably the score is derived based upon a criteria indicative of error, such as a square error and correlation. For example, with respect to correlation, it is a fairly safe assumption that the adjacent pixels will be somewhat closely correlated. Thus, a significant or a lack of correlation is indicative that the candidate decoding is or is not the correct decoding.
As is shown in Figure 6, four different criteria are analyzed to select the best candidate decoding. However, one, two, three or more different criteria can be analyzed to select the best candidate decoding. Referring to Figure 6, the present embodiment utilizes four subscoring criteria which are subsequently combined into a final score. In particular, in step 615, the square error measure is generated; in step 620, horizontal correlation is determined; in step 625, vertical correlation is determined; and at step 630 temporal activity is measured. Each step utilizes an M- by-2-N matrix in accordance with M candidates, N blocks and 2 frames /block of data. Although horizontal and vertical correlation is discussed herein, it should be recognized that a variety of correlation measurements, including diagonal correlation, can be used.
At steps 635, 640, 645, 650, a confidence measure is generated for each criterion to normalize the measurements generated. At steps 655, 660, 665 and 670, a probability function for each of the different criteria is generated. These probability functions are then combined, for example, by multiplying the probability values to generate a score, for example, the likelihood function shown in Figure 6, step 675. The score for the candidate decoding is subsequently compared against all candidate decoding scores to determine the likely candidate.
A variety of techniques can be used to evaluate the candidate decodings and generate the "scorings" for each candidate. For example, confidence measures are one way of normalizing the criteria. Furthermore, a variety of confidence measures, besides the ones described below, can be used. Similarly, multiplying the probability values based on each criterion to generate a total likelihood function is just one way of combining the variety of criteria examined.
The encoding processes facilitate the determination of the best candidate decoding because typically the candidate decodings which are not the likely candidate will have a relatively poor score, while decodings that are the likely candidate will have a significantly better score.
Figures 8a, 8b, 8c and 8d provide illustrations of the different measurements performed at steps 615, 620, 625 and 630 of Figure 6 to generate the scoring and total score for a particular candidate decoding. Figure 8a illustrates the square error to evaluate a candidate decoded pixel Xj as compared to its decoded neighbors yi i, wherein the suffix "i,j" is corresponding to the neighboring address of "i".
Optionally, some of the largest terms are removed to remove any influences due to spikes, that is the terms that arise due to legitimate edges in the image. For example, the three largest terms of (xry )*^ may be discarded to remove spikes.
Figure 8b illustrates the temporal activity criteria. This is applicable only when it is or is assumed to be a motion block. The temporal activity criteria assumes that the better the candidate decoding, the smaller the differences between blocks. Thus the worse the candidate decoding, the larger the differences between blocks. Spatial correlation assumes that the more likely candidate decodings will result in heavy correlations as real images tend to change in a slow consistent way. The horizontal correlation process illustrated in Figure 8c and vertical correlation process illustrated by Figure 8d utilize that assumption.
The confidence measures, steps 635, 640, 645, and 650 of Figure 6, provide a process for normalizing the criteria determined in the previous steps (steps 615, 620, 625 and 630). In one embodiment, for example, the confidence measure for the square error takes values from the interval [0,1], and confidence is equal to 0 if the errors are equal and equal to 1 if one error is 0. Other measures or methods to normalize are also contemplated.
Similarly, the confidence measure for the spatial correlation is: maximum(Y,0) - maximum(X,0) where Y is the best correlation value and X is the correlation for the current candidate decoding. The temporal activity confidence measure is determined according to the following equation: conf = (a-b) / (a+b) where a = max (X, M_TH) and b = max (Y,M_TH) where M_TH is the motion threshold for the candidate block and Y is the best measurement, that is the smallest temporal activity, and X equals the current candidate measurement of temporal activity. At steps 655, 660, 665 and 670, Figure 6, the probability function is generated for each of the different criteria. A variety of methods can be used to generate the probability measure. For example, a score can be prescribed to a confidence measure. If the confidence measure is greater than a predetermined value, e.g., 0.8, the base score is decreased by 10; if between 0.5 and 0.8, the base score decreased by 5...
Figure 9 illustrates one embodiment in which a table used to generate the probability function for the square error measurement criteria. The table includes empirically determined data containing arbitrarily binned confidence and square error measures and known candidate decodings. More particularly, the table can be generated by using undamaged data and assuming that the DR was corrupted or lost. Keys and confidence measures for correct and incorrect decodings are then generated.
The table reflects the probability ratio of correct to incorrect decodings. Using this table, for a particular squared error value (row) and confidence value (column), the probability can be determined. For example, it can therefore be seen that for a variety of square error measures at a confidence measure of zero, there is approximately a 40% to 50% probability that the candidate is correct. If the confidence is not 0, but small, the probability drops significantly. Similar probability tables are generated for the correlation and temporal measurements based on corresponding empirically determined criteria measurements and confidence measurements.
The probabilities generated are considered data to generate "scores" in the present embodiment. Other techniques to score candidate decodings may also be used. At step 1875, the different probabilities are combined into a likelihood
function Lj = 7lj-Pi,j, where 7tj is a multiplication function of probability functions
Pi,j, and Pi,j,is the probability function for candidate i, block j. The candidate is therefore selected as the one that maximizes the function L[. Referring back to Figure 6, it may be necessary to recover certain block attributes that were transmitted in lost packets. Therefore, at step 610, DR and MIN values are recovered where necessary. A variety of techniques, from default values, averaging, squared error functions to more sophisticated techniques, including those discussed in Kondo, Fujimori, Nakaya and Uchida, "A New Concealment Method for Digital VCRs", IEEE Visual Signal Processing and Communications, September 20-22, 1993, Melbourne Australia, may be used. The recovered values are utilized to generate the candidate decodings as discussed above.
Alternately, the DR and MIN values are determined during the Qbit determination process. This is illustrated in Figure 10. In particular, as noted above, in the present embodiment, the Motion Flag and number of quantization bits are used in the encoding process and later used during the recovery process to narrow the number of possible candidate decodings. Other information can also be used. Thus the value of DR and /or value of MIN may also be used to encode the data. Alternately, a portion of bits of DR are used for encoding (e.g., the two least significant bits of DR). Although the DR data is encoded, the number of possible candidate decodings is increased significantly as variables are added. Referring to Figure 10, K-M candidate decodings are therefore generated, where K is the number of candidate values for the unknown data, e.g. K=4 if two bits of the sum of DRi, DR2 and DR3 is encoded (DRi, DR2 and DR3 represent the DR values of the blocks of the group). The DR and MIN are therefore recovered using the auxiliary information provided, e.g., the encoded two bits of the sum of DRi, DR2 and DR3. This improves the process of candidate selection at the cost of additional overhead to examine the larger number of candidate decodings.
It should be noted that generally, the more neighboring blocks that are decoded, the better the Qbit and Motion Flag recovery process. Furthermore, in some embodiments the process is applied to each subsequent block of a buffer; if all or some of the FL-data is available, the number of candidate decodings can be reduced, possibly to one candidate decoding given all the FL-data for a block is available. However, it is desirable that the Qbit and Motion Flag recovery process be avoided altogether as the process is a relatively time consuming one. Furthermore, it is desirable to use as much information as possible to perform Qbit and Motion Flag recovery.
In one embodiment, blocks are processed from the beginning of a buffer until a block with lost Qbit/Motion Flag information is reached. This is referred to as forward Qbit and Motion Flag recovery. In another embodiment, the end of the buffer is referenced to determine the location of the end of the last block of the buffer and the data is recovered from the end of the buffer until a block with lost Qbit/Motion Flag data is reached. This is referred to as backward Qbit and Motion Flag recovery.
As noted earlier, the blocks are variable in length, due the length of the VL- data; therefore there is a need to determine the number of bits forming the VL-data of a block so that the position of subsequent blocks in the buffer can be accurately located. During the encoding process, a post-amble of a predetermined and preferably easily recognizable pattern is placed in the buffer to fill the unused bit locations. During the decoding process, the post-amble will be located between the block and the end of the buffer. As the pattern is one that is easily recognizable, review of patterns of bits enables the system to locate the beginning of the post- amble and therefore the end of the last block in the buffer.
This information can be used in two ways. If the last block contains damaged Qbit/Motion Flag data and the beginning of the last block is known (e.g., the preceding blocks have been successfully decoded), the difference between the end of the immediate preceding block and the beginning of the post-amble corresponds to the length of the block. This information can be used to calculate the Qbit and /or Motion Flag of the block. The starting location of the post-amble can also be used to perform Qbit and Motion Flag recovery starting at the last block and proceeding towards the beginning of the buffer. Thus, the Qbit and Motion Flag recovery process can be implemented bidirectionally. Figure 11 illustrates the use of a post-amble in the bidirectional Qbit and Motion Flag recovery process. Referring to Figure 11, the buffer 1100 includes FL- data 1103 for the N groups of blocks of VL-data. Each group consists of a plurality of blocks (e.g., 3 blocks). In the present example, the first two groups 1105, 1110 are decoded and the third group 1115 cannot immediately be decoded due to damaged DR/Motion Flag data. At this point, the Qbit/Motion Flag recovery process is required in order to recover the damaged data. Rather than continue processing groups in the forward direction, the process refers to the end of the buffer, determined by looking for the post-amble pattern 1120. The beginning of the post-amble and therefore the end of the last group of blocks are determined. As the DR/Motion Flag data is indicative of the length of the VL-data, the beginning of the VL data of the last block, and therefore the end of the immediate preceding block, is determined. Therefore, the blocks can be decoded , e.g., blocks 1125, 1130, 1135 until a block 1140 with damaged data is reached. The damaged 1115, 1140 and obstructed blocks 1150 are then recovered, for example, using the Qbit/Motion Flag recovery process described above.
The bidirectional process is not limited to a sequence of forward and reverse processing; processing can occur in either or both directions. Furthermore, in some embodiments, it may be desirable to perform such processing in parallel to improve efficiency. Finally, it is contemplated that undamaged obstructed blocks may be recovered by directly accessing the Qbit/Motion Flag information without executing the Qbit/Motion Flag recovery process described above.
As noted earlier, a variety of scoring techniques may be used to determine the best candidate decoding to select as the decoding. In an alternate embodiment, the smoothness of the image using each candidate decoding is evaluated. In one embodiment, the Laplacian measurement is performed. The Laplacian measurement measures a second-order image surface property, e.g., surface curvature. For a linear image surface, i.e., smooth surface, the Laplacian measurement will result in a value that is approximately zero. The process will be explained with reference to Figures 12a, 12b, and 12c. Figure 12a illustrates one embodiment of the Laplacian kernel. It is contemplated that other embodiments may also be used. The kernel "L" represents a 3x3 region. To measure smoothness of the image region, 3x3 subregions of the image (Figure 12b) are convolved with the kernel and the convolved values are averaged. The size of the region and subregion (and therefore kernel size) can be varied according to application.
One embodiment of the process is described with reference to Figure 12c. This embodiment utilizes a kernel and subregion size of 3x3 and a region size of 8x8, the individual elements identified by indices i,j. At step 1260, the candidate decoded values x[i][j] are normalized. For example, the values can be normalized according to the following equation:
Figure imgf000027_0001
Figure imgf000027_0002
"here, Xmmeeaan = ''' ^ , 0 ≤ U < 8
At step 1265, the normalized values are used to compute a block Laplacian value Lx indicative of smoothness according to the following:
#][/]= Σ ∑ [m][n]-x'[i + m][j + n , 0≤ i,j < S m = -ϊ n=-l
Figure imgf000027_0003
x 64
The closer the block Laplacian value is to zero, the smoother the image portion. Thus a score can be measured based upon the block Laplacian value, and the decoding with the least Laplacian value is the correct one. The Laplacian evaluation can also be achieved using candidate encoded values q[i][j] . The basic process is the same as the candidate decoded value case of Figure 12c. This embodiment utilizes a kernel and subregion size of 3x3 and a region size 8x8, the individual elements identifies by the indices i,j. At step 1260, the candidate encoded values q[i][j] are normalized. For example, the values can be normalized according to the following equation:
Figure imgf000028_0001
where, Q =
At step 1265, the normalized values are used to compute the block Laplacian value Lq indicative of smoothness according to the following equation:
Figure imgf000028_0002
The closer the block Laplacian value is to zero, the smoother the image portion. Thus a score can be measured based upon the block Laplacian value and the candidate with the smallest Laplacian value is the correct one.
Other variations are also contemplated. In alternative embodiments, higher order image surface properties can be used as a smoothness measure. In those cases, higher order kernels would be used. For example, a fourth order block Laplacian measurement may be performed using a fourth order kernel. Such a fourth order kernel can be realized using two second order Laplacian computations in cascade.
The evaluation process can be dependent upon whether the image has an activity or motion larger than a predetermined level. If the image portion is evaluated to have larger motion than a predetermined level, then it may be preferable to perform the measurements on a field basis as opposed to on a frame basis. This is explained with reference to Figure 13. Figure 13 explains the process using smoothness measures; however, it is contemplated that this process can be implemented using a variety of types of measures.
Frame 1305 of an image region is composed of field 0 and field 1. If motion is not detected, step 1310, the smoothness measurement is computed by computing the block Laplacian value for the block within each frame, step 1315. If larger motion than a predetermined level is detected, block Laplacian measurements are performed on each field, steps 1320, 1325, and the two measurements are combined, step 1330, e.g. averaged, to generate the smoothness measurement.
Motion can be detected /measured a variety of ways. In one embodiment, the extent of change between fields is evaluated and motion is detected if it exceeds a predetermined threshold.
Motion detection and the use of frame information and field information to generate recovered values (typically to replace lost or damaged values) can be applied to any portion of the process that requires a recovered value to be generated. For example, motion detection and the selective use of frame information and field information to generate recovered values can be applied to DR/MIN recovery, pixel recovery as well as Qbit and Motion Flag recovery processes. Thus, based on the level of motion detected, the recovery process will utilize existing information on a field basis or frame basis. Furthermore, this process can be combined with the application of weighting values that are selected based upon levels of correlation in particular directions (e.g., horizontal or vertical).
In another embodiment of the Qbit and Motion Flag recovery process, candidate decodings are evaluated based upon intra block and inter block measurements. In the following discussion, the term "block" refers to a portion of a frame or field. The intra block measurement evaluates the candidate decoded image portion, e.g., the smoothness of the image portion. The inter block measurement measures how well the candidate decoding fits with the neighboring image portions. Figures 14a and 14b illustrate the combined inter block and intra block evaluation. In particular, Figure 14a shows an acceptable candidate decoding as both the inter block and intra block measurements are good, whereas in Figure 14b the inter block measurement is poor, even though the intra block measurement is quite good.
Examples of intra block measurements include the smoothness measurement described above. Examples of inter block measurements include the square error measurements described earlier. An alternative inter block measurement is the ratio of compatible boundary pixels and the total number of boundary pixels at the candidate ADRC block.
An example of an inter block and intra block evaluation of an 8x8 block that is ADRC encoded will be explained with respect to Figures 14c, 14d and 14e. Figure 14d illustrates an image portion (block) of data of a encoded values 1450 consisting of q values from which candidate decoded values x are generated and neighboring decoded data 1455 consisting of y values. As set forth in the flow chart of Figure 14c, at step 1405, the intra block measure is computed to generate a measure, e.g., block Laplacian Lx. At step 1410, the inter block measure Sx is computed to generate a measure of compatibility between adjacent blocks. At step 1415, the combined measure Mx is generated. The combined measure provides the information used to select a candidate decoding.
In the present embodiment, Sx is computed as the number of neighboring data that lies in a valid range for each boundary pixel of candidate decoding (see Figure 14e). Figure 14e is a chart illustrating a valid range for one embodiment which shows a valid range of each observed quantized value qi. Thus LQ < DR < UQ, where LQ, UQ respectively represent the lower and upper bounds of DR corresponding to the number of quantization bits = Q. Preferably Sx is normalized according to the following: Sx = Sx/number of boundary pixels.
In the present embodiment the combined measure Mx is computed according to the following equation: Mx = Sx + (1-LX). Alternatively, the combined measure may be weighted such that the following equation would be used: Mx =w ■ SX + (1 - w) • (1-LX), where w is the weighting value, typically an empirically determined weighting value.
Other embodiments for determining DR and MIN values that have been lost /damaged are also contemplated. For example, the earlier described equations can be modified to recover DR and MIN values with higher accuracy. In an alternate embodiment, a median technique is applied. In one embodiment of the median technique, the value of MIN is recovered as the median of all MINi values computed as:
Figure imgf000031_0001
where qi represents the encoded pixel value and yi represents the decoded pixel neighboring qi. For edge-matching ADRC, s = DR/(2Q - 1). For non-edge-matching
ADRC, s = DR/2Q, where Q represents the number of quantization bits per pixel (Qbit value).
The values used may be temporally proximate or spatially proximate. The values of yi may be the decoded value of the neighboring pixel in an adjacent frame/field or the same field. The values of yi may be the decoded value of the pixel from the same location as qi in an adjacent frame/field or the same field.
In addition, any DR and /or MIN recovery technique may be combined with a clipping process to improve recovery accuracy and prevent data overflow during the recovery process. The clipping process restricts the recovered data to a predetermined range of values; thus those values outside the range are clipped to the closest range bound. In one embodiment, the clipping process restricts values in the range [LQ, UQ], where LQ, UQ respectively represent the lower and upper bounds of the range of pixel values represented by the number of quantization bits Q. The values can be further restricted to MIN + DR < Num, where Num represents the minimum pixel value; in the present embodiment, Num is 255. Furthermore, in the present embodiment, where applicable, UQ +1 = LQ+I. Combining the criteria into a single equation results for an unbounded recovered value (val') for the DR, the final clipped recovered value (val) is obtained from the following equation: val = max(min(val, min(UQ,255-MIN)),LQ) where min and max respectively represent minimum and maximum functions.
In an alternate embodiment, the boundary pixels yj used to generate an recovered DR and /or MIN can be filtered to only use those that appear to correlate best, thereby better recovering DR and MIN. Those boundary pixels not meeting the criteria are not used. In one embodiment, a boundary pixel yj is considered valid for DR calculations if there exists a value of DR such that LQ≤ DR<UQ and an original pixel yi would have been encoded as qi. Thus, a pixel is valid if the following equations are satisfied:
_ (__y__ - MIN) -- m > J_ maxfø, - 0.5,0) δ
(y -MIN)- m
-^M r < UQ min , -0.5, m)
where m represents the maximum quantization level = 2Q-1. A DR recovered value (val') can then be computed according to the following equation: m - ∑(y, -MIN)- q, vaV=
β,2
The value can then be clipped into the valid range. Thus this process forces the DR recovered value into the interior of the valid region as defined by the threshold table, reducing the accuracy for points whose true DR lies near the threshold table boundary.
Due to quantization noise, the DR of stationary ADRC blocks varies slightly from frame to frame. If this variance crosses an ADRC encoding boundary, and if the DR is recovered on several consecutive frames, then the DR recovered value with valid pixel selection tends to overshoot at each crossing, resulting in a noticeable blinking effect in the display. In an attempt to reduce the occurrence of this effect, in one embodiment, the valid pixel selection process is modified to relax the upper and lower bounds, allowing border pixels that encroach into the neighboring valid region. By including points just outside the boundary, it is more likely that the recovered value will take on a value near that of the upper or lower bound. The relaxed bounds L'Q and U'Q are computed by means of a relaxation constant r. In one embodiment, r is set to a value of .5. Other values can be used:
L'Q = ΓLQ_I + (1-r) LQ
U'Q = (l-r)UQ + rUQ+ι
The discussion above sets forth a number of ways to recover DR and MIN when the values have been damaged or lost. Further enhancements can be realized by examining the correlation between data temporally and/or spatially, and weighting corresponding calculated recovered values accordingly. More particularly, if there is a large correlation in a particular direction or across time, e.g., horizontal correlation, there is a strong likelihood that the image features continue smoothly in that direction that has a large correlation and therefore an recovered value using highly correlated data typically generates a better estimate. To take advantage of this, boundary data is broken down into corresponding directions (e.g., vertical, horizontal, field-to-field) and weighted according to the correlation measurement to generate a final recovered value.
One embodiment of the process is described with reference to Figure 15a. At step 1510, a recovered value of the DR or MIN value to be recovered is generated in one direction and at step 1515, a recovered value is generated in another direction. For example, if the process is spatially adaptive, then boundary pixels along horizontal borders are used to generate a first recovered value, "hest", and boundary pixels along vertical borders are used to generated a second recovered value, "vest". Alternately, if the process is temporally adaptive, then boundary pixels between adjacent fields are used to generate a first recovered value and boundary pixels between adjacent frames are used to generate a second recovered value.
At step 1520, the recovered values are weighted according to correlation calculations indicative of the level of correlation in each direction. The weighted first and second recovered values are combined to generate a combined recovered value, step 1525. The process is not limited to generated weighted recovered values in only two directions; the number of recovered values that are weighted and combined can be varied according to the application.
A variety of known techniques can be used to generate a correlation value indicative of the level of correlation in a particular direction. Furthermore, a variety of criteria can be used to select the weighting factor in view of the levels of correlation. Typically, if one correlation is much larger than the other, the combined recovered value should be based primarily on the corresponding recovered value. In one embodiment, the combined recovered value is computed as follows:
Figure imgf000034_0001
- ct)vest : hc ≥ vc vaϊ= \
I (1 - a) hest + ccvest : he < vc I
where he represents the horizontal correlation, vc represents the vertical correlation, hest represents a DR recovered value based only on left and right boundary information, and vest represents a DR recovered value based only on top and bottom boundary information, and a represents the weighting value.
The weighting value can be determined a variety of ways. Figure 15b illustrates one embodiment for determining weighting values as a function of the difference between the horizontal correlation and vertical correlation. More particularly, a was chosen to be: < α35
0.35
Figure imgf000034_0002
As noted above, the adaptive correlation process is applicable to both DR and MIN recovery. It is preferred, however, that the MIN recovery is clipped to insure that MIN + DR ≤ 255, therefore the function val = max(min(val', 255 - MIN), 0) can be used. Furthermore, as noted above, the temporal correlation can be evaluated and used to weight recovered values. In addition, a combination of temporal and spatial correlation can be performed. For example, one recovered value is generated between fields as a temporal recovered value. Another recovered value is generated within one field as a spatial recovered value. The final recovered value is computed as the combination value with a combination of temporal and spatial correlation. The correlation combination can be replaced with a motion quantity. Other variations are also contemplated. For example, the techniques described herein can be applied to audio data.
In an alternate embodiment, a low complexity modification to the least squares technique is used. Using this embodiment, the blinking experienced due to recovered DR values is reduced. For purposes of the following discussion, QV represents a list of encoded values from the image section or ADRC block whose DR is being recovered having a set of points qi and Y is a list of decoded values taken from the vertical or horizontal neighbors of the points in QV, where yi represents a vertical or horizontal neighbor of qi. As each point qi may have up to four decoded neighbors, one pixel or point may give rise to as many as four (qi, y pairings. The unconstrained least squares estimate of DR (DRuis) is thus:
Figure imgf000035_0001
where Q is the number of quantization bits, MIΝ is the minimum value transmitted as a block attribute. The above equation assumes non-edge-matching ADRC; for edge-matching ADRC, 2Q is replaced with 2Q-1 and (0.5 + qi) is replaced with qj.
The unconstrained least squares estimate is can be clipped to assure consistency with the threshold table and the equation MIΝ + DR < 255 which is enforced during encoding (Typically, for non-edge-matching ADRC, permissible DR values are in the range of 1-256). Thus, the least squares estimate is clipped (DRιsc) by:
(DR)ιsc = max(min(UB,DRuis),LB) where UB represents the upper bound and LB represents the lower bound and min and max respectively represent minimum and maximum functions.
In an alternate embodiment, the estimation can be enhanced by selecting the pixels that are more suitable for DR estimation to calculate the estimate of DR. For example, flat regions in an image provide pixels which are more suitable for DR estimation than those regions in which high activity occurs. In particular, a sharp edge in the edge may decrease the accuracy of the estimate. The following embodiment provides a computationally light method for selecting the pixels to use to calculate an estimate of DR.
In one embodiment, the least squares estimate (DRιse), e.g., DRuιs or DRιsc. is computed. Using this estimate, the list of encoded values QV is transformed into candidate decoded values X, where Xi are members of X derived from qi. The xj value is a recovered decoded value formed using the first estimate of DR. The Xi value is defined according to the following equation:
Edge-matching ADRC: J , = MIN + l 0.5 + q> ' J^
2β - l
Non-edge-matching ADRC: , =
Figure imgf000036_0001
Assuming DRιse is a reasonable estimate of the true DR, then anywhere that xi is relatively close to yi, may be judged to be a low activity area and thus a desirable matching. New X and Y lists may then be formed by considering only the matches where X[ and yi are close and the least squares estimate recomputed to generate an updated estimate. The criteria for determining what is considered "close" can be determined a number of ways. In one embodiment, an ADRC encoding of the error function is used. This approach is desirable as it is computationally inexpensive. For the process, a list E, consisting of the points ei = | yϊ - Xi | is defined. Defining emin and emax as respectively the smallest and largest values from the list, then eDR = emax- emin. An encoded error value can then defined as: gi = (ei - emin)nl/eDR where nl represents the number of quantization levels for requantizing ei in a similar manner to the ADRC process described above.
Thus, new lists X and Y are generated by selecting only those matches where gi is less than some threshold. If the new lists are sufficiently long, these lists may be used to generate a refined least squares estimate DRrιs. The threshold for gj and the number of matches needed before refining the least squares estimation is preferably empirically determined. For example, in one embodiment for an process involving 8x8x2 horizontally subsampled blocks and where nl is 10, only matches corresponding to gi = 0 are used and the estimate is refined only when the new lists contain at least 30 matches.
In an alternate embodiment, DR estimation can be improved by clipping potential DR values and recomputing a DR estimate. In particular, in one embodiment, a list D is composed of member di which contains the DR value that would cause xi to equal yi. More precisely: di = 2Q(yi - MIN)/(0.5 + qi) Improvement is seen by clipping each di. That is, di' = max(min(UB,di), LB) where DRcis is then computed to be the average of di'. The clipped method (DRcis) may be combined with other DR estimates , e.g., DRιse in a weighted average to produce a final DR value. For example, the weighted average DReSt is determined according to the following: DReSt = wi(DRcis) + W2(DRιse).
The weights wi and W2 are preferably empirically determined by examining resultant estimations and images generated therefrom from particular weightings. In one embodiment wi = 0.22513 and w2 = 0.80739.
The invention has been described in conjunction with the preferred embodiment. It is evident that numerous alternatives, modifications, variations and uses will be apparent to those skilled in the art in light of the foregoing description.

Claims

CLAIMSWhat is claimed is:
1. A method for recovering compression constants in a bitstream of data comprising the steps of: computing (405) a numerator of a quotient determinative of the compression constant to be recovered; computing (410) a denominator of the quotient; shifting off (415) a least significant bit from the numerator and the denominator until the numerator and denominator are less than 2K ; and performing (425) K-bit division, where K is a constant.
2. The method as set forth in claim 1, wherein the encoded data is selected from the set comprising image data and sound data.
3. The method as set forth in claim 1, wherein the encoded data comprises image data having blocks and each block comprises compression constants selected from the group comprising MIN, DR and MAX, wherein MIN represents a minimum data value in the block, DR represents a dynamic range of the block, and MAX represents a maximum data value in the block.
4. The method as set forth in claim 3, wherein the compression constant to be recovered is MIN and the quotient comprises:
Edge-matching ADRC:
Figure imgf000039_0001
Νon edge-matching ADRC:
PR
MIN-- y,- , + 0.5)
N rsO (e ι"=l where MIΝ' is the recovered MIΝ value, Ν represents a number of neighboring data to use, Q represents a number of quantization bits used to encode, ei represents encoded data of the block, yi represents decoded neighboring data and DR represents the dynamic range of the block.
5. The method as set forth in claim 3, wherein the compression constant is DR and the quotient is selected from the group comprising: Edge-matching ADRC:
(2β - l ∑( , - 0.5 - MIN)-e,
DR'= ι=l
N
Σ .e<2
1=1
PR'=
Figure imgf000040_0001
Νon edge-matching ADRC:
2β - X(y, - /N). (e, + 0.5)
PR'= ι=l
∑(e, + 0.5)2
1=1
Figure imgf000040_0002
where DR' is the recovered DR value, Ν represents a number of neighboring data to use, Q represents a number of quantization bits used to encode, qi represents encoded data of the block, yi represents decoded data of neighboring data and MIΝ represents a mininum data value in the block.
6. A system for recovering at least one lost /damaged compression constant for encoded data comprising numerator logic (10) configured to compute a numerator of a quotient determinative of the compression constant to be recovered; denominator logic (20) configured to compute a denominator of the quotient; shift logic (50) configured to shift off a least significant bit from the numerator and the denominator until the numerator and the denominator are less than 2K ; and computation logic (50) configured to perform K-bit division, where K is a constant.
7. The system as set forth in claim 6, wherein the encoded data is selected from the set comprising image data and sound data.
8. The system as set forth in claim 6, wherein the encoded data comprises image data and each block comprises compression constants selected from the group comprising MIN, DR and MAX compression constants, wherein MIN represents a minimum data value in the block, DR represents a dynamic range of the block, and MAX represents a maximum data value in the block.
9. The system as set forth in claim 8, wherein MIN is estimated according to the following:
Edge-matching ADRC:
Figure imgf000041_0001
Non edge-matching ADRC:
1 N PR
MIN'= — ■ y (e, + 0.5)
where MIN' is the recovered MIN value, N represents a number of neighboring data to use, Q represents a number of quantization bits used to encode, ej represents the encoded data of the block, yi represents decoded data of the neighboring data and DR represents a dynamic range of the block.
10. The system as set forth in claim 8, wherein DR is estimated according to a formula selected from the group comprising: Edge-matching ADRC: (2β -ι)- ∑(y, -o.5 - /N)- e,
DR'= ι=l
Figure imgf000042_0001
Νon edge-matching ADRC:
2β - ∑(y, -M/N). (e, + o.5)
DR'= ι=l
∑(e, +0.5)2 ι=l
Figure imgf000042_0002
where DR' is the recovered DR value, Ν represents a number of neighboring data to use, Q represents a number of quantization bits used to encode, qj represents encoded data of the block, yi represents decoded data of the neighboring data and MIΝ represents a mininum data value in the block.
11. The system as set forth in claim 6, further comprising full precision registers configured to store a numerator and denominator of an equation used to estimate.
12. The system as set forth in claim 6, wherein a the numerator logic, denominator logic, first shift logic and second shift logic are formed in a processor.
13. A computer readable medium containing executable instructions, which, when executed in a processing system, causes the system to perform the steps for recovery of a compression constant of a bitstream of encoded data, comprising: computing (405) a numerator of a quotient determinative of the compression constant to be recovered; computing (410) a denominator of the quotient; shifting off (415) a least significant bit from the numerator and the denominator until the numerator and the denominator are less than 2K ; and performing (425) K-bit division, where K is a constant.
14. The computer readable medium as set forth in claim 13, wherein the encoded data is selected from the set comprising image data and sound data.
15. The computer readable medium as set forth in claim 13, wherein the encoded data comprises compression constants selected from the group comprising MIN, DR and MAX, wherein MIN represents a minimum data value in the block, DR represents a dynamic range of the block, and MAX represents a maximum data value in the block.
16. The computer readable medium as set forth in claim 15, wherein the compression constant is MIN and the quotient comprises:
Edge-matching ADRC:
Figure imgf000043_0001
Non edge-matching ADRC:
Figure imgf000043_0002
where MIN' is the recovered MIN value, N represents a number of neighboring data to use, Q represents a number of quantization bits used to encode, ei represents the encoded data of the block, yj represents decoded data of the neighboring data and DR represents the dynamic range of the block.
17. The computer readable medium as set forth in claim 15, wherein the compression constant is DR and the quotient is selected from the group comprising: Edge-matching ADRC:
Figure imgf000044_0001
Νon edge-matching ADRC:
Figure imgf000044_0002
2β - ∑( -MIN)
DR'= ^
∑(e, + 0.5) ι=l where DR' is the recovered DR value, Ν represents a number of neighboring data to use, Q represents a number of quantization bits used to encode, qi represents encoded data of the block, yj represents decoded data of the neighboring data and MIΝ represents a mininum data value in the block.
18. An apparatus for recovering at least one lost/ damaged compression constant for a block of encoded data comprising: means for computing (405) a numerator of a quotient determinative of the compression constant to be recovered; means for computing (410) a denominator of the quotient; means for shifting off (415) a least significant bit from both the numerator and the denominator until the numerator and the denominator are less than 2 ; and means for performing (425) K-bit division, where K is a constant.
19. The apparatus as set forth in claim 18, wherein the encoded data comprises image data and each block comprises MIN, DR and MAX compression constants selected from the group compressions, wherein MIN represents a minimum data value in the block, DR represents a dynamic range of the block, and MAX represents a maximum data value in the block.
PCT/US2000/003299 1999-02-12 2000-02-09 Method and apparatus for truncated decoding WO2000048319A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU36977/00A AU3697700A (en) 1999-02-12 2000-02-09 Method and apparatus for truncated decoding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/249,500 1999-02-12
US09/249,500 US6535148B1 (en) 1999-02-12 1999-02-12 Method and apparatus for truncated decoding

Publications (1)

Publication Number Publication Date
WO2000048319A1 true WO2000048319A1 (en) 2000-08-17

Family

ID=22943714

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/003299 WO2000048319A1 (en) 1999-02-12 2000-02-09 Method and apparatus for truncated decoding

Country Status (3)

Country Link
US (2) US6535148B1 (en)
AU (1) AU3697700A (en)
WO (1) WO2000048319A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4003282B2 (en) * 1998-03-13 2007-11-07 株式会社ニコン Electronic camera
US6473876B1 (en) 1999-06-29 2002-10-29 Sony Corporation Method and apparatus for encoding of bitstreams using rotation
US8374237B2 (en) * 2001-03-02 2013-02-12 Dolby Laboratories Licensing Corporation High precision encoding and decoding of video images
GB2373661B (en) * 2001-03-23 2005-05-11 Advanced Risc Mach Ltd A data processing apparatus and method for performing an adaptive filter operation on an input data sample
US7370120B2 (en) * 2001-12-07 2008-05-06 Propel Software Corporation Method and system for reducing network latency in data communication
US9577667B2 (en) 2002-04-23 2017-02-21 Ntt Docomo, Inc. System and method for arithmetic encoding and decoding
US20070076971A1 (en) * 2005-09-30 2007-04-05 Nokia Corporation Compression of images for computer graphics
JP4508132B2 (en) * 2006-02-27 2010-07-21 ソニー株式会社 Imaging device, imaging circuit, and imaging method
WO2012144876A2 (en) 2011-04-21 2012-10-26 한양대학교 산학협력단 Method and apparatus for encoding/decoding images using a prediction method adopting in-loop filtering

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5134479A (en) * 1990-02-16 1992-07-28 Sharp Kabushiki Kaisha NTSC high resolution television converting apparatus for converting television signals of an NTSC system into high resolution television signals
US5469474A (en) * 1992-06-24 1995-11-21 Nec Corporation Quantization bit number allocation by first selecting a subband signal having a maximum of signal to mask ratios in an input signal
US5649053A (en) * 1993-10-30 1997-07-15 Samsung Electronics Co., Ltd. Method for encoding audio signals
US5786857A (en) * 1993-10-01 1998-07-28 Texas Instruments Incorporated Image processing system

Family Cites Families (112)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3311879A (en) 1963-04-18 1967-03-28 Ibm Error checking system for variable length data
US3805232A (en) 1972-01-24 1974-04-16 Honeywell Inf Systems Encoder/decoder for code words of variable length
FR2387557A1 (en) 1977-04-14 1978-11-10 Telediffusion Fse NOISE VISIBILITY REDUCTION SYSTEMS ON TELEVISION IMAGES
GB2073534B (en) 1980-04-02 1984-04-04 Sony Corp Error concealment in digital television signals
GB2084432A (en) 1980-09-18 1982-04-07 Sony Corp Error concealment in digital television signals
US4509150A (en) 1980-12-31 1985-04-02 Mobil Oil Corporation Linear prediction coding for compressing of seismic data
US4532628A (en) 1983-02-28 1985-07-30 The Perkin-Elmer Corporation System for periodically reading all memory locations to detect errors
US4574393A (en) 1983-04-14 1986-03-04 Blackwell George F Gray scale image processor
JPH0746864B2 (en) 1984-08-22 1995-05-17 ソニー株式会社 High efficiency encoder
CA1251555A (en) 1984-12-19 1989-03-21 Tetsujiro Kondo High efficiency technique for coding a digital video signal
JPH0793724B2 (en) 1984-12-21 1995-10-09 ソニー株式会社 High efficiency coding apparatus and coding method for television signal
JP2512894B2 (en) 1985-11-05 1996-07-03 ソニー株式会社 High efficiency coding / decoding device
JP2670259B2 (en) 1985-11-29 1997-10-29 ソニー株式会社 High efficiency coding device
JPH0746862B2 (en) 1985-11-30 1995-05-17 ソニー株式会社 Frame dropping compression encoding and decoding method
JP2612557B2 (en) 1985-12-18 1997-05-21 ソニー株式会社 Data transmission receiving system and data decoding device
JPS62231569A (en) 1986-03-31 1987-10-12 Fuji Photo Film Co Ltd Quantizing method for estimated error
JP2751201B2 (en) 1988-04-19 1998-05-18 ソニー株式会社 Data transmission device and reception device
JP2508439B2 (en) 1987-05-29 1996-06-19 ソニー株式会社 High efficiency encoder
EP0293644B1 (en) 1987-06-02 1992-03-25 Siemens Aktiengesellschaft Method for determining movement vector fields from digital image sequences
US4885636A (en) 1987-06-22 1989-12-05 Eastman Kodak Company Block adaptive linear predictive coding with adaptive gain and bias
US5122873A (en) 1987-10-05 1992-06-16 Intel Corporation Method and apparatus for selectively encoding and decoding a digital motion video signal at multiple resolution levels
US5093872A (en) 1987-11-09 1992-03-03 Interand Corporation Electronic image compression method and apparatus using interlocking digitate geometric sub-areas to improve the quality of reconstructed images
JP2629238B2 (en) 1988-02-05 1997-07-09 ソニー株式会社 Decoding device and decoding method
SE503549C2 (en) 1988-09-15 1996-07-01 Telia Ab Encryption with subsequent source encoding
US4953023A (en) 1988-09-29 1990-08-28 Sony Corporation Coding apparatus for encoding and compressing video data
JP2900385B2 (en) 1988-12-16 1999-06-02 ソニー株式会社 Framing circuit and method
US5150210A (en) 1988-12-26 1992-09-22 Canon Kabushiki Kaisha Image signal restoring apparatus
JP3018366B2 (en) 1989-02-08 2000-03-13 ソニー株式会社 Video signal processing circuit
JPH02248161A (en) 1989-03-20 1990-10-03 Fujitsu Ltd Data transmission system
US5185746A (en) 1989-04-14 1993-02-09 Mitsubishi Denki Kabushiki Kaisha Optical recording system with error correction and data recording distributed across multiple disk drives
JPH02280462A (en) 1989-04-20 1990-11-16 Fuji Photo Film Co Ltd Picture data compression method
DE69031638T2 (en) 1989-05-19 1998-03-19 Canon Kk System for the transmission of image information
US5208816A (en) 1989-08-18 1993-05-04 At&T Bell Laboratories Generalized viterbi decoding algorithms
JPH03141752A (en) 1989-10-27 1991-06-17 Hitachi Ltd Picture signal transmitting method
US5166987A (en) 1990-04-04 1992-11-24 Sony Corporation Encoding apparatus with two stages of data compression
US5101446A (en) 1990-05-31 1992-03-31 Aware, Inc. Method and apparatus for coding an image
JPH0474063A (en) 1990-07-13 1992-03-09 Matsushita Electric Ind Co Ltd Coding method for picture
JP2650472B2 (en) 1990-07-30 1997-09-03 松下電器産業株式会社 Digital signal recording apparatus and digital signal recording method
JP2969867B2 (en) 1990-08-31 1999-11-02 ソニー株式会社 High-efficiency encoder for digital image signals.
GB9019538D0 (en) 1990-09-07 1990-10-24 Philips Electronic Associated Tracking a moving object
US5416651A (en) 1990-10-31 1995-05-16 Sony Corporation Apparatus for magnetically recording digital data
US5243428A (en) 1991-01-29 1993-09-07 North American Philips Corporation Method and apparatus for concealing errors in a digital television
US5636316A (en) 1990-12-05 1997-06-03 Hitachi, Ltd. Picture signal digital processing unit
ES2143136T3 (en) 1990-12-28 2000-05-01 Canon Kk APPARATUS FOR IMAGE PROCESSING.
JP2906671B2 (en) 1990-12-28 1999-06-21 ソニー株式会社 Highly efficient digital video signal encoding apparatus and method
EP0495501B1 (en) 1991-01-17 1998-07-08 Sharp Kabushiki Kaisha Image coding and decoding system using an orthogonal transform and bit allocation method suitable therefore
DE69230922T2 (en) 1991-01-17 2000-11-30 Mitsubishi Electric Corp Video signal encoder with block exchange technology
US5455629A (en) 1991-02-27 1995-10-03 Rca Thomson Licensing Corporation Apparatus for concealing errors in a digital video processing system
JP3125451B2 (en) 1991-11-05 2001-01-15 ソニー株式会社 Signal processing method
JPH04358486A (en) 1991-06-04 1992-12-11 Toshiba Corp High efficiency code signal processing unit
JP2766919B2 (en) 1991-06-07 1998-06-18 三菱電機株式会社 Digital signal recording / reproducing device, digital signal recording device, digital signal reproducing device
US5263026A (en) 1991-06-27 1993-11-16 Hughes Aircraft Company Maximum likelihood sequence estimation based equalization within a mobile digital cellular receiver
JP3141896B2 (en) 1991-08-09 2001-03-07 ソニー株式会社 Digital video signal recording device
DE69217150T2 (en) 1991-09-30 1997-07-17 Philips Electronics Nv Motion vector estimation, motion picture coding and storage
JPH05103309A (en) 1991-10-04 1993-04-23 Canon Inc Method and device for transmitting information
US5398078A (en) 1991-10-31 1995-03-14 Kabushiki Kaisha Toshiba Method of detecting a motion vector in an image coding apparatus
JP3278881B2 (en) 1991-12-13 2002-04-30 ソニー株式会社 Image signal generator
US5473479A (en) 1992-01-17 1995-12-05 Sharp Kabushiki Kaisha Digital recording and/or reproduction apparatus of video signal rearranging components within a fixed length block
JP3360844B2 (en) 1992-02-04 2003-01-07 ソニー株式会社 Digital image signal transmission apparatus and framing method
JPH05236427A (en) 1992-02-25 1993-09-10 Sony Corp Device and method for encoding image signal
US5307175A (en) 1992-03-27 1994-04-26 Xerox Corporation Optical image defocus correction
JP3259323B2 (en) 1992-04-13 2002-02-25 ソニー株式会社 De-interleave circuit
US5325203A (en) 1992-04-16 1994-06-28 Sony Corporation Adaptively controlled noise reduction device for producing a continuous output
JP3438233B2 (en) 1992-05-22 2003-08-18 ソニー株式会社 Image conversion apparatus and method
US5359694A (en) 1992-07-27 1994-10-25 Teknekron Communications Systems, Inc. Method and apparatus for converting image data
US5438369A (en) 1992-08-17 1995-08-01 Zenith Electronics Corporation Digital data interleaving system with improved error correctability for vertically correlated interference
US5481554A (en) 1992-09-02 1996-01-02 Sony Corporation Data transmission apparatus for transmitting code data
JPH06153180A (en) 1992-09-16 1994-05-31 Fujitsu Ltd Picture data coding method and device
JPH06121192A (en) 1992-10-08 1994-04-28 Sony Corp Noise removing circuit
DE69324650T2 (en) 1992-11-06 1999-09-09 Gold Star Co Mixing method for a digital video tape recorder
US5689302A (en) 1992-12-10 1997-11-18 British Broadcasting Corp. Higher definition video signals from lower definition sources
US5477276A (en) 1992-12-17 1995-12-19 Sony Corporation Digital signal processing apparatus for achieving fade-in and fade-out effects on digital video signals
JPH06205386A (en) 1992-12-28 1994-07-22 Canon Inc Picture reproduction device
US5805762A (en) 1993-01-13 1998-09-08 Hitachi America, Ltd. Video recording device compatible transmitter
US5416847A (en) 1993-02-12 1995-05-16 The Walt Disney Company Multi-band, digital audio noise filter
US5737022A (en) 1993-02-26 1998-04-07 Kabushiki Kaisha Toshiba Motion picture error concealment using simplified motion compensation
JP3259428B2 (en) 1993-03-24 2002-02-25 ソニー株式会社 Apparatus and method for concealing digital image signal
KR100261072B1 (en) 1993-04-30 2000-07-01 윤종용 Digital signal processing system
KR940026915A (en) 1993-05-24 1994-12-10 오오가 노리오 Digital video signal recording device and playback device and recording method
US5499057A (en) 1993-08-27 1996-03-12 Sony Corporation Apparatus for producing a noise-reducded image signal from an input image signal
JP3557626B2 (en) 1993-08-27 2004-08-25 ソニー株式会社 Image restoration apparatus and method
US5406334A (en) 1993-08-30 1995-04-11 Sony Corporation Apparatus and method for producing a zoomed image signal
KR960012931B1 (en) 1993-08-31 1996-09-25 대우전자 주식회사 Channel error concealing method for classified vector quantized video
JP3590996B2 (en) 1993-09-30 2004-11-17 ソニー株式会社 Hierarchical encoding and decoding apparatus for digital image signal
US5663764A (en) 1993-09-30 1997-09-02 Sony Corporation Hierarchical encoding and decoding apparatus for a digital image signal
JP2862064B2 (en) 1993-10-29 1999-02-24 三菱電機株式会社 Data decoding device, data receiving device, and data receiving method
US5617333A (en) 1993-11-29 1997-04-01 Kokusai Electric Co., Ltd. Method and apparatus for transmission of image data
JP3271108B2 (en) 1993-12-03 2002-04-02 ソニー株式会社 Apparatus and method for processing digital image signal
JPH07203428A (en) 1993-12-28 1995-08-04 Canon Inc Image processing method and its device
JP3321972B2 (en) 1994-02-15 2002-09-09 ソニー株式会社 Digital signal recording device
JP3161217B2 (en) 1994-04-28 2001-04-25 松下電器産業株式会社 Image encoding recording device and recording / reproducing device
JP3336754B2 (en) 1994-08-19 2002-10-21 ソニー株式会社 Digital video signal recording method and recording apparatus
JP3845870B2 (en) 1994-09-09 2006-11-15 ソニー株式会社 Integrated circuit for digital signal processing
US5577053A (en) 1994-09-14 1996-11-19 Ericsson Inc. Method and apparatus for decoder optimization
JPH08140091A (en) 1994-11-07 1996-05-31 Kokusai Electric Co Ltd Image transmission system
US5594807A (en) 1994-12-22 1997-01-14 Siemens Medical Systems, Inc. System and method for adaptive filtering of images based on similarity between histograms
US5852470A (en) 1995-05-31 1998-12-22 Sony Corporation Signal converting apparatus and signal converting method
US5946044A (en) 1995-06-30 1999-08-31 Sony Corporation Image signal converting method and image signal converting apparatus
FR2736743B1 (en) 1995-07-10 1997-09-12 France Telecom METHOD FOR CONTROLLING THE OUTPUT RATE OF AN ENCODER OF DIGITAL DATA REPRESENTATIVE OF IMAGE SEQUENCES
JP3617879B2 (en) 1995-09-12 2005-02-09 株式会社東芝 Disk repair method and disk repair device for real-time stream server
KR0155900B1 (en) 1995-10-18 1998-11-16 김광호 Phase error detecting method and phase tracking loop circuit
US5724369A (en) 1995-10-26 1998-03-03 Motorola Inc. Method and device for concealment and containment of errors in a macroblock-based video codec
KR100196872B1 (en) 1995-12-23 1999-06-15 전주범 Apparatus for restoring error of image data in image decoder
KR100197366B1 (en) 1995-12-23 1999-06-15 전주범 Apparatus for restoring error of image data
US5751862A (en) 1996-05-08 1998-05-12 Xerox Corporation Self-timed two-dimensional filter
JP3352887B2 (en) * 1996-09-09 2002-12-03 株式会社東芝 Divider with clamp, information processing apparatus provided with this divider with clamp, and clamp method in division processing
US6134269A (en) 1996-09-25 2000-10-17 At&T Corp Fixed or adaptive deinterleaved transform coding for image coding and intra coding of video
JP3106985B2 (en) 1996-12-25 2000-11-06 日本電気株式会社 Electronic watermark insertion device and detection device
KR100196840B1 (en) 1996-12-27 1999-06-15 전주범 Apparatus for reconstucting bits error in the image decoder
KR100239302B1 (en) 1997-01-20 2000-01-15 전주범 Countour coding method and apparatus using vertex coding
EP1025707B1 (en) 1997-10-23 2008-07-23 Sony Electronics Inc. Apparatus and method for mapping an image to blocks to provide for robust error recovery in a lossy transmission environment
US6137915A (en) 1998-08-20 2000-10-24 Sarnoff Corporation Apparatus and method for error concealment for hierarchical subband coding and decoding

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5134479A (en) * 1990-02-16 1992-07-28 Sharp Kabushiki Kaisha NTSC high resolution television converting apparatus for converting television signals of an NTSC system into high resolution television signals
US5469474A (en) * 1992-06-24 1995-11-21 Nec Corporation Quantization bit number allocation by first selecting a subband signal having a maximum of signal to mask ratios in an input signal
US5786857A (en) * 1993-10-01 1998-07-28 Texas Instruments Incorporated Image processing system
US5649053A (en) * 1993-10-30 1997-07-15 Samsung Electronics Co., Ltd. Method for encoding audio signals

Also Published As

Publication number Publication date
US6295008B1 (en) 2001-09-25
US6535148B1 (en) 2003-03-18
AU3697700A (en) 2000-08-29

Similar Documents

Publication Publication Date Title
CA2306897C (en) Source coding to provide for robust error recovery during transmission losses
KR100704313B1 (en) Apparatus and method for the recovery of compression constants in the encoded domain, computer readable medium
US6535148B1 (en) Method and apparatus for truncated decoding
EP1027651B1 (en) Apparatus and method for providing robust error recovery for errors that occur in a lossy transmission environment
EP1025647B1 (en) Apparatus and method for recovery of lost/damaged data in a bitstream of data based on compatibility
EP1025707B1 (en) Apparatus and method for mapping an image to blocks to provide for robust error recovery in a lossy transmission environment
US6581170B1 (en) Source coding to provide for robust error recovery during transmission losses
US6282684B1 (en) Apparatus and method for recovery of data in a lossy transmission environment
WO2000048320A1 (en) Method and apparatus for truncated decoding
EP1040444B1 (en) Apparatus and method for recovery of quantization codes in a lossy transmission environment
US20030212944A1 (en) Method and apparatus for error data recovery
EP1025648B1 (en) Apparatus and method for localizing transmission errors to provide robust error recovery in a lossy transmission environment
EP1025538B1 (en) Apparatus and method for recovery of data in a lossy transmission environment
EP1025705B1 (en) Apparatus and method for partial buffering transmitted data to provide robust error recovery in a lossy transmission environment
US6178266B1 (en) Method and apparatus for the recovery of compression constants in the encoded domain
EP1151615B1 (en) Apparatus and method for the recovery of compression constants in the encoded domain

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase