Distributed Coding of Dynamic Scenes with
Motion-Compensated Wavelets
This work discusses distributed coding of two correlated video signals.
The video signals are captured from a dynamic scene where each signal
is temporally decorrelated by a motion-compensated Haar wavelet.
The two cameras operate independently, however, the central decoder is
able to exploit the coded information from all cameras to achieve the best
reconstruction of the correlated video signals.
The coding system utilizes nested lattice codes for the transform
coefficients and exploits side information at the decoder.
The efficiency of the decoder is improved by disparity compensation
of one video signal.
When compared to decoding without side information, decoding of the quantized
transform coefficients with disparity compensated side information improves
rate-distortion performance.
(Article)
1. Distributed Coding Scheme
Scene information that is acquired by more than one sensor can be coded
efficiently if the correlation among sensor signals is exploited.
In one possible compression scenario, encoders of the sensor signals
are connected and compress the sensor signals jointly.
In an alternative compression scenario, each encoder operates independently
but relies on a joint decoding unit that receives all coded sensor signals.
This is also known as distributed source coding.
A special case of this scenario is source coding with side information.
Wyner and Ziv showed that for certain cases the encoder does not need the
side information to which the decoder has access to achieve the
rate-distortion bound.
The figure depicts the distributed coding scheme for dynamic scenes.
The dynamic scene is represented by the image sequences
sk[x,y] and wk[x,y].
The coding scheme comprises of Encoder 1 and Encoder 2 that
operate independently as well as of Decoder 2 that is dependent on
Decoder 1.
The side information for Decoder 2 can be improved by considering
the spatial camera positions and performing disparity compensation.
As the video signals are not stationary, Decoder 2 is decoding with
feed-back.
2. Experimental Results
For the experiments, we select the stereoscopic MPEG-4 sequence
Funfair in QCIF resolution.
We divide each view with 224 frames at 30 fps into groups of K=32
pictures.
The GOPs of the left view are encoded with Encoder 1 at high
quality by setting the quantization parameter QP=2, where Q = 2 QP.
This coded version of the left view is used for disparity compensation.
The compensated frames provide the side information for Decoder 2
to decode the right view.
The left figure shows the luminance PSNR over the total bit-rate of the
distributed codec Encoder 2 for the sequences Funfair 2.
The sequence is the right view of the stereoscopic sequence.
The rate-distortion points are obtained by varying the quantization
parameter for the nested lattice in Encoder 2.
When compared to decoding without side information, decoding with
coefficient side information reduces the bit-rate by up to 5%.
Decoding with disparity-compensated side information reduces the bit-rate
by up to 8%.
Utilizing more accurate geometrical information about the scene will
improve the side information for Decoder 2 and, hence, will further
reduce the bit-rate of Encoder 2.
The right figure shows the bit-rate difference between decoding with side
information and decoding without side information over the luminance PSNR
at Decoder 2 for the sequence Funfair 2 (right view).
The bit-rate savings due to side information are depicted for weak temporal
filtering with K=8 pictures per GOP and strong temporal filtering
with K=32 pictures per GOP.
Note that both the coded signal (right view) and the side information
(left view) are encoded with the same GOP length K.
It is observed that strong temporal filtering results in lower
bit-rate savings due to side information when compared to the bit-rate
savings due to side information for weaker temporal filtering.
Obviously, there is a trade-off between the level of temporal decorrelation
and the efficiency of multi-view side information.
This trade-off is also found in the theoretical investigation on the
efficiency of video coding with side information.
Copyright Markus Flierl, July 15, 2004
|