Image Compression, Packet Video and Video Processing
Fall Semester, 2008
Stanford University Image Systems Engineering Toolbox - This toolbox is about 8.3MBytes. The toolbox contains matlab routines (and in many cases, Mex files) for image processing, motion (affine), calibration, color conversion, some image compression and dithering, file utilities (pgm, ppm, mpeg), image filtering, image pyramids, graphics support, image statistics, optics, optimization, and general Matlab utility functions. It is a good library to have if you do not have the Mathworks image processing toolbox.
General Image Processing and Computer Vision Tools:
Intel hosts a series of free tools in both image processing and computer vision that have direct application to the video compression problem. The two tool sets: Image Processing Library (IPL) and OpenCV provide C libraries for image and video processing. In particular, the OpenCV library has support for frame grabbers, directX and also supports a series of Matlab toolboxes (Calibration and OpenCV Wrapper).
Since Intel is no longer releasing IPL (version 2.5), I have included it on the CD-ROM (It is 21 MBytes). OpenCV no longer requires it.
OpenCV may be obtained via their network site at Source Forge. There are
If you are unable to download these programs, Please let me know, and I make them available to the class via CD.
Wavelet Toolbox - UVIWave version 3.0 - This toolbox (and manual) was used by earlier image compression classes. The highest matlab version that was tested was 5.3 (It should run on version 6.0 and 6.5).
Image Processing and Computer Vision Tools from Peter Kovesi MatlabFns.tar.gz
Virtual Dub (with MPEG-2). This package will allow you to view sequences of files, as well as avi and mpeg1/2 data. The program will identify IPB frames and extract to PNG and BMP sequences.
The homework assignments occasionally call for test data for motion estimation, or compression examples. I will add more images as the semester progresses:
Color NTSC (640x480x24) Image for use with homework #2
SMPTE BARS image (from http://www.barney-wol.net/testpatterns/testpatterns.html)
Readyuvtestntsc.m script file to read 486x720 YCbCr data set.
Black and White Still Images:
The following are 352x288 (8 bit pixels):
Matlab script to convert double precision floating point to binary representation (useful for arithmetic encoder): num2bin.m
Matlab script to convert binary representation to double precision floating point number (useful for arithmetic decoder): bin2num.m
MATLAB Script to find the minimum distortion (LBG example from VQ Charts) findmindistortion.m
MATLAB Script to do the first step of the Rao and Hwang example testvectrun.m
The external file may say 352x240, but they are 352x288, 8 bit pixels.
Arithmetic Compression Articles:
I. H. Witten, R. M. Neal, J. G. Cleary, "Arithmetic Coding for Data Compression", Communications of the ACM, Vol 30, No. 6, pp 520-540, June 1987.
C.Stiller and J.Konrad, "Estimating motion in image sequences," IEEE Signal Processing Magazine, 16:70--91, July 1999.
J. R. Bergen, P. J. Burt, R. Hingorani, S. Peleg, "A Three-Frame Algorithm for Estimating Two-Component Image Motion", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol 14, No. 9, Sept, 1992, pp. 886-896.
C.V. Stewart, "Robust Parameter
Estimation in Computer Vision,"
P. J. Burt and E. H. Adelson. A multiresolution spline with application to image mosaics. ACM Transactions on Graphics, 2(4):217--236, Oct. 1983
P. Heckbert, "Projective Mappings for Image Warping", Excerpted from pages 17-21 of Fundamentals of Texture mapping and Image Warping, Paul Heckbert, Master's thesis, UCB/CSD 89/516, CS Division, U.C. Berkeley, June 1989.
F. Odone, A. Fusiello, E. Trucco, "Robust Motion Segmentation for Content-based Video Coding", In 6th RIAO (Recherche d'Informations Assiste par Ordinateur) International Conference, 2000
S. Peleg and J. Herman, "Panoramic Mosaics by Manifold Projection", CVPR97, pp. 338-343,June 1997.
S. Peleg, M. Ben-Ezra, "Stereo
panorama with a single camera", Int. Conf. Computer Vision and
Pattern Recognition ICPR 99, pp. 395401,
B. Rousso, S. Peleg and I. Finci, "Mosaicing
with Generalized Strips", DARPA Image Understanding Workshop, pp.
R. Szeliski, "Image Alignment and Stitching: A Tutorial", Microsoft Research Technical Report (Preliminary Draft), MSR-TR-2004-92, January 26, 2005.
R. Szeliski, "Image Mosaicing for Tele-Reality Applications", IEEE WACV94, pp. 44-53, Dec. 1994.
R. Szeliski and J. Coughlan, "Spline-Based Image Registration", International Journal of Computer Vision 22(3), 199-218 (1997).
R. Szliski and H. Shum, "Creating Full View Panoramic Image Mosaics and Environment Maps", SIGRAPH 97, pp. 251-258, LA, Aug. 1997.
I. Zoghlami, O. Faugeras and R. Deriche, "Using geometric corners to build a 2D mosaic from a set of images", CVPR97, pp. 420-425, Puerto-Rico, Jun. 1997.
E.Dubois, "The sampling and reconstruction of time-varying imagery with application in video systems. Proceedings of IEEE, 73:502--522, 1985.
R.A. Beuker and I.A. Shah. “Analysis of interlaced video signals and its applications.” IEEE Trans. Image Process., 3(5):501--512, Sept. 1994.
B.K. Gunturk, Y. Altunbasak and R.M. Mersereau, “Multiframe Resolution Enhancement Methods for Compressed Video”, IEEE Signal Processing Letters, 9(6):170-174, June 2002.
A. Patti, M. Sezan, and M. Tekalp. “Superresolution video reconstruction with arbitrary sampling lattices and nonzero aperature time”, In IEEE Trans. on Image Processing, pages 1064--1078, August 1997
A. J. Patti, M. Sezan, and A. M. Tekalp, “Robust methods for high quality stills from interlaced video in the presence of dominant motion,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 7, no. 2, pp. 328--342, Apr. 1997
R. Schultz and R. Stevenson, “ Extraction of high-resolution frames from video sequences”, IEEE T. Image Proces., 5(6), pp. 996--1011, 1996.
P.E. Eren, M.L. Sezan, A.M Tekalp, “ Robust, object-based high-resolution image reconstruction from low-resolution video”, Image Processing, IEEE Transactions on , Volume: 6 Issue: 10 , Oct. 1997 Page(s): 1446 -1451
A.M. Tekalp, M.K. Ozkan, M.I. Sezan, “ High-resolution image reconstruction from lower-resolution image sequences and space-varying image restoration ”, Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on , Volume: 3 , 1992 Page(s): 169 -172 vol.3
M. Irani and S. Peleg, “ Motion Analysis for Image Enhancement: Resolution, Occlusion, and Transparency” J. on Visual Communications and Image Representation, Vol 4 No. 4, December 1993, pp. 324-335.
S.P. Kim, N.K. Bose,H.M Valenzuela, “ Recursive reconstruction of high resolution image from noisy undersampled multiframes” Acoustics, Speech and Signal Processing [see also IEEE Transactions on Signal Processing], IEEE Transactions on , Volume: 38 Issue: 6 , June 1990 Page(s): 1013 -1027
Brady, N., “ MPEG-4 standardized methods for the compression of arbitrarily shaped video objects” Circuits and Systems for Video Technology, IEEE Transactions on , Volume: 9 Issue: 8 , Dec. 1999, Page(s): 1170 -1189
Kan-Li Huang; Yi-Shin Tung; Ja-Ling Wu; Po-Kang Hsiao; Hsien-Shuo Chen, “ A frame-based MPEG characteristics extraction tool and its application in video transcoding” Consumer Electronics, IEEE Transactions on , Volume: 48 Issue: 3 , Aug. 2002 Page(s): 522 -532
Lienhart, R.; Wernicke, A., “ Localizing and segmenting text in images and videos ” Circuits and Systems for Video Technology, IEEE Transactions on , Volume: 12 Issue: 4 , April 2002, Page(s): 256-268
Changick Kim; Jenq-Neng Hwang, “ Fast and automatic video object segmentation and tracking for content-based applications” Circuits and Systems for Video Technology, IEEE Transactions on , Volume: 12 Issue: 2 , Feb. 2002, Page(s): 122 -129
Tsaig, Y.; Averbuch, A., “ Automatic segmentation of moving objects in video sequences: a region labeling approach” Circuits and Systems for Video Technology, IEEE Transactions on , Volume: 12 Issue: 7 , July 2002, Page(s): 597 -612
Shao-Yi Chien; Shyh-Yih Ma; Liang-Gee Chen, “ Efficient moving object segmentation algorithm using background registration technique” Circuits and Systems for Video Technology, IEEE Transactions on , Volume: 12 Issue: 7 , July 2002, Page(s): 577 -586
Koskinen, L.; Laiho, M.; Paasio, A.; Halonen, K., “ MPEG-4 based modifications for an CNN segmentation chip” Cellular Neural Networks and Their Applications, 2002. (CNNA 2002). Proceedings of the 2002 7th IEEE International Workshop on , 22-24 Jul 2002, Page(s): 71 -77
Jeong-Hwan Ahn; Chang-Su Kim; Kuo, C.-C.J.; Yo-Sung Ho, “ Motion-compensated compression of 3D animation models” Electronics Letters , Volume: 37 Issue: 24 , 22 Nov. 2001, Page(s): 1445 -1446
Minami, G.; Zixiang Xiong; Wang, A.; Mehrotra, S., “ 3-D wavelet coding of video with arbitrary regions of support” Circuits and Systems for Video Technology, IEEE Transactions on , Volume: 11 Issue: 9 , Sept. 2001, Page(s): 1063 -1068
Zhidong Yan; Kumar, S.; Kuo, C.-C.J., “ Error-resilient coding of 3-D graphic models via adaptive mesh segmentation” Circuits and Systems for Video Technology, IEEE Transactions on , Volume: 11 Issue: 7 , July 2001, Page(s): 860 -873
Strat, T.M., “ Object-based encoding: next-generation video compression",” MPEG-4. 2001 Proceedings of Workshop and Exhibition on , 2002, Page(s): 53 -57
Yan Lu; Wen Gao; Feng Wu, “ Sprite generation for frame-based video coding",” Image Processing, 2001. Proceedings. 2001 International Conference on , Volume: 1 , 2001, Page(s): 473 -476 vol.1
Match Moving Tools (Links to Company)
MatchMover - Trial Version Available – Note: With the purchase by Autodesk, MatchMover has disappeared.
Movimento- Trial Version Available – Note: With the purchase by Autodesk, Movimento has disappeared.
VTour – Trial Version Available – Note: With the purchase by Autodesk, VTour has disappeared.
Stitcher – Trial Version Available
Voodoo Camera Tracker – Free tracker
Updated: September 13, 2009