Video compression formats: A primer

technology

Examining compression standards Motion JPEG, MPEG-4, and H.264.

By Ryan Zatolokin, Axis Communications, inc.

Video compression technologies reduce and remove redundant video data so a digital video file can effectively be sent over a network and stored on computer disks. With efficient compression techniques, a significant reduction in file size can be achieved with little or no adverse effect on visual quality.

This graph shows a bit-rate comparison, given the same level of image quality, among the following video standards: Motion JPEG, MPEG-4 Part 2 (no motion compensation), MPEG-4 Part 2 (with motion compensation), and H.264 (baseline profile).

Different compression technologies, both proprietary and industry-standard, are available. Today most video vendors use standard compression techniques; standards are particularly relevant to video compression because video may be used for different purposes, and in some surveillance applications, must be viewable years from the recording date. Using standard-based compression enables users to select from different vendors, rather than being tied to a single supplier.

At Axis Communications we use three video compression standards: Motion JPEG, MPEG-4 Part 2 (often referred to simply as MPEG-4), and H.264. H.264 is the latest and most-efficient video compression standard. Here are descriptions of each of those formats.

Motion JPEG—Motion JPEG or M-JPEG is a digital video sequence that is made up of a series of individual JPEG images. (JPEG stands for Joint Photographic Experts Group.) When 16 image frames or more are shown per second, the viewer perceives motion video. Full motion video is perceived at 30 (NTSC) or 25 (PAL) frames per second.

One of the advantages of Motion JPEG is that each image in a video sequence can have the same guaranteed quality that is determined by the compression level chosen for the network camera or video encoder. The higher the compression level, the lower the file size and image quality. In some situations, such as in low light or when a scene becomes complex, the image file size may become quite large and use more bandwidth and storage space. To prevent an increase in the bandwidth and storage used, Axis network video products allow the user to set a maximum file size for an image frame.

Because there is no dependency between the frames in Motion JPEG, a Motion JPEG video is robust, meaning that if one frame is dropped during transmission, the rest of the video will not be affected.

Motion JPEG is an unlicensed standard. It has broad compatibility and is popular in applications where individual frames in a video sequence are required—for examples, for analysis—and where lower frame rates, typically 5 frames per second or lower, are used. Motion JPEG may also be needed for applications that require integration with systems that support only Motion JPEG.

The main disadvantage of Motion JPEG is that it makes no use of any video compression techniques to reduce the data since it is a series of still, complete images. The result is that it has a relatively high bit rate or low compression ratio for the delivered quality compared with video compression standards such as MPEG-4 and H.264.

MPEG-4—When MPEG-4 is mentioned in video surveillance applications, it is usually referring to MPEG-4 Part 2, also known as MPEG-4 Visual. Like all MPEG (Moving Picture Experts Group) standards, it is a licensed standard, so users must pay a license fee per monitoring station. MPEG-4 supports low-bandwidth applications and applications that require high-quality images, no limitations in frame rate and with virtually unlimited bandwidth.

H.264 or MPEG-4 Part 10/AVC—H.264, also known as MPEG-4 Part 10/AVC for Advanced Video Coding, is the latest MPEG standard for video encoding. H.264 is expected to become the video standard of choice in the coming years. This is because an H.264 encoder can, without compromising image quality, reduce the size of a digital video file by more than 80 percent compared with the Motion JPEG format and as much as 50 percent more than with the MPEG-4 standard. This means that much less network bandwidth and storage space are required for a video file. Or seen another way, much higher video quality can be achieved for a given bit rate.

H.264 was jointly defined by standardization organizations in the telecommunications (ITU-T’s Video Coding Experts Group) and IT industries (ISO/IEC Moving Picture Experts Group), and is expected to be more widely adopted than previous standards. In the video surveillance industry, H.264 will most likely find the quickest traction in applications where there are demands for high frame rates and high resolution, such as in the surveillance of highways, airports and casinos, where the use of 30/25 (NTSC/PAL) frames per second is the norm. This is where the economies of reduced bandwidth and storage needs will deliver the biggest savings.

H.264 is also expected to accelerate the adoption of megapixel cameras because the highly efficient compression technology can reduce the large file sizes and bit rates generated without compromising image quality. While H.264 provides savings in network bandwidth and storage costs, it will require higher-performance network cameras and monitoring stations.

Axis’s H.264 encoders use the baseline profile, which means that only I- and P-frames are used. This profile is ideal for network cameras and video encoders because low latency is achieved because B-frames are not used. Low latency is essential in video surveillance applications where live monitoring takes place, especially when PTZ cameras or PTZ dome cameras are used.

When comparing the performance of MPEG standards such as MPEG-4 and H.264, it is important to note that results may vary between encoders that use the same standard. This is because the designer of an encoder can choose to implement different sets of tools defined by a standard. As long as the output of an encoder conforms to a standard’s format and decoder, it is possible to make different implementations. An MPEG standard, therefore, cannot guarantee a given bit rate or quality, and comparisons cannot be properly made without first defining how the standards are implemented in an encoder. A decoder, unlike an encoder, must implement all the required parts of a standard in order to decode a compliant bit stream. A standard specifies exactly how a decompression algorithm should restore every bit of a compressed video.

At Axis we compared bit rates of different encoders using the same level of image quality and different compression standards. Specifically, the standards were Motion JPEG, MPEG-4 Part 2 (no motion compensation), MPEG-4 Part 2 (with motion compensation), and H.264 (baseline profile).

Our H.264 encoder generated up to 50 percent fewer bits per second for a sample video sequence than an MPEG-4 encoder with motion compensation. The H.264 encoder was at least three times more efficient than an MPEG-4 encoder with no motion compensation, and at least six times more efficient than with Motion JPEG.

Ryan Zatolokin is senior technologist with Axis Communications, Inc. This article is excerpted from an article that is available on Axis’s website. That article discusses topics including image compression versus video compression, as well as variable and constant bit rate.