Jump to content

zlib

From Wikipedia, the free encyclopedia
(Redirected from ZLIB)
zlib
Initial release1 May 1995 (1995-05-01)
Stable release
1.3.1[1] / 22 January 2024; 2 years ago (22 January 2024)
Written inC
Operating systemCross-platform
TypeData compression
Licensezlib License
Websitezlib.net
Repository

zlib (/ˈzlɪb/ or "zeta-lib", /ˈztəˌlɪb/)[2][3] is a data format and a lossless data compression software library created by Jean-loup Gailly and Mark Adler. The library implements the Deflate algorithm and supports compressing and decompressing data using the zlib data format, the gzip file format, or simply as a Deflate stream.[4] zlib is also a crucial component of many software platforms, including Linux, macOS, and iOS. It has also been used in gaming consoles such as the PlayStation 4, PlayStation 3, Wii U, Wii, Xbox One and Xbox 360.

The first public version of Zlib, 0.9, was released on 1 May 1995 and was originally intended for use with the libpng image library. It is free software, distributed under the zlib License.

RFC 1950 standardized the zlib data format.[5]

Data format

[edit]

The zlib data format contains a 2-byte header, an optional extra header field, Deflate-compressed data, and a 4-byte trailer.

The first bit (bit 0), in a byte, is the least-significant. Multi-byte values use big-endian.[6]

Offset (bytes) Field[7] Size (bytes) Description
0 CMF 1

This byte is divided into two 4-bit fields:

  • Bits 0 to 3: CM (compression method). Must be 8 (Deflate).
  • Bits 4 to 7: CINFO (compression info). Base-2 logarithm of the LZ77 window size, minus eight. The maximum allowed value is 7 (i.e. 32 KiB window size). 6 indicates 16 KiB, 5 indicates 8 KiB, etc...
1 FLG 1

Flags.

  • Bits 0 to 4: FCHECK. This value must be chosen such that CMF*256 + FLG is a multiple of 31.
  • Bit 5: FDICT. See DICTID below.
  • Bits 6 to 7: FLEVEL (compression level). Fastest (0), fast (1), default (2) or maximum (3). FLEVEL is not needed for decompression. It indicates if recompression might be worthwhile.
2 DICTID 0 or 4 Adler-32 checksum of the preset DEFLATE dictionary used during compression. Present if the FDICT flag is set. The decompressor can use this checksum to determine which dictionary has been used by the compressor (e.g. inflateSetDictionary in zlib.h).
Varies Compressed data Varies Deflate stream.
ADLER32 4 Adler-32 checksum of the uncompressed data (excluding any dictionary data).

Library

[edit]

Encapsulation

[edit]

Deflate compression typically uses the zlib data format or the gzip file format, which add a header and a trailer to the compressed data. This provides stream identification and error detection that are not provided by a Deflate stream.

The zlib format (at least 6 bytes) is smaller than the gzip format (at least 18 bytes) as the latter stores filesystem metadata.

Algorithm

[edit]

As of September 2018, zlib only supports one algorithm, called DEFLATE, which uses a combination of a variation of LZ77 (Lempel–Ziv 1977) and Huffman coding.[8] This algorithm provides good compression on a wide variety of data with minimal use of system resources. This is also the algorithm used in the Zip archive format. The header makes allowance for other algorithms, but none are currently implemented.

Resource use

[edit]

zlib provides facilities for control of processor and memory use. A compression level value may be supplied that trades speed for compression. There are also facilities for conserving memory, useful in restricted memory environments, such as some embedded systems.

Strategy

[edit]

The compression can be optimized for specific types of data. If one is using the library to always compress specific types of data, then using a specific strategy may improve compression and performance. For example, if the data contain long lengths of repeated bytes, the run-length encoding (RLE) strategy may give good results at a higher speed. For general data, the default strategy is preferred.

Error handling

[edit]

Errors in compressed data may be detected and skipped. Further, if "full-flush" points are written to the compressed stream, then corrupt data can be skipped, and the decompression will resynchronize at the next flush point—although no error recovery of the corrupt data is provided. Full-flush points are useful for large data streams on unreliable channels, where some data loss is unimportant, such as in some multimedia applications. However, creating many flush points can affect the speed as well as the amount (ratio) of compression.

Data length

[edit]

There is no limit to the length of data that can be compressed or decompressed. Repeated calls to the library allow an unlimited number of blocks of data to be handled. Some ancillary code (counters) may suffer from overflow for long data streams, but this does not affect the actual compression or decompression.

When compressing a long (or infinite) data stream, it is advisable to write regular full-flush points.

Applications

[edit]

Today, zlib is something of a de facto standard, to the point that zlib and DEFLATE are often used interchangeably in standards documents, with thousands of applications relying on it for compression, either directly or indirectly.[9] These include:

zlib is also used in many embedded devices, such as the Apple iPhone and Sony PlayStation 3, because the code is portable, liberally licensed, and has a relatively small memory footprint.

Forks

[edit]

A commonly used library built on an old codebase, zlib is also frequently forked by third-parties that claim improvements to this library:

See also

[edit]

References

[edit]
  1. ^ Adler, Mark (22 January 2024). "[Zlib-announce] zlib 1.3.1 released". Retrieved 23 January 2024.
  2. ^ Stutz, Michael (October 22, 2004). The Linux Cookbook, 2nd Edition: Tips and Techniques for Everyday Use. No Starch Press. ISBN 9781593270315. Archived from the original on January 25, 2023. Retrieved October 28, 2021 – via Google Books.
  3. ^ Loki Software; Hall, John R. (October 22, 2001). Programming Linux Games. No Starch Press. ISBN 9781886411494. Archived from the original on July 11, 2024. Retrieved October 28, 2021 – via Google Books.
  4. ^ "zlib 1.3.1 Manual". zlib.net. Archived from the original on 8 February 2024.
  5. ^ RFC 1950. doi:10.17487/RFC1950.
  6. ^ Overall conventions. p. 3. doi:10.17487/RFC1950. RFC 1950.
  7. ^ Data format. p. 4. doi:10.17487/RFC1950. RFC 1950.
  8. ^ RFC 1951. doi:10.17487/RFC1951.
  9. ^ Gailly, Jean-loup; Adler, Mark (2002-04-18), zlib Applications, archived from the original on 2019-06-10, retrieved 2007-12-09
  10. ^ "c++ - Why does cURL use zlib?". Stack Overflow.
  11. ^ "ORCFile in HDP 2: Better Compression, Better Performance". 6 September 2013. Archived from the original on 6 August 2020. Retrieved 16 May 2020.
  12. ^ System.IO.Compression.DeflateStream Archived 2016-02-06 at the Wayback Machine. MSDN Library.
  13. ^ zlib-flate(1) – Debian General Commands Manual
  14. ^ "qpdf". Archived from the original on 2024-07-11. Retrieved 2021-03-28.
  15. ^ "MySQL :: MySQL 8.0 Reference Manual :: 15.9.1.5 How Compression Works for InnoDB Tables". dev.mysql.com. Archived from the original on 2021-08-01. Retrieved 2021-07-14.
  16. ^ "github Intel ZLIB". github. 25 January 2024. Archived from the original on 25 January 2024. Retrieved 25 January 2024.
  17. ^ "Fighting Cancer: The Unexpected Benefit Of Open Sourcing Our Code". The Cloudflare Blog. 8 July 2015. Archived from the original on 21 September 2019. Retrieved 18 January 2020.
  18. ^ Rosbach, HK (24 February 2023). "zlib-ng/zlib-ng: zlib replacement with optimizations for "next generation" systems". GitHub. zlib-ng. Archived from the original on 20 April 2020. Retrieved 18 January 2020.
[edit]