xref: /third_party/python/Doc/library/struct.rst (revision 7db96d56)
17db96d56Sopenharmony_ci:mod:`struct` --- Interpret bytes as packed binary data
27db96d56Sopenharmony_ci=======================================================
37db96d56Sopenharmony_ci
47db96d56Sopenharmony_ci.. module:: struct
57db96d56Sopenharmony_ci   :synopsis: Interpret bytes as packed binary data.
67db96d56Sopenharmony_ci
77db96d56Sopenharmony_ci**Source code:** :source:`Lib/struct.py`
87db96d56Sopenharmony_ci
97db96d56Sopenharmony_ci.. index::
107db96d56Sopenharmony_ci   pair: C; structures
117db96d56Sopenharmony_ci   triple: packing; binary; data
127db96d56Sopenharmony_ci
137db96d56Sopenharmony_ci--------------
147db96d56Sopenharmony_ci
157db96d56Sopenharmony_ciThis module converts between Python values and C structs represented
167db96d56Sopenharmony_cias Python :class:`bytes` objects.  Compact :ref:`format strings <struct-format-strings>`
177db96d56Sopenharmony_cidescribe the intended conversions to/from Python values.
187db96d56Sopenharmony_ciThe module's functions and objects can be used for two largely
197db96d56Sopenharmony_cidistinct applications, data exchange with external sources (files or
207db96d56Sopenharmony_cinetwork connections), or data transfer between the Python application
217db96d56Sopenharmony_ciand the C layer.
227db96d56Sopenharmony_ci
237db96d56Sopenharmony_ci.. note::
247db96d56Sopenharmony_ci
257db96d56Sopenharmony_ci   When no prefix character is given, native mode is the default. It
267db96d56Sopenharmony_ci   packs or unpacks data based on the platform and compiler on which
277db96d56Sopenharmony_ci   the Python interpreter was built.
287db96d56Sopenharmony_ci   The result of packing a given C struct includes pad bytes which
297db96d56Sopenharmony_ci   maintain proper alignment for the C types involved; similarly,
307db96d56Sopenharmony_ci   alignment is taken into account when unpacking.  In contrast, when
317db96d56Sopenharmony_ci   communicating data between external sources, the programmer is
327db96d56Sopenharmony_ci   responsible for defining byte ordering and padding between elements.
337db96d56Sopenharmony_ci   See :ref:`struct-alignment` for details.
347db96d56Sopenharmony_ci
357db96d56Sopenharmony_ciSeveral :mod:`struct` functions (and methods of :class:`Struct`) take a *buffer*
367db96d56Sopenharmony_ciargument.  This refers to objects that implement the :ref:`bufferobjects` and
377db96d56Sopenharmony_ciprovide either a readable or read-writable buffer.  The most common types used
387db96d56Sopenharmony_cifor that purpose are :class:`bytes` and :class:`bytearray`, but many other types
397db96d56Sopenharmony_cithat can be viewed as an array of bytes implement the buffer protocol, so that
407db96d56Sopenharmony_cithey can be read/filled without additional copying from a :class:`bytes` object.
417db96d56Sopenharmony_ci
427db96d56Sopenharmony_ci
437db96d56Sopenharmony_ciFunctions and Exceptions
447db96d56Sopenharmony_ci------------------------
457db96d56Sopenharmony_ci
467db96d56Sopenharmony_ciThe module defines the following exception and functions:
477db96d56Sopenharmony_ci
487db96d56Sopenharmony_ci
497db96d56Sopenharmony_ci.. exception:: error
507db96d56Sopenharmony_ci
517db96d56Sopenharmony_ci   Exception raised on various occasions; argument is a string describing what
527db96d56Sopenharmony_ci   is wrong.
537db96d56Sopenharmony_ci
547db96d56Sopenharmony_ci
557db96d56Sopenharmony_ci.. function:: pack(format, v1, v2, ...)
567db96d56Sopenharmony_ci
577db96d56Sopenharmony_ci   Return a bytes object containing the values *v1*, *v2*, ... packed according
587db96d56Sopenharmony_ci   to the format string *format*.  The arguments must match the values required by
597db96d56Sopenharmony_ci   the format exactly.
607db96d56Sopenharmony_ci
617db96d56Sopenharmony_ci
627db96d56Sopenharmony_ci.. function:: pack_into(format, buffer, offset, v1, v2, ...)
637db96d56Sopenharmony_ci
647db96d56Sopenharmony_ci   Pack the values *v1*, *v2*, ... according to the format string *format* and
657db96d56Sopenharmony_ci   write the packed bytes into the writable buffer *buffer* starting at
667db96d56Sopenharmony_ci   position *offset*.  Note that *offset* is a required argument.
677db96d56Sopenharmony_ci
687db96d56Sopenharmony_ci
697db96d56Sopenharmony_ci.. function:: unpack(format, buffer)
707db96d56Sopenharmony_ci
717db96d56Sopenharmony_ci   Unpack from the buffer *buffer* (presumably packed by ``pack(format, ...)``)
727db96d56Sopenharmony_ci   according to the format string *format*.  The result is a tuple even if it
737db96d56Sopenharmony_ci   contains exactly one item.  The buffer's size in bytes must match the
747db96d56Sopenharmony_ci   size required by the format, as reflected by :func:`calcsize`.
757db96d56Sopenharmony_ci
767db96d56Sopenharmony_ci
777db96d56Sopenharmony_ci.. function:: unpack_from(format, /, buffer, offset=0)
787db96d56Sopenharmony_ci
797db96d56Sopenharmony_ci   Unpack from *buffer* starting at position *offset*, according to the format
807db96d56Sopenharmony_ci   string *format*.  The result is a tuple even if it contains exactly one
817db96d56Sopenharmony_ci   item.  The buffer's size in bytes, starting at position *offset*, must be at
827db96d56Sopenharmony_ci   least the size required by the format, as reflected by :func:`calcsize`.
837db96d56Sopenharmony_ci
847db96d56Sopenharmony_ci
857db96d56Sopenharmony_ci.. function:: iter_unpack(format, buffer)
867db96d56Sopenharmony_ci
877db96d56Sopenharmony_ci   Iteratively unpack from the buffer *buffer* according to the format
887db96d56Sopenharmony_ci   string *format*.  This function returns an iterator which will read
897db96d56Sopenharmony_ci   equally sized chunks from the buffer until all its contents have been
907db96d56Sopenharmony_ci   consumed.  The buffer's size in bytes must be a multiple of the size
917db96d56Sopenharmony_ci   required by the format, as reflected by :func:`calcsize`.
927db96d56Sopenharmony_ci
937db96d56Sopenharmony_ci   Each iteration yields a tuple as specified by the format string.
947db96d56Sopenharmony_ci
957db96d56Sopenharmony_ci   .. versionadded:: 3.4
967db96d56Sopenharmony_ci
977db96d56Sopenharmony_ci
987db96d56Sopenharmony_ci.. function:: calcsize(format)
997db96d56Sopenharmony_ci
1007db96d56Sopenharmony_ci   Return the size of the struct (and hence of the bytes object produced by
1017db96d56Sopenharmony_ci   ``pack(format, ...)``) corresponding to the format string *format*.
1027db96d56Sopenharmony_ci
1037db96d56Sopenharmony_ci
1047db96d56Sopenharmony_ci.. _struct-format-strings:
1057db96d56Sopenharmony_ci
1067db96d56Sopenharmony_ciFormat Strings
1077db96d56Sopenharmony_ci--------------
1087db96d56Sopenharmony_ci
1097db96d56Sopenharmony_ciFormat strings describe the data layout when
1107db96d56Sopenharmony_cipacking and unpacking data.  They are built up from :ref:`format characters<format-characters>`,
1117db96d56Sopenharmony_ciwhich specify the type of data being packed/unpacked.  In addition,
1127db96d56Sopenharmony_cispecial characters control the :ref:`byte order, size and alignment<struct-alignment>`.
1137db96d56Sopenharmony_ciEach format string consists of an optional prefix character which
1147db96d56Sopenharmony_cidescribes the overall properties of the data and one or more format
1157db96d56Sopenharmony_cicharacters which describe the actual data values and padding.
1167db96d56Sopenharmony_ci
1177db96d56Sopenharmony_ci
1187db96d56Sopenharmony_ci.. _struct-alignment:
1197db96d56Sopenharmony_ci
1207db96d56Sopenharmony_ciByte Order, Size, and Alignment
1217db96d56Sopenharmony_ci^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1227db96d56Sopenharmony_ci
1237db96d56Sopenharmony_ciBy default, C types are represented in the machine's native format and byte
1247db96d56Sopenharmony_ciorder, and properly aligned by skipping pad bytes if necessary (according to the
1257db96d56Sopenharmony_cirules used by the C compiler).
1267db96d56Sopenharmony_ciThis behavior is chosen so
1277db96d56Sopenharmony_cithat the bytes of a packed struct correspond exactly to the memory layout
1287db96d56Sopenharmony_ciof the corresponding C struct.
1297db96d56Sopenharmony_ciWhether to use native byte ordering
1307db96d56Sopenharmony_ciand padding or standard formats depends on the application.
1317db96d56Sopenharmony_ci
1327db96d56Sopenharmony_ci.. index::
1337db96d56Sopenharmony_ci   single: @ (at); in struct format strings
1347db96d56Sopenharmony_ci   single: = (equals); in struct format strings
1357db96d56Sopenharmony_ci   single: < (less); in struct format strings
1367db96d56Sopenharmony_ci   single: > (greater); in struct format strings
1377db96d56Sopenharmony_ci   single: ! (exclamation); in struct format strings
1387db96d56Sopenharmony_ci
1397db96d56Sopenharmony_ciAlternatively, the first character of the format string can be used to indicate
1407db96d56Sopenharmony_cithe byte order, size and alignment of the packed data, according to the
1417db96d56Sopenharmony_cifollowing table:
1427db96d56Sopenharmony_ci
1437db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+
1447db96d56Sopenharmony_ci| Character | Byte order             | Size     | Alignment |
1457db96d56Sopenharmony_ci+===========+========================+==========+===========+
1467db96d56Sopenharmony_ci| ``@``     | native                 | native   | native    |
1477db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+
1487db96d56Sopenharmony_ci| ``=``     | native                 | standard | none      |
1497db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+
1507db96d56Sopenharmony_ci| ``<``     | little-endian          | standard | none      |
1517db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+
1527db96d56Sopenharmony_ci| ``>``     | big-endian             | standard | none      |
1537db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+
1547db96d56Sopenharmony_ci| ``!``     | network (= big-endian) | standard | none      |
1557db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+
1567db96d56Sopenharmony_ci
1577db96d56Sopenharmony_ciIf the first character is not one of these, ``'@'`` is assumed.
1587db96d56Sopenharmony_ci
1597db96d56Sopenharmony_ciNative byte order is big-endian or little-endian, depending on the
1607db96d56Sopenharmony_cihost system. For example, Intel x86, AMD64 (x86-64), and Apple M1 are
1617db96d56Sopenharmony_cilittle-endian; IBM z and many legacy architectures are big-endian.
1627db96d56Sopenharmony_ciUse :data:`sys.byteorder` to check the endianness of your system.
1637db96d56Sopenharmony_ci
1647db96d56Sopenharmony_ciNative size and alignment are determined using the C compiler's
1657db96d56Sopenharmony_ci``sizeof`` expression.  This is always combined with native byte order.
1667db96d56Sopenharmony_ci
1677db96d56Sopenharmony_ciStandard size depends only on the format character;  see the table in
1687db96d56Sopenharmony_cithe :ref:`format-characters` section.
1697db96d56Sopenharmony_ci
1707db96d56Sopenharmony_ciNote the difference between ``'@'`` and ``'='``: both use native byte order, but
1717db96d56Sopenharmony_cithe size and alignment of the latter is standardized.
1727db96d56Sopenharmony_ci
1737db96d56Sopenharmony_ciThe form ``'!'`` represents the network byte order which is always big-endian
1747db96d56Sopenharmony_cias defined in `IETF RFC 1700 <IETF RFC 1700_>`_.
1757db96d56Sopenharmony_ci
1767db96d56Sopenharmony_ciThere is no way to indicate non-native byte order (force byte-swapping); use the
1777db96d56Sopenharmony_ciappropriate choice of ``'<'`` or ``'>'``.
1787db96d56Sopenharmony_ci
1797db96d56Sopenharmony_ciNotes:
1807db96d56Sopenharmony_ci
1817db96d56Sopenharmony_ci(1) Padding is only automatically added between successive structure members.
1827db96d56Sopenharmony_ci    No padding is added at the beginning or the end of the encoded struct.
1837db96d56Sopenharmony_ci
1847db96d56Sopenharmony_ci(2) No padding is added when using non-native size and alignment, e.g.
1857db96d56Sopenharmony_ci    with '<', '>', '=', and '!'.
1867db96d56Sopenharmony_ci
1877db96d56Sopenharmony_ci(3) To align the end of a structure to the alignment requirement of a
1887db96d56Sopenharmony_ci    particular type, end the format with the code for that type with a repeat
1897db96d56Sopenharmony_ci    count of zero.  See :ref:`struct-examples`.
1907db96d56Sopenharmony_ci
1917db96d56Sopenharmony_ci
1927db96d56Sopenharmony_ci.. _format-characters:
1937db96d56Sopenharmony_ci
1947db96d56Sopenharmony_ciFormat Characters
1957db96d56Sopenharmony_ci^^^^^^^^^^^^^^^^^
1967db96d56Sopenharmony_ci
1977db96d56Sopenharmony_ciFormat characters have the following meaning; the conversion between C and
1987db96d56Sopenharmony_ciPython values should be obvious given their types.  The 'Standard size' column
1997db96d56Sopenharmony_cirefers to the size of the packed value in bytes when using standard size; that
2007db96d56Sopenharmony_ciis, when the format string starts with one of ``'<'``, ``'>'``, ``'!'`` or
2017db96d56Sopenharmony_ci``'='``.  When using native size, the size of the packed value is
2027db96d56Sopenharmony_ciplatform-dependent.
2037db96d56Sopenharmony_ci
2047db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2057db96d56Sopenharmony_ci| Format | C Type                   | Python type        | Standard size  | Notes      |
2067db96d56Sopenharmony_ci+========+==========================+====================+================+============+
2077db96d56Sopenharmony_ci| ``x``  | pad byte                 | no value           |                | \(7)       |
2087db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2097db96d56Sopenharmony_ci| ``c``  | :c:expr:`char`           | bytes of length 1  | 1              |            |
2107db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2117db96d56Sopenharmony_ci| ``b``  | :c:expr:`signed char`    | integer            | 1              | \(1), \(2) |
2127db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2137db96d56Sopenharmony_ci| ``B``  | :c:expr:`unsigned char`  | integer            | 1              | \(2)       |
2147db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2157db96d56Sopenharmony_ci| ``?``  | :c:expr:`_Bool`          | bool               | 1              | \(1)       |
2167db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2177db96d56Sopenharmony_ci| ``h``  | :c:expr:`short`          | integer            | 2              | \(2)       |
2187db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2197db96d56Sopenharmony_ci| ``H``  | :c:expr:`unsigned short` | integer            | 2              | \(2)       |
2207db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2217db96d56Sopenharmony_ci| ``i``  | :c:expr:`int`            | integer            | 4              | \(2)       |
2227db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2237db96d56Sopenharmony_ci| ``I``  | :c:expr:`unsigned int`   | integer            | 4              | \(2)       |
2247db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2257db96d56Sopenharmony_ci| ``l``  | :c:expr:`long`           | integer            | 4              | \(2)       |
2267db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2277db96d56Sopenharmony_ci| ``L``  | :c:expr:`unsigned long`  | integer            | 4              | \(2)       |
2287db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2297db96d56Sopenharmony_ci| ``q``  | :c:expr:`long long`      | integer            | 8              | \(2)       |
2307db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2317db96d56Sopenharmony_ci| ``Q``  | :c:expr:`unsigned long   | integer            | 8              | \(2)       |
2327db96d56Sopenharmony_ci|        | long`                    |                    |                |            |
2337db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2347db96d56Sopenharmony_ci| ``n``  | :c:expr:`ssize_t`        | integer            |                | \(3)       |
2357db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2367db96d56Sopenharmony_ci| ``N``  | :c:expr:`size_t`         | integer            |                | \(3)       |
2377db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2387db96d56Sopenharmony_ci| ``e``  | \(6)                     | float              | 2              | \(4)       |
2397db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2407db96d56Sopenharmony_ci| ``f``  | :c:expr:`float`          | float              | 4              | \(4)       |
2417db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2427db96d56Sopenharmony_ci| ``d``  | :c:expr:`double`         | float              | 8              | \(4)       |
2437db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2447db96d56Sopenharmony_ci| ``s``  | :c:expr:`char[]`         | bytes              |                | \(9)       |
2457db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2467db96d56Sopenharmony_ci| ``p``  | :c:expr:`char[]`         | bytes              |                | \(8)       |
2477db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2487db96d56Sopenharmony_ci| ``P``  | :c:expr:`void \*`        | integer            |                | \(5)       |
2497db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+
2507db96d56Sopenharmony_ci
2517db96d56Sopenharmony_ci.. versionchanged:: 3.3
2527db96d56Sopenharmony_ci   Added support for the ``'n'`` and ``'N'`` formats.
2537db96d56Sopenharmony_ci
2547db96d56Sopenharmony_ci.. versionchanged:: 3.6
2557db96d56Sopenharmony_ci   Added support for the ``'e'`` format.
2567db96d56Sopenharmony_ci
2577db96d56Sopenharmony_ci
2587db96d56Sopenharmony_ciNotes:
2597db96d56Sopenharmony_ci
2607db96d56Sopenharmony_ci(1)
2617db96d56Sopenharmony_ci   .. index:: single: ? (question mark); in struct format strings
2627db96d56Sopenharmony_ci
2637db96d56Sopenharmony_ci   The ``'?'`` conversion code corresponds to the :c:expr:`_Bool` type defined by
2647db96d56Sopenharmony_ci   C99. If this type is not available, it is simulated using a :c:expr:`char`. In
2657db96d56Sopenharmony_ci   standard mode, it is always represented by one byte.
2667db96d56Sopenharmony_ci
2677db96d56Sopenharmony_ci(2)
2687db96d56Sopenharmony_ci   When attempting to pack a non-integer using any of the integer conversion
2697db96d56Sopenharmony_ci   codes, if the non-integer has a :meth:`__index__` method then that method is
2707db96d56Sopenharmony_ci   called to convert the argument to an integer before packing.
2717db96d56Sopenharmony_ci
2727db96d56Sopenharmony_ci   .. versionchanged:: 3.2
2737db96d56Sopenharmony_ci      Added use of the :meth:`__index__` method for non-integers.
2747db96d56Sopenharmony_ci
2757db96d56Sopenharmony_ci(3)
2767db96d56Sopenharmony_ci   The ``'n'`` and ``'N'`` conversion codes are only available for the native
2777db96d56Sopenharmony_ci   size (selected as the default or with the ``'@'`` byte order character).
2787db96d56Sopenharmony_ci   For the standard size, you can use whichever of the other integer formats
2797db96d56Sopenharmony_ci   fits your application.
2807db96d56Sopenharmony_ci
2817db96d56Sopenharmony_ci(4)
2827db96d56Sopenharmony_ci   For the ``'f'``, ``'d'`` and ``'e'`` conversion codes, the packed
2837db96d56Sopenharmony_ci   representation uses the IEEE 754 binary32, binary64 or binary16 format (for
2847db96d56Sopenharmony_ci   ``'f'``, ``'d'`` or ``'e'`` respectively), regardless of the floating-point
2857db96d56Sopenharmony_ci   format used by the platform.
2867db96d56Sopenharmony_ci
2877db96d56Sopenharmony_ci(5)
2887db96d56Sopenharmony_ci   The ``'P'`` format character is only available for the native byte ordering
2897db96d56Sopenharmony_ci   (selected as the default or with the ``'@'`` byte order character). The byte
2907db96d56Sopenharmony_ci   order character ``'='`` chooses to use little- or big-endian ordering based
2917db96d56Sopenharmony_ci   on the host system. The struct module does not interpret this as native
2927db96d56Sopenharmony_ci   ordering, so the ``'P'`` format is not available.
2937db96d56Sopenharmony_ci
2947db96d56Sopenharmony_ci(6)
2957db96d56Sopenharmony_ci   The IEEE 754 binary16 "half precision" type was introduced in the 2008
2967db96d56Sopenharmony_ci   revision of the `IEEE 754 standard <ieee 754 standard_>`_. It has a sign
2977db96d56Sopenharmony_ci   bit, a 5-bit exponent and 11-bit precision (with 10 bits explicitly stored),
2987db96d56Sopenharmony_ci   and can represent numbers between approximately ``6.1e-05`` and ``6.5e+04``
2997db96d56Sopenharmony_ci   at full precision. This type is not widely supported by C compilers: on a
3007db96d56Sopenharmony_ci   typical machine, an unsigned short can be used for storage, but not for math
3017db96d56Sopenharmony_ci   operations. See the Wikipedia page on the `half-precision floating-point
3027db96d56Sopenharmony_ci   format <half precision format_>`_ for more information.
3037db96d56Sopenharmony_ci
3047db96d56Sopenharmony_ci(7)
3057db96d56Sopenharmony_ci   When packing, ``'x'`` inserts one NUL byte.
3067db96d56Sopenharmony_ci
3077db96d56Sopenharmony_ci(8)
3087db96d56Sopenharmony_ci   The ``'p'`` format character encodes a "Pascal string", meaning a short
3097db96d56Sopenharmony_ci   variable-length string stored in a *fixed number of bytes*, given by the count.
3107db96d56Sopenharmony_ci   The first byte stored is the length of the string, or 255, whichever is
3117db96d56Sopenharmony_ci   smaller.  The bytes of the string follow.  If the string passed in to
3127db96d56Sopenharmony_ci   :func:`pack` is too long (longer than the count minus 1), only the leading
3137db96d56Sopenharmony_ci   ``count-1`` bytes of the string are stored.  If the string is shorter than
3147db96d56Sopenharmony_ci   ``count-1``, it is padded with null bytes so that exactly count bytes in all
3157db96d56Sopenharmony_ci   are used.  Note that for :func:`unpack`, the ``'p'`` format character consumes
3167db96d56Sopenharmony_ci   ``count`` bytes, but that the string returned can never contain more than 255
3177db96d56Sopenharmony_ci   bytes.
3187db96d56Sopenharmony_ci
3197db96d56Sopenharmony_ci(9)
3207db96d56Sopenharmony_ci   For the ``'s'`` format character, the count is interpreted as the length of the
3217db96d56Sopenharmony_ci   bytes, not a repeat count like for the other format characters; for example,
3227db96d56Sopenharmony_ci   ``'10s'`` means a single 10-byte string mapping to or from a single
3237db96d56Sopenharmony_ci   Python byte string, while ``'10c'`` means 10
3247db96d56Sopenharmony_ci   separate one byte character elements (e.g., ``cccccccccc``) mapping
3257db96d56Sopenharmony_ci   to or from ten different Python byte objects. (See :ref:`struct-examples`
3267db96d56Sopenharmony_ci   for a concrete demonstration of the difference.)
3277db96d56Sopenharmony_ci   If a count is not given, it defaults to 1.  For packing, the string is
3287db96d56Sopenharmony_ci   truncated or padded with null bytes as appropriate to make it fit. For
3297db96d56Sopenharmony_ci   unpacking, the resulting bytes object always has exactly the specified number
3307db96d56Sopenharmony_ci   of bytes.  As a special case, ``'0s'`` means a single, empty string (while
3317db96d56Sopenharmony_ci   ``'0c'`` means 0 characters).
3327db96d56Sopenharmony_ci
3337db96d56Sopenharmony_ciA format character may be preceded by an integral repeat count.  For example,
3347db96d56Sopenharmony_cithe format string ``'4h'`` means exactly the same as ``'hhhh'``.
3357db96d56Sopenharmony_ci
3367db96d56Sopenharmony_ciWhitespace characters between formats are ignored; a count and its format must
3377db96d56Sopenharmony_cinot contain whitespace though.
3387db96d56Sopenharmony_ci
3397db96d56Sopenharmony_ciWhen packing a value ``x`` using one of the integer formats (``'b'``,
3407db96d56Sopenharmony_ci``'B'``, ``'h'``, ``'H'``, ``'i'``, ``'I'``, ``'l'``, ``'L'``,
3417db96d56Sopenharmony_ci``'q'``, ``'Q'``), if ``x`` is outside the valid range for that format
3427db96d56Sopenharmony_cithen :exc:`struct.error` is raised.
3437db96d56Sopenharmony_ci
3447db96d56Sopenharmony_ci.. versionchanged:: 3.1
3457db96d56Sopenharmony_ci   Previously, some of the integer formats wrapped out-of-range values and
3467db96d56Sopenharmony_ci   raised :exc:`DeprecationWarning` instead of :exc:`struct.error`.
3477db96d56Sopenharmony_ci
3487db96d56Sopenharmony_ci.. index:: single: ? (question mark); in struct format strings
3497db96d56Sopenharmony_ci
3507db96d56Sopenharmony_ciFor the ``'?'`` format character, the return value is either :const:`True` or
3517db96d56Sopenharmony_ci:const:`False`. When packing, the truth value of the argument object is used.
3527db96d56Sopenharmony_ciEither 0 or 1 in the native or standard bool representation will be packed, and
3537db96d56Sopenharmony_ciany non-zero value will be ``True`` when unpacking.
3547db96d56Sopenharmony_ci
3557db96d56Sopenharmony_ci
3567db96d56Sopenharmony_ci
3577db96d56Sopenharmony_ci.. _struct-examples:
3587db96d56Sopenharmony_ci
3597db96d56Sopenharmony_ciExamples
3607db96d56Sopenharmony_ci^^^^^^^^
3617db96d56Sopenharmony_ci
3627db96d56Sopenharmony_ci.. note::
3637db96d56Sopenharmony_ci   Native byte order examples (designated by the ``'@'`` format prefix or
3647db96d56Sopenharmony_ci   lack of any prefix character) may not match what the reader's
3657db96d56Sopenharmony_ci   machine produces as
3667db96d56Sopenharmony_ci   that depends on the platform and compiler.
3677db96d56Sopenharmony_ci
3687db96d56Sopenharmony_ciPack and unpack integers of three different sizes, using big endian
3697db96d56Sopenharmony_ciordering::
3707db96d56Sopenharmony_ci
3717db96d56Sopenharmony_ci    >>> from struct import *
3727db96d56Sopenharmony_ci    >>> pack(">bhl", 1, 2, 3)
3737db96d56Sopenharmony_ci    b'\x01\x00\x02\x00\x00\x00\x03'
3747db96d56Sopenharmony_ci    >>> unpack('>bhl', b'\x01\x00\x02\x00\x00\x00\x03')
3757db96d56Sopenharmony_ci    (1, 2, 3)
3767db96d56Sopenharmony_ci    >>> calcsize('>bhl')
3777db96d56Sopenharmony_ci    7
3787db96d56Sopenharmony_ci
3797db96d56Sopenharmony_ciAttempt to pack an integer which is too large for the defined field::
3807db96d56Sopenharmony_ci
3817db96d56Sopenharmony_ci    >>> pack(">h", 99999)
3827db96d56Sopenharmony_ci    Traceback (most recent call last):
3837db96d56Sopenharmony_ci      File "<stdin>", line 1, in <module>
3847db96d56Sopenharmony_ci    struct.error: 'h' format requires -32768 <= number <= 32767
3857db96d56Sopenharmony_ci
3867db96d56Sopenharmony_ciDemonstrate the difference between ``'s'`` and ``'c'`` format
3877db96d56Sopenharmony_cicharacters::
3887db96d56Sopenharmony_ci
3897db96d56Sopenharmony_ci    >>> pack("@ccc", b'1', b'2', b'3')
3907db96d56Sopenharmony_ci    b'123'
3917db96d56Sopenharmony_ci    >>> pack("@3s", b'123')
3927db96d56Sopenharmony_ci    b'123'
3937db96d56Sopenharmony_ci
3947db96d56Sopenharmony_ciUnpacked fields can be named by assigning them to variables or by wrapping
3957db96d56Sopenharmony_cithe result in a named tuple::
3967db96d56Sopenharmony_ci
3977db96d56Sopenharmony_ci    >>> record = b'raymond   \x32\x12\x08\x01\x08'
3987db96d56Sopenharmony_ci    >>> name, serialnum, school, gradelevel = unpack('<10sHHb', record)
3997db96d56Sopenharmony_ci
4007db96d56Sopenharmony_ci    >>> from collections import namedtuple
4017db96d56Sopenharmony_ci    >>> Student = namedtuple('Student', 'name serialnum school gradelevel')
4027db96d56Sopenharmony_ci    >>> Student._make(unpack('<10sHHb', record))
4037db96d56Sopenharmony_ci    Student(name=b'raymond   ', serialnum=4658, school=264, gradelevel=8)
4047db96d56Sopenharmony_ci
4057db96d56Sopenharmony_ciThe ordering of format characters may have an impact on size in native
4067db96d56Sopenharmony_cimode since padding is implicit. In standard mode, the user is
4077db96d56Sopenharmony_ciresponsible for inserting any desired padding.
4087db96d56Sopenharmony_ciNote in
4097db96d56Sopenharmony_cithe first ``pack`` call below that three NUL bytes were added after the
4107db96d56Sopenharmony_cipacked ``'#'`` to align the following integer on a four-byte boundary.
4117db96d56Sopenharmony_ciIn this example, the output was produced on a little endian machine::
4127db96d56Sopenharmony_ci
4137db96d56Sopenharmony_ci    >>> pack('@ci', b'#', 0x12131415)
4147db96d56Sopenharmony_ci    b'#\x00\x00\x00\x15\x14\x13\x12'
4157db96d56Sopenharmony_ci    >>> pack('@ic', 0x12131415, b'#')
4167db96d56Sopenharmony_ci    b'\x15\x14\x13\x12#'
4177db96d56Sopenharmony_ci    >>> calcsize('@ci')
4187db96d56Sopenharmony_ci    8
4197db96d56Sopenharmony_ci    >>> calcsize('@ic')
4207db96d56Sopenharmony_ci    5
4217db96d56Sopenharmony_ci
4227db96d56Sopenharmony_ciThe following format ``'llh0l'`` results in two pad bytes being added
4237db96d56Sopenharmony_ciat the end, assuming the platform's longs are aligned on 4-byte boundaries::
4247db96d56Sopenharmony_ci
4257db96d56Sopenharmony_ci    >>> pack('@llh0l', 1, 2, 3)
4267db96d56Sopenharmony_ci    b'\x00\x00\x00\x01\x00\x00\x00\x02\x00\x03\x00\x00'
4277db96d56Sopenharmony_ci
4287db96d56Sopenharmony_ci
4297db96d56Sopenharmony_ci.. seealso::
4307db96d56Sopenharmony_ci
4317db96d56Sopenharmony_ci   Module :mod:`array`
4327db96d56Sopenharmony_ci      Packed binary storage of homogeneous data.
4337db96d56Sopenharmony_ci
4347db96d56Sopenharmony_ci   Module :mod:`json`
4357db96d56Sopenharmony_ci      JSON encoder and decoder.
4367db96d56Sopenharmony_ci
4377db96d56Sopenharmony_ci   Module :mod:`pickle`
4387db96d56Sopenharmony_ci      Python object serialization.
4397db96d56Sopenharmony_ci
4407db96d56Sopenharmony_ci
4417db96d56Sopenharmony_ci.. _applications:
4427db96d56Sopenharmony_ci
4437db96d56Sopenharmony_ciApplications
4447db96d56Sopenharmony_ci------------
4457db96d56Sopenharmony_ci
4467db96d56Sopenharmony_ciTwo main applications for the :mod:`struct` module exist, data
4477db96d56Sopenharmony_ciinterchange between Python and C code within an application or another
4487db96d56Sopenharmony_ciapplication compiled using the same compiler (:ref:`native formats<struct-native-formats>`), and
4497db96d56Sopenharmony_cidata interchange between applications using agreed upon data layout
4507db96d56Sopenharmony_ci(:ref:`standard formats<struct-standard-formats>`).  Generally speaking, the format strings
4517db96d56Sopenharmony_ciconstructed for these two domains are distinct.
4527db96d56Sopenharmony_ci
4537db96d56Sopenharmony_ci
4547db96d56Sopenharmony_ci.. _struct-native-formats:
4557db96d56Sopenharmony_ci
4567db96d56Sopenharmony_ciNative Formats
4577db96d56Sopenharmony_ci^^^^^^^^^^^^^^
4587db96d56Sopenharmony_ci
4597db96d56Sopenharmony_ciWhen constructing format strings which mimic native layouts, the
4607db96d56Sopenharmony_cicompiler and machine architecture determine byte ordering and padding.
4617db96d56Sopenharmony_ciIn such cases, the ``@`` format character should be used to specify
4627db96d56Sopenharmony_cinative byte ordering and data sizes.  Internal pad bytes are normally inserted
4637db96d56Sopenharmony_ciautomatically.  It is possible that a zero-repeat format code will be
4647db96d56Sopenharmony_cineeded at the end of a format string to round up to the correct
4657db96d56Sopenharmony_cibyte boundary for proper alignment of consective chunks of data.
4667db96d56Sopenharmony_ci
4677db96d56Sopenharmony_ciConsider these two simple examples (on a 64-bit, little-endian
4687db96d56Sopenharmony_cimachine)::
4697db96d56Sopenharmony_ci
4707db96d56Sopenharmony_ci    >>> calcsize('@lhl')
4717db96d56Sopenharmony_ci    24
4727db96d56Sopenharmony_ci    >>> calcsize('@llh')
4737db96d56Sopenharmony_ci    18
4747db96d56Sopenharmony_ci
4757db96d56Sopenharmony_ciData is not padded to an 8-byte boundary at the end of the second
4767db96d56Sopenharmony_ciformat string without the use of extra padding.  A zero-repeat format
4777db96d56Sopenharmony_cicode solves that problem::
4787db96d56Sopenharmony_ci
4797db96d56Sopenharmony_ci    >>> calcsize('@llh0l')
4807db96d56Sopenharmony_ci    24
4817db96d56Sopenharmony_ci
4827db96d56Sopenharmony_ciThe ``'x'`` format code can be used to specify the repeat, but for
4837db96d56Sopenharmony_cinative formats it is better to use a zero-repeat format like ``'0l'``.
4847db96d56Sopenharmony_ci
4857db96d56Sopenharmony_ciBy default, native byte ordering and alignment is used, but it is
4867db96d56Sopenharmony_cibetter to be explicit and use the ``'@'`` prefix character.
4877db96d56Sopenharmony_ci
4887db96d56Sopenharmony_ci
4897db96d56Sopenharmony_ci.. _struct-standard-formats:
4907db96d56Sopenharmony_ci
4917db96d56Sopenharmony_ciStandard Formats
4927db96d56Sopenharmony_ci^^^^^^^^^^^^^^^^
4937db96d56Sopenharmony_ci
4947db96d56Sopenharmony_ciWhen exchanging data beyond your process such as networking or storage,
4957db96d56Sopenharmony_cibe precise.  Specify the exact byte order, size, and alignment.  Do
4967db96d56Sopenharmony_cinot assume they match the native order of a particular machine.
4977db96d56Sopenharmony_ciFor example, network byte order is big-endian, while many popular CPUs
4987db96d56Sopenharmony_ciare little-endian.  By defining this explicitly, the user need not
4997db96d56Sopenharmony_cicare about the specifics of the platform their code is running on.
5007db96d56Sopenharmony_ciThe first character should typically be ``<`` or ``>``
5017db96d56Sopenharmony_ci(or ``!``).  Padding is the responsibility of the programmer.  The
5027db96d56Sopenharmony_cizero-repeat format character won't work.  Instead, the user must
5037db96d56Sopenharmony_ciexplicitly add ``'x'`` pad bytes where needed.  Revisiting the
5047db96d56Sopenharmony_ciexamples from the previous section, we have::
5057db96d56Sopenharmony_ci
5067db96d56Sopenharmony_ci    >>> calcsize('<qh6xq')
5077db96d56Sopenharmony_ci    24
5087db96d56Sopenharmony_ci    >>> pack('<qh6xq', 1, 2, 3) == pack('@lhl', 1, 2, 3)
5097db96d56Sopenharmony_ci    True
5107db96d56Sopenharmony_ci    >>> calcsize('@llh')
5117db96d56Sopenharmony_ci    18
5127db96d56Sopenharmony_ci    >>> pack('@llh', 1, 2, 3) == pack('<qqh', 1, 2, 3)
5137db96d56Sopenharmony_ci    True
5147db96d56Sopenharmony_ci    >>> calcsize('<qqh6x')
5157db96d56Sopenharmony_ci    24
5167db96d56Sopenharmony_ci    >>> calcsize('@llh0l')
5177db96d56Sopenharmony_ci    24
5187db96d56Sopenharmony_ci    >>> pack('@llh0l', 1, 2, 3) == pack('<qqh6x', 1, 2, 3)
5197db96d56Sopenharmony_ci    True
5207db96d56Sopenharmony_ci
5217db96d56Sopenharmony_ciThe above results (executed on a 64-bit machine) aren't guaranteed to
5227db96d56Sopenharmony_cimatch when executed on different machines.  For example, the examples
5237db96d56Sopenharmony_cibelow were executed on a 32-bit machine::
5247db96d56Sopenharmony_ci
5257db96d56Sopenharmony_ci    >>> calcsize('<qqh6x')
5267db96d56Sopenharmony_ci    24
5277db96d56Sopenharmony_ci    >>> calcsize('@llh0l')
5287db96d56Sopenharmony_ci    12
5297db96d56Sopenharmony_ci    >>> pack('@llh0l', 1, 2, 3) == pack('<qqh6x', 1, 2, 3)
5307db96d56Sopenharmony_ci    False
5317db96d56Sopenharmony_ci
5327db96d56Sopenharmony_ci
5337db96d56Sopenharmony_ci.. _struct-objects:
5347db96d56Sopenharmony_ci
5357db96d56Sopenharmony_ciClasses
5367db96d56Sopenharmony_ci-------
5377db96d56Sopenharmony_ci
5387db96d56Sopenharmony_ciThe :mod:`struct` module also defines the following type:
5397db96d56Sopenharmony_ci
5407db96d56Sopenharmony_ci
5417db96d56Sopenharmony_ci.. class:: Struct(format)
5427db96d56Sopenharmony_ci
5437db96d56Sopenharmony_ci   Return a new Struct object which writes and reads binary data according to
5447db96d56Sopenharmony_ci   the format string *format*.  Creating a ``Struct`` object once and calling its
5457db96d56Sopenharmony_ci   methods is more efficient than calling module-level functions with the
5467db96d56Sopenharmony_ci   same format since the format string is only compiled once.
5477db96d56Sopenharmony_ci
5487db96d56Sopenharmony_ci   .. note::
5497db96d56Sopenharmony_ci
5507db96d56Sopenharmony_ci      The compiled versions of the most recent format strings passed to
5517db96d56Sopenharmony_ci      :class:`Struct` and the module-level functions are cached, so programs
5527db96d56Sopenharmony_ci      that use only a few format strings needn't worry about reusing a single
5537db96d56Sopenharmony_ci      :class:`Struct` instance.
5547db96d56Sopenharmony_ci
5557db96d56Sopenharmony_ci   Compiled Struct objects support the following methods and attributes:
5567db96d56Sopenharmony_ci
5577db96d56Sopenharmony_ci   .. method:: pack(v1, v2, ...)
5587db96d56Sopenharmony_ci
5597db96d56Sopenharmony_ci      Identical to the :func:`pack` function, using the compiled format.
5607db96d56Sopenharmony_ci      (``len(result)`` will equal :attr:`size`.)
5617db96d56Sopenharmony_ci
5627db96d56Sopenharmony_ci
5637db96d56Sopenharmony_ci   .. method:: pack_into(buffer, offset, v1, v2, ...)
5647db96d56Sopenharmony_ci
5657db96d56Sopenharmony_ci      Identical to the :func:`pack_into` function, using the compiled format.
5667db96d56Sopenharmony_ci
5677db96d56Sopenharmony_ci
5687db96d56Sopenharmony_ci   .. method:: unpack(buffer)
5697db96d56Sopenharmony_ci
5707db96d56Sopenharmony_ci      Identical to the :func:`unpack` function, using the compiled format.
5717db96d56Sopenharmony_ci      The buffer's size in bytes must equal :attr:`size`.
5727db96d56Sopenharmony_ci
5737db96d56Sopenharmony_ci
5747db96d56Sopenharmony_ci   .. method:: unpack_from(buffer, offset=0)
5757db96d56Sopenharmony_ci
5767db96d56Sopenharmony_ci      Identical to the :func:`unpack_from` function, using the compiled format.
5777db96d56Sopenharmony_ci      The buffer's size in bytes, starting at position *offset*, must be at least
5787db96d56Sopenharmony_ci      :attr:`size`.
5797db96d56Sopenharmony_ci
5807db96d56Sopenharmony_ci
5817db96d56Sopenharmony_ci   .. method:: iter_unpack(buffer)
5827db96d56Sopenharmony_ci
5837db96d56Sopenharmony_ci      Identical to the :func:`iter_unpack` function, using the compiled format.
5847db96d56Sopenharmony_ci      The buffer's size in bytes must be a multiple of :attr:`size`.
5857db96d56Sopenharmony_ci
5867db96d56Sopenharmony_ci      .. versionadded:: 3.4
5877db96d56Sopenharmony_ci
5887db96d56Sopenharmony_ci   .. attribute:: format
5897db96d56Sopenharmony_ci
5907db96d56Sopenharmony_ci      The format string used to construct this Struct object.
5917db96d56Sopenharmony_ci
5927db96d56Sopenharmony_ci      .. versionchanged:: 3.7
5937db96d56Sopenharmony_ci         The format string type is now :class:`str` instead of :class:`bytes`.
5947db96d56Sopenharmony_ci
5957db96d56Sopenharmony_ci   .. attribute:: size
5967db96d56Sopenharmony_ci
5977db96d56Sopenharmony_ci      The calculated size of the struct (and hence of the bytes object produced
5987db96d56Sopenharmony_ci      by the :meth:`pack` method) corresponding to :attr:`format`.
5997db96d56Sopenharmony_ci
6007db96d56Sopenharmony_ci
6017db96d56Sopenharmony_ci.. _half precision format: https://en.wikipedia.org/wiki/Half-precision_floating-point_format
6027db96d56Sopenharmony_ci
6037db96d56Sopenharmony_ci.. _ieee 754 standard: https://en.wikipedia.org/wiki/IEEE_754-2008_revision
6047db96d56Sopenharmony_ci
6057db96d56Sopenharmony_ci.. _IETF RFC 1700: https://datatracker.ietf.org/doc/html/rfc1700
606