17db96d56Sopenharmony_ci:mod:`struct` --- Interpret bytes as packed binary data 27db96d56Sopenharmony_ci======================================================= 37db96d56Sopenharmony_ci 47db96d56Sopenharmony_ci.. module:: struct 57db96d56Sopenharmony_ci :synopsis: Interpret bytes as packed binary data. 67db96d56Sopenharmony_ci 77db96d56Sopenharmony_ci**Source code:** :source:`Lib/struct.py` 87db96d56Sopenharmony_ci 97db96d56Sopenharmony_ci.. index:: 107db96d56Sopenharmony_ci pair: C; structures 117db96d56Sopenharmony_ci triple: packing; binary; data 127db96d56Sopenharmony_ci 137db96d56Sopenharmony_ci-------------- 147db96d56Sopenharmony_ci 157db96d56Sopenharmony_ciThis module converts between Python values and C structs represented 167db96d56Sopenharmony_cias Python :class:`bytes` objects. Compact :ref:`format strings <struct-format-strings>` 177db96d56Sopenharmony_cidescribe the intended conversions to/from Python values. 187db96d56Sopenharmony_ciThe module's functions and objects can be used for two largely 197db96d56Sopenharmony_cidistinct applications, data exchange with external sources (files or 207db96d56Sopenharmony_cinetwork connections), or data transfer between the Python application 217db96d56Sopenharmony_ciand the C layer. 227db96d56Sopenharmony_ci 237db96d56Sopenharmony_ci.. note:: 247db96d56Sopenharmony_ci 257db96d56Sopenharmony_ci When no prefix character is given, native mode is the default. It 267db96d56Sopenharmony_ci packs or unpacks data based on the platform and compiler on which 277db96d56Sopenharmony_ci the Python interpreter was built. 287db96d56Sopenharmony_ci The result of packing a given C struct includes pad bytes which 297db96d56Sopenharmony_ci maintain proper alignment for the C types involved; similarly, 307db96d56Sopenharmony_ci alignment is taken into account when unpacking. In contrast, when 317db96d56Sopenharmony_ci communicating data between external sources, the programmer is 327db96d56Sopenharmony_ci responsible for defining byte ordering and padding between elements. 337db96d56Sopenharmony_ci See :ref:`struct-alignment` for details. 347db96d56Sopenharmony_ci 357db96d56Sopenharmony_ciSeveral :mod:`struct` functions (and methods of :class:`Struct`) take a *buffer* 367db96d56Sopenharmony_ciargument. This refers to objects that implement the :ref:`bufferobjects` and 377db96d56Sopenharmony_ciprovide either a readable or read-writable buffer. The most common types used 387db96d56Sopenharmony_cifor that purpose are :class:`bytes` and :class:`bytearray`, but many other types 397db96d56Sopenharmony_cithat can be viewed as an array of bytes implement the buffer protocol, so that 407db96d56Sopenharmony_cithey can be read/filled without additional copying from a :class:`bytes` object. 417db96d56Sopenharmony_ci 427db96d56Sopenharmony_ci 437db96d56Sopenharmony_ciFunctions and Exceptions 447db96d56Sopenharmony_ci------------------------ 457db96d56Sopenharmony_ci 467db96d56Sopenharmony_ciThe module defines the following exception and functions: 477db96d56Sopenharmony_ci 487db96d56Sopenharmony_ci 497db96d56Sopenharmony_ci.. exception:: error 507db96d56Sopenharmony_ci 517db96d56Sopenharmony_ci Exception raised on various occasions; argument is a string describing what 527db96d56Sopenharmony_ci is wrong. 537db96d56Sopenharmony_ci 547db96d56Sopenharmony_ci 557db96d56Sopenharmony_ci.. function:: pack(format, v1, v2, ...) 567db96d56Sopenharmony_ci 577db96d56Sopenharmony_ci Return a bytes object containing the values *v1*, *v2*, ... packed according 587db96d56Sopenharmony_ci to the format string *format*. The arguments must match the values required by 597db96d56Sopenharmony_ci the format exactly. 607db96d56Sopenharmony_ci 617db96d56Sopenharmony_ci 627db96d56Sopenharmony_ci.. function:: pack_into(format, buffer, offset, v1, v2, ...) 637db96d56Sopenharmony_ci 647db96d56Sopenharmony_ci Pack the values *v1*, *v2*, ... according to the format string *format* and 657db96d56Sopenharmony_ci write the packed bytes into the writable buffer *buffer* starting at 667db96d56Sopenharmony_ci position *offset*. Note that *offset* is a required argument. 677db96d56Sopenharmony_ci 687db96d56Sopenharmony_ci 697db96d56Sopenharmony_ci.. function:: unpack(format, buffer) 707db96d56Sopenharmony_ci 717db96d56Sopenharmony_ci Unpack from the buffer *buffer* (presumably packed by ``pack(format, ...)``) 727db96d56Sopenharmony_ci according to the format string *format*. The result is a tuple even if it 737db96d56Sopenharmony_ci contains exactly one item. The buffer's size in bytes must match the 747db96d56Sopenharmony_ci size required by the format, as reflected by :func:`calcsize`. 757db96d56Sopenharmony_ci 767db96d56Sopenharmony_ci 777db96d56Sopenharmony_ci.. function:: unpack_from(format, /, buffer, offset=0) 787db96d56Sopenharmony_ci 797db96d56Sopenharmony_ci Unpack from *buffer* starting at position *offset*, according to the format 807db96d56Sopenharmony_ci string *format*. The result is a tuple even if it contains exactly one 817db96d56Sopenharmony_ci item. The buffer's size in bytes, starting at position *offset*, must be at 827db96d56Sopenharmony_ci least the size required by the format, as reflected by :func:`calcsize`. 837db96d56Sopenharmony_ci 847db96d56Sopenharmony_ci 857db96d56Sopenharmony_ci.. function:: iter_unpack(format, buffer) 867db96d56Sopenharmony_ci 877db96d56Sopenharmony_ci Iteratively unpack from the buffer *buffer* according to the format 887db96d56Sopenharmony_ci string *format*. This function returns an iterator which will read 897db96d56Sopenharmony_ci equally sized chunks from the buffer until all its contents have been 907db96d56Sopenharmony_ci consumed. The buffer's size in bytes must be a multiple of the size 917db96d56Sopenharmony_ci required by the format, as reflected by :func:`calcsize`. 927db96d56Sopenharmony_ci 937db96d56Sopenharmony_ci Each iteration yields a tuple as specified by the format string. 947db96d56Sopenharmony_ci 957db96d56Sopenharmony_ci .. versionadded:: 3.4 967db96d56Sopenharmony_ci 977db96d56Sopenharmony_ci 987db96d56Sopenharmony_ci.. function:: calcsize(format) 997db96d56Sopenharmony_ci 1007db96d56Sopenharmony_ci Return the size of the struct (and hence of the bytes object produced by 1017db96d56Sopenharmony_ci ``pack(format, ...)``) corresponding to the format string *format*. 1027db96d56Sopenharmony_ci 1037db96d56Sopenharmony_ci 1047db96d56Sopenharmony_ci.. _struct-format-strings: 1057db96d56Sopenharmony_ci 1067db96d56Sopenharmony_ciFormat Strings 1077db96d56Sopenharmony_ci-------------- 1087db96d56Sopenharmony_ci 1097db96d56Sopenharmony_ciFormat strings describe the data layout when 1107db96d56Sopenharmony_cipacking and unpacking data. They are built up from :ref:`format characters<format-characters>`, 1117db96d56Sopenharmony_ciwhich specify the type of data being packed/unpacked. In addition, 1127db96d56Sopenharmony_cispecial characters control the :ref:`byte order, size and alignment<struct-alignment>`. 1137db96d56Sopenharmony_ciEach format string consists of an optional prefix character which 1147db96d56Sopenharmony_cidescribes the overall properties of the data and one or more format 1157db96d56Sopenharmony_cicharacters which describe the actual data values and padding. 1167db96d56Sopenharmony_ci 1177db96d56Sopenharmony_ci 1187db96d56Sopenharmony_ci.. _struct-alignment: 1197db96d56Sopenharmony_ci 1207db96d56Sopenharmony_ciByte Order, Size, and Alignment 1217db96d56Sopenharmony_ci^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 1227db96d56Sopenharmony_ci 1237db96d56Sopenharmony_ciBy default, C types are represented in the machine's native format and byte 1247db96d56Sopenharmony_ciorder, and properly aligned by skipping pad bytes if necessary (according to the 1257db96d56Sopenharmony_cirules used by the C compiler). 1267db96d56Sopenharmony_ciThis behavior is chosen so 1277db96d56Sopenharmony_cithat the bytes of a packed struct correspond exactly to the memory layout 1287db96d56Sopenharmony_ciof the corresponding C struct. 1297db96d56Sopenharmony_ciWhether to use native byte ordering 1307db96d56Sopenharmony_ciand padding or standard formats depends on the application. 1317db96d56Sopenharmony_ci 1327db96d56Sopenharmony_ci.. index:: 1337db96d56Sopenharmony_ci single: @ (at); in struct format strings 1347db96d56Sopenharmony_ci single: = (equals); in struct format strings 1357db96d56Sopenharmony_ci single: < (less); in struct format strings 1367db96d56Sopenharmony_ci single: > (greater); in struct format strings 1377db96d56Sopenharmony_ci single: ! (exclamation); in struct format strings 1387db96d56Sopenharmony_ci 1397db96d56Sopenharmony_ciAlternatively, the first character of the format string can be used to indicate 1407db96d56Sopenharmony_cithe byte order, size and alignment of the packed data, according to the 1417db96d56Sopenharmony_cifollowing table: 1427db96d56Sopenharmony_ci 1437db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+ 1447db96d56Sopenharmony_ci| Character | Byte order | Size | Alignment | 1457db96d56Sopenharmony_ci+===========+========================+==========+===========+ 1467db96d56Sopenharmony_ci| ``@`` | native | native | native | 1477db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+ 1487db96d56Sopenharmony_ci| ``=`` | native | standard | none | 1497db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+ 1507db96d56Sopenharmony_ci| ``<`` | little-endian | standard | none | 1517db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+ 1527db96d56Sopenharmony_ci| ``>`` | big-endian | standard | none | 1537db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+ 1547db96d56Sopenharmony_ci| ``!`` | network (= big-endian) | standard | none | 1557db96d56Sopenharmony_ci+-----------+------------------------+----------+-----------+ 1567db96d56Sopenharmony_ci 1577db96d56Sopenharmony_ciIf the first character is not one of these, ``'@'`` is assumed. 1587db96d56Sopenharmony_ci 1597db96d56Sopenharmony_ciNative byte order is big-endian or little-endian, depending on the 1607db96d56Sopenharmony_cihost system. For example, Intel x86, AMD64 (x86-64), and Apple M1 are 1617db96d56Sopenharmony_cilittle-endian; IBM z and many legacy architectures are big-endian. 1627db96d56Sopenharmony_ciUse :data:`sys.byteorder` to check the endianness of your system. 1637db96d56Sopenharmony_ci 1647db96d56Sopenharmony_ciNative size and alignment are determined using the C compiler's 1657db96d56Sopenharmony_ci``sizeof`` expression. This is always combined with native byte order. 1667db96d56Sopenharmony_ci 1677db96d56Sopenharmony_ciStandard size depends only on the format character; see the table in 1687db96d56Sopenharmony_cithe :ref:`format-characters` section. 1697db96d56Sopenharmony_ci 1707db96d56Sopenharmony_ciNote the difference between ``'@'`` and ``'='``: both use native byte order, but 1717db96d56Sopenharmony_cithe size and alignment of the latter is standardized. 1727db96d56Sopenharmony_ci 1737db96d56Sopenharmony_ciThe form ``'!'`` represents the network byte order which is always big-endian 1747db96d56Sopenharmony_cias defined in `IETF RFC 1700 <IETF RFC 1700_>`_. 1757db96d56Sopenharmony_ci 1767db96d56Sopenharmony_ciThere is no way to indicate non-native byte order (force byte-swapping); use the 1777db96d56Sopenharmony_ciappropriate choice of ``'<'`` or ``'>'``. 1787db96d56Sopenharmony_ci 1797db96d56Sopenharmony_ciNotes: 1807db96d56Sopenharmony_ci 1817db96d56Sopenharmony_ci(1) Padding is only automatically added between successive structure members. 1827db96d56Sopenharmony_ci No padding is added at the beginning or the end of the encoded struct. 1837db96d56Sopenharmony_ci 1847db96d56Sopenharmony_ci(2) No padding is added when using non-native size and alignment, e.g. 1857db96d56Sopenharmony_ci with '<', '>', '=', and '!'. 1867db96d56Sopenharmony_ci 1877db96d56Sopenharmony_ci(3) To align the end of a structure to the alignment requirement of a 1887db96d56Sopenharmony_ci particular type, end the format with the code for that type with a repeat 1897db96d56Sopenharmony_ci count of zero. See :ref:`struct-examples`. 1907db96d56Sopenharmony_ci 1917db96d56Sopenharmony_ci 1927db96d56Sopenharmony_ci.. _format-characters: 1937db96d56Sopenharmony_ci 1947db96d56Sopenharmony_ciFormat Characters 1957db96d56Sopenharmony_ci^^^^^^^^^^^^^^^^^ 1967db96d56Sopenharmony_ci 1977db96d56Sopenharmony_ciFormat characters have the following meaning; the conversion between C and 1987db96d56Sopenharmony_ciPython values should be obvious given their types. The 'Standard size' column 1997db96d56Sopenharmony_cirefers to the size of the packed value in bytes when using standard size; that 2007db96d56Sopenharmony_ciis, when the format string starts with one of ``'<'``, ``'>'``, ``'!'`` or 2017db96d56Sopenharmony_ci``'='``. When using native size, the size of the packed value is 2027db96d56Sopenharmony_ciplatform-dependent. 2037db96d56Sopenharmony_ci 2047db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2057db96d56Sopenharmony_ci| Format | C Type | Python type | Standard size | Notes | 2067db96d56Sopenharmony_ci+========+==========================+====================+================+============+ 2077db96d56Sopenharmony_ci| ``x`` | pad byte | no value | | \(7) | 2087db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2097db96d56Sopenharmony_ci| ``c`` | :c:expr:`char` | bytes of length 1 | 1 | | 2107db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2117db96d56Sopenharmony_ci| ``b`` | :c:expr:`signed char` | integer | 1 | \(1), \(2) | 2127db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2137db96d56Sopenharmony_ci| ``B`` | :c:expr:`unsigned char` | integer | 1 | \(2) | 2147db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2157db96d56Sopenharmony_ci| ``?`` | :c:expr:`_Bool` | bool | 1 | \(1) | 2167db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2177db96d56Sopenharmony_ci| ``h`` | :c:expr:`short` | integer | 2 | \(2) | 2187db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2197db96d56Sopenharmony_ci| ``H`` | :c:expr:`unsigned short` | integer | 2 | \(2) | 2207db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2217db96d56Sopenharmony_ci| ``i`` | :c:expr:`int` | integer | 4 | \(2) | 2227db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2237db96d56Sopenharmony_ci| ``I`` | :c:expr:`unsigned int` | integer | 4 | \(2) | 2247db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2257db96d56Sopenharmony_ci| ``l`` | :c:expr:`long` | integer | 4 | \(2) | 2267db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2277db96d56Sopenharmony_ci| ``L`` | :c:expr:`unsigned long` | integer | 4 | \(2) | 2287db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2297db96d56Sopenharmony_ci| ``q`` | :c:expr:`long long` | integer | 8 | \(2) | 2307db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2317db96d56Sopenharmony_ci| ``Q`` | :c:expr:`unsigned long | integer | 8 | \(2) | 2327db96d56Sopenharmony_ci| | long` | | | | 2337db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2347db96d56Sopenharmony_ci| ``n`` | :c:expr:`ssize_t` | integer | | \(3) | 2357db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2367db96d56Sopenharmony_ci| ``N`` | :c:expr:`size_t` | integer | | \(3) | 2377db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2387db96d56Sopenharmony_ci| ``e`` | \(6) | float | 2 | \(4) | 2397db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2407db96d56Sopenharmony_ci| ``f`` | :c:expr:`float` | float | 4 | \(4) | 2417db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2427db96d56Sopenharmony_ci| ``d`` | :c:expr:`double` | float | 8 | \(4) | 2437db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2447db96d56Sopenharmony_ci| ``s`` | :c:expr:`char[]` | bytes | | \(9) | 2457db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2467db96d56Sopenharmony_ci| ``p`` | :c:expr:`char[]` | bytes | | \(8) | 2477db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2487db96d56Sopenharmony_ci| ``P`` | :c:expr:`void \*` | integer | | \(5) | 2497db96d56Sopenharmony_ci+--------+--------------------------+--------------------+----------------+------------+ 2507db96d56Sopenharmony_ci 2517db96d56Sopenharmony_ci.. versionchanged:: 3.3 2527db96d56Sopenharmony_ci Added support for the ``'n'`` and ``'N'`` formats. 2537db96d56Sopenharmony_ci 2547db96d56Sopenharmony_ci.. versionchanged:: 3.6 2557db96d56Sopenharmony_ci Added support for the ``'e'`` format. 2567db96d56Sopenharmony_ci 2577db96d56Sopenharmony_ci 2587db96d56Sopenharmony_ciNotes: 2597db96d56Sopenharmony_ci 2607db96d56Sopenharmony_ci(1) 2617db96d56Sopenharmony_ci .. index:: single: ? (question mark); in struct format strings 2627db96d56Sopenharmony_ci 2637db96d56Sopenharmony_ci The ``'?'`` conversion code corresponds to the :c:expr:`_Bool` type defined by 2647db96d56Sopenharmony_ci C99. If this type is not available, it is simulated using a :c:expr:`char`. In 2657db96d56Sopenharmony_ci standard mode, it is always represented by one byte. 2667db96d56Sopenharmony_ci 2677db96d56Sopenharmony_ci(2) 2687db96d56Sopenharmony_ci When attempting to pack a non-integer using any of the integer conversion 2697db96d56Sopenharmony_ci codes, if the non-integer has a :meth:`__index__` method then that method is 2707db96d56Sopenharmony_ci called to convert the argument to an integer before packing. 2717db96d56Sopenharmony_ci 2727db96d56Sopenharmony_ci .. versionchanged:: 3.2 2737db96d56Sopenharmony_ci Added use of the :meth:`__index__` method for non-integers. 2747db96d56Sopenharmony_ci 2757db96d56Sopenharmony_ci(3) 2767db96d56Sopenharmony_ci The ``'n'`` and ``'N'`` conversion codes are only available for the native 2777db96d56Sopenharmony_ci size (selected as the default or with the ``'@'`` byte order character). 2787db96d56Sopenharmony_ci For the standard size, you can use whichever of the other integer formats 2797db96d56Sopenharmony_ci fits your application. 2807db96d56Sopenharmony_ci 2817db96d56Sopenharmony_ci(4) 2827db96d56Sopenharmony_ci For the ``'f'``, ``'d'`` and ``'e'`` conversion codes, the packed 2837db96d56Sopenharmony_ci representation uses the IEEE 754 binary32, binary64 or binary16 format (for 2847db96d56Sopenharmony_ci ``'f'``, ``'d'`` or ``'e'`` respectively), regardless of the floating-point 2857db96d56Sopenharmony_ci format used by the platform. 2867db96d56Sopenharmony_ci 2877db96d56Sopenharmony_ci(5) 2887db96d56Sopenharmony_ci The ``'P'`` format character is only available for the native byte ordering 2897db96d56Sopenharmony_ci (selected as the default or with the ``'@'`` byte order character). The byte 2907db96d56Sopenharmony_ci order character ``'='`` chooses to use little- or big-endian ordering based 2917db96d56Sopenharmony_ci on the host system. The struct module does not interpret this as native 2927db96d56Sopenharmony_ci ordering, so the ``'P'`` format is not available. 2937db96d56Sopenharmony_ci 2947db96d56Sopenharmony_ci(6) 2957db96d56Sopenharmony_ci The IEEE 754 binary16 "half precision" type was introduced in the 2008 2967db96d56Sopenharmony_ci revision of the `IEEE 754 standard <ieee 754 standard_>`_. It has a sign 2977db96d56Sopenharmony_ci bit, a 5-bit exponent and 11-bit precision (with 10 bits explicitly stored), 2987db96d56Sopenharmony_ci and can represent numbers between approximately ``6.1e-05`` and ``6.5e+04`` 2997db96d56Sopenharmony_ci at full precision. This type is not widely supported by C compilers: on a 3007db96d56Sopenharmony_ci typical machine, an unsigned short can be used for storage, but not for math 3017db96d56Sopenharmony_ci operations. See the Wikipedia page on the `half-precision floating-point 3027db96d56Sopenharmony_ci format <half precision format_>`_ for more information. 3037db96d56Sopenharmony_ci 3047db96d56Sopenharmony_ci(7) 3057db96d56Sopenharmony_ci When packing, ``'x'`` inserts one NUL byte. 3067db96d56Sopenharmony_ci 3077db96d56Sopenharmony_ci(8) 3087db96d56Sopenharmony_ci The ``'p'`` format character encodes a "Pascal string", meaning a short 3097db96d56Sopenharmony_ci variable-length string stored in a *fixed number of bytes*, given by the count. 3107db96d56Sopenharmony_ci The first byte stored is the length of the string, or 255, whichever is 3117db96d56Sopenharmony_ci smaller. The bytes of the string follow. If the string passed in to 3127db96d56Sopenharmony_ci :func:`pack` is too long (longer than the count minus 1), only the leading 3137db96d56Sopenharmony_ci ``count-1`` bytes of the string are stored. If the string is shorter than 3147db96d56Sopenharmony_ci ``count-1``, it is padded with null bytes so that exactly count bytes in all 3157db96d56Sopenharmony_ci are used. Note that for :func:`unpack`, the ``'p'`` format character consumes 3167db96d56Sopenharmony_ci ``count`` bytes, but that the string returned can never contain more than 255 3177db96d56Sopenharmony_ci bytes. 3187db96d56Sopenharmony_ci 3197db96d56Sopenharmony_ci(9) 3207db96d56Sopenharmony_ci For the ``'s'`` format character, the count is interpreted as the length of the 3217db96d56Sopenharmony_ci bytes, not a repeat count like for the other format characters; for example, 3227db96d56Sopenharmony_ci ``'10s'`` means a single 10-byte string mapping to or from a single 3237db96d56Sopenharmony_ci Python byte string, while ``'10c'`` means 10 3247db96d56Sopenharmony_ci separate one byte character elements (e.g., ``cccccccccc``) mapping 3257db96d56Sopenharmony_ci to or from ten different Python byte objects. (See :ref:`struct-examples` 3267db96d56Sopenharmony_ci for a concrete demonstration of the difference.) 3277db96d56Sopenharmony_ci If a count is not given, it defaults to 1. For packing, the string is 3287db96d56Sopenharmony_ci truncated or padded with null bytes as appropriate to make it fit. For 3297db96d56Sopenharmony_ci unpacking, the resulting bytes object always has exactly the specified number 3307db96d56Sopenharmony_ci of bytes. As a special case, ``'0s'`` means a single, empty string (while 3317db96d56Sopenharmony_ci ``'0c'`` means 0 characters). 3327db96d56Sopenharmony_ci 3337db96d56Sopenharmony_ciA format character may be preceded by an integral repeat count. For example, 3347db96d56Sopenharmony_cithe format string ``'4h'`` means exactly the same as ``'hhhh'``. 3357db96d56Sopenharmony_ci 3367db96d56Sopenharmony_ciWhitespace characters between formats are ignored; a count and its format must 3377db96d56Sopenharmony_cinot contain whitespace though. 3387db96d56Sopenharmony_ci 3397db96d56Sopenharmony_ciWhen packing a value ``x`` using one of the integer formats (``'b'``, 3407db96d56Sopenharmony_ci``'B'``, ``'h'``, ``'H'``, ``'i'``, ``'I'``, ``'l'``, ``'L'``, 3417db96d56Sopenharmony_ci``'q'``, ``'Q'``), if ``x`` is outside the valid range for that format 3427db96d56Sopenharmony_cithen :exc:`struct.error` is raised. 3437db96d56Sopenharmony_ci 3447db96d56Sopenharmony_ci.. versionchanged:: 3.1 3457db96d56Sopenharmony_ci Previously, some of the integer formats wrapped out-of-range values and 3467db96d56Sopenharmony_ci raised :exc:`DeprecationWarning` instead of :exc:`struct.error`. 3477db96d56Sopenharmony_ci 3487db96d56Sopenharmony_ci.. index:: single: ? (question mark); in struct format strings 3497db96d56Sopenharmony_ci 3507db96d56Sopenharmony_ciFor the ``'?'`` format character, the return value is either :const:`True` or 3517db96d56Sopenharmony_ci:const:`False`. When packing, the truth value of the argument object is used. 3527db96d56Sopenharmony_ciEither 0 or 1 in the native or standard bool representation will be packed, and 3537db96d56Sopenharmony_ciany non-zero value will be ``True`` when unpacking. 3547db96d56Sopenharmony_ci 3557db96d56Sopenharmony_ci 3567db96d56Sopenharmony_ci 3577db96d56Sopenharmony_ci.. _struct-examples: 3587db96d56Sopenharmony_ci 3597db96d56Sopenharmony_ciExamples 3607db96d56Sopenharmony_ci^^^^^^^^ 3617db96d56Sopenharmony_ci 3627db96d56Sopenharmony_ci.. note:: 3637db96d56Sopenharmony_ci Native byte order examples (designated by the ``'@'`` format prefix or 3647db96d56Sopenharmony_ci lack of any prefix character) may not match what the reader's 3657db96d56Sopenharmony_ci machine produces as 3667db96d56Sopenharmony_ci that depends on the platform and compiler. 3677db96d56Sopenharmony_ci 3687db96d56Sopenharmony_ciPack and unpack integers of three different sizes, using big endian 3697db96d56Sopenharmony_ciordering:: 3707db96d56Sopenharmony_ci 3717db96d56Sopenharmony_ci >>> from struct import * 3727db96d56Sopenharmony_ci >>> pack(">bhl", 1, 2, 3) 3737db96d56Sopenharmony_ci b'\x01\x00\x02\x00\x00\x00\x03' 3747db96d56Sopenharmony_ci >>> unpack('>bhl', b'\x01\x00\x02\x00\x00\x00\x03') 3757db96d56Sopenharmony_ci (1, 2, 3) 3767db96d56Sopenharmony_ci >>> calcsize('>bhl') 3777db96d56Sopenharmony_ci 7 3787db96d56Sopenharmony_ci 3797db96d56Sopenharmony_ciAttempt to pack an integer which is too large for the defined field:: 3807db96d56Sopenharmony_ci 3817db96d56Sopenharmony_ci >>> pack(">h", 99999) 3827db96d56Sopenharmony_ci Traceback (most recent call last): 3837db96d56Sopenharmony_ci File "<stdin>", line 1, in <module> 3847db96d56Sopenharmony_ci struct.error: 'h' format requires -32768 <= number <= 32767 3857db96d56Sopenharmony_ci 3867db96d56Sopenharmony_ciDemonstrate the difference between ``'s'`` and ``'c'`` format 3877db96d56Sopenharmony_cicharacters:: 3887db96d56Sopenharmony_ci 3897db96d56Sopenharmony_ci >>> pack("@ccc", b'1', b'2', b'3') 3907db96d56Sopenharmony_ci b'123' 3917db96d56Sopenharmony_ci >>> pack("@3s", b'123') 3927db96d56Sopenharmony_ci b'123' 3937db96d56Sopenharmony_ci 3947db96d56Sopenharmony_ciUnpacked fields can be named by assigning them to variables or by wrapping 3957db96d56Sopenharmony_cithe result in a named tuple:: 3967db96d56Sopenharmony_ci 3977db96d56Sopenharmony_ci >>> record = b'raymond \x32\x12\x08\x01\x08' 3987db96d56Sopenharmony_ci >>> name, serialnum, school, gradelevel = unpack('<10sHHb', record) 3997db96d56Sopenharmony_ci 4007db96d56Sopenharmony_ci >>> from collections import namedtuple 4017db96d56Sopenharmony_ci >>> Student = namedtuple('Student', 'name serialnum school gradelevel') 4027db96d56Sopenharmony_ci >>> Student._make(unpack('<10sHHb', record)) 4037db96d56Sopenharmony_ci Student(name=b'raymond ', serialnum=4658, school=264, gradelevel=8) 4047db96d56Sopenharmony_ci 4057db96d56Sopenharmony_ciThe ordering of format characters may have an impact on size in native 4067db96d56Sopenharmony_cimode since padding is implicit. In standard mode, the user is 4077db96d56Sopenharmony_ciresponsible for inserting any desired padding. 4087db96d56Sopenharmony_ciNote in 4097db96d56Sopenharmony_cithe first ``pack`` call below that three NUL bytes were added after the 4107db96d56Sopenharmony_cipacked ``'#'`` to align the following integer on a four-byte boundary. 4117db96d56Sopenharmony_ciIn this example, the output was produced on a little endian machine:: 4127db96d56Sopenharmony_ci 4137db96d56Sopenharmony_ci >>> pack('@ci', b'#', 0x12131415) 4147db96d56Sopenharmony_ci b'#\x00\x00\x00\x15\x14\x13\x12' 4157db96d56Sopenharmony_ci >>> pack('@ic', 0x12131415, b'#') 4167db96d56Sopenharmony_ci b'\x15\x14\x13\x12#' 4177db96d56Sopenharmony_ci >>> calcsize('@ci') 4187db96d56Sopenharmony_ci 8 4197db96d56Sopenharmony_ci >>> calcsize('@ic') 4207db96d56Sopenharmony_ci 5 4217db96d56Sopenharmony_ci 4227db96d56Sopenharmony_ciThe following format ``'llh0l'`` results in two pad bytes being added 4237db96d56Sopenharmony_ciat the end, assuming the platform's longs are aligned on 4-byte boundaries:: 4247db96d56Sopenharmony_ci 4257db96d56Sopenharmony_ci >>> pack('@llh0l', 1, 2, 3) 4267db96d56Sopenharmony_ci b'\x00\x00\x00\x01\x00\x00\x00\x02\x00\x03\x00\x00' 4277db96d56Sopenharmony_ci 4287db96d56Sopenharmony_ci 4297db96d56Sopenharmony_ci.. seealso:: 4307db96d56Sopenharmony_ci 4317db96d56Sopenharmony_ci Module :mod:`array` 4327db96d56Sopenharmony_ci Packed binary storage of homogeneous data. 4337db96d56Sopenharmony_ci 4347db96d56Sopenharmony_ci Module :mod:`json` 4357db96d56Sopenharmony_ci JSON encoder and decoder. 4367db96d56Sopenharmony_ci 4377db96d56Sopenharmony_ci Module :mod:`pickle` 4387db96d56Sopenharmony_ci Python object serialization. 4397db96d56Sopenharmony_ci 4407db96d56Sopenharmony_ci 4417db96d56Sopenharmony_ci.. _applications: 4427db96d56Sopenharmony_ci 4437db96d56Sopenharmony_ciApplications 4447db96d56Sopenharmony_ci------------ 4457db96d56Sopenharmony_ci 4467db96d56Sopenharmony_ciTwo main applications for the :mod:`struct` module exist, data 4477db96d56Sopenharmony_ciinterchange between Python and C code within an application or another 4487db96d56Sopenharmony_ciapplication compiled using the same compiler (:ref:`native formats<struct-native-formats>`), and 4497db96d56Sopenharmony_cidata interchange between applications using agreed upon data layout 4507db96d56Sopenharmony_ci(:ref:`standard formats<struct-standard-formats>`). Generally speaking, the format strings 4517db96d56Sopenharmony_ciconstructed for these two domains are distinct. 4527db96d56Sopenharmony_ci 4537db96d56Sopenharmony_ci 4547db96d56Sopenharmony_ci.. _struct-native-formats: 4557db96d56Sopenharmony_ci 4567db96d56Sopenharmony_ciNative Formats 4577db96d56Sopenharmony_ci^^^^^^^^^^^^^^ 4587db96d56Sopenharmony_ci 4597db96d56Sopenharmony_ciWhen constructing format strings which mimic native layouts, the 4607db96d56Sopenharmony_cicompiler and machine architecture determine byte ordering and padding. 4617db96d56Sopenharmony_ciIn such cases, the ``@`` format character should be used to specify 4627db96d56Sopenharmony_cinative byte ordering and data sizes. Internal pad bytes are normally inserted 4637db96d56Sopenharmony_ciautomatically. It is possible that a zero-repeat format code will be 4647db96d56Sopenharmony_cineeded at the end of a format string to round up to the correct 4657db96d56Sopenharmony_cibyte boundary for proper alignment of consective chunks of data. 4667db96d56Sopenharmony_ci 4677db96d56Sopenharmony_ciConsider these two simple examples (on a 64-bit, little-endian 4687db96d56Sopenharmony_cimachine):: 4697db96d56Sopenharmony_ci 4707db96d56Sopenharmony_ci >>> calcsize('@lhl') 4717db96d56Sopenharmony_ci 24 4727db96d56Sopenharmony_ci >>> calcsize('@llh') 4737db96d56Sopenharmony_ci 18 4747db96d56Sopenharmony_ci 4757db96d56Sopenharmony_ciData is not padded to an 8-byte boundary at the end of the second 4767db96d56Sopenharmony_ciformat string without the use of extra padding. A zero-repeat format 4777db96d56Sopenharmony_cicode solves that problem:: 4787db96d56Sopenharmony_ci 4797db96d56Sopenharmony_ci >>> calcsize('@llh0l') 4807db96d56Sopenharmony_ci 24 4817db96d56Sopenharmony_ci 4827db96d56Sopenharmony_ciThe ``'x'`` format code can be used to specify the repeat, but for 4837db96d56Sopenharmony_cinative formats it is better to use a zero-repeat format like ``'0l'``. 4847db96d56Sopenharmony_ci 4857db96d56Sopenharmony_ciBy default, native byte ordering and alignment is used, but it is 4867db96d56Sopenharmony_cibetter to be explicit and use the ``'@'`` prefix character. 4877db96d56Sopenharmony_ci 4887db96d56Sopenharmony_ci 4897db96d56Sopenharmony_ci.. _struct-standard-formats: 4907db96d56Sopenharmony_ci 4917db96d56Sopenharmony_ciStandard Formats 4927db96d56Sopenharmony_ci^^^^^^^^^^^^^^^^ 4937db96d56Sopenharmony_ci 4947db96d56Sopenharmony_ciWhen exchanging data beyond your process such as networking or storage, 4957db96d56Sopenharmony_cibe precise. Specify the exact byte order, size, and alignment. Do 4967db96d56Sopenharmony_cinot assume they match the native order of a particular machine. 4977db96d56Sopenharmony_ciFor example, network byte order is big-endian, while many popular CPUs 4987db96d56Sopenharmony_ciare little-endian. By defining this explicitly, the user need not 4997db96d56Sopenharmony_cicare about the specifics of the platform their code is running on. 5007db96d56Sopenharmony_ciThe first character should typically be ``<`` or ``>`` 5017db96d56Sopenharmony_ci(or ``!``). Padding is the responsibility of the programmer. The 5027db96d56Sopenharmony_cizero-repeat format character won't work. Instead, the user must 5037db96d56Sopenharmony_ciexplicitly add ``'x'`` pad bytes where needed. Revisiting the 5047db96d56Sopenharmony_ciexamples from the previous section, we have:: 5057db96d56Sopenharmony_ci 5067db96d56Sopenharmony_ci >>> calcsize('<qh6xq') 5077db96d56Sopenharmony_ci 24 5087db96d56Sopenharmony_ci >>> pack('<qh6xq', 1, 2, 3) == pack('@lhl', 1, 2, 3) 5097db96d56Sopenharmony_ci True 5107db96d56Sopenharmony_ci >>> calcsize('@llh') 5117db96d56Sopenharmony_ci 18 5127db96d56Sopenharmony_ci >>> pack('@llh', 1, 2, 3) == pack('<qqh', 1, 2, 3) 5137db96d56Sopenharmony_ci True 5147db96d56Sopenharmony_ci >>> calcsize('<qqh6x') 5157db96d56Sopenharmony_ci 24 5167db96d56Sopenharmony_ci >>> calcsize('@llh0l') 5177db96d56Sopenharmony_ci 24 5187db96d56Sopenharmony_ci >>> pack('@llh0l', 1, 2, 3) == pack('<qqh6x', 1, 2, 3) 5197db96d56Sopenharmony_ci True 5207db96d56Sopenharmony_ci 5217db96d56Sopenharmony_ciThe above results (executed on a 64-bit machine) aren't guaranteed to 5227db96d56Sopenharmony_cimatch when executed on different machines. For example, the examples 5237db96d56Sopenharmony_cibelow were executed on a 32-bit machine:: 5247db96d56Sopenharmony_ci 5257db96d56Sopenharmony_ci >>> calcsize('<qqh6x') 5267db96d56Sopenharmony_ci 24 5277db96d56Sopenharmony_ci >>> calcsize('@llh0l') 5287db96d56Sopenharmony_ci 12 5297db96d56Sopenharmony_ci >>> pack('@llh0l', 1, 2, 3) == pack('<qqh6x', 1, 2, 3) 5307db96d56Sopenharmony_ci False 5317db96d56Sopenharmony_ci 5327db96d56Sopenharmony_ci 5337db96d56Sopenharmony_ci.. _struct-objects: 5347db96d56Sopenharmony_ci 5357db96d56Sopenharmony_ciClasses 5367db96d56Sopenharmony_ci------- 5377db96d56Sopenharmony_ci 5387db96d56Sopenharmony_ciThe :mod:`struct` module also defines the following type: 5397db96d56Sopenharmony_ci 5407db96d56Sopenharmony_ci 5417db96d56Sopenharmony_ci.. class:: Struct(format) 5427db96d56Sopenharmony_ci 5437db96d56Sopenharmony_ci Return a new Struct object which writes and reads binary data according to 5447db96d56Sopenharmony_ci the format string *format*. Creating a ``Struct`` object once and calling its 5457db96d56Sopenharmony_ci methods is more efficient than calling module-level functions with the 5467db96d56Sopenharmony_ci same format since the format string is only compiled once. 5477db96d56Sopenharmony_ci 5487db96d56Sopenharmony_ci .. note:: 5497db96d56Sopenharmony_ci 5507db96d56Sopenharmony_ci The compiled versions of the most recent format strings passed to 5517db96d56Sopenharmony_ci :class:`Struct` and the module-level functions are cached, so programs 5527db96d56Sopenharmony_ci that use only a few format strings needn't worry about reusing a single 5537db96d56Sopenharmony_ci :class:`Struct` instance. 5547db96d56Sopenharmony_ci 5557db96d56Sopenharmony_ci Compiled Struct objects support the following methods and attributes: 5567db96d56Sopenharmony_ci 5577db96d56Sopenharmony_ci .. method:: pack(v1, v2, ...) 5587db96d56Sopenharmony_ci 5597db96d56Sopenharmony_ci Identical to the :func:`pack` function, using the compiled format. 5607db96d56Sopenharmony_ci (``len(result)`` will equal :attr:`size`.) 5617db96d56Sopenharmony_ci 5627db96d56Sopenharmony_ci 5637db96d56Sopenharmony_ci .. method:: pack_into(buffer, offset, v1, v2, ...) 5647db96d56Sopenharmony_ci 5657db96d56Sopenharmony_ci Identical to the :func:`pack_into` function, using the compiled format. 5667db96d56Sopenharmony_ci 5677db96d56Sopenharmony_ci 5687db96d56Sopenharmony_ci .. method:: unpack(buffer) 5697db96d56Sopenharmony_ci 5707db96d56Sopenharmony_ci Identical to the :func:`unpack` function, using the compiled format. 5717db96d56Sopenharmony_ci The buffer's size in bytes must equal :attr:`size`. 5727db96d56Sopenharmony_ci 5737db96d56Sopenharmony_ci 5747db96d56Sopenharmony_ci .. method:: unpack_from(buffer, offset=0) 5757db96d56Sopenharmony_ci 5767db96d56Sopenharmony_ci Identical to the :func:`unpack_from` function, using the compiled format. 5777db96d56Sopenharmony_ci The buffer's size in bytes, starting at position *offset*, must be at least 5787db96d56Sopenharmony_ci :attr:`size`. 5797db96d56Sopenharmony_ci 5807db96d56Sopenharmony_ci 5817db96d56Sopenharmony_ci .. method:: iter_unpack(buffer) 5827db96d56Sopenharmony_ci 5837db96d56Sopenharmony_ci Identical to the :func:`iter_unpack` function, using the compiled format. 5847db96d56Sopenharmony_ci The buffer's size in bytes must be a multiple of :attr:`size`. 5857db96d56Sopenharmony_ci 5867db96d56Sopenharmony_ci .. versionadded:: 3.4 5877db96d56Sopenharmony_ci 5887db96d56Sopenharmony_ci .. attribute:: format 5897db96d56Sopenharmony_ci 5907db96d56Sopenharmony_ci The format string used to construct this Struct object. 5917db96d56Sopenharmony_ci 5927db96d56Sopenharmony_ci .. versionchanged:: 3.7 5937db96d56Sopenharmony_ci The format string type is now :class:`str` instead of :class:`bytes`. 5947db96d56Sopenharmony_ci 5957db96d56Sopenharmony_ci .. attribute:: size 5967db96d56Sopenharmony_ci 5977db96d56Sopenharmony_ci The calculated size of the struct (and hence of the bytes object produced 5987db96d56Sopenharmony_ci by the :meth:`pack` method) corresponding to :attr:`format`. 5997db96d56Sopenharmony_ci 6007db96d56Sopenharmony_ci 6017db96d56Sopenharmony_ci.. _half precision format: https://en.wikipedia.org/wiki/Half-precision_floating-point_format 6027db96d56Sopenharmony_ci 6037db96d56Sopenharmony_ci.. _ieee 754 standard: https://en.wikipedia.org/wiki/IEEE_754-2008_revision 6047db96d56Sopenharmony_ci 6057db96d56Sopenharmony_ci.. _IETF RFC 1700: https://datatracker.ietf.org/doc/html/rfc1700 606