extensions/AMD/AMD_gpu_shader_half_float.txt

5bd8deadSopenharmony_ciName
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    AMD_gpu_shader_half_float
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciName Strings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GL_AMD_gpu_shader_half_float
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciContact
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Qun Lin, AMD (quentin.lin 'at' amd.com)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciContributors
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Qun Lin, AMD
5bd8deadSopenharmony_ci    Daniel Rakos, AMD
5bd8deadSopenharmony_ci    Donglin Wei, AMD
5bd8deadSopenharmony_ci    Graham Sellers, AMD
5bd8deadSopenharmony_ci    Rex Xu, AMD
5bd8deadSopenharmony_ci    Dominik Witczak, AMD
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciStatus
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Shipping.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciVersion
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Last Modified Date:         09/21/2016
5bd8deadSopenharmony_ci    Author Revision:            5
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNumber
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    OpenGL Extension #496
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension is written against the OpenGL 4.5 (Core Profile)
5bd8deadSopenharmony_ci    Specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension is written against version 4.50 of the OpenGL Shading
5bd8deadSopenharmony_ci    Language Specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    OpenGL 4.0 and GLSL 4.00 are required.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension interacts with ARB_gpu_shader_int64.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension interacts with AMD_shader_trinary_minmax.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension interacts with AMD_shader_explicit_vertex_parameter.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciOverview
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension was developed based on the NV_gpu_shader5 extension to
5bd8deadSopenharmony_ci    allow implementations supporting half float in shader and expose the
5bd8deadSopenharmony_ci    feature without the additional requirements that are present in
5bd8deadSopenharmony_ci    NV_gpu_shader5.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The extension introduces the following features for all shader types:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * support for half float scalar, vector and matrix data types in shader;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * new built-in functions to pack and unpack half float types into a
5bd8deadSopenharmony_ci        32-bit integer vector;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * half float support for all existing single float built-in functions,
5bd8deadSopenharmony_ci        including angle functions, exponential functions, common functions,
5bd8deadSopenharmony_ci        geometric functions, matrix functions and etc.;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension is designed to be a functional superset of the half-precision
5bd8deadSopenharmony_ci    floating-point support from NV_gpu_shader5 and to keep source code compatible
5bd8deadSopenharmony_ci    with that, thus the new procedures, functions, and tokens are identical to
5bd8deadSopenharmony_ci    those found in that extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Procedures and Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Tokens
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Returned by the <type> parameter of GetActiveAttrib, GetActiveUniform, and
5bd8deadSopenharmony_ci    GetTransformFeedbackVarying:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (The tokens are identical to those defined in NV_gpu_shader5.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        FLOAT16_NV                                      0x8FF8
5bd8deadSopenharmony_ci        FLOAT16_VEC2_NV                                 0x8FF9
5bd8deadSopenharmony_ci        FLOAT16_VEC3_NV                                 0x8FFA
5bd8deadSopenharmony_ci        FLOAT16_VEC4_NV                                 0x8FFB
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (New tokens)
5bd8deadSopenharmony_ci        FLOAT16_MAT2_AMD                                0x91C5
5bd8deadSopenharmony_ci        FLOAT16_MAT3_AMD                                0x91C6
5bd8deadSopenharmony_ci        FLOAT16_MAT4_AMD                                0x91C7
5bd8deadSopenharmony_ci        FLOAT16_MAT2x3_AMD                              0x91C8
5bd8deadSopenharmony_ci        FLOAT16_MAT2x4_AMD                              0x91C9
5bd8deadSopenharmony_ci        FLOAT16_MAT3x2_AMD                              0x91CA
5bd8deadSopenharmony_ci        FLOAT16_MAT3x4_AMD                              0x91CB
5bd8deadSopenharmony_ci        FLOAT16_MAT4x2_AMD                              0x91CC
5bd8deadSopenharmony_ci        FLOAT16_MAT4x3_AMD                              0x91CD
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 7 of the OpenGL 4.5 (Core Profile) Specification
5bd8deadSopenharmony_ci(Program Objects)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 7.3.1, Program Interfaces
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to Table 7.3, OpenGL Shading Language type tokens, p. 108)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +----------------------------+----------------+------+------+------+
5bd8deadSopenharmony_ci    | Type Name Token            | Keyword        |Attrib| Xfb  |Buffer|
5bd8deadSopenharmony_ci    +----------------------------+----------------+------+------+------+
5bd8deadSopenharmony_ci    | FLOAT16_NV                 | float16_t      |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_VEC2_NV            | f16vec2        |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_VEC3_NV            | f16vec3        |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_VEC4_NV            | f16vec4        |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT2_AMD           | f16mat2        |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT3_AMD           | f16mat3        |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT4_AMD           | f16mat4        |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT2x3_AMD         | f16mat2x3      |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT2x4_AMD         | f16mat2x4      |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT3x2_AMD         | f16mat3x2      |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT3x4_AMD         | f16mat3x4      |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT4x2_AMD         | f16mat4x2      |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    | FLOAT16_MAT4x3_AMD         | f16mat4x3      |  *   |  *   |  *   |
5bd8deadSopenharmony_ci    +----------------------------+----------------+------+------+------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 7.6.1, Loading Uniform Variables
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the last paragraph on p. 132)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The Uniform*f{v} commands will load count sets of one to four floating-
5bd8deadSopenharmony_ci    point values into a uniform defined as a float, a half float, a floating-
5bd8deadSopenharmony_ci    point vector, a half-precision floating-point vector or an array of either
5bd8deadSopenharmony_ci    of these types. Floating-point values are converted to half float by the GL
5bd8deadSopenharmony_ci    for uniforms defined as a half float, a half float vector or an array of
5bd8deadSopenharmony_ci    those.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 7.6.2.1, Uniform Buffer Object Storage
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the first two bullets of the first paragraph on p. 136)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    * Members of type bool, int, uint, float, float16_t and double are respectively
5bd8deadSopenharmony_ci      extracted from a buffer object by reading a single uint, int, uint, float,
5bd8deadSopenharmony_ci      half float or double value at the specified offset.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    * Vectors with N elements with basic data types of bool, int, uint, float,
5bd8deadSopenharmony_ci      float16_t or double are extracted as N values in consecutive memory locations
5bd8deadSopenharmony_ci      beginning at the specified offset, with components stored in order with the
5bd8deadSopenharmony_ci      first (X) component at the lowest offset. The GL data type used for component
5bd8deadSopenharmony_ci      extraction is derived according to the rules for scalar members above.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 11 of the OpenGL 4.5 (Core Profile) Specification
5bd8deadSopenharmony_ci(Programmable Vertex Processing)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 11.1.1, Vertex Attributes
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify Table 11.2, Generic attributes and vector types used by column vectors of
5bd8deadSopenharmony_ci    matrix variables bound to generic attribute index i. p. 366)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +------------------------------+-------------------------+-----------------------+
5bd8deadSopenharmony_ci    |          Data type           |Column vector type layout|     Generic           |
5bd8deadSopenharmony_ci    |                              |qualifier attributes used|                       |
5bd8deadSopenharmony_ci    +------------------------------+-------------------------+-----------------------+
5bd8deadSopenharmony_ci    | mat2, dmat2, f16mat2         | two-component vector    | i, i + 1              |
5bd8deadSopenharmony_ci    | mat2x3, dmat2x3, f16mat2x3   | three-component vector  | i, i + 1              |
5bd8deadSopenharmony_ci    | mat2x4, dmat2x4, f16mat2x4   | four-component vector   | i, i + 1              |
5bd8deadSopenharmony_ci    | mat3x2, dmat3x2, f16mat3x2   | two-component vector    | i, i + 1, i + 2       |
5bd8deadSopenharmony_ci    | mat3, dmat3, f16mat3         | three-component vector  | i, i + 1, i + 2       |
5bd8deadSopenharmony_ci    | mat3x4, dmat3x4, f16mat3x4   | four-component vector   | i, i + 1, i + 2       |
5bd8deadSopenharmony_ci    | mat4x2, dmat4x2, f16mat4x2   | two-component vector    | i, i + 1, i + 2, i + 3|
5bd8deadSopenharmony_ci    | mat4x3, dmat4x3, f16mat4x3   | three-component vector  | i, i + 1, i + 2, i + 3|
5bd8deadSopenharmony_ci    | mat4, dmat4, f16mat4         | four-component vector   | i, i + 1, i + 2, i + 3|
5bd8deadSopenharmony_ci    +------------------------------+-------------------------+-----------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify Table 11.3: Scalar and vector vertex attribute types and VertexAttrib*
5bd8deadSopenharmony_ci    commands used to set the values of the corresponding generic attributes. p. 366)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-------------------+--------------------------+
5bd8deadSopenharmony_ci    |   Data type       |         Command          |
5bd8deadSopenharmony_ci    +-------------------+--------------------------+
5bd8deadSopenharmony_ci    | float, float16_t  | VertexAttrib1*           |
5bd8deadSopenharmony_ci    | vec2, f16vec2     | VertexAttrib2*           |
5bd8deadSopenharmony_ci    | vec3, f16vec3     | VertexAttrib3*           |
5bd8deadSopenharmony_ci    | vec4, f16vec4     | VertexAttrib4*           |
5bd8deadSopenharmony_ci    +-------------------+--------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 11.1.2.1, Output Variables
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the last paragraph on p. 374)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ..., each component of outputs declared as half-precision floating-point
5bd8deadSopenharmony_ci    scalars, vectors, or matrices is considered to consume two basic machine
5bd8deadSopenharmony_ci    units, and each component of any other type ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciModifications to the OpenGL Shading Language Specification, Version 4.50
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Including the following line in a shader can be used to control the
5bd8deadSopenharmony_ci    language features described in this extension:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      #extension GL_AMD_gpu_shader_half_float : <behavior>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where <behavior> is as specified in section 3.3.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    New preprocessor #defines are added to the OpenGL Shading Language:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      #define GL_AMD_gpu_shader_half_float       1
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 3 of the OpenGL Shading Language Specification (Basics)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.6, Keywords
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add the following to the list of reserved keywords at p. 18)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    float16_t f16vec2 f16vec3 f16vec4
5bd8deadSopenharmony_ci    f16mat2  f16mat3  f16mat4
5bd8deadSopenharmony_ci    f16mat2x2 fl6mat2x3 f16mat2x4
5bd8deadSopenharmony_ci    f16mat3x2 f16mat3x3 f16mat3x4
5bd8deadSopenharmony_ci    f16mat4x2 f16mat4x3 f16mat4x4
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 4 of the OpenGL Shading Language Specification
5bd8deadSopenharmony_ci(Variables and Types)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 4.1, Basic Types
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the basic "Transparent Types" table, p. 23)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-----------+------------------------------------------------------------+
5bd8deadSopenharmony_ci    | Type      | Meaning                                                    |
5bd8deadSopenharmony_ci    +-----------+------------------------------------------------------------+
5bd8deadSopenharmony_ci    | float16_t | a half-precision floating-point scalar                     |
5bd8deadSopenharmony_ci    | f16vec2   | a two-component half-precision floating-point vector       |
5bd8deadSopenharmony_ci    | f16vec3   | a three-component half-precision floating-point vector     |
5bd8deadSopenharmony_ci    | f16vec4   | a four-component half-precision floating-point vector      |
5bd8deadSopenharmony_ci    | f16mat2   | a 2x2 half-precision floating-point matrix                 |
5bd8deadSopenharmony_ci    | f16mat3   | a 3x3 half-precision floating-point matrix                 |
5bd8deadSopenharmony_ci    | f16mat4   | a 4x4 half-precision floating-point matrix                 |
5bd8deadSopenharmony_ci    | f16mat2x2 | same as a f16mat2                                          |
5bd8deadSopenharmony_ci    | f16mat2x3 | a half-precision floating-point matrix with 2 columns and  |
5bd8deadSopenharmony_ci    |           | 3 rows                                                     |
5bd8deadSopenharmony_ci    | f16mat2x4 | a half-precision floating-point matrix with 2 columns and  |
5bd8deadSopenharmony_ci    |           | 4 rows                                                     |
5bd8deadSopenharmony_ci    | f16mat3x2 | a half-precision floating-point matrix with 3 columns and  |
5bd8deadSopenharmony_ci    |           | 2 rows                                                     |
5bd8deadSopenharmony_ci    | f16mat3x3 | same as a f16mat3                                          |
5bd8deadSopenharmony_ci    | f16mat3x4 | a half-precision floating-point matrix with 3 columns and  |
5bd8deadSopenharmony_ci    |           | 4 rows                                                     |
5bd8deadSopenharmony_ci    | f16mat4x2 | a half-precision floating-point matrix with 4 columns and  |
5bd8deadSopenharmony_ci    |           | 2 rows                                                     |
5bd8deadSopenharmony_ci    | f16mat4x3 | a half-precision floating-point matrix with 4 columns and  |
5bd8deadSopenharmony_ci    |           | 3 rows                                                     |
5bd8deadSopenharmony_ci    | f16mat4x4 | same as a f16mat4                                          |
5bd8deadSopenharmony_ci    +-----------+------------------------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 4.1.4, Floating-Point Variables
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (replace first paragraph of the section, p. 29)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Single-precision, double-precision and half-precision floating point variables
5bd8deadSopenharmony_ci    are available for use in a variety of scalar calculations. Generally, the term
5bd8deadSopenharmony_ci    floating-point will refer to all single-, double- and half-precision floating
5bd8deadSopenharmony_ci    point. Floating-point variables are defined as in the following examples:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        float a, b = 1.5;       // single-precision floating-point
5bd8deadSopenharmony_ci        double c, d = 2.0LF;    // double-precision floating-point
5bd8deadSopenharmony_ci        float16_t e, f = 3.0HF; // half-precision floating-point
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    As an input value to one of the processing units, a single-precision, double-
5bd8deadSopenharmony_ci    precision or half-precison floating-point variable is expected to match the
5bd8deadSopenharmony_ci    corresponding IEEE 754 floating-point definition in terms of precision and
5bd8deadSopenharmony_ci    dynamic range.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify grammar rule for "floating-suffix", p. 30)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      floating-suffix: one of
5bd8deadSopenharmony_ci        f F lf LF hf HF
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the fourth sentence of second paragraph on p. 30)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When the suffix "lf" or "LF" is present, the literal has type double. When the
5bd8deadSopenharmony_ci    suffix "hf" or "HF" is present, the literal has type float16_t. Otherwise, the
5bd8deadSopenharmony_ci    literal has type float.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 4.1.6, Matrices
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the second sentence in the section, p. 30)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Matrix types beginning with "mat" have single-precision components, matrix
5bd8deadSopenharmony_ci    types beginning with "dmat" have double-precision components and matrix types
5bd8deadSopenharmony_ci    beginning with "f16mat" have half-precision components.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 4.1.10, Implicit Conversions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the implicit conversion table on p. 37)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-----------------------+-------------------------------------------------+
5bd8deadSopenharmony_ci    | Type of expression    | Can be implicitly converted to                  |
5bd8deadSopenharmony_ci    +-----------------------+-------------------------------------------------+
5bd8deadSopenharmony_ci    | int, uint, float16_t  | float                                           |
5bd8deadSopenharmony_ci    | ivec2, uvec2, f16vec2 | vec2                                            |
5bd8deadSopenharmony_ci    | ivec3, uvec3, f16vec3 | vec3                                            |
5bd8deadSopenharmony_ci    | ivec4, uvec4, f16vec4 | vec4                                            |
5bd8deadSopenharmony_ci    | f16mat2               | mat2                                            |
5bd8deadSopenharmony_ci    | f16mat3               | mat3                                            |
5bd8deadSopenharmony_ci    | f16mat4               | mat4                                            |
5bd8deadSopenharmony_ci    | f16mat2x3             | mat2x3                                          |
5bd8deadSopenharmony_ci    | f16mat2x4             | mat2x4                                          |
5bd8deadSopenharmony_ci    | f16mat3x2             | mat3x2                                          |
5bd8deadSopenharmony_ci    | f16mat3x4             | mat3x4                                          |
5bd8deadSopenharmony_ci    | f16mat4x2             | mat4x2                                          |
5bd8deadSopenharmony_ci    | f16mat4x3             | mat4x3                                          |
5bd8deadSopenharmony_ci    | int, uint,            | double                                          |
5bd8deadSopenharmony_ci    | float, float16_t      |                                                 |
5bd8deadSopenharmony_ci    | ivec2, uvec2,         | dvec2                                           |
5bd8deadSopenharmony_ci    | vec2, f16vec2         |                                                 |
5bd8deadSopenharmony_ci    | ivec3, uvec3,         | dvec3                                           |
5bd8deadSopenharmony_ci    | vec3, f16vec3         |                                                 |
5bd8deadSopenharmony_ci    | ivec4, uvec4,         | dvec4                                           |
5bd8deadSopenharmony_ci    | vec4, f16vec4         |                                                 |
5bd8deadSopenharmony_ci    | mat2, f16mat2,        | dmat2                                           |
5bd8deadSopenharmony_ci    | mat3, f16mat3         | dmat3                                           |
5bd8deadSopenharmony_ci    | mat4, f16mat4         | dmat4                                           |
5bd8deadSopenharmony_ci    | mat2x3, f16mat2x3     | dmat2x3                                         |
5bd8deadSopenharmony_ci    | mat2x4, f16mat2x4     | dmat2x4                                         |
5bd8deadSopenharmony_ci    | mat3x2, f16mat3x2     | dmat3x2                                         |
5bd8deadSopenharmony_ci    | mat3x4, f16mat3x4     | dmat3x4                                         |
5bd8deadSopenharmony_ci    | mat4x2, f16mat4x2     | dmat4x2                                         |
5bd8deadSopenharmony_ci    | mat4x3, f16mat4x3     | dmat4x3                                         |
5bd8deadSopenharmony_ci    +-----------------------+-------------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 4.4.2.1 Transform Feedback Layout Qualifiers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (insert after the fourth paragraph in the section on p. 70)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ... will be a multiple of 8; if applied to an aggregrate containing a
5bd8deadSopenharmony_ci    float16_t, the offset must also be a multiple of 2, and the space taken in
5bd8deadSopenharmony_ci    the buffer will be a multiple of 2.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 4.7.1 Range and Precision
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (insert after the first paragraph in the section on p. 85)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ... and positive and negative zeros. The precision of stored half-
5bd8deadSopenharmony_ci    precision floating-point variables is described in section 2.3.3.2 "16-Bit
5bd8deadSopenharmony_ci    Floating-Point Numbers" of OpenGL Specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following rules apply to all floating operations, including single-,
5bd8deadSopenharmony_ci    double- and half-precision operations:...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 5 of the OpenGL Shading Language Specification
5bd8deadSopenharmony_ci(Operators and Expressions)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 5.4.1, Conversion and Scalar Constructors
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add after the first list of constructor examples on p. 97)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      int(float16_t)    // convert a float16_t value to a signed integer
5bd8deadSopenharmony_ci      uint(float16_t)   // convert a float16_t value to an unsigned integer
5bd8deadSopenharmony_ci      bool(float16_t)   // convert a float16_t value to a Boolean
5bd8deadSopenharmony_ci      float(float16_t)  // convert a float16_t value to a float value
5bd8deadSopenharmony_ci      double(float16_t) // convert a float16_t value to a double value
5bd8deadSopenharmony_ci      float16_t(bool)   // convert a Boolean to a float16_t value
5bd8deadSopenharmony_ci      float16_t(int)    // convert a signed integer to a float16_t value
5bd8deadSopenharmony_ci      float16_t(uint)   // convert an unsigned integer to a float16_t value
5bd8deadSopenharmony_ci      float16_t(float)  // convert a float value to a float16_t value
5bd8deadSopenharmony_ci      float16_t(double) // convert a double value to a float16_t value
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the first sentence of last paragraph on p. 98)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ... other arguments.
5bd8deadSopenharmony_ci    If the basic type (bool, int, float, double, or float16_t) of a parameter to
5bd8deadSopenharmony_ci    a constructor does not match the basic type of the object being constructed,
5bd8deadSopenharmony_ci    the scalar construction rules (above) are used to convert the parameters.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 6 of the OpenGL Shading Language Specification
5bd8deadSopenharmony_ci(Statements and Structure)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 6.1, Function Defintions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (replace the second rule in third paragraph on p. 113)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      2. A match involving a conversion from a signed integer, unsigned
5bd8deadSopenharmony_ci         integer, or floating-point type to a similar type having a larger
5bd8deadSopenharmony_ci         number of bits is better than a match involving any other implicit
5bd8deadSopenharmony_ci         conversion.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 8 of the OpenGL Shading Language Specification
5bd8deadSopenharmony_ci(Built-in Functions)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (insert after the sixth sentence of last paragraph on p. 140)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ... genDType is used as the argument. Where the input arguments (and
5bd8deadSopenharmony_ci    corresponding output) can be float16_t, f16vec2, f16vec3, f16vec4,
5bd8deadSopenharmony_ci    genF16Type is used as the argument.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.1, Angle and Trigonometry Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the table of Angle and Trigonometry Functions on p. 141)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                         | Desciption                                         |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type radians (genF16Type degrees)        | Converts degrees to radians, i.e., 180/PI *        |
5bd8deadSopenharmony_ci    |                                                | degrees.                                           |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type degrees (genF16Type radians)        | Converts radians to degrees, i.e., 180/PI *        |
5bd8deadSopenharmony_ci    |                                                | radians.                                           |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type sin (genF16Type angle)              | The standard trigonometric sine function.          |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type cos (genF16Type angle)              | The standard trigonometric cosine function         |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type tan (genF16Type angle)              | The standard trigonometric tangent.                |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type asin (genF16Type x)                 | Arc sine. Returns an angle whose sine is x. The    |
5bd8deadSopenharmony_ci    |                                                | range of values returned by this function is [-PI/2|
5bd8deadSopenharmony_ci    |                                                | , PI/2] Results are undefined if |x| > 1.          |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type acos (genF16Type x)                 | Arc cosine. Returns an angle whose cosine is x. The|
5bd8deadSopenharmony_ci    |                                                | range of values returned by this function is [0, p]|
5bd8deadSopenharmony_ci    |                                                | Results are undefined if |x| > 1.                  |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type atan (genF16Type y, genF16Type x)   | Arc tangent. Returns an angle whose tangent is y/x.|
5bd8deadSopenharmony_ci    |                                                | The signs of x and y are used to determine what    |
5bd8deadSopenharmony_ci    |                                                | quadrant the angle is in. The range of values      |
5bd8deadSopenharmony_ci    |                                                | returned by this function is [-PI,PI]. Results are |
5bd8deadSopenharmony_ci    |                                                | undefined if x and y are both 0.                   |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type atan (genF16Type y_over_x)          | Arc tangent. Returns an angle whose tangent is     |
5bd8deadSopenharmony_ci    |                                                | y_over_x. The range of values returned by this     |
5bd8deadSopenharmony_ci    |                                                | function is [-PI/2, PI/2].                         |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type sinh (genF16Type x)                 | Returns the hyperbolic sine function               |
5bd8deadSopenharmony_ci    |                                                | (e^x - e^-x) / 2.                                  |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type cosh (genF16Type x)                 | Returns the hyperbolic cosine function             |
5bd8deadSopenharmony_ci    |                                                | (e^x + e^-x) / 2.                                  |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type tanh (genF16Type x)                 | Returns the hyperbolic tangent function            |
5bd8deadSopenharmony_ci    |                                                | sinh(x) / cosh(x).                                 |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type asinh (genF16Type x)                | Arc hyperbolic sine; returns the inverse of sinh.  |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type acosh (genF16Type x)                | Arc hyperbolic cosine; returns the non-negative    |
5bd8deadSopenharmony_ci    |                                                | inverse of cosh. Results are undefined if x < 1.   |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type atanh (genF16Type x)                | Arc hyperbolic tangent; returns the inverse of     |
5bd8deadSopenharmony_ci    |                                                | tanh. Results are undefined if |x| >= 1.           |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.2, Exponential Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the table of Exponential Functions on p. 143)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                         | Desciption                                         |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type pow (genF16Type x, genF16Type y)    | Returns x raised to the y power, i.e., x^y         |
5bd8deadSopenharmony_ci    |                                                | Results are undefined if x < 0.                    |
5bd8deadSopenharmony_ci    |                                                | Results are undefined if x = 0 and y <= 0.         |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type exp (genF16Type x)                  | Returns the natural exponentiation of x, i.e., e^x.|
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type log (genF16Type x)                  | Returns the natural logarithm of x, i.e., returns  |
5bd8deadSopenharmony_ci    |                                                | the value y which satisfies the equation x = e^y.  |
5bd8deadSopenharmony_ci    |                                                | Results are undefined if x <= 0.                   |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type exp2 (genF16Type x)                 | Returns 2 raised to the x power, i.e., 2^x.        |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type log2 (genF16Type x)                 | Returns the base 2 logarithm of x, i.e., returns   |
5bd8deadSopenharmony_ci    |                                                | the value y which satisfies the equation x = 2^y   |
5bd8deadSopenharmony_ci    |                                                | Results are undefined if x <= 0.                   |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type sqrt (genF16Type x)                 | Returns sqrt(x) .Results are undefined if x < 0.   |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type inversesqrt (genF16Type x)          | Returns 1 / sqrt(x). Results are undefined if      |
5bd8deadSopenharmony_ci    |                                                | x <= 0.                                            |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.3, Common Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the table of common functions on p. 144)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                         | Desciption                                         |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type abs(genF16Type x)                   | Returns x if x >= 0; otherwise it returns -x.      |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type sign(genF16Type x)                  | Returns 1.0 if x > 0, 0.0 if x = 0, or -1.0 if x < |
5bd8deadSopenharmony_ci    |                                                | 0.                                                 |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type floor (genF16Type x)                | Returns a value equal to the nearest integer that  |
5bd8deadSopenharmony_ci    |                                                | is less than or equal to x.                        |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type trunc (genF16Type x)                | Returns a value equal to the nearest integer to x  |
5bd8deadSopenharmony_ci    |                                                | whose absolute value is not larger than the        |
5bd8deadSopenharmony_ci    |                                                | absolute value of x.                               |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type round (genF16Type x)                | Returns a value equal to the nearest integer to x. |
5bd8deadSopenharmony_ci    |                                                | The fraction 0.5 will round in a direction chosen  |
5bd8deadSopenharmony_ci    |                                                | by the implementation, presumably the direction    |
5bd8deadSopenharmony_ci    |                                                | that is fastest. This includes the possibility     |
5bd8deadSopenharmony_ci    |                                                | that round(x) returns the same value as            |
5bd8deadSopenharmony_ci    |                                                | roundEven(x) for all values of x.                  |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type roundEven (genF16Type x)            | Returns a value equal to the nearest integer to x. |
5bd8deadSopenharmony_ci    |                                                | A fractional part of 0.5 will round toward the     |
5bd8deadSopenharmony_ci    |                                                | nearest even integer. (Both 3.5 and 4.5 for x will |
5bd8deadSopenharmony_ci    |                                                | return 4.0.)                                       |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type ceil (genF16Type x)                 | Returns a value equal to the nearest integer that  |
5bd8deadSopenharmony_ci    |                                                | is greater than or equal to x.                     |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type fract (genF16Type x)                | Returns x - floor(x).                              |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type mod (genF16Type x, float16_t y)     | Modulus. Returns x - y * floor(x/y).               |
5bd8deadSopenharmony_ci    | genF16Type mod (genF16Type x, genF16Type y)    |                                                    |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type modf(genF16Type x, out genF16Type i)| Returns the fractional part of x and sets i to the |
5bd8deadSopenharmony_ci    |                                                | integer part (as a whole number floating-point     |
5bd8deadSopenharmony_ci    |                                                | value). Both the return value and the output       |
5bd8deadSopenharmony_ci    |                                                | parameter will have the same sign as x.            |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type min(genF16Type x,                   | Returns y if y < x; otherwise it returns x.        |
5bd8deadSopenharmony_ci    |                genF16Type y)                   |                                                    |
5bd8deadSopenharmony_ci    | genF16Type min(genF16Type x,                   |                                                    |
5bd8deadSopenharmony_ci    |                float16_t y)                    |                                                    |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type max(genF16Type x,                   | Returns y if x < y; otherwise it returns x.        |
5bd8deadSopenharmony_ci    |                genF16Type y)                   |                                                    |
5bd8deadSopenharmony_ci    | genF16Type max(genF16Type x,                   |                                                    |
5bd8deadSopenharmony_ci    |                float16_t y)                    |                                                    |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type clamp(genF16Type x,                 | Returns min(max(x, minVal), maxVal).               |
5bd8deadSopenharmony_ci    |                  genF16Type minVal,            |                                                    |
5bd8deadSopenharmony_ci    |                  genF16Type maxVal)            | Results are undefined if minVal > maxVal.          |
5bd8deadSopenharmony_ci    | genF16Type clamp(genF16Type x,                 |                                                    |
5bd8deadSopenharmony_ci    |                  float16_t minVal,             |                                                    |
5bd8deadSopenharmony_ci    |                  float16_t maxVal)             |                                                    |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type mix(genF16Type x,                   | Selects which vector each returned component comes |
5bd8deadSopenharmony_ci    |                genF16Type y,                   | from. For a component of a that is false, the      |
5bd8deadSopenharmony_ci    |                genF16Type a)                   | corresponding component of x is returned. For a    |
5bd8deadSopenharmony_ci    | genF16Type mix(genF16Type x,                   | component of a that is true, the corresponding     |
5bd8deadSopenharmony_ci    |                genF16Type y,                   | component of y is returned.                        |
5bd8deadSopenharmony_ci    |                float16_t a)                    |                                                    |
5bd8deadSopenharmony_ci    | genF16Type mix(genF16Type x,                   |                                                    |
5bd8deadSopenharmony_ci    |                genF16Type y,                   |                                                    |
5bd8deadSopenharmony_ci    |                genBType a)                     |                                                    |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type step (genF16Type edge, genF16Type x)| Returns 0.0 if x < edge; otherwise it returns 1.0. |
5bd8deadSopenharmony_ci    | genF16Type step (float16_t edge, genF16Type x) |                                                    |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type smoothstep (genF16Type edge0,       | Returns 0.0 if x <= edge0 and 1.0 if x >= edge1    |
5bd8deadSopenharmony_ci    |                        genF16Type edge1,       | and performs smooth Hermite interpolation between 0|
5bd8deadSopenharmony_ci    |                        genF16Type x)           | and 1 when edge0 < x < edge1. This is useful in    |
5bd8deadSopenharmony_ci    | genF16Type smoothstep (float16_t edge0,        | cases where you would want a threshold function    |
5bd8deadSopenharmony_ci    |                        float16_t edge1         | with a smooth,transition. This is equivalent to:   |
5bd8deadSopenharmony_ci    |                        genF16Type x)           |    genF16Type t;                                   |
5bd8deadSopenharmony_ci    |                                                |    t = clamp((x - edge0) / (edge1 - edge0), 0, 1); |
5bd8deadSopenharmony_ci    |                                                |    return t * t * (3 - 2 * t);                     |
5bd8deadSopenharmony_ci    |                                                |    Results are undefined if edge0 >= edge1.        |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genBType isnan (genF16Type x)                  | Returns true if x holds a NaN. Returns false       |
5bd8deadSopenharmony_ci    |                                                | otherwise. Always returns false if NaNs are not    |
5bd8deadSopenharmony_ci    |                                                | implemented.                                       |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genBType isinf (genF16Type x)                  | Returns true if x holds a positive infinity or     |
5bd8deadSopenharmony_ci    |                                                | negative infinity. Returns false otherwise.        |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type fma (genF16Type a, genF16Type b,    | Computes and returns a * b + c.                    |
5bd8deadSopenharmony_ci    |                 genF16Type c)                  |                                                    |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type frexp (genF16Type x,                | Splits x into a floating-point significand in the  |
5bd8deadSopenharmony_ci    |                   out genIType exp)            | range [0.5, 1.0) and an integral exponent of two,  |
5bd8deadSopenharmony_ci    |                                                | such that:                                         |
5bd8deadSopenharmony_ci    |                                                |    x = significand * 2^exp                         |
5bd8deadSopenharmony_ci    |                                                | The significand is returned by the function and the|
5bd8deadSopenharmony_ci    |                                                | exponent is returned in the parameter exp. For a   |
5bd8deadSopenharmony_ci    |                                                | floating-point value of zero, the significand and  |
5bd8deadSopenharmony_ci    |                                                | exponent are both zero. For a floating-point value |
5bd8deadSopenharmony_ci    |                                                | that is an infinity or is not a number, the results|
5bd8deadSopenharmony_ci    |                                                | are undefined.                                     |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type ldexp (genF16Type x,                | Builds a floating-point number from x and the      |
5bd8deadSopenharmony_ci    |                   in genIType exp)             | corresponding integral exponent of two in exp,     |
5bd8deadSopenharmony_ci    |                                                | returning:                                         |
5bd8deadSopenharmony_ci    |                                                |    x* 2^exp                                        |
5bd8deadSopenharmony_ci    |                                                | If this product is too large to be represented in  |
5bd8deadSopenharmony_ci    |                                                | the floating-point type, the result is undefined.  |
5bd8deadSopenharmony_ci    +------------------------------------------------+----------------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.4, Floating-Point Pack and Unpack Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the table of pack and unpack functions on p. 149)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-----------------------------------+------------------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                            | Desciption                                           |
5bd8deadSopenharmony_ci    +-----------------------------------+------------------------------------------------------+
5bd8deadSopenharmony_ci    | uint packFloat2x16(f16vec2 v)     | Returns an unsigned 32-bit integer obtained by       |
5bd8deadSopenharmony_ci    |                                   | packing the components of a two-component half-      |
5bd8deadSopenharmony_ci    |                                   | precision floating-point vector, respectively. The   |
5bd8deadSopenharmony_ci    |                                   | first vector component specifies the 16 least        |
5bd8deadSopenharmony_ci    |                                   | significant bits; the second component specifies the |
5bd8deadSopenharmony_ci    |                                   | 16 most significant bits.                            |
5bd8deadSopenharmony_ci    +-----------------------------------+------------------------------------------------------+
5bd8deadSopenharmony_ci    | f16vec2 unpackFloat2x16(uint v)   | Returns a two-component half-precision floating-point|
5bd8deadSopenharmony_ci    |                                   | vector built from a 32-bit unsigned integer scalar,  |
5bd8deadSopenharmony_ci    |                                   | respectively. The first component of the vector      |
5bd8deadSopenharmony_ci    |                                   | contains the 16 least significant bits of the input; |
5bd8deadSopenharmony_ci    |                                   | the second component contains the 16 most            |
5bd8deadSopenharmony_ci    |                                   | significant bits.                                    |
5bd8deadSopenharmony_ci    +-----------------------------------+------------------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.5 Geometric Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to table of geometric functions on p.152)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                    | Desciption                                    |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | float16_t length (genF16Type x)           | Returns the length of vector x, i.e.,         |
5bd8deadSopenharmony_ci    |                                           | sqrt(x[0]*x[0] + x[1]*x[1] + ...)             |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | float16_t distance (genF16Type p0,        | Returns the distance between p0 and p1, i.e., |
5bd8deadSopenharmony_ci    |                     genF16Type p1)        | length (p0 - p1)                              |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | float16_t dot (genF16Type x, genF16Type y)| Returns the dot product of x and y, i.e.,     |
5bd8deadSopenharmony_ci    |                                           | x[0]*y[0] + x[1]*y [1] + ...                  |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | f16vec3 cross (f16vec3 x, f16vec3 y)      | Returns the cross product of x and y, i.e.,   |
5bd8deadSopenharmony_ci    |                                           | |x[1] * y[2] - y[1] * x[2]|                   |
5bd8deadSopenharmony_ci    |                                           | |x[2] * y[0] - y[2] * x[0]|                   |
5bd8deadSopenharmony_ci    |                                           | |x[0] * y[1] - y[0] * x[1]|                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type normalize (genF16Type x)       | Returns a vector in the same direction as x   |
5bd8deadSopenharmony_ci    |                                           | but with a length of 1.                       |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type faceforward (genF16Type N,     | If dot(Nref, I) < 0 return N, otherwise return|
5bd8deadSopenharmony_ci    |                         genF16Type I,     | -N.                                           |
5bd8deadSopenharmony_ci    |                         genF16Type Nref), |                                               |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type reflect (genF16Type I,         | For the incident vector I and surface         |
5bd8deadSopenharmony_ci    |                     genF16Type N)         | orientation N, returns the reflection         |
5bd8deadSopenharmony_ci    |                                           | direction:                                    |
5bd8deadSopenharmony_ci    |                                           |    I - 2 * dot(N, I) * N                      |
5bd8deadSopenharmony_ci    |                                           | N must already be normalized in order to      |
5bd8deadSopenharmony_ci    |                                           | achieve the desired result.                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type refract (genF16Type I,         | For the incident vector I and surface normal  |
5bd8deadSopenharmony_ci    |                     genF16Type N,         | N, and the ratio of indices of refraction eta,|
5bd8deadSopenharmony_ci    |                     float16_t eta)        | return the refraction vector. The result is   |
5bd8deadSopenharmony_ci    |                                           | computed by                                   |
5bd8deadSopenharmony_ci    |                                           |    k = 1.0 - eta * eta * (1.0 - dot(N, I) *   |
5bd8deadSopenharmony_ci    |                                           |                dot(N, I))                     |
5bd8deadSopenharmony_ci    |                                           | if (k < 0.0)                                  |
5bd8deadSopenharmony_ci    |                                           |     return genF16Type(0.0)                    |
5bd8deadSopenharmony_ci    |                                           | else                                          |
5bd8deadSopenharmony_ci    |                                           |    return eta * I - (eta * dot(N, I)          |
5bd8deadSopenharmony_ci    |                                           |                      + sqrt(k)) * N           |
5bd8deadSopenharmony_ci    |                                           | The input parameters for the incident vector  |
5bd8deadSopenharmony_ci    |                                           | I and the surface normal N must already be    |
5bd8deadSopenharmony_ci    |                                           | normalized to get the desired results.        |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section, 8.6 Matrix Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the first paragraph of the section on p. 154)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ..., there is both a single-precision floating-point version, where all
5bd8deadSopenharmony_ci    arguments and return values are single precision, a double-precision
5bd8deadSopenharmony_ci    floating-point version, where all arguments and return values are double
5bd8deadSopenharmony_ci    precision, and a half-precision floating-point version, where all
5bd8deadSopenharmony_ci    arguments and return values are half precision.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section, 8.7, Vector Relational Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the table of placeholders at the top of p. 156)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-------------+-----------------------------+
5bd8deadSopenharmony_ci    | Placeholder | Specific Types Allowed      |
5bd8deadSopenharmony_ci    +-------------+-----------------------------+
5bd8deadSopenharmony_ci    | f16vec      | f16vec2, f16vec3, f16vec4   |
5bd8deadSopenharmony_ci    +-------------+-----------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the table of vector relational functions at the bottom of p. 156)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                    | Desciption                                    |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | bvec lessThan(f16vec x, f16vec y)         | Returns the component-wise compare of x < y.  |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | bvec lessThanEqual(f16vec x, f16vec y)    | Returns the component-wise compare of x <= y. |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | bvec greaterThan(f16vec x, f16vec y)      | Returns the component-wise compare of x > y.  |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | bvec greaterThanEqual(f16vec x, f16vec y) | Returns the component-wise compare of x >= y. |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | bvec equal(f16vec x, f16vec y)            | Returns the component-wise compare of x == y. |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | bvec notEqual(f16vec x, f16vec y)         | Returns the component-wise compare of x != y. |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.13.1 Derivative Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to table of derivative functions on p. 181)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                    | Description                                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type dFdx (genF16Type p)            | Returns either dFdxFine(p) or dFdxCoarse(p),  |
5bd8deadSopenharmony_ci    |                                           | based on implementation choice, presumably    |
5bd8deadSopenharmony_ci    |                                           | whichever is the faster, or by whichever is   |
5bd8deadSopenharmony_ci    |                                           | selected in the API through                   |
5bd8deadSopenharmony_ci    |                                           | quality-versus-speed hints.                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type dFdy (genF16Type p)            | Returns either dFdyFine(p) or dFdyCoarse(p),  |
5bd8deadSopenharmony_ci    |                                           | based on implementation choice, presumably    |
5bd8deadSopenharmony_ci    |                                           | whichever is the faster, or by whichever is   |
5bd8deadSopenharmony_ci    |                                           | selected in the API through                   |
5bd8deadSopenharmony_ci    |                                           | quality-versus-speed hints.                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type dFdxFine (genF16Type p)        | Returns the partial derivative of p with      |
5bd8deadSopenharmony_ci    |                                           | respect to the window x coordinate. Will use  |
5bd8deadSopenharmony_ci    |                                           | local differencing based on the value of p    |
5bd8deadSopenharmony_ci    |                                           | for the current fragment and its immediate    |
5bd8deadSopenharmony_ci    |                                           | neighbor(s).                                  |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type dFdyFine (genF16Type p)        | Returns the partial derivative of p with      |
5bd8deadSopenharmony_ci    |                                           | respect to the window y coordinate. Will use  |
5bd8deadSopenharmony_ci    |                                           | local differencing based on the value of p    |
5bd8deadSopenharmony_ci    |                                           | for the current fragment and its immediate    |
5bd8deadSopenharmony_ci    |                                           | neighbor(s).                                  |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type dFdxCoarse (genF16Type p)      | Returns the partial derivative of p with      |
5bd8deadSopenharmony_ci    |                                           | respect to the window x coordinate. Will use  |
5bd8deadSopenharmony_ci    |                                           | local differencing based on the value of p    |
5bd8deadSopenharmony_ci    |                                           | for the current fragment's neighbors, and     |
5bd8deadSopenharmony_ci    |                                           | will possibly, but not necessarily, include   |
5bd8deadSopenharmony_ci    |                                           | the value of p for the current fragment. That |
5bd8deadSopenharmony_ci    |                                           | is, over a given area, the implementation can |
5bd8deadSopenharmony_ci    |                                           | x compute derivatives in fewer unique         |
5bd8deadSopenharmony_ci    |                                           | locations than would be allowed for           |
5bd8deadSopenharmony_ci    |                                           | dFdxFine(p).                                  |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type dFdyCoarse (genF16Type p)      | Returns the partial derivative of p with      |
5bd8deadSopenharmony_ci    |                                           | respect to the window y coordinate. Will use  |
5bd8deadSopenharmony_ci    |                                           | local differencing based on the value of p    |
5bd8deadSopenharmony_ci    |                                           | for the current fragment's neighbors, and     |
5bd8deadSopenharmony_ci    |                                           | will possibly, but not necessarily, include   |
5bd8deadSopenharmony_ci    |                                           | the value of p for the current fragment. That |
5bd8deadSopenharmony_ci    |                                           | is, over a given area, the implementation can |
5bd8deadSopenharmony_ci    |                                           | compute y derivatives in fewer unique         |
5bd8deadSopenharmony_ci    |                                           | locations than would be allowed for           |
5bd8deadSopenharmony_ci    |                                           | dFdyFine(p).                                  |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type fwidth (genF16Type p)          | Returns abs(dFdx(p)) + abs(dFdy(p)).          |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type fwidthFine (genF16Type p)      | Returns abs(dFdxFine(p)) + abs(dFdyFine(p)).  |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type fwidthCoarse (genF16Type p)    | Returns abs(dFdxCoarse(p)) +                  |
5bd8deadSopenharmony_ci    |                                           |         abs(dFdyCoarse(p)).                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.13.2 Interpolation Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to table of interpolation functions on p. 180)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                    | Description                                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type interpolateAtCentroid (        | Returns the value of the input interpolant    |
5bd8deadSopenharmony_ci    |            genF16Type interpolant)        | sampled at a location inside both the pixel   |
5bd8deadSopenharmony_ci    |                                           | and the primitive being processed. The value  |
5bd8deadSopenharmony_ci    |                                           | obtained would be the same value assigned to  |
5bd8deadSopenharmony_ci    |                                           | the input variable if declared with the       |
5bd8deadSopenharmony_ci    |                                           | centroid qualifier                            |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type interpolateAtSample (          | Returns the value of the input interpolant    |
5bd8deadSopenharmony_ci    |            genF16Type interpolant,        | variable at the location of sample number     |
5bd8deadSopenharmony_ci    |            int        sample)             | sample. If multisample buffers are not        |
5bd8deadSopenharmony_ci    |                                           | available, the input variable will be         |
5bd8deadSopenharmony_ci    |                                           | evaluated at the center of the pixel. If      |
5bd8deadSopenharmony_ci    |                                           | sample sample does not exist, the position    |
5bd8deadSopenharmony_ci    |                                           | used to interpolate the input variable is     |
5bd8deadSopenharmony_ci    |                                           | undefined.                                    |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type interpolateAtOffset (          | Returns the value of the input interpolant    |
5bd8deadSopenharmony_ci    |            genF16Type interpolant,        | variable sampled at an offset from the center |
5bd8deadSopenharmony_ci    |            f16vec2    offset)             | of the pixel specified by offset. The two     |
5bd8deadSopenharmony_ci    |                                           | floating-point components of offset, give the |
5bd8deadSopenharmony_ci    |                                           | offset in pixels in the x and y directions,   |
5bd8deadSopenharmony_ci    |                                           | respectively. An offset of (0, 0) identifies  |
5bd8deadSopenharmony_ci    |                                           | the center of the pixel. The range and        |
5bd8deadSopenharmony_ci    |                                           | granularity of offsets supported by this      |
5bd8deadSopenharmony_ci    |                                           | function isimplementation-dependent.          |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 9, Shading Language Grammar for Core Profile
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the list of tokens on p. 187)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      ...
5bd8deadSopenharmony_ci      FLOAT16  F16VEC2  F16VEC3  F16VEC4
5bd8deadSopenharmony_ci      F16MAT2 F16MAT3 F16MAT4
5bd8deadSopenharmony_ci      F16MAT2X2 FL6MAT2X3 F16MAT2X4
5bd8deadSopenharmony_ci      F16MAT3X2 F16MAT3X3 F16MAT3X4
5bd8deadSopenharmony_ci      F16MAT4X2 F16MAT4X3 F16MAT4X4
5bd8deadSopenharmony_ci      ...
5bd8deadSopenharmony_ci      FLOAT16CONSTANT
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the rule of "primary_expression" on p. 188)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      primary_expression:
5bd8deadSopenharmony_ci        ...
5bd8deadSopenharmony_ci        FLOAT16CONSTANT
5bd8deadSopenharmony_ci        ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the rule of "type_specifier_nonarray" on p. 195)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      type_specifier_nonarray:
5bd8deadSopenharmony_ci        ...
5bd8deadSopenharmony_ci          FLOAT16
5bd8deadSopenharmony_ci          F16VEC2
5bd8deadSopenharmony_ci          F16VEC3
5bd8deadSopenharmony_ci          F16VEC4
5bd8deadSopenharmony_ci          F16MAT2
5bd8deadSopenharmony_ci          F16MAT3
5bd8deadSopenharmony_ci          F16MAT4
5bd8deadSopenharmony_ci          F16MAT2X2
5bd8deadSopenharmony_ci          FL6MAT2X3
5bd8deadSopenharmony_ci          F16MAT2X4
5bd8deadSopenharmony_ci          F16MAT3X2
5bd8deadSopenharmony_ci          F16MAT3X3
5bd8deadSopenharmony_ci          F16MAT3X4
5bd8deadSopenharmony_ci          F16MAT4X2
5bd8deadSopenharmony_ci          F16MAT4X3
5bd8deadSopenharmony_ci          F16MAT4X4
5bd8deadSopenharmony_ci        ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on ARB_gpu_shader_int64
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the shader enables ARB_gpu_shader_int64, this extension allows
5bd8deadSopenharmony_ci    additional explicit conversions between half-precision floating-point
5bd8deadSopenharmony_ci    types and 64-bit integer types.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 5.4.1, Conversion and Scalar Constructors
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add after the first list of constructor examples on p. 95)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      int64_t(float16_t)    // convert a float16_t value to a signed 64-bit integer
5bd8deadSopenharmony_ci      uint64_t(float16_t)   // convert a float16_t value to an unsigned 64-bit integer
5bd8deadSopenharmony_ci      float16_t(int64_t)    // convert a signed 64-bit integer to a float16_t value
5bd8deadSopenharmony_ci      float16_t(uint64_t)   // convert an unsigned 64-bit integer to a float16_t value
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on AMD_shader_trinary_minmax
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the shader enables AMD_shader_trinary_minmax, this extension adds
5bd8deadSopenharmony_ci    additional common functions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.3, Common Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to the table of common functions on p. 144)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                    | Description                                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type min3(genF16Type x,             | Returns the per-component minimum value of x, |
5bd8deadSopenharmony_ci    |                 genF16Type y,             | y, and z.                                     |
5bd8deadSopenharmony_ci    |                 genF16Type z)             |                                               |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type max3(genF16Type x,             | Returns the per-component maximum value of x, |
5bd8deadSopenharmony_ci    |                 genF16Type y,             | y, and z.                                     |
5bd8deadSopenharmony_ci    |                 genF16Type z)             |                                               |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type mid3(genF16Type x,             | Returns the per-component median value of x,  |
5bd8deadSopenharmony_ci    |                 genF16Type y,             | y, and z.                                     |
5bd8deadSopenharmony_ci    |                 genF16Type z)             |                                               |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on AMD_shader_explicit_vertex_parameter
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the shader enables AMD_shader_explicit_vertex_parameter, this extension
5bd8deadSopenharmony_ci    adds additional interpolation functions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 8.13.2 Interpolation Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to table of interpolation functions on p. 180)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | Syntax                                    | Description                                   |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci    | genF16Type interpolateAtVertexAMD (       | Returns the value of the input <interpolant>  |
5bd8deadSopenharmony_ci    |            genF16Type interpolant,        | without any interpolation. i.e. the raw       |
5bd8deadSopenharmony_ci    |            uint       vertexIdx)          | output value of previous shader stage.        |
5bd8deadSopenharmony_ci    |                                           | <vertexIdx> selects for which vertex of the   |
5bd8deadSopenharmony_ci    |                                           | primitive the value of <interpolant> is       |
5bd8deadSopenharmony_ci    |                                           | returned.                                     |
5bd8deadSopenharmony_ci    |                                           |                                               |
5bd8deadSopenharmony_ci    |                                           | This return value is equivalent with          |
5bd8deadSopenharmony_ci    |                                           | interpolating the input <interpolant> using   |
5bd8deadSopenharmony_ci    |                                           | the following set of barycentric coordinates, |
5bd8deadSopenharmony_ci    |                                           | depending on the value of <vertexIdx>:        |
5bd8deadSopenharmony_ci    |                                           |                                               |
5bd8deadSopenharmony_ci    |                                           |  vertexIdx    Barycentric coordinates         |
5bd8deadSopenharmony_ci    |                                           |  0            I=0, J=0, K=1                   |
5bd8deadSopenharmony_ci    |                                           |  1            I=1, J=0, K=0                   |
5bd8deadSopenharmony_ci    |                                           |  2            I=0, J=1, K=0                   |
5bd8deadSopenharmony_ci    |                                           |                                               |
5bd8deadSopenharmony_ci    |                                           | However this order has no association with    |
5bd8deadSopenharmony_ci    |                                           | the vertex order specified by the application |
5bd8deadSopenharmony_ci    |                                           | in the originating draw.                      |
5bd8deadSopenharmony_ci    |                                           |                                               |
5bd8deadSopenharmony_ci    |                                           | The value of <vertexIdx> must be constant     |
5bd8deadSopenharmony_ci    |                                           | integer expression with a value in the range  |
5bd8deadSopenharmony_ci    |                                           | [0, 2].                                       |
5bd8deadSopenharmony_ci    +-------------------------------------------+-----------------------------------------------+
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciErrors
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew State
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Implementation Dependent State
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciIssues
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (1) How the functionality in this extension different than the half_precision
5bd8deadSopenharmony_ci        floating-point types introduced by NV_gpu_shader5?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED: This extension is designed to be source code compatible with
5bd8deadSopenharmony_ci      the half-precison floating-point support in NV_gpu_shader5. However, it
5bd8deadSopenharmony_ci      is a functional superset of that, as it adds the following additional
5bd8deadSopenharmony_ci      features:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * support for implicit conversions from int, uint and float to float16_t.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * support for overloaded versions of the functions, such as abs, sign, min,
5bd8deadSopenharmony_ci          max, clamp, and etc., that accept float16_t type or half-precision
5bd8deadSopenharmony_ci          floating-point type as parameters.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (2) What should be done to distinguish half-precison floating-point constants?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED: We will use "HF" and "hf" to identify half-precision
5bd8deadSopenharmony_ci      floating-point constants.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (3) Should we import new uniform API to setup the float16_t type uniform in
5bd8deadSopenharmony_ci        default uniform block?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED: No. float16_t isn't a IEEE standard format, CPU doesn't support
5bd8deadSopenharmony_ci      it directly. So most data on CPU side is stored in the form of single- or
5bd8deadSopenharmony_ci      double-precision floating-point precision floating-point. Uniform*f{v}'s
5bd8deadSopenharmony_ci      functionality is extended to support uniforms with float16_t type in this
5bd8deadSopenharmony_ci      extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (4) Should we support float16_t types as members of uniform blocks,
5bd8deadSopenharmony_ci        shader storage buffer blocks, or as transform feedback varyings?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED: Yes, support all of them. float16_t types will consume two
5bd8deadSopenharmony_ci      basic machine units. Some examples:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          struct S {
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              float16_t  x;     // rule 1:  align = 2, takes offsets 0-1
5bd8deadSopenharmony_ci              f16vec2    y;     // rule 2:  align = 4, takes offsets 4-7
5bd8deadSopenharmony_ci              f16vec3    z;     // rule 3:  align = 8, takes offsets 8-13
5bd8deadSopenharmony_ci          };
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          layout(column_major, std140) uniform B1 {
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              float16_t  a;     // rule 1:  align = 2, takes offsets 0-1
5bd8deadSopenharmony_ci              f16vec2    b;     // rule 2:  align = 4, takes offsets 4-7
5bd8deadSopenharmony_ci              f16vec3    c;     // rule 3:  align = 8, takes offsets 8-13
5bd8deadSopenharmony_ci              float16_t  d[2];  // rule 4:  align = 16, array stride = 16,
5bd8deadSopenharmony_ci                                //          takes offsets 16-47
5bd8deadSopenharmony_ci              f16mat2x3  e;     // rule 5:  align = 16, matrix stride = 16,
5bd8deadSopenharmony_ci                                //          takes offsets 48-79
5bd8deadSopenharmony_ci              f16mat2x3  f[2];  // rule 6:  align = 16, matrix stride = 16,
5bd8deadSopenharmony_ci                                //          array stride = 32, f[0] takes
5bd8deadSopenharmony_ci                                //          offsets 80-111, f[1] takes offsets
5bd8deadSopenharmony_ci                                //          112-143
5bd8deadSopenharmony_ci              S          g;     // rule 9:  align = 16, g.x takes offsets
5bd8deadSopenharmony_ci                                //          144-145, g.y takes offsets 148-151,
5bd8deadSopenharmony_ci                                //          g.z takes offsets 152-159
5bd8deadSopenharmony_ci              S          h[2];  // rule 10: align = 16, array stride = 16, h[0]
5bd8deadSopenharmony_ci                                //          takes offsets 160-175, h[1] takes
5bd8deadSopenharmony_ci                                //          offsets 176-191
5bd8deadSopenharmony_ci          };
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          layout(row_major, std430) buffer B2 {
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              float16_t  o;     // rule 1:  align = 2, takes offsets 0-1
5bd8deadSopenharmony_ci              f16vec2    p;     // rule 2:  align = 4, takes offsets 4-7
5bd8deadSopenharmony_ci              f16vec3    q;     // rule 3:  align = 8, takes offsets 8-13
5bd8deadSopenharmony_ci              float16_t  r[2];  // rule 4:  align = 2, array stride = 2, takes
5bd8deadSopenharmony_ci                                //          offsets 14-17
5bd8deadSopenharmony_ci              f16mat2x3  s;     // rule 7:  align = 4, matrix stride = 4, takes
5bd8deadSopenharmony_ci                                //          offsets 20-31
5bd8deadSopenharmony_ci              f16mat2x3  t[2];  // rule 8:  align = 4, matrix stride = 4, array
5bd8deadSopenharmony_ci                                //          stride = 12, t[0] takes offsets
5bd8deadSopenharmony_ci                                //          32-43, t[1] takes offsets 44-55
5bd8deadSopenharmony_ci              S          u;     // rule 9:  align = 8, u.x takes offsets
5bd8deadSopenharmony_ci                                //          56-57, u.y takes offsets 60-63, u.z
5bd8deadSopenharmony_ci                                //          takes offsets 64-69
5bd8deadSopenharmony_ci              S          v[2];  // rule 10: align = 8, array stride = 16, v[0]
5bd8deadSopenharmony_ci                                //          takes offsets 72-87, v[1] takes
5bd8deadSopenharmony_ci                                //          offsets 88-103
5bd8deadSopenharmony_ci          };
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (5) In OpenGL ES Shading Language, the format of floating-point in UBO and
5bd8deadSopenharmony_ci        SSBO is always single-precision floating-point regardless of the precision
5bd8deadSopenharmony_ci        qualifier in shader. which format should be used for this extension?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED: the format should be equal with the type declaried in shader.
5bd8deadSopenharmony_ci      i.e. if the block member's type is float16_t, the format in buffer is
5bd8deadSopenharmony_ci      half-precision floating-point. and if the block member's type is float,
5bd8deadSopenharmony_ci      the format is single-precision floating-point. we will provide another
5bd8deadSopenharmony_ci      extension to keep compatible with ES driver's behavior.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciRevision History
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Rev.    Date    Author    Changes
5bd8deadSopenharmony_ci    ----  --------  --------  -----------------------------------------
5bd8deadSopenharmony_ci     5    09/21/16  dwitczak  Fixed minor character encoding issues.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     4    08/01/16  rexu      Correct the example of offset calculation for
5bd8deadSopenharmony_ci                              block members. Add limitation of xfb_offset when
5bd8deadSopenharmony_ci                              this qualifier is applied to block members that
5bd8deadSopenharmony_ci                              have float16_t types.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     3    07/11/16  rexu      Clarify that each component of float16_t types
5bd8deadSopenharmony_ci                              consume two basic machine units. Remove the
5bd8deadSopenharmony_ci                              interaction with NV_gpu_shader5 in that implicit
5bd8deadSopenharmony_ci                              conversion from int, uint and float types to
5bd8deadSopenharmony_ci                              float16_t types are disallowed now. Add new
5bd8deadSopenharmony_ci                              derivative functions: dFdxFine, dFdyFine,
5bd8deadSopenharmony_ci                              dFdxCoarse, dFdyCoarse, fwidthFine, fwidthCoarse.
5bd8deadSopenharmony_ci                              Add the interaction with AMD_shader_trinary_minmax
5bd8deadSopenharmony_ci                              and AMD_shader_explicit_vertex_parameter. Remove
5bd8deadSopenharmony_ci                              two listed issues that are no longer valid for
5bd8deadSopenharmony_ci                              the updated version of this extension. Remove
5bd8deadSopenharmony_ci                              floatBitsToInt and decide to add it when
5bd8deadSopenharmony_ci                              16-bit integer data type is supported.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     2    07/06/16  rexu      Remove sections that involve half-precision
5bd8deadSopenharmony_ci                              floating-point opaque types. Modify allowed rules
5bd8deadSopenharmony_ci                              of implicit conversion relevant to float16_t
5bd8deadSopenharmony_ci                              types. Add the interaction with ARB_gpu_shader_
5bd8deadSopenharmony_ci                              int64. Remove the modification of the first rule
5bd8deadSopenharmony_ci                              of std140 layout. Provide some examples to
5bd8deadSopenharmony_ci                              demostrate memory storage layout of uniform
5bd8deadSopenharmony_ci                              blocks and shader storage blocks when they have
5bd8deadSopenharmony_ci                              members of float16_t types.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     1    11/14/13  qlin      Initial revision.