extensions/NV/NV_fragment_program.txt

5bd8deadSopenharmony_ciName
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_fragment_program
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciName Strings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GL_NV_fragment_program
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciContact
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Pat Brown, NVIDIA Corporation (pbrown 'at' nvidia.com)
5bd8deadSopenharmony_ci    Mark J. Kilgard, NVIDIA Corporation (mjk 'at' nvidia.com)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNotice
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Copyright NVIDIA Corporation, 2001-2002.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciIP Status
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NVIDIA Proprietary.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciStatus
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Implemented in CineFX (NV30) Emulation driver, August 2002.
5bd8deadSopenharmony_ci    Shipping in Release 40 NVIDIA driver for CineFX hardware, January 2003.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciVersion
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Last Modified Date:  2005/05/24
5bd8deadSopenharmony_ci    NVIDIA Revision:     73
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNumber
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    282
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Written based on the wording of the OpenGL 1.2.1 specification and
5bd8deadSopenharmony_ci    requires OpenGL 1.2.1.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Requires support for the ARB_multitexture extension with at least
5bd8deadSopenharmony_ci    two texture units.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_vertex_program affects the definition of this extension.  The only
5bd8deadSopenharmony_ci    dependency is that both extensions use the same mechanisms for defining
5bd8deadSopenharmony_ci    and binding programs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_texture_shader trivially affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_texture_rectangle trivially affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ARB_texture_cube_map trivially affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    EXT_fog_coord trivially affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_depth_clamp affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ARB_depth_texture and SGIX_depth_texture affect the definition of this
5bd8deadSopenharmony_ci    extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_float_buffer affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ARB_vertex_program affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ARB_fragment_program affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciOverview
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    OpenGL mandates a certain set of configurable per-fragment computations
5bd8deadSopenharmony_ci    defining texture lookup, texture environment, color sum, and fog
5bd8deadSopenharmony_ci    operations.  Each of these areas provide a useful but limited set of fixed
5bd8deadSopenharmony_ci    operations.  For example, unextended OpenGL 1.2.1 provides only four
5bd8deadSopenharmony_ci    texture environment modes, color sum, and three fog modes.  Many OpenGL
5bd8deadSopenharmony_ci    extensions have either improved existing functionality or introduced new
5bd8deadSopenharmony_ci    configurable fragment operations.  While these extensions have enabled new
5bd8deadSopenharmony_ci    and interesting rendering effects, the set of effects is limited by the
5bd8deadSopenharmony_ci    set of special modes introduced by the extension.  This lack of
5bd8deadSopenharmony_ci    flexibility is in contrast to the high-level of programmability of
5bd8deadSopenharmony_ci    general-purpose CPUs and other (frequently software-based) shading
5bd8deadSopenharmony_ci    languages.  The purpose of this extension is to expose to the OpenGL
5bd8deadSopenharmony_ci    application writer an unprecedented degree of programmability in the
5bd8deadSopenharmony_ci    computation of final fragment colors and depth values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension provides a mechanism for defining fragment program
5bd8deadSopenharmony_ci    instruction sequences for application-defined fragment programs.  When in
5bd8deadSopenharmony_ci    fragment program mode, a program is executed each time a fragment is
5bd8deadSopenharmony_ci    produced by rasterization.  The inputs for the program are the attributes
5bd8deadSopenharmony_ci    (position, colors, texture coordinates) associated with the fragment and a
5bd8deadSopenharmony_ci    set of constant registers.  A fragment program can perform mathematical
5bd8deadSopenharmony_ci    computations and texture lookups using arbitrary texture coordinates.  The
5bd8deadSopenharmony_ci    results of a fragment program are new color and depth values for the
5bd8deadSopenharmony_ci    fragment.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension defines a programming model including a 4-component vector
5bd8deadSopenharmony_ci    instruction set, 16- and 32-bit floating-point data types, and a
5bd8deadSopenharmony_ci    relatively large set of temporary registers.  The programming model also
5bd8deadSopenharmony_ci    includes a condition code vector which can be used to mask register writes
5bd8deadSopenharmony_ci    at run-time or kill fragments altogether.  The syntax, program
5bd8deadSopenharmony_ci    instructions, and general semantics are similar to those in the
5bd8deadSopenharmony_ci    NV_vertex_program and NV_vertex_program2 extensions, which provide for the
5bd8deadSopenharmony_ci    execution of an arbitrary program each time the GL receives a vertex.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The fragment program execution environment is designed for efficient
5bd8deadSopenharmony_ci    hardware implementation and to support a wide variety of programs.  By
5bd8deadSopenharmony_ci    design, the entire set of existing fragment programs defined by existing
5bd8deadSopenharmony_ci    OpenGL per-fragment computation extensions can be implemented using the
5bd8deadSopenharmony_ci    extension's programming model.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The fragment program execution environment accesses textures via
5bd8deadSopenharmony_ci    arbitrarily computed texture coordinates.  As such, there is no necessary
5bd8deadSopenharmony_ci    correspondence between the texture coordinates and texture maps previously
5bd8deadSopenharmony_ci    lumped into a single "texture unit".  This extension separates the notion
5bd8deadSopenharmony_ci    of "texture coordinate sets" and "texture image units" (texture maps and
5bd8deadSopenharmony_ci    associated parameters), allowing implementations with a different number
5bd8deadSopenharmony_ci    of each.  The initial implementation of this extension will support 8
5bd8deadSopenharmony_ci    texture coordinate sets and 16 texture image units.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciIssues
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    What limitations exist in this extension?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Very few.  Programs can not exceed a maximum program length
5bd8deadSopenharmony_ci        (which is no less than 1024 instructions), and can use no more than
5bd8deadSopenharmony_ci        32-64 temporary registers.  Programs can not access more than one
5bd8deadSopenharmony_ci        fragment attribute or program parameter (constant) per instruction,
5bd8deadSopenharmony_ci        but can work around this restriction using temporaries.  The number of
5bd8deadSopenharmony_ci        textures that can be used by a program is limited to the number of
5bd8deadSopenharmony_ci        texture image units provided by the implementation (16 in the initial
5bd8deadSopenharmony_ci        implementation of this extension).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        These limits are fairly high.  Additionally, there is no limit on the
5bd8deadSopenharmony_ci        total number of texture lookups that can be performed by a program.
5bd8deadSopenharmony_ci        There is no limit on the length of a texture dependency chain -- one
5bd8deadSopenharmony_ci        can write a program that performs over 1000 consecutive dependent
5bd8deadSopenharmony_ci        texture lookups.  There is no restrictions on dependencies between
5bd8deadSopenharmony_ci        texture mapping instructions and arithmetic instructions.  Texture
5bd8deadSopenharmony_ci        lookups can be performed using arbitrarily computed texture
5bd8deadSopenharmony_ci        coordinates.  Applications can carry out their calculations with full
5bd8deadSopenharmony_ci        32-bit single precision, although two lower-precision modes are also
5bd8deadSopenharmony_ci        available.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How does texture mapping work with fragment programs?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  This extension provides three instructions used to perform
5bd8deadSopenharmony_ci        texture lookups.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The "TEX" instruction performs a lookup with the (s,t,r) values taken
5bd8deadSopenharmony_ci        from an interpolated texture coordinate, an arbitrarily computed
5bd8deadSopenharmony_ci        vector, or even a program constant.  The "TXP" instruction performs a
5bd8deadSopenharmony_ci        similar lookup, except that it uses the fourth component of the source
5bd8deadSopenharmony_ci        vector to performs a perspective divide, using (s/q, t/q, r/q).  In
5bd8deadSopenharmony_ci        both cases, the GL will automatically compute partial derivatives used
5bd8deadSopenharmony_ci        for filter and LOD selection.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The "TXD" instruction operates like "TEX", except that it allows the
5bd8deadSopenharmony_ci        program to explicitly specify two additional vectors containing the
5bd8deadSopenharmony_ci        partial derivatives of the texture coordinate with respect to x and y
5bd8deadSopenharmony_ci        window coordinates.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        All three instructions write a filtered texel value to a temporary or
5bd8deadSopenharmony_ci        output register.  Other than the computation of texture coordinates
5bd8deadSopenharmony_ci        and partial derivatives, texture lookups not performed any differently
5bd8deadSopenharmony_ci        in fragment program mode.  In particular, any applicable LOD biases,
5bd8deadSopenharmony_ci        wrap modes, minification and magnification filters, and anisotropic
5bd8deadSopenharmony_ci        filtering controls are still applied in fragment program mode.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The results of the texture lookup are available to be used arbitrarily
5bd8deadSopenharmony_ci        by subsequent fragment program instructions.  Fragment programs are
5bd8deadSopenharmony_ci        allowed to access any texture map arbitrarily many times.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Can fragment programs be used to compute depth values?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci         RESOLVED:  Yes.  A fragment program can perform arbitrary
5bd8deadSopenharmony_ci         computations to compute a final value for the fragment, which it
5bd8deadSopenharmony_ci         should write to the "z" component of the o[DEPR] register.  The "z"
5bd8deadSopenharmony_ci         value written should be in the range [0,1], regardless of the size of
5bd8deadSopenharmony_ci         the depth buffer.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci         To assist in the computation of the final Z value, a fragment program
5bd8deadSopenharmony_ci         can access the interpolated depth of the fragment (prior to any
5bd8deadSopenharmony_ci         displacement) by reading the "z" component of the f[WPOS] attribute
5bd8deadSopenharmony_ci         register.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How should near and far plane clipping work in fragment program mode if
5bd8deadSopenharmony_ci    the current fragment program computes a depth value?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Geometric clipping to the near and far clip plane should be
5bd8deadSopenharmony_ci        disabled.  Clipping should be done based on the depth values computed
5bd8deadSopenharmony_ci        per-fragment.  The rationale is that per-fragment depth displacement
5bd8deadSopenharmony_ci        operations may effectively move portions of a primitive initially
5bd8deadSopenharmony_ci        outside the clip volume inside, and vice versa.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Note that under the NV_depth_clamp extension, geometric clipping to
5bd8deadSopenharmony_ci        the near and far clip planes is also disabled, and the fragment depth
5bd8deadSopenharmony_ci        values are clamped to the depth range.  If depth clamp mode is enabled
5bd8deadSopenharmony_ci        when using a fragment program that computes a depth value, the
5bd8deadSopenharmony_ci        computed depth value will be clamped to the depth range.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should fragment programs be allowed to use multiple precisions for
5bd8deadSopenharmony_ci    operands and operations?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes.  Low-precision operands are generally adequate for
5bd8deadSopenharmony_ci        representing colors.  Allowing low-precision registers also allows for
5bd8deadSopenharmony_ci        a larger number of temporary registers (at lower precision).
5bd8deadSopenharmony_ci        Low-precision operations also provide the opportunity for a higher
5bd8deadSopenharmony_ci        level of performance.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Applications are free to use only high-precision operations or mix
5bd8deadSopenharmony_ci        high- and low-precision operations as necessary.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    What levels of precision are supported in arithmetic operations?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Arithmetic operations can be performed at three different
5bd8deadSopenharmony_ci        precisions.  32-bit floating point precision (fp32) uses the IEEE
5bd8deadSopenharmony_ci        single-precision standard with a sign bit, 8 exponent bits, and 23
5bd8deadSopenharmony_ci        mantissa bits.  16-bit floating-point precision (fp16) uses a similar
5bd8deadSopenharmony_ci        floating-point representation, but with 5 exponent bits and 10
5bd8deadSopenharmony_ci        mantissa bits.  Additionally, many arithmetic operations can also be
5bd8deadSopenharmony_ci        carried out at 12-bit fixed point precision (fx12), where values in
5bd8deadSopenharmony_ci        the range [-2,+2) are represented as signed values with 10 fraction
5bd8deadSopenharmony_ci        bits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How should the precision with which operations are carried out be
5bd8deadSopenharmony_ci    specified?  Should we infer the precision from the types of the operands
5bd8deadSopenharmony_ci    or result vectors?  Or should it be an attribute of the instruction?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Applications can optionally specify the precision of
5bd8deadSopenharmony_ci        individual instructions by adding a suffix of "R", "H", and "X" to
5bd8deadSopenharmony_ci        instruction names to select fp32, fp16, and fx12 precision,
5bd8deadSopenharmony_ci        respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        By default, instructions will be carried out using the precision of
5bd8deadSopenharmony_ci        the destination register.  Always inferring the precision from the
5bd8deadSopenharmony_ci        operands has a number of issues.  First, there are a number of
5bd8deadSopenharmony_ci        operations (e.g., TEX/TXP/TXD) where result type has little to no
5bd8deadSopenharmony_ci        correspondance to the type of the operands.  In these cases, precision
5bd8deadSopenharmony_ci        suffixes are not supported.  Second, one could have instructions
5bd8deadSopenharmony_ci        automatically cast operands and compute results using the type of the
5bd8deadSopenharmony_ci        highest precision operand or result.  This behavior would be
5bd8deadSopenharmony_ci        problematic since all fragment attribute registers and program
5bd8deadSopenharmony_ci        parameters are kept at full precision, but full precision may not be
5bd8deadSopenharmony_ci        needed by the operation.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The choice of precision level allows programs to trade off precision
5bd8deadSopenharmony_ci        for potentially higher performance.  Giving the program explicit
5bd8deadSopenharmony_ci        control over the precision also allows it to dictate precision
5bd8deadSopenharmony_ci        explicitly and eliminate any uncertainty over type casting.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For instructions whose specified precision is different than the precision
5bd8deadSopenharmony_ci    of the operands or the result registers, how are the operations performed?
5bd8deadSopenharmony_ci    How are the condition codes updated?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Operations are performed with operands and results at the
5bd8deadSopenharmony_ci        precision specified by the instruction.  After the operation is
5bd8deadSopenharmony_ci        complete, the result is converted to the precision of the destination
5bd8deadSopenharmony_ci        register, after which the condition code is generated.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        In an alternate approach, the condition code could be generated from
5bd8deadSopenharmony_ci        the result.  However, in some cases, the register contents would not
5bd8deadSopenharmony_ci        match the condition code.  In such cases, it may not be reliable to
5bd8deadSopenharmony_ci        use the condition code to prevent division by zero or other special
5bd8deadSopenharmony_ci        cases.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How does this extension interact with the ARB_multisample extension?  In
5bd8deadSopenharmony_ci    the ARB_multisample extension, each fragment has multiple depth values.
5bd8deadSopenharmony_ci    In this extension, a single interpolated depth value may be modified by a
5bd8deadSopenharmony_ci    fragment program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  The depth values for the extra samples are generated by
5bd8deadSopenharmony_ci        computing partials of the computed depth value and using these
5bd8deadSopenharmony_ci        partials to derive the depth values for each of the extra samples.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How does this extension interact with polygon offset?  Both extensions
5bd8deadSopenharmony_ci    modify fragment depth values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  As in the base OpenGL spec, the depth offset generated by
5bd8deadSopenharmony_ci        polygon offset is added during polygon rasterization.  The depth value
5bd8deadSopenharmony_ci        provided to programs in f[WPOS].z already includes polygon offset, if
5bd8deadSopenharmony_ci        enabled.  If the depth value is replaced by a fragment program, the
5bd8deadSopenharmony_ci        polygon offset value will NOT be recomputed and added back after
5bd8deadSopenharmony_ci        program execution.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        This is probably not desirable for fragment programs that modify depth
5bd8deadSopenharmony_ci        values since the partials used to generate the offset may not match
5bd8deadSopenharmony_ci        the partials of the computed depth value.  Polygon offset for filled
5bd8deadSopenharmony_ci        polygons can be approximated in a fragment program using the depth
5bd8deadSopenharmony_ci        partials obtained by the DDX and DDY instructions.  This will not work
5bd8deadSopenharmony_ci        properly for line- and point-mode polygons, since the partials used
5bd8deadSopenharmony_ci        for offset are computed over the polygon, while the partials resulting
5bd8deadSopenharmony_ci        from the DDX and DDY instructions are computed along the line (or are
5bd8deadSopenharmony_ci        zero for point-mode polygons).  In addition, separate treatment of
5bd8deadSopenharmony_ci        points, line segments, and polygons is not possible in a fragment
5bd8deadSopenharmony_ci        program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should depth component replacement be an property of the fragment program
5bd8deadSopenharmony_ci    or a separate enable?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  It should be a program property.  Using the output register
5bd8deadSopenharmony_ci        notation simplifies matters:  depth components are replaced if and
5bd8deadSopenharmony_ci        only if the DEPR register is written to.  This alleviates the
5bd8deadSopenharmony_ci        application and driver burden of maintaining separate state.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How does this extension affect the handling of q texture coordinates in
5bd8deadSopenharmony_ci    the OpenGL spec?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Fragment programs are allowed to access an associated q
5bd8deadSopenharmony_ci        texture coordinate, so this attribute must be produced by
5bd8deadSopenharmony_ci        rasterization.  In unextended OpenGL 1.2, the q coordinate is
5bd8deadSopenharmony_ci        eliminated in the rasterization portions of the spec after dividing
5bd8deadSopenharmony_ci        each of s, t, and r by it.  This extension updates the specification
5bd8deadSopenharmony_ci        to pass q coordinates through at least to conventional texture
5bd8deadSopenharmony_ci        mapping.  When fragment program mode are disabled, q coordinates will
5bd8deadSopenharmony_ci        be eliminated there in an identical manner.  This modification has the
5bd8deadSopenharmony_ci        added benefit of simplifying the equations used for attribute
5bd8deadSopenharmony_ci        interpolation.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How should clip w coordinates be handled by this extension?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Fragment programs are allowed to access the reciprocal of
5bd8deadSopenharmony_ci        the clip w coordinate, so this attribute must be produced by
5bd8deadSopenharmony_ci        rasterization.  The OpenGL 1.2 spec doesn't explictly enumerate the
5bd8deadSopenharmony_ci        attributes associated with the fragment, but we add treatment of the w
5bd8deadSopenharmony_ci        clip coordinate in the appropriate locations.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The reciprocal of the clip w coordinate in traditional graphics
5bd8deadSopenharmony_ci        hardware is produced by screen-space linear interpolation of the
5bd8deadSopenharmony_ci        reciprocals of the clip w coordinates of the vertices.  However, this
5bd8deadSopenharmony_ci        spec says the clip w coordinate is produced by perspective-correct
5bd8deadSopenharmony_ci        interpolation of the (non-reciprocated) clip w vertex coordinates.
5bd8deadSopenharmony_ci        These two formulations turn out to be equivalent, and the latter is
5bd8deadSopenharmony_ci        more convenient since the core OpenGL spec already contains formulas
5bd8deadSopenharmony_ci        for perspective-correct interpolation of vertex attributes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    What is produced by the TEX/TXP/TXD instructions if the requested texture
5bd8deadSopenharmony_ci    image is inconsistent?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  The result vector is specified to be (0,0,0,0).  This
5bd8deadSopenharmony_ci        behavior is consistent with the NV_texture_shader extension.  Note
5bd8deadSopenharmony_ci        that like in NV_texture_shader, these instructions ignore the standard
5bd8deadSopenharmony_ci        hierarchy of texture enables and programs can access textures that are
5bd8deadSopenharmony_ci        not specifically "enabled".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should a minimum precision be specified for certain fragment attribute
5bd8deadSopenharmony_ci    registers (in particular COL0, COL1) that may not be generated with full
5bd8deadSopenharmony_ci    fp32 precision?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  No.  It is expected that the precision of COL0/COL1 should
5bd8deadSopenharmony_ci        generally be at least as high as that of the frame buffer.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment color components (f[COL0] and f[COL1]) are generally
5bd8deadSopenharmony_ci    low-precision fixed-point values in the range [0,1].  Is it possible to
5bd8deadSopenharmony_ci    pass unclamped or high-precision color components to fragment programs?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes, although you can't exactly call them "colors".
5bd8deadSopenharmony_ci        High-precision per-vertex color values can be written into any unused
5bd8deadSopenharmony_ci        texture coordinate set, either via a MultiTexCoord call or using a
5bd8deadSopenharmony_ci        vertex program.  These "texture coordinates" will be interpolated
5bd8deadSopenharmony_ci        during rasterization, and can be used arbitrarily by a fragment
5bd8deadSopenharmony_ci        program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        In particular, there is no requirement that per-fragment attributes
5bd8deadSopenharmony_ci        called "texture coordinates" be used for texture mapping.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this specification guarantee that temporary registers are
5bd8deadSopenharmony_ci    initialized to zero?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes.  This will allow for the modular construction of
5bd8deadSopenharmony_ci        programs that accumulate results in registers.  For example,
5bd8deadSopenharmony_ci        per-fragment lighting may use MAD instructions to accumulate color
5bd8deadSopenharmony_ci        contributions at each light.  Without zero-initialization, the program
5bd8deadSopenharmony_ci        would require an explicit MOV instruction to load 0 or the use of the
5bd8deadSopenharmony_ci        MUL instruction for the first light.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this specification support Unicode program strings?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Not necessary.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Programs defined by NV_vertex_program begin with "!!VP1.0".  Should
5bd8deadSopenharmony_ci    fragment programs have a similar identifier?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes, "!!FP1.0", identifying the first revision of this
5bd8deadSopenharmony_ci        fragment program language.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should per-fragment attributes have equivalent integer names in the
5bd8deadSopenharmony_ci    program language, as per-vertex attributes do in NV_vertex_program?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  No.  In NV_vertex_program, "generic" vertex attributes
5bd8deadSopenharmony_ci        could be specified directly by an application using only an attribute
5bd8deadSopenharmony_ci        number.  Those numbers may have no necessary correlation with the
5bd8deadSopenharmony_ci        conventional attribute names, although conventional vertex attributes
5bd8deadSopenharmony_ci        are mapped to attribute numbers.  However, conventional attributes are
5bd8deadSopenharmony_ci        the only outputs of vertex programs and of rasterization.  Therefore,
5bd8deadSopenharmony_ci        there is no need for a similar input-by-number functionality for
5bd8deadSopenharmony_ci        fragment programs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should we provide the ability to issue instructions that do not update
5bd8deadSopenharmony_ci    temporary or output registers?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes.  Programs may issue instructions whose only purpose is
5bd8deadSopenharmony_ci        to update the condition code register, and requiring such instructions
5bd8deadSopenharmony_ci        to write to a temporary may require the use of an additional temporary
5bd8deadSopenharmony_ci        and/or defeat possible program optimizations.  We accomplish this by
5bd8deadSopenharmony_ci        adding two write-only temporary pseudo-registers ("RC" and "HC") that
5bd8deadSopenharmony_ci        can be specified as destination registers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Do the packing and unpacking instructions in this extension make any
5bd8deadSopenharmony_ci    sense?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes.  They are useful for packing and unpacking multiple
5bd8deadSopenharmony_ci        components in a single channel of a floating-point frame buffer.  For
5bd8deadSopenharmony_ci        example, a 128-bit "RGBA" frame buffer could pack 16 8-bit quantities
5bd8deadSopenharmony_ci        or 8 16-bit quantities, all of which could be used in later
5bd8deadSopenharmony_ci        rasterization passes.  See the NV_float_buffer extension for more
5bd8deadSopenharmony_ci        information.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should we provide a method for specifying a fp16 depth component output
5bd8deadSopenharmony_ci    value?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  No.  There is no good reason for supporting half-precision
5bd8deadSopenharmony_ci        Z outputs.  Even with 16-bit Z buffers, the 10-bit mantissa of the
5bd8deadSopenharmony_ci        half-precision float is rather limiting.  There would effectively be
5bd8deadSopenharmony_ci        only 11 good bits in the back half of the Z buffer.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should RequestResidentProgramsNV (or a new equivalent function) take a
5bd8deadSopenharmony_ci    target?  Dealing with working sets of different program types is a bit
5bd8deadSopenharmony_ci    messy.  Should we document some limitation if we get programs of different
5bd8deadSopenharmony_ci    types?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  In retrospect, it may have been a good idea to attach a
5bd8deadSopenharmony_ci        target to this command, but there isn't a good reason to mess with
5bd8deadSopenharmony_ci        something that already works for vertex programs.  The driver is
5bd8deadSopenharmony_ci        responsible for ensuring consistent results when the program types
5bd8deadSopenharmony_ci        specified are mixed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    What happens on data type conversions where the original value is not
5bd8deadSopenharmony_ci    exactly representable in the new data type, either due to overflow or
5bd8deadSopenharmony_ci    insufficient precision in the destination type?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  In case of overflow, the original value is clamped to the
5bd8deadSopenharmony_ci        +/-INF (fp16 or fp32) or the nearest representable value (fx12).  In
5bd8deadSopenharmony_ci        case of imprecision, the conversion is either to round or truncate to
5bd8deadSopenharmony_ci        the nearest representable value.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this extension support IEEE-style denorms?  For 32-bit IEEE
5bd8deadSopenharmony_ci    floating point, denorms are numbers smaller in absolute value than 2^-126.
5bd8deadSopenharmony_ci    For 16-bit floats used by this extension, denorms are numbers smaller in
5bd8deadSopenharmony_ci    absolute value than 2^-14.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  For 32-bit data types, hardware support for denorms was
5bd8deadSopenharmony_ci        considered too expensive relative to the benefit provided.
5bd8deadSopenharmony_ci        Computational results that would otherwise produce denorms are flushed
5bd8deadSopenharmony_ci        to zero.  For 16-bit data types, hardware denorm support will be
5bd8deadSopenharmony_ci        present.  The expense of hardware denorm support is lower and the
5bd8deadSopenharmony_ci        potential precision benefit is greater for 16-bit data types.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    OpenGL provides a hierarchy of texture enables.  The texture lookup
5bd8deadSopenharmony_ci    operations in NV_texture_shader effectively override the texture enable
5bd8deadSopenharmony_ci    hierarchy and select a specific texture to enable.  What should be done by
5bd8deadSopenharmony_ci    this extension?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  This extension will build upon NV_texture_shader and reduce
5bd8deadSopenharmony_ci        the driver overhead of validating the texture enables.  Texture
5bd8deadSopenharmony_ci        lookups can be specified by instructions like "TEX H0, f[TEX2], TEX2,
5bd8deadSopenharmony_ci        3D", which would indicate to use texture coordinate set number 2 to do
5bd8deadSopenharmony_ci        a lookup in the texture object bound to the TEXTURE_3D target in
5bd8deadSopenharmony_ci        texture image unit 2.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Each texture unit can have only one "active" target.  Programs are not
5bd8deadSopenharmony_ci        allowed to reference different texture targets in the same texture
5bd8deadSopenharmony_ci        image unit.  In the example above, any other texture instructions
5bd8deadSopenharmony_ci        using texture image unit 2 must specify the 3D texture target.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    What is the interaction with NV_register_combiners?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Register combiners are not available when fragment programs
5bd8deadSopenharmony_ci        are enabled.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Previous version of this specification supported the notion of
5bd8deadSopenharmony_ci        combiner programs, where the result of fragment program execution was
5bd8deadSopenharmony_ci        a set of four "texture lookup" values that fed the register combiners.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For convenience, should we include pseudo-instructions not present in the
5bd8deadSopenharmony_ci    hardware instruction set that are trivially implementable?  For example,
5bd8deadSopenharmony_ci    absolute value and subtract instructions could fall in this category.  An
5bd8deadSopenharmony_ci    "ABS R1,R0" instruction would be equivalent to "MAX R1,R0,-R0", and a "SUB
5bd8deadSopenharmony_ci    R2,R0,R1" would be equivalent to "ADD R2,R0,-R1"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  In general, yes.  A SUB instruction is provided for
5bd8deadSopenharmony_ci        convenience.  This extension does not provide a separate ABS
5bd8deadSopenharmony_ci        instruction because it supports absolute value operations of each
5bd8deadSopenharmony_ci        operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should there be a '+' in the <optionalSign> portion of the grammar?  There
5bd8deadSopenharmony_ci    isn't one in the GL_NV_vertex_program spec.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes, for orthogonality/readability.  A '+' obviously adds
5bd8deadSopenharmony_ci        no functionality.  In NV_vertex_program, an <optionalSign> of "-" was
5bd8deadSopenharmony_ci        always a negation operator.  However, in fragment programs, it can
5bd8deadSopenharmony_ci        also be used as a sign for a constant value.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Can the same fragment attribute register, program parameter register, or
5bd8deadSopenharmony_ci    constants be used for multiple operands in the same instruction?  If so,
5bd8deadSopenharmony_ci    can it be used with different swizzle patterns?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes and yes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension allows different limits for the number of texture
5bd8deadSopenharmony_ci    coordinate sets and the number of texture image units (i.e., texture maps
5bd8deadSopenharmony_ci    and associated data).  The state in ActiveTextureARB affects both
5bd8deadSopenharmony_ci    coordinate sets (TexGen, matrix operations) and image units (TexParameter,
5bd8deadSopenharmony_ci    TexEnv).  How should we deal with this?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Continue to use ActiveTextureARB and emit an
5bd8deadSopenharmony_ci        INVALID_OPERATION if the active texture refers to an unsupported
5bd8deadSopenharmony_ci        coordinate set/image unit.  Other options included creating dummy
5bd8deadSopenharmony_ci        (unusable) state for unsupported coordinate sets/image units and
5bd8deadSopenharmony_ci        continue to use ActiveTextureARB normally, or creating separate state
5bd8deadSopenharmony_ci        and state-setting commands for coordinate sets and image units.
5bd8deadSopenharmony_ci        Separate state is the cleanest solution, but would add more calls and
5bd8deadSopenharmony_ci        potentially cause more programmer confusion.  Dummy state would avoid
5bd8deadSopenharmony_ci        additional error checks, but the demands of dummy state could grow if
5bd8deadSopenharmony_ci        the number of texture image units and texture coordinate sets
5bd8deadSopenharmony_ci        increases.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The current OpenGL spec is vague as to what state is affected by the
5bd8deadSopenharmony_ci        active texture selector and has no distination between
5bd8deadSopenharmony_ci        coordinate-related and image-related state.  The state tables could
5bd8deadSopenharmony_ci        use a good clean-up in this area.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The LRP instruction is defined so that the result of "LRP R0, R0, R1, R2"
5bd8deadSopenharmony_ci    is R0*R1+(1-R0)*R2.  There are conflicting precedents here.  The
5bd8deadSopenharmony_ci    definition here matches the "lrp" instruction in the DirectX 8.0 pixel
5bd8deadSopenharmony_ci    shader language.  However, an equivalent RenderMan lerp operation would
5bd8deadSopenharmony_ci    yield a result of (1-R0)*R1+R0*R2.  Which ordering should be implemented?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  NVIDIA hardware implements the former operand ordering, and
5bd8deadSopenharmony_ci        there is no good reason to specify a different ordering.  To convert a
5bd8deadSopenharmony_ci        "LRP" using the latter ordering to NV_fragment_program, swap the third
5bd8deadSopenharmony_ci        and fourth arguments.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this extension provide tracking of matrices or any other state,
5bd8deadSopenharmony_ci    similar to that provided in NV_vertex_program?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  No.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this extension provide global program parameters -- values shared
5bd8deadSopenharmony_ci    between multiple fragment programs?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  No.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this extension provide program parameters specific to a program?
5bd8deadSopenharmony_ci    If so, how?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes.  These parameters will be called "local parameters".
5bd8deadSopenharmony_ci        This extension will provide both named and numbered local parameters.
5bd8deadSopenharmony_ci        Local parameters can be managed by the driver and eliminate the need
5bd8deadSopenharmony_ci        for applications to manage a global name space.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Named local parameters work much like standard variable names in most
5bd8deadSopenharmony_ci        programming languages.  They are created using the "DECLARE"
5bd8deadSopenharmony_ci        instruction within the fragment program itself.  For example:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            DECLARE color = {1,0,0,1};
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Named local parameters are used simply by referencing the variable
5bd8deadSopenharmony_ci        name.  They do not require the array syntax like the global parameters
5bd8deadSopenharmony_ci        in the NV_vertex_program extension.  They can be updated using the
5bd8deadSopenharmony_ci        commands ProgramNamedParameter4[f,fv]NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Numbered local parameters are not declared.  They are used by simply
5bd8deadSopenharmony_ci        referencing an element of an array called "p".  For example,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            MOV R0, p[12];
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        loads the value of numbered local parameter 12 into register R0.
5bd8deadSopenharmony_ci        Numbered local parameters can be updated using the commands
5bd8deadSopenharmony_ci        ProgramLocalParameter4[d,dv,f,fv]ARB.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The numbered local parameter APIs were added to this extension late in
5bd8deadSopenharmony_ci        its development, and are provided for compatibility with the
5bd8deadSopenharmony_ci        ARB_vertex_program extension, and what will likely be supported in
5bd8deadSopenharmony_ci        ARB_fragment_program as well.  Providing this mechanism allows
5bd8deadSopenharmony_ci        programs to use the same mechanisms to set local parameters in both
5bd8deadSopenharmony_ci        extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Why are the APIs for setting named and numbered local parameters
5bd8deadSopenharmony_ci    different?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  The named parameter API was created prior to
5bd8deadSopenharmony_ci        ARB_vertex_program (and the possible future ARB_fragment_program) and
5bd8deadSopenharmony_ci        uses conventions borrowed from NV_vertex_program.  A slightly
5bd8deadSopenharmony_ci        different API was chosen during the ARB standardization process; see
5bd8deadSopenharmony_ci        the ARB_vertex_program specification for more details.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The named parameter API takes a program ID and a parameter name, and
5bd8deadSopenharmony_ci        sets the parameter for the program with the specified ID.  The
5bd8deadSopenharmony_ci        specified program does not need to be bound (via BindProgramNV) in
5bd8deadSopenharmony_ci        order to modify the values of its named parameters.  The numbered
5bd8deadSopenharmony_ci        parameter API takes a program target enum (FRAGMENT_PROGRAM_NV) and a
5bd8deadSopenharmony_ci        parameter number and modifies the corresponding numbered parameter of
5bd8deadSopenharmony_ci        the currently bound program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    What should be the initial value of uninitialized local parameters?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  (0,0,0,0).  This choice is somewhat arbitrary, but matches
5bd8deadSopenharmony_ci        previous extensions (e.g., NV_vertex_program).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this extension support program parameter arrays?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  No hardware support is present.  Note that from the point
5bd8deadSopenharmony_ci        of view of a fragment program, a texture map can be used as a 1-, 2-,
5bd8deadSopenharmony_ci        or 3-dimensional array of constants.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this extension provide support constants in fragment programs?  If
5bd8deadSopenharmony_ci    so, how?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes.  Scalar or vector constants can be defined inline
5bd8deadSopenharmony_ci        (e.g., "1.0" or "{1,2,3,4}").  In addition, named constants are
5bd8deadSopenharmony_ci        supported using the "DEFINE" instruction, which allow programmers to
5bd8deadSopenharmony_ci        change the values of constants used in multiple instructions simply be
5bd8deadSopenharmony_ci        changing the value assigned to the named constant.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Note that because this extension uses program strings, the
5bd8deadSopenharmony_ci        floating-point value of any constants generated on the fly must be
5bd8deadSopenharmony_ci        printed to the program string.  An alternate method that avoids the
5bd8deadSopenharmony_ci        need to print constants is to declare a named local program parameter
5bd8deadSopenharmony_ci        and initialize it with the ProgramNamedParameter4[f,fv]() calls.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should named constants be allowed to be redefined?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  No.  If you want to redefine the values of constants, you
5bd8deadSopenharmony_ci        can create an equivalent named program parameter by changing the
5bd8deadSopenharmony_ci        "DEFINE" keyword to "DECLARE".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should functions used to update or query named local parameters take a
5bd8deadSopenharmony_ci    zero-terminated string (as with most strings in the C programming
5bd8deadSopenharmony_ci    language), or should they require an explicit string length?  If the
5bd8deadSopenharmony_ci    former, should we create a version of LoadProgramNV that does not require
5bd8deadSopenharmony_ci    a string length.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Stick with explicit string length.  Strings that are
5bd8deadSopenharmony_ci        defined as constants can have the length computed at compile-time.
5bd8deadSopenharmony_ci        Strings read from files will have the length known in advance.
5bd8deadSopenharmony_ci        Programs to build strings at run-time also likely keep the length
5bd8deadSopenharmony_ci        up-to-date.  Passing an explicit length saves time, since the driver
5bd8deadSopenharmony_ci        doesn't have to do a strlen().
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    What is the deal with the alpha of the secondary color?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  In unextended OpenGL 1.2, the alpha component of the
5bd8deadSopenharmony_ci        secondary color is forced to 0.0.  In the EXT_secondary_color
5bd8deadSopenharmony_ci        extension, the alpha of the per-vertex secondary colors is defined to
5bd8deadSopenharmony_ci        be 0.0.  NV_vertex_program allows vertex programs to produce a
5bd8deadSopenharmony_ci        per-vertex alpha component, but it is forced to zero for the purposes
5bd8deadSopenharmony_ci        of the color sum.  In the NV_register_combiners extension, the alpha
5bd8deadSopenharmony_ci        component of the secondary color is undefined.  What a mess.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        In this extension, the alpha of the secondary color is well-defined
5bd8deadSopenharmony_ci        and can be used normally.  When in vertex program mode
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Why are fragment program instructions involving f[FOGC] or f[TEX0] through
5bd8deadSopenharmony_ci    f[TEX7] automatically carried out at full precision?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  This is an artifact of the method that these interpolants
5bd8deadSopenharmony_ci        are generated the NVIDIA graphics hardware.  If such instructions
5bd8deadSopenharmony_ci        absolutely must be carried out at lower precision, the requirement can
5bd8deadSopenharmony_ci        be met by first loading the interpolants into a temporary register.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    With a different number of texture coordinate sets and texture image
5bd8deadSopenharmony_ci    units, how many copies of each kind of texture state are there?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  The intention is that texture state be broken into three
5bd8deadSopenharmony_ci        groups.  (1) There are MAX_TEXTURE_COORDS_NV copies of texture
5bd8deadSopenharmony_ci        coordinate set state, which includes current texture coordinates,
5bd8deadSopenharmony_ci        TexGen state, and texture matrices.  (2) There are
5bd8deadSopenharmony_ci        MAX_TEXTURE_IMAGE_UNITS_NV copies of texture image unit state, which
5bd8deadSopenharmony_ci        include texture maps, texture parameters, LOD bias parameters.  (3)
5bd8deadSopenharmony_ci        There are MAX_TEXTURE_UNITS_ARB copies of legacy OpenGL texture unit
5bd8deadSopenharmony_ci        state (e.g., texture enables, TexEnv blending state), all of which are
5bd8deadSopenharmony_ci        unused when in fragment program mode.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        It is not necessary that MAX_TEXTURE_UNITS_ARB be equal to the minimum
5bd8deadSopenharmony_ci        of MAX_TEXTURE_COORDS_NV and MAX_TEXTURE_IMAGE_UNITS --
5bd8deadSopenharmony_ci        implementations may choose not to extend fixed-function OpenGL texture
5bd8deadSopenharmony_ci        mapping modes beyond a certain point.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The GLX protocol for LoadProgramNV (and ProgramNamedParameterNV) may end
5bd8deadSopenharmony_ci    up with programs >64KB.  This will overflow the limits of the GLX Render
5bd8deadSopenharmony_ci    protocol, resulting in the need to use RenderLarge path.  This is an issue
5bd8deadSopenharmony_ci    with vertex programs, also.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes, it is.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should textures used by fragment programs be declared?  For example,
5bd8deadSopenharmony_ci    "TEXTURE TEX3, 2D", indicating that the 2D texture should be used for all
5bd8deadSopenharmony_ci    accesses to texture unit 3.  The dimension could be dropped from the TEX
5bd8deadSopenharmony_ci    family of instructions, and some of the compile-time error checking could
5bd8deadSopenharmony_ci    be dropped.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Maybe it should be, but for better or worse, it isn't.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    It is not all that uncommon to have negative q values with projective
5bd8deadSopenharmony_ci    texture mapping, but results are undefined if any q values are negative in
5bd8deadSopenharmony_ci    this specification.  Why?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  This restriction carries on a similar one in the initial
5bd8deadSopenharmony_ci        OpenGL specification.  The motivation for this restriction is that
5bd8deadSopenharmony_ci        when interpolating, it is possible for a fragment to have an
5bd8deadSopenharmony_ci        interpolated q coordinate at or near 0.0.  Since the texture
5bd8deadSopenharmony_ci        coordinates used for projective texture mapping are s/q, t/q, and r/q,
5bd8deadSopenharmony_ci        this will result in a divide-by-zero error or suffer from significant
5bd8deadSopenharmony_ci        numerical instability.  Results will be inaccurate for such fragments.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Other than the numerical stability issue above, NVIDIA hardware should
5bd8deadSopenharmony_ci        have no problems with negative q coordinates.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should programs that replace depth have their own special program type,
5bd8deadSopenharmony_ci    Such as "!!FPD1.0" and "!!FPDC1.0"?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  No.  If a program has an instruction that writes to
5bd8deadSopenharmony_ci        o[DEPR], the final fragment depth value is taken from o[DEPR].z.
5bd8deadSopenharmony_ci        Otherwise, the fragment's original depth value is used.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    What fx12 value should NaN map to?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  For the lack of any better choice, 0.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How are special-case encodings (-INF, +INF, -0.0, +0.0, NaN) handled for
5bd8deadSopenharmony_ci    arithmetic and comparison operations?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  The special cases for all floating-point operations are
5bd8deadSopenharmony_ci        designed to match the IEEE specification for floating-point numbers as
5bd8deadSopenharmony_ci        closely as possible.  The results produced by special cases should be
5bd8deadSopenharmony_ci        enumerated in the sections of this spec describing the operations.
5bd8deadSopenharmony_ci        There are some cases where the implemented fragment program behavior
5bd8deadSopenharmony_ci        does not match IEEE conventions, and these cases should be noted in
5bd8deadSopenharmony_ci        this specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How can condition codes be used to mask out register writes?  How about
5bd8deadSopenharmony_ci    killing fragments?  What other things can you do?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  The following example computes a component wise |R1-R2|:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          SUBC R0, R1, R2;      # "C" suffix means update condition code
5bd8deadSopenharmony_ci          MOV  R0 (LT), -R0;    # Conditional write mask in parentheses
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        The first instruction computes a component-wise difference between R1
5bd8deadSopenharmony_ci        and R2, storing R1-R2 in register R0.  The "C" suffix in the
5bd8deadSopenharmony_ci        instruction means to update the condition code based on the sign of
5bd8deadSopenharmony_ci        the result vector components.  The second instruction inverts the sign
5bd8deadSopenharmony_ci        of the components of R0.  However the "(LT)" portion says that the
5bd8deadSopenharmony_ci        destination register should be updated only if the corresponding
5bd8deadSopenharmony_ci        condition code component is LT (negative).  This means that only those
5bd8deadSopenharmony_ci        components of R0
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        To kill a fragment if the red (x) component of a texture lookup
5bd8deadSopenharmony_ci        returns zero:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          TEXC R0, f[TEX0], TEX0, 2D;
5bd8deadSopenharmony_ci          KIL EQ.x;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        To kill based on the green (y) component, use "EQ.y" instead.  To kill
5bd8deadSopenharmony_ci        if any of the four components is zero, use "EQ.xyzw" or just "EQ".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Fragment programs do not support boolean expressions.  These can
5bd8deadSopenharmony_ci        generally be achieved using conditional write mask.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        To evaluate the expression "(R0.x == 0) && (R1.x == 0)":
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          MOVC RC.x, R0.x;
5bd8deadSopenharmony_ci          MOVC RC.x (EQ), R1.x;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        To evaluate the expression "(R0.x == 0) || (R1.x == 0)":
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          MOVC RC.x, R0.x;
5bd8deadSopenharmony_ci          MOVC RC.x (NE), R1.x;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        In both cases, the x component of the condition code will contain "EQ"
5bd8deadSopenharmony_ci        if and only if the condition is TRUE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    How can fragment programs be used to implement non-standard texture
5bd8deadSopenharmony_ci    filtering modes?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  As one example, consider a case where you want to do linear
5bd8deadSopenharmony_ci        filtering in a 2D texture map, but only horizontally.  To achieve
5bd8deadSopenharmony_ci        this, first set the texture filtering mode to NEAREST.  For a 16 x n
5bd8deadSopenharmony_ci        texture, you might do something like:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          DEFINE halfTexel = { 0.03125, 0 };   # 1/32 (1/2 a texel)
5bd8deadSopenharmony_ci          ADD R2, f[TEX0], -halfTexel;         # coords of left sample
5bd8deadSopenharmony_ci          ADD R1, f[TEX0], +halfTexel;         # coords of right sample
5bd8deadSopenharmony_ci          TEX R0, R2, TEX0, 2D;                # lookup left sample
5bd8deadSopenharmony_ci          TEX R1, R1, TEX0, 2D;                # lookup right sample
5bd8deadSopenharmony_ci          MUL R2.x, R2.x, 16;                  # scale X coords to texels
5bd8deadSopenharmony_ci          FRC R2.x, R2.x;                      # get fraction, filter weight
5bd8deadSopenharmony_ci          LRP R0, R2.x, R1, R0;                # blend samples based on weight
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        There are plenty of other interesting things that can be done.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Should this specification provide more examples?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes, it should.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Is the OpenGL ARB working on a multi-vendor standard for fragment
5bd8deadSopenharmony_ci    programmability?  Will there be an ARB_fragment_program extension?  If so,
5bd8deadSopenharmony_ci    how will this extension interact with the ARB standard?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  Yes, as of July 2002, there was a multi-vendor working
5bd8deadSopenharmony_ci        group and a draft specification.  The ARB extension is expected to
5bd8deadSopenharmony_ci        have several features not present in this extension, such as state
5bd8deadSopenharmony_ci        tracking and global parameters (called "program environment
5bd8deadSopenharmony_ci        parameters").  It will also likely lack certain features found in this
5bd8deadSopenharmony_ci        extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Why does the HEMI mapping apply to the third component of signed HILO
5bd8deadSopenharmony_ci    textures, but not to unsigned HILO textures?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        RESOLVED:  This behavior matches the behavior of NV_texture_shader
5bd8deadSopenharmony_ci        (e.g., the DOT_PRODUCT_NV mode).  The HEMI mapping will construct the
5bd8deadSopenharmony_ci        third component of a unit vector whose first two components are
5bd8deadSopenharmony_ci        encoded in the HILO texture.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Procedures and Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    void ProgramNamedParameter4fNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                   float x, float y, float z, float w);
5bd8deadSopenharmony_ci    void ProgramNamedParameter4dNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                   double x, double y, double z, double w);
5bd8deadSopenharmony_ci    void ProgramNamedParameter4fvNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                    const float v[]);
5bd8deadSopenharmony_ci    void ProgramNamedParameter4dvNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                    const double v[]);
5bd8deadSopenharmony_ci    void GetProgramNamedParameterfvNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                      float *params);
5bd8deadSopenharmony_ci    void GetProgramNamedParameterdvNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                      double *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    void ProgramLocalParameter4dARB(enum target, uint index,
5bd8deadSopenharmony_ci                                    double x, double y, double z, double w);
5bd8deadSopenharmony_ci    void ProgramLocalParameter4dvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                     const double *params);
5bd8deadSopenharmony_ci    void ProgramLocalParameter4fARB(enum target, uint index,
5bd8deadSopenharmony_ci                                    float x, float y, float z, float w);
5bd8deadSopenharmony_ci    void ProgramLocalParameter4fvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                     const float *params);
5bd8deadSopenharmony_ci    void GetProgramLocalParameterdvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                       double *params);
5bd8deadSopenharmony_ci    void GetProgramLocalParameterfvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                       float *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Tokens
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Accepted by the <cap> parameter of Disable, Enable, and IsEnabled, by the
5bd8deadSopenharmony_ci    <pname> parameter of GetBooleanv, GetIntegerv, GetFloatv, and GetDoublev,
5bd8deadSopenharmony_ci    and by the <target> parameter of BindProgramNV, LoadProgramNV,
5bd8deadSopenharmony_ci    ProgramLocalParameter4dARB, ProgramLocalParameter4dvARB,
5bd8deadSopenharmony_ci    ProgramLocalParameter4fARB, ProgramLocalParameter4fvARB,
5bd8deadSopenharmony_ci    GetProgramLocalParameterdvARB, and GetProgramLocalParameterfvARB:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        FRAGMENT_PROGRAM_NV                            0x8870
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Accepted by the <pname> parameter of GetBooleanv, GetIntegerv, GetFloatv,
5bd8deadSopenharmony_ci    and GetDoublev:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MAX_TEXTURE_COORDS_NV                          0x8871
5bd8deadSopenharmony_ci        MAX_TEXTURE_IMAGE_UNITS_NV                     0x8872
5bd8deadSopenharmony_ci        FRAGMENT_PROGRAM_BINDING_NV                    0x8873
5bd8deadSopenharmony_ci        MAX_FRAGMENT_PROGRAM_LOCAL_PARAMETERS_NV       0x8868
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Accepted by the <name> parameter of GetString:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        PROGRAM_ERROR_STRING_NV                        0x8874
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 2 of the OpenGL 1.2.1 Specification (OpenGL Operation)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 2.11, Clipping (p.39)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (replace the first paragraph of the section, p. 39)  Primitives are clipped
5bd8deadSopenharmony_ci    to the clip volume.  In clip coordinates, the view volume is defined by
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        -w_c <= x_c <= w_c,
5bd8deadSopenharmony_ci        -w_c <= y_c <= w_c, and
5bd8deadSopenharmony_ci        -w_c <= z_c <= w_c.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Clipping to the near and far clip planes is ignored if fragment program
5bd8deadSopenharmony_ci    mode (section 3.11) or texture shaders (see NV_texture_shader
5bd8deadSopenharmony_ci    specification) are enabled, if the current fragment program or texture
5bd8deadSopenharmony_ci    shader computes per-fragment depth values.  In this case, the view volume
5bd8deadSopenharmony_ci    is defined by:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        -w_c <= x_c <= w_c and
5bd8deadSopenharmony_ci        -w_c <= y_c <= w_c.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 3 of the OpenGL 1.2.1 Specification (Rasterization)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Chapter 3 introduction (p. 57)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (p.57, modify 1st paragraph) ... Figure 3.1 diagrams the rasterization
5bd8deadSopenharmony_ci    process.  The color value assigned to a fragment is initially determined
5bd8deadSopenharmony_ci    by the rasterization operations (Sections 3.3 through 3.7) and modified by
5bd8deadSopenharmony_ci    either the execution of the texturing, color sum, and fog operations as
5bd8deadSopenharmony_ci    defined in Sections 3.8, 3.9, and 3.10, or of a fragment program defined
5bd8deadSopenharmony_ci    in Section 3.11.  The final depth value is initially determined by the
5bd8deadSopenharmony_ci    rasterization operations and may be modified by a fragment program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    note:  Antialiasing Application is renumbered from Section 3.11 to Section
5bd8deadSopenharmony_ci    3.12.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Figure 3.1 (p.58)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                             Primitive Assembly
5bd8deadSopenharmony_ci                                      |
5bd8deadSopenharmony_ci              +-----------+-----------+-----------+-----------+
5bd8deadSopenharmony_ci              |           |           |           |           |
5bd8deadSopenharmony_ci              |           |           |        Pixel          |
5bd8deadSopenharmony_ci            Point       Line       Polygon     Rectangle   Bitmap
5bd8deadSopenharmony_ci           Raster-     Raster-     Raster-     Raster-     Raster-
5bd8deadSopenharmony_ci           ization     ization     ization     ization     ization
5bd8deadSopenharmony_ci              |           |           |           |           |
5bd8deadSopenharmony_ci              +-----------+-----------+-----------+-----------+
5bd8deadSopenharmony_ci                                      |
5bd8deadSopenharmony_ci                                      |
5bd8deadSopenharmony_ci                    +-----------------+-----------------+
5bd8deadSopenharmony_ci                    |                 |                 |
5bd8deadSopenharmony_ci              Conventional         Texture          Fragment
5bd8deadSopenharmony_ci              Texture Fetch        Shaders          Programs
5bd8deadSopenharmony_ci                    |                 |                 |
5bd8deadSopenharmony_ci                    |  +--------------+                 |
5bd8deadSopenharmony_ci                    |  |                                |
5bd8deadSopenharmony_ci        TEXTURE_    o  o                                |
5bd8deadSopenharmony_ci        SHADER_NV                                       |
5bd8deadSopenharmony_ci        enable      o                                   |
5bd8deadSopenharmony_ci                    |                                   |
5bd8deadSopenharmony_ci                    +-------------+                     |
5bd8deadSopenharmony_ci                    |             |                     |
5bd8deadSopenharmony_ci               Conventional   Register                  |
5bd8deadSopenharmony_ci                  TexEnv      Combiners                 |
5bd8deadSopenharmony_ci                    |             |                     |
5bd8deadSopenharmony_ci                Color Sum         |                     |
5bd8deadSopenharmony_ci                    |             |                     |
5bd8deadSopenharmony_ci                   Fog            |                     |
5bd8deadSopenharmony_ci                    |             |                     |
5bd8deadSopenharmony_ci                    |  +----------+                     |
5bd8deadSopenharmony_ci                    |  |                                |
5bd8deadSopenharmony_ci        REGISTER_   o  o                                |
5bd8deadSopenharmony_ci        COMBINERS_                                      |
5bd8deadSopenharmony_ci        NV enable   o                                   |
5bd8deadSopenharmony_ci                    |                                   |
5bd8deadSopenharmony_ci                    +-----------------+  +--------------+
5bd8deadSopenharmony_ci                                      |  |
5bd8deadSopenharmony_ci                           FRAGMENT_  o  o
5bd8deadSopenharmony_ci                           PROGRAM_
5bd8deadSopenharmony_ci                           NV enable  o
5bd8deadSopenharmony_ci                                      |
5bd8deadSopenharmony_ci                                      |
5bd8deadSopenharmony_ci                                   Coverage
5bd8deadSopenharmony_ci                                  Application
5bd8deadSopenharmony_ci                                      |
5bd8deadSopenharmony_ci                                      v
5bd8deadSopenharmony_ci                            to fragment processing
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.3, Points (p.61)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    All fragments produced in rasterizing a non-antialiased point are assigned
5bd8deadSopenharmony_ci    the same associated data, which are those of the vertex corresponding to
5bd8deadSopenharmony_ci    the point.  (delete reference to divide by q).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If anitialiasing is enabled, then ...  The data associated with each
5bd8deadSopenharmony_ci    fragment are otherwise the data associated with the point being
5bd8deadSopenharmony_ci    rasterized.  (delete reference to divide by q)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.4.1, Basic Line Segment Rasterization (p.66)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (Note that t=0 at p_a and t=1 at p_b).  The value of an associated datum f
5bd8deadSopenharmony_ci    from the fragment, whether it be R, G, B, or A (in RGBA mode) or a color
5bd8deadSopenharmony_ci    index (in color index mode), the s, t, r, or q texture coordinate, or the
5bd8deadSopenharmony_ci    clip w coordinate (the depth value, window z, must be found using equation
5bd8deadSopenharmony_ci    3.3, below), is found as
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      f = (1-t) * f_a / w_a + t * f_b / w_b                     (3.2)
5bd8deadSopenharmony_ci          ---------------------------------
5bd8deadSopenharmony_ci                (1-t) / w_a + t / w_b
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where f_a and f_b are the data associated with the starting and ending
5bd8deadSopenharmony_ci    endpoints of the segment, respectively; w_a and w_b are the clip
5bd8deadSopenharmony_ci    w coordinates of the starting and ending endpoints of the segments
5bd8deadSopenharmony_ci    respectively.  Note that linear interpolation would use
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      f = (1-t) * f_a + t * f_b.                                (3.3)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ... A GL implementation may choose to approximate equation 3.2 with 3.3,
5bd8deadSopenharmony_ci    but this will normally lead to unacceptable distortion effects when
5bd8deadSopenharmony_ci    interpolating texture coordinates or clip w coordinates.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.5.1, Basic Polygon Rasterization (p.71)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Denote a datum at p_a, p_b, or p_c ... is given by
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      f = a * f_a / w_a + b * f_b / w_b + c * f_c / w_c         (3.4)
5bd8deadSopenharmony_ci          ---------------------------------------------
5bd8deadSopenharmony_ci                  a / w_a + b / w_b + c / w_c
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where w_a, w_b, and w_c are the clip w coordinates of p_a, p_b, and p_c,
5bd8deadSopenharmony_ci    respectively.  a, b, and c are the barycentric coordinates of the fragment
5bd8deadSopenharmony_ci    for which the data are produced. a, b, and c must correspond precisely to
5bd8deadSopenharmony_ci    the exact coordinates ... at the fragment's center.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Just as with line segment rasterization, equation 3.4 may be approximated
5bd8deadSopenharmony_ci    by
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      f = a * f_a + b * f_b + c * f_c;                          (3.5)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    this may yield ... for texture coordinates or clip w coordinates.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.6.4, Rasterization of Pixel Rectangles (p.100)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment arising from a group ... are given by those associated with the
5bd8deadSopenharmony_ci    current raster position.  (delete reference to divide by q)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.7, Bitmaps (p.111)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Otherwise, a rectangular array ... The associated data for each fragment
5bd8deadSopenharmony_ci    are those associated with the current raster position.  (delete reference
5bd8deadSopenharmony_ci    to divide by q)  Once the fragments have been produced ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.8, Texturing (p.112)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ... an image at the location indicated by a fragment's texture coordinates
5bd8deadSopenharmony_ci    to modify the fragments primary RGBA color.  Texturing does not affect the
5bd8deadSopenharmony_ci    secondary color.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Texturing is specified only for RGBA mode; its use in color index mode is
5bd8deadSopenharmony_ci    undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Except when in fragment program mode (Section 3.11), the (s,t,r) texture
5bd8deadSopenharmony_ci    coordinates used for texturing are the values s/q, t/q, and r/q,
5bd8deadSopenharmony_ci    respectively, where s, t, r, and q are the texture coordinates associated
5bd8deadSopenharmony_ci    with the fragment.  When in fragment program mode, the (s,t,r) texture
5bd8deadSopenharmony_ci    coordinates are specified by the program.  If q is less than or equal to
5bd8deadSopenharmony_ci    zero, the results of texturing are undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Add new Section 3.11, Fragment Programs (p.140)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment program mode is enabled and disabled with the Enable and Disable
5bd8deadSopenharmony_ci    commands using the symbolic constant FRAGMENT_PROGRAM_NV.  When fragment
5bd8deadSopenharmony_ci    program mode is enabled, standard and extended texturing, color sum, and
5bd8deadSopenharmony_ci    fog application stages are ignored and a general purpose program is
5bd8deadSopenharmony_ci    executed instead.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program is a sequence of instructions that execute on a
5bd8deadSopenharmony_ci    per-fragment basis.  In fragment program mode, the currently bound
5bd8deadSopenharmony_ci    fragment program is executed as each fragment is generated by the
5bd8deadSopenharmony_ci    rasterization operations.  Fragment programs execute a finite fixed
5bd8deadSopenharmony_ci    sequence of instructions with no branching or looping, and operate
5bd8deadSopenharmony_ci    independently from the processing of other fragments.  Fragment programs
5bd8deadSopenharmony_ci    are used to compute new color values to be associated with each fragment,
5bd8deadSopenharmony_ci    and can optionally compute a new depth value for each fragment as well.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment program mode is not available in color index mode and is
5bd8deadSopenharmony_ci    considered disabled, regardless of the state of FRAGMENT_PROGRAM_NV.  When
5bd8deadSopenharmony_ci    fragment program mode is enabled, texture shaders and register combiners
5bd8deadSopenharmony_ci    (NV_texture_shader and NV_register_combiners extension) are disabled,
5bd8deadSopenharmony_ci    regardless of the state of TEXTURE_SHADER_NV and REGISTER_COMBINERS_NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.1, Fragment Program Registers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment programs operate on a set of program registers.  Each program
5bd8deadSopenharmony_ci    register is a 4-component vector, whose components are referred to as "x",
5bd8deadSopenharmony_ci    "y", "z", and "w" respectively.  The components of a fragment register are
5bd8deadSopenharmony_ci    always referred to in this manner, regardless of the meaning of their
5bd8deadSopenharmony_ci    contents.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The four components of each fragment program register have one of two
5bd8deadSopenharmony_ci    different representations:  32-bit floating-point (fp32) or 16-bit
5bd8deadSopenharmony_ci    floating-point (fp16).  More details on these representations can be found
5bd8deadSopenharmony_ci    in Section 3.11.4.1.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    There are several different classes of program registers.  Attribute
5bd8deadSopenharmony_ci    registers (Table X.1) correspond to the fragment's associated data
5bd8deadSopenharmony_ci    produced by rasterization.  Temporary registers (Table X.2) hold
5bd8deadSopenharmony_ci    intermediate results generated by the fragment program.  Output registers
5bd8deadSopenharmony_ci    (Table X.3) hold the final results of a fragment program.  The single
5bd8deadSopenharmony_ci    condition code register is used to mask writes to other registers or to
5bd8deadSopenharmony_ci    determine if a fragment should be discarded.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.1.1, Fragment Program Attribute Registers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The fragment program attribute registers (Table X.1) hold the location of
5bd8deadSopenharmony_ci    the fragment and the data associated with the fragment produced by
5bd8deadSopenharmony_ci    rasterization.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment Attribute                                    Component
5bd8deadSopenharmony_ci    Register Name    Description                          Interpretation
5bd8deadSopenharmony_ci    --------------   -----------------------------------  --------------
5bd8deadSopenharmony_ci       f[WPOS]       Position of the fragment center.     (x,y,z,1/w)
5bd8deadSopenharmony_ci       f[COL0]       Interpolated primary color           (r,g,b,a)
5bd8deadSopenharmony_ci       f[COL1]       Interpolated secondary color         (r,g,b,a)
5bd8deadSopenharmony_ci       f[FOGC]       Interpolated fog distance/coord      (z,0,0,0)
5bd8deadSopenharmony_ci       f[TEX0]       Texture coordinate (unit 0)          (s,t,r,q)
5bd8deadSopenharmony_ci       f[TEX1]       Texture coordinate (unit 1)          (s,t,r,q)
5bd8deadSopenharmony_ci       f[TEX2]       Texture coordinate (unit 2)          (s,t,r,q)
5bd8deadSopenharmony_ci       f[TEX3]       Texture coordinate (unit 3)          (s,t,r,q)
5bd8deadSopenharmony_ci       f[TEX4]       Texture coordinate (unit 4)          (s,t,r,q)
5bd8deadSopenharmony_ci       f[TEX5]       Texture coordinate (unit 5)          (s,t,r,q)
5bd8deadSopenharmony_ci       f[TEX6]       Texture coordinate (unit 6)          (s,t,r,q)
5bd8deadSopenharmony_ci       f[TEX7]       Texture coordinate (unit 7)          (s,t,r,q)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Table X.1:  Fragment Attribute Registers.  The component interpretation
5bd8deadSopenharmony_ci    column describes the mapping of attribute values to register components.
5bd8deadSopenharmony_ci    For example, the "x" component of f[COL0] holds the red color component,
5bd8deadSopenharmony_ci    and the "x" component of f[TEX0] holds the "s" texture coordinate for
5bd8deadSopenharmony_ci    texture unit 0.  The entries "0" and "1" indicate that the attribute
5bd8deadSopenharmony_ci    register components hold the constants 0 and 1, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    f[WPOS].x and f[WPOS].y hold the (x,y) window coordinates of the fragment
5bd8deadSopenharmony_ci    center, and relative to the lower left corner of the window.  f[WPOS].z
5bd8deadSopenharmony_ci    holds the associated z window coordinate, normally in the range [0,1].
5bd8deadSopenharmony_ci    f[WPOS].w holds the reciprocal of the associated clip w coordinate.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    f[COL0] and f[COL1] hold the associated RGBA primary and secondary colors
5bd8deadSopenharmony_ci    of the fragment, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    f[FOGC] holds the associated eye distance or fog coordinate normally used
5bd8deadSopenharmony_ci    for fog computations.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    f[TEX0] through f[TEX7] hold the associated texture coordinates for
5bd8deadSopenharmony_ci    texture coordinate sets 0 through 7, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    All attribute register components are treated as 32-bit floats.  However,
5bd8deadSopenharmony_ci    the components of primary and secondary colors (f[COL0] and f[COL1]) may
5bd8deadSopenharmony_ci    be generated with reduced precision.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The contents of the fragment attribute registers may not be modified by a
5bd8deadSopenharmony_ci    fragment program.  In addition, each fragment program instruction can use
5bd8deadSopenharmony_ci    at most one unique attribute register.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.1.2, Fragment Program Temporary Registers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The fragment temporary registers (Table X.2) hold intermediate values used
5bd8deadSopenharmony_ci    during the execution of a fragment program.  There are 96 temporary
5bd8deadSopenharmony_ci    register names, but not all can be used simultaneously.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment Temporary
5bd8deadSopenharmony_ci    Register Name       Description
5bd8deadSopenharmony_ci    ------------------  -----------------------------------------------------
5bd8deadSopenharmony_ci        R0-R31          Four 32-bit (fp32) floating point values (s.e8.m23)
5bd8deadSopenharmony_ci        H0-H63          Four 16-bit (fp16) floating point values (s.e5.m10)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Table X.2:  Fragment Temporary Registers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In addition to the normal temporary registers, there are two temporary
5bd8deadSopenharmony_ci    pseudo-registers, "RC" and "HC".  RC and HC are treated as unnumbered,
5bd8deadSopenharmony_ci    write-only temporary registers.  The components of RC have a fp32 data
5bd8deadSopenharmony_ci    type; the components of HC have a fp16 data type.  The sole purpose of
5bd8deadSopenharmony_ci    these registers is to permit instructions to modify the condition code
5bd8deadSopenharmony_ci    register (section 3.11.1.4) without overwriting the values in any
5bd8deadSopenharmony_ci    temporary register.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment program instructions can read and write temporary registers.
5bd8deadSopenharmony_ci    There is no restriction on the number of temporary registers that can be
5bd8deadSopenharmony_ci    accessed by any given instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    All temporary registers are initialized to (0,0,0,0) each time a fragment
5bd8deadSopenharmony_ci    program executes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.1.3, Fragment Program Output Registers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The fragment program output registers hold the final results of the
5bd8deadSopenharmony_ci    fragment program.  The possible final results of a fragment program are a
5bd8deadSopenharmony_ci    high- or low-precision RGBA fragment color, and a fragment depth value.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci       Output
5bd8deadSopenharmony_ci    Register Name      Description
5bd8deadSopenharmony_ci    -------------      -------------------------------------------------------
5bd8deadSopenharmony_ci       o[COLR]         Final RGBA fragment color, fp32 format
5bd8deadSopenharmony_ci       o[COLH]         Final RGBA fragment color, fp16 format
5bd8deadSopenharmony_ci       o[DEPR]         Final fragment depth value, fp32 format
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Table X.3:  Fragment Program Output Registers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    o[COLR] and o[COLH] specify the color of a fragment.  These two registers
5bd8deadSopenharmony_ci    are identical, except for the associated data type of the components.  The
5bd8deadSopenharmony_ci    R, G, B, and A components of the fragment color are taken from the x, y,
5bd8deadSopenharmony_ci    z, and w components respectively of the o[COLR] or o[COLH].  A fragment
5bd8deadSopenharmony_ci    program will fail to load if it writes to both o[COLR] and o[COLH].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    o[DEPR] can be used to replace the associated depth value of a fragment.
5bd8deadSopenharmony_ci    The new depth value is taken from the z component of o[DEPR].  If a
5bd8deadSopenharmony_ci    fragment program does not write to o[DEPR], the associated depth value is
5bd8deadSopenharmony_ci    unmodified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program will fail to load if it does not write to at least one
5bd8deadSopenharmony_ci    output register.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The fragment program output registers may not be read by a fragment
5bd8deadSopenharmony_ci    program, but may be written to multiple times.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The values of all fragment program output registers are initially
5bd8deadSopenharmony_ci    undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.1.4, Fragment Program Condition Code Register
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The condition code register (CC) is a single four-component vector.  Each
5bd8deadSopenharmony_ci    component of this register is one of four enumerated values:  GT (greater
5bd8deadSopenharmony_ci    than), EQ (equal), LT (less than), or UN (unordered).  The condition code
5bd8deadSopenharmony_ci    register can be used to mask writes to fragment data register components
5bd8deadSopenharmony_ci    or to terminate processing of a fragment altogether (via the KIL
5bd8deadSopenharmony_ci    instruction).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Most fragment program instructions can optionally update the condition
5bd8deadSopenharmony_ci    code register.  When a fragment program instruction updates the condition
5bd8deadSopenharmony_ci    code register, a condition code component is set to LT if the
5bd8deadSopenharmony_ci    corresponding component of the result vector is less than zero, EQ if it
5bd8deadSopenharmony_ci    is equal to zero, GT if it is greater than zero, and UN if it is NaN (not
5bd8deadSopenharmony_ci    a number).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The condition code register is initialized to a vector of EQ values each
5bd8deadSopenharmony_ci    time a fragment program executes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.2, Fragment Program Parameters
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In addition to using the registers defined in Section 3.11.1, fragment
5bd8deadSopenharmony_ci    programs may also use fragment program parameters in their computation.
5bd8deadSopenharmony_ci    Fragment program parameters are constant during the execution of fragment
5bd8deadSopenharmony_ci    programs, but some parameters may be modified outside the execution of a
5bd8deadSopenharmony_ci    fragment program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    There are five different types of program parameters:  embedded scalar
5bd8deadSopenharmony_ci    constants, embedded vector constants, named constants, named local
5bd8deadSopenharmony_ci    parameters, and numbered local parameters.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Embedded scalar constants are written as standard floating-point numbers
5bd8deadSopenharmony_ci    with an optional sign designator ("+" or "-") and optional scientific
5bd8deadSopenharmony_ci    notation (e.g., "E+06", meaning "times 10^6").
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Embedded vector constants are written as a comma-separated array of one to
5bd8deadSopenharmony_ci    four scalar constants, surrounded by braces (like a C/C++ array
5bd8deadSopenharmony_ci    initializer).  Vector constants are always treated as 4-component vectors:
5bd8deadSopenharmony_ci    constants with fewer than four components are expanded to 4-components by
5bd8deadSopenharmony_ci    filling missing y and z components with 0.0 and missing w components with
5bd8deadSopenharmony_ci    1.0.  Thus, the vector constant "{2}" is equivalent to "{2,0,0,1}",
5bd8deadSopenharmony_ci    "{3,4}" is equivalent to "{3,4,0,1}", and "{5,6,7}" is equivalent to
5bd8deadSopenharmony_ci    "{5,6,7,1}".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Named constants allow fragment program instructions to define scalar or
5bd8deadSopenharmony_ci    vector constants that can be referenced by name.  Named constants are
5bd8deadSopenharmony_ci    created using the DEFINE instruction:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        DEFINE pi = 3.1415926535;
5bd8deadSopenharmony_ci        DEFINE color = {0.2, 0.5, 0.8, 1.0};
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DEFINE instruction associates a constant name with a scalar or vector
5bd8deadSopenharmony_ci    constant value.  Subsequent fragment program instructions that use the
5bd8deadSopenharmony_ci    constant name are equivalent to those using the corresponding constant
5bd8deadSopenharmony_ci    value.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Named local parameters are similar to named vector constants, but their
5bd8deadSopenharmony_ci    values can be modified after the program is loaded.  Local parameters are
5bd8deadSopenharmony_ci    created using the DECLARE instruction:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        DECLARE fog_color1;
5bd8deadSopenharmony_ci        DECLARE fog_color2 = {0.3, 0.6, 0.9, 0.1};
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DECLARE instruction creates a 4-component vector associated with the
5bd8deadSopenharmony_ci    local parameter name.  Subsequent fragment program instructions
5bd8deadSopenharmony_ci    referencing the local parameter name are processed as though the current
5bd8deadSopenharmony_ci    value of the local parameter vector were specified instead of the
5bd8deadSopenharmony_ci    parameter name.  A DECLARE instruction can optionally specify an initial
5bd8deadSopenharmony_ci    value for the local parameter, which can be either a scalar or vector
5bd8deadSopenharmony_ci    constant.  Scalar constants are expanded to 4-component vectors by
5bd8deadSopenharmony_ci    replicating the scalar value in each component.  The initial value of
5bd8deadSopenharmony_ci    local parameters not initialized by the program is (0,0,0,0).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A named local parameter for a specific program can be updated using the
5bd8deadSopenharmony_ci    calls ProgramNamedParameter4fNV or ProgramNamedParameter4fvNV (section
5bd8deadSopenharmony_ci    5.7).  Named local parameters are accessible only by the program in which
5bd8deadSopenharmony_ci    they are defined.  Modifying a local parameter affects the only the
5bd8deadSopenharmony_ci    associated program and does not affect local parameters with the same name
5bd8deadSopenharmony_ci    that are found in any other fragment program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Numbered local parameters are similar to named local parameters, except
5bd8deadSopenharmony_ci    that they are referred to by number and are not declared in fragment
5bd8deadSopenharmony_ci    programs.  Each fragment program object has an array of four-component
5bd8deadSopenharmony_ci    floating-point vectors that can be used by the program.  The number of
5bd8deadSopenharmony_ci    vectors is given by the implementation-dependent constant
5bd8deadSopenharmony_ci    MAX_FRAGMENT_PROGRAM_LOCAL_PARAMETERS_NV, and must be at least 64.  A
5bd8deadSopenharmony_ci    numbered local parameter is accessed by a fragment program as members of
5bd8deadSopenharmony_ci    an array called "p".  For example, the instruction
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MOV R0, p[31];
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    copies the contents of numbered local parameter 31 into temporary register
5bd8deadSopenharmony_ci    R0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Constant and local parameter names can be arbitrary strings consisting of
5bd8deadSopenharmony_ci    letters (upper or lower-case), numbers, underscores ("_"), and dollar
5bd8deadSopenharmony_ci    signs ("$").  Keywords defined in the grammar (including instruction
5bd8deadSopenharmony_ci    names) can not be used as constant names, nor can strings that start with
5bd8deadSopenharmony_ci    numbers, or strings that specify valid temporary register or texture
5bd8deadSopenharmony_ci    numbers (e.g., "R0"-"R31", "H0"-"H63"", "TEX0"-"TEX15").  A fragment
5bd8deadSopenharmony_ci    program will fail to load if a DEFINE or DECLARE instruction specifies an
5bd8deadSopenharmony_ci    invalid constant or local parameter name.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program will fail to load if an instruction contains a named
5bd8deadSopenharmony_ci    parameter not specified in a previous DEFINE or DECLARE instruction.  A
5bd8deadSopenharmony_ci    fragment program will also fail to load if a DEFINE or DECLARE instruction
5bd8deadSopenharmony_ci    attempts to re-define a named parameter specified in a previous DEFINE or
5bd8deadSopenharmony_ci    DECLARE instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The contents of the fragment program parameters may not be modified by a
5bd8deadSopenharmony_ci    fragment program.  In addition, each fragment program instruction can
5bd8deadSopenharmony_ci    normally use at most one unique program parameter.  The only exception to
5bd8deadSopenharmony_ci    this rule is if all program parameter references specify named or embedded
5bd8deadSopenharmony_ci    constants that taken together contain no more than four unique scalar
5bd8deadSopenharmony_ci    values.  For such instructions, the GL will automatically generate an
5bd8deadSopenharmony_ci    equivalent instruction that references a single merged vector constant.
5bd8deadSopenharmony_ci    This merging allows programs to specify instructions like the following:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Instruction              Equivalent Instruction
5bd8deadSopenharmony_ci        ---------------------    ---------------------------------------
5bd8deadSopenharmony_ci        MAD R0, R1, 2, -1;       MAD R0, R1, {2,-1,0,0}.x, {2,-1,0,0}.y;
5bd8deadSopenharmony_ci        ADD R0, {1,2,3,4}, 4;    ADD R0, {1,2,3,4}.xyzw, {1,2,3,4}.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Before counting the number of unique values, any named constants are first
5bd8deadSopenharmony_ci    converted to the equivalent embedded constants.  When generating a
5bd8deadSopenharmony_ci    combined vector constant, the GL does not perform swizzling, component
5bd8deadSopenharmony_ci    selection, negation, or absolute value operations.  The following
5bd8deadSopenharmony_ci    instructions are invalid, as they contain more than four unique scalar
5bd8deadSopenharmony_ci    values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Invalid Instructions
5bd8deadSopenharmony_ci        -----------------------------------
5bd8deadSopenharmony_ci        ADD R0, {1,2,3,4}, -4;
5bd8deadSopenharmony_ci        ADD R0, {1,2,3,4}, |-4|;
5bd8deadSopenharmony_ci        ADD R0, {1,2,3,4}, -{-1,-2,-3,-4};
5bd8deadSopenharmony_ci        ADD R0, {1,2,3,4}, {4,5,6,7}.x;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.3, Fragment Program Specification
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment programs are specified as an array of ubytes.  The array is a
5bd8deadSopenharmony_ci    string of ASCII characters encoding the program.  The command
5bd8deadSopenharmony_ci    LoadProgramNV loads a fragment program when the target parameter is
5bd8deadSopenharmony_ci    FRAGMENT_PROGRAM_NV.  The command BindProgramNV enables a fragment program
5bd8deadSopenharmony_ci    for execution.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    At program load time, the program is parsed into a set of tokens possibly
5bd8deadSopenharmony_ci    separated by white space.  Spaces, tabs, newlines, carriage returns, and
5bd8deadSopenharmony_ci    comments are considered whitespace.  Comments begin with the character "#"
5bd8deadSopenharmony_ci    and are terminated by a newline, a carriage return, or the end of the
5bd8deadSopenharmony_ci    program array.  Fragment programs are case-sensitive -- upper and lower
5bd8deadSopenharmony_ci    case letters are treated differently.  The proper choice of case can be
5bd8deadSopenharmony_ci    inferred from the grammar.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The Backus-Naur Form (BNF) grammar below specifies the syntactically valid
5bd8deadSopenharmony_ci    sequences for fragment programs.  The set of valid tokens can be inferred
5bd8deadSopenharmony_ci    from the grammar.  The token "" represents an empty string and is used to
5bd8deadSopenharmony_ci    indicate optional rules.  A program is invalid if it contains any
5bd8deadSopenharmony_ci    undefined tokens or characters.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <program>              ::= <progPrefix> <instructionSequence> "END"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <progPrefix>           ::= "!!FP1.0"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instructionSequence>  ::= <instructionSequence> <instructionStatement>
5bd8deadSopenharmony_ci                             | <instructionStatement>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instructionStatement> ::= <instruction> ";"
5bd8deadSopenharmony_ci                             | <constantDefinition> ";"
5bd8deadSopenharmony_ci                             | <localDeclaration> ";"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instruction>          ::= <VECTORop-instruction>
5bd8deadSopenharmony_ci                             | <SCALARop-instruction>
5bd8deadSopenharmony_ci                             | <BINSCop-instruction>
5bd8deadSopenharmony_ci                             | <BINop-instruction>
5bd8deadSopenharmony_ci                             | <TRIop-instruction>
5bd8deadSopenharmony_ci                             | <KILop-instruction>
5bd8deadSopenharmony_ci                             | <TEXop-instruction>
5bd8deadSopenharmony_ci                             | <TXDop-instruction>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <VECTORop-instruction> ::= <VECTORop> <maskedDstReg> ","
5bd8deadSopenharmony_ci                               <vectorSrc>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <VECTORop>             ::= "DDX"   | "DDX_SAT"
5bd8deadSopenharmony_ci                             | "DDXR"  | "DDXR_SAT"
5bd8deadSopenharmony_ci                             | "DDXH"  | "DDXH_SAT"
5bd8deadSopenharmony_ci                             | "DDXC"  | "DDXC_SAT"
5bd8deadSopenharmony_ci                             | "DDXRC" | "DDXRC_SAT"
5bd8deadSopenharmony_ci                             | "DDXHC" | "DDXHC_SAT"
5bd8deadSopenharmony_ci                             | "DDY"   | "DDY_SAT"
5bd8deadSopenharmony_ci                             | "DDYR"  | "DDYR_SAT"
5bd8deadSopenharmony_ci                             | "DDYH"  | "DDYH_SAT"
5bd8deadSopenharmony_ci                             | "DDYC"  | "DDYC_SAT"
5bd8deadSopenharmony_ci                             | "DDYRC" | "DDYRC_SAT"
5bd8deadSopenharmony_ci                             | "DDYHC" | "DDYHC_SAT"
5bd8deadSopenharmony_ci                             | "FLR"   | "FLR_SAT"
5bd8deadSopenharmony_ci                             | "FLRR"  | "FLRR_SAT"
5bd8deadSopenharmony_ci                             | "FLRH"  | "FLRH_SAT"
5bd8deadSopenharmony_ci                             | "FLRX"  | "FLRX_SAT"
5bd8deadSopenharmony_ci                             | "FLRC"  | "FLRC_SAT"
5bd8deadSopenharmony_ci                             | "FLRRC" | "FLRRC_SAT"
5bd8deadSopenharmony_ci                             | "FLRHC" | "FLRHC_SAT"
5bd8deadSopenharmony_ci                             | "FLRXC" | "FLRXC_SAT"
5bd8deadSopenharmony_ci                             | "FRC"   | "FRC_SAT"
5bd8deadSopenharmony_ci                             | "FRCR"  | "FRCR_SAT"
5bd8deadSopenharmony_ci                             | "FRCH"  | "FRCH_SAT"
5bd8deadSopenharmony_ci                             | "FRCX"  | "FRCX_SAT"
5bd8deadSopenharmony_ci                             | "FRCC"  | "FRCC_SAT"
5bd8deadSopenharmony_ci                             | "FRCRC" | "FRCRC_SAT"
5bd8deadSopenharmony_ci                             | "FRCHC" | "FRCHC_SAT"
5bd8deadSopenharmony_ci                             | "FRCXC" | "FRCXC_SAT"
5bd8deadSopenharmony_ci                             | "LIT"   | "LIT_SAT"
5bd8deadSopenharmony_ci                             | "LITR"  | "LITR_SAT"
5bd8deadSopenharmony_ci                             | "LITH"  | "LITH_SAT"
5bd8deadSopenharmony_ci                             | "LITC"  | "LITC_SAT"
5bd8deadSopenharmony_ci                             | "LITRC" | "LITRC_SAT"
5bd8deadSopenharmony_ci                             | "LITHC" | "LITHC_SAT"
5bd8deadSopenharmony_ci                             | "MOV"   | "MOV_SAT"
5bd8deadSopenharmony_ci                             | "MOVR"  | "MOVR_SAT"
5bd8deadSopenharmony_ci                             | "MOVH"  | "MOVH_SAT"
5bd8deadSopenharmony_ci                             | "MOVX"  | "MOVX_SAT"
5bd8deadSopenharmony_ci                             | "MOVC"  | "MOVC_SAT"
5bd8deadSopenharmony_ci                             | "MOVRC" | "MOVRC_SAT"
5bd8deadSopenharmony_ci                             | "MOVHC" | "MOVHC_SAT"
5bd8deadSopenharmony_ci                             | "MOVXC" | "MOVXC_SAT"
5bd8deadSopenharmony_ci                             | "PK2H"
5bd8deadSopenharmony_ci                             | "PK2US"
5bd8deadSopenharmony_ci                             | "PK4B"
5bd8deadSopenharmony_ci                             | "PK4UB"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <SCALARop-instruction> ::= <SCALARop> <maskedDstReg> ","
5bd8deadSopenharmony_ci                               <scalarSrc>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <SCALARop>             ::= "COS"     | "COS_SAT"
5bd8deadSopenharmony_ci                             | "COSR"    | "COSR_SAT"
5bd8deadSopenharmony_ci                             | "COSH"    | "COSH_SAT"
5bd8deadSopenharmony_ci                             | "COSC"    | "COSC_SAT"
5bd8deadSopenharmony_ci                             | "COSRC"   | "COSRC_SAT"
5bd8deadSopenharmony_ci                             | "COSHC"   | "COSHC_SAT"
5bd8deadSopenharmony_ci                             | "EX2"     | "EX2_SAT"
5bd8deadSopenharmony_ci                             | "EX2R"    | "EX2R_SAT"
5bd8deadSopenharmony_ci                             | "EX2H"    | "EX2H_SAT"
5bd8deadSopenharmony_ci                             | "EX2C"    | "EX2C_SAT"
5bd8deadSopenharmony_ci                             | "EX2RC"   | "EX2RC_SAT"
5bd8deadSopenharmony_ci                             | "EX2HC"   | "EX2HC_SAT"
5bd8deadSopenharmony_ci                             | "LG2"     | "LG2_SAT"
5bd8deadSopenharmony_ci                             | "LG2R"    | "LG2R_SAT"
5bd8deadSopenharmony_ci                             | "LG2H"    | "LG2H_SAT"
5bd8deadSopenharmony_ci                             | "LG2C"    | "LG2C_SAT"
5bd8deadSopenharmony_ci                             | "LG2RC"   | "LG2RC_SAT"
5bd8deadSopenharmony_ci                             | "LG2HC"   | "LG2HC_SAT"
5bd8deadSopenharmony_ci                             | "RCP"     | "RCP_SAT"
5bd8deadSopenharmony_ci                             | "RCPR"    | "RCPR_SAT"
5bd8deadSopenharmony_ci                             | "RCPH"    | "RCPH_SAT"
5bd8deadSopenharmony_ci                             | "RCPC"    | "RCPC_SAT"
5bd8deadSopenharmony_ci                             | "RCPRC"   | "RCPRC_SAT"
5bd8deadSopenharmony_ci                             | "RCPHC"   | "RCPHC_SAT"
5bd8deadSopenharmony_ci                             | "RSQ"     | "RSQ_SAT"
5bd8deadSopenharmony_ci                             | "RSQR"    | "RSQR_SAT"
5bd8deadSopenharmony_ci                             | "RSQH"    | "RSQH_SAT"
5bd8deadSopenharmony_ci                             | "RSQC"    | "RSQC_SAT"
5bd8deadSopenharmony_ci                             | "RSQRC"   | "RSQRC_SAT"
5bd8deadSopenharmony_ci                             | "RSQHC"   | "RSQHC_SAT"
5bd8deadSopenharmony_ci                             | "SIN"     | "SIN_SAT"
5bd8deadSopenharmony_ci                             | "SINR"    | "SINR_SAT"
5bd8deadSopenharmony_ci                             | "SINH"    | "SINH_SAT"
5bd8deadSopenharmony_ci                             | "SINC"    | "SINC_SAT"
5bd8deadSopenharmony_ci                             | "SINRC"   | "SINRC_SAT"
5bd8deadSopenharmony_ci                             | "SINHC"   | "SINHC_SAT"
5bd8deadSopenharmony_ci                             | "UP2H"    | "UP2H_SAT"
5bd8deadSopenharmony_ci                             | "UP2HC"   | "UP2HC_SAT"
5bd8deadSopenharmony_ci                             | "UP2US"   | "UP2US_SAT"
5bd8deadSopenharmony_ci                             | "UP2USC"  | "UP2USC_SAT"
5bd8deadSopenharmony_ci                             | "UP4B"    | "UP4B_SAT"
5bd8deadSopenharmony_ci                             | "UP4BC"   | "UP4BC_SAT"
5bd8deadSopenharmony_ci                             | "UP4UB"   | "UP4UB_SAT"
5bd8deadSopenharmony_ci                             | "UP4UBC"  | "UP4UBC_SAT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BINSCop-instruction> ::=  <BINSCop> <maskedDstReg> ","
5bd8deadSopenharmony_ci                               <scalarSrc> "," <scalarSrc>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BINSCop>              ::= "POW"   | "POW_SAT"
5bd8deadSopenharmony_ci                             | "POWR"  | "POWR_SAT"
5bd8deadSopenharmony_ci                             | "POWH"  | "POWH_SAT"
5bd8deadSopenharmony_ci                             | "POWC"  | "POWC_SAT"
5bd8deadSopenharmony_ci                             | "POWRC" | "POWRC_SAT"
5bd8deadSopenharmony_ci                             | "POWHC" | "POWHC_SAT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BINop-instruction>    ::= <BINop> <maskedDstReg> ","
5bd8deadSopenharmony_ci                               <vectorSrc> "," <vectorSrc>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BINop>                ::= "ADD"   | "ADD_SAT"
5bd8deadSopenharmony_ci                             | "ADDR"  | "ADDR_SAT"
5bd8deadSopenharmony_ci                             | "ADDH"  | "ADDH_SAT"
5bd8deadSopenharmony_ci                             | "ADDX"  | "ADDX_SAT"
5bd8deadSopenharmony_ci                             | "ADDC"  | "ADDC_SAT"
5bd8deadSopenharmony_ci                             | "ADDRC" | "ADDRC_SAT"
5bd8deadSopenharmony_ci                             | "ADDHC" | "ADDHC_SAT"
5bd8deadSopenharmony_ci                             | "ADDXC" | "ADDXC_SAT"
5bd8deadSopenharmony_ci                             | "DP3"   | "DP3_SAT"
5bd8deadSopenharmony_ci                             | "DP3R"  | "DP3R_SAT"
5bd8deadSopenharmony_ci                             | "DP3H"  | "DP3H_SAT"
5bd8deadSopenharmony_ci                             | "DP3X"  | "DP3X_SAT"
5bd8deadSopenharmony_ci                             | "DP3C"  | "DP3C_SAT"
5bd8deadSopenharmony_ci                             | "DP3RC" | "DP3RC_SAT"
5bd8deadSopenharmony_ci                             | "DP3HC" | "DP3HC_SAT"
5bd8deadSopenharmony_ci                             | "DP3XC" | "DP3XC_SAT"
5bd8deadSopenharmony_ci                             | "DP4"   | "DP4_SAT"
5bd8deadSopenharmony_ci                             | "DP4R"  | "DP4R_SAT"
5bd8deadSopenharmony_ci                             | "DP4H"  | "DP4H_SAT"
5bd8deadSopenharmony_ci                             | "DP4X"  | "DP4X_SAT"
5bd8deadSopenharmony_ci                             | "DP4C"  | "DP4C_SAT"
5bd8deadSopenharmony_ci                             | "DP4RC" | "DP4RC_SAT"
5bd8deadSopenharmony_ci                             | "DP4HC" | "DP4HC_SAT"
5bd8deadSopenharmony_ci                             | "DP4XC" | "DP4XC_SAT"
5bd8deadSopenharmony_ci                             | "DST"   | "DST_SAT"
5bd8deadSopenharmony_ci                             | "DSTR"  | "DSTR_SAT"
5bd8deadSopenharmony_ci                             | "DSTH"  | "DSTH_SAT"
5bd8deadSopenharmony_ci                             | "DSTC"  | "DSTC_SAT"
5bd8deadSopenharmony_ci                             | "DSTRC" | "DSTRC_SAT"
5bd8deadSopenharmony_ci                             | "DSTHC" | "DSTHC_SAT"
5bd8deadSopenharmony_ci                             | "MAX"   | "MAX_SAT"
5bd8deadSopenharmony_ci                             | "MAXR"  | "MAXR_SAT"
5bd8deadSopenharmony_ci                             | "MAXH"  | "MAXH_SAT"
5bd8deadSopenharmony_ci                             | "MAXX"  | "MAXX_SAT"
5bd8deadSopenharmony_ci                             | "MAXC"  | "MAXC_SAT"
5bd8deadSopenharmony_ci                             | "MAXRC" | "MAXRC_SAT"
5bd8deadSopenharmony_ci                             | "MAXHC" | "MAXHC_SAT"
5bd8deadSopenharmony_ci                             | "MAXXC" | "MAXXC_SAT"
5bd8deadSopenharmony_ci                             | "MIN"   | "MIN_SAT"
5bd8deadSopenharmony_ci                             | "MINR"  | "MINR_SAT"
5bd8deadSopenharmony_ci                             | "MINH"  | "MINH_SAT"
5bd8deadSopenharmony_ci                             | "MINX"  | "MINX_SAT"
5bd8deadSopenharmony_ci                             | "MINC"  | "MINC_SAT"
5bd8deadSopenharmony_ci                             | "MINRC" | "MINRC_SAT"
5bd8deadSopenharmony_ci                             | "MINHC" | "MINHC_SAT"
5bd8deadSopenharmony_ci                             | "MINXC" | "MINXC_SAT"
5bd8deadSopenharmony_ci                             | "MUL"   | "MUL_SAT"
5bd8deadSopenharmony_ci                             | "MULR"  | "MULR_SAT"
5bd8deadSopenharmony_ci                             | "MULH"  | "MULH_SAT"
5bd8deadSopenharmony_ci                             | "MULX"  | "MULX_SAT"
5bd8deadSopenharmony_ci                             | "MULC"  | "MULC_SAT"
5bd8deadSopenharmony_ci                             | "MULRC" | "MULRC_SAT"
5bd8deadSopenharmony_ci                             | "MULHC" | "MULHC_SAT"
5bd8deadSopenharmony_ci                             | "MULXC" | "MULXC_SAT"
5bd8deadSopenharmony_ci                             | "RFL"   | "RFL_SAT"
5bd8deadSopenharmony_ci                             | "RFLR"  | "RFLR_SAT"
5bd8deadSopenharmony_ci                             | "RFLH"  | "RFLH_SAT"
5bd8deadSopenharmony_ci                             | "RFLC"  | "RFLC_SAT"
5bd8deadSopenharmony_ci                             | "RFLRC" | "RFLRC_SAT"
5bd8deadSopenharmony_ci                             | "RFLHC" | "RFLHC_SAT"
5bd8deadSopenharmony_ci                             | "SEQ"   | "SEQ_SAT"
5bd8deadSopenharmony_ci                             | "SEQR"  | "SEQR_SAT"
5bd8deadSopenharmony_ci                             | "SEQH"  | "SEQH_SAT"
5bd8deadSopenharmony_ci                             | "SEQX"  | "SEQX_SAT"
5bd8deadSopenharmony_ci                             | "SEQC"  | "SEQC_SAT"
5bd8deadSopenharmony_ci                             | "SEQRC" | "SEQRC_SAT"
5bd8deadSopenharmony_ci                             | "SEQHC" | "SEQHC_SAT"
5bd8deadSopenharmony_ci                             | "SEQXC" | "SEQXC_SAT"
5bd8deadSopenharmony_ci                             | "SFL"   | "SFL_SAT"
5bd8deadSopenharmony_ci                             | "SFLR"  | "SFLR_SAT"
5bd8deadSopenharmony_ci                             | "SFLH"  | "SFLH_SAT"
5bd8deadSopenharmony_ci                             | "SFLX"  | "SFLX_SAT"
5bd8deadSopenharmony_ci                             | "SFLC"  | "SFLC_SAT"
5bd8deadSopenharmony_ci                             | "SFLRC" | "SFLRC_SAT"
5bd8deadSopenharmony_ci                             | "SFLHC" | "SFLHC_SAT"
5bd8deadSopenharmony_ci                             | "SFLXC" | "SFLXC_SAT"
5bd8deadSopenharmony_ci                             | "SGE"   | "SGE_SAT"
5bd8deadSopenharmony_ci                             | "SGER"  | "SGER_SAT"
5bd8deadSopenharmony_ci                             | "SGEH"  | "SGEH_SAT"
5bd8deadSopenharmony_ci                             | "SGEX"  | "SGEX_SAT"
5bd8deadSopenharmony_ci                             | "SGEC"  | "SGEC_SAT"
5bd8deadSopenharmony_ci                             | "SGERC" | "SGERC_SAT"
5bd8deadSopenharmony_ci                             | "SGEHC" | "SGEHC_SAT"
5bd8deadSopenharmony_ci                             | "SGEXC" | "SGEXC_SAT"
5bd8deadSopenharmony_ci                             | "SGT"   | "SGT_SAT"
5bd8deadSopenharmony_ci                             | "SGTR"  | "SGTR_SAT"
5bd8deadSopenharmony_ci                             | "SGTH"  | "SGTH_SAT"
5bd8deadSopenharmony_ci                             | "SGTX"  | "SGTX_SAT"
5bd8deadSopenharmony_ci                             | "SGTC"  | "SGTC_SAT"
5bd8deadSopenharmony_ci                             | "SGTRC" | "SGTRC_SAT"
5bd8deadSopenharmony_ci                             | "SGTHC" | "SGTHC_SAT"
5bd8deadSopenharmony_ci                             | "SGTXC" | "SGTXC_SAT"
5bd8deadSopenharmony_ci                             | "SLE"   | "SLE_SAT"
5bd8deadSopenharmony_ci                             | "SLER"  | "SLER_SAT"
5bd8deadSopenharmony_ci                             | "SLEH"  | "SLEH_SAT"
5bd8deadSopenharmony_ci                             | "SLEX"  | "SLEX_SAT"
5bd8deadSopenharmony_ci                             | "SLEC"  | "SLEC_SAT"
5bd8deadSopenharmony_ci                             | "SLERC" | "SLERC_SAT"
5bd8deadSopenharmony_ci                             | "SLEHC" | "SLEHC_SAT"
5bd8deadSopenharmony_ci                             | "SLEXC" | "SLEXC_SAT"
5bd8deadSopenharmony_ci                             | "SLT"   | "SLT_SAT"
5bd8deadSopenharmony_ci                             | "SLTR"  | "SLTR_SAT"
5bd8deadSopenharmony_ci                             | "SLTH"  | "SLTH_SAT"
5bd8deadSopenharmony_ci                             | "SLTX"  | "SLTX_SAT"
5bd8deadSopenharmony_ci                             | "SLTC"  | "SLTC_SAT"
5bd8deadSopenharmony_ci                             | "SLTRC" | "SLTRC_SAT"
5bd8deadSopenharmony_ci                             | "SLTHC" | "SLTHC_SAT"
5bd8deadSopenharmony_ci                             | "SLTXC" | "SLTXC_SAT"
5bd8deadSopenharmony_ci                             | "SNE"   | "SNE_SAT"
5bd8deadSopenharmony_ci                             | "SNER"  | "SNER_SAT"
5bd8deadSopenharmony_ci                             | "SNEH"  | "SNEH_SAT"
5bd8deadSopenharmony_ci                             | "SNEX"  | "SNEX_SAT"
5bd8deadSopenharmony_ci                             | "SNEC"  | "SNEC_SAT"
5bd8deadSopenharmony_ci                             | "SNERC" | "SNERC_SAT"
5bd8deadSopenharmony_ci                             | "SNEHC" | "SNEHC_SAT"
5bd8deadSopenharmony_ci                             | "SNEXC" | "SNEXC_SAT"
5bd8deadSopenharmony_ci                             | "STR"   | "STR_SAT"
5bd8deadSopenharmony_ci                             | "STRR"  | "STRR_SAT"
5bd8deadSopenharmony_ci                             | "STRH"  | "STRH_SAT"
5bd8deadSopenharmony_ci                             | "STRX"  | "STRX_SAT"
5bd8deadSopenharmony_ci                             | "STRC"  | "STRC_SAT"
5bd8deadSopenharmony_ci                             | "STRRC" | "STRRC_SAT"
5bd8deadSopenharmony_ci                             | "STRHC" | "STRHC_SAT"
5bd8deadSopenharmony_ci                             | "STRXC" | "STRXC_SAT"
5bd8deadSopenharmony_ci                             | "SUB"   | "SUB_SAT"
5bd8deadSopenharmony_ci                             | "SUBR"  | "SUBR_SAT"
5bd8deadSopenharmony_ci                             | "SUBH"  | "SUBH_SAT"
5bd8deadSopenharmony_ci                             | "SUBX"  | "SUBX_SAT"
5bd8deadSopenharmony_ci                             | "SUBC"  | "SUBC_SAT"
5bd8deadSopenharmony_ci                             | "SUBRC" | "SUBRC_SAT"
5bd8deadSopenharmony_ci                             | "SUBHC" | "SUBHC_SAT"
5bd8deadSopenharmony_ci                             | "SUBXC" | "SUBXC_SAT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TRIop-instruction>    ::= <TRIop> <maskedDstReg> ","
5bd8deadSopenharmony_ci                               <vectorSrc> "," <vectorSrc> ","
5bd8deadSopenharmony_ci                               <vectorSrc>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TRIop>                ::= "MAD"   | "MAD_SAT"
5bd8deadSopenharmony_ci                             | "MADR"  | "MADR_SAT"
5bd8deadSopenharmony_ci                             | "MADH"  | "MADH_SAT"
5bd8deadSopenharmony_ci                             | "MADX"  | "MADX_SAT"
5bd8deadSopenharmony_ci                             | "MADC"  | "MADC_SAT"
5bd8deadSopenharmony_ci                             | "MADRC" | "MADRC_SAT"
5bd8deadSopenharmony_ci                             | "MADHC" | "MADHC_SAT"
5bd8deadSopenharmony_ci                             | "MADXC" | "MADXC_SAT"
5bd8deadSopenharmony_ci                             | "LRP"   | "LRP_SAT"
5bd8deadSopenharmony_ci                             | "LRPR"  | "LRPR_SAT"
5bd8deadSopenharmony_ci                             | "LRPH"  | "LRPH_SAT"
5bd8deadSopenharmony_ci                             | "LRPX"  | "LRPX_SAT"
5bd8deadSopenharmony_ci                             | "LRPC"  | "LRPC_SAT"
5bd8deadSopenharmony_ci                             | "LRPRC" | "LRPRC_SAT"
5bd8deadSopenharmony_ci                             | "LRPHC" | "LRPHC_SAT"
5bd8deadSopenharmony_ci                             | "LRPXC" | "LRPXC_SAT"
5bd8deadSopenharmony_ci                             | "X2D"   | "X2D_SAT"
5bd8deadSopenharmony_ci                             | "X2DR"  | "X2DR_SAT"
5bd8deadSopenharmony_ci                             | "X2DH"  | "X2DH_SAT"
5bd8deadSopenharmony_ci                             | "X2DC"  | "X2DC_SAT"
5bd8deadSopenharmony_ci                             | "X2DRC" | "X2DRC_SAT"
5bd8deadSopenharmony_ci                             | "X2DHC" | "X2DHC_SAT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <KILop-instruction>    ::= <KILop> <ccMask>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <KILop>                ::= "KIL"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TEXop-instruction>    ::= <TEXop> <maskedDstReg> ","
5bd8deadSopenharmony_ci                               <vectorSrc> "," <texImageId>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TEXop>                ::= "TEX"  | "TEX_SAT"
5bd8deadSopenharmony_ci                             | "TEXC" | "TEXC_SAT"
5bd8deadSopenharmony_ci                             | "TXP"  | "TXP_SAT"
5bd8deadSopenharmony_ci                             | "TXPC" | "TXPC_SAT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TXDop-instruction>    ::= <TXDop> <maskedDstReg> ","
5bd8deadSopenharmony_ci                               <vectorSrc> "," <vectorSrc> ","
5bd8deadSopenharmony_ci                               <vectorSrc> "," <texImageId>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TXDop>                ::= "TXD"  | "TXD_SAT"
5bd8deadSopenharmony_ci                             | "TXDC" | "TXDC_SAT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <scalarSrc>            ::= <absScalarSrc>
5bd8deadSopenharmony_ci                             | <baseScalarSrc>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <absScalarSrc>         ::= <negate> "|" <baseScalarSrc> "|"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <baseScalarSrc>        ::= <signedScalarConstant>
5bd8deadSopenharmony_ci                             | <negate> <namedScalarConstant>
5bd8deadSopenharmony_ci                             | <negate> <vectorConstant> <scalarSuffix>
5bd8deadSopenharmony_ci                             | <negate> <namedLocalParameter> <scalarSuffix>
5bd8deadSopenharmony_ci                             | <negate> <numberedLocal> <scalarSuffix>
5bd8deadSopenharmony_ci                             | <negate> <srcRegister> <scalarSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <vectorSrc>            ::= <absVectorSrc>
5bd8deadSopenharmony_ci                             | <baseVectorSrc>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <absVectorSrc>         ::= <negate> "|" <baseVectorSrc> "|"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <baseVectorSrc>        ::= <signedScalarConstant>
5bd8deadSopenharmony_ci                             | <negate> <namedScalarConstant>
5bd8deadSopenharmony_ci                             | <negate> <vectorConstant> <scalarSuffix>
5bd8deadSopenharmony_ci                             | <negate> <vectorConstant> <swizzleSuffix>
5bd8deadSopenharmony_ci                             | <negate> <namedLocalParameter> <scalarSuffix>
5bd8deadSopenharmony_ci                             | <negate> <namedLocalParameter> <swizzleSuffix>
5bd8deadSopenharmony_ci                             | <negate> <numberedLocal> <scalarSuffix>
5bd8deadSopenharmony_ci                             | <negate> <numberedLocal> <swizzleSuffix>
5bd8deadSopenharmony_ci                             | <negate> <srcRegister> <scalarSuffix>
5bd8deadSopenharmony_ci                             | <negate> <srcRegister> <swizzleSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <maskedDstReg>         ::= <dstRegister> <optionalWriteMask>
5bd8deadSopenharmony_ci                               <optionalCCMask>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <dstRegister>          ::= <fragTempReg>
5bd8deadSopenharmony_ci                             | <fragOutputReg>
5bd8deadSopenharmony_ci                             | "RC"
5bd8deadSopenharmony_ci                             | "HC"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optionalCCMask>       ::= "(" <ccMask> ")"
5bd8deadSopenharmony_ci                             | ""
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ccMask>               ::= <ccMaskRule> <swizzleSuffix>
5bd8deadSopenharmony_ci                             | <ccMaskRule> <scalarSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ccMaskRule>           ::= "EQ" | "GE" | "GT" | "LE" | "LT" | "NE" |
5bd8deadSopenharmony_ci                               "TR" | "FL"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optionalWriteMask>    ::= ""
5bd8deadSopenharmony_ci                             | "." "x"
5bd8deadSopenharmony_ci                             | "."     "y"
5bd8deadSopenharmony_ci                             | "." "x" "y"
5bd8deadSopenharmony_ci                             | "."         "z"
5bd8deadSopenharmony_ci                             | "." "x"     "z"
5bd8deadSopenharmony_ci                             | "."     "y" "z"
5bd8deadSopenharmony_ci                             | "." "x" "y" "z"
5bd8deadSopenharmony_ci                             | "."             "w"
5bd8deadSopenharmony_ci                             | "." "x"         "w"
5bd8deadSopenharmony_ci                             | "."     "y"     "w"
5bd8deadSopenharmony_ci                             | "." "x" "y"     "w"
5bd8deadSopenharmony_ci                             | "."         "z" "w"
5bd8deadSopenharmony_ci                             | "." "x"     "z" "w"
5bd8deadSopenharmony_ci                             | "."     "y" "z" "w"
5bd8deadSopenharmony_ci                             | "." "x" "y" "z" "w"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <srcRegister>          ::= <fragAttribReg>
5bd8deadSopenharmony_ci                             | <fragTempReg>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <fragAttribReg>        ::= "f" "[" <fragAttribRegId> "]"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <fragAttribRegId>      ::= "WPOS" | "COL0" | "COL1" | "FOGC" | "TEX0"
5bd8deadSopenharmony_ci                             | "TEX1" | "TEX2" | "TEX3" | "TEX4" | "TEX5"
5bd8deadSopenharmony_ci                             | "TEX6" | "TEX7"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <fragTempReg>          ::= <fragF32Reg>
5bd8deadSopenharmony_ci                             | <fragF16Reg>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <fragF32Reg>           ::= "R0"  | "R1"  | "R2"  | "R3"
5bd8deadSopenharmony_ci                             | "R4"  | "R5"  | "R6"  | "R7"
5bd8deadSopenharmony_ci                             | "R8"  | "R9"  | "R10" | "R11"
5bd8deadSopenharmony_ci                             | "R12" | "R13" | "R14" | "R15"
5bd8deadSopenharmony_ci                             | "R16" | "R17" | "R18" | "R19"
5bd8deadSopenharmony_ci                             | "R20" | "R21" | "R22" | "R23"
5bd8deadSopenharmony_ci                             | "R24" | "R25" | "R26" | "R27"
5bd8deadSopenharmony_ci                             | "R28" | "R29" | "R30" | "R31"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <fragF16Reg>           ::= "H0"  | "H1"  | "H2"  | "H3"
5bd8deadSopenharmony_ci                             | "H4"  | "H5"  | "H6"  | "H7"
5bd8deadSopenharmony_ci                             | "H8"  | "H9"  | "H10" | "H11"
5bd8deadSopenharmony_ci                             | "H12" | "H13" | "H14" | "H15"
5bd8deadSopenharmony_ci                             | "H16" | "H17" | "H18" | "H19"
5bd8deadSopenharmony_ci                             | "H20" | "H21" | "H22" | "H23"
5bd8deadSopenharmony_ci                             | "H24" | "H25" | "H26" | "H27"
5bd8deadSopenharmony_ci                             | "H28" | "H29" | "H30" | "H31"
5bd8deadSopenharmony_ci                             | "H32" | "H33" | "H34" | "H35"
5bd8deadSopenharmony_ci                             | "H36" | "H37" | "H38" | "H39"
5bd8deadSopenharmony_ci                             | "H40" | "H41" | "H42" | "H43"
5bd8deadSopenharmony_ci                             | "H44" | "H45" | "H46" | "H47"
5bd8deadSopenharmony_ci                             | "H48" | "H49" | "H50" | "H51"
5bd8deadSopenharmony_ci                             | "H52" | "H53" | "H54" | "H55"
5bd8deadSopenharmony_ci                             | "H56" | "H57" | "H58" | "H59"
5bd8deadSopenharmony_ci                             | "H60" | "H61" | "H62" | "H63"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <fragOutputReg>        ::= "o" "[" <fragOutputRegName> "]"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <fragOutputRegName>    ::= "COLR" | "COLH" | "DEPR"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <numberedLocal>        ::= "p" "[" <localNumber> "]"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <localNumber>          ::= <integer> from 0 to
5bd8deadSopenharmony_ci                               MAX_FRAGMENT_PROGRAM_LOCAL_PARAMETERS_NV - 1
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <scalarSuffix>         ::= "." <component>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <swizzleSuffix>        ::= ""
5bd8deadSopenharmony_ci                             | "." <component> <component>
5bd8deadSopenharmony_ci                                   <component> <component>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <component>            ::= "x" | "y" | "z" | "w"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <texImageId>           ::= <texImageUnit> "," <texImageTarget>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <texImageUnit>         ::= "TEX0"  | "TEX1"  | "TEX2"  | "TEX3"
5bd8deadSopenharmony_ci                             | "TEX4"  | "TEX5"  | "TEX6"  | "TEX7"
5bd8deadSopenharmony_ci                             | "TEX8"  | "TEX9"  | "TEX10" | "TEX11"
5bd8deadSopenharmony_ci                             | "TEX12" | "TEX13" | "TEX14" | "TEX15"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <texImageTarget>       ::= "1D" | "2D" | "3D" | "CUBE" | "RECT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <constantDefinition>   ::= "DEFINE" <namedVectorConstant> "="
5bd8deadSopenharmony_ci                               <vectorConstant>
5bd8deadSopenharmony_ci                             | "DEFINE" <namedScalarConstant> "="
5bd8deadSopenharmony_ci                               <scalarConstant>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <localDeclaration>     ::= "DECLARE" <namedLocalParameter>
5bd8deadSopenharmony_ci                               <optionalLocalValue>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optionalLocalValue>   ::= ""
5bd8deadSopenharmony_ci                             | "=" <vectorConstant>
5bd8deadSopenharmony_ci                             | "=" <scalarConstant>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <vectorConstant>       ::= {" <vectorConstantList> "}"
5bd8deadSopenharmony_ci                             | <namedVectorConstant>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <vectorConstantList>   ::= <scalarConstant>
5bd8deadSopenharmony_ci                             | <scalarConstant> "," <scalarConstant>
5bd8deadSopenharmony_ci                             | <scalarConstant> "," <scalarConstant> ","
5bd8deadSopenharmony_ci                               <scalarConstant>
5bd8deadSopenharmony_ci                             | <scalarConstant> "," <scalarConstant> ","
5bd8deadSopenharmony_ci                               <scalarConstant> "," <scalarConstant>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <scalarConstant>       ::= <signedScalarConstant>
5bd8deadSopenharmony_ci                             | <namedScalarConstant>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <signedScalarConstant> ::= <optionalSign> <floatConstant>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <namedScalarConstant>  ::= <identifier>    ((name of a scalar constant
5bd8deadSopenharmony_ci                                                 in a DEFINE instruction))
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <namedVectorConstant>  ::= <identifier>    ((name of a vector constant
5bd8deadSopenharmony_ci                                                 in a DEFINE instruction))
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <namedLocalParameter>  ::= <identifier>    ((name of a local parameter
5bd8deadSopenharmony_ci                                                 in a DECLARE instruction))
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <negate>               ::= "-" | "+" | ""
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optionalSign>         ::= "-" | "+" | ""
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <identifier>           ::= see text below
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <floatConstant>        ::= see text below
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <identifier> rule matches a sequence of one or more letters ("A"
5bd8deadSopenharmony_ci    through "Z", "a" through "z", "_", and "$") and digits ("0" through "9);
5bd8deadSopenharmony_ci    the first character must be a letter.  The underscore ("_") and dollar
5bd8deadSopenharmony_ci    sign ("$") count as a letters.  Upper and lower case letters are different
5bd8deadSopenharmony_ci    (names are case-sensitive).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <floatConstant> rule matches a floating-point constant consisting
5bd8deadSopenharmony_ci    of an integer part, a decimal point, a fraction part, an "e" or
5bd8deadSopenharmony_ci    "E", and an optionally signed integer exponent.  The integer and
5bd8deadSopenharmony_ci    fraction parts both consist of a sequence of on or more digits ("0"
5bd8deadSopenharmony_ci    through "9").  Either the integer part or the fraction parts (not
5bd8deadSopenharmony_ci    both) may be missing; either the decimal point or the "e" (or "E")
5bd8deadSopenharmony_ci    and the exponent (not both) may be missing.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program fails to load if it contains more than the maximum
5bd8deadSopenharmony_ci    number of executable instructions.  If ARB_fragment_program is supported,
5bd8deadSopenharmony_ci    this limit is the value of MAX_PROGRAM_INSTRUCTIONS_ARB for the
5bd8deadSopenharmony_ci    FRAGMENT_PROGRAM_ARB target.  Otherwise, the limit is 1024.  Executable
5bd8deadSopenharmony_ci    instructions are those matching the <instruction> rule in the grammar, and
5bd8deadSopenharmony_ci    do not include DEFINE or DECLARE instructions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program fails to load if its total temporary and output
5bd8deadSopenharmony_ci    register count exceeds 64.  Each fp32 temporary or output register used by
5bd8deadSopenharmony_ci    the program (R0-R31, o[COLR], and o[DEPR]) counts as two registers; each
5bd8deadSopenharmony_ci    fp16 temporary or output register used by the program (H0-H63 and o[COLH])
5bd8deadSopenharmony_ci    count as a single register.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program fails to load if any instruction sources more than one
5bd8deadSopenharmony_ci    unique fragment attribute register.  Instructions sourcing the same
5bd8deadSopenharmony_ci    attribute register multiple times are acceptable.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program fails to load if any instruction sources more than one
5bd8deadSopenharmony_ci    unique program parameter register.  Instructions sourcing the same program
5bd8deadSopenharmony_ci    parameter multiple times are acceptable.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program fails to load if multiple texture lookup instructions
5bd8deadSopenharmony_ci    reference different targets for the same texture image unit.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program fails to load if it writes to both the o[COLR] and
5bd8deadSopenharmony_ci    o[COLH] output registers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error INVALID_OPERATION is generated by LoadProgramNV if a fragment
5bd8deadSopenharmony_ci    program fails to load because it is not syntactically correct or for one
5bd8deadSopenharmony_ci    of the semantic restrictions listed above.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error INVALID_OPERATION is generated by LoadProgramNV if a program is
5bd8deadSopenharmony_ci    loaded for id when id is currently loaded with a program of a different
5bd8deadSopenharmony_ci    target.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A successfully loaded fragment program is parsed into a sequence of
5bd8deadSopenharmony_ci    instructions.  Each instruction is identified by its tokenized name.  The
5bd8deadSopenharmony_ci    operation of these instructions when executed is defined in Sections
5bd8deadSopenharmony_ci    3.11.4 and 3.11.5.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.4, Fragment Program Operation
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    There are forty-five fragment program instructions.  Fragment program
5bd8deadSopenharmony_ci    instructions may have up to eight variants, including a suffix of "R",
5bd8deadSopenharmony_ci    "H", or "X" to specify arithmetic precision (section 3.11.4.2), a suffix
5bd8deadSopenharmony_ci    of "C" to allow an update of the condition code register (section
5bd8deadSopenharmony_ci    3.11.4.4), and a suffix of "_SAT" to clamp the result vector components to
5bd8deadSopenharmony_ci    the range [0,1] (section 3.11.4.4).  For example, the sixteen forms of the
5bd8deadSopenharmony_ci    "ADD" instruction are "ADD", "ADDR", "ADDH", "ADDX", "ADDC", "ADDRC",
5bd8deadSopenharmony_ci    "ADDHC", "ADDXC", "ADD_SAT", "ADDR_SAT", "ADDH_SAT", "ADDX_SAT",
5bd8deadSopenharmony_ci    "ADDC_SAT", "ADDRC_SAT", "ADDHC_SAT", and "ADDXC_SAT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Some mathematical instructions that support precision suffixes, typically
5bd8deadSopenharmony_ci    those that involve complicated floating-point computations, do not support
5bd8deadSopenharmony_ci    the "X" precision suffix.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The fragment program instructions and their respective input and output
5bd8deadSopenharmony_ci    parameters are summarized in Table X.4.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Instruction          Inputs  Output   Description
5bd8deadSopenharmony_ci      -----------------    ------  ------   --------------------------------
5bd8deadSopenharmony_ci      ADD[RHX][C][_SAT]    v,v     v        add
5bd8deadSopenharmony_ci      COS[RH ][C][_SAT]    s       ssss     cosine
5bd8deadSopenharmony_ci      DDX[RH ][C][_SAT]    v       v        derivative relative to x
5bd8deadSopenharmony_ci      DDY[RH ][C][_SAT]    v       v        derivative relative to y
5bd8deadSopenharmony_ci      DP3[RHX][C][_SAT]    v,v     ssss     3-component dot product
5bd8deadSopenharmony_ci      DP4[RHX][C][_SAT]    v,v     ssss     4-component dot product
5bd8deadSopenharmony_ci      DST[RH ][C][_SAT]    v,v     v        distance vector
5bd8deadSopenharmony_ci      EX2[RH ][C][_SAT]    s       ssss     exponential base 2
5bd8deadSopenharmony_ci      FLR[RHX][C][_SAT]    v       v        floor
5bd8deadSopenharmony_ci      FRC[RHX][C][_SAT]    v       v        fraction
5bd8deadSopenharmony_ci      KIL                  none    none     conditionally discard fragment
5bd8deadSopenharmony_ci      LG2[RH ][C][_SAT]    s       ssss     logarithm base 2
5bd8deadSopenharmony_ci      LIT[RH ][C][_SAT]    v       v        compute light coefficients
5bd8deadSopenharmony_ci      LRP[RHX][C][_SAT]    v,v,v   v        linear interpolation
5bd8deadSopenharmony_ci      MAD[RHX][C][_SAT]    v,v,v   v        multiply and add
5bd8deadSopenharmony_ci      MAX[RHX][C][_SAT]    v,v     v        maximum
5bd8deadSopenharmony_ci      MIN[RHX][C][_SAT]    v,v     v        minimum
5bd8deadSopenharmony_ci      MOV[RHX][C][_SAT]    v       v        move
5bd8deadSopenharmony_ci      MUL[RHX][C][_SAT]    v,v     v        multiply
5bd8deadSopenharmony_ci      PK2H                 v       ssss     pack two 16-bit floats
5bd8deadSopenharmony_ci      PK2US                v       ssss     pack two unsigned 16-bit scalars
5bd8deadSopenharmony_ci      PK4B                 v       ssss     pack four signed 8-bit scalars
5bd8deadSopenharmony_ci      PK4UB                v       ssss     pack four unsigned 8-bit scalars
5bd8deadSopenharmony_ci      POW[RH ][C][_SAT]    s,s     ssss     exponentiation (x^y)
5bd8deadSopenharmony_ci      RCP[RH ][C][_SAT]    s       ssss     reciprocal
5bd8deadSopenharmony_ci      RFL[RH ][C][_SAT]    v,v     v        reflection vector
5bd8deadSopenharmony_ci      RSQ[RH ][C][_SAT]    s       ssss     reciprocal square root
5bd8deadSopenharmony_ci      SEQ[RHX][C][_SAT]    v,v     v        set on equal
5bd8deadSopenharmony_ci      SFL[RHX][C][_SAT]    v,v     v        set on false
5bd8deadSopenharmony_ci      SGE[RHX][C][_SAT]    v,v     v        set on greater than or equal
5bd8deadSopenharmony_ci      SGT[RHX][C][_SAT]    v,v     v        set on greater than
5bd8deadSopenharmony_ci      SIN[RH ][C][_SAT]    s       ssss     sine
5bd8deadSopenharmony_ci      SLE[RHX][C][_SAT]    v,v     v        set on less than or equal
5bd8deadSopenharmony_ci      SLT[RHX][C][_SAT]    v,v     v        set on less than
5bd8deadSopenharmony_ci      SNE[RHX][C][_SAT]    v,v     v        set on not equal
5bd8deadSopenharmony_ci      STR[RHX][C][_SAT]    v,v     v        set on true
5bd8deadSopenharmony_ci      SUB[RHX][C][_SAT]    v,v     v        subtract
5bd8deadSopenharmony_ci      TEX[C][_SAT]         v       v        texture lookup
5bd8deadSopenharmony_ci      TXD[C][_SAT]         v,v,v   v        texture lookup w/partials
5bd8deadSopenharmony_ci      TXP[C][_SAT]         v       v        projective texture lookup
5bd8deadSopenharmony_ci      UP2H[C][_SAT]        s       v        unpack two 16-bit floats
5bd8deadSopenharmony_ci      UP2US[C][_SAT]       s       v        unpack two unsigned 16-bit scalars
5bd8deadSopenharmony_ci      UP4B[C][_SAT]        s       v        unpack four signed 8-bit scalars
5bd8deadSopenharmony_ci      UP4UB[C][_SAT]       s       v        unpack four unsigned 8-bit scalars
5bd8deadSopenharmony_ci      X2D[RH ][C][_SAT]    v,v,v   v        2D coordinate transformation
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Table X.4:  Summary of fragment program instructions.  "[RHX]" indicates
5bd8deadSopenharmony_ci    an optional arithmetic precision suffix.  "[C]" indicates an optional
5bd8deadSopenharmony_ci    condition code update suffix.  "[_SAT]" indicates an optional clamp of
5bd8deadSopenharmony_ci    result vector components to [0,1].  "v" indicates a 4-component vector
5bd8deadSopenharmony_ci    input or output, "s" indicates a scalar input, and "ssss" indicates a
5bd8deadSopenharmony_ci    scalar output replicated across a 4-component vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.4.1:  Fragment Program Storage Precision
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Registers in fragment program are stored in two different representations:
5bd8deadSopenharmony_ci    16-bit floating-point (fp16) and 32-bit floating-point (fp32).  There is
5bd8deadSopenharmony_ci    an additional 12-bit fixed-point representation (fx12) used only as an
5bd8deadSopenharmony_ci    internal representation for instructions with the "X" precision qualifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In the 32-bit float (fp32) representation, each component is represented
5bd8deadSopenharmony_ci    in floating-point with eight exponent and twenty-three mantissa bits, as
5bd8deadSopenharmony_ci    in the standard IEEE single-precision format.  If S represents the sign (0
5bd8deadSopenharmony_ci    or 1), E represents the exponent in the range [0,255], and M represents
5bd8deadSopenharmony_ci    the mantissa in the range [0,2^23-1], then a fp32 float is decoded as:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci       (-1)^S * 0.0,                           if E == 0,
5bd8deadSopenharmony_ci       (-1)^S * 2^(E-127) * (1 + M/2^23),      if 0 < E < 255,
5bd8deadSopenharmony_ci       (-1)^S * INF,                           if E == 255 and M == 0,
5bd8deadSopenharmony_ci       NaN,                                    if E == 255 and M != 0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INF (Infinity) is a special representation indicating numerical overflow.
5bd8deadSopenharmony_ci    NaN (Not a Number) is a special representation indicating the result of
5bd8deadSopenharmony_ci    illegal arithmetic operations, such as computing the square root or
5bd8deadSopenharmony_ci    logarithm of a negative number.  Note that all normal fp32 values, zero,
5bd8deadSopenharmony_ci    and INF have an associated sign.  -0.0 and +0.0 are considered equivalent
5bd8deadSopenharmony_ci    for the purposes of comparisons.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This representation is identical to the IEEE single-precision
5bd8deadSopenharmony_ci    floating-point standard, except that no special representation is provided
5bd8deadSopenharmony_ci    for denorms -- numbers in the range (-2^-126, +2^-126).  All such numbers
5bd8deadSopenharmony_ci    are flushed to zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In a 16-bit float (fp16) register, each component is represented
5bd8deadSopenharmony_ci    similarly, except with only five exponent and ten mantissa bits.  If S
5bd8deadSopenharmony_ci    represents the sign (0 or 1), E represents the exponent in the range
5bd8deadSopenharmony_ci    [0,31], and M represents the mantissa in the range [0,2^10-1], then an
5bd8deadSopenharmony_ci    fp32 float is decoded as:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci       (-1)^S * 0.0,                           if E == 0 and M == 0,
5bd8deadSopenharmony_ci       (-1)^S * 2^-14 * M/2^10                 if E == 0 and M != 0,
5bd8deadSopenharmony_ci       (-1)^S * 2^(E-15) * (1 + M/2^10),       if 0 < E < 31,
5bd8deadSopenharmony_ci       (-1)^S * INF,                           if E == 31 and M == 0, or
5bd8deadSopenharmony_ci       NaN,                                    if E == 31 and M != 0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    One important difference is that the fp16 representation, unlike fp32,
5bd8deadSopenharmony_ci    supports denorms to maximize the limited precision of the 16-bit floating
5bd8deadSopenharmony_ci    point encodings.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In the 12-bit fixed-point (fx12) format, numbers are represented as signed
5bd8deadSopenharmony_ci    12-bit two's complement integers with 10 fraction bits.  The range of
5bd8deadSopenharmony_ci    representable values is [-2048/1024, +2047/1024].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.4.2:  Fragment Program Operation Precision
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment program instructions frequently perform mathematical operations.
5bd8deadSopenharmony_ci    Such operations may be performed at one of three different precisions.
5bd8deadSopenharmony_ci    Fragment programs can specify the precision of each instruction by using
5bd8deadSopenharmony_ci    the precision suffix.  If an instruction has a suffix of "R", calculations
5bd8deadSopenharmony_ci    are carried out with 32-bit floating point operands and results.  If an
5bd8deadSopenharmony_ci    instruction has a suffix of "H", calculations are carried out using 16-bit
5bd8deadSopenharmony_ci    floating point operands and results.  If an instruction has a suffix of
5bd8deadSopenharmony_ci    "X", calculations are carried out using 12-bit fixed point operands and
5bd8deadSopenharmony_ci    results.  For example, the instruction "MULR" performs a 32-bit
5bd8deadSopenharmony_ci    floating-point multiply, "MULH" performs a 16-bit floating-point multiply,
5bd8deadSopenharmony_ci    and "MULX" performs a 12-bit fixed-point multiply.  If no precision suffix
5bd8deadSopenharmony_ci    is specified, calculations are carried out using the precision of the
5bd8deadSopenharmony_ci    temporary register receiving the result.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment program instructions may source registers or constants whose
5bd8deadSopenharmony_ci    precisions differ from the precision specified with the instruction.
5bd8deadSopenharmony_ci    Instructions may also generate intermediate results with a different
5bd8deadSopenharmony_ci    precision than that of the destination register.  In these cases, the
5bd8deadSopenharmony_ci    values sourced are converted to the precision specified by the
5bd8deadSopenharmony_ci    instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When converting to fx12 format, -INF and any values less than -2048/1024
5bd8deadSopenharmony_ci    become -2048/1024.  +INF, and any values greater than +2047/1024 become
5bd8deadSopenharmony_ci    +2047/1024.  NaN becomes 0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When converting to fp16 format, any values less than or equal to -2^16 are
5bd8deadSopenharmony_ci    converted to -INF.  Any values greater than or equal to +2^16 are
5bd8deadSopenharmony_ci    converted to +INF.  -INF, +INF, NaN, -0.0, and +0.0 are unchanged.  Any
5bd8deadSopenharmony_ci    other values that are not exactly representable in fp16 format are
5bd8deadSopenharmony_ci    converted to one of the two nearest representable values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When converting to fp32 format, any values less than or equal to -2^128
5bd8deadSopenharmony_ci    are converted to -INF.  Any values greater than or equal to +2^128 are
5bd8deadSopenharmony_ci    converted to +INF.  -INF, +INF, NaN, -0.0, and +0.0 are unchanged.  Any
5bd8deadSopenharmony_ci    other values that are not exactly representable in fp32 format are
5bd8deadSopenharmony_ci    converted to one of the two nearest representable values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment program instructions using the fragment attribute registers
5bd8deadSopenharmony_ci    f[FOGC] or f[TEX0] through f[TEX7] will be carried out at full fp32
5bd8deadSopenharmony_ci    precision, regardless of the precision specified by the instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.4.3:  Fragment Program Operands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Except for KIL, fragment program instructions operate on either vector or
5bd8deadSopenharmony_ci    scalar operands, indicated in the grammar (see section 3.11.3) by the
5bd8deadSopenharmony_ci    rules <vectorSrc> and <scalarSrc> respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The basic set of scalar operands is defined by the grammar rule
5bd8deadSopenharmony_ci    <baseScalarSrc>.  Scalar operands can be scalar constants (embedded or
5bd8deadSopenharmony_ci    named), or single components of vector constants, local parameters, or
5bd8deadSopenharmony_ci    registers allowed by the <srcRegister> rule.  A vector component is
5bd8deadSopenharmony_ci    selected by the <scalarSuffix> rule, where the characters "x", "y", "z",
5bd8deadSopenharmony_ci    and "w" select the x, y, z, and w components, respectively, of the vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The basic set of vector operands is defined by the grammar rule
5bd8deadSopenharmony_ci    <baseVectorSrc>.  Vector operands can include vector constants, local
5bd8deadSopenharmony_ci    parameters, or registers allowed by the <srcRegister> rule.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Basic vector operands can be swizzled according to the <swizzleSuffix>
5bd8deadSopenharmony_ci    rule.  In its most general form, the <swizzleSuffix> rule matches the
5bd8deadSopenharmony_ci    pattern ".????" where each question mark is one of "x", "y", "z", or "w".
5bd8deadSopenharmony_ci    For such patterns, the x, y, z, and w components of the operand are taken
5bd8deadSopenharmony_ci    from the vector components named by the first, second, third, and fourth
5bd8deadSopenharmony_ci    character of the pattern, respectively.  For example, if the swizzle
5bd8deadSopenharmony_ci    suffix is ".yzzx" and the specified source contains {2,8,9,0}, the
5bd8deadSopenharmony_ci    swizzled operand used by the instruction is {8,9,9,2}.  If the
5bd8deadSopenharmony_ci    <swizzleSuffix> rule matches "", it is treated as though it were ".xyzw".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Operands can optionally be negated according to the <negate> rule in
5bd8deadSopenharmony_ci    <baseScalarSrc> or <baseVectorSrc>.  If the <negate> matches "-", each
5bd8deadSopenharmony_ci    value is negated.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The absolute value of operands can be taken if the <vectorSrc> or
5bd8deadSopenharmony_ci    <scalarSrc> rules match <absScalarSrc> or <absVectorSrc>.  In this case,
5bd8deadSopenharmony_ci    the absolute value of each component is taken.  In addition, if the
5bd8deadSopenharmony_ci    <negate> rule in <absScalarSrc> or <absVectorSrc> matches "-", the result
5bd8deadSopenharmony_ci    is then negated.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Instructions requiring vector operands can also use scalar operands in the
5bd8deadSopenharmony_ci    case where the <vectorSrc> rule matches <scalarSrc>.  In such cases, a
5bd8deadSopenharmony_ci    4-component vector is produced by replicating the scalar.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    After operands are loaded, they are converted to a data type corresponding
5bd8deadSopenharmony_ci    to the operation precision specified in the fragment program instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following pseudo-code spells out the operand generation process.
5bd8deadSopenharmony_ci    "SrcT" and "InstT" refer to the data types of the specified register or
5bd8deadSopenharmony_ci    constant and the instruction, respectively.  "VecSrcT" and "VecInstT"
5bd8deadSopenharmony_ci    refer to 4-component vectors of the corresponding type.  "absolute" is
5bd8deadSopenharmony_ci    TRUE if the operand matches the <absScalarSrc> or <absVectorSrc> rules,
5bd8deadSopenharmony_ci    and FALSE otherwise.  "negateBase" is TRUE if the <negate> rule in
5bd8deadSopenharmony_ci    <baseScalarSrc> or <baseVectorSrc> matches "-" and FALSE otherwise.
5bd8deadSopenharmony_ci    "negateAbs" is TRUE if the <negate> rule in <absScalarSrc> or
5bd8deadSopenharmony_ci    <absVectorSrc> matches "-" and FALSE otherwise.  The ".c***", ".*c**",
5bd8deadSopenharmony_ci    ".**c*", ".***c" modifiers refer to the x, y, z, and w components obtained
5bd8deadSopenharmony_ci    by the swizzle operation.  TypeConvert() is assumed to convert a scalar of
5bd8deadSopenharmony_ci    type SrcT to a scalar of type InstT using the type conversion process
5bd8deadSopenharmony_ci    specified above.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      VecInstT VectorLoad(VecSrcT source)
5bd8deadSopenharmony_ci      {
5bd8deadSopenharmony_ci          VecSrcT srcVal;
5bd8deadSopenharmony_ci          VecInstT convertedVal;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          srcVal.x = source.c***;
5bd8deadSopenharmony_ci          srcVal.y = source.*c**;
5bd8deadSopenharmony_ci          srcVal.z = source.**c*;
5bd8deadSopenharmony_ci          srcVal.w = source.***c;
5bd8deadSopenharmony_ci          if (negateBase) {
5bd8deadSopenharmony_ci             srcVal.x = -srcVal.x;
5bd8deadSopenharmony_ci             srcVal.y = -srcVal.y;
5bd8deadSopenharmony_ci             srcVal.z = -srcVal.z;
5bd8deadSopenharmony_ci             srcVal.w = -srcVal.w;
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          if (absolute) {
5bd8deadSopenharmony_ci             srcVal.x = abs(srcVal.x);
5bd8deadSopenharmony_ci             srcVal.y = abs(srcVal.y);
5bd8deadSopenharmony_ci             srcVal.z = abs(srcVal.z);
5bd8deadSopenharmony_ci             srcVal.w = abs(srcVal.w);
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          if (negateAbs) {
5bd8deadSopenharmony_ci             srcVal.x = -srcVal.x;
5bd8deadSopenharmony_ci             srcVal.y = -srcVal.y;
5bd8deadSopenharmony_ci             srcVal.z = -srcVal.z;
5bd8deadSopenharmony_ci             srcVal.w = -srcVal.w;
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          convertedVal.x = TypeConvert(srcVal.x);
5bd8deadSopenharmony_ci          convertedVal.y = TypeConvert(srcVal.y);
5bd8deadSopenharmony_ci          convertedVal.z = TypeConvert(srcVal.z);
5bd8deadSopenharmony_ci          convertedVal.w = TypeConvert(srcVal.w);
5bd8deadSopenharmony_ci          return convertedVal;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      InstT ScalarLoad(VecSrcT source)
5bd8deadSopenharmony_ci      {
5bd8deadSopenharmony_ci          SrcT srcVal;
5bd8deadSopenharmony_ci          InstT convertedVal;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          srcVal = source.c***;
5bd8deadSopenharmony_ci          if (negateBase) {
5bd8deadSopenharmony_ci            srcVal = -srcVal;
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          if (absolute) {
5bd8deadSopenharmony_ci             srcVal = abs(srcVal);
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          if (negateAbs) {
5bd8deadSopenharmony_ci            srcVal = -srcVal;
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          convertedVal = TypeConvert(srcVal);
5bd8deadSopenharmony_ci          return convertedVal;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.4.4, Fragment Program Destination Register Update
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Each fragment program instruction, except for KIL, writes a 4-component
5bd8deadSopenharmony_ci    result vector to a single temporary or output register.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The four components of the result vector are first optionally clamped to
5bd8deadSopenharmony_ci    the range [0,1].  The components will be clamped if and only if the result
5bd8deadSopenharmony_ci    clamp suffix "_SAT" is present in the instruction name.  The instruction
5bd8deadSopenharmony_ci    "ADD_SAT" will clamp the results to [0,1]; the otherwise equivalent
5bd8deadSopenharmony_ci    instruction "ADD" will not.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Since the instruction may be carried out at a different precision than the
5bd8deadSopenharmony_ci    destination register, the components of the results vector are then
5bd8deadSopenharmony_ci    converted to the data type corresponding to destination register.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Writes to individual components of the temporary register are controlled
5bd8deadSopenharmony_ci    by two sets of enables: individual component write masks specified as part
5bd8deadSopenharmony_ci    of the instruction and the optional condition code mask.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The component write mask is specified by the <optionalWriteMask> rule
5bd8deadSopenharmony_ci    found in the <maskedDstReg> rule.  If the optional mask is "", all
5bd8deadSopenharmony_ci    components are enabled.  Otherwise, the optional mask names the individual
5bd8deadSopenharmony_ci    components to enable.  The characters "x", "y", "z", and "w" match the x,
5bd8deadSopenharmony_ci    y, z, and w components respectively.  For example, an optional mask of
5bd8deadSopenharmony_ci    ".xzw" indicates that the x, z, and w components should be enabled for
5bd8deadSopenharmony_ci    writing but the y component should not.  The grammar requires that the
5bd8deadSopenharmony_ci    destination register mask components must be listed in "xyzw" order.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The optional condition code mask is specified by the <optionalCCMask> rule
5bd8deadSopenharmony_ci    found in the <maskedDstReg> rule.  If <optionalCCMask> matches "", all
5bd8deadSopenharmony_ci    components are enabled.  Otherwise, the condition code register is loaded
5bd8deadSopenharmony_ci    and swizzled according to the swizzling specified by <swizzleSuffix>.
5bd8deadSopenharmony_ci    Each component of the swizzled condition code is tested according to the
5bd8deadSopenharmony_ci    rule given by <ccMaskRule>.  <ccMaskRule> may have the values "EQ", "NE",
5bd8deadSopenharmony_ci    "LT", "GE", LE", or "GT", which mean to enable writes if the corresponding
5bd8deadSopenharmony_ci    condition code field evaluates to equal, not equal, less than, greater
5bd8deadSopenharmony_ci    than or equal, less than or equal, or greater than, respectively.
5bd8deadSopenharmony_ci    Comparisons involving condition codes of "UN" (unordered) evaluate to true
5bd8deadSopenharmony_ci    for "NE" and false otherwise.  For example, if the condition code is
5bd8deadSopenharmony_ci    (GT,LT,EQ,GT) and the condition code mask is "(NE.zyxw)", the swizzle
5bd8deadSopenharmony_ci    operation will load (EQ,LT,GT,GT) and the mask will thus will enable
5bd8deadSopenharmony_ci    writes on the y, z, and w components.  In addition, "TR" always enables
5bd8deadSopenharmony_ci    writes and "FL" always disables writes, regardless of the condition code.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Each component of the destination register is updated with the result of
5bd8deadSopenharmony_ci    the fragment program if and only if the component is enabled for writes by
5bd8deadSopenharmony_ci    both the component write mask and the optional condition code mask.
5bd8deadSopenharmony_ci    Otherwise, the component of the destination register remains unchanged.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program instruction can also optionally update the condition
5bd8deadSopenharmony_ci    code register.  The condition code is updated if the condition code
5bd8deadSopenharmony_ci    register update suffix "C" is present in the instruction name.  The
5bd8deadSopenharmony_ci    instruction "ADDC" will update the condition code; the otherwise
5bd8deadSopenharmony_ci    equivalent instruction "ADD" will not.  If condition code updates are
5bd8deadSopenharmony_ci    enabled, each component of the destination register enabled for writes is
5bd8deadSopenharmony_ci    compared to zero.  The corresponding component of the condition code is
5bd8deadSopenharmony_ci    set to "LT", "EQ", or "GT", if the written component is less than, equal
5bd8deadSopenharmony_ci    to, or greater than zero, respectively.  Condition code components are set
5bd8deadSopenharmony_ci    to "UN" if the written component is NaN.  Note that values of -0.0 and
5bd8deadSopenharmony_ci    +0.0 both evaluate to "EQ".  If a component of the destination register is
5bd8deadSopenharmony_ci    not enabled for writes, the corresponding condition code component is
5bd8deadSopenharmony_ci    unchanged.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In the following example code,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        # R1=(-2, 0, 2, NaN)              R0                  CC
5bd8deadSopenharmony_ci        MOVC R0, R1;               # ( -2,  0,   2, NaN) (LT,EQ,GT,UN)
5bd8deadSopenharmony_ci        MOVC R0.xyz, R1.yzwx;      # (  0,  2, NaN, NaN) (EQ,GT,UN,UN)
5bd8deadSopenharmony_ci        MOVC R0 (NE), R1.zywx;     # (  0,  0, NaN,  -2) (EQ,EQ,UN,LT)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    the first instruction writes (-2,0,2,NaN) to R0 and updates the condition
5bd8deadSopenharmony_ci    code to (LT,EQ,GT,UN).  The second instruction, only the "x", "y", and "z"
5bd8deadSopenharmony_ci    components of R0 and the condition code are updated, so R0 ends up with
5bd8deadSopenharmony_ci    (0,2,NaN,NaN) and the condition code ends up with (EQ,GT,UN,UN).  In the
5bd8deadSopenharmony_ci    third instruction, the condition code mask disables writes to the x
5bd8deadSopenharmony_ci    component (its condition code field is "EQ"), so R0 ends up with
5bd8deadSopenharmony_ci    (0,0,NaN,-2) and the condition code ends up with (EQ,EQ,UN,LT).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following pseudocode illustrates the process of writing a result
5bd8deadSopenharmony_ci    vector to the destination register.  In the example, "ccMaskRule" refers
5bd8deadSopenharmony_ci    to the condition code mask rule given by <ccMaskRule> (or "" if no rule is
5bd8deadSopenharmony_ci    specified), "instrmask" refers to the component write mask given by the
5bd8deadSopenharmony_ci    <optionalWriteMask> rule, "updatecc" is TRUE if condition code updates are
5bd8deadSopenharmony_ci    enabled, and "clamp01" is TRUE if [0,1] result clamping is enabled.
5bd8deadSopenharmony_ci    "destination" and "cc" refer to the register selected by <dstRegister> and
5bd8deadSopenharmony_ci    the condition code, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      boolean TestCC(CondCode field) {
5bd8deadSopenharmony_ci          switch (ccMaskRule) {
5bd8deadSopenharmony_ci          case "EQ":  return (field == "EQ");
5bd8deadSopenharmony_ci          case "NE":  return (field != "EQ");
5bd8deadSopenharmony_ci          case "LT":  return (field == "LT");
5bd8deadSopenharmony_ci          case "GE":  return (field == "GT" || field == "EQ");
5bd8deadSopenharmony_ci          case "LE":  return (field == "LT" || field == "EQ");
5bd8deadSopenharmony_ci          case "GT":  return (field == "GT");
5bd8deadSopenharmony_ci          case "TR":  return TRUE;
5bd8deadSopenharmony_ci          case "FL":  return FALSE;
5bd8deadSopenharmony_ci          case "":    return TRUE;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      enum GenerateCC(DstT value) {
5bd8deadSopenharmony_ci        if (value == NaN) {
5bd8deadSopenharmony_ci          return UN;
5bd8deadSopenharmony_ci        } else if (value < 0) {
5bd8deadSopenharmony_ci          return LT;
5bd8deadSopenharmony_ci        } else if (value == 0) {
5bd8deadSopenharmony_ci          return EQ;
5bd8deadSopenharmony_ci        } else {
5bd8deadSopenharmony_ci          return GT;
5bd8deadSopenharmony_ci        }
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void UpdateDestination(VecDstT destination, VecInstT result)
5bd8deadSopenharmony_ci      {
5bd8deadSopenharmony_ci          // Load the original destination register and condition code.
5bd8deadSopenharmony_ci          VecDstT resultDst;
5bd8deadSopenharmony_ci          VecDstT merged;
5bd8deadSopenharmony_ci          VecCC   mergedCC;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          // Clamp the result vector components to [0,1], if requested.
5bd8deadSopenharmony_ci          if (clamp01) {
5bd8deadSopenharmony_ci              if (result.x < 0)      result.x = 0;
5bd8deadSopenharmony_ci              else if (result.x > 1) result.x = 1;
5bd8deadSopenharmony_ci              if (result.y < 0)      result.y = 0;
5bd8deadSopenharmony_ci              else if (result.y > 1) result.y = 1;
5bd8deadSopenharmony_ci              if (result.z < 0)      result.z = 0;
5bd8deadSopenharmony_ci              else if (result.z > 1) result.z = 1;
5bd8deadSopenharmony_ci              if (result.w < 0)      result.w = 0;
5bd8deadSopenharmony_ci              else if (result.w > 1) result.w = 1;
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          // Convert the result to the type of the destination register.
5bd8deadSopenharmony_ci          resultDst.x = TypeConvert(result.x);
5bd8deadSopenharmony_ci          resultDst.y = TypeConvert(result.y);
5bd8deadSopenharmony_ci          resultDst.z = TypeConvert(result.z);
5bd8deadSopenharmony_ci          resultDst.w = TypeConvert(result.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          // Merge the converted result into the destination register, under
5bd8deadSopenharmony_ci          // control of the compile- and run-time write masks.
5bd8deadSopenharmony_ci          merged = destination;
5bd8deadSopenharmony_ci          mergedCC = cc;
5bd8deadSopenharmony_ci          if (instrMask.x && TestCC(cc.c***)) {
5bd8deadSopenharmony_ci              merged.x = result.x;
5bd8deadSopenharmony_ci              if (updatecc) mergedCC.x = GenerateCC(result.x);
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          if (instrMask.y && TestCC(cc.*c**)) {
5bd8deadSopenharmony_ci              merged.y = result.y;
5bd8deadSopenharmony_ci              if (updatecc) mergedCC.y = GenerateCC(result.y);
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          if (instrMask.z && TestCC(cc.**c*)) {
5bd8deadSopenharmony_ci              merged.z = result.z;
5bd8deadSopenharmony_ci              if (updatecc) mergedCC.z = GenerateCC(result.z);
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          if (instrMask.w && TestCC(cc.***c)) {
5bd8deadSopenharmony_ci              merged.w = result.w;
5bd8deadSopenharmony_ci              if (updatecc) mergedCC.w = GenerateCC(result.w);
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          // Write out the new destination register and result code.
5bd8deadSopenharmony_ci          destination = merged;
5bd8deadSopenharmony_ci          cc = mergedCC;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5, Fragment Program Instruction Set
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following sections describe the instruction set available to fragment
5bd8deadSopenharmony_ci    programs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.1,  ADD:  Add
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The ADD instruction performs a component-wise add of the two operands to
5bd8deadSopenharmony_ci    yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x + tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y + tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z + tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w + tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to addition:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. "A+B" is always equivalent to "B+A".
5bd8deadSopenharmony_ci      2. NaN + <x> = NaN, for all <x>.
5bd8deadSopenharmony_ci      3. +INF + <x> = +INF, for all <x> except NaN and -INF.
5bd8deadSopenharmony_ci      4. -INF + <x> = -INF, for all <x> except NaN and +INF.
5bd8deadSopenharmony_ci      5. +INF + -INF = NaN.
5bd8deadSopenharmony_ci      6. -0.0 + <x> = <x>, for all <x>.
5bd8deadSopenharmony_ci      7. +0.0 + <x> = <x>, for all <x> except -0.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.2,  COS:  Cosine
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The COS instruction approximates the cosine of the angle specified by the
5bd8deadSopenharmony_ci    scalar operand and replicates the approximation to all four components of
5bd8deadSopenharmony_ci    the result vector.  The angle is specified in radians and does not have to
5bd8deadSopenharmony_ci    be in the range [0,2*PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxCosine(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxCosine(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxCosine(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxCosine(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The approximation function ApproxCosine is accurate to at least 22 bits
5bd8deadSopenharmony_ci    with an angle in the range [0,2*PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      | ApproxCosine(x) - cos(x) | < 1.0 / 2^22, if 0.0 <= x < 2.0 * PI.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error in the approximation will typically increase with the absolute
5bd8deadSopenharmony_ci    value of the angle when the angle falls outside the range [0,2*PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to cosine approximation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. ApproxCosine(NaN) = NaN.
5bd8deadSopenharmony_ci      2. ApproxCosine(+/-INF) = NaN.
5bd8deadSopenharmony_ci      3. ApproxCosine(+/-0.0) = +1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.3,  DDX:  Derivative Relative to X
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DDX instruction computes approximate partial derivatives of the four
5bd8deadSopenharmony_ci    components of the single operand with respect to the X window coordinate
5bd8deadSopenharmony_ci    to yield a result vector.  The partial derivative is evaluated at the
5bd8deadSopenharmony_ci    center of the pixel.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      f = VectorLoad(op0);
5bd8deadSopenharmony_ci      result = ComputePartialX(f);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that the partial derivates obtained by this instruction are
5bd8deadSopenharmony_ci    approximate, and derivative-of-derivate instruction sequences may not
5bd8deadSopenharmony_ci    yield accurate second derivatives.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For components with partial derivatives that overflow (including +/-INF
5bd8deadSopenharmony_ci    inputs), the resulting partials may be encoded as large floating-point
5bd8deadSopenharmony_ci    numbers instead of +/-INF.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.4,  DDY:  Derivative Relative to Y
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DDY instruction computes approximate partial derivatives of the four
5bd8deadSopenharmony_ci    components of the single operand with respect to the Y window coordinate
5bd8deadSopenharmony_ci    to yield a result vector.  The partial derivative is evaluated at the
5bd8deadSopenharmony_ci    center of the pixel.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      f = VectorLoad(op0);
5bd8deadSopenharmony_ci      result = ComputePartialY(f);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that the partial derivates obtained by this instruction are
5bd8deadSopenharmony_ci    approximate, and derivative-of-derivate instruction sequences may not
5bd8deadSopenharmony_ci    yield accurate second derivatives.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For components with partial derivatives that overflow (including +/-INF
5bd8deadSopenharmony_ci    inputs), the resulting partials may be encoded as large floating-point
5bd8deadSopenharmony_ci    numbers instead of +/-INF.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.5,  DP3:  3-Component Dot Product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DP3 instruction computes a three component dot product of the two
5bd8deadSopenharmony_ci    operands (using the x, y, and z components) and replicates the dot product
5bd8deadSopenharmony_ci    to all four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1):
5bd8deadSopenharmony_ci      result.x = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci                 (tmp0.z * tmp2.z);
5bd8deadSopenharmony_ci      result.y = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci                 (tmp0.z * tmp2.z);
5bd8deadSopenharmony_ci      result.z = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci                 (tmp0.z * tmp2.z);
5bd8deadSopenharmony_ci      result.w = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci                 (tmp0.z * tmp2.z);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.6,  DP4:  4-Component Dot Product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DP4 instruction computes a four component dot product of the two
5bd8deadSopenharmony_ci    operands and replicates the dot product to all four components of the
5bd8deadSopenharmony_ci    result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1):
5bd8deadSopenharmony_ci      result.x = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci                 (tmp0.z * tmp2.z) + (tmp0.w * tmp1.w);
5bd8deadSopenharmony_ci      result.y = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci                 (tmp0.z * tmp2.z) + (tmp0.w * tmp1.w);
5bd8deadSopenharmony_ci      result.z = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci                 (tmp0.z * tmp2.z) + (tmp0.w * tmp1.w);
5bd8deadSopenharmony_ci      result.w = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci                 (tmp0.z * tmp2.z) + (tmp0.w * tmp1.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.7,  DST:  Distance Vector
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DST instruction computes a distance vector from two specially-
5bd8deadSopenharmony_ci    formatted operands.  The first operand should be of the form [NA, d^2,
5bd8deadSopenharmony_ci    d^2, NA] and the second operand should be of the form [NA, 1/d, NA, 1/d],
5bd8deadSopenharmony_ci    where NA values are not relevant to the calculation and d is a vector
5bd8deadSopenharmony_ci    length.  If both vectors satisfy these conditions, the result vector will
5bd8deadSopenharmony_ci    be of the form [1.0, d, d^2, 1/d].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The exact behavior is specified in the following pseudo-code:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = 1.0;
5bd8deadSopenharmony_ci      result.y = tmp0.y * tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z;
5bd8deadSopenharmony_ci      result.w = tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Given an arbitrary vector, d^2 can be obtained using the DOT3 instruction
5bd8deadSopenharmony_ci    (using the same vector for both operands) and 1/d can be obtained from d^2
5bd8deadSopenharmony_ci    using the RSQ instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This distance vector is useful for per-fragment light attenuation
5bd8deadSopenharmony_ci    calculations:  a DOT3 operation involving the distance vector and an
5bd8deadSopenharmony_ci    attenuation constants vector will yield the attenuation factor.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.8,  EX2:  Exponential Base 2
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The EX2 instruction approximates 2 raised to the power of the scalar
5bd8deadSopenharmony_ci    operand and replicates it to all four components of the result
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = Approx2ToX(tmp);
5bd8deadSopenharmony_ci      result.y = Approx2ToX(tmp);
5bd8deadSopenharmony_ci      result.z = Approx2ToX(tmp);
5bd8deadSopenharmony_ci      result.w = Approx2ToX(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The approximation function is accurate to at least 22 bits:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      | Approx2ToX(x) - 2^x | < 1.0 / 2^22, if 0.0 <= x < 1.0,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    and, in general,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      | Approx2ToX(x) - 2^x | < (1.0 / 2^22) * (2^floor(x)).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to exponential approximation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. Approx2ToX(NaN) = NaN.
5bd8deadSopenharmony_ci      2. Approx2ToX(-INF) = +0.0.
5bd8deadSopenharmony_ci      3. Approx2ToX(+INF) = +INF.
5bd8deadSopenharmony_ci      4. Approx2ToX(+/-0.0) = +1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.9,  FLR:  Floor
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The FLR instruction performs a component-wise floor operation on the
5bd8deadSopenharmony_ci    operand to generate a result vector.  The floor of a value is defined as
5bd8deadSopenharmony_ci    the largest integer less than or equal to the value.  The floor of 2.3 is
5bd8deadSopenharmony_ci    2.0; the floor of -3.6 is -4.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = floor(tmp.x);
5bd8deadSopenharmony_ci      result.y = floor(tmp.y);
5bd8deadSopenharmony_ci      result.z = floor(tmp.z);
5bd8deadSopenharmony_ci      result.w = floor(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to floor computation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. floor(NaN) = NaN.
5bd8deadSopenharmony_ci      2. floor(<x>) = <x>, for -0.0, +0.0, -INF, and +INF.  In all cases, the
5bd8deadSopenharmony_ci         sign of the result is equal to the sign of the operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.10,  FRC:  Fraction
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The FRC instruction extracts the fractional portion of each component of
5bd8deadSopenharmony_ci    the operand to generate a result vector.  The fractional portion of a
5bd8deadSopenharmony_ci    component is defined as the result after subtracting off the floor of the
5bd8deadSopenharmony_ci    component (see FLR), and is always in the range [0.00, 1.00).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For negative values, the fractional portion is NOT the number written to
5bd8deadSopenharmony_ci    the right of the decimal point -- the fractional portion of -1.7 is not
5bd8deadSopenharmony_ci    0.7 -- it is 0.3.  0.3 is produced by subtracting the floor of -1.7 (-2.0)
5bd8deadSopenharmony_ci    from -1.7.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = tmp.x - floor(tmp.x);
5bd8deadSopenharmony_ci      result.y = tmp.y - floor(tmp.y);
5bd8deadSopenharmony_ci      result.z = tmp.z - floor(tmp.z);
5bd8deadSopenharmony_ci      result.w = tmp.w - floor(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules, which can be derived from the rules for
5bd8deadSopenharmony_ci    FLR and ADD apply to fraction computation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. fraction(NaN) = NaN.
5bd8deadSopenharmony_ci      2. fraction(+/-INF) = NaN.
5bd8deadSopenharmony_ci      3. fraction(+/-0.0) = +0.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.11,  KIL:  Conditionally Discard Fragment
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The KIL instruction is unlike any other instruction in the instruction
5bd8deadSopenharmony_ci    set.  This instruction evaluates components of a swizzled condition code
5bd8deadSopenharmony_ci    using a test expression identical to that used to evaluate condition code
5bd8deadSopenharmony_ci    write masks (Section 3.11.4.4).  If any condition code component evaluates
5bd8deadSopenharmony_ci    to TRUE, the fragment is discarded.  Otherwise, the instruction has no
5bd8deadSopenharmony_ci    effect.  The condition code components are specified, swizzled, and
5bd8deadSopenharmony_ci    evaluated in the same manner as the condition code write mask.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      if (TestCC(rc.c***) || TestCC(rc.*c**) ||
5bd8deadSopenharmony_ci          TestCC(rc.**c*) || TestCC(rc.***c)) {
5bd8deadSopenharmony_ci         // Discard the fragment.
5bd8deadSopenharmony_ci      } else {
5bd8deadSopenharmony_ci        // Do nothing.
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the fragment is discarded, it is treated as though it were not produced
5bd8deadSopenharmony_ci    by rasterization.  In particular, none of the per-fragment operations
5bd8deadSopenharmony_ci    (such as stencil tests, blends, stencil, depth, or color buffer writes)
5bd8deadSopenharmony_ci    are performed on the fragment.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.12,  LG2:  Logarithm Base 2
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The LG2 instruction approximates the base 2 logarithm of the scalar
5bd8deadSopenharmony_ci    operand and replicates it to all four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxLog2(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxLog2(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxLog2(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxLog2(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The approximation function is accurate to at least 22 bits:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      | ApproxLog2(x) - log_2(x) | < 1.0 / 2^22.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that for large values of x, there are not enough bits in the
5bd8deadSopenharmony_ci    floating-point storage format to represent a result that precisely.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to logarithm approximation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. ApproxLog2(NaN) = NaN.
5bd8deadSopenharmony_ci      2. ApproxLog2(+INF) = +INF.
5bd8deadSopenharmony_ci      3. ApproxLog2(+/-0.0) = -INF.
5bd8deadSopenharmony_ci      4. ApproxLog2(x) = NaN, -INF < x < -0.0.
5bd8deadSopenharmony_ci      5. ApproxLog2(-INF) = NaN.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.13,  LIT:  Compute Light Coefficients
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The LIT instruction accelerates per-fragment lighting by computing
5bd8deadSopenharmony_ci    lighting coefficients for ambient, diffuse, and specular light
5bd8deadSopenharmony_ci    contributions.  The "x" component of the operand is assumed to hold a
5bd8deadSopenharmony_ci    diffuse dot product (n dot VP_pli, as in the vertex lighting equations in
5bd8deadSopenharmony_ci    Section 2.13.1).  The "y" component of the operand is assumed to hold a
5bd8deadSopenharmony_ci    specular dot product (n dot h_i).  The "w" component of the operand is
5bd8deadSopenharmony_ci    assumed to hold the specular exponent of the material (s_rm).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The "x" component of the result vector receives the value that should be
5bd8deadSopenharmony_ci    multiplied by the ambient light/material product (always 1.0).  The "y"
5bd8deadSopenharmony_ci    component of the result vector receives the value that should be
5bd8deadSopenharmony_ci    multiplied by the diffuse light/material product (n dot VP_pli).  The "z"
5bd8deadSopenharmony_ci    component of the result vector receives the value that should be
5bd8deadSopenharmony_ci    multiplied by the specular light/material product (f_i * (n dot h_i) ^
5bd8deadSopenharmony_ci    s_rm).  The "w" component of the result is the constant 1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Negative diffuse and specular dot products are clamped to 0.0, as is done
5bd8deadSopenharmony_ci    in the standard per-vertex lighting operations.  In addition, if the
5bd8deadSopenharmony_ci    diffuse dot product is zero or negative, the specular coefficient is
5bd8deadSopenharmony_ci    forced to zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      if (t.x < 0) t.x = 0;
5bd8deadSopenharmony_ci      if (t.y < 0) t.y = 0;
5bd8deadSopenharmony_ci      result.x = 1.0;
5bd8deadSopenharmony_ci      result.y = t.x;
5bd8deadSopenharmony_ci      result.z = (t.x > 0) ? ApproxPower(t.y, t.w) : 0.0;
5bd8deadSopenharmony_ci      result.w = 1.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The exponentiation approximation used to compute result.z are identical to
5bd8deadSopenharmony_ci    that used in the POW instruction, including errors and the processing of
5bd8deadSopenharmony_ci    any special cases.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.14,  LRP:  Linear Interpolation
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The LRP instruction performs a component-wise linear interpolation to
5bd8deadSopenharmony_ci    yield a result vector.  It interpolates between the components of the
5bd8deadSopenharmony_ci    second and third operands, using the first operand as a weight.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      result.x = tmp0.x * tmp1.x + (1 - tmp0.x) * tmp2.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y * tmp1.y + (1 - tmp0.y) * tmp2.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z * tmp1.z + (1 - tmp0.z) * tmp2.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w * tmp1.w + (1 - tmp0.w) * tmp2.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.15,  MAD:  Multiply and Add
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MAD instruction performs a component-wise multiply of the first two
5bd8deadSopenharmony_ci    operands, and then does a component-wise add of the product to the third
5bd8deadSopenharmony_ci    operand to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      result.x = tmp0.x * tmp1.x + tmp2.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y * tmp1.y + tmp2.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z * tmp1.z + tmp2.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w * tmp1.w + tmp2.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.16,  MAX:  maximum
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MAX instruction computes component-wise maximums of the values in the
5bd8deadSopenharmony_ci    two operands to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = max(tmp0.x, tmp1.x);
5bd8deadSopenharmony_ci      result.y = max(tmp0.y, tmp1.y);
5bd8deadSopenharmony_ci      result.z = max(tmp0.z, tmp1.z);
5bd8deadSopenharmony_ci      result.w = max(tmp0.w, tmp1.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special cases apply to the maximum operation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. max(A,B) is always equivalent to max(B,A).
5bd8deadSopenharmony_ci      2. max(NaN, <x>) == NaN, for all <x>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.17,  MIN:  minimum
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MIN instruction computes component-wise minimums of the values in the
5bd8deadSopenharmony_ci    two operands to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = min(tmp0.x, tmp1.x);
5bd8deadSopenharmony_ci      result.y = min(tmp0.y, tmp1.y);
5bd8deadSopenharmony_ci      result.z = min(tmp0.z, tmp1.z);
5bd8deadSopenharmony_ci      result.w = min(tmp0.w, tmp1.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special cases apply to the minimum operation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. min(A,B) is always equivalent to min(B,A).
5bd8deadSopenharmony_ci      2. min(NaN, <x>) == NaN, for all <x>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.18,  MOV:  Move
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MOV instruction copies the value of the operand to yield a result
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result = VectorLoad(op0);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.19,  MUL:  Multiply
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MUL instruction performs a component-wise multiply of the two operands
5bd8deadSopenharmony_ci    to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x * tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y * tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z * tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w * tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to multiplication:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. "A*B" is always equivalent to "B*A".
5bd8deadSopenharmony_ci      2. NaN * <x> = NaN, for all <x>.
5bd8deadSopenharmony_ci      3. +/-0.0 * +/-INF = NaN.
5bd8deadSopenharmony_ci      4. +/-0.0 * <x> = +/-0.0, for all <x> except -INF, +INF, and NaN.  The
5bd8deadSopenharmony_ci         sign of the result is positive if the signs of the two operands match
5bd8deadSopenharmony_ci         and negative otherwise.
5bd8deadSopenharmony_ci      5. +/-INF * <x> = +/-INF, for all <x> except -0.0, +0.0, and NaN.  The
5bd8deadSopenharmony_ci         sign of the result is positive if the signs of the two operands match
5bd8deadSopenharmony_ci         and negative otherwise.
5bd8deadSopenharmony_ci      6. +1.0 * <x> = <x>, for all <x>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.20,  PK2H:  Pack Two 16-bit Floats
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The PK2H instruction converts the "x" and "y" components of the single
5bd8deadSopenharmony_ci    operand into 16-bit floating-point format, packs the bit representation of
5bd8deadSopenharmony_ci    these two floats into a 32-bit value, and replicates that value to all
5bd8deadSopenharmony_ci    four components of the result vector.  The PK2H instruction can be
5bd8deadSopenharmony_ci    reversed by the UP2H instruction below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      /* result obtained by combining raw bits of tmp0.x, tmp0.y */
5bd8deadSopenharmony_ci      result.x = RawBits(tmp0.x) | (RawBits(tmp0.y) << 16);
5bd8deadSopenharmony_ci      result.y = RawBits(tmp0.x) | (RawBits(tmp0.y) << 16);
5bd8deadSopenharmony_ci      result.z = RawBits(tmp0.x) | (RawBits(tmp0.y) << 16);
5bd8deadSopenharmony_ci      result.w = RawBits(tmp0.x) | (RawBits(tmp0.y) << 16);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The result must be written to a register with 32-bit components (an "R"
5bd8deadSopenharmony_ci    register, o[COLR], or o[DEPR]).  A fragment program will fail to load if
5bd8deadSopenharmony_ci    any other register type is specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.21,  PK2US:  Pack Two Unsigned 16-bit Scalars
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The PK2US instruction converts the "x" and "y" components of the single
5bd8deadSopenharmony_ci    operand into a packed pair of 16-bit unsigned scalars.  The scalars are
5bd8deadSopenharmony_ci    represented in a bit pattern where all '0' bits corresponds to 0.0 and all
5bd8deadSopenharmony_ci    '1' bits corresponds to 1.0.  The bit representations of the two converted
5bd8deadSopenharmony_ci    components are packed into a 32-bit value, and that value is replicated to
5bd8deadSopenharmony_ci    all four components of the result vector.  The PK2US instruction can be
5bd8deadSopenharmony_ci    reversed by the UP2US instruction below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      if (tmp0.x < 0.0) tmp0.x = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.x > 1.0) tmp0.x = 1.0;
5bd8deadSopenharmony_ci      if (tmp0.y < 0.0) tmp0.y = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.y > 1.0) tmp0.y = 1.0;
5bd8deadSopenharmony_ci      us.x = round(65535.0 * tmp0.x);  /* us is a ushort vector */
5bd8deadSopenharmony_ci      us.y = round(65535.0 * tmp0.y);
5bd8deadSopenharmony_ci      /* result obtained by combining raw bits of us. */
5bd8deadSopenharmony_ci      result.x = ((us.x) | (us.y << 16));
5bd8deadSopenharmony_ci      result.y = ((us.x) | (us.y << 16));
5bd8deadSopenharmony_ci      result.z = ((us.x) | (us.y << 16));
5bd8deadSopenharmony_ci      result.w = ((us.x) | (us.y << 16));
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The result must be written to a register with 32-bit components (an "R"
5bd8deadSopenharmony_ci    register, o[COLR], or o[DEPR]).  A fragment program will fail to load if
5bd8deadSopenharmony_ci    any other register type is specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.22,  PK4B:  Pack Four Signed 8-bit Scalars
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The PK4B instruction converts the four components of the single operand
5bd8deadSopenharmony_ci    into 8-bit signed quantities.  The signed quantities are represented in a
5bd8deadSopenharmony_ci    bit pattern where all '0' bits corresponds to -128/127 and all '1' bits
5bd8deadSopenharmony_ci    corresponds to +127/127.  The bit representations of the four converted
5bd8deadSopenharmony_ci    components are packed into a 32-bit value, and that value is replicated to
5bd8deadSopenharmony_ci    all four components of the result vector.  The PK4B instruction can be
5bd8deadSopenharmony_ci    reversed by the UP4B instruction below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      if (tmp0.x < -128/127) tmp0.x = -128/127;
5bd8deadSopenharmony_ci      if (tmp0.y < -128/127) tmp0.y = -128/127;
5bd8deadSopenharmony_ci      if (tmp0.z < -128/127) tmp0.z = -128/127;
5bd8deadSopenharmony_ci      if (tmp0.w < -128/127) tmp0.w = -128/127;
5bd8deadSopenharmony_ci      if (tmp0.x > +127/127) tmp0.x = +127/127;
5bd8deadSopenharmony_ci      if (tmp0.y > +127/127) tmp0.y = +127/127;
5bd8deadSopenharmony_ci      if (tmp0.z > +127/127) tmp0.z = +127/127;
5bd8deadSopenharmony_ci      if (tmp0.w > +127/127) tmp0.w = +127/127;
5bd8deadSopenharmony_ci      ub.x = round(127.0 * tmp0.x + 128.0);  /* ub is a ubyte vector */
5bd8deadSopenharmony_ci      ub.y = round(127.0 * tmp0.y + 128.0);
5bd8deadSopenharmony_ci      ub.z = round(127.0 * tmp0.z + 128.0);
5bd8deadSopenharmony_ci      ub.w = round(127.0 * tmp0.w + 128.0);
5bd8deadSopenharmony_ci      /* result obtained by combining raw bits of ub. */
5bd8deadSopenharmony_ci      result.x = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.y = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.z = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.w = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The result must be written to a register with 32-bit components (an "R"
5bd8deadSopenharmony_ci    register, o[COLR], or o[DEPR]).  A fragment program will fail to load if
5bd8deadSopenharmony_ci    any other register type is specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.23,  PK4UB:  Pack Four Unsigned 8-bit Scalars
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The PK4UB instruction converts the four components of the single operand
5bd8deadSopenharmony_ci    into a packed grouping of 8-bit unsigned scalars.  The scalars are
5bd8deadSopenharmony_ci    represented in a bit pattern where all '0' bits corresponds to 0.0 and all
5bd8deadSopenharmony_ci    '1' bits corresponds to 1.0.  The bit representations of the four
5bd8deadSopenharmony_ci    converted components are packed into a 32-bit value, and that value is
5bd8deadSopenharmony_ci    replicated to all four components of the result vector.  The PK4UB
5bd8deadSopenharmony_ci    instruction can be reversed by the UP4UB instruction below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      if (tmp0.x < 0.0) tmp0.x = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.x > 1.0) tmp0.x = 1.0;
5bd8deadSopenharmony_ci      if (tmp0.y < 0.0) tmp0.y = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.y > 1.0) tmp0.y = 1.0;
5bd8deadSopenharmony_ci      if (tmp0.z < 0.0) tmp0.z = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.z > 1.0) tmp0.z = 1.0;
5bd8deadSopenharmony_ci      if (tmp0.w < 0.0) tmp0.w = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.w > 1.0) tmp0.w = 1.0;
5bd8deadSopenharmony_ci      ub.x = round(255.0 * tmp0.x);  /* ub is a ubyte vector */
5bd8deadSopenharmony_ci      ub.y = round(255.0 * tmp0.y);
5bd8deadSopenharmony_ci      ub.z = round(255.0 * tmp0.z);
5bd8deadSopenharmony_ci      ub.w = round(255.0 * tmp0.w);
5bd8deadSopenharmony_ci      /* result obtained by combining raw bits of ub. */
5bd8deadSopenharmony_ci      result.x = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.y = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.z = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.w = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The result must be written to a register with 32-bit components (an "R"
5bd8deadSopenharmony_ci    register, o[COLR], or o[DEPR]).  A fragment program will fail to load if
5bd8deadSopenharmony_ci    any other register type is specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.24,  POW:  Exponentiation
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The POW instruction approximates the value of the first scalar operand
5bd8deadSopenharmony_ci    raised to the power of the second scalar operand and replicates it to all
5bd8deadSopenharmony_ci    four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = ScalarLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = ScalarLoad(op1);
5bd8deadSopenharmony_ci      result.x = ApproxPower(tmp0, tmp1);
5bd8deadSopenharmony_ci      result.y = ApproxPower(tmp0, tmp1);
5bd8deadSopenharmony_ci      result.z = ApproxPower(tmp0, tmp1);
5bd8deadSopenharmony_ci      result.w = ApproxPower(tmp0, tmp1);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The exponentiation approximation function is defined in terms of the base
5bd8deadSopenharmony_ci    2 exponentiation and logarithm approximation operations in the EX2 and LG2
5bd8deadSopenharmony_ci    instructions, including errors and the processing of any special cases.
5bd8deadSopenharmony_ci    In particular,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      ApproxPower(a,b) = ApproxExp2(b * ApproxLog2(a)).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules, which can be derived from the rules in
5bd8deadSopenharmony_ci    the LG2, MUL, and EX2 instructions, apply to exponentiation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. ApproxPower(<x>, <y>) = NaN, if x < -0.0,
5bd8deadSopenharmony_ci      2. ApproxPower(<x>, <y>) = NaN, if x or y is NaN.
5bd8deadSopenharmony_ci      3. ApproxPower(+/-0.0, +/-0.0) = NaN.
5bd8deadSopenharmony_ci      4. ApproxPower(+INF, +/-0.0) = NaN.
5bd8deadSopenharmony_ci      5. ApproxPower(+1.0, +/-INF) = NaN.
5bd8deadSopenharmony_ci      6. ApproxPower(+/-0.0, <x>) = +0.0, if x > +0.0.
5bd8deadSopenharmony_ci      7. ApproxPower(+/-0.0, <x>) = +INF, if x < -0.0.
5bd8deadSopenharmony_ci      8. ApproxPower(+1.0, <x>)   = +1.0, if -INF < x < +INF.
5bd8deadSopenharmony_ci      9. ApproxPower(+INF, <x>) = +INF, if x > +0.0.
5bd8deadSopenharmony_ci      10. ApproxPower(+INF, <x>) = +INF, if x < -0.0.
5bd8deadSopenharmony_ci      11. ApproxPower(<x>, +/-0.0) = +1.0, if +0.0 < x < +INF.
5bd8deadSopenharmony_ci      12. ApproxPower(<x>, +1.0) ~= <x>, if x >= +0.0.
5bd8deadSopenharmony_ci      13. ApproxPower(<x>, +INF) = +0.0, if -0.0 <= x < +1.0,
5bd8deadSopenharmony_ci                                   +INF, if x > +1.0,
5bd8deadSopenharmony_ci      14. ApproxPower(<x>, -INF) = +INF, if -0.0 <= x < +1.0,
5bd8deadSopenharmony_ci                                   +0.0, if x > +1.0,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that 0^0 is defined here as NaN, since ApproxLog2(0) = -INF, and
5bd8deadSopenharmony_ci    0*(-INF) = NaN.  In many other applications, including the standard C
5bd8deadSopenharmony_ci    pow() function, 0^0 is defined as 1.0.  This behavior can be emulated
5bd8deadSopenharmony_ci    using additional instructions in much that same way that the pow()
5bd8deadSopenharmony_ci    function is implemented on many CPUs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that a logarithm is involved even if the exponent is an integer.
5bd8deadSopenharmony_ci    This means that any exponentiating with a negative base will produce NaN.
5bd8deadSopenharmony_ci    In constrast, it is possible in a "normal" mathematical formulation to
5bd8deadSopenharmony_ci    raise negative numbers to integral powers (e.g., (-3)^2== 9, and
5bd8deadSopenharmony_ci    (-0.5)^-2==4).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.25,  RCP:  Reciprocal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RCP instruction approximates the reciprocal of the scalar operand and
5bd8deadSopenharmony_ci    replicates it to all four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxReciprocal(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The approximation function is accurate to at least 22 bits:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      | ApproxReciprocal(x) - (1/x) | < 1.0 / 2^22, if 1.0 <= x < 2.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to reciprocation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. ApproxReciprocal(NaN) = NaN.
5bd8deadSopenharmony_ci      2. ApproxReciprocal(+INF) = +0.0.
5bd8deadSopenharmony_ci      3. ApproxReciprocal(-INF) = -0.0.
5bd8deadSopenharmony_ci      4. ApproxReciprocal(+0.0) = +INF.
5bd8deadSopenharmony_ci      5. ApproxReciprocal(-0.0) = -INF.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.26,  RFL:  Reflection Vector
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RFL instruction computes the reflection of the second vector operand
5bd8deadSopenharmony_ci    (the "direction" vector) about the vector specified by the first vector
5bd8deadSopenharmony_ci    operand (the "axis" vector).  Both operands are treated as 3D vectors (the
5bd8deadSopenharmony_ci    w components are ignored).  The result vector is another 3D vector (the
5bd8deadSopenharmony_ci    "reflected direction" vector).  The length of the result vector, ignoring
5bd8deadSopenharmony_ci    rounding errors, should equal that of the second operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      axis = VectorLoad(op0);
5bd8deadSopenharmony_ci      direction = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp.w = (axis.x * axis.x + axis.y * axis.y +
5bd8deadSopenharmony_ci               axis.z * axis.z);
5bd8deadSopenharmony_ci      tmp.x = (axis.x * direction.x + axis.y * direction.y +
5bd8deadSopenharmony_ci               axis.z * direction.z);
5bd8deadSopenharmony_ci      tmp.x = 2.0 * tmp.x;
5bd8deadSopenharmony_ci      tmp.x = tmp.x / tmp.w;
5bd8deadSopenharmony_ci      result.x = tmp.x * axis.x - direction.x;
5bd8deadSopenharmony_ci      result.y = tmp.x * axis.y - direction.y;
5bd8deadSopenharmony_ci      result.z = tmp.x * axis.z - direction.z;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A fragment program will fail to load if the w component of the result is
5bd8deadSopenharmony_ci    enabled in the component write mask (see the <optionalWriteMask> rule in
5bd8deadSopenharmony_ci    the grammar).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.27,  RSQ:  Reciprocal Square Root
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RSQ instruction approximates the reciprocal of the square root of the
5bd8deadSopenharmony_ci    scalar operand and replicates it to all four components of the result
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxRSQRT(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxRSQRT(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxRSQRT(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxRSQRT(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The approximation function is accurate to at least 22 bits:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      | ApproxRSQRT(x) - (1/x) | < 1.0 / 2^22, if 1.0 <= x < 4.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to reciprocal square roots:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. ApproxRSQRT(NaN) = NaN.
5bd8deadSopenharmony_ci      2. ApproxRSQRT(+INF) = +0.0.
5bd8deadSopenharmony_ci      3. ApproxRSQRT(-INF) = NaN.
5bd8deadSopenharmony_ci      4. ApproxRSQRT(+0.0) = +INF.
5bd8deadSopenharmony_ci      5. ApproxRSQRT(-0.0) = -INF.
5bd8deadSopenharmony_ci      6. ApproxRSQRT(x) = NaN, if -INF < x < -0.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.28,  SEQ:  Set on Equal To
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SEQ instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector is 1.0 if the corresponding
5bd8deadSopenharmony_ci    component of the first operand is equal to that of the second, and 0.0
5bd8deadSopenharmony_ci    otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x == tmp1.x) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.y = (tmp0.y == tmp1.y) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.z = (tmp0.z == tmp1.z) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.w = (tmp0.w == tmp1.w) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to SEQ:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. (<x> == <y>) and (<y> == <x>) always produce the same result.
5bd8deadSopenharmony_ci      1. (NaN == <x>) is FALSE for all <x>, including NaN.
5bd8deadSopenharmony_ci      2. (+INF == +INF) and (-INF == -INF) are TRUE.
5bd8deadSopenharmony_ci      3. (-0.0 == +0.0) and (+0.0 == -0.0) are TRUE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.29,  SFL:  Set on False
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SFL instruction is a degenerate case of the other "Set on"
5bd8deadSopenharmony_ci    instructions that sets all components of the result vector to
5bd8deadSopenharmony_ci    0.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result.x = 0.0;
5bd8deadSopenharmony_ci      result.y = 0.0;
5bd8deadSopenharmony_ci      result.z = 0.0;
5bd8deadSopenharmony_ci      result.w = 0.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.30,  SGE:  Set on Greater Than or Equal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SGE instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector is 1.0 if the corresponding
5bd8deadSopenharmony_ci    component of the first operands is greater than or equal that of the
5bd8deadSopenharmony_ci    second, and 0.0 otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x >= tmp1.x) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.y = (tmp0.y >= tmp1.y) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.z = (tmp0.z >= tmp1.z) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.w = (tmp0.w >= tmp1.w) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to SGE:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. (NaN >= <x>) and (<x> >= NaN) are FALSE for all <x>.
5bd8deadSopenharmony_ci      2. (+INF >= +INF) and (-INF >= -INF) are TRUE.
5bd8deadSopenharmony_ci      3. (-0.0 >= +0.0) and (+0.0 >= -0.0) are TRUE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.31,  SGT:  Set on Greater Than
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SGT instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector is 1.0 if the corresponding
5bd8deadSopenharmony_ci    component of the first operands is greater than that of the second, and
5bd8deadSopenharmony_ci    0.0 otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x > tmp1.x) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.y = (tmp0.y > tmp1.y) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.z = (tmp0.z > tmp1.z) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.w = (tmp0.w > tmp1.w) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to SGT:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. (NaN > <x>) and (<x> > NaN) are FALSE for all <x>.
5bd8deadSopenharmony_ci      2. (-0.0 > +0.0) and (+0.0 > -0.0) are FALSE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.32,  SIN:  Sine
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SIN instruction approximates the sine of the angle specified by the
5bd8deadSopenharmony_ci    scalar operand and replicates it to all four components of the result
5bd8deadSopenharmony_ci    vector.  The angle is specified in radians and does not have to be in the
5bd8deadSopenharmony_ci    range [0,2*PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxSine(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxSine(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxSine(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxSine(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The approximation function is accurate to at least 22 bits with an angle
5bd8deadSopenharmony_ci    in the range [0,2*PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      | ApproxSine(x) - sin(x) | < 1.0 / 2^22, if 0.0 <= x < 2.0 * PI.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error in the approximation will typically increase with the absolute
5bd8deadSopenharmony_ci    value of the angle when the angle falls outside the range [0,2*PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to cosine approximation:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. ApproxSine(NaN) = NaN.
5bd8deadSopenharmony_ci      2. ApproxSine(+/-INF) = NaN.
5bd8deadSopenharmony_ci      3. ApproxSine(+/-0.0) = +/-0.0.  The sign of the result is equal to the
5bd8deadSopenharmony_ci         sign of the single operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.33,  SLE:  Set on Less Than or Equal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SLE instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector is 1.0 if the corresponding
5bd8deadSopenharmony_ci    component of the first operand is less than or equal to that of the
5bd8deadSopenharmony_ci    second, and 0.0 otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x <= tmp1.x) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.y = (tmp0.y <= tmp1.y) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.z = (tmp0.z <= tmp1.z) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.w = (tmp0.w <= tmp1.w) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to SLE:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. (NaN <= <x>) and (<x> <= NaN) are FALSE for all <x>.
5bd8deadSopenharmony_ci      2. (+INF <= +INF) and (-INF <= -INF) are TRUE.
5bd8deadSopenharmony_ci      3. (-0.0 <= +0.0) and (+0.0 <= -0.0) are TRUE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.34,  SLT:  Set on Less Than
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SLT instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector is 1.0 if the corresponding
5bd8deadSopenharmony_ci    component of the first operand is less than that of the second, and 0.0
5bd8deadSopenharmony_ci    otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x < tmp1.x) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.y = (tmp0.y < tmp1.y) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.z = (tmp0.z < tmp1.z) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.w = (tmp0.w < tmp1.w) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to SLT:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. (NaN < <x>) and (<x> < NaN) are FALSE for all <x>.
5bd8deadSopenharmony_ci      2. (-0.0 < +0.0) and (+0.0 < -0.0) are FALSE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.35,  SNE:  Set on Not Equal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SNE instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector is 1.0 if the corresponding
5bd8deadSopenharmony_ci    component of the first operand is not equal to that of the second, and 0.0
5bd8deadSopenharmony_ci    otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x != tmp1.x) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.y = (tmp0.y != tmp1.y) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.z = (tmp0.z != tmp1.z) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci      result.w = (tmp0.w != tmp1.w) ? 1.0 : 0.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following special-case rules apply to SNE:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. (<x> != <y>) and (<y> != <x>) always produce the same result.
5bd8deadSopenharmony_ci      2. (NaN != <x>) is TRUE for all <x>, including NaN.
5bd8deadSopenharmony_ci      3. (+INF != +INF) and (-INF != -INF) are FALSE.
5bd8deadSopenharmony_ci      4. (-0.0 != +0.0) and (+0.0 != -0.0) are TRUE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.36,  STR:  Set on True
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The STR instruction is a degenerate case of the other "Set on"
5bd8deadSopenharmony_ci    instructions that sets all components of the result vector to 1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result.x = 1.0;
5bd8deadSopenharmony_ci      result.y = 1.0;
5bd8deadSopenharmony_ci      result.z = 1.0;
5bd8deadSopenharmony_ci      result.w = 1.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.37,  SUB:  Subtract
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SUB instruction performs a component-wise subtraction of the second
5bd8deadSopenharmony_ci    operand from the first to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x - tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y - tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z - tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w - tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SUB instruction is completely equivalent to an identical ADD
5bd8deadSopenharmony_ci    instruction in which the negate operator on the second operand is
5bd8deadSopenharmony_ci    reversed:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      1. "SUB R0, R1, R2" is equivalent to "ADD R0, R1, -R2".
5bd8deadSopenharmony_ci      2. "SUB R0, R1, -R2" is equivalent to "ADD R0, R1, R2".
5bd8deadSopenharmony_ci      3. "SUB R0, R1, |R2|" is equivalent to "ADD R0, R1, -|R2|".
5bd8deadSopenharmony_ci      4. "SUB R0, R1, -|R2|" is equivalent to "ADD R0, R1, |R2|".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.38,  TEX: Texture Lookup
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TEX instruction performs a filtered texture lookup using the texture
5bd8deadSopenharmony_ci    target given by <texImageTarget> belonging to the texture image unit given
5bd8deadSopenharmony_ci    by <texImageUnit>.  <texImageTarget> values of "1D", "2D", "3D", "CUBE",
5bd8deadSopenharmony_ci    and "RECT" correspond to the texture targets TEXTURE_1D, TEXTURE_2D,
5bd8deadSopenharmony_ci    TEXTURE_3D, TEXTURE_CUBE_MAP_ARB, and TEXTURE_RECTANGLE_NV, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The (s,t,r) texture coordinates used for the lookup are the x, y, and z
5bd8deadSopenharmony_ci    components of the single operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The texture lookup is performed as specified in Section 3.8.  The LOD
5bd8deadSopenharmony_ci    calculations in Section 3.8.5 are performed using an implementation
5bd8deadSopenharmony_ci    dependent method to derive ds/dx, ds/dy, dt/dx, dt/dy, dr/dx, and dr/dy.
5bd8deadSopenharmony_ci    The mapping of filtered texture components to the components of the result
5bd8deadSopenharmony_ci    vector is dependent on the base internal format of the texture and is
5bd8deadSopenharmony_ci    specified in Table X.5.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                                 Result Vector Components
5bd8deadSopenharmony_ci      Base Internal Format        X      Y      Z      W
5bd8deadSopenharmony_ci      --------------------      -----  -----  -----  -----
5bd8deadSopenharmony_ci      ALPHA                      0.0    0.0    0.0    At
5bd8deadSopenharmony_ci      LUMINANCE                  Lt     Lt     Lt     1.0
5bd8deadSopenharmony_ci      LUMINANCE_ALPHA            Lt     Lt     Lt     At
5bd8deadSopenharmony_ci      INTENSITY                  It     It     It     It
5bd8deadSopenharmony_ci      RGB                        Rt     Gt     Bt     1.0
5bd8deadSopenharmony_ci      RGBA                       Rt     Gt     Bt     At
5bd8deadSopenharmony_ci      HILO_NV (signed)           HIt    LOt    HEMI   1.0
5bd8deadSopenharmony_ci      HILO_NV (unsigned)         HIt    LOt    1.0    1.0
5bd8deadSopenharmony_ci      DSDT_NV                    DSt    DTt    0.0    1.0
5bd8deadSopenharmony_ci      DSDT_MAG_NV                DSt    DTt    MAGt   1.0
5bd8deadSopenharmony_ci      DSDT_MAG_INTENSITY_NV      DSt    DTt    MAGt   It
5bd8deadSopenharmony_ci      FLOAT_R_NV                 Rt     0.0    0.0    1.0
5bd8deadSopenharmony_ci      FLOAT_RG_NV                Rt     Gt     0.0    1.0
5bd8deadSopenharmony_ci      FLOAT_RGB_NV               Rt     Gt     Bt     1.0
5bd8deadSopenharmony_ci      FLOAT_RGBA_NV              Rt     Gt     Bt     At
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.5:  Mapping of filtered texel components to result vector
5bd8deadSopenharmony_ci      components for the TEX instruction.  0.0 and 1.0 indicate that the
5bd8deadSopenharmony_ci      corresponding constant value is written to the result vector.
5bd8deadSopenharmony_ci      DEPTH_COMPONENT textures are treated as ALPHA, LUMINANCE, or INTENSITY,
5bd8deadSopenharmony_ci      as specified in the texture's depth texture mode.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      For HILO_NV textures with signed components, "HEMI" is defined as
5bd8deadSopenharmony_ci      sqrt(MAX(0, 1-(HIt^2+LOt^2))).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This instruction specifies a particular texture target, ignoring the
5bd8deadSopenharmony_ci    standard hierarchy of texture enables (TEXTURE_CUBE_MAP_ARB, TEXTURE_3D,
5bd8deadSopenharmony_ci    TEXTURE_2D, TEXTURE_1D) used to select a texture target in unextended
5bd8deadSopenharmony_ci    OpenGL.  If the specified texture target has a consistent set of images, a
5bd8deadSopenharmony_ci    lookup is performed.  Otherwise, the result of the instruction is the
5bd8deadSopenharmony_ci    vector (0,0,0,0).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Although this instruction allows the selection of any texture target, a
5bd8deadSopenharmony_ci    fragment program can not use more than one texture target for any given
5bd8deadSopenharmony_ci    texture image unit.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.39,  TXD: Texture Lookup with Derivatives
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXD instruction performs a filtered texture lookup using the texture
5bd8deadSopenharmony_ci    target given by <texImageTarget> belonging to the texture image unit given
5bd8deadSopenharmony_ci    by <texImageUnit>.  <texImageTarget> values of "1D", "2D", "3D", "CUBE",
5bd8deadSopenharmony_ci    and "RECT" correspond to the texture targets TEXTURE_1D, TEXTURE_2D,
5bd8deadSopenharmony_ci    TEXTURE_3D, TEXTURE_CUBE_MAP_ARB, and TEXTURE_RECTANGLE_NV, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The (s,t,r) texture coordinates used for the lookup are the x, y, and z
5bd8deadSopenharmony_ci    components of the first operand.  The partial derivatives in the X
5bd8deadSopenharmony_ci    direction (ds/dx, dt/dx, dr/dx) are specified by the x, y, and z
5bd8deadSopenharmony_ci    components of the second operand.  The partial derivatives in the Y
5bd8deadSopenharmony_ci    direction (ds/dy, dt/dy, dr/dy) are specified by the x, y, and z
5bd8deadSopenharmony_ci    components of the third operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The texture lookup is performed as specified in Section 3.8.  The LOD
5bd8deadSopenharmony_ci    calculations in Section 3.8.5 are performed using the specified partial
5bd8deadSopenharmony_ci    derivatives.  The mapping of filtered texture components to the components
5bd8deadSopenharmony_ci    of the result vector is dependent on the base internal format of the
5bd8deadSopenharmony_ci    texture and is specified in Table X.5.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This instruction specifies a particular texture target, ignoring the
5bd8deadSopenharmony_ci    standard hierarchy of texture enables (TEXTURE_CUBE_MAP_ARB, TEXTURE_3D,
5bd8deadSopenharmony_ci    TEXTURE_2D, TEXTURE_1D) used to select a texture target in unextended
5bd8deadSopenharmony_ci    OpenGL.  If the specified texture target has a consistent set of images, a
5bd8deadSopenharmony_ci    lookup is performed.  Otherwise, the result of the instruction is the
5bd8deadSopenharmony_ci    vector (0,0,0,0).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Although this instruction allows the selection of any texture target, a
5bd8deadSopenharmony_ci    fragment program can not use more than one texture target for any given
5bd8deadSopenharmony_ci    texture image unit.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.40,  TXP: Projective Texture Lookup
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXP instruction performs a filtered texture lookup using the texture
5bd8deadSopenharmony_ci    target given by <texImageTarget> belonging to the texture image unit given
5bd8deadSopenharmony_ci    by <texImageUnit>.  <texImageTarget> values of "1D", "2D", "3D", "CUBE",
5bd8deadSopenharmony_ci    and "RECT" correspond to the texture targets TEXTURE_1D, TEXTURE_2D,
5bd8deadSopenharmony_ci    TEXTURE_3D, TEXTURE_CUBE_MAP_ARB, and TEXTURE_RECTANGLE_NV, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For cube map textures, the (s,t,r) texture coordinates used for the lookup
5bd8deadSopenharmony_ci    are given by x, y, and z, respectively.  For all other textures, the
5bd8deadSopenharmony_ci    (s,t,r) texture coordinates used for the lookup are given by x/w, y/w, and
5bd8deadSopenharmony_ci    z/w, respectively, where x, y, z, and w are the corresponding components
5bd8deadSopenharmony_ci    of the operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The texture lookup is performed as specified in Section 3.8.  The LOD
5bd8deadSopenharmony_ci    calculations in Section 3.8.5 are performed using an implementation
5bd8deadSopenharmony_ci    dependent method to derive ds/dx, ds/dy, dt/dx, dt/dy, dr/dx, and dr/dy.
5bd8deadSopenharmony_ci    The mapping of filtered texture components to the components of the result
5bd8deadSopenharmony_ci    vector is dependent on the base internal format of the texture and is
5bd8deadSopenharmony_ci    specified in Table X.5.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This instruction specifies a particular texture target, ignoring the
5bd8deadSopenharmony_ci    standard hierarchy of texture enables (TEXTURE_CUBE_MAP_ARB, TEXTURE_3D,
5bd8deadSopenharmony_ci    TEXTURE_2D, TEXTURE_1D) used to select a texture target in unextended
5bd8deadSopenharmony_ci    OpenGL.  If the specified texture target has a consistent set of images, a
5bd8deadSopenharmony_ci    lookup is performed.  Otherwise, the result of the instruction is the
5bd8deadSopenharmony_ci    vector (0,0,0,0).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Although this instruction allows the selection of any texture target, a
5bd8deadSopenharmony_ci    fragment program can not use more than one texture target for any given
5bd8deadSopenharmony_ci    texture image unit.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.41,  UP2H:  Unpack Two 16-Bit Floats
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The UP2H instruction unpacks two 16-bit floats stored together in a 32-bit
5bd8deadSopenharmony_ci    scalar operand.  The first 16-bit float (stored in the 16 least
5bd8deadSopenharmony_ci    significant bits) is written into the "x" and "z" components of the result
5bd8deadSopenharmony_ci    vector; the second is written into the "y" and "w" components of the
5bd8deadSopenharmony_ci    result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This operation undoes the type conversion and packing performed by the
5bd8deadSopenharmony_ci    PK2H instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = (fp16) (RawBits(tmp) & 0xFFFF);
5bd8deadSopenharmony_ci      result.y = (fp16) ((RawBits(tmp) >> 16) & 0xFFFF);
5bd8deadSopenharmony_ci      result.z = (fp16) (RawBits(tmp) & 0xFFFF);
5bd8deadSopenharmony_ci      result.w = (fp16) ((RawBits(tmp) >> 16) & 0xFFFF);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Since the source operand must be a 32-bit scalar, a fragment program will
5bd8deadSopenharmony_ci    fail to load if the operand is not obtained from a register with 32-bit
5bd8deadSopenharmony_ci    components or from a program parameter.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.42,  UP2US:  Unpack Two Unsigned 16-Bit Scalars
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The UP2US instruction unpacks two 16-bit unsigned values packed together
5bd8deadSopenharmony_ci    in a 32-bit scalar operand.  The unsigned quantities are encoded where a
5bd8deadSopenharmony_ci    bit pattern of all '0' bits corresponds to 0.0 and a pattern of all '1'
5bd8deadSopenharmony_ci    bits corresponds to 1.0.  The "x" and "z" components of the result vector
5bd8deadSopenharmony_ci    are obtained from the 16 least significant bits of the operand; the "y"
5bd8deadSopenharmony_ci    and "w" components are obtained from the 16 most significant bits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This operation undoes the type conversion and packing performed by the
5bd8deadSopenharmony_ci    PK2US instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ((RawBits(tmp) >> 0)  & 0xFFFF) / 65535.0;
5bd8deadSopenharmony_ci      result.y = ((RawBits(tmp) >> 16) & 0xFFFF) / 65535.0;
5bd8deadSopenharmony_ci      result.z = ((RawBits(tmp) >> 0)  & 0xFFFF) / 65535.0;
5bd8deadSopenharmony_ci      result.w = ((RawBits(tmp) >> 16) & 0xFFFF) / 65535.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Since the source operand must be a 32-bit scalar, a fragment program will
5bd8deadSopenharmony_ci    fail to load if the operand is not obtained from a register with 32-bit
5bd8deadSopenharmony_ci    components or from a program parameter.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.43,  UP4B:  Unpack Four Signed 8-Bit Values
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The UP4B instruction unpacks four 8-bit signed values packed together in a
5bd8deadSopenharmony_ci    32-bit scalar operand.  The signed quantities are encoded where a bit
5bd8deadSopenharmony_ci    pattern of all '0' bits corresponds to -128/127 and a pattern of all '1'
5bd8deadSopenharmony_ci    bits corresponds to +127/127.  The "x" component of the result vector is
5bd8deadSopenharmony_ci    the converted value corresponding to the 8 least significant bits of the
5bd8deadSopenharmony_ci    operand; the "w" component corresponds to the 8 most significant bits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This operation undoes the type conversion and packing performed by the
5bd8deadSopenharmony_ci    PK4B instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = (((RawBits(tmp) >> 0) & 0xFF) - 128) / 127.0;
5bd8deadSopenharmony_ci      result.y = (((RawBits(tmp) >> 8) & 0xFF) - 128) / 127.0;
5bd8deadSopenharmony_ci      result.z = (((RawBits(tmp) >> 16) & 0xFF) - 128) / 127.0;
5bd8deadSopenharmony_ci      result.w = (((RawBits(tmp) >> 24) & 0xFF) - 128) / 127.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Since the source operand must be a 32-bit scalar, a fragment program will
5bd8deadSopenharmony_ci    fail to load if the operand is not obtained from a register with 32-bit
5bd8deadSopenharmony_ci    components or from a program parameter.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.44,  UP4UB:  Unpack Four Unsigned 8-Bit Scalars
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The UP4UB instruction unpacks four 8-bit unsigned values packed together
5bd8deadSopenharmony_ci    in a 32-bit scalar operand.  The unsigned quantities are encoded where a
5bd8deadSopenharmony_ci    bit pattern of all '0' bits corresponds to 0.0 and a pattern of all '1'
5bd8deadSopenharmony_ci    bits corresponds to 1.0.  The "x" component of the result vector is
5bd8deadSopenharmony_ci    obtained from the 8 least significant bits of the operand; the "w"
5bd8deadSopenharmony_ci    component is obtained from the 8 most significant bits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This operation undoes the type conversion and packing performed by the
5bd8deadSopenharmony_ci    PK4UB instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ((RawBits(tmp) >> 0)  & 0xFF) / 255.0;
5bd8deadSopenharmony_ci      result.y = ((RawBits(tmp) >> 8)  & 0xFF) / 255.0;
5bd8deadSopenharmony_ci      result.z = ((RawBits(tmp) >> 16) & 0xFF) / 255.0;
5bd8deadSopenharmony_ci      result.w = ((RawBits(tmp) >> 24) & 0xFF) / 255.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Since the source operand must be a 32-bit scalar, a fragment program will
5bd8deadSopenharmony_ci    fail to load if the operand is not obtained from a register with 32-bit
5bd8deadSopenharmony_ci    components or from a program parameter.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.5.45,  X2D:  2D Coordinate Transformation
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The X2D instruction multiplies the 2D offset vector specified by the "x"
5bd8deadSopenharmony_ci    and "y" components of the second vector operand by the 2x2 matrix
5bd8deadSopenharmony_ci    specified by the four components of the third vector operand, and adds the
5bd8deadSopenharmony_ci    transformed offset vector to the 2D vector specified by the "x" and "y"
5bd8deadSopenharmony_ci    components of the first vector operand.  The first component of the sum is
5bd8deadSopenharmony_ci    written to the "x" and "z" components of the result; the second component
5bd8deadSopenharmony_ci    is written to the "y" and "w" components of the result.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The X2D instruction can be used to displace texture coordinates in the
5bd8deadSopenharmony_ci    same manner as the OFFSET_TEXTURE_2D_NV mode in the GL_NV_texture_shader
5bd8deadSopenharmony_ci    extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      result.x = tmp0.x + tmp1.x * tmp2.x + tmp1.y * tmp2.y;
5bd8deadSopenharmony_ci      result.y = tmp0.y + tmp1.x * tmp2.z + tmp1.y * tmp2.w;
5bd8deadSopenharmony_ci      result.z = tmp0.x + tmp1.x * tmp2.x + tmp1.y * tmp2.y;
5bd8deadSopenharmony_ci      result.w = tmp0.y + tmp1.x * tmp2.z + tmp1.y * tmp2.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.6, Fragment Program Outputs
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Upon completion of fragment program execution, the output registers are
5bd8deadSopenharmony_ci    used to replace the fragment's associated data.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RGBA color of the fragment is taken from the color output register
5bd8deadSopenharmony_ci    used by the program (COLR or COLH).  The R, G, B, and A color components
5bd8deadSopenharmony_ci    are extracted from the "x", "y", "z", and "w" components, respectively, of
5bd8deadSopenharmony_ci    the output register and are clamped to the range [0,1].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the DEPR output register is written by the fragment program, the depth
5bd8deadSopenharmony_ci    value of the fragment is taken from the z component of the DEPR output
5bd8deadSopenharmony_ci    register.  If depth clamping is enabled, the depth value is clamped to the
5bd8deadSopenharmony_ci    range [min(n,f), max(n,f)], where n and f are the near and far depth range
5bd8deadSopenharmony_ci    values.  If depth clamping is disabled, the fragment is discarded if its
5bd8deadSopenharmony_ci    depth value is outside the range [min(n,f), max(n,f)].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 3.11.7, Required Fragment Program State
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The state required for managing fragment programs consists of:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      a bit indicating whether or not fragment program mode is enabled;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      an unsigned integer naming the currently bound fragment program
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      and the state that must be maintained to indicate which integers are
5bd8deadSopenharmony_ci      currently in use as fragment program names.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fragment program mode is initially disabled.  The initial state of all 128
5bd8deadSopenharmony_ci    fragment program parameter registers is (0,0,0,0).  The initial currently
5bd8deadSopenharmony_ci    bound fragment program is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Each fragment program object consists of:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      an enumerant given the program target (FRAGMENT_PROGRAM_NV);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      a boolean indicating whether the program is resident;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      an array of type ubyte containing the program string;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      an integer representing the length of the program string array;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      one four-component floating-point vector for each named local
5bd8deadSopenharmony_ci      parameter in the program;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      and a set of MAX_FRAGMENT_PROGRAM_LOCAL_PARAMETERS_NV four-component
5bd8deadSopenharmony_ci      floating-point vectors to hold numbered local parameters, each initially
5bd8deadSopenharmony_ci      set to (0,0,0,0).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Initially, no program objects exist.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Additionally, the state required during the execution of a fragment
5bd8deadSopenharmony_ci    program consists of:  twelve 4-component floating-point fragment attribute
5bd8deadSopenharmony_ci    registers, thirty-two 128-bit physical temporary registers, and a single
5bd8deadSopenharmony_ci    4-component condition code, whose components have one of four values (LT,
5bd8deadSopenharmony_ci    EQ, GT, or UN).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Each time a fragment program is executed, the fragment attribute registers
5bd8deadSopenharmony_ci    are initialized with the fragment's location and associated data, all
5bd8deadSopenharmony_ci    temporary register components are initialized to zero, and all condition
5bd8deadSopenharmony_ci    code components are initialized to EQ.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Renumber Section 3.11 to Section 3.12, Antialiasing Application (p.140).
5bd8deadSopenharmony_ci    No changes to the text of the section.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 4 of the OpenGL 1.2.1 Specification (Per-Fragment
5bd8deadSopenharmony_ciOperations and the Framebuffer)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 5 of the OpenGL 1.2.1 Specification (Special Functions)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Add new section 5.7, Programs (after "Flush and Finish")
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Programs are specified as an array of ubytes used to control the operation
5bd8deadSopenharmony_ci    of portions of the GL.  The array is a string of ASCII characters encoding
5bd8deadSopenharmony_ci    the program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      LoadProgramNV(enum target, uint id, sizei len, const ubyte *program);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    loads a program.  The target parameter specifies the type of program
5bd8deadSopenharmony_ci    loaded and can be VERTEX_PROGRAM_NV, VERTEX_STATE_PROGRAM_NV, or
5bd8deadSopenharmony_ci    FRAGMENT_PROGRAM_NV.  VERTEX_PROGRAM_NV specifies a program to be executed
5bd8deadSopenharmony_ci    in vertex program mode as each vertex is specified.  VERTEX_STATE_PROGRAM
5bd8deadSopenharmony_ci    specifies a program to be run manually to update vertex state.
5bd8deadSopenharmony_ci    FRAGMENT_PROGRAM specifies a program to be executed in fragment program
5bd8deadSopenharmony_ci    mode as each fragment is rasterized.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Multiple programs can be loaded with different names.  id names the
5bd8deadSopenharmony_ci    program to load.  The name space for programs is the set of positive
5bd8deadSopenharmony_ci    integers (zero is reserved).  The error INVALID_VALUE is generated by
5bd8deadSopenharmony_ci    LoadProgramNV if a program is loaded with an id of zero.  The error
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by LoadProgramNV or if a program is loaded
5bd8deadSopenharmony_ci    for an id that is currently loaded with a program of a different program
5bd8deadSopenharmony_ci    target.  program is a pointer to an array of ubytes that represents the
5bd8deadSopenharmony_ci    program being loaded.  The length of the array in ubytes is indicated by
5bd8deadSopenharmony_ci    len.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    At program load time, the program is parsed into a set of tokens possibly
5bd8deadSopenharmony_ci    separated by white space.  Spaces, tabs, newlines, carriage returns, and
5bd8deadSopenharmony_ci    comments are considered whitespace.  Comments begin with the character "#"
5bd8deadSopenharmony_ci    and are terminated by a newline, a carriage return, or the end of the
5bd8deadSopenharmony_ci    program array.  Tokens are processed in a case-sensitive manner:  upper
5bd8deadSopenharmony_ci    and lower-case letters are not considered equivalent.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Each program target has a corresponding Backus-Naur Form (BNF) grammar
5bd8deadSopenharmony_ci    specifying the syntactically valid sequences for programs of the specified
5bd8deadSopenharmony_ci    type.  The set of valid tokens can be inferred from the grammar.  The
5bd8deadSopenharmony_ci    token "" represents an empty string and is used to indicate optional
5bd8deadSopenharmony_ci    rules.  A program is invalid if it contains any undefined tokens or
5bd8deadSopenharmony_ci    characters.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error INVALID_OPERATION is generated by LoadProgramNV if a program
5bd8deadSopenharmony_ci    fails to load because it is not syntactically correct or fails to satisfy
5bd8deadSopenharmony_ci    all of the semantic restrictions corresponding to the program target.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A successfully loaded program is parsed into a sequence of instructions.
5bd8deadSopenharmony_ci    Each instruction is identified by its tokenized name.  The operation of
5bd8deadSopenharmony_ci    these instructions is specific to the program target and is defined
5bd8deadSopenharmony_ci    elsewhere.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A successfully loaded program replaces the program previously assigned to
5bd8deadSopenharmony_ci    the name specified by id.  If the OUT_OF_MEMORY error is generated by
5bd8deadSopenharmony_ci    LoadProgramNV, no change is made to the previous contents of the named
5bd8deadSopenharmony_ci    program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Querying the value of PROGRAM_ERROR_POSITION_NV returns a ubyte offset
5bd8deadSopenharmony_ci    into the program string most recently passed to LoadProgramNV indicating
5bd8deadSopenharmony_ci    the position of the first error, if any, in the program.  If the program
5bd8deadSopenharmony_ci    fails to load because of a semantic restriction that cannot be determined
5bd8deadSopenharmony_ci    until the program is fully scanned, the error position will be len, the
5bd8deadSopenharmony_ci    length of the program.  If the program loads successfully, the value of
5bd8deadSopenharmony_ci    PROGRAM_ERROR_POSITION_NV is assigned the value negative one.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For targets whose programs are executed automatically (e.g., vertex and
5bd8deadSopenharmony_ci    fragment programs), there must be a current program.  The current vertex
5bd8deadSopenharmony_ci    program is executed automatically in vertex program mode as vertices are
5bd8deadSopenharmony_ci    specified.  The current fragment program is executed automatically in
5bd8deadSopenharmony_ci    fragment program mode as fragments are generated by rasterization.
5bd8deadSopenharmony_ci    Current programs for a program target are updated by
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      BindProgramNV(enum target, uint id);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where target must be VERTEX_PROGRAM_NV or FRAGMENT_PROGRAM_NV.  The error
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by BindProgramNV if id names a program that
5bd8deadSopenharmony_ci    has a type different than target (for example, if id names a vertex state
5bd8deadSopenharmony_ci    program as described in section 2.14.4).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Binding to a nonexistent program id does not generate an error.  In
5bd8deadSopenharmony_ci    particular, binding to program id zero does not generate an error.
5bd8deadSopenharmony_ci    However, because program zero cannot be loaded, program zero is always
5bd8deadSopenharmony_ci    nonexistent.  If a program id is successfully loaded with a new vertex
5bd8deadSopenharmony_ci    program and id is also the currently bound vertex program, the new program
5bd8deadSopenharmony_ci    is considered the currently bound vertex program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The INVALID_OPERATION error is generated when both vertex program mode is
5bd8deadSopenharmony_ci    enabled and Begin is called (or when a command that performs an implicit
5bd8deadSopenharmony_ci    Begin is called) if the current vertex program is nonexistent or not
5bd8deadSopenharmony_ci    valid.  A vertex program may not be valid for reasons explained in section
5bd8deadSopenharmony_ci    2.14.5.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The INVALID_OPERATION error is generated when both fragment program mode
5bd8deadSopenharmony_ci    is enabled and Begin, another GL command that performs an implicit Begin,
5bd8deadSopenharmony_ci    or any other GL command that generates fragments is called, if the current
5bd8deadSopenharmony_ci    fragment program is nonexistent or not valid.  A fragment program may be
5bd8deadSopenharmony_ci    invalid for reasons explained in Section 3.11.3.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Programs are deleted by calling
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void DeleteProgramsNV(sizei n, const uint *ids);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ids contains n names of programs to be deleted.  After a program is
5bd8deadSopenharmony_ci    deleted, it becomes nonexistent, and its name is again unused.  If a
5bd8deadSopenharmony_ci    program that is currently bound is deleted, it is as though BindProgramNV
5bd8deadSopenharmony_ci    has been executed with the same target as the deleted program and program
5bd8deadSopenharmony_ci    zero.  Unused names in ids are silently ignored, as is the value zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void GenProgramsNV(sizei n, uint *ids);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    returns n currently unused program names in ids.  These names are marked
5bd8deadSopenharmony_ci    as used, for the purposes of GenProgramsNV only, but they become existent
5bd8deadSopenharmony_ci    programs only when the are first loaded using LoadProgramNV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    An implementation may choose to establish a working set of programs on
5bd8deadSopenharmony_ci    which binding and/or manual execution are performed with higher
5bd8deadSopenharmony_ci    performance.  A program that is currently part of this working set is said
5bd8deadSopenharmony_ci    to be resident.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      boolean AreProgramsResidentNV(sizei n, const uint *ids,
5bd8deadSopenharmony_ci                                    boolean *residences);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    returns TRUE if all of the n programs named in ids are resident, or if the
5bd8deadSopenharmony_ci    implementation does not distinguish a working set.  If at least one of the
5bd8deadSopenharmony_ci    programs named in ids is not resident, then FALSE is returned, and the
5bd8deadSopenharmony_ci    residence of each program is returned in residences.  Otherwise the
5bd8deadSopenharmony_ci    contents of residences are not changed.  If any of the names in ids are
5bd8deadSopenharmony_ci    nonexistent or zero, FALSE is returned, the error INVALID_VALUE is
5bd8deadSopenharmony_ci    generated, and the contents of residences are indeterminate.  The
5bd8deadSopenharmony_ci    residence status of a single named program can also be queried by calling
5bd8deadSopenharmony_ci    GetProgramivNV (Section 6.1.13) with id set to the name of the program and
5bd8deadSopenharmony_ci    pname set to PROGRAM_RESIDENT_NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    AreProgramsResidentNV indicates only whether a program is currently
5bd8deadSopenharmony_ci    resident, not whether it could not be made resident.  An implementation
5bd8deadSopenharmony_ci    may choose to make a program resident only on first use, for example.  The
5bd8deadSopenharmony_ci    client may guide the GL implementation in determining which programs
5bd8deadSopenharmony_ci    should be resident by requesting a set of programs to make resident.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void RequestResidentProgramsNV(sizei n, const uint *ids);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    requests that the n programs named in ids should be made resident.
5bd8deadSopenharmony_ci    While all the programs are not guaranteed to become resident,
5bd8deadSopenharmony_ci    the implementation should make a best effort to make as many of
5bd8deadSopenharmony_ci    the programs resident as possible.  As a result of making the
5bd8deadSopenharmony_ci    requested programs resident, program names not among the requested
5bd8deadSopenharmony_ci    programs may become non-resident.  Higher priority for residency
5bd8deadSopenharmony_ci    should be given to programs listed earlier in the ids array.
5bd8deadSopenharmony_ci    RequestResidentProgramsNV silently ignores attempts to make resident
5bd8deadSopenharmony_ci    nonexistent program names or zero.  AreProgramsResidentNV can be
5bd8deadSopenharmony_ci    called after RequestResidentProgramsNV to determine which programs
5bd8deadSopenharmony_ci    actually became resident.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void ProgramNamedParameter4fNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                     float x, float y, float z, float w);
5bd8deadSopenharmony_ci      void ProgramNamedParameter4dNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                     double x, double y, double z, double w);
5bd8deadSopenharmony_ci      void ProgramNamedParameter4fvNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                      const float v[]);
5bd8deadSopenharmony_ci      void ProgramNamedParameter4dvNV(uint id, sizei len, const ubyte *name,
5bd8deadSopenharmony_ci                                      const double v[]);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    specify a new value for the named program local parameter <name> belonging
5bd8deadSopenharmony_ci    to the fragment program specified by <id>.  <name> is a pointer to an
5bd8deadSopenharmony_ci    array of ubytes holding the parameter name.  <len> specifies the number of
5bd8deadSopenharmony_ci    ubytes in the array given by <name>.  The new x, y, z, and w components of
5bd8deadSopenharmony_ci    the named local parameter are given by x, y, z, and w, respectively, for
5bd8deadSopenharmony_ci    ProgramNamedParameter4fNV and ProgramNamedParameter4dNV, and by v[0],
5bd8deadSopenharmony_ci    v[1], v[2], and v[3], respectively, for ProgramNamedParameter4fvNV and
5bd8deadSopenharmony_ci    ProgramNamedParameter4dvNV.  The error INVALID_OPERATION is generated if
5bd8deadSopenharmony_ci    <id> specifies a nonexistent program or a program whose type does not
5bd8deadSopenharmony_ci    suport named local parameters.  The error INVALID_VALUE error is generated
5bd8deadSopenharmony_ci    if <name> does not specify the name of a local parameter in the program
5bd8deadSopenharmony_ci    corresponding to <id>.  The error INVALID_VALUE is also generated if <len>
5bd8deadSopenharmony_ci    is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void ProgramLocalParameter4fARB(enum target, uint index,
5bd8deadSopenharmony_ci                                      float x, float y, float z, float w);
5bd8deadSopenharmony_ci      void ProgramLocalParameter4fvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                       const float *params);
5bd8deadSopenharmony_ci      void ProgramLocalParameter4dARB(enum target, uint index,
5bd8deadSopenharmony_ci                                      double x, double y, double z, double w);
5bd8deadSopenharmony_ci      void ProgramLocalParameter4dvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                       const double *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    update the values of the numbered program local parameter <index>
5bd8deadSopenharmony_ci    belonging to the program object currently bound to <target>.  For
5bd8deadSopenharmony_ci    ProgramLocalParameter4fARB and ProgramLocalParameter4dARB, the four
5bd8deadSopenharmony_ci    components of the parameter are updated with the values of <x>, <y>, <z>,
5bd8deadSopenharmony_ci    and <w>, respectively.  For ProgramLocalParameter4fvARB and
5bd8deadSopenharmony_ci    ProgramLocalParameter4dvARB, the four components of the parameter are
5bd8deadSopenharmony_ci    updated with the array of four values pointed to by <params>.  The error
5bd8deadSopenharmony_ci    INVALID_VALUE is generated if <index> is greater than or equal to the
5bd8deadSopenharmony_ci    number of numbered program local parameters supported by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 6 of the OpenGL 1.2.1 Specification (State and
5bd8deadSopenharmony_ciState Requests)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 6.1.11, Pointer and String Queries (p. 206)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify last paragraph, p. 206) ... The possible values for <name> are
5bd8deadSopenharmony_ci    VENDOR, RENDERER, VERSION, EXTENSIONS, and PROGRAM_ERROR_STRING_NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add after last paragraph of section, p. 207) Queries of
5bd8deadSopenharmony_ci    PROGRAM_ERROR_STRING_NV return a pointer to an implementation-dependent
5bd8deadSopenharmony_ci    program load error string.  If the last call to LoadProgramNV failed to
5bd8deadSopenharmony_ci    load a program, the returned string describes a reason that the program
5bd8deadSopenharmony_ci    failed to load.  Otherwise, a pointer to an empty string (containing only
5bd8deadSopenharmony_ci    a terminator) is returned.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Rename and modify Section 6.1.13, Vertex and Fragment Program Queries
5bd8deadSopenharmony_ci    (from GL_NV_fragment_program).  Portions of this section pertaining to
5bd8deadSopenharmony_ci    fragment programs are copied verbatim.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (insert after discussion of GetProgramParameter[fd]vNV)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void GetProgramNamedParameterfvNV(uint id, sizei len,
5bd8deadSopenharmony_ci                                        const ubyte *name, float *params);
5bd8deadSopenharmony_ci      void GetProgramNamedParameterdvNV(uint id, sizei len,
5bd8deadSopenharmony_ci                                        const ubyte *name, double *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    obtain the current program named local parameter value for the parameter
5bd8deadSopenharmony_ci    named <name> belonging to the program given by <id>.  <name> is a pointer
5bd8deadSopenharmony_ci    to an array of ubytes holding the parameter name.  <len> specifies the
5bd8deadSopenharmony_ci    number of ubytes in the array given by <name>.  The error
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated if <id> specifies a nonexistent program or
5bd8deadSopenharmony_ci    a program whose type does not suport named local parameters.  The error
5bd8deadSopenharmony_ci    INVALID_VALUE is generated if <name> does not specify the name of a local
5bd8deadSopenharmony_ci    parameter in the program corresponding to <id>.  The error INVALID_VALUE
5bd8deadSopenharmony_ci    is also generated if <len> is zero.  Each named program local parameter is
5bd8deadSopenharmony_ci    an array of four values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void GetProgramLocalParameterdvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                         double *params);
5bd8deadSopenharmony_ci      void GetProgramLocalParameterfvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                         float *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    obtain the current value for the numbered program local parameter <index>
5bd8deadSopenharmony_ci    belonging to the program object currently bound to <target>, and places
5bd8deadSopenharmony_ci    the information in the array <params>.  The error INVALID_ENUM is
5bd8deadSopenharmony_ci    generated if <target> specifies a nonexistent program target or a program
5bd8deadSopenharmony_ci    target that does not support numbered program local parameters.  The error
5bd8deadSopenharmony_ci    INVALID_VALUE is generated if <index> is greater than or equal to the
5bd8deadSopenharmony_ci    implementation-dependent number of supported numbered program local
5bd8deadSopenharmony_ci    parameters for the program target.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When the program target type is FRAGMENT_PROGRAM_NV, each numbered program
5bd8deadSopenharmony_ci    local parameter returned is an array of four values.  ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void GetProgramivNV(uint id, enum pname, int *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    obtains program state named by pname for the program named id in the array
5bd8deadSopenharmony_ci    params.  pname must be one of PROGRAM_TARGET_NV, PROGRAM_LENGTH_NV, or
5bd8deadSopenharmony_ci    PROGRAM_RESIDENT_NV.  The error INVALID_OPERATION is generated if the
5bd8deadSopenharmony_ci    program named id does not exist.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void GetProgramStringNV(uint id, enum pname,
5bd8deadSopenharmony_ci                              ubyte *program);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    obtains the program string for program id.  pname must be
5bd8deadSopenharmony_ci    PROGRAM_STRING_NV.  n ubytes are returned into the array program
5bd8deadSopenharmony_ci    where n is the length of the program in ubytes.  GetProgramivNV with
5bd8deadSopenharmony_ci    PROGRAM_LENGTH_NV can be used to query the length of a program's
5bd8deadSopenharmony_ci    string.  The INVALID_OPERATION error is generated if the program
5bd8deadSopenharmony_ci    named id does not exist.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      boolean IsProgramNV(uint id);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    returns TRUE if program is the name of a program object.  If program
5bd8deadSopenharmony_ci    is zero or is a non-zero value that is not the name of a program
5bd8deadSopenharmony_ci    object, or if an error condition occurs, IsProgramNV returns FALSE.
5bd8deadSopenharmony_ci    A name returned by GenProgramsNV but not yet loaded with a program
5bd8deadSopenharmony_ci    is not the name of a program object."
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Appendix F of the OpenGL 1.2.1 Specification (ARB Extensions)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section F.2.3 (Changes to Section 2.6), p.240
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify last paragraph on p.240) ... Multiple sets of texture coordinates
5bd8deadSopenharmony_ci    may be used to specify how multiple texture images are mapped onto a
5bd8deadSopenharmony_ci    primitive.  The number of texture coordinate sets supported is
5bd8deadSopenharmony_ci    implementation dependent, but must be at least 1.  The number of texture
5bd8deadSopenharmony_ci    coordinate sets supported may be queried with the state
5bd8deadSopenharmony_ci    MAX_TEXTURE_COORDS_NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section F.2.4 (Changes to Section 2.7), p.241
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify the last paragraph on p.241, carrying over to p.243)
5bd8deadSopenharmony_ci    Implementations may support more than one set of texture coordinates.  The
5bd8deadSopenharmony_ci    commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        void MultiTexCoord{1234}{sifd}ARB(enum texture, T coords)
5bd8deadSopenharmony_ci        void MultiTexCoord{1234}{sifd}vARB(enum texture, T coords)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    take the coordinate set to be modified as the <texture> parameter.
5bd8deadSopenharmony_ci    <texture> is a symbolic constant of the form TEXTUREi_ARB, indicating that
5bd8deadSopenharmony_ci    texture coordinate set i is to be modified.  The constants obey
5bd8deadSopenharmony_ci    TEXTUREi_ARB = TEXTURE0_ARB + i (i is in the range 0 to k-1, where k is
5bd8deadSopenharmony_ci    the implementation dependent number of texture units defined by
5bd8deadSopenharmony_ci    MAX_TEXTURE_COORDS_NV).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section F.2.5 (Changes to Section 2.8), p.243
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify first and second paragraphs of section) ... The client may specify
5bd8deadSopenharmony_ci    up to 5 plus the value of MAX_TEXTURE_COORDS_NV arrays; one each to store
5bd8deadSopenharmony_ci    vertex coordinates...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In implementations which support more than one texture coordinate set, the
5bd8deadSopenharmony_ci    command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        void ClientActiveTextureARB(enum texture)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    is used to select the vertex array client state parameters to be modified
5bd8deadSopenharmony_ci    by the TexCoordPointer command and the array affected by EnableClientState
5bd8deadSopenharmony_ci    and DisableClientState with the parameter TEXTURE_COORD_ARRAY.  This
5bd8deadSopenharmony_ci    command sets the state variable CLIENT_ACTIVE_TEXTURE_ARB.  Each texture
5bd8deadSopenharmony_ci    coordinate set has a client state vector which is selected when this
5bd8deadSopenharmony_ci    command is invoked.  This state vector also includes the vertex array
5bd8deadSopenharmony_ci    state.  This command also selects the texture coordinate set state used
5bd8deadSopenharmony_ci    for queries of client state.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify first paragraph on p.244) If the number of supported texture
5bd8deadSopenharmony_ci    coordinate sets (the value of MAX_TEXTURE_COORDS_NV) is k, ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section F.2.6 (Changes to Section 2.10.2), p.244
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify first paragraph)  For each texture coordinate set, a 4x4 matrix is
5bd8deadSopenharmony_ci    applied to the corresponding texture coordinates...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (replace second and third paragraphs) The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void ActiveTextureARB(enum texture);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    specifies the active texture unit selector, ACTIVE_TEXTURE_ARB.  Each
5bd8deadSopenharmony_ci    texture unit contains up to two distinct sub-units:  a texture coordinate
5bd8deadSopenharmony_ci    processing unit (consisting of a texture matrix stack and texture
5bd8deadSopenharmony_ci    coordinate generation state) and a texture image unit (consisting of all
5bd8deadSopenharmony_ci    the texture state defined in Section 3.8).  In implementations with a
5bd8deadSopenharmony_ci    different number of supported texture coordinate sets and texture image
5bd8deadSopenharmony_ci    units, some texture units may consist of only one of the two sub-units.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The active texture unit selector specifies the texture unit accessed by
5bd8deadSopenharmony_ci    commands involving texture coordinate processing.  Such commands include
5bd8deadSopenharmony_ci    those accessing the current matrix stack (if MATRIX_MODE is TEXTURE),
5bd8deadSopenharmony_ci    TexGen (Section 2.10.4), Enable/Disable (if any texture coordinate
5bd8deadSopenharmony_ci    generation enum is selected), as well as queries of the current texture
5bd8deadSopenharmony_ci    coordinates and current raster texture coordinates.  If the texture unit
5bd8deadSopenharmony_ci    number corresponding to the current value of ACTIVE_TEXTURE_ARB is greater
5bd8deadSopenharmony_ci    than or equal to the implementation dependent constant
5bd8deadSopenharmony_ci    MAX_TEXTURE_COORDS_NV, the error INVALID_OPERATION is generated by any
5bd8deadSopenharmony_ci    such command.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The active texture unit selector also selects the texture unit accessed by
5bd8deadSopenharmony_ci    commands involving texture image processing (Section 3.8).  Such commands
5bd8deadSopenharmony_ci    include all variants of TexEnv, TexParameter, and TexImage commands,
5bd8deadSopenharmony_ci    BindTexture, Enable/Disable for any texture target (e.g., TEXTURE_2D), and
5bd8deadSopenharmony_ci    queries of all such state.  If the texture unit number corresponding to
5bd8deadSopenharmony_ci    the current value of ACTIVE_TEXTURE_ARB is greater than or equal to the
5bd8deadSopenharmony_ci    implementation dependent constant MAX_TEXTURE_IMAGE_UNITS_NV, the error
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by any such command.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ActiveTextureARB generates the error INVALID_ENUM if an invalid <texture>
5bd8deadSopenharmony_ci    is specified.  <texture> is a symbolic constant of the form TEXTUREi_ARB,
5bd8deadSopenharmony_ci    indicating that texture unit i is to be modified.  The constants obey
5bd8deadSopenharmony_ci    TEXTUREi_ARB = TEXTURE0_ARB + i (i is in the range 0 to k-1, where k is
5bd8deadSopenharmony_ci    the larger of the MAX_TEXTURE_COORDS_NV and MAX_TEXTURE_IMAGE_UNITS_NV).
5bd8deadSopenharmony_ci    For compatibility with old OpenGL specifications, the implementation
5bd8deadSopenharmony_ci    dependent constant MAX_TEXTURE_UNITS_ARB specifies the number of
5bd8deadSopenharmony_ci    conventional texture units supported by the implementation.  Its value
5bd8deadSopenharmony_ci    must be no larger than the minimum of MAX_TEXTURE_COORDS_NV and
5bd8deadSopenharmony_ci    MAX_TEXTURE_IMAGE_UNITS_NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section F.2.12 (Changes to Section 3.8.10), p.249
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify next-to-last paragraph) Texturing is enabled and disabled
5bd8deadSopenharmony_ci    individually for each texture unit.  If texturing is disabled for one of
5bd8deadSopenharmony_ci    the units, then the fragment resulting from the previous unit is passed
5bd8deadSopenharmony_ci    unaltered to the following unit.  Individual texture units beyond those
5bd8deadSopenharmony_ci    specified by MAX_TEXTURE_UNITS_ARB may be incomplete and are always
5bd8deadSopenharmony_ci    treated as disabled.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section F.2.15 (Changes to Section 6.1.2), p.251
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add to end of paragraph) Queries of texture state variables corresponding
5bd8deadSopenharmony_ci    to texture coordinate processing unit (namely, TexGen state and enables,
5bd8deadSopenharmony_ci    and matrices) will produce an INVALID_OPERATION error if the value of
5bd8deadSopenharmony_ci    ACTIVE_TEXTURE_ARB is greater than or equal to MAX_TEXTURE_COORDS_NV.  All
5bd8deadSopenharmony_ci    other texture state queries will result in an INVALID_OPERATION error if
5bd8deadSopenharmony_ci    the value of ACTIVE_TEXTURE_ARB is greater than or equal to
5bd8deadSopenharmony_ci    MAX_TEXTURE_IMAGE_UNITS_NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to the AGL/GLX/WGL Specifications
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program objects are shared between AGL/GLX/WGL rendering contexts if
5bd8deadSopenharmony_ci    and only if the rendering contexts share display lists.  No change
5bd8deadSopenharmony_ci    is made to the AGL/GLX/WGL API.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on GL_NV_vertex_program
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If NV_vertex_program is supported, the description of LoadProgramNV in
5bd8deadSopenharmony_ci    Section 2.14.1.7 (up to the BNF description of vertex programs) is
5bd8deadSopenharmony_ci    deleted, as it is replaced by the contents of Section 5.7 in this
5bd8deadSopenharmony_ci    specification.  The general error descriptions in Section 2.14.1.7 common
5bd8deadSopenharmony_ci    to Section 5.7 (like INVALID_OPERATION if the program fails to compile)
5bd8deadSopenharmony_ci    should also be deleted.  Section 2.14.1.8 should also be deleted.  Section
5bd8deadSopenharmony_ci    6.1.13 is modified by this specification as described above.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on NV_texture_shader
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If NV_texture_shader is not supported, the comment about texture shaders
5bd8deadSopenharmony_ci    being disabled in fragment program mode is not applicable.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on NV_texture_rectangle
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If NV_texture_rectangle is not supported, the references to "RECT" in the
5bd8deadSopenharmony_ci    <texImageTarget> grammar rule and TEXTURE_RECTANGLE_NV are not applicable.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on ARB_texture_cube_map
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If ARB_texture_cube_map is not supported, the references to "CUBE" in the
5bd8deadSopenharmony_ci    <texImageTarget> grammar rule and TEXTURE_CUBE_MAP_ARB are not applicable.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on EXT_fog_coord
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If EXT_fog_coord is not supported, references to "fog coordinate" in the
5bd8deadSopenharmony_ci    definition of the "FOGC" fragment attribute register should be removed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on NV_depth_clamp
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If NV_depth_clamp is not supported, section 3.11.6 is modified to remove
5bd8deadSopenharmony_ci    discussion of the depth clamp enable and instead indicate that fragments
5bd8deadSopenharmony_ci    with depth values outside [min(n,f), max(n,f)] are always discarded.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on ARB_depth_texture and SGIX_depth_texture
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If ARB_depth_texture is not supported, but SGIX_depth_texture is
5bd8deadSopenharmony_ci    supported, the discussion of Table X.5 is modified to indicate that
5bd8deadSopenharmony_ci    DEPTH_COMPONENT textures are treated as LUMINANCE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If neither extension is supported, the discussion of DEPTH_COMPONENT
5bd8deadSopenharmony_ci    textures in Table X.5 should be removed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on NV_float_buffer
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If NV_float_buffer is not supported, references to FLOAT_R_NV,
5bd8deadSopenharmony_ci    FLOAT_RG_NV, FLOAT_RGB_NV, and FLOAT_RGBA_NV internal texture formats in
5bd8deadSopenharmony_ci    Table X.5 should be removed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on ARB_vertex_program
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension does not have any explicit dependencies, but the APIs for
5bd8deadSopenharmony_ci    setting and querying numbered local parameters (ProgramLocalParameter*ARB
5bd8deadSopenharmony_ci    and GetProgramLocalParameter*ARB) were taken directly from this extension,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on ARB_fragment_program
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If ARB_fragment_program is not supported, the maximum number of executable
5bd8deadSopenharmony_ci    instructions in any !!FP1.0 program is 1024.  If ARB_fragment_program is
5bd8deadSopenharmony_ci    supported, the maximum number of executable instructions for an !!FP1.0 is
5bd8deadSopenharmony_ci    at least 1024, but can be larger.  The limit can be queried by calling
5bd8deadSopenharmony_ci    GetProgramiv with <target> set to FRAGMENT_PROGRAM_ARB and <pname> set to
5bd8deadSopenharmony_ci    MAX_PROGRAM_INSTRUCTIONS_ARB.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciGLX Protocol
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Most of the GLX protocol needed to implement this extension is described
5bd8deadSopenharmony_ci    in the GL_NV_vertex_program extension specification and will not be
5bd8deadSopenharmony_ci    repeated here.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following two rendering commands are potentially large, and hence can
5bd8deadSopenharmony_ci    be sent in a glXRender or glXRenderLarge request.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        ProgramNamedParameter4fvNV
5bd8deadSopenharmony_ci            2           28+len+p        rendering command length
5bd8deadSopenharmony_ci            2           4218            rendering command opcode
5bd8deadSopenharmony_ci            4           CARD32          id
5bd8deadSopenharmony_ci            4           CARD32          len
5bd8deadSopenharmony_ci            4           FLOAT32         params[0]
5bd8deadSopenharmony_ci            4           FLOAT32         params[1]
5bd8deadSopenharmony_ci            4           FLOAT32         params[2]
5bd8deadSopenharmony_ci            4           FLOAT32         params[3]
5bd8deadSopenharmony_ci            len         LISTofCARD8     name
5bd8deadSopenharmony_ci            p                           unused, p=pad(len)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci         If the command is encoded in a glxRenderLarge request, the command
5bd8deadSopenharmony_ci         opcode and command length fields above are expanded to 4 bytes each:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            4           32+len+p        rendering command length
5bd8deadSopenharmony_ci            4           4218            rendering command opcode
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        ProgramNamedParameter4dvNV
5bd8deadSopenharmony_ci            2           44+len+p        rendering command length
5bd8deadSopenharmony_ci            2           4219            rendering command opcode
5bd8deadSopenharmony_ci            4           CARD32          id
5bd8deadSopenharmony_ci            4           CARD32          len
5bd8deadSopenharmony_ci            8           FLOAT64         params[0]
5bd8deadSopenharmony_ci            8           FLOAT64         params[1]
5bd8deadSopenharmony_ci            8           FLOAT64         params[2]
5bd8deadSopenharmony_ci            8           FLOAT64         params[3]
5bd8deadSopenharmony_ci            len         LISTofCARD8     name
5bd8deadSopenharmony_ci            p                           unused, p=pad(len)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci         If the command is encoded in a glxRenderLarge request, the command
5bd8deadSopenharmony_ci         opcode and command length fields above are expanded to 4 bytes each:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            4           48+len+p        rendering command length
5bd8deadSopenharmony_ci            4           4219            rendering command opcode
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The remaining two commands are non-rendering commands.  These commands are
5bd8deadSopenharmony_ci    sent separately (i.e., not as part of a glXRender or glXRenderLarge
5bd8deadSopenharmony_ci    request), using the glXVendorPrivateWithReply request:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        GetProgramNamedParameterfvNV
5bd8deadSopenharmony_ci            1           CARD8           opcode (X assigned)
5bd8deadSopenharmony_ci            1           17              GLX opcode (glXVendorPrivateWithReply)
5bd8deadSopenharmony_ci            2           4+(len+p)/4     request length
5bd8deadSopenharmony_ci            4           1310            vendor specific opcode
5bd8deadSopenharmony_ci            4           GLX_CONTEXT_TAG context tag
5bd8deadSopenharmony_ci            4           INT32           len
5bd8deadSopenharmony_ci            len         LISTofCARD8     name
5bd8deadSopenharmony_ci            p                           unused, p=pad(len)
5bd8deadSopenharmony_ci          =>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          If the command succeeds, 4 floats are sent in the reply:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            1           1               reply
5bd8deadSopenharmony_ci            1                           unused
5bd8deadSopenharmony_ci            2           CARD16          sequence number
5bd8deadSopenharmony_ci            4           4               reply length
5bd8deadSopenharmony_ci            24                          unused
5bd8deadSopenharmony_ci            16          LISTofFLOAT32   params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          Otherwise, an empty reply is sent, indicating that a GL error
5bd8deadSopenharmony_ci          occured:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            1           1               reply
5bd8deadSopenharmony_ci            1                           unused
5bd8deadSopenharmony_ci            2           CARD16          sequence number
5bd8deadSopenharmony_ci            4           0               reply length
5bd8deadSopenharmony_ci            24                          unused
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        GetProgramNamedParameterdvNV
5bd8deadSopenharmony_ci            1           CARD8           opcode (X assigned)
5bd8deadSopenharmony_ci            1           17              GLX opcode (glXVendorPrivateWithReply)
5bd8deadSopenharmony_ci            2           4+(len+p)/4     request length
5bd8deadSopenharmony_ci            4           1311            vendor specific opcode
5bd8deadSopenharmony_ci            4           GLX_CONTEXT_TAG context tag
5bd8deadSopenharmony_ci            4           INT32           len
5bd8deadSopenharmony_ci            len         LISTofCARD8     name
5bd8deadSopenharmony_ci            p                           unused, p=pad(len)
5bd8deadSopenharmony_ci          =>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          If the command succeeds, 4 doubles are sent in the reply:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            1           1               reply
5bd8deadSopenharmony_ci            1                           unused
5bd8deadSopenharmony_ci            2           CARD16          sequence number
5bd8deadSopenharmony_ci            4           8               reply length
5bd8deadSopenharmony_ci            24                          unused
5bd8deadSopenharmony_ci            32          LISTofFLOAT64   params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          Otherwise, an empty reply is sent, indicating that a GL error
5bd8deadSopenharmony_ci          occured:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            1           1               reply
5bd8deadSopenharmony_ci            1                           unused
5bd8deadSopenharmony_ci            2           CARD16          sequence number
5bd8deadSopenharmony_ci            4           0               reply length
5bd8deadSopenharmony_ci            24                          unused
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciErrors
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by Begin, DrawPixels, Bitmap, CopyPixels,
5bd8deadSopenharmony_ci    or a command that performs an explicit Begin if FRAGMENT_PROGRAM_NV is
5bd8deadSopenharmony_ci    enabled and the currently bound fragment program does not exist.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by ProgramNamedParameter4fNV,
5bd8deadSopenharmony_ci    ProgramNamedParameter4dNV, ProgramNamedParameter4fvNV,
5bd8deadSopenharmony_ci    ProgramNamedParameter4dvNV, GetProgramNamedParameterfvNV, or
5bd8deadSopenharmony_ci    GetProgramNamedParameterdvNV if <id> specifies a nonexistent program or a
5bd8deadSopenharmony_ci    program whose type does not suport local parameters.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_VALUE is generated by ProgramNamedParameter4fNV,
5bd8deadSopenharmony_ci    ProgramNamedParameter4dNV, ProgramNamedParameter4fvNV,
5bd8deadSopenharmony_ci    ProgramNamedParameter4dvNV, GetProgramNamedParameterfvNV, or
5bd8deadSopenharmony_ci    GetProgramNamedParameterdvNV if <len> is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_VALUE is generated by ProgramNamedParameter4fNV,
5bd8deadSopenharmony_ci    ProgramNamedParameter4dNV, ProgramNamedParameter4fvNV,
5bd8deadSopenharmony_ci    ProgramNamedParameter4dvNV, GetProgramNamedParameterfvNV, or
5bd8deadSopenharmony_ci    GetProgramNamedParameterdvNV if <name> does not specify the name of a
5bd8deadSopenharmony_ci    local parameter in the program corresponding to <id>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by any command accessing texture coordinate
5bd8deadSopenharmony_ci    processing state if the texture unit number corresponding to the current
5bd8deadSopenharmony_ci    value of ACTIVE_TEXTURE_ARB is greater than or equal to the implementation
5bd8deadSopenharmony_ci    dependent constant MAX_TEXTURE_COORDS_NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by any command accessing texture image
5bd8deadSopenharmony_ci    processing state if the texture unit number corresponding to the current
5bd8deadSopenharmony_ci    value of ACTIVE_TEXTURE_ARB is greater than or equal to the implementation
5bd8deadSopenharmony_ci    dependent constant MAX_TEXTURE_IMAGE_UNITS_NV.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (The following are error descriptions copied from GL_NV_vertex_program
5bd8deadSopenharmony_ci     that apply to this extension as well.  These modifications do not affect
5bd8deadSopenharmony_ci     the behavior of that extension.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_VALUE is generated by LoadProgramNV if id is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by LoadProgramNV if the program
5bd8deadSopenharmony_ci    corresponding to id is currently loaded but has a program type different
5bd8deadSopenharmony_ci    from that given by target.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by LoadProgramNV if the program specified
5bd8deadSopenharmony_ci    is syntactically incorrect for the program type specified by target.  The
5bd8deadSopenharmony_ci    value of PROGRAM_ERROR_POSITION_NV is still updated when this error is
5bd8deadSopenharmony_ci    generated.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by LoadProgramNV if the program specified
5bd8deadSopenharmony_ci    fails to conform to any of the semantic restrictions imposed on programs
5bd8deadSopenharmony_ci    of the type specified by target.  The value of PROGRAM_ERROR_POSITION_NV
5bd8deadSopenharmony_ci    is still updated when this error is generated.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by BindProgramNV if target does not match
5bd8deadSopenharmony_ci    the type of the program named by id.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_VALUE is generated by AreProgramsResidentNV if any of the queried
5bd8deadSopenharmony_ci    programs are zero or do not exist.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    INVALID_OPERATION is generated by GetProgramivNV or GetProgramStringNV if
5bd8deadSopenharmony_ci    the program named id does not exist.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew State
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciGet Value                          Type  Get Command              Initial Value  Description         Section   Attribute
5bd8deadSopenharmony_ci---------------------------------  ----  -----------------------  -------------  ------------------  --------  ------------
5bd8deadSopenharmony_ciFRAGMENT_PROGRAM_NV                B     IsEnabled                FALSE          fragment program    3.11      enable
5bd8deadSopenharmony_ci                                                                                 mode enable
5bd8deadSopenharmony_ciFRAGMENT_PROGRAM_BINDING_NV        Z+    GetIntegerv              0              bound fragment      5.7       -
5bd8deadSopenharmony_ci                                                                                 program
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciTable X.6.  New State Introduced by NV_fragment_program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciGet Value                  Type    Get Command          Initial Value  Description         Section   Attribute
5bd8deadSopenharmony_ci-------------------------  ------  ------------------   -------------  ------------------  --------  ---------
5bd8deadSopenharmony_ciPROGRAM_ERROR_POSITION_NV  Z       GetIntegerv          -1             program error       5.7       -
5bd8deadSopenharmony_ci                                                                       position
5bd8deadSopenharmony_ciPROGRAM_TARGET_NV          Z2      GetProgramivNV       0              program target      6.1.13    -
5bd8deadSopenharmony_ciPROGRAM_LENGTH_NV          Z+      GetProgramivNV       0              program length      6.1.13    -
5bd8deadSopenharmony_ciPROGRAM_RESIDENT_NV        Z2      GetProgramivNV       False          program residency   6.1.13    -
5bd8deadSopenharmony_ciPROGRAM_STRING_NV          ubxn    GetProgramStringNV   ""             program string      6.1.13    -
5bd8deadSopenharmony_ci-                          nxR4    GetProgramNamed-     (0,0,0,0)      named program local 5.7       -
5bd8deadSopenharmony_ci                                   ParameterNV                         parameter value
5bd8deadSopenharmony_ci-                          64+xR4  GetProgramLocal-     (0,0,0,0)      numbered program    5.7       -
5bd8deadSopenharmony_ci                                   ParameterARB                        local parameter
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciTable X.7.  Program Object State common to NV_vertex_program and NV_fragment_program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciGet Value    Type    Get Command   Initial Value  Description               Section   Attribute
5bd8deadSopenharmony_ci---------    ------  -----------   -------------  -----------------------   --------  ---------
5bd8deadSopenharmony_ci-            12xR4   -             fragment data  fragment attribute
5bd8deadSopenharmony_ci                                                  registers                 3.11.1.1  -
5bd8deadSopenharmony_ci-            16xR4   -             (0,0,0,0)      fp32 temporary registers  3.11.1.2  -
5bd8deadSopenharmony_ci-            32xR4   -             (0,0,0,0)      fp16 temporary registers  3.11.1.2  -
5bd8deadSopenharmony_ci             (Z_4)4  -             (EQ,EQ,EQ,EQ)  condition code register   3.11.1.4  -
5bd8deadSopenharmony_ci                                                  address register
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciTable X.8.  Fragment Program Per-Fragment Execution State.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Implementation Dependent State
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                                                 Minimum
5bd8deadSopenharmony_ciGet Value                   Type   Get Command    Value       Description    Section  Attribute
5bd8deadSopenharmony_ci---------                   ----   -----------   -------  -----------------  -------  ---------
5bd8deadSopenharmony_ciMAX_TEXTURE_COORDS_NV       Z+     GetIntegerv      2     number of texture  2.6      -
5bd8deadSopenharmony_ci                                                          coordinate sets
5bd8deadSopenharmony_ci                                                          supported
5bd8deadSopenharmony_ciMAX_TEXTURE_IMAGE_UNITS_NV  Z+     GetIntegerv      2     number of texture  2.10.2   -
5bd8deadSopenharmony_ci                                                          image units
5bd8deadSopenharmony_ci                                                          supported
5bd8deadSopenharmony_ciMAX_FRAGMENT_PROGRAM_       Z+     GetIntegerv     64     number of numbered 3.11.7   -
5bd8deadSopenharmony_ci  LOCAL_PARAMETERS_NV                                     local parameters
5bd8deadSopenharmony_ci                                                          supported
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciRevision History
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Rev.    Date    Author   Changes
5bd8deadSopenharmony_ci    ----  -------- --------  --------------------------------------------
5bd8deadSopenharmony_ci     73   05/23/05  pbrown   Fixed cut-and-paste error in the dependency
5bd8deadSopenharmony_ci                             section where it said "NV_texture_rectangle"
5bd8deadSopenharmony_ci                             instead of "ARB_texture_cube_map".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     72   05/16/04  pbrown   Documented that it's not possible to results from
5bd8deadSopenharmony_ci                             LG2 that are any more precise than what is
5bd8deadSopenharmony_ci                             available in the fp32 storage format.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     71   04/23/04  pbrown   Fixed incorrect example.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     70   03/20/03  pbrown   Made the instruction count limit for !!FP1.0
5bd8deadSopenharmony_ci                             programs queryable instead of a hard-wired value
5bd8deadSopenharmony_ci                             of 1024.  The limit can be queried using
5bd8deadSopenharmony_ci                             ARB_fragment_program mechanisms, and remains 1024
5bd8deadSopenharmony_ci                             if ARB_fragment_program is unsupported.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     69   02/01/03  pbrown   Removed support for combiner fragment programs
5bd8deadSopenharmony_ci                             (!!FCP1.0).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     68   01/08/03  pbrown   Correct spec language providing examples of NaNs,
5bd8deadSopenharmony_ci                             such as sqrt(-1) or log(-1).  Division by zero
5bd8deadSopenharmony_ci                             produces an infinity, not a NaN.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     67   12/23/02  pbrown   Fix incorrect syntax of examples of "KIL"
5bd8deadSopenharmony_ci                             instruction. The condition code test is not
5bd8deadSopenharmony_ci                             parenthesized in KIL.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     66   10/31/02  pbrown   Cleaned up special cases of POW, including the
5bd8deadSopenharmony_ci                             fact that "POW dst, 0, 0" produces NaN in this
5bd8deadSopenharmony_ci                             spec, not 1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     65   10/28/02  pbrown   Documented that signed HILO textures will have
5bd8deadSopenharmony_ci                             the hemisphere remapping applied, but unsigned
5bd8deadSopenharmony_ci                             textures will not.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     64   09/17/02  pbrown   Minor typo fixes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     63   08/14/02  pbrown   Clarified the value of the "other" components
5bd8deadSopenharmony_ci                             of f[FOGC].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     62   07/24/02  pbrown   Removed PK4UBG and UP4UBG instructions.
5bd8deadSopenharmony_ci                             Simplified the implementation of the temporary
5bd8deadSopenharmony_ci                             and output register limit for combiner
5bd8deadSopenharmony_ci                             programs by counting all four o[TEXn] registers
5bd8deadSopenharmony_ci                             against the limit, whether or not they are
5bd8deadSopenharmony_ci                             written.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     61   07/19/02  pbrown   Renamed ProgramLocalParameter*NV to
5bd8deadSopenharmony_ci                             ProgramNamedParameter*NV to eliminate naming
5bd8deadSopenharmony_ci                             conflicts with ARB_vertex_program (and presumably
5bd8deadSopenharmony_ci                             ARB_fragment_program).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                             Added support for numbered program local
5bd8deadSopenharmony_ci                             parameters for compatibility with the ARB vertex
5bd8deadSopenharmony_ci                             program extension (and upcoming ARB fragment
5bd8deadSopenharmony_ci                             program extension), so it's possible to set local
5bd8deadSopenharmony_ci                             parameters the same way in both extensions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                             Eliminated the language describing "register
5bd8deadSopenharmony_ci                             slots" and how the "H" and "R" registers overlap.
5bd8deadSopenharmony_ci                             Instead, registers are guaranteed not to overlap,
5bd8deadSopenharmony_ci                             and a semantic limit is added on the number of
5bd8deadSopenharmony_ci                             temporaries and output registers that can be used
5bd8deadSopenharmony_ci                             by a program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                             Eliminated the requirement that non-combiner
5bd8deadSopenharmony_ci                             programs actually write a color value; the only
5bd8deadSopenharmony_ci                             requirement is that one output register be
5bd8deadSopenharmony_ci                             written.  When using fragment programs that use
5bd8deadSopenharmony_ci                             depth replacement, there may not be a need to
5bd8deadSopenharmony_ci                             compute color if color writes are currently
5bd8deadSopenharmony_ci                             disabled
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                             Cleaned up the issues section.  Added several
5bd8deadSopenharmony_ci                             examples of fragment program operation.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                             Cleaned up GLX protocol.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     59   07/07/02  pbrown   Minor clarifications of texture lookup handling.
5bd8deadSopenharmony_ci                             Documented that DDX and DDY may not always
5bd8deadSopenharmony_ci                             produce infinities.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     58   06/27/02  pbrown   Added clarification that instructions can use the
5bd8deadSopenharmony_ci                             same attribute or parameter register more than
5bd8deadSopenharmony_ci                             once.  Added support for "X" precision on the
5bd8deadSopenharmony_ci                             "set on" instructions.  Removed "X" precision
5bd8deadSopenharmony_ci                             support from DST.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     57   06/27/02  pbrown   Added missing table entries covering the use of
5bd8deadSopenharmony_ci                             floating-point textures.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     56   06/27/02  pbrown   Modified the spec to indicate that depth textures
5bd8deadSopenharmony_ci                             are treated as alpha, luminance, or intensity
5bd8deadSopenharmony_ci                             according to the depth texture mode in ARB_shadow.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     55   06/26/02  pbrown   Fixed the correct aliased register number and
5bd8deadSopenharmony_ci                             "read-only" mappings for o[DEPR] in combiner
5bd8deadSopenharmony_ci                             programs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     54   06/05/02  pbrown   Fixed the spec to indicate that near and far
5bd8deadSopenharmony_ci                             frustum clipping is disabled for depth
5bd8deadSopenharmony_ci                             replacement programs.  Fixed the spec to indicate
5bd8deadSopenharmony_ci                             that the register combiners enable is overridden
5bd8deadSopenharmony_ci                             for fragment programs (enabled for combiner
5bd8deadSopenharmony_ci                             programs, disabled for color programs).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     53   05/20/02  pbrown   Miscellaneous bug fixes for wording and
5bd8deadSopenharmony_ci                             special-case handling errors.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     52   05/16/02  pbrown   Added "_SAT" suffix to clamp result vector
5bd8deadSopenharmony_ci                             components to [0,1].  Fixed special case rules
5bd8deadSopenharmony_ci                             for MUL instruction and the "UN" condition code.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     50   04/19/02  pbrown   Added "$" as a legal character in an identifier
5bd8deadSopenharmony_ci                             name.  Added example for fixed and conditional
5bd8deadSopenharmony_ci                             write masks and condition code updates.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     49   04/16/02  pbrown   Added new query of PROGRAM_ERROR_STRING_NV to
5bd8deadSopenharmony_ci                             return more detailed information on program load
5bd8deadSopenharmony_ci                             failures.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     48   04/02/02  pbrown   Added missing enum value for the
5bd8deadSopenharmony_ci                             FRAGMENT_PROGRAM_BINDING_NV query.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     47   03/15/02  pbrown   Fixed various typos, and an incorrect description
5bd8deadSopenharmony_ci                             of the MAX operation.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     45   01/31/02  pbrown   Renamed the packing and unpacking opcode to more
5bd8deadSopenharmony_ci                             closely match OpenGL data type naming conventions
5bd8deadSopenharmony_ci                             (PK2 becomes PK2H, PK16 becomes PH2US, PK4
5bd8deadSopenharmony_ci                             becomes PK4B, PKB becomes PK4UB).  Renamed "BEM"
5bd8deadSopenharmony_ci                             instruction to "X2D" to reflect the fact that it
5bd8deadSopenharmony_ci                             does a 2D coordinate transformation (not just a
5bd8deadSopenharmony_ci                             bump mapping operation).  Added PK4UBG and UP4UBG
5bd8deadSopenharmony_ci                             instructions to support sRGB gamma correction
5bd8deadSopenharmony_ci                             when packing and unpacking components.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     44   01/18/02  pbrown   Double the number of available temporaries (16 to
5bd8deadSopenharmony_ci                             32 fp32 vectors).  Add BEM (texture coordinate
5bd8deadSopenharmony_ci                             offset), PKB/UPB (unsigned byte packing), and
5bd8deadSopenharmony_ci                             PK16/UP16 (unsigned short packing) instructions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     43   01/04/02  pbrown   Documented special cases for comparisons,
5bd8deadSopenharmony_ci                             including the handling of NaN in the SNE
5bd8deadSopenharmony_ci                             instruction. Added automatic generation of a
5bd8deadSopenharmony_ci                             third normal component for HILO textures.
5bd8deadSopenharmony_ci                             Documented the restriction that RFL can't write
5bd8deadSopenharmony_ci                             to the w component of the result.  Trivial fix of
5bd8deadSopenharmony_ci                             the special-cases for RCP.  Fixed minor typo on
5bd8deadSopenharmony_ci                             the TEX instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     40   11/26/01  pbrown   Eliminated "X" precision specifier on those
5bd8deadSopenharmony_ci                             instructions that do complicated math or don't
5bd8deadSopenharmony_ci                             otherwise need it (e.g., "SGE").  Fixed special
5bd8deadSopenharmony_ci                             case math on LG2 instruction.  Eliminated
5bd8deadSopenharmony_ci                             incorrectly specified exponent clamping on LIT
5bd8deadSopenharmony_ci                             instruction.  Fixed description and special-case
5bd8deadSopenharmony_ci                             math on LIT/POW instructions.  Specified that
5bd8deadSopenharmony_ci                             combiner program outputs are clamped to [-1,+1],
5bd8deadSopenharmony_ci                             not [+0,+1].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     39   11/16/01  pbrown   Added semantic restriction that PK2/PK4 must
5bd8deadSopenharmony_ci                             write to a 32-bit register.  Cleaned up the
5bd8deadSopenharmony_ci                             converse restrictions on UP2/UP4, making sure to
5bd8deadSopenharmony_ci                             allow UP2/UP4 from a program parameter.  Fix
5bd8deadSopenharmony_ci                             section numberings and a few typos.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     36   11/07/01  pbrown   Cleaned up explanation of the "negative q is
5bd8deadSopenharmony_ci                             undefined" for texture mapping spec restriction.
5bd8deadSopenharmony_ci                             Fixed a nit on the number of condition code
5bd8deadSopenharmony_ci                             values (now 4 with UN - unordered).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     35   10/29/01  pbrown   Add a SUB instruction for programmer
5bd8deadSopenharmony_ci                             convenience. Moved unresolved issue list back to
5bd8deadSopenharmony_ci                             the "Issues" section.  Fix several minor wording
5bd8deadSopenharmony_ci                             issues.  Clarify register combiners/texture
5bd8deadSopenharmony_ci                             shader/fragment program flow control diagram.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     32   10/19/01  pbrown   Document the fragment program restriction that
5bd8deadSopenharmony_ci                             instructions involving f[FOGC] and f[TEX0-TEX7]
5bd8deadSopenharmony_ci                             are always carried out at fp32 precision.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     31   10/19/01  pbrown   Fixed incorrect description of encoding of fp16
5bd8deadSopenharmony_ci                             denorms.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     30   10/12/01  pbrown   Documented (0,0,0,0) local parameter
5bd8deadSopenharmony_ci                             initialization.  Disallow multiple defines of the
5bd8deadSopenharmony_ci                             same token.  Allow tokens that look like a
5bd8deadSopenharmony_ci                             possible register or texture name, but have
5bd8deadSopenharmony_ci                             numbers that are too big (e.g., "TEX24", "R37").
5bd8deadSopenharmony_ci                             Fixed up several grammar bugs.  Documented that
5bd8deadSopenharmony_ci                             LG2 and RSQ now do not automatically take
5bd8deadSopenharmony_ci                             absolute values, plus new math special cases.