extensions/NV/NV_gpu_program4.txt

5bd8deadSopenharmony_ciName
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_gpu_program4
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciName Strings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GL_NV_gpu_program4
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciContact
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Pat Brown, NVIDIA Corporation (pbrown 'at' nvidia.com)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciStatus
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Shipping for GeForce 8 Series (November 2006)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciVersion
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Last Modified Date:         09/11/2014
5bd8deadSopenharmony_ci    NVIDIA Revision:            11
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNumber
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    322
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension is written against to OpenGL 2.0 specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    OpenGL 2.0 is not required, but we expect all implementations of this
5bd8deadSopenharmony_ci    extension will also support OpenGL 2.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension is also written against the ARB_vertex_program
5bd8deadSopenharmony_ci    specification, which provides the basic mechanisms for the assembly
5bd8deadSopenharmony_ci    programming model used by this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension serves as the basis for the NV_fragment_program4,
5bd8deadSopenharmony_ci    NV_geometry_program4, and NV_vertex_program4, which all build on this
5bd8deadSopenharmony_ci    extension to support fragment, geometry, and vertex programs,
5bd8deadSopenharmony_ci    respectively.  If "GL_NV_gpu_program4" is found in the extension string,
5bd8deadSopenharmony_ci    all of these extensions are supported.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_parameter_buffer_object affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ARB_texture_rectangle trivially affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    EXT_gpu_program_parameters trivially affects the definition of this
5bd8deadSopenharmony_ci    extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    EXT_texture_integer trivially affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    EXT_texture_array trivially affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    EXT_texture_buffer_object trivially affects the definition of this
5bd8deadSopenharmony_ci    extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_primitive_restart trivially affects the definition of this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciOverview
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This specification documents the common instruction set and basic
5bd8deadSopenharmony_ci    functionality provided by NVIDIA's 4th generation of assembly instruction
5bd8deadSopenharmony_ci    sets supporting programmable graphics pipeline stages.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The instruction set builds upon the basic framework provided by the
5bd8deadSopenharmony_ci    ARB_vertex_program and ARB_fragment_program extensions to expose
5bd8deadSopenharmony_ci    considerably more capable hardware.  In addition to new capabilities for
5bd8deadSopenharmony_ci    vertex and fragment programs, this extension provides a new program type
5bd8deadSopenharmony_ci    (geometry programs) further described in the NV_geometry_program4
5bd8deadSopenharmony_ci    specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NV_gpu_program4 provides a unified instruction set -- all instruction set
5bd8deadSopenharmony_ci    features are available for all program types, except for a small number of
5bd8deadSopenharmony_ci    features that make sense only for a specific program type.  It provides
5bd8deadSopenharmony_ci    fully capable signed and unsigned integer data types, along with a set of
5bd8deadSopenharmony_ci    arithmetic, logical, and data type conversion instructions capable of
5bd8deadSopenharmony_ci    operating on integers.  It also provides a uniform set of structured
5bd8deadSopenharmony_ci    branching constructs (if tests, loops, and subroutines) that fully support
5bd8deadSopenharmony_ci    run-time condition testing.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This extension provides several new texture mapping capabilities.  Shadow
5bd8deadSopenharmony_ci    cube maps are supported, where cube map faces can encode depth values.
5bd8deadSopenharmony_ci    Texture lookup instructions can include an immediate texel offset, which
5bd8deadSopenharmony_ci    can assist in advanced filtering.  New instructions are provided to fetch
5bd8deadSopenharmony_ci    a single texel by address in a texture map (TXF) and query the size of a
5bd8deadSopenharmony_ci    specified texture level (TXQ).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    By and large, vertex and fragment programs written to ARB_vertex_program
5bd8deadSopenharmony_ci    and ARB_fragment_program can be ported directly by simply changing the
5bd8deadSopenharmony_ci    program header from "!!ARBvp1.0" or "!!ARBfp1.0" to "!!NVvp4.0" or
5bd8deadSopenharmony_ci    "!!NVfp4.0", and then modifying the code to take advantage of the expanded
5bd8deadSopenharmony_ci    feature set.  There are a small number of areas where this extension is
5bd8deadSopenharmony_ci    not a functional superset of previous vertex program extensions, which are
5bd8deadSopenharmony_ci    documented in this specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Procedures and Functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    void ProgramLocalParameterI4iNV(enum target, uint index,
5bd8deadSopenharmony_ci                                    int x, int y, int z, int w);
5bd8deadSopenharmony_ci    void ProgramLocalParameterI4ivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                     const int *params);
5bd8deadSopenharmony_ci    void ProgramLocalParametersI4ivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                      sizei count, const int *params);
5bd8deadSopenharmony_ci    void ProgramLocalParameterI4uiNV(enum target, uint index,
5bd8deadSopenharmony_ci                                     uint x, uint y, uint z, uint w);
5bd8deadSopenharmony_ci    void ProgramLocalParameterI4uivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                      const uint *params);
5bd8deadSopenharmony_ci    void ProgramLocalParametersI4uivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                       sizei count, const uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    void ProgramEnvParameterI4iNV(enum target, uint index,
5bd8deadSopenharmony_ci                                  int x, int y, int z, int w);
5bd8deadSopenharmony_ci    void ProgramEnvParameterI4ivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                   const int *params);
5bd8deadSopenharmony_ci    void ProgramEnvParametersI4ivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                    sizei count, const int *params);
5bd8deadSopenharmony_ci    void ProgramEnvParameterI4uiNV(enum target, uint index,
5bd8deadSopenharmony_ci                                   uint x, uint y, uint z, uint w);
5bd8deadSopenharmony_ci    void ProgramEnvParameterI4uivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                    const uint *params);
5bd8deadSopenharmony_ci    void ProgramEnvParametersI4uivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                     sizei count, const uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    void GetProgramLocalParameterIivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                       int *params);
5bd8deadSopenharmony_ci    void GetProgramLocalParameterIuivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                        uint *params);
5bd8deadSopenharmony_ci    void GetProgramEnvParameterIivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                     int *params);
5bd8deadSopenharmony_ci    void GetProgramEnvParameterIuivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                      uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Tokens
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Accepted by the <pname> parameter of GetBooleanv, GetIntegerv,
5bd8deadSopenharmony_ci    GetFloatv, and GetDoublev:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MIN_PROGRAM_TEXEL_OFFSET_EXT                    0x8904
5bd8deadSopenharmony_ci        MAX_PROGRAM_TEXEL_OFFSET_EXT                    0x8905
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (note:  these tokens are shared with the EXT_gpu_shader4 extension.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Accepted by the <pname> parameter of GetProgramivARB:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        PROGRAM_ATTRIB_COMPONENTS_NV                    0x8906
5bd8deadSopenharmony_ci        PROGRAM_RESULT_COMPONENTS_NV                    0x8907
5bd8deadSopenharmony_ci        MAX_PROGRAM_ATTRIB_COMPONENTS_NV                0x8908
5bd8deadSopenharmony_ci        MAX_PROGRAM_RESULT_COMPONENTS_NV                0x8909
5bd8deadSopenharmony_ci        MAX_PROGRAM_GENERIC_ATTRIBS_NV                  0x8DA5
5bd8deadSopenharmony_ci        MAX_PROGRAM_GENERIC_RESULTS_NV                  0x8DA6
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 2 of the OpenGL 1.5 Specification (OpenGL Operation)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (Modify "Section 2.14.1" of the ARB_vertex_program specification,
5bd8deadSopenharmony_ci    describing program parameters.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Each program object has an associated array of program local parameters.
5bd8deadSopenharmony_ci    Program local parameters are four-component vectors whose components can
5bd8deadSopenharmony_ci    hold floating-point, signed integer, or unsigned integer values.  The data
5bd8deadSopenharmony_ci    type of each local parameter is established when the parameter's values
5bd8deadSopenharmony_ci    are assigned.  If a program attempts to read a local parameter using a
5bd8deadSopenharmony_ci    data type other than the one used when the parameter is set, the values
5bd8deadSopenharmony_ci    returned are undefined.  ... The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void ProgramLocalParameter4fARB(enum target, uint index,
5bd8deadSopenharmony_ci                                      float x, float y, float z, float w);
5bd8deadSopenharmony_ci      void ProgramLocalParameter4fvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                       const float *params);
5bd8deadSopenharmony_ci      void ProgramLocalParameter4dARB(enum target, uint index,
5bd8deadSopenharmony_ci                                      double x, double y, double z, double w);
5bd8deadSopenharmony_ci      void ProgramLocalParameter4dvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                       const double *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void ProgramLocalParameterI4iNV(enum target, uint index,
5bd8deadSopenharmony_ci                                      int x, int y, int z, int w);
5bd8deadSopenharmony_ci      void ProgramLocalParameterI4ivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                       const int *params);
5bd8deadSopenharmony_ci      void ProgramLocalParameterI4uiNV(enum target, uint index,
5bd8deadSopenharmony_ci                                       uint x, uint y, uint z, uint w);
5bd8deadSopenharmony_ci      void ProgramLocalParameterI4uivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                        const uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    update the values of the program local parameter numbered <index>
5bd8deadSopenharmony_ci    belonging to the program object currently bound to <target>.  For the
5bd8deadSopenharmony_ci    non-vector versions of these commands, the four components of the
5bd8deadSopenharmony_ci    parameter are updated with the values of <x>, <y>, <z>, and <w>,
5bd8deadSopenharmony_ci    respectively.  For the vector versions, the components of the parameter
5bd8deadSopenharmony_ci    are updated with the array of four values pointed to by <params>.  The
5bd8deadSopenharmony_ci    error INVALID_VALUE is generated if <index> is greater than or equal to
5bd8deadSopenharmony_ci    the number of program local parameters supported by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void ProgramLocalParameters4fvNV(enum target, uint index,
5bd8deadSopenharmony_ci                                       sizei count, const float *params);
5bd8deadSopenharmony_ci      void ProgramLocalParametersI4ivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                        sizei count, const int *params);
5bd8deadSopenharmony_ci      void ProgramLocalParametersI4uivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                         sizei count, const uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    update the values of the program local parameters numbered <index> through
5bd8deadSopenharmony_ci    <index> + <count> - 1 with the array of 4 * <count> values pointed to by
5bd8deadSopenharmony_ci    <params>.  The error INVALID_VALUE is generated if the sum of <index> and
5bd8deadSopenharmony_ci    <count> is greater than the number of program local parameters supported
5bd8deadSopenharmony_ci    by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When a program local parameter is updated, the data type of its components
5bd8deadSopenharmony_ci    is assigned according to the data type of the provided values.  If values
5bd8deadSopenharmony_ci    provided are of type "float" or "double", the components of the parameter
5bd8deadSopenharmony_ci    are floating-point.  If the values provided are of type "int", the
5bd8deadSopenharmony_ci    components of the parameter are signed integers.  If the values provided
5bd8deadSopenharmony_ci    are of type "uint", the components of the parameter are unsigned integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Additionally, each program target has an associated array of program
5bd8deadSopenharmony_ci    environment parameters.  Unlike program local parameters, program
5bd8deadSopenharmony_ci    environment parameters are shared by all program objects of a given
5bd8deadSopenharmony_ci    target.  Program environment parameters are four-component vectors whose
5bd8deadSopenharmony_ci    components can hold floating-point, signed integer, or unsigned integer
5bd8deadSopenharmony_ci    values.  The data type of each environment parameter is established when
5bd8deadSopenharmony_ci    the parameter's values are assigned.  If a program attempts to read an
5bd8deadSopenharmony_ci    environment parameter using a data type other than the one used when the
5bd8deadSopenharmony_ci    parameter is set, the values returned are undefined.  ... The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void ProgramEnvParameter4fARB(enum target, uint index,
5bd8deadSopenharmony_ci                                    float x, float y, float z, float w);
5bd8deadSopenharmony_ci      void ProgramEnvParameter4fvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                     const float *params);
5bd8deadSopenharmony_ci      void ProgramEnvParameter4dARB(enum target, uint index,
5bd8deadSopenharmony_ci                                    double x, double y, double z, double w);
5bd8deadSopenharmony_ci      void ProgramEnvParameter4dvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                     const double *params);
5bd8deadSopenharmony_ci      void ProgramEnvParameterI4iNV(enum target, uint index,
5bd8deadSopenharmony_ci                                    int x, int y, int z, int w);
5bd8deadSopenharmony_ci      void ProgramEnvParameterI4ivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                     const int *params);
5bd8deadSopenharmony_ci      void ProgramEnvParameterI4uiNV(enum target, uint index,
5bd8deadSopenharmony_ci                                     uint x, uint y, uint z, uint w);
5bd8deadSopenharmony_ci      void ProgramEnvParameterI4uivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                      const uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    update the values of the program environment parameter numbered <index>
5bd8deadSopenharmony_ci    for the given program target <target>.  For the non-vector versions of
5bd8deadSopenharmony_ci    these commands, the four components of the parameter are updated with the
5bd8deadSopenharmony_ci    values of <x>, <y>, <z>, and <w>, respectively.  For the vector versions,
5bd8deadSopenharmony_ci    the four components of the parameter are updated with the array of four
5bd8deadSopenharmony_ci    values pointed to by <params>.  The error INVALID_VALUE is generated if
5bd8deadSopenharmony_ci    <index> is greater than or equal to the number of program environment
5bd8deadSopenharmony_ci    parameters supported by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void ProgramEnvParameters4fvNV(enum target, uint index,
5bd8deadSopenharmony_ci                                     sizei count, const float *params);
5bd8deadSopenharmony_ci      void ProgramEnvParametersI4ivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                      sizei count, const int *params);
5bd8deadSopenharmony_ci      void ProgramEnvParametersI4uivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                       sizei count, const uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    update the values of the program environment parameters numbered <index>
5bd8deadSopenharmony_ci    through <index> + <count> - 1 with the array of 4 * <count> values pointed
5bd8deadSopenharmony_ci    to by <params>.  The error INVALID_VALUE is generated if the sum of
5bd8deadSopenharmony_ci    <index> and <count> is greater than the number of program local parameters
5bd8deadSopenharmony_ci    supported by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When a program environment parameter is updated, the data type of its
5bd8deadSopenharmony_ci    components is assigned according to the data type of the provided values.
5bd8deadSopenharmony_ci    If values provided are of type "float" or "double", the components of the
5bd8deadSopenharmony_ci    parameter are floating-point.  If the values provided are of type "int",
5bd8deadSopenharmony_ci    the components of the parameter are signed integers.  If the values
5bd8deadSopenharmony_ci    provided are of type "uint", the components of the parameter are unsigned
5bd8deadSopenharmony_ci    integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Insert New Section 2.X between Sections 2.Y and 2.Z:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X, GPU Programs
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The GL provides a number of different program targets that allow an
5bd8deadSopenharmony_ci    application to either replace certain fixed-function pipeline stages with
5bd8deadSopenharmony_ci    a fully programmable model or use a program to control aspects of the GL
5bd8deadSopenharmony_ci    pipeline that previously had only hard-wired behavior.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A common base instruction set is available for all program types,
5bd8deadSopenharmony_ci    providing both integer and floating-point operations.  Structured
5bd8deadSopenharmony_ci    branching operations and subroutine calls are available.  Texture
5bd8deadSopenharmony_ci    mapping (loading data from external images) is supported for all
5bd8deadSopenharmony_ci    program types.  The main differences between the different program
5bd8deadSopenharmony_ci    types are the set of available inputs and outputs, which are program type-
5bd8deadSopenharmony_ci    specific, and a few instructions that are meaningful for only a subset
5bd8deadSopenharmony_ci    of program types.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.2, Program Grammar
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GPU program strings are specified as an array of ASCII characters
5bd8deadSopenharmony_ci    containing the program text.  When a GPU program is loaded by a call to
5bd8deadSopenharmony_ci    ProgramStringARB, the program string is parsed into a set of tokens
5bd8deadSopenharmony_ci    possibly separated by whitespace.  Spaces, tabs, newlines, carriage
5bd8deadSopenharmony_ci    returns, and comments are considered whitespace.  Comments begin with the
5bd8deadSopenharmony_ci    character "#" and are terminated by a newline, a carriage return, or the
5bd8deadSopenharmony_ci    end of the program array.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The Backus-Naur Form (BNF) grammar below specifies the syntactically valid
5bd8deadSopenharmony_ci    sequences for GPU programs.  The set of valid tokens can be inferred
5bd8deadSopenharmony_ci    from the grammar.  A line containing "/* empty */" represents an empty
5bd8deadSopenharmony_ci    string and is used to indicate optional rules.  A program is invalid if it
5bd8deadSopenharmony_ci    contains any tokens or characters not defined in this specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that this extension is not a standalone extension and a small number
5bd8deadSopenharmony_ci    of grammar rules are left to be defined in the extensions defining the
5bd8deadSopenharmony_ci    specific vertex, fragment, and geometry program types.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <program>               ::= <optionSequence> <declSequence>
5bd8deadSopenharmony_ci                                <statementSequence> "END"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optionSequence>        ::= <option> <optionSequence>
5bd8deadSopenharmony_ci                              | /* empty */
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <option>                ::= "OPTION" <identifier> ";"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <declSequence>          ::= /* empty */
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <statementSequence>     ::= <statement> <statementSequence>
5bd8deadSopenharmony_ci                              | /* empty */
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <statement>             ::= <instruction> ";"
5bd8deadSopenharmony_ci                              | <namingStatement> ";"
5bd8deadSopenharmony_ci                              | <instLabel> ":"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instruction>           ::= <ALUInstruction>
5bd8deadSopenharmony_ci                              | <TexInstruction>
5bd8deadSopenharmony_ci                              | <FlowInstruction>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ALUInstruction>        ::= <VECTORop_instruction>
5bd8deadSopenharmony_ci                              | <SCALARop_instruction>
5bd8deadSopenharmony_ci                              | <BINSCop_instruction>
5bd8deadSopenharmony_ci                              | <BINop_instruction>
5bd8deadSopenharmony_ci                              | <VECSCAop_instruction>
5bd8deadSopenharmony_ci                              | <TRIop_instruction>
5bd8deadSopenharmony_ci                              | <SWZop_instruction>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TexInstruction>        ::= <TEXop_instruction>
5bd8deadSopenharmony_ci                              | <TXDop_instruction>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <FlowInstruction>       ::= <BRAop_instruction>
5bd8deadSopenharmony_ci                              | <FLOWCCop_instruction>
5bd8deadSopenharmony_ci                              | <IFop_instruction>
5bd8deadSopenharmony_ci                              | <REPop_instruction>
5bd8deadSopenharmony_ci                              | <ENDFLOWop_instruction>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <VECTORop_instruction>  ::= <VECTORop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandV>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <VECTORop>              ::= "ABS"
5bd8deadSopenharmony_ci                              | "CEIL"
5bd8deadSopenharmony_ci                              | "FLR"
5bd8deadSopenharmony_ci                              | "FRC"
5bd8deadSopenharmony_ci                              | "I2F"
5bd8deadSopenharmony_ci                              | "LIT"
5bd8deadSopenharmony_ci                              | "MOV"
5bd8deadSopenharmony_ci                              | "NOT"
5bd8deadSopenharmony_ci                              | "NRM"
5bd8deadSopenharmony_ci                              | "PK2H"
5bd8deadSopenharmony_ci                              | "PK2US"
5bd8deadSopenharmony_ci                              | "PK4B"
5bd8deadSopenharmony_ci                              | "PK4UB"
5bd8deadSopenharmony_ci                              | "ROUND"
5bd8deadSopenharmony_ci                              | "SSG"
5bd8deadSopenharmony_ci                              | "TRUNC"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <SCALARop_instruction>  ::= <SCALARop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandS>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <SCALARop>              ::= "COS"
5bd8deadSopenharmony_ci                              | "EX2"
5bd8deadSopenharmony_ci                              | "LG2"
5bd8deadSopenharmony_ci                              | "RCC"
5bd8deadSopenharmony_ci                              | "RCP"
5bd8deadSopenharmony_ci                              | "RSQ"
5bd8deadSopenharmony_ci                              | "SCS"
5bd8deadSopenharmony_ci                              | "SIN"
5bd8deadSopenharmony_ci                              | "UP2H"
5bd8deadSopenharmony_ci                              | "UP2US"
5bd8deadSopenharmony_ci                              | "UP4B"
5bd8deadSopenharmony_ci                              | "UP4UB"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BINSCop_instruction>   ::= <BINSCop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandS> "," <instOperandS>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BINSCop>               ::= "POW"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <VECSCAop_instruction>  ::= <VECSCAop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandV> "," <instOperandS>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <VECSCAop>              ::= "DIV"
5bd8deadSopenharmony_ci                              | "SHL"
5bd8deadSopenharmony_ci                              | "SHR"
5bd8deadSopenharmony_ci                              | "MOD"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BINop_instruction>     ::= <BINop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandV> "," <instOperandV>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BINop>                 ::= "ADD"
5bd8deadSopenharmony_ci                              | "AND"
5bd8deadSopenharmony_ci                              | "DP3"
5bd8deadSopenharmony_ci                              | "DP4"
5bd8deadSopenharmony_ci                              | "DPH"
5bd8deadSopenharmony_ci                              | "DST"
5bd8deadSopenharmony_ci                              | "MAX"
5bd8deadSopenharmony_ci                              | "MIN"
5bd8deadSopenharmony_ci                              | "MUL"
5bd8deadSopenharmony_ci                              | "OR"
5bd8deadSopenharmony_ci                              | "RFL"
5bd8deadSopenharmony_ci                              | "SEQ"
5bd8deadSopenharmony_ci                              | "SFL"
5bd8deadSopenharmony_ci                              | "SGE"
5bd8deadSopenharmony_ci                              | "SGT"
5bd8deadSopenharmony_ci                              | "SLE"
5bd8deadSopenharmony_ci                              | "SLT"
5bd8deadSopenharmony_ci                              | "SNE"
5bd8deadSopenharmony_ci                              | "STR"
5bd8deadSopenharmony_ci                              | "SUB"
5bd8deadSopenharmony_ci                              | "XPD"
5bd8deadSopenharmony_ci                              | "DP2"
5bd8deadSopenharmony_ci                              | "XOR"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TRIop_instruction>     ::= <TRIop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandV> "," <instOperandV> ","
5bd8deadSopenharmony_ci                                <instOperandV>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TRIop>                 ::= "CMP"
5bd8deadSopenharmony_ci                              | "DP2A"
5bd8deadSopenharmony_ci                              | "LRP"
5bd8deadSopenharmony_ci                              | "MAD"
5bd8deadSopenharmony_ci                              | "SAD"
5bd8deadSopenharmony_ci                              | "X2D"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <SWZop_instruction>     ::= <SWZop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandVNS> "," <extendedSwizzle>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <SWZop>                 ::= "SWZ"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TEXop_instruction>     ::= <TEXop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandV> "," <texAccess>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TEXop>                 ::= "TEX"
5bd8deadSopenharmony_ci                              | "TXB"
5bd8deadSopenharmony_ci                              | "TXF"
5bd8deadSopenharmony_ci                              | "TXL"
5bd8deadSopenharmony_ci                              | "TXP"
5bd8deadSopenharmony_ci                              | "TXQ"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TXDop_instruction>     ::= <TXDop> <opModifiers> <instResult> ","
5bd8deadSopenharmony_ci                                <instOperandV> "," <instOperandV> ","
5bd8deadSopenharmony_ci                                <instOperandV> "," <texAccess>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TXDop>                 ::= "TXD"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BRAop_instruction>     ::= <BRAop> <opModifiers> <instTarget>
5bd8deadSopenharmony_ci                                <optBranchCond>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BRAop>                 ::= "CAL"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <FLOWCCop_instruction>  ::= <FLOWCCop> <opModifiers> <optBranchCond>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <FLOWCCop>              ::= "RET"
5bd8deadSopenharmony_ci                              | "BRK"
5bd8deadSopenharmony_ci                              | "CONT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <IFop_instruction>      ::= <IFop> <opModifiers> <ccTest>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <IFop>                  ::= "IF"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <REPop_instruction>     ::= <REPop> <opModifiers> <instOperandV>
5bd8deadSopenharmony_ci                              | <REPop> <opModifiers>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <REPop>                 ::= "REP"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ENDFLOWop_instruction> ::= <ENDFLOWop> <opModifiers>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ENDFLOWop>             ::= "ELSE"
5bd8deadSopenharmony_ci                              | "ENDIF"
5bd8deadSopenharmony_ci                              | "ENDREP"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <opModifiers>           ::= <opModifierItem> <opModifiers>
5bd8deadSopenharmony_ci                              | /* empty */
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <opModifierItem>        ::= "." <opModifier>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <opModifier>            ::= "F"
5bd8deadSopenharmony_ci                              | "U"
5bd8deadSopenharmony_ci                              | "S"
5bd8deadSopenharmony_ci                              | "CC"
5bd8deadSopenharmony_ci                              | "CC0"
5bd8deadSopenharmony_ci                              | "CC1"
5bd8deadSopenharmony_ci                              | "SAT"
5bd8deadSopenharmony_ci                              | "SSAT"
5bd8deadSopenharmony_ci                              | "NTC"
5bd8deadSopenharmony_ci                              | "S24"
5bd8deadSopenharmony_ci                              | "U24"
5bd8deadSopenharmony_ci                              | "HI"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <texAccess>             ::= <texImageUnit> "," <texTarget>
5bd8deadSopenharmony_ci                              | <texImageUnit> "," <texTarget> "," <texOffset>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <texImageUnit>          ::= "texture" <optArrayMemAbs>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <texTarget>             ::= "1D"
5bd8deadSopenharmony_ci                              | "2D"
5bd8deadSopenharmony_ci                              | "3D"
5bd8deadSopenharmony_ci                              | "CUBE"
5bd8deadSopenharmony_ci                              | "RECT"
5bd8deadSopenharmony_ci                              | "SHADOW1D"
5bd8deadSopenharmony_ci                              | "SHADOW2D"
5bd8deadSopenharmony_ci                              | "SHADOWRECT"
5bd8deadSopenharmony_ci                              | "ARRAY1D"
5bd8deadSopenharmony_ci                              | "ARRAY2D"
5bd8deadSopenharmony_ci                              | "SHADOWCUBE"
5bd8deadSopenharmony_ci                              | "SHADOWARRAY1D"
5bd8deadSopenharmony_ci                              | "SHADOWARRAY2D"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <texOffset>             ::= "(" <texOffsetComp> ")"
5bd8deadSopenharmony_ci                              | "(" <texOffsetComp> "," <texOffsetComp> ")"
5bd8deadSopenharmony_ci                              | "(" <texOffsetComp> "," <texOffsetComp> ","
5bd8deadSopenharmony_ci                                <texOffsetComp> ")"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <texOffsetComp>         ::= <optSign> <int>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optBranchCond>         ::= /* empty */
5bd8deadSopenharmony_ci                              | <ccMask>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instOperandV>          ::= <instOperandAbsV>
5bd8deadSopenharmony_ci                              | <instOperandBaseV>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instOperandAbsV>       ::= <operandAbsNeg> "|" <instOperandBaseV> "|"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instOperandBaseV>      ::= <operandNeg> <attribUseV>
5bd8deadSopenharmony_ci                              | <operandNeg> <tempUseV>
5bd8deadSopenharmony_ci                              | <operandNeg> <paramUseV>
5bd8deadSopenharmony_ci                              | <operandNeg> <bufferUseV>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instOperandS>          ::= <instOperandAbsS>
5bd8deadSopenharmony_ci                              | <instOperandBaseS>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instOperandAbsS>       ::= <operandAbsNeg> "|" <instOperandBaseS> "|"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instOperandBaseS>      ::= <operandNeg> <attribUseS>
5bd8deadSopenharmony_ci                              | <operandNeg> <tempUseS>
5bd8deadSopenharmony_ci                              | <operandNeg> <paramUseS>
5bd8deadSopenharmony_ci                              | <operandNeg> <bufferUseS>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instOperandVNS>        ::= <attribUseVNS>
5bd8deadSopenharmony_ci                              | <tempUseVNS>
5bd8deadSopenharmony_ci                              | <paramUseVNS>
5bd8deadSopenharmony_ci                              | <bufferUseVNS>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <operandAbsNeg>         ::= <optSign>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <operandNeg>            ::= <optSign>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instResult>            ::= <instResultCC>
5bd8deadSopenharmony_ci                              | <instResultBase>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instResultCC>          ::= <instResultBase> <ccMask>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instResultBase>        ::= <tempUseW>
5bd8deadSopenharmony_ci                              | <resultUseW>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <namingStatement>       ::= <varMods> <ATTRIB_statement>
5bd8deadSopenharmony_ci                              | <varMods> <PARAM_statement>
5bd8deadSopenharmony_ci                              | <varMods> <TEMP_statement>
5bd8deadSopenharmony_ci                              | <varMods> <OUTPUT_statement>
5bd8deadSopenharmony_ci                              | <varMods> <BUFFER_statement>
5bd8deadSopenharmony_ci                              | <ALIAS_statement>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ATTRIB_statement>      ::= "ATTRIB" <establishName> "=" <attribUseD>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <PARAM_statement>       ::= <PARAM_singleStmt>
5bd8deadSopenharmony_ci                              | <PARAM_multipleStmt>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <PARAM_singleStmt>      ::= "PARAM" <establishName> <paramSingleInit>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <PARAM_multipleStmt>    ::= "PARAM" <establishName> <optArraySize>
5bd8deadSopenharmony_ci                                <paramMultipleInit>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <paramSingleInit>       ::= "=" <paramUseDB>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <paramMultipleInit>     ::= "=" "{" <paramMultInitList> "}"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <paramMultInitList>     ::= <paramUseDM>
5bd8deadSopenharmony_ci                              | <paramUseDM> "," <paramMultInitList>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <TEMP_statement>        ::= "TEMP" <varNameList>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <OUTPUT_statement>      ::= "OUTPUT" <establishName> "=" <resultUseD>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <varMods>               ::= <varModifier> <varMods>
5bd8deadSopenharmony_ci                              | /* empty */
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <varModifier>           ::= "SHORT"
5bd8deadSopenharmony_ci                              | "LONG"
5bd8deadSopenharmony_ci                              | "INT"
5bd8deadSopenharmony_ci                              | "UINT"
5bd8deadSopenharmony_ci                              | "FLOAT"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ALIAS_statement>       ::= "ALIAS" <establishName> "=" <establishedName>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <BUFFER_statement>      ::= <bufferDeclType> <establishName> "="
5bd8deadSopenharmony_ci                                <bufferSingleInit>
5bd8deadSopenharmony_ci                              | <bufferDeclType> <establishName>
5bd8deadSopenharmony_ci                                <optArraySize> "=" <bufferMultInit>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferDeclType>        ::= "BUFFER"
5bd8deadSopenharmony_ci                              | "BUFFER4"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferSingleInit>      ::= "=" <bufferUseDB>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferMultInit>        ::= "=" "{" <bufferMultInitList> "}"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferMultInitList>    ::= <bufferUseDM>
5bd8deadSopenharmony_ci                              | <bufferUseDM> "," <bufferMultInitList>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <varNameList>           ::= <establishName>
5bd8deadSopenharmony_ci                              | <establishName> "," <varNameList>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <attribUseV>            ::= <attribBasic> <swizzleSuffix>
5bd8deadSopenharmony_ci                              | <attribVarName> <swizzleSuffix>
5bd8deadSopenharmony_ci                              | <attribVarName> <arrayMem> <swizzleSuffix>
5bd8deadSopenharmony_ci                              | <attribColor> <swizzleSuffix>
5bd8deadSopenharmony_ci                              | <attribColor> "." <colorType> <swizzleSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <attribUseS>            ::= <attribBasic> <scalarSuffix>
5bd8deadSopenharmony_ci                              | <attribVarName> <scalarSuffix>
5bd8deadSopenharmony_ci                              | <attribVarName> <arrayMem> <scalarSuffix>
5bd8deadSopenharmony_ci                              | <attribColor> <scalarSuffix>
5bd8deadSopenharmony_ci                              | <attribColor> "." <colorType> <scalarSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <attribUseVNS>          ::= <attribBasic>
5bd8deadSopenharmony_ci                              | <attribVarName>
5bd8deadSopenharmony_ci                              | <attribVarName> <arrayMem>
5bd8deadSopenharmony_ci                              | <attribColor>
5bd8deadSopenharmony_ci                              | <attribColor> "." <colorType>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <attribUseD>            ::= <attribBasic>
5bd8deadSopenharmony_ci                              | <attribColor>
5bd8deadSopenharmony_ci                              | <attribColor> "." <colorType>
5bd8deadSopenharmony_ci                              | <attribMulti>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <paramUseV>             ::= <paramVarName> <optArrayMem> <swizzleSuffix>
5bd8deadSopenharmony_ci                              | <stateSingleItem> <swizzleSuffix>
5bd8deadSopenharmony_ci                              | <programSingleItem> <swizzleSuffix>
5bd8deadSopenharmony_ci                              | <constantVector> <swizzleSuffix>
5bd8deadSopenharmony_ci                              | <constantScalar>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <paramUseS>             ::= <paramVarName> <optArrayMem> <scalarSuffix>
5bd8deadSopenharmony_ci                              | <stateSingleItem> <scalarSuffix>
5bd8deadSopenharmony_ci                              | <programSingleItem> <scalarSuffix>
5bd8deadSopenharmony_ci                              | <constantVector> <scalarSuffix>
5bd8deadSopenharmony_ci                              | <constantScalar>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <paramUseVNS>           ::= <paramVarName> <optArrayMem>
5bd8deadSopenharmony_ci                              | <stateSingleItem>
5bd8deadSopenharmony_ci                              | <programSingleItem>
5bd8deadSopenharmony_ci                              | <constantVector>
5bd8deadSopenharmony_ci                              | <constantScalar>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <paramUseDB>            ::= <stateSingleItem>
5bd8deadSopenharmony_ci                              | <programSingleItem>
5bd8deadSopenharmony_ci                              | <constantVector>
5bd8deadSopenharmony_ci                              | <signedConstantScalar>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <paramUseDM>            ::= <stateMultipleItem>
5bd8deadSopenharmony_ci                              | <programMultipleItem>
5bd8deadSopenharmony_ci                              | <constantVector>
5bd8deadSopenharmony_ci                              | <signedConstantScalar>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateMultipleItem>     ::= <stateSingleItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateMatrixRows>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateSingleItem>       ::= "state" "." <stateMaterialItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateLightItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateLightModelItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateLightProdItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateFogItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateMatrixRow>
5bd8deadSopenharmony_ci                              | "state" "." <stateTexGenItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateClipPlaneItem>
5bd8deadSopenharmony_ci                              | "state" "." <statePointItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateTexEnvItem>
5bd8deadSopenharmony_ci                              | "state" "." <stateDepthItem>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateMaterialItem>     ::= "material" "." <stateMatProperty>
5bd8deadSopenharmony_ci                              | "material" "." <faceType> "."
5bd8deadSopenharmony_ci                                <stateMatProperty>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateMatProperty>      ::= "ambient"
5bd8deadSopenharmony_ci                              | "diffuse"
5bd8deadSopenharmony_ci                              | "specular"
5bd8deadSopenharmony_ci                              | "emission"
5bd8deadSopenharmony_ci                              | "shininess"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateLightItem>        ::= "light" <arrayMemAbs> "." <stateLightProperty>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateLightProperty>    ::= "ambient"
5bd8deadSopenharmony_ci                              | "diffuse"
5bd8deadSopenharmony_ci                              | "specular"
5bd8deadSopenharmony_ci                              | "position"
5bd8deadSopenharmony_ci                              | "attenuation"
5bd8deadSopenharmony_ci                              | "spot" "." <stateSpotProperty>
5bd8deadSopenharmony_ci                              | "half"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateSpotProperty>     ::= "direction"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateLightModelItem>   ::= "lightmodel" "." <stateLModProperty>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateLModProperty>     ::= "ambient"
5bd8deadSopenharmony_ci                              | "scenecolor"
5bd8deadSopenharmony_ci                              | <faceType> "." "scenecolor"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateLightProdItem>    ::= "lightprod" <arrayMemAbs> "."
5bd8deadSopenharmony_ci                                <stateLProdProperty>
5bd8deadSopenharmony_ci                              | "lightprod" <arrayMemAbs> "." <faceType> "."
5bd8deadSopenharmony_ci                                <stateLProdProperty>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateLProdProperty>    ::= "ambient"
5bd8deadSopenharmony_ci                              | "diffuse"
5bd8deadSopenharmony_ci                              | "specular"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateFogItem>          ::= "fog" "." <stateFogProperty>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateFogProperty>      ::= "color"
5bd8deadSopenharmony_ci                              | "params"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateMatrixRows>       ::= <stateMatrixItem>
5bd8deadSopenharmony_ci                              | <stateMatrixItem> "." <stateMatModifier>
5bd8deadSopenharmony_ci                              | <stateMatrixItem> "." "row" <arrayRange>
5bd8deadSopenharmony_ci                              | <stateMatrixItem> "." <stateMatModifier> "."
5bd8deadSopenharmony_ci                                "row" <arrayRange>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateMatrixRow>        ::= <stateMatrixItem> "." "row" <arrayMemAbs>
5bd8deadSopenharmony_ci                              | <stateMatrixItem> "." <stateMatModifier> "."
5bd8deadSopenharmony_ci                                "row" <arrayMemAbs>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateMatrixItem>       ::= "matrix" "." <stateMatrixName>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateMatModifier>      ::= "inverse"
5bd8deadSopenharmony_ci                              | "transpose"
5bd8deadSopenharmony_ci                              | "invtrans"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateMatrixName>       ::= "modelview" <optArrayMemAbs>
5bd8deadSopenharmony_ci                              | "projection"
5bd8deadSopenharmony_ci                              | "mvp"
5bd8deadSopenharmony_ci                              | "texture" <optArrayMemAbs>
5bd8deadSopenharmony_ci                              | "program" <arrayMemAbs>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateTexGenItem>       ::= "texgen" <optArrayMemAbs> "."
5bd8deadSopenharmony_ci                                <stateTexGenType> "." <stateTexGenCoord>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateTexGenType>       ::= "eye"
5bd8deadSopenharmony_ci                              | "object"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateTexGenCoord>      ::= "s"
5bd8deadSopenharmony_ci                              | "t"
5bd8deadSopenharmony_ci                              | "r"
5bd8deadSopenharmony_ci                              | "q"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateClipPlaneItem>    ::= "clip" <arrayMemAbs> "." "plane"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <statePointItem>        ::= "point" "." <statePointProperty>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <statePointProperty>    ::= "size"
5bd8deadSopenharmony_ci                              | "attenuation"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateTexEnvItem>       ::= "texenv" <optArrayMemAbs> "."
5bd8deadSopenharmony_ci                                <stateTexEnvProperty>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateTexEnvProperty>   ::= "color"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateDepthItem>        ::= "depth" "." <stateDepthProperty>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <stateDepthProperty>    ::= "range"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <programSingleItem>     ::= <progEnvParam>
5bd8deadSopenharmony_ci                              | <progLocalParam>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <programMultipleItem>   ::= <progEnvParams>
5bd8deadSopenharmony_ci                              | <progLocalParams>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <progEnvParams>         ::= "program" "." "env" <arrayMemAbs>
5bd8deadSopenharmony_ci                              | "program" "." "env" <arrayRange>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <progEnvParam>          ::= "program" "." "env" <arrayMemAbs>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <progLocalParams>       ::= "program" "." "local" <arrayMemAbs>
5bd8deadSopenharmony_ci                              | "program" "." "local" <arrayRange>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <progLocalParam>        ::= "program" "." "local" <arrayMemAbs>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <constantVector>        ::= "{" <constantVectorList> "}"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <constantVectorList>    ::= <signedConstantScalar>
5bd8deadSopenharmony_ci                              | <signedConstantScalar> ","
5bd8deadSopenharmony_ci                                <signedConstantScalar>
5bd8deadSopenharmony_ci                              | <signedConstantScalar> ","
5bd8deadSopenharmony_ci                                <signedConstantScalar> ","
5bd8deadSopenharmony_ci                                <signedConstantScalar>
5bd8deadSopenharmony_ci                              | <signedConstantScalar> ","
5bd8deadSopenharmony_ci                                <signedConstantScalar> ","
5bd8deadSopenharmony_ci                                <signedConstantScalar> ","
5bd8deadSopenharmony_ci                                <signedConstantScalar>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <signedConstantScalar>  ::= <optSign> <constantScalar>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <constantScalar>        ::= <floatConstant>
5bd8deadSopenharmony_ci                              | <intConstant>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <floatConstant>         ::= <float>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <intConstant>           ::= <int>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <tempUseV>              ::= <tempVarName> <swizzleSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <tempUseS>              ::= <tempVarName> <scalarSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <tempUseVNS>            ::= <tempVarName>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <tempUseW>              ::= <tempVarName> <optWriteMask>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <resultUseW>            ::= <resultBasic> <optWriteMask>
5bd8deadSopenharmony_ci                              | <resultVarName> <optWriteMask>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <resultUseD>            ::= <resultBasic>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferUseV>            ::= <bufferVarName> <optArrayMem> <swizzleSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferUseS>            ::= <bufferVarName> <optArrayMem> <scalarSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferUseVNS>          ::= <bufferVarName> <optArrayMem>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferUseDB>           ::= <bufferBinding> <arrayMemAbs>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferUseDM>           ::= <bufferBinding> <arrayMemAbs>
5bd8deadSopenharmony_ci                              | <bufferBinding> <arrayRange>
5bd8deadSopenharmony_ci                              | <bufferBinding>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <bufferBinding>         ::= "program" "." "buffer" <arrayMemAbs>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optArraySize>          ::= "[" "]"
5bd8deadSopenharmony_ci                              | "[" <int> "]"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optArrayMem>           ::= /* empty */
5bd8deadSopenharmony_ci                              | <arrayMem>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <arrayMem>              ::= <arrayMemAbs>
5bd8deadSopenharmony_ci                              | <arrayMemRel>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optArrayMemAbs>        ::= /* empty */
5bd8deadSopenharmony_ci                              | <arrayMemAbs>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <arrayMemAbs>           ::= "[" <int> "]"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <arrayMemRel>           ::= "[" <arrayMemReg> <arrayMemOffset> "]"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <arrayMemReg>           ::= <addrUseS>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <arrayMemOffset>        ::= /* empty */
5bd8deadSopenharmony_ci                              | "+" <int>
5bd8deadSopenharmony_ci                              | "-" <int>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <arrayRange>            ::= "[" <int> ".." <int> "]"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <addrUseS>              ::= <addrVarName> <scalarSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ccMask>                ::= "(" <ccTest> ")"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ccTest>                ::= <ccMaskRule> <swizzleSuffix>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <ccMaskRule>            ::= "EQ"
5bd8deadSopenharmony_ci                              | "GE"
5bd8deadSopenharmony_ci                              | "GT"
5bd8deadSopenharmony_ci                              | "LE"
5bd8deadSopenharmony_ci                              | "LT"
5bd8deadSopenharmony_ci                              | "NE"
5bd8deadSopenharmony_ci                              | "TR"
5bd8deadSopenharmony_ci                              | "FL"
5bd8deadSopenharmony_ci                              | "EQ0"
5bd8deadSopenharmony_ci                              | "GE0"
5bd8deadSopenharmony_ci                              | "GT0"
5bd8deadSopenharmony_ci                              | "LE0"
5bd8deadSopenharmony_ci                              | "LT0"
5bd8deadSopenharmony_ci                              | "NE0"
5bd8deadSopenharmony_ci                              | "TR0"
5bd8deadSopenharmony_ci                              | "FL0"
5bd8deadSopenharmony_ci                              | "EQ1"
5bd8deadSopenharmony_ci                              | "GE1"
5bd8deadSopenharmony_ci                              | "GT1"
5bd8deadSopenharmony_ci                              | "LE1"
5bd8deadSopenharmony_ci                              | "LT1"
5bd8deadSopenharmony_ci                              | "NE1"
5bd8deadSopenharmony_ci                              | "TR1"
5bd8deadSopenharmony_ci                              | "FL1"
5bd8deadSopenharmony_ci                              | "NAN"
5bd8deadSopenharmony_ci                              | "NAN0"
5bd8deadSopenharmony_ci                              | "NAN1"
5bd8deadSopenharmony_ci                              | "LEG"
5bd8deadSopenharmony_ci                              | "LEG0"
5bd8deadSopenharmony_ci                              | "LEG1"
5bd8deadSopenharmony_ci                              | "CF"
5bd8deadSopenharmony_ci                              | "CF0"
5bd8deadSopenharmony_ci                              | "CF1"
5bd8deadSopenharmony_ci                              | "NCF"
5bd8deadSopenharmony_ci                              | "NCF0"
5bd8deadSopenharmony_ci                              | "NCF1"
5bd8deadSopenharmony_ci                              | "OF"
5bd8deadSopenharmony_ci                              | "OF0"
5bd8deadSopenharmony_ci                              | "OF1"
5bd8deadSopenharmony_ci                              | "NOF"
5bd8deadSopenharmony_ci                              | "NOF0"
5bd8deadSopenharmony_ci                              | "NOF1"
5bd8deadSopenharmony_ci                              | "AB"
5bd8deadSopenharmony_ci                              | "AB0"
5bd8deadSopenharmony_ci                              | "AB1"
5bd8deadSopenharmony_ci                              | "BLE"
5bd8deadSopenharmony_ci                              | "BLE0"
5bd8deadSopenharmony_ci                              | "BLE1"
5bd8deadSopenharmony_ci                              | "SF"
5bd8deadSopenharmony_ci                              | "SF0"
5bd8deadSopenharmony_ci                              | "SF1"
5bd8deadSopenharmony_ci                              | "NSF"
5bd8deadSopenharmony_ci                              | "NSF0"
5bd8deadSopenharmony_ci                              | "NSF1"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optWriteMask>          ::= /* empty */
5bd8deadSopenharmony_ci                              | <xyzwMask>
5bd8deadSopenharmony_ci                              | <rgbaMask>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <xyzwMask>              ::= "." "x"
5bd8deadSopenharmony_ci                              | "." "y"
5bd8deadSopenharmony_ci                              | "." "xy"
5bd8deadSopenharmony_ci                              | "." "z"
5bd8deadSopenharmony_ci                              | "." "xz"
5bd8deadSopenharmony_ci                              | "." "yz"
5bd8deadSopenharmony_ci                              | "." "xyz"
5bd8deadSopenharmony_ci                              | "." "w"
5bd8deadSopenharmony_ci                              | "." "xw"
5bd8deadSopenharmony_ci                              | "." "yw"
5bd8deadSopenharmony_ci                              | "." "xyw"
5bd8deadSopenharmony_ci                              | "." "zw"
5bd8deadSopenharmony_ci                              | "." "xzw"
5bd8deadSopenharmony_ci                              | "." "yzw"
5bd8deadSopenharmony_ci                              | "." "xyzw"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <rgbaMask>              ::= "." "r"
5bd8deadSopenharmony_ci                              | "." "g"
5bd8deadSopenharmony_ci                              | "." "rg"
5bd8deadSopenharmony_ci                              | "." "b"
5bd8deadSopenharmony_ci                              | "." "rb"
5bd8deadSopenharmony_ci                              | "." "gb"
5bd8deadSopenharmony_ci                              | "." "rgb"
5bd8deadSopenharmony_ci                              | "." "a"
5bd8deadSopenharmony_ci                              | "." "ra"
5bd8deadSopenharmony_ci                              | "." "ga"
5bd8deadSopenharmony_ci                              | "." "rga"
5bd8deadSopenharmony_ci                              | "." "ba"
5bd8deadSopenharmony_ci                              | "." "rba"
5bd8deadSopenharmony_ci                              | "." "gba"
5bd8deadSopenharmony_ci                              | "." "rgba"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <swizzleSuffix>         ::= /* empty */
5bd8deadSopenharmony_ci                              | "." <component>
5bd8deadSopenharmony_ci                              | "." <xyzwSwizzle>
5bd8deadSopenharmony_ci                              | "." <rgbaSwizzle>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <extendedSwizzle>       ::= <extSwizComp> "," <extSwizComp> ","
5bd8deadSopenharmony_ci                                <extSwizComp> "," <extSwizComp>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <extSwizComp>           ::= <optSign> <xyzwExtSwizSel>
5bd8deadSopenharmony_ci                              | <optSign> <rgbaExtSwizSel>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <xyzwExtSwizSel>        ::= "0"
5bd8deadSopenharmony_ci                              | "1"
5bd8deadSopenharmony_ci                              | <xyzwComponent>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <rgbaExtSwizSel>        ::= <rgbaComponent>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <scalarSuffix>          ::= "." <component>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <component>             ::= <xyzwComponent>
5bd8deadSopenharmony_ci                              | <rgbaComponent>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <xyzwComponent>         ::= "x"
5bd8deadSopenharmony_ci                              | "y"
5bd8deadSopenharmony_ci                              | "z"
5bd8deadSopenharmony_ci                              | "w"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <rgbaComponent>         ::= "r"
5bd8deadSopenharmony_ci                              | "g"
5bd8deadSopenharmony_ci                              | "b"
5bd8deadSopenharmony_ci                              | "a"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <optSign>               ::= /* empty */
5bd8deadSopenharmony_ci                              | "-"
5bd8deadSopenharmony_ci                              | "+"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <faceType>              ::= "front"
5bd8deadSopenharmony_ci                              | "back"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <colorType>             ::= "primary"
5bd8deadSopenharmony_ci                              | "secondary"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instLabel>             ::= <identifier>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <instTarget>            ::= <identifier>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <establishedName>       ::= <identifier>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <establishName>         ::= <identifier>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <int> rule matches an integer constant.  The integer consists of a
5bd8deadSopenharmony_ci    sequence of one or more digits ("0" through "9"), or a sequence in
5bd8deadSopenharmony_ci    hexadecimal form beginning with "0x" followed by a sequence of one or more
5bd8deadSopenharmony_ci    hexadecimal digits ("0" through "9", "a" through "f", "A" through "F").
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <float> rule matches a floating-point constant consisting of an
5bd8deadSopenharmony_ci    integer part, a decimal point, a fraction part, an "e" or "E", and an
5bd8deadSopenharmony_ci    optionally signed integer exponent.  The integer and fraction parts both
5bd8deadSopenharmony_ci    consist of a sequence of one or more digits ("0" through "9").  Either the
5bd8deadSopenharmony_ci    integer part or the fraction parts (not both) may be missing; either the
5bd8deadSopenharmony_ci    decimal point or the "e" (or "E") and the exponent (not both) may be
5bd8deadSopenharmony_ci    missing.  Most grammar rules that allow floating-point values also allow
5bd8deadSopenharmony_ci    integers matching the <int> rule.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <identifier> rule matches a sequence of one or more letters ("A"
5bd8deadSopenharmony_ci    through "Z", "a" through "z"), digits ("0" through "9), underscores ("_"),
5bd8deadSopenharmony_ci    or dollar signs ("$"); the first character must not be a number.  Upper
5bd8deadSopenharmony_ci    and lower case letters are considered different (names are
5bd8deadSopenharmony_ci    case-sensitive).  The following strings are reserved keywords and may not
5bd8deadSopenharmony_ci    be used as identifiers:  "fragment" (for fragment programs only), "vertex"
5bd8deadSopenharmony_ci    (for vertex and geometry programs), "primitive" (for fragment and geometry
5bd8deadSopenharmony_ci    programs), "program", "result", "state", and "texture".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <tempVarName>, <paramVarName>, <attribVarName>, <resultVarName>, and
5bd8deadSopenharmony_ci    <bufferName> rules match identifiers that have been previously established
5bd8deadSopenharmony_ci    as names of temporary, program parameter, attribute, result, and program
5bd8deadSopenharmony_ci    parameter buffer variables, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <xyzwSwizzle> and <rgbaSwizzle> rules match any 4-character strings
5bd8deadSopenharmony_ci    consisting only of the characters "x", "y", "z", and "w" (<xyzwSwizzle>)
5bd8deadSopenharmony_ci    or "r", "g", "b", "a" (<rgbaSwizzle>).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error INVALID_OPERATION is generated if a program fails to load
5bd8deadSopenharmony_ci    because it is not syntactically correct or for one of the semantic
5bd8deadSopenharmony_ci    restrictions described in the following sections.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A successfully loaded program is parsed into a sequence of instructions.
5bd8deadSopenharmony_ci    Each instruction is identified by its tokenized name.  The operation of
5bd8deadSopenharmony_ci    these instructions when executed is defined in section 2.X.4.  A
5bd8deadSopenharmony_ci    successfully loaded program string replaces the program string previously
5bd8deadSopenharmony_ci    loaded into the specified program object.  If the OUT_OF_MEMORY error is
5bd8deadSopenharmony_ci    generated by ProgramStringARB, no change is made to the previous contents
5bd8deadSopenharmony_ci    of the current program object.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3, Program Variables
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Programs may operate on a number of different variables during their
5bd8deadSopenharmony_ci    execution.  The following sections define the different classes of
5bd8deadSopenharmony_ci    variables that can be declared and used by a program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Some variable classes require variable bindings.  Variable classes with
5bd8deadSopenharmony_ci    bindings refer to state that is either generated or consumed outside the
5bd8deadSopenharmony_ci    program.  Examples of variable bindings include a vertex's normal, the
5bd8deadSopenharmony_ci    position of a vertex computed by a vertex program, an interpolated texture
5bd8deadSopenharmony_ci    coordinate, and the diffuse color of light 1.  Variables that are used
5bd8deadSopenharmony_ci    only during program execution do not have bindings.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Variables may be declared explicitly according to the <namingStatement>
5bd8deadSopenharmony_ci    grammar rule.  Explicit variable declarations allow a program to establish
5bd8deadSopenharmony_ci    a variable name that can be used to refer to a specified resource in
5bd8deadSopenharmony_ci    subsequent instructions.  Variables may be declared anywhere in the
5bd8deadSopenharmony_ci    program string, but must be declared prior to use.  A program will fail to
5bd8deadSopenharmony_ci    load if it declares the same variable name more than once, or if it refers
5bd8deadSopenharmony_ci    to a variable name that has not been previously declared in the program
5bd8deadSopenharmony_ci    string.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Variables may also be declared implicitly, simply by using a variable
5bd8deadSopenharmony_ci    binding as an operand in a program instruction.  Such uses are considered
5bd8deadSopenharmony_ci    to automatically create a nameless variable using the specified binding.
5bd8deadSopenharmony_ci    Only variable from classes with bindings can be declared implicitly.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.1, Program Variable Types
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Explicit variable declarations may include one or more modifiers that
5bd8deadSopenharmony_ci    specify additional information about the variable, such as the size and
5bd8deadSopenharmony_ci    data type of the components of the variable.  Variable modifiers are
5bd8deadSopenharmony_ci    specified according to the <varModifier> grammar rule.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    By default, variables are considered typeless.  They can be used in
5bd8deadSopenharmony_ci    instructions that read or write the variable as floating-point values,
5bd8deadSopenharmony_ci    signed integers, or unsigned integers.  If a variable is written using one
5bd8deadSopenharmony_ci    data type but then read using a different one, the results of the
5bd8deadSopenharmony_ci    operation are undefined.  Variables with bindings are considered to be
5bd8deadSopenharmony_ci    read or written when their values are produced or consumed; the data type
5bd8deadSopenharmony_ci    used by the GL is specified in the description of each binding.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Explicitly declared variables may optionally have one data type modifier,
5bd8deadSopenharmony_ci    which can be used to detect data type mismatch errors.  Type modifers of
5bd8deadSopenharmony_ci    "INT", "UINT", and "FLOAT" indicate that the components of the variable
5bd8deadSopenharmony_ci    are stored as signed integers, unsigned integers, or floating-point
5bd8deadSopenharmony_ci    values, respectively.  A program will fail to load if it attempts to read
5bd8deadSopenharmony_ci    or write a variable using a data type other than the one indicated by the
5bd8deadSopenharmony_ci    data type modifier.  Variables without a data type modifier can be read or
5bd8deadSopenharmony_ci    written using any data type.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Explicitly declared variables may optionally have one storage size
5bd8deadSopenharmony_ci    modifier.  Variables decared as "SHORT" will be represented using at least
5bd8deadSopenharmony_ci    16 bits per component.  "SHORT" floating-point values will have at least 5
5bd8deadSopenharmony_ci    bits of exponent and 10 bits of mantissa.  Variables declared as "LONG"
5bd8deadSopenharmony_ci    will be represented with at least 32 bits per component.  "LONG"
5bd8deadSopenharmony_ci    floating-point values will have at least 8 bits of exponent and 23 bits of
5bd8deadSopenharmony_ci    mantissa.  If no size modifier is provided, the GL will automatically
5bd8deadSopenharmony_ci    select component sizes.  Implementations are not required to support more
5bd8deadSopenharmony_ci    than one component size, so "SHORT", "LONG", and the default could all
5bd8deadSopenharmony_ci    refer to the same component size.  The "LONG" modifier is supported only
5bd8deadSopenharmony_ci    for declarations of temporary variables ("TEMP").  The "SHORT" modifier is
5bd8deadSopenharmony_ci    supported only for declarations of temporary variables and result
5bd8deadSopenharmony_ci    variables ("OUTPUT").
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Each variable declaration can include at most one data type and one
5bd8deadSopenharmony_ci    storage size modifier.  A program will fail to load if it specifies
5bd8deadSopenharmony_ci    multiple data type or multiple storage size modifiers in a single variable
5bd8deadSopenharmony_ci    declaration.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (NOTE:  Fragment programs also support the modifiers "FLAT", "CENTROID",
5bd8deadSopenharmony_ci    and "NOPERSPECTIVE", which control how per-fragment attribute values are
5bd8deadSopenharmony_ci    produced.  These modifiers are described in detail in the
5bd8deadSopenharmony_ci    NV_fragment_program4 specification.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Explicitly declared variables of all types may be declared as arrays.  An
5bd8deadSopenharmony_ci    array variable has one or more members, numbered 0 through <n>-1, where
5bd8deadSopenharmony_ci    <n> is the number of entries in the array.  The total number of entries in
5bd8deadSopenharmony_ci    the array can be declared using the <optArraySize> grammar rule.  For
5bd8deadSopenharmony_ci    variable classes without bindings, an array size must be specified in the
5bd8deadSopenharmony_ci    program, and must be a positive integer.  For variable classes with
5bd8deadSopenharmony_ci    bindings, a declared size is optional, and is taken from the number of
5bd8deadSopenharmony_ci    bindings assigned in the declaration if omitted.  A program will fail to
5bd8deadSopenharmony_ci    load if the declared size of an array variable does not match the number
5bd8deadSopenharmony_ci    of assigned bindings.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When a variable is declared as an array, instructions that use the
5bd8deadSopenharmony_ci    variable must specify an array member to access according to the
5bd8deadSopenharmony_ci    <arrayMem> grammar rule.  A program will fail to load if it contains an
5bd8deadSopenharmony_ci    instruction that accesses an array variable without specifying an array
5bd8deadSopenharmony_ci    member or an instruction that specifies an array member for a non-array
5bd8deadSopenharmony_ci    variable.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.2, Program Attribute Variables
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program attribute variables represent per-vertex or per-fragment inputs to
5bd8deadSopenharmony_ci    the program.  All attribute variables have associated bindings, and are
5bd8deadSopenharmony_ci    read-only during program execution.  Attribute variables may be declared
5bd8deadSopenharmony_ci    explicitly via the <ATTRIB_statement> grammar rule, or implicitly by using
5bd8deadSopenharmony_ci    an attribute binding in an instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The set of available attribute bindings depends on the program type, and
5bd8deadSopenharmony_ci    is enumerated in the specifications for each program type.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The set of bindings allowed for attribute array variables is limited to
5bd8deadSopenharmony_ci    attribute state grouped in arrays (e.g., texture coordinates, generic
5bd8deadSopenharmony_ci    vertex attributes).  Additionally, all bindings assigned to the array must
5bd8deadSopenharmony_ci    be of the same binding type and must increase consecutively.  Examples of
5bd8deadSopenharmony_ci    valid and invalid binding lists include:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      vertex.attrib[1], vertex.attrib[2]      # valid, 2-entry array
5bd8deadSopenharmony_ci      vertex.texcoord[0..3]                   # valid, 4-entry array
5bd8deadSopenharmony_ci      vertex.attrib[1], vertex.attrib[3]      # invalid, skipped attrib 2
5bd8deadSopenharmony_ci      vertex.attrib[2], vertex.attrib[1]      # invalid, wrong order
5bd8deadSopenharmony_ci      vertex.attrib[1], vertex.texcoord[2]    # invalid, different types
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Additionally, attribute bindings may be used in no more than one array
5bd8deadSopenharmony_ci    variable accessed with relative addressing.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Implementations may have a limit on the total number of attribute binding
5bd8deadSopenharmony_ci    components used by each program target (MAX_PROGRAM_ATTRIB_COMPONENTS_NV).
5bd8deadSopenharmony_ci    Programs that use more attribute binding components than this limit will
5bd8deadSopenharmony_ci    fail to load.  The method of counting used attribute binding components is
5bd8deadSopenharmony_ci    implementation-dependent, but must satisfy the following properties:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * If an attribute binding is not referenced in a program, or is
5bd8deadSopenharmony_ci        referenced only in declarations of attribute variables that are not
5bd8deadSopenharmony_ci        used, none of its components are counted.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * An attribute binding component may be counted as used only if there
5bd8deadSopenharmony_ci        exists an instruction operand where
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          - the component is enabled for read by the swizzle pattern (Section
5bd8deadSopenharmony_ci            2.X.4.2), and
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          - the attribute binding is
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              - referenced directly by the operand,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              - bound to a declared variable referenced by the operand, or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              - bound to a declared array variable where another binding in
5bd8deadSopenharmony_ci                the array satisfies one of the two previous conditions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Implementations are not required to optimize out unused elements of an
5bd8deadSopenharmony_ci        attribute array or components that are used in only some elements of
5bd8deadSopenharmony_ci        an array.  The last of these rules is intended to cover the case where
5bd8deadSopenharmony_ci        the same attribute binding is used in multiple variables.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        For example, an operand whose swizzle pattern selects only the x
5bd8deadSopenharmony_ci        component may result in the x component of an attribute binding being
5bd8deadSopenharmony_ci        counted, but may never result in the counting of the y, z, or w
5bd8deadSopenharmony_ci        components of any attribute binding.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * Implementations are not required to determine that components read by
5bd8deadSopenharmony_ci        an instruction are actually unused due to:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          - instruction write masks (for example, a component-wise ADD
5bd8deadSopenharmony_ci            operation that only writes the "x" component doesn't have to read
5bd8deadSopenharmony_ci            the "y", "z", and "w" components of its operands) or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          - any other properties of the instruction (for example, the DP3
5bd8deadSopenharmony_ci            instruction computes a 3-component dot product doesn't have to
5bd8deadSopenharmony_ci            read the "w" component of its operands).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.3, Program Parameters
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program parameter variables are used as constants during program
5bd8deadSopenharmony_ci    execution.  All program parameter variables have associated bindings and
5bd8deadSopenharmony_ci    are read-only during program execution.  Program parameters retain their
5bd8deadSopenharmony_ci    values across program invocations, although their values may change
5bd8deadSopenharmony_ci    between invocations due to GL state changes.  Program parameter variables
5bd8deadSopenharmony_ci    may be declared explicitly via the <PARAM_statement> grammar rule, or
5bd8deadSopenharmony_ci    implicitly by using a parameter binding in an instruction.  Except where
5bd8deadSopenharmony_ci    otherwise specified, program parameter bindings always specify
5bd8deadSopenharmony_ci    floating-point values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When declaring program parameter array variables, all bindings are
5bd8deadSopenharmony_ci    supported and can be assigned to array members in any order.  The only
5bd8deadSopenharmony_ci    restriction is that no parameter binding may be used more than once in
5bd8deadSopenharmony_ci    array variables accessed using relative addressing.  A program will fail
5bd8deadSopenharmony_ci    to load if any program parameter binding is used more than once in a
5bd8deadSopenharmony_ci    single array accessed using relative addressing or used at least once in
5bd8deadSopenharmony_ci    two or more arrays accessed using relative addressing.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Constant Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches the <constantScalar> or
5bd8deadSopenharmony_ci    <signedConstantScalar> grammar rules, the corresponding program parameter
5bd8deadSopenharmony_ci    variable is bound to the vector (X,X,X,X), where X is the value of the
5bd8deadSopenharmony_ci    specified constant.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches <constantVector>, the corresponding
5bd8deadSopenharmony_ci    program parameter variable is bound to the vector (X,Y,Z,W), where X, Y,
5bd8deadSopenharmony_ci    Z, and W are the values corresponding to the first, second, third, and
5bd8deadSopenharmony_ci    fourth match of <signedConstantScalar>.  If fewer than four constants are
5bd8deadSopenharmony_ci    specified, Y, Z, and W assume the values 0, 0, and 1, if their respective
5bd8deadSopenharmony_ci    constants are not specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Constant bindings can be interpreted as having signed integer, unsigned
5bd8deadSopenharmony_ci    integer, or floating-point values, depending on how they are used in the
5bd8deadSopenharmony_ci    program text.  For constants in variable declarations, the components of
5bd8deadSopenharmony_ci    the constant are interpreted according to the variable's component data
5bd8deadSopenharmony_ci    type modifier.  If no data type modifier is specified in a declaration,
5bd8deadSopenharmony_ci    constants are interpreted as floating-point values.  For constant bindings
5bd8deadSopenharmony_ci    used directly in an instruction, the components of the constant are
5bd8deadSopenharmony_ci    interpreted according to the required data type of the operand.  A program
5bd8deadSopenharmony_ci    will fail to load if it specifies a floating-point constant value
5bd8deadSopenharmony_ci    (matching the <floatConstant> grammar rule) that should be interpreted as
5bd8deadSopenharmony_ci    a signed or unsigned integer, or a negative integer constant value that
5bd8deadSopenharmony_ci    should be interpreted as an unsigned integer.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the value used to specify a floating-point constant can not be exactly
5bd8deadSopenharmony_ci    represented, the nearest floating-point value will be used.  If the value
5bd8deadSopenharmony_ci    used to specify an integer constant is too large to be represented, the
5bd8deadSopenharmony_ci    program will fail to load.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program Environment/Local Parameter Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                    Components  Underlying State
5bd8deadSopenharmony_ci      -------------------------  ----------  -------------------------------
5bd8deadSopenharmony_ci      program.env[a]             (x,y,z,w)   program environment parameter a
5bd8deadSopenharmony_ci      program.local[a]           (x,y,z,w)   program local parameter a
5bd8deadSopenharmony_ci      program.env[a..b]          (x,y,z,w)   program environment parameters
5bd8deadSopenharmony_ci                                             a through b
5bd8deadSopenharmony_ci      program.local[a..b]        (x,y,z,w)   program local parameters
5bd8deadSopenharmony_ci                                             a through b
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.1:  Program Environment/Local Parameter Bindings.  <a> and <b>
5bd8deadSopenharmony_ci      indicate parameter numbers, where <a> must be less than or equal to <b>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "program.env[a]" or
5bd8deadSopenharmony_ci    "program.local[a]", the four components of the program parameter variable
5bd8deadSopenharmony_ci    are filled with the four components of program environment parameter <a>
5bd8deadSopenharmony_ci    or program local parameter <a> respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Additionally, for program parameter array bindings, "program.env[a..b]"
5bd8deadSopenharmony_ci    and "program.local[a..b]" are equivalent to specifying program environment
5bd8deadSopenharmony_ci    or local parameters <a> through <b> in order, respectively.  A program
5bd8deadSopenharmony_ci    using any of these bindings will fail to load if <a> is greater than <b>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program environment and local parameters are typeless, and may be
5bd8deadSopenharmony_ci    specified as signed integer, unsigned integer, or floating-point
5bd8deadSopenharmony_ci    variables.  If a program environment parameter is read using a data type
5bd8deadSopenharmony_ci    other than the one used to specify it, an undefined value is returned.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Material Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                        Components  Underlying State
5bd8deadSopenharmony_ci      -----------------------------  ----------  ----------------------------
5bd8deadSopenharmony_ci      state.material.ambient         (r,g,b,a)   front ambient material color
5bd8deadSopenharmony_ci      state.material.diffuse         (r,g,b,a)   front diffuse material color
5bd8deadSopenharmony_ci      state.material.specular        (r,g,b,a)   front specular material color
5bd8deadSopenharmony_ci      state.material.emission        (r,g,b,a)   front emissive material color
5bd8deadSopenharmony_ci      state.material.shininess       (s,0,0,1)   front material shininess
5bd8deadSopenharmony_ci      state.material.front.ambient   (r,g,b,a)   front ambient material color
5bd8deadSopenharmony_ci      state.material.front.diffuse   (r,g,b,a)   front diffuse material color
5bd8deadSopenharmony_ci      state.material.front.specular  (r,g,b,a)   front specular material color
5bd8deadSopenharmony_ci      state.material.front.emission  (r,g,b,a)   front emissive material color
5bd8deadSopenharmony_ci      state.material.front.shininess (s,0,0,1)   front material shininess
5bd8deadSopenharmony_ci      state.material.back.ambient    (r,g,b,a)   back ambient material color
5bd8deadSopenharmony_ci      state.material.back.diffuse    (r,g,b,a)   back diffuse material color
5bd8deadSopenharmony_ci      state.material.back.specular   (r,g,b,a)   back specular material color
5bd8deadSopenharmony_ci      state.material.back.emission   (r,g,b,a)   back emissive material color
5bd8deadSopenharmony_ci      state.material.back.shininess  (s,0,0,1)   back material shininess
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.3:  Material Property Bindings.  If a material face is not
5bd8deadSopenharmony_ci      specified in the binding, the front property is used.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches any of the material properties
5bd8deadSopenharmony_ci    listed in Table X.3, the program parameter variable is filled according to
5bd8deadSopenharmony_ci    the table.  For ambient, diffuse, specular, or emissive colors, the "x",
5bd8deadSopenharmony_ci    "y", "z", and "w" components are filled with the "r", "g", "b", and "a"
5bd8deadSopenharmony_ci    components, respectively, of the corresponding material color.  For
5bd8deadSopenharmony_ci    material shininess, the "x" component is filled with the material's
5bd8deadSopenharmony_ci    specular exponent, and the "y", "z", and "w" components are filled with
5bd8deadSopenharmony_ci    the floating-point constants 0, 0, and 1, respectively.  Bindings
5bd8deadSopenharmony_ci    containing ".back" refer to the back material; all other bindings refer to
5bd8deadSopenharmony_ci    the front material.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Material properties can be changed inside a Begin/End pair, either
5bd8deadSopenharmony_ci    directly by calling Material, or indirectly through color material.
5bd8deadSopenharmony_ci    However, such property changes are not guaranteed to update program
5bd8deadSopenharmony_ci    parameter bindings until the following End command.  Program parameter
5bd8deadSopenharmony_ci    variables bound to material properties changed inside a Begin/End pair are
5bd8deadSopenharmony_ci    undefined until the following End command.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Light Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                        Components  Underlying State
5bd8deadSopenharmony_ci      -----------------------------  ----------  ----------------------------
5bd8deadSopenharmony_ci      state.light[n].ambient         (r,g,b,a)   light n ambient color
5bd8deadSopenharmony_ci      state.light[n].diffuse         (r,g,b,a)   light n diffuse color
5bd8deadSopenharmony_ci      state.light[n].specular        (r,g,b,a)   light n specular color
5bd8deadSopenharmony_ci      state.light[n].position        (x,y,z,w)   light n position
5bd8deadSopenharmony_ci      state.light[n].attenuation     (a,b,c,e)   light n attenuation constants
5bd8deadSopenharmony_ci                                                 and spot light exponent
5bd8deadSopenharmony_ci      state.light[n].spot.direction  (x,y,z,c)   light n spot direction and
5bd8deadSopenharmony_ci                                                 cutoff angle cosine
5bd8deadSopenharmony_ci      state.light[n].half            (x,y,z,1)   light n infinite half-angle
5bd8deadSopenharmony_ci      state.lightmodel.ambient       (r,g,b,a)   light model ambient color
5bd8deadSopenharmony_ci      state.lightmodel.scenecolor    (r,g,b,a)   light model front scene color
5bd8deadSopenharmony_ci      state.lightmodel.              (r,g,b,a)   light model front scene color
5bd8deadSopenharmony_ci               front.scenecolor
5bd8deadSopenharmony_ci      state.lightmodel.              (r,g,b,a)   light model back scene color
5bd8deadSopenharmony_ci               back.scenecolor
5bd8deadSopenharmony_ci      state.lightprod[n].ambient     (r,g,b,a)   light n / front material
5bd8deadSopenharmony_ci                                                 ambient color product
5bd8deadSopenharmony_ci      state.lightprod[n].diffuse     (r,g,b,a)   light n / front material
5bd8deadSopenharmony_ci                                                 diffuse color product
5bd8deadSopenharmony_ci      state.lightprod[n].specular    (r,g,b,a)   light n / front material
5bd8deadSopenharmony_ci                                                 specular color product
5bd8deadSopenharmony_ci      state.lightprod[n].            (r,g,b,a)   light n / front material
5bd8deadSopenharmony_ci              front.ambient                      ambient color product
5bd8deadSopenharmony_ci      state.lightprod[n].            (r,g,b,a)   light n / front material
5bd8deadSopenharmony_ci              front.diffuse                      diffuse color product
5bd8deadSopenharmony_ci      state.lightprod[n].            (r,g,b,a)   light n / front material
5bd8deadSopenharmony_ci              front.specular                     specular color product
5bd8deadSopenharmony_ci      state.lightprod[n].            (r,g,b,a)   light n / back material
5bd8deadSopenharmony_ci              back.ambient                       ambient color product
5bd8deadSopenharmony_ci      state.lightprod[n].            (r,g,b,a)   light n / back material
5bd8deadSopenharmony_ci              back.diffuse                       diffuse color product
5bd8deadSopenharmony_ci      state.lightprod[n].            (r,g,b,a)   light n / back material
5bd8deadSopenharmony_ci              back.specular                      specular color product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.4: Light Property Bindings.  <n> indicates a light number.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.light[n].ambient",
5bd8deadSopenharmony_ci    "state.light[n].diffuse", or "state.light[n].specular", the "x", "y", "z",
5bd8deadSopenharmony_ci    and "w" components of the program parameter variable are filled with the
5bd8deadSopenharmony_ci    "r", "g", "b", and "a" components, respectively, of the corresponding
5bd8deadSopenharmony_ci    light color.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.light[n].position", the "x",
5bd8deadSopenharmony_ci    "y", "z", and "w" components of the program parameter variable are filled
5bd8deadSopenharmony_ci    with the "x", "y", "z", and "w" components, respectively, of the light
5bd8deadSopenharmony_ci    position.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.light[n].attenuation", the
5bd8deadSopenharmony_ci    "x", "y", and "z" components of the program parameter variable are filled
5bd8deadSopenharmony_ci    with the constant, linear, and quadratic attenuation parameters of the
5bd8deadSopenharmony_ci    specified light, respectively (section 2.13.1).  The "w" component of the
5bd8deadSopenharmony_ci    program parameter variable is filled with the spot light exponent of the
5bd8deadSopenharmony_ci    specified light.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.light[n].spot.direction",
5bd8deadSopenharmony_ci    the "x", "y", and "z" components of the program parameter variable are
5bd8deadSopenharmony_ci    filled with the "x", "y", and "z" components of the spot light direction
5bd8deadSopenharmony_ci    of the specified light, respectively (section 2.13.1).  The "w" component
5bd8deadSopenharmony_ci    of the program parameter variable is filled with the cosine of the spot
5bd8deadSopenharmony_ci    light cutoff angle of the specified light.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.light[n].half", the "x",
5bd8deadSopenharmony_ci    "y", and "z" components of the program parameter variable are filled with
5bd8deadSopenharmony_ci    the x, y, and z components, respectively, of the normalized infinite
5bd8deadSopenharmony_ci    half-angle vector
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      h_inf = || P + (0, 0, 1) ||.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The "w" component is filled with 1.0.  In the computation of h_inf, P
5bd8deadSopenharmony_ci    consists of the x, y, and z coordinates of the normalized vector from the
5bd8deadSopenharmony_ci    eye position P_e to the eye-space light position P_pli (section 2.13.1).
5bd8deadSopenharmony_ci    h_inf is defined to correspond to the normalized half-angle vector when
5bd8deadSopenharmony_ci    using an infinite light (w coordinate of the position is zero) and an
5bd8deadSopenharmony_ci    infinite viewer (v_bs is FALSE).  For local lights or a local viewer,
5bd8deadSopenharmony_ci    h_inf is well-defined but does not match the normalized half-angle vector,
5bd8deadSopenharmony_ci    which will vary depending on the vertex position.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.lightmodel.ambient", the
5bd8deadSopenharmony_ci    "x", "y", "z", and "w" components of the program parameter variable are
5bd8deadSopenharmony_ci    filled with the "r", "g", "b", and "a" components of the light model
5bd8deadSopenharmony_ci    ambient color, respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.lightmodel.scenecolor" or
5bd8deadSopenharmony_ci    "state.lightmodel.front.scenecolor", the "x", "y", and "z" components of
5bd8deadSopenharmony_ci    the program parameter variable are filled with the "r", "g", and "b"
5bd8deadSopenharmony_ci    components respectively of the "front scene color"
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      c_scene = a_cs * a_cm + e_cm,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where a_cs is the light model ambient color, a_cm is the front ambient
5bd8deadSopenharmony_ci    material color, and e_cm is the front emissive material color.  The "w"
5bd8deadSopenharmony_ci    component of the program parameter variable is filled with the alpha
5bd8deadSopenharmony_ci    component of the front diffuse material color.  If a program parameter
5bd8deadSopenharmony_ci    binding matches "state.lightmodel.back.scenecolor", a similar back scene
5bd8deadSopenharmony_ci    color, computed using back-facing material properties, is used.  The front
5bd8deadSopenharmony_ci    and back scene colors match the values that would be assigned to vertices
5bd8deadSopenharmony_ci    using conventional lighting if all lights were disabled.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches anything beginning with
5bd8deadSopenharmony_ci    "state.lightprod[n]", the "x", "y", and "z" components of the program
5bd8deadSopenharmony_ci    parameter variable are filled with the "r", "g", and "b" components,
5bd8deadSopenharmony_ci    respectively, of the corresponding light product.  The three light product
5bd8deadSopenharmony_ci    components are the products of the corresponding color components of the
5bd8deadSopenharmony_ci    specified material property and the light color of the specified light
5bd8deadSopenharmony_ci    (see Table X.4).  The "w" component of the program parameter variable is
5bd8deadSopenharmony_ci    filled with the alpha component of the specified material property.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Light products depend on material properties, which can be changed inside
5bd8deadSopenharmony_ci    a Begin/End pair.  Such property changes are not guaranteed to take effect
5bd8deadSopenharmony_ci    until the following End command.  Program parameter variables bound to
5bd8deadSopenharmony_ci    light products whose corresponding material property changes inside a
5bd8deadSopenharmony_ci    Begin/End pair are undefined until the following End command.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Texture Coordinate Generation Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                    Components  Underlying State
5bd8deadSopenharmony_ci      -------------------------  ----------  ----------------------------
5bd8deadSopenharmony_ci      state.texgen[n].eye.s      (a,b,c,d)   TexGen eye linear plane
5bd8deadSopenharmony_ci                                             coefficients, s coord, unit n
5bd8deadSopenharmony_ci      state.texgen[n].eye.t      (a,b,c,d)   TexGen eye linear plane
5bd8deadSopenharmony_ci                                             coefficients, t coord, unit n
5bd8deadSopenharmony_ci      state.texgen[n].eye.r      (a,b,c,d)   TexGen eye linear plane
5bd8deadSopenharmony_ci                                             coefficients, r coord, unit n
5bd8deadSopenharmony_ci      state.texgen[n].eye.q      (a,b,c,d)   TexGen eye linear plane
5bd8deadSopenharmony_ci                                             coefficients, q coord, unit n
5bd8deadSopenharmony_ci      state.texgen[n].object.s   (a,b,c,d)   TexGen object linear plane
5bd8deadSopenharmony_ci                                             coefficients, s coord, unit n
5bd8deadSopenharmony_ci      state.texgen[n].object.t   (a,b,c,d)   TexGen object linear plane
5bd8deadSopenharmony_ci                                             coefficients, t coord, unit n
5bd8deadSopenharmony_ci      state.texgen[n].object.r   (a,b,c,d)   TexGen object linear plane
5bd8deadSopenharmony_ci                                             coefficients, r coord, unit n
5bd8deadSopenharmony_ci      state.texgen[n].object.q   (a,b,c,d)   TexGen object linear plane
5bd8deadSopenharmony_ci                                             coefficients, q coord, unit n
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.5:  Texture Coordinate Generation Property Bindings.  "[n]" is
5bd8deadSopenharmony_ci      optional -- texture unit <n> is used if specified; texture unit 0 is
5bd8deadSopenharmony_ci      used otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches a set of TexGen plane coefficients,
5bd8deadSopenharmony_ci    the "x", "y", "z", and "w" components of the program parameter variable
5bd8deadSopenharmony_ci    are filled with the coefficients p1, p2, p3, and p4, respectively, for
5bd8deadSopenharmony_ci    object linear coefficients, and the coefficents p1', p2', p3', and p4',
5bd8deadSopenharmony_ci    respectively, for eye linear coefficients (section 2.10.4).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Fog Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                        Components  Underlying State
5bd8deadSopenharmony_ci      -----------------------------  ----------  ----------------------------
5bd8deadSopenharmony_ci      state.fog.color                (r,g,b,a)   RGB fog color (section 3.10)
5bd8deadSopenharmony_ci      state.fog.params               (d,s,e,r)   fog density, linear start
5bd8deadSopenharmony_ci                                                 and end, and 1/(end-start)
5bd8deadSopenharmony_ci                                                 (section 3.10)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.6:  Fog Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.fog.color", the "x", "y",
5bd8deadSopenharmony_ci    "z", and "w" components of the program parameter variable are filled with
5bd8deadSopenharmony_ci    the "r", "g", "b", and "a" components, respectively, of the fog color
5bd8deadSopenharmony_ci    (section 3.10).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.fog.params", the "x", "y",
5bd8deadSopenharmony_ci    and "z" components of the program parameter variable are filled with the
5bd8deadSopenharmony_ci    fog density, linear fog start, and linear fog end parameters (section
5bd8deadSopenharmony_ci    3.10), respectively.  The "w" component is filled with 1/(end-start),
5bd8deadSopenharmony_ci    where end and start are the linear fog end and start parameters,
5bd8deadSopenharmony_ci    respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Clip Plane Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                        Components  Underlying State
5bd8deadSopenharmony_ci      -----------------------------  ----------  ----------------------------
5bd8deadSopenharmony_ci      state.clip[n].plane            (a,b,c,d)   clip plane n coefficients
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.7:  Clip Plane Property Bindings.  <n> specifies the clip plane
5bd8deadSopenharmony_ci      number, and is required.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.clip[n].plane", the "x",
5bd8deadSopenharmony_ci    "y", "z", and "w" components of the program parameter variable are filled
5bd8deadSopenharmony_ci    with the coefficients p1', p2', p3', and p4', respectively, of clip plane
5bd8deadSopenharmony_ci    <n> (section 2.11).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Point Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                        Components  Underlying State
5bd8deadSopenharmony_ci      -----------------------------  ----------  ----------------------------
5bd8deadSopenharmony_ci      state.point.size               (s,n,x,f)   point size, min and max size
5bd8deadSopenharmony_ci                                                 clamps, and fade threshold
5bd8deadSopenharmony_ci                                                 (section 3.3)
5bd8deadSopenharmony_ci      state.point.attenuation        (a,b,c,1)   point size attenuation consts
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.8:  Point Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.point.size", the "x", "y",
5bd8deadSopenharmony_ci    "z", and "w" components of the program parameter variable are filled with
5bd8deadSopenharmony_ci    the point size, minimum point size, maximum point size, and fade
5bd8deadSopenharmony_ci    threshold, respectively (section 3.3).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.point.attenuation", the "x",
5bd8deadSopenharmony_ci    "y", and "z" components of the program parameter variable are filled with
5bd8deadSopenharmony_ci    the constant, linear, and quadratic point size attenuation parameters (a,
5bd8deadSopenharmony_ci    b, and c), respectively (section 3.3).  The "w" component is filled with
5bd8deadSopenharmony_ci    1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Texture Environment Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                    Components  Underlying State
5bd8deadSopenharmony_ci      -------------------------  ----------  ----------------------------
5bd8deadSopenharmony_ci      state.texenv[n].color      (r,g,b,a)   texture environment n color
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.9:  Texture Environment Property Bindings.  "[n]" is optional --
5bd8deadSopenharmony_ci      texture unit <n> is used if specified; texture unit 0 is used otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.texenv[n].color", the "x",
5bd8deadSopenharmony_ci    "y", "z", and "w" components of the program parameter variable are filled
5bd8deadSopenharmony_ci    with the "r", "g", "b", and "a" components, respectively, of the
5bd8deadSopenharmony_ci    corresponding texture environment color.  Note that only "legacy" texture
5bd8deadSopenharmony_ci    units, as queried by MAX_TEXTURE_UNITS, include texture environment state.
5bd8deadSopenharmony_ci    Texture image units and texture coordinate sets do not have associated
5bd8deadSopenharmony_ci    texture environment state.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Depth Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                      Components  Underlying State
5bd8deadSopenharmony_ci      ---------------------------  ----------  ----------------------------
5bd8deadSopenharmony_ci      state.depth.range            (n,f,d,1)   Depth range near, far, and
5bd8deadSopenharmony_ci                                               (far-near) (section 2.10.1)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.10:  Depth Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter binding matches "state.depth.range", the "x" and
5bd8deadSopenharmony_ci    "y" components of the program parameter variable are filled with the
5bd8deadSopenharmony_ci    mappings of near and far clipping planes to window coordinates,
5bd8deadSopenharmony_ci    respectively.  The "z" component is filled with the difference of the
5bd8deadSopenharmony_ci    mappings of near and far clipping planes, far minus near.  The "w"
5bd8deadSopenharmony_ci    component is filled with 1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Matrix Property Bindings
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                               Underlying State
5bd8deadSopenharmony_ci      ------------------------------------  ---------------------------
5bd8deadSopenharmony_ci      * state.matrix.modelview[n]           modelview matrix n
5bd8deadSopenharmony_ci        state.matrix.projection             projection matrix
5bd8deadSopenharmony_ci        state.matrix.mvp                    modelview-projection matrix
5bd8deadSopenharmony_ci      * state.matrix.texture[n]             texture matrix n
5bd8deadSopenharmony_ci        state.matrix.program[n]             program matrix n
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.11:  Base Matrix Property Bindings.  The "[n]" syntax indicates
5bd8deadSopenharmony_ci      a specific matrix number.  For modelview and texture matrices, a matrix
5bd8deadSopenharmony_ci      number is optional, and matrix zero will be used if the matrix number is
5bd8deadSopenharmony_ci      omitted.  These base bindings may further be modified by a
5bd8deadSopenharmony_ci      inverse/transpose selector and a row selector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the beginning of a program parameter binding matches any of the matrix
5bd8deadSopenharmony_ci    binding names listed in Table X.11, the binding corresponds to a 4x4
5bd8deadSopenharmony_ci    matrix.  If the parameter binding is followed by ".inverse", ".transpose",
5bd8deadSopenharmony_ci    or ".invtrans" (<stateMatModifier> grammar rule), the inverse, transpose,
5bd8deadSopenharmony_ci    or transpose of the inverse, respectively, of the matrix specified in
5bd8deadSopenharmony_ci    Table X.11 is selected.  Otherwise, the matrix specified in Table X.11 is
5bd8deadSopenharmony_ci    selected.  If the specified matrix is poorly-conditioned (singular or
5bd8deadSopenharmony_ci    nearly so), its inverse matrix is undefined.  The binding name
5bd8deadSopenharmony_ci    "state.matrix.mvp" refers to the product of modelview matrix zero and the
5bd8deadSopenharmony_ci    projection matrix, defined as
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci       MVP = P * M0,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where P is the projection matrix and M0 is modelview matrix zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the selected matrix is followed by ".row[<a>]" (matching the
5bd8deadSopenharmony_ci    <stateMatrixRow> grammar rule), the "x", "y", "z", and "w" components of
5bd8deadSopenharmony_ci    the program parameter variable are filled with the four entries of row <a>
5bd8deadSopenharmony_ci    of the selected matrix.  In the example,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      PARAM m0 = state.matrix.modelview[1].row[0];
5bd8deadSopenharmony_ci      PARAM m1 = state.matrix.projection.transpose.row[3];
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    the variable "m0" is set to the first row (row 0) of modelview matrix 1
5bd8deadSopenharmony_ci    and "m1" is set to the last row (row 3) of the transpose of the projection
5bd8deadSopenharmony_ci    matrix.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For program parameter array bindings, multiple rows of the selected matrix
5bd8deadSopenharmony_ci    can be bound via the <stateMatrixRows> grammar rule.  If the selected
5bd8deadSopenharmony_ci    matrix binding is followed by ".row[<a>..<b>]", the result is equivalent
5bd8deadSopenharmony_ci    to specifying matrix rows <a> through <b>, in order.  A program will fail
5bd8deadSopenharmony_ci    to load if <a> is greater than <b>.  If no row selection is specified
5bd8deadSopenharmony_ci    (<optMatrixRows> matches ""), matrix rows 0 through 3 are bound in order.
5bd8deadSopenharmony_ci    In the example,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      PARAM m2[] = { state.matrix.program[0].row[1..2] };
5bd8deadSopenharmony_ci      PARAM m3[] = { state.matrix.program[0].transpose };
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    the array "m2" has two entries, containing rows 1 and 2 of program matrix
5bd8deadSopenharmony_ci    zero, and "m3" has four entries, containing all four rows of the transpose
5bd8deadSopenharmony_ci    of program matrix zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.4, Program Temporaries
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program temporary variables are used to hold temporary results during
5bd8deadSopenharmony_ci    program execution.  Temporaries do not persist between program
5bd8deadSopenharmony_ci    invocations, and are undefined at the beginning of each program
5bd8deadSopenharmony_ci    invocation.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Temporary variables are declared explicitly using the <TEMP_statement>
5bd8deadSopenharmony_ci    grammar rule.  Each such statement can declare one or more temporaries.
5bd8deadSopenharmony_ci    Temporaries can not be declared implicitly.  Temporaries can be declared
5bd8deadSopenharmony_ci    using any component size ("SHORT" or "LONG") and type ("FLOAT" or "INT")
5bd8deadSopenharmony_ci    modifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Temporary variables may be declared as arrays.  Temporary variables
5bd8deadSopenharmony_ci    declared as arrays may be stored in slower memory than those not declared
5bd8deadSopenharmony_ci    as arrays, and it is recommended to use non-array variables unless array
5bd8deadSopenharmony_ci    functionality is required.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.5, Program Results
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program result variables represent the per-vertex or per-fragment results
5bd8deadSopenharmony_ci    of the program.  All result variables have associated bindings, are
5bd8deadSopenharmony_ci    write-only during program execution, and are undefined at the beginning of
5bd8deadSopenharmony_ci    each program invocation.  Any vertex or fragment attributes corresponding
5bd8deadSopenharmony_ci    to unwritten result variables will be undefined in subsequent stages of
5bd8deadSopenharmony_ci    the pipeline.  Result variables may be declared explicitly via the
5bd8deadSopenharmony_ci    <OUTPUT_statement> grammar rule, or implicitly by using a result binding
5bd8deadSopenharmony_ci    in an instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The set of available result bindings depends on the program type, and is
5bd8deadSopenharmony_ci    enumerated in the specifications for each program type.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Result variables may generally be declared as arrays, but the set of
5bd8deadSopenharmony_ci    bindings allowed for arrays is limited to state grouped in arrays (e.g.,
5bd8deadSopenharmony_ci    texture coordinates, clip distances, colors).  Additionally, all bindings
5bd8deadSopenharmony_ci    assigned to the array must be of the same binding type and must increase
5bd8deadSopenharmony_ci    consecutively.  Examples of valid and invalid binding lists for vertex
5bd8deadSopenharmony_ci    programs include:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result.clip[1], result.clip[2]          # valid, 2-entry array
5bd8deadSopenharmony_ci      result.texcoord[0..3]                   # valid, 4-entry array
5bd8deadSopenharmony_ci      result.texcoord[1], result.texcoord[3]  # invalid, skipped texcoord 2
5bd8deadSopenharmony_ci      result.texcoord[2], result.texcoord[1]  # invalid, wrong order
5bd8deadSopenharmony_ci      result.texcoord[1], result.clip[2]      # invalid, different types
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Additionally, result bindings may be used in no more than one array
5bd8deadSopenharmony_ci    addressed with relative addressing.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Implementations may have a limit on the total number of result binding
5bd8deadSopenharmony_ci    components used by each program target (MAX_PROGRAM_RESULT_COMPONENTS_NV).
5bd8deadSopenharmony_ci    Programs that require more result binding components than this limit will
5bd8deadSopenharmony_ci    fail to load.  The method of counting used result binding components is
5bd8deadSopenharmony_ci    implementation-dependent, but must satisfy the following properties:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * If a result binding is not referenced in a program, or is referenced
5bd8deadSopenharmony_ci        only in declarations of result variables that are not used, none of
5bd8deadSopenharmony_ci        its components are counted.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * A result binding component may be counted as used only if there exists
5bd8deadSopenharmony_ci        an instruction operand where
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          - the component is enabled in the write mask (Section 2.X.4.3), and
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          - the result binding is either
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              - referenced directly by the operand,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              - bound to a declared variable referenced by the operand, or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci              - bound to a declared array variable where another binding in
5bd8deadSopenharmony_ci                the array satisfies one of the two previous conditions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        Implementations are not required to optimize out unused elements of an
5bd8deadSopenharmony_ci        result array or components that are used in only some elements of an
5bd8deadSopenharmony_ci        array.  The last of these rules is intended to cover the case where
5bd8deadSopenharmony_ci        the same result binding is used in multiple variables.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        For example, an instruction whose write mask selects only the x
5bd8deadSopenharmony_ci        component may result in the x component of a result binding being
5bd8deadSopenharmony_ci        counted, but may never result in the counting of the y, z, or w
5bd8deadSopenharmony_ci        components of any result binding.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.6, Program Parameter Buffers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program parameter buffers are arrays consisting of single-component
5bd8deadSopenharmony_ci    typeless values or four-component typeless vectors stored in a buffer
5bd8deadSopenharmony_ci    object.  The GL provides an implementation-dependent number of buffer
5bd8deadSopenharmony_ci    object binding points for each program target, to which buffer objects can
5bd8deadSopenharmony_ci    be attached.  Program parameter buffer variables can be changed either by
5bd8deadSopenharmony_ci    updating the contents of bound buffer objects, or simply by changing the
5bd8deadSopenharmony_ci    buffer object attached to a binding point.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program parameter buffer variables are used as constants during program
5bd8deadSopenharmony_ci    execution.  All program parameter buffer variables have an associated
5bd8deadSopenharmony_ci    binding and are read-only during program execution.  Program parameter
5bd8deadSopenharmony_ci    buffers retain their values across program invocations, although their
5bd8deadSopenharmony_ci    values may change as buffer object bindings or contents change.  Program
5bd8deadSopenharmony_ci    parameter buffer variables must be declared explicitly via the
5bd8deadSopenharmony_ci    <BUFFER_statement> grammar rule.  Program parameter buffer bindings can
5bd8deadSopenharmony_ci    not be used directly in executable instructions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program parameter buffer variables are treated as an array of
5bd8deadSopenharmony_ci    single-component values if the <bufferDeclType> grammar rule matches
5bd8deadSopenharmony_ci    "BUFFER" or as an array of four-component vectors if it matches "BUFFER4".
5bd8deadSopenharmony_ci    A program will fail to load if a variable declared as "BUFFER" and another
5bd8deadSopenharmony_ci    variable declared as "BUFFER4" use the same buffer binding point.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Program parameter buffer variables may be declared as arrays, but all
5bd8deadSopenharmony_ci    bindings assigned to the array must use the same binding point and must
5bd8deadSopenharmony_ci    increase consecutively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Binding                        Components  Underlying State
5bd8deadSopenharmony_ci      -----------------------------  ----------  -----------------------------
5bd8deadSopenharmony_ci      program.buffer[a][b]           (x,x,x,x)   program parameter buffer a,
5bd8deadSopenharmony_ci                                                   element b
5bd8deadSopenharmony_ci      program.buffer[a][b..c]        (x,x,x,x)   program parameter buffer a,
5bd8deadSopenharmony_ci                                                   elements b through c
5bd8deadSopenharmony_ci      program.buffer[a]              (x,x,x,x)   program parameter buffer a,
5bd8deadSopenharmony_ci                                                   all elements
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.12: Program Parameter Buffer Bindings.  <a> indicates a buffer
5bd8deadSopenharmony_ci      number, <b> and <c> indicate individual elements.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If a program parameter buffer binding matches "program.buffer[a][b]", the
5bd8deadSopenharmony_ci    program parameter variable are filled with element <b> of the buffer
5bd8deadSopenharmony_ci    object bound to binding point <a>.  Each element of the bound buffer
5bd8deadSopenharmony_ci    object is treated a one or four words of data that can hold integer or
5bd8deadSopenharmony_ci    floating-point values.  When a single-component binding is evaluated, the
5bd8deadSopenharmony_ci    selected word is broadcast to all four components of the variable.  When a
5bd8deadSopenharmony_ci    four-component binding is evaluated, the four components of the buffer
5bd8deadSopenharmony_ci    element are loaded into the variable.  If no buffer object is bound to
5bd8deadSopenharmony_ci    binding point <a>, or the bound buffer object is not large enough to hold
5bd8deadSopenharmony_ci    an element <b>, the values used are undefined.  The binding point <a> must
5bd8deadSopenharmony_ci    be a nonnegative integer constant.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For program parameter buffer array declarations, "program.buffer[a][b..c]"
5bd8deadSopenharmony_ci    is equivalent to specifying elements <b> through <c> of the buffer object
5bd8deadSopenharmony_ci    bound to binding point <a> in order.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For program parameter buffer array declarations, "program.buffer[a]" is
5bd8deadSopenharmony_ci    equivalent to specifying the entire buffer -- elements 0 through <n>-1,
5bd8deadSopenharmony_ci    where <n> is either the size of the array (if declared) or the
5bd8deadSopenharmony_ci    implementation-dependent maximum parameter buffer object size limit (if no
5bd8deadSopenharmony_ci    size is declared).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.7, Program Condition Code Registers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The program condition code registers are four-component vectors.  Each
5bd8deadSopenharmony_ci    component of this register is a collection of single-bit flags, including
5bd8deadSopenharmony_ci    a sign flag (SF), a zero flag (ZF), an overflow flag (OF), and a carry
5bd8deadSopenharmony_ci    flag (CF).  There are two condition code registers (CC0 and CC1), whose
5bd8deadSopenharmony_ci    values are undefined at the beginning of program execution.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Most program instructions can optionally update one of the condition code
5bd8deadSopenharmony_ci    registers, by designating the condition code to update in the instruction.
5bd8deadSopenharmony_ci    When a condition code component is updated, the four flags of each
5bd8deadSopenharmony_ci    component of the condition code are set according to the corresponding
5bd8deadSopenharmony_ci    component of the instruction result.  Full details on the condition code
5bd8deadSopenharmony_ci    updates and tests can be found in Section 2.X.4.3.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The value of these four flags can be combined in various condition code
5bd8deadSopenharmony_ci    tests, which can be used to mask writes to destination variables and to
5bd8deadSopenharmony_ci    perform conditional branches or other condition operations.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.8, Program Aliases
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Programs can create aliases by matching the <ALIAS_statement> grammar
5bd8deadSopenharmony_ci    rule.  Aliases allow programs to use multiple variable names to refer to a
5bd8deadSopenharmony_ci    single underlying variable.  For example, the statement
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      ALIAS var1 = var0
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    establishes a variable name of "var1".  Subsequent references to "var1" in
5bd8deadSopenharmony_ci    the program text are treated as references to "var0".  The left hand side
5bd8deadSopenharmony_ci    of an ALIAS statement must be a new variable name, and the right hand side
5bd8deadSopenharmony_ci    must be an established variable name.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Aliases are not considered variable declarations, so do not count against
5bd8deadSopenharmony_ci    the limits on the number of variable declarations allowed in the program
5bd8deadSopenharmony_ci    text.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.3.9, Program Resource Limits
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (see ARB_vertex_program specification, incorporates all the different
5bd8deadSopenharmony_ci    limits on instruction counts, temporaries, attribute bindings, program
5bd8deadSopenharmony_ci    parameters, and so on)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.4, Program Execution Environment
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The set of instructions supported for GPU programs is given in Table X.13
5bd8deadSopenharmony_ci    below and is described in detail in Section 2.X.8.  An instruction can use
5bd8deadSopenharmony_ci    up to three operands when it executes, and most instructions can write a
5bd8deadSopenharmony_ci    single result vector.  Instructions may also specify one or more
5bd8deadSopenharmony_ci    modifiers, according to the <opModifiers> grammar rule.  Instruction
5bd8deadSopenharmony_ci    modifiers affect how the specified operation is performed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GPU programs may operate on signed integer, unsigned integer, or
5bd8deadSopenharmony_ci    floating-point values; some instructions are capable of operating on any
5bd8deadSopenharmony_ci    of the three types.  However, the data type of the operands and the result
5bd8deadSopenharmony_ci    are always determined based solely on the instruction and its modifiers.
5bd8deadSopenharmony_ci    If any of the variables used in the instruction are typeless, they will be
5bd8deadSopenharmony_ci    interpreted according to the data type derived from the instruction.  If
5bd8deadSopenharmony_ci    any variables with a conflicting data type are used in the instruction,
5bd8deadSopenharmony_ci    the program will fail to load unless the "NTC" (no type checking)
5bd8deadSopenharmony_ci    instruction modifier is specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                  Modifiers
5bd8deadSopenharmony_ci      Instruction F I C S H D  Out Inputs    Description
5bd8deadSopenharmony_ci      ----------- - - - - - -  --- --------  --------------------------------
5bd8deadSopenharmony_ci      ABS         X X X X X F  v   v         absolute value
5bd8deadSopenharmony_ci      ADD         X X X X X F  v   v,v       add
5bd8deadSopenharmony_ci      AND         - X X - - S  v   v,v       bitwise and
5bd8deadSopenharmony_ci      BRK         - - - - - -  -   c         break out of loop instruction
5bd8deadSopenharmony_ci      CAL         - - - - - -  -   c         subroutine call
5bd8deadSopenharmony_ci      CEIL        X X X X X F  v   vf        ceiling
5bd8deadSopenharmony_ci      CMP         X X X X X F  v   v,v,v     compare
5bd8deadSopenharmony_ci      CONT        - - - - - -  -   c         continue with next loop interation
5bd8deadSopenharmony_ci      COS         X - X X X F  s   s         cosine with reduction to [-PI,PI]
5bd8deadSopenharmony_ci      DIV         X X X X X F  v   v,s       divide vector components by scalar
5bd8deadSopenharmony_ci      DP2         X - X X X F  s   v,v       2-component dot product
5bd8deadSopenharmony_ci      DP2A        X - X X X F  s   v,v,v     2-comp. dot product w/scalar add
5bd8deadSopenharmony_ci      DP3         X - X X X F  s   v,v       3-component dot product
5bd8deadSopenharmony_ci      DP4         X - X X X F  s   v,v       4-component dot product
5bd8deadSopenharmony_ci      DPH         X - X X X F  s   v,v       homogeneous dot product
5bd8deadSopenharmony_ci      DST         X - X X X F  v   v,v       distance vector
5bd8deadSopenharmony_ci      ELSE        - - - - - -  -   -         start if test else block
5bd8deadSopenharmony_ci      ENDIF       - - - - - -  -   -         end if test block
5bd8deadSopenharmony_ci      ENDREP      - - - - - -  -   -         end of repeat block
5bd8deadSopenharmony_ci      EX2         X - X X X F  s   s         exponential base 2
5bd8deadSopenharmony_ci      FLR         X X X X X F  v   vf        floor
5bd8deadSopenharmony_ci      FRC         X - X X X F  v   v         fraction
5bd8deadSopenharmony_ci      I2F         - X X - - S  vf  v         integer to float
5bd8deadSopenharmony_ci      IF          - - - - - -  -   c         start of if test block
5bd8deadSopenharmony_ci      KIL         X X - - X F  -   vc        kill fragment
5bd8deadSopenharmony_ci      LG2         X - X X X F  s   s         logarithm base 2
5bd8deadSopenharmony_ci      LIT         X - X X X F  v   v         compute lighting coefficients
5bd8deadSopenharmony_ci      LRP         X - X X X F  v   v,v,v     linear interpolation
5bd8deadSopenharmony_ci      MAD         X X X X X F  v   v,v,v     multiply and add
5bd8deadSopenharmony_ci      MAX         X X X X X F  v   v,v       maximum
5bd8deadSopenharmony_ci      MIN         X X X X X F  v   v,v       minimum
5bd8deadSopenharmony_ci      MOD         - X X - - S  v   v,s       modulus vector components by scalar
5bd8deadSopenharmony_ci      MOV         X X X X X F  v   v         move
5bd8deadSopenharmony_ci      MUL         X X X X X F  v   v,v       multiply
5bd8deadSopenharmony_ci      NOT         - X X - - S  v   v         bitwise not
5bd8deadSopenharmony_ci      NRM         X - X X X F  v   v         normalize 3-component vector
5bd8deadSopenharmony_ci      OR          - X X - - S  v   v,v       bitwise or
5bd8deadSopenharmony_ci      PK2H        X X - - - F  s   vf        pack two 16-bit floats
5bd8deadSopenharmony_ci      PK2US       X X - - - F  s   vf        pack two floats as unsigned 16-bit
5bd8deadSopenharmony_ci      PK4B        X X - - - F  s   vf        pack four floats as signed 8-bit
5bd8deadSopenharmony_ci      PK4UB       X X - - - F  s   vf        pack four floats as unsigned 8-bit
5bd8deadSopenharmony_ci      POW         X - X X X F  s   s,s       exponentiate
5bd8deadSopenharmony_ci      RCC         X - X X X F  s   s         reciprocal (clamped)
5bd8deadSopenharmony_ci      RCP         X - X X X F  s   s         reciprocal
5bd8deadSopenharmony_ci      REP         X X - - X F  -   v         start of repeat block
5bd8deadSopenharmony_ci      RET         - - - - - -  -   c         subroutine return
5bd8deadSopenharmony_ci      RFL         X - X X X F  v   v,v       reflection vector
5bd8deadSopenharmony_ci      ROUND       X X X X X F  v   vf        round to nearest integer
5bd8deadSopenharmony_ci      RSQ         X - X X X F  s   s         reciprocal square root
5bd8deadSopenharmony_ci      SAD         - X X - - S  vu  v,v,vu    sum of absolute differences
5bd8deadSopenharmony_ci      SCS         X - X X X F  v   s         sine/cosine without reduction
5bd8deadSopenharmony_ci      SEQ         X X X X X F  v   v,v       set on equal
5bd8deadSopenharmony_ci      SFL         X X X X X F  v   v,v       set on false
5bd8deadSopenharmony_ci      SGE         X X X X X F  v   v,v       set on greater than or equal
5bd8deadSopenharmony_ci      SGT         X X X X X F  v   v,v       set on greater than
5bd8deadSopenharmony_ci      SHL         - X X - - S  v   v,s       shift left
5bd8deadSopenharmony_ci      SHR         - X X - - S  v   v,s       shift right
5bd8deadSopenharmony_ci      SIN         X - X X X F  s   s         sine with reduction to [-PI,PI]
5bd8deadSopenharmony_ci      SLE         X X X X X F  v   v,v       set on less than or equal
5bd8deadSopenharmony_ci      SLT         X X X X X F  v   v,v       set on less than
5bd8deadSopenharmony_ci      SNE         X X X X X F  v   v,v       set on not equal
5bd8deadSopenharmony_ci      SSG         X - X X X F  v   v         set sign
5bd8deadSopenharmony_ci      STR         X X X X X F  v   v,v       set on true
5bd8deadSopenharmony_ci      SUB         X X X X X F  v   v,v       subtract
5bd8deadSopenharmony_ci      SWZ         X - X X X F  v   v         extended swizzle
5bd8deadSopenharmony_ci      TEX         X X X X - F  v   vf        texture sample
5bd8deadSopenharmony_ci      TRUNC       X X X X X F  v   vf        truncate (round toward zero)
5bd8deadSopenharmony_ci      TXB         X X X X - F  v   vf        texture sample with bias
5bd8deadSopenharmony_ci      TXD         X X X X - F  v   vf,vf,vf  texture sample w/partials
5bd8deadSopenharmony_ci      TXF         X X X X - F  v   vs        texel fetch
5bd8deadSopenharmony_ci      TXL         X X X X - F  v   vf        texture sample w/LOD
5bd8deadSopenharmony_ci      TXP         X X X X - F  v   vf        texture sample w/projection
5bd8deadSopenharmony_ci      TXQ         - - - - - S  vs  vs        texture info query
5bd8deadSopenharmony_ci      UP2H        X X X X - F  vf  s         unpack two 16-bit floats
5bd8deadSopenharmony_ci      UP2US       X X X X - F  vf  s         unpack two unsigned 16-bit ints
5bd8deadSopenharmony_ci      UP4B        X X X X - F  vf  s         unpack four signed 8-bit ints
5bd8deadSopenharmony_ci      UP4UB       X X X X - F  vf  s         unpack four unsigned 8-bit ints
5bd8deadSopenharmony_ci      X2D         X - X X X F  v   v,v,v     2D coordinate transformation
5bd8deadSopenharmony_ci      XOR         - X X - - S  v   v,v       exclusive or
5bd8deadSopenharmony_ci      XPD         X - X X X F  v   v,v       cross product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.13:  Summary of NV_gpu_program4 instructions.  The "Modifiers"
5bd8deadSopenharmony_ci      columns specify the set of modifiers allowed for the instruction:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        F = floating-point data type modifiers
5bd8deadSopenharmony_ci        I = signed and unsigned integer data type modifiers
5bd8deadSopenharmony_ci        C = condition code update modifiers
5bd8deadSopenharmony_ci        S = clamping (saturation) modifiers
5bd8deadSopenharmony_ci        H = half-precision float data type suffix
5bd8deadSopenharmony_ci        D = default data type modifier (F, U, or S)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The input and output columns describe the formats of the operands and
5bd8deadSopenharmony_ci      results of the instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        v:  4-component vector (data type is inherited from operation)
5bd8deadSopenharmony_ci        vf: 4-component vector (data type is always floating-point)
5bd8deadSopenharmony_ci        vs: 4-component vector (data type is always signed integer)
5bd8deadSopenharmony_ci        vu: 4-component vector (data type is always unsigned integer)
5bd8deadSopenharmony_ci        s:  scalar (replicated if written to a vector destination;
5bd8deadSopenharmony_ci                    data type is inherited from operation)
5bd8deadSopenharmony_ci        c:  condition code test result (e.g., "EQ", "GT1.x")
5bd8deadSopenharmony_ci        vc: 4-component vector or condition code test
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.4.1, Program Instruction Modifiers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    There are several types of instruction modifiers available.  A data type
5bd8deadSopenharmony_ci    modifier specifies that an instruction should operate on signed integer,
5bd8deadSopenharmony_ci    unsigned integer, or floating-point data, when multiple data types are
5bd8deadSopenharmony_ci    supported.  A clamping modifier applies to instructions with
5bd8deadSopenharmony_ci    floating-point results, and specifies the range to which the results
5bd8deadSopenharmony_ci    should be clamped.  A condition code update modifier specifies that the
5bd8deadSopenharmony_ci    instruction should update one of the condition code variables.  Several
5bd8deadSopenharmony_ci    other special modifiers are also provided.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Instruction modifiers may be specified as stand-alone modifiers or as
5bd8deadSopenharmony_ci    suffixes concatenated with the opcode name.  A program will fail to load
5bd8deadSopenharmony_ci    if it contains an instruction that
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * specifies more than one modifier of any given type,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * specifies a clamping modifier on an instruction, unless it produces
5bd8deadSopenharmony_ci        floating-point results, or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * specifies a modifier that is not supported by the instruction (see
5bd8deadSopenharmony_ci        Table X.13 and the instruction description).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Stand-alone instruction modifiers are specified according to the
5bd8deadSopenharmony_ci    <opModifiers> grammar rule using a ".<modifier>" syntax.  Multiple
5bd8deadSopenharmony_ci    modifers, separated by periods, may be specified.  The set of supported
5bd8deadSopenharmony_ci    modifiers is described in Table X.14.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Modifier  Description
5bd8deadSopenharmony_ci      --------  -----------------------------------------------
5bd8deadSopenharmony_ci      F         Floating-point operation
5bd8deadSopenharmony_ci      U         Fixed-point operation, unsigned operands
5bd8deadSopenharmony_ci      S         Fixed-point operation, signed operands
5bd8deadSopenharmony_ci      CC        Update condition code register zero
5bd8deadSopenharmony_ci      CC0       Update condition code register zero
5bd8deadSopenharmony_ci      CC1       Update condition code register one
5bd8deadSopenharmony_ci      SAT       Floating-point results clamped to [0,1]
5bd8deadSopenharmony_ci      SSAT      Floating-point results clamped to [-1,1]
5bd8deadSopenharmony_ci      NTC       Disable type-checking on operands/results
5bd8deadSopenharmony_ci      S24       Signed multiply (24-bit operands)
5bd8deadSopenharmony_ci      U24       Unsigned multiply (24-bit operands)
5bd8deadSopenharmony_ci      HI        Multiplies two 32-bit integer operands, returns
5bd8deadSopenharmony_ci                  the 32 MSBs of the product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.14, Instruction Modifers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    "F", "U", and "S" modifiers are data type modifiers and specify that the
5bd8deadSopenharmony_ci    instruction should operate on floating-point, unsigned integer, or
5bd8deadSopenharmony_ci    signed integer values, respectively.  For example, "ADD.F", "ADD.U", and
5bd8deadSopenharmony_ci    "ADD.S" specify component-wise addition of floating-point, unsigned
5bd8deadSopenharmony_ci    integer, or signed integer vectors, respectively.  These modifiers specify
5bd8deadSopenharmony_ci    a data type, but do not specify a precision at which the operation is
5bd8deadSopenharmony_ci    performed.  Floating-point operations will be carried out with an internal
5bd8deadSopenharmony_ci    precision no less than that used to represent the largest operand.
5bd8deadSopenharmony_ci    Fixed-point operations will be carried out using at least as many bits as
5bd8deadSopenharmony_ci    used to represent the largest operand.  Operands represented with fewer
5bd8deadSopenharmony_ci    bits than used to perform the instruction will be promoted to a larger
5bd8deadSopenharmony_ci    data type.  Signed integer operands will be sign-extended, where the most
5bd8deadSopenharmony_ci    significant bits are filled with ones if the operand is negative and zero
5bd8deadSopenharmony_ci    otherwise.  Unsigned integer operands will be zero-extended, where the
5bd8deadSopenharmony_ci    most significant bits are always filled with zeroes.  For some
5bd8deadSopenharmony_ci    instructions, the data type of some operands or the result are fixed; in
5bd8deadSopenharmony_ci    these cases, the data type modifier specifies the data type of the
5bd8deadSopenharmony_ci    remaining values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    "CC", "CC0", and "CC1" are condition code update modifiers that specify
5bd8deadSopenharmony_ci    that one of the condition code registers should be updated based on the
5bd8deadSopenharmony_ci    result of the instruction, as described in section 2.X.4.3.  "CC" and
5bd8deadSopenharmony_ci    "CC0" specify that the condition code register CC0 be updated; "CC1"
5bd8deadSopenharmony_ci    specifies an update to CC1.  If no condition code update modifier is
5bd8deadSopenharmony_ci    provided, the condition code registers will not be affected.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    "SAT" and "SSAT" are clamping modifiers that specify that the
5bd8deadSopenharmony_ci    floating-point components of the instruction result should be clamped to
5bd8deadSopenharmony_ci    [0,1] or [-1,1], respectively, before updating the condition code and the
5bd8deadSopenharmony_ci    destination variable.  If no clamping suffix is specified, unclamped
5bd8deadSopenharmony_ci    results will be used for condition code updates (if any) and destination
5bd8deadSopenharmony_ci    variable writes.  Clamping modifiers are not supported on instructions
5bd8deadSopenharmony_ci    that do not produce floating-point results.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    "NTC" (no type checking) disables data type checking on the instruction,
5bd8deadSopenharmony_ci    and allows instructions to use operands or result variables whose data
5bd8deadSopenharmony_ci    types are inconsistent with the expected data types of the instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    "S24", "U24", and "HI" are special modifiers that are allowed only for the
5bd8deadSopenharmony_ci    MUL instruction, and are described in detail where MUL is documented.  No
5bd8deadSopenharmony_ci    more than one such modifier may be provided for any instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If an instruction supports data type modifiers, but none is provided, a
5bd8deadSopenharmony_ci    default data type will be chosen based on the instruction, as specified in
5bd8deadSopenharmony_ci    Table X.13 and the instruction set description (Section 2.X.8).  If
5bd8deadSopenharmony_ci    condition code update or clamping modifiers are not specified, the
5bd8deadSopenharmony_ci    corresponding operation will not be performed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Additionally, each instruction name may have one or more suffixes,
5bd8deadSopenharmony_ci    concatenated onto the base instruction name, that operate as instruction
5bd8deadSopenharmony_ci    modifiers.  For conciseness, these suffixes are not spelled out in the
5bd8deadSopenharmony_ci    grammar -- the base opcode name is used as a placeholder for the opcode
5bd8deadSopenharmony_ci    and all of its possible suffixes.  Instruction suffixes are provided
5bd8deadSopenharmony_ci    mainly for compatibility with prior GPU program instruction sets (e.g.,
5bd8deadSopenharmony_ci    NV_vertex_program3, NV_fragment_program2, and predecessors).  The set of
5bd8deadSopenharmony_ci    allowable suffixes, and their equivalent stand-alone modifiers, are listed
5bd8deadSopenharmony_ci    in Table X.15.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Suffix  Modifier     Description
5bd8deadSopenharmony_ci      ------  ----------   ---------------------------------------------------
5bd8deadSopenharmony_ci      R       F            Floating-point operation, 32-bit precision
5bd8deadSopenharmony_ci      H       F(*)         Floating-point operation, at least 16-bit precision
5bd8deadSopenharmony_ci      C       CC0          Update condition code register zero
5bd8deadSopenharmony_ci      C0      CC0          Update condition code register zero
5bd8deadSopenharmony_ci      C1      CC1          Update condition code register one
5bd8deadSopenharmony_ci      _SAT    SAT          Floating-point results clamped to [0,1]
5bd8deadSopenharmony_ci      _SSAT   SSAT         Floating-point results clamped to [-1,1]
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.15,  Instruction Suffixes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The "R" and "H" suffixes specify floating-point operations and are
5bd8deadSopenharmony_ci    equivalent to the "F" data type modifier.  They additionally specify a
5bd8deadSopenharmony_ci    minimum precision for the operations.  Instructions with an "R" precision
5bd8deadSopenharmony_ci    modifier will be carried out at no less than IEEE single-precision
5bd8deadSopenharmony_ci    floating-point (8 bits of exponent, 23 bits of mantissa).  Instructions
5bd8deadSopenharmony_ci    with an "H" precision modifier will be carried out at no less than 16-bit
5bd8deadSopenharmony_ci    floating-point precision (5 bits of exponent, 10 bits of mantissa).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    An instruction may have multiple suffixes, but they must appear in order,
5bd8deadSopenharmony_ci    with data type suffixes first, followed by condition code update suffixes,
5bd8deadSopenharmony_ci    followed by clamping suffixes.  For example, "ADDR" carries out an add at
5bd8deadSopenharmony_ci    32-bit precision.  "ADDH_SAT" carries out an add at 16-bit precision (or
5bd8deadSopenharmony_ci    better) and clamps the results to [0,1].  "ADDRC1_SSAT" carries out an add
5bd8deadSopenharmony_ci    at 32-bit floating-point precision, clamps the results to [-1,1], and
5bd8deadSopenharmony_ci    updates condition code one based on the clamped result.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.4.2, Program Operands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Most program instructions operate on one or more scalar or vector
5bd8deadSopenharmony_ci    operands.  Each operand specifies an operand variable, which is either the
5bd8deadSopenharmony_ci    name of a previously declared variable or an implicit variable declaration
5bd8deadSopenharmony_ci    created by using a variable binding in the instruction.  Attribute,
5bd8deadSopenharmony_ci    parameter, or parameter buffer variables can be declared implicitly by
5bd8deadSopenharmony_ci    using a valid binding name in an operand.  Instruction operands are
5bd8deadSopenharmony_ci    specified by the <instOperandV>, <instOperandS>, or <instOperandVNS>
5bd8deadSopenharmony_ci    grammar rules.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the operand variable is not an array, its contents are loaded directly.
5bd8deadSopenharmony_ci    If the operand variable is an array, a single element of the array is
5bd8deadSopenharmony_ci    loaded according to the <arrayMem> grammar rule.  The elements of an array
5bd8deadSopenharmony_ci    are numbered from 0 to <n>-1, where <n> is the number of entries in the
5bd8deadSopenharmony_ci    array.  Array members can be accessed using either absolute or relative
5bd8deadSopenharmony_ci    addressing.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Absolute array addressing is used when the <arrayMemAbs> grammar rule is
5bd8deadSopenharmony_ci    matched; the array member to load is specified by the matching integer.
5bd8deadSopenharmony_ci    Out-of-bounds array absolute accesses are not allowed.  If the specified
5bd8deadSopenharmony_ci    member number is greater than or equal to the size of the array, the
5bd8deadSopenharmony_ci    program will fail to load.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Relative array addressing is used when the <arrayMemRel> grammar rule is
5bd8deadSopenharmony_ci    matched.  This grammar rule allows the program to specify a scalar integer
5bd8deadSopenharmony_ci    operand and an optional constant offset, according to the <arrayMemReg>
5bd8deadSopenharmony_ci    and <arrayMemOffset> grammar rules.  When performing relative addressing,
5bd8deadSopenharmony_ci    the GL evaluates the specified integer scalar operand (according to the
5bd8deadSopenharmony_ci    rules specified in this section) and adds the constant offset.  The array
5bd8deadSopenharmony_ci    member loaded is given by this sum.  The constant offset is considered
5bd8deadSopenharmony_ci    zero if an offset is omitted.  If the sum is negative or exceeds the size
5bd8deadSopenharmony_ci    of the array, the results of the access are undefined, but may not lead to
5bd8deadSopenharmony_ci    program or GL termination.  The set of constant offsets supported for
5bd8deadSopenharmony_ci    relative addressing is limited to values in the range [0,<n>-1], where <n>
5bd8deadSopenharmony_ci    is the size of the array.  A program will fail to load if it specifies an
5bd8deadSopenharmony_ci    offset outside that range.  If offsets outside that range are required,
5bd8deadSopenharmony_ci    they can be applied by using an integer ADD instruction writing to a
5bd8deadSopenharmony_ci    temporary variable.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    After the operand is loaded, its components can be rearranged according to
5bd8deadSopenharmony_ci    the <swizzleSuffix> grammar rule, or it can be converted to a scalar
5bd8deadSopenharmony_ci    operand according to the <scalarSuffix> grammar rule.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <swizzleSuffix> grammar rule rearranges the components of a loaded
5bd8deadSopenharmony_ci    vector to produce another vector.  If the <swizzleSuffix> rule matches the
5bd8deadSopenharmony_ci    <xyzwSwizzle> or <rgbaSwizzle> grammar rule, a pattern of the form ".????"
5bd8deadSopenharmony_ci    is used, where each question mark is replaced with one of "x", "y", "z",
5bd8deadSopenharmony_ci    "w", "r", "g", "b", or a".  For such patterns, the x, y, z, and w
5bd8deadSopenharmony_ci    components of the operand are taken from the vector components named by
5bd8deadSopenharmony_ci    the first, second, third, and fourth character of the pattern,
5bd8deadSopenharmony_ci    respectively.  Swizzle components of "r", "g", "b", and "a" are equivalent
5bd8deadSopenharmony_ci    to "x", "y", "z", and "w", respectively.  For example, if the swizzle
5bd8deadSopenharmony_ci    suffix is ".yzzx" or ".gbbr" and the specified source contains {2,8,9,0},
5bd8deadSopenharmony_ci    the result is the vector {8,9,9,2}.  If the <swizzleSuffix> matches the
5bd8deadSopenharmony_ci    <component> grammar rule, a pattern of the form ".?" is used.  For this
5bd8deadSopenharmony_ci    pattern, all four components of the operand are taken from the single
5bd8deadSopenharmony_ci    component identified by the pattern.  If the swizzle suffix is omitted,
5bd8deadSopenharmony_ci    components are not rearranged and swizzling has no effect, as though
5bd8deadSopenharmony_ci    ".xyzw" were specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The swizzle suffix rules do not allow mixing "x", "y", "z", or "w"
5bd8deadSopenharmony_ci    selectors with "r", "g", "b", or "a" selectors.  A program will fail to
5bd8deadSopenharmony_ci    load if it contains a swizzle suffix with selectors from both of these
5bd8deadSopenharmony_ci    sets.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The <scalarSuffix> grammar rule converts a vector to a scalar by selecting
5bd8deadSopenharmony_ci    a single component.  The <scalarSuffix> rule is similar to the swizzle
5bd8deadSopenharmony_ci    selector, except that only a single component is selected.  If the scalar
5bd8deadSopenharmony_ci    suffix is ".y" and the specified source contains {2,8,9,0}, the value is
5bd8deadSopenharmony_ci    the scalar value 8.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Next, a component-wise negate operation is performed on the operand if the
5bd8deadSopenharmony_ci    <operandNeg> grammar rule matches "-".  Negation is not performed if the
5bd8deadSopenharmony_ci    operand has no sign prefix, or is prefixed with "+".  For unsigned integer
5bd8deadSopenharmony_ci    operands, the negate operand performs a two's complement operation.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Next, a component-wise absolute value operation is performed on the
5bd8deadSopenharmony_ci    operand if the <instOperandAbsV> or <instOperandAbsS> grammar rule is
5bd8deadSopenharmony_ci    matched, by surrounding the operand with two "|" characters.  The result
5bd8deadSopenharmony_ci    is optionally negated if the <operandAbsNeg> grammar rule matches "-".
5bd8deadSopenharmony_ci    For unsigned integer operands, the absolute value operation has no effect.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.4.3, Program Destination Variable Update
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Most program instructions perform computations that produce a result,
5bd8deadSopenharmony_ci    which will be written to a variable.  Each instruction that computes a
5bd8deadSopenharmony_ci    result specifies a destination variable, which is either the name of a
5bd8deadSopenharmony_ci    previously declared variable or an implicit variable declaration created
5bd8deadSopenharmony_ci    by using a variable binding in the instruction.  Result variables can be
5bd8deadSopenharmony_ci    declared implicitly by using a valid program result binding name in the
5bd8deadSopenharmony_ci    result portion of the instruction.  Instruction results are specified
5bd8deadSopenharmony_ci    according to the <instResult> grammar rule.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The destination variable may be a single member of an array.  In this
5bd8deadSopenharmony_ci    case, a single array member is specified using the <arrayMem> grammar
5bd8deadSopenharmony_ci    rule, and the array member to update is computed in the exact same manner
5bd8deadSopenharmony_ci    as done for operand loads.  If the array member is computed at run time,
5bd8deadSopenharmony_ci    and is negative or greater than or equal to the size of the array, the
5bd8deadSopenharmony_ci    results of the destination variable update are undefined and could result
5bd8deadSopenharmony_ci    in overwriting other program variables.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The results of the operation may be obtained at a different precision than
5bd8deadSopenharmony_ci    that used to store the destination variable.  If so, the results are
5bd8deadSopenharmony_ci    converted to match the size of the destination variable.  For
5bd8deadSopenharmony_ci    floating-point values, the results are rounded to the nearest
5bd8deadSopenharmony_ci    floating-point value that can be represented in the destination variable.
5bd8deadSopenharmony_ci    If a result component is larger in magnitude than the largest
5bd8deadSopenharmony_ci    representable floating-point value in the data type of the destination
5bd8deadSopenharmony_ci    variable, an infinity encoding (+/-INF) is used.  Signed or unsigned
5bd8deadSopenharmony_ci    integer values are sign-extended or zero-extended, respectively, if the
5bd8deadSopenharmony_ci    destination variable has more bits than the result, and have their most
5bd8deadSopenharmony_ci    significant bits discarded if the destination variable has fewer bits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Writes to individual components of a vector destination variable can be
5bd8deadSopenharmony_ci    controlled at compile time by individual component write masks specified
5bd8deadSopenharmony_ci    in the instruction.  The component write mask is specified by the
5bd8deadSopenharmony_ci    <optWriteMask> grammar rule, and is a string of up to four characters,
5bd8deadSopenharmony_ci    naming the components to enable for writing.  If no write mask is
5bd8deadSopenharmony_ci    specified, all components are enabled for writing.  The characters "x",
5bd8deadSopenharmony_ci    "y", "z", and "w" match the x, y, z, and w components respectively.  For
5bd8deadSopenharmony_ci    example, a write mask mask of ".xzw" indicates that the x, z, and w
5bd8deadSopenharmony_ci    components should be enabled for writing but the y component should not be
5bd8deadSopenharmony_ci    written.  The grammar requires that the destination register mask
5bd8deadSopenharmony_ci    components must be listed in "xyzw" order.  Additionally, write mask
5bd8deadSopenharmony_ci    components of "r", "g", "b", and "a" are equivalent to "x", "y", "z", and
5bd8deadSopenharmony_ci    "w", respectively.  The grammar does not allow mixing "x", "y", "z", or
5bd8deadSopenharmony_ci    "w" components with "r", "g", "b", and "a" ones.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Writes to individual components of a vector destination variable, or to a
5bd8deadSopenharmony_ci    scalar destination variable, can also be controlled at run time using
5bd8deadSopenharmony_ci    condition code write masks.  The condition code write mask is specified by
5bd8deadSopenharmony_ci    the <ccMask> grammar rule.  If a mask is specified, a condition code
5bd8deadSopenharmony_ci    variable is loaded according to the <ccMaskRule> grammar rule and tested
5bd8deadSopenharmony_ci    as described in Table X.16 to produce a four-component vector of TRUE/FALSE
5bd8deadSopenharmony_ci    values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci         mask rule         test name                condition
5bd8deadSopenharmony_ci         ---------------   ----------------------   -----------------
5bd8deadSopenharmony_ci         EQ,  EQ0,  EQ1    equal                    !SF && ZF
5bd8deadSopenharmony_ci         GE,  GE0,  GE1    greater than or equal    !(SF ^ OF)
5bd8deadSopenharmony_ci         GT,  GT0,  GT1    greater than             (!SF ^ OF) && !ZF
5bd8deadSopenharmony_ci         LE,  LE0,  LE1    less than or equal       SF ^ (ZF || OF)
5bd8deadSopenharmony_ci         LT,  LT0,  LT1    less than                (SF && !ZF) ^ OF
5bd8deadSopenharmony_ci         NE,  NE0,  NE1    not equal                SF || !ZF
5bd8deadSopenharmony_ci         FL,  FL0,  FL1    false                    always false
5bd8deadSopenharmony_ci         TR,  TR0,  TR1    true                     always true
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci         NAN, NAN0, NAN1   not a number             SF && ZF
5bd8deadSopenharmony_ci         LEG, LEG0, LEG1   less, equal, or greater  !SF || !ZF
5bd8deadSopenharmony_ci                             (anything but a NaN)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci         CF,  CF0,  CF1    carry flag               CF
5bd8deadSopenharmony_ci         NCF, NCF0, NCF1   no carry flag            !CF
5bd8deadSopenharmony_ci         OF,  OF0,  OF1    overflow flag            OF
5bd8deadSopenharmony_ci         NOF, NOF0, NOF1   no overflow flag         !OF
5bd8deadSopenharmony_ci         SF,  SF0,  SF1    sign flag                SF
5bd8deadSopenharmony_ci         NSF, NSF0, NSF1   no sign flag             !SF
5bd8deadSopenharmony_ci         AB,  AB0,  AB1    above                    CF && !ZF
5bd8deadSopenharmony_ci         BLE, BLE0, BLE1   below or equal           !CF || ZF
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.16, Condition Code Tests.  The allowed rules are specified in
5bd8deadSopenharmony_ci      the "mask rule" column.  If "0" or "1" is appended to the rule name
5bd8deadSopenharmony_ci      (e.g., "EQ1"), the corresponding condition code register (CC1 in this
5bd8deadSopenharmony_ci      example) is loaded, otherwise CC0 is loaded.  After loading, each
5bd8deadSopenharmony_ci      component is tested, using the expression listed in the "condition"
5bd8deadSopenharmony_ci      column.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    After the condition code tests are performed, the four-component result
5bd8deadSopenharmony_ci    can be swizzled according to the <swizzleSuffix> grammar rule.  Individual
5bd8deadSopenharmony_ci    components of the destination variable are written only if the
5bd8deadSopenharmony_ci    corresponding component of the swizzled condition code test result is
5bd8deadSopenharmony_ci    TRUE.  If both a (compile-time) component write mask and a condition code
5bd8deadSopenharmony_ci    write mask are specified, destination variable components are written only
5bd8deadSopenharmony_ci    if the corresponding component is enabled in both masks.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A program instruction can also optionally update one of the two condition
5bd8deadSopenharmony_ci    code registers if the "CC", "CC0", or "CC1" instruction modifier are
5bd8deadSopenharmony_ci    specified.  These instruction modifiers update condition code register
5bd8deadSopenharmony_ci    CC0, CC0, or CC1, respectively.  The instructions "ADD.CC" or "ADD.CC0"
5bd8deadSopenharmony_ci    will perform an add and update condition code zero, "ADD.CC1" will add and
5bd8deadSopenharmony_ci    update condition code one, and "ADD" will simply perform the add without a
5bd8deadSopenharmony_ci    condition code update.  The components of the selected condition code
5bd8deadSopenharmony_ci    register are updated if and only if the corresponding component of the
5bd8deadSopenharmony_ci    destination variable are enabled by both write masks.  For the purposes of
5bd8deadSopenharmony_ci    condition code update, a scalar destination variable is treated as a
5bd8deadSopenharmony_ci    vector where the scalar result is written to "x" (if enabled in the write
5bd8deadSopenharmony_ci    mask), and writes to the "y", "z", and "w" components are disabled.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When condition code components are written, the condition code flags are
5bd8deadSopenharmony_ci    updated based on the corresponding component of the result.  If a
5bd8deadSopenharmony_ci    component of the destination register is not enabled for writes, the
5bd8deadSopenharmony_ci    corresponding condition code component is also unchanged.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For floating-point results, the sign flag (SF) is set if the result is
5bd8deadSopenharmony_ci    less than zero or is a NaN (not a number) value.  The zero flag (ZF) is
5bd8deadSopenharmony_ci    set if the result is equal to zero or is a NaN.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For signed and unsigned integer results, the sign flag (SF) is set if the
5bd8deadSopenharmony_ci    most significant bit of the value written to the result variable is set
5bd8deadSopenharmony_ci    and the zero flag (ZF) is set if the result written is zero.  For
5bd8deadSopenharmony_ci    instructions other than those performing an integer add or subtract (ADD,
5bd8deadSopenharmony_ci    MAD, SAD, SUB), the overflow and carry flags (OF and CF) are cleared.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For integer add or subtract operations, the overflow and carry flags by
5bd8deadSopenharmony_ci    doing both signed and unsigned adds/subtracts as follows:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The overflow flag (OF) is set by interpreting the two operands as signed
5bd8deadSopenharmony_ci      integers and performing a signed add or subtract.  If the result is
5bd8deadSopenharmony_ci      representable as a signed integer (i.e., doesn't overflow), the overflow
5bd8deadSopenharmony_ci      flag is cleared; otherwise, it is set.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The carry flag (CF) is set by interpreting the two operands as unsigned
5bd8deadSopenharmony_ci      integers and performing an unsigned add or subtract.  If the result of
5bd8deadSopenharmony_ci      an add is representable as an unsigned integer (i.e., doesn't overflow),
5bd8deadSopenharmony_ci      the carry flag is cleared; otherwise, it is set.  If the result of a
5bd8deadSopenharmony_ci      subtract is greater than or equal to zero, the carry flag is set;
5bd8deadSopenharmony_ci      otherwise, it is cleared.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For the purposes of condition code setting, negation modifiers turn add
5bd8deadSopenharmony_ci    operations into subtracts and vice versa.  If the operation is equivalent
5bd8deadSopenharmony_ci    to an add with both operands negated (-A-B), the carry and overflow flags
5bd8deadSopenharmony_ci    are both undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.4.4, Program Texture Access
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Certain program instructions may access texture images, as described in
5bd8deadSopenharmony_ci    section 3.8.  The coordinates, level-of-detail, and partial derivatives
5bd8deadSopenharmony_ci    used for performing the texture lookup are derived from values provided in
5bd8deadSopenharmony_ci    the program as described in the various sub-sections of Section 2.X.8.
5bd8deadSopenharmony_ci    These descriptions use the function
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result_t_vec
5bd8deadSopenharmony_ci        TextureSample(float_vec coord, float lod, float_vec ddx,
5bd8deadSopenharmony_ci                      float_vec ddy, int_vec offset);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    which obtains a filtered texel value <tau> as described in Section 3.8.8
5bd8deadSopenharmony_ci    and returns a 4-component vector (R,G,B,A) according to the format
5bd8deadSopenharmony_ci    conversions specified in Table 3.21.  The result vector is interpreted as
5bd8deadSopenharmony_ci    floating-point, signed integer, or unsigned integer, according to the data
5bd8deadSopenharmony_ci    type modifier of the instruction.  If the internal format of the texture
5bd8deadSopenharmony_ci    does not match the instruction's data type modifer, the results of the
5bd8deadSopenharmony_ci    texture lookup are undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (Note:  For unextended OpenGL 2.0, all supported texture internal formats
5bd8deadSopenharmony_ci    store integer values but return floating-point results in the range [0,1]
5bd8deadSopenharmony_ci    on a texture lookup.  The ARB_texture_float extension introduces
5bd8deadSopenharmony_ci    floating-point internal format where components are both stored and
5bd8deadSopenharmony_ci    returned as floating-point values.  The EXT_texture_integer extension
5bd8deadSopenharmony_ci    introduces formats that both store and return either signed or unsigned
5bd8deadSopenharmony_ci    integer values.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <coord> is a four-component floating-point vector from which the (s,t,r)
5bd8deadSopenharmony_ci    texture coordinates used for the texture access, the layer used for array
5bd8deadSopenharmony_ci    textures, and the reference value used for depth comparisons (section
5bd8deadSopenharmony_ci    3.8.14) are extracted according to Table X.17.  If the texture is a cube
5bd8deadSopenharmony_ci    map, (s,t,r) is projected to one of the six cube faces to produce a new
5bd8deadSopenharmony_ci    (s,t) vector according to Section 3.8.6.  For array textures, the layer
5bd8deadSopenharmony_ci    used is derived by rounding the extracted floating-point component to the
5bd8deadSopenharmony_ci    nearest integer and clamping the result to the range [0,<n>-1], where <n>
5bd8deadSopenharmony_ci    is the number of layers in the texture.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <lod> specifies the level of detail parameter and replaces the value
5bd8deadSopenharmony_ci    computed in equation 3.18.  <ddx> and <ddy> specify partial derivatives
5bd8deadSopenharmony_ci    (ds/dx, dt/dx, dr/dx, ds/dy, dt/dy, and dr/dy) for the texture
5bd8deadSopenharmony_ci    coordinates, and may be used to derive footprint shapes for anisotropic
5bd8deadSopenharmony_ci    texture filtering.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <offset> is a constant 3-component signed integer vector specified
5bd8deadSopenharmony_ci    according to the <texOffset> grammar rule, which is added to the computed
5bd8deadSopenharmony_ci    <u>, <v>, and <w> texel locations prior to sampling.  One, two, or three
5bd8deadSopenharmony_ci    components may be specified in the instruction; if fewer than three are
5bd8deadSopenharmony_ci    specified, the remaining offset components are zero.  A limited range of
5bd8deadSopenharmony_ci    offset values are supported; the minimum and maximum <texOffset> values
5bd8deadSopenharmony_ci    are implementation-dependent and given by MIN_PROGRAM_TEXEL_OFFSET_EXT and
5bd8deadSopenharmony_ci    MAX_PROGRAM_TEXEL_OFFSET_EXT, respectively.  A program will fail to load:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if the texture target specified in the instruction is 1D, ARRAY1D,
5bd8deadSopenharmony_ci        SHADOW1D, or SHADOWARRAY1D, and the second or third component of the
5bd8deadSopenharmony_ci        offset vector is non-zero,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if the texture target specified in the instruction is 2D, RECT,
5bd8deadSopenharmony_ci        ARRAY2D, SHADOW2D, SHADOWRECT, or SHADOWARRAY2D, and the third
5bd8deadSopenharmony_ci        component of the offset vector is non-zero,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if the texture target is CUBE or SHADOWCUBE, and any component of the
5bd8deadSopenharmony_ci        offset vector is non-zero -- texel offsets are not supported for cube
5bd8deadSopenharmony_ci        map or buffer textures, or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if any component of the offset vector is less than
5bd8deadSopenharmony_ci        MIN_PROGRAM_TEXEL_OFFSET_EXT or greater than
5bd8deadSopenharmony_ci        MAX_PROGRAM_TEXEL_OFFSET_EXT.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (NOTE:  Texel offsets are a new feature provided by this extension and are
5bd8deadSopenharmony_ci    described in more detail in edits to Section 3.8 below.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The texture used by TextureSample() is one of the textures bound to the
5bd8deadSopenharmony_ci    texture image unit whose number is specified in the instruction according
5bd8deadSopenharmony_ci    to the <texImageUnit> grammar rule.  The texture target accessed is
5bd8deadSopenharmony_ci    specified according to the <texTarget> grammar rule and Table X.17.
5bd8deadSopenharmony_ci    Fixed-function texture enables are always ignored when determining the
5bd8deadSopenharmony_ci    texture to access in a program.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                                                     coordinates used
5bd8deadSopenharmony_ci      texTarget          Texture Type               s t r  layer  shadow
5bd8deadSopenharmony_ci      ----------------   ---------------------      -----  -----  ------
5bd8deadSopenharmony_ci      1D                 TEXTURE_1D                 x - -    -      -
5bd8deadSopenharmony_ci      2D                 TEXTURE_2D                 x y -    -      -
5bd8deadSopenharmony_ci      3D                 TEXTURE_3D                 x y z    -      -
5bd8deadSopenharmony_ci      CUBE               TEXTURE_CUBE_MAP           x y z    -      -
5bd8deadSopenharmony_ci      RECT               TEXTURE_RECTANGLE_ARB      x y -    -      -
5bd8deadSopenharmony_ci      ARRAY1D            TEXTURE_1D_ARRAY_EXT       x - -    y      -
5bd8deadSopenharmony_ci      ARRAY2D            TEXTURE_2D_ARRAY_EXT       x y -    z      -
5bd8deadSopenharmony_ci      SHADOW1D           TEXTURE_1D                 x - -    -      z
5bd8deadSopenharmony_ci      SHADOW2D           TEXTURE_2D                 x y -    -      z
5bd8deadSopenharmony_ci      SHADOWRECT         TEXTURE_RECTANGLE_ARB      x y -    -      z
5bd8deadSopenharmony_ci      SHADOWCUBE         TEXTURE_CUBE_MAP           x y z    -      w
5bd8deadSopenharmony_ci      SHADOWARRAY1D      TEXTURE_1D_ARRAY_EXT       x - -    y      z
5bd8deadSopenharmony_ci      SHADOWARRAY2D      TEXTURE_2D_ARRAY_EXT       x y -    z      w
5bd8deadSopenharmony_ci      BUFFER             TEXTURE_BUFFER_EXT           <not supported>
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.17:  Texture types accessed for each of the <texTarget>, and
5bd8deadSopenharmony_ci      coordinate mappings.  The "SHADOW" and "ARRAY" targets are special
5bd8deadSopenharmony_ci      pseudo-targets described below.  The "coordinates used" column indicate
5bd8deadSopenharmony_ci      the input values used for each coordinate of the texture lookup, the
5bd8deadSopenharmony_ci      layer selector for array textures, and the reference value for texture
5bd8deadSopenharmony_ci      comparisons.  Buffer textures are not supported by normal texture lookup
5bd8deadSopenharmony_ci      functions, but are supported by TXF and TXQ, described below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Texture targets with "SHADOW" are used to access textures with a
5bd8deadSopenharmony_ci    DEPTH_COMPONENT base internal format using depth comparisons (Section
5bd8deadSopenharmony_ci    3.8.14).  Results of a texture access are undefined:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if a "SHADOW" target is used, and the corresponding texture has a base
5bd8deadSopenharmony_ci        internal format other than DEPTH_COMPONENT or a TEXTURE_COMPARE_MODE
5bd8deadSopenharmony_ci        of NONE, or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if a non-"SHADOW" target is used, and the corresponding texture has a
5bd8deadSopenharmony_ci        base internal format of DEPTH_COMPONENT and a TEXTURE_COMPARE_MODE
5bd8deadSopenharmony_ci        other than NONE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the texture being accessed is not complete (or cube complete for
5bd8deadSopenharmony_ci    cubemap textures), no texture access is performed and the result is
5bd8deadSopenharmony_ci    undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A program will fail to load if it attempts to sample from multiple texture
5bd8deadSopenharmony_ci    targets (including the SHADOW pseudo-targets) on the same texture image
5bd8deadSopenharmony_ci    unit.  For example, a program containing any two the following
5bd8deadSopenharmony_ci    instructions will fail to load:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      TEX out, coord, texture[0], 1D;
5bd8deadSopenharmony_ci      TEX out, coord, texture[0], 2D;
5bd8deadSopenharmony_ci      TEX out, coord, texture[0], ARRAY2D;
5bd8deadSopenharmony_ci      TEX out, coord, texture[0], SHADOW2D;
5bd8deadSopenharmony_ci      TEX out, coord, texture[0], 3D;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Additionally, multiple texture targets for a single texture image unit may
5bd8deadSopenharmony_ci    not be used at the same time by the GL.  The error INVALID_OPERATION is
5bd8deadSopenharmony_ci    generated by Begin, RasterPos, or any command that performs an implicit
5bd8deadSopenharmony_ci    Begin if an enabled program accesses one texture target for a texture unit
5bd8deadSopenharmony_ci    while another enabled program or fixed-function fragment processing
5bd8deadSopenharmony_ci    accesses a different texture target for the same texture image unit.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Some texture instructions use standard methods to compute partial
5bd8deadSopenharmony_ci    derivatives and/or the level-of-detail used to perform texture accesses.
5bd8deadSopenharmony_ci    For fragment programs, the functions
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      float_vec ComputePartialsX(float_vec coord);
5bd8deadSopenharmony_ci      float_vec ComputePartialsY(float_vec coord);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    compute approximate component-wise partial derivatives of the
5bd8deadSopenharmony_ci    floating-point vector <coord> relative to the X and Y coordinates,
5bd8deadSopenharmony_ci    respectively.  For vertex and geometry programs, these functions always
5bd8deadSopenharmony_ci    return (0,0,0,0).  The function
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      float ComputeLOD(float_vec ddx, float_vec ddy);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    maps partial derivative vectors <ddx> and <ddy> to ds/dx, dt/dx, dr/dx,
5bd8deadSopenharmony_ci    ds/dy, dt/dy, and dr/dy and computes lambda_base(x,y) according to
5bd8deadSopenharmony_ci    equation 3.18.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXF instruction provides the ability to extract a single texel from a
5bd8deadSopenharmony_ci    specified texture image using the function
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result_t_vec TexelFetch(int_vec coord, int_vec offset);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The extracted texel is converted to an (R,G,B,A) vector according to Table
5bd8deadSopenharmony_ci    3.21.  The result vector is interpreted as floating-point, signed integer,
5bd8deadSopenharmony_ci    or unsigned integer, according to the data type modifier of the
5bd8deadSopenharmony_ci    instruction.  If the internal format of the texture is not compatible with
5bd8deadSopenharmony_ci    the instruction's data type modifer, the extracted texel value is
5bd8deadSopenharmony_ci    undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    <coord> is a four-component signed integer vector used to identify the
5bd8deadSopenharmony_ci    single texel accessed.  The (i,j,k) coordinates of the texel and the layer
5bd8deadSopenharmony_ci    used for array textures are extracted according to Table X.18.  The level
5bd8deadSopenharmony_ci    of detail accessed is obtained by adding the w component of <coord> to the
5bd8deadSopenharmony_ci    base level (level_base).  <offset> is a constant 3-component signed
5bd8deadSopenharmony_ci    integer vector added to the texel coordinates prior to the texel fetch as
5bd8deadSopenharmony_ci    described above.  In addition to the restrictions described above,
5bd8deadSopenharmony_ci    non-zero offset components are also not supported for BUFFER targets.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The texture used by TexelFetch() is specified by the image unit and target
5bd8deadSopenharmony_ci    parameters provided in the instruction, as for TextureSample() above.
5bd8deadSopenharmony_ci    Single texel fetches can not perform depth comparisons or access cubemaps.
5bd8deadSopenharmony_ci    If a program contains a TXF instruction specifying one of the "SHADOW" or
5bd8deadSopenharmony_ci    "CUBE" targets, it will fail to load.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                                      coordinates used
5bd8deadSopenharmony_ci      texTarget          supported      i j k  layer  lod
5bd8deadSopenharmony_ci      ----------------   ---------      -----  -----  ---
5bd8deadSopenharmony_ci      1D                    yes         x - -    -     w
5bd8deadSopenharmony_ci      2D                    yes         x y -    -     w
5bd8deadSopenharmony_ci      3D                    yes         x y z    -     w
5bd8deadSopenharmony_ci      CUBE                  no          - - -    -     -
5bd8deadSopenharmony_ci      RECT                  yes         x y -    -     w
5bd8deadSopenharmony_ci      ARRAY1D               yes         x - -    y     w
5bd8deadSopenharmony_ci      ARRAY2D               yes         x y -    z     w
5bd8deadSopenharmony_ci      SHADOW1D              no          - - -    -     -
5bd8deadSopenharmony_ci      SHADOW2D              no          - - -    -     -
5bd8deadSopenharmony_ci      SHADOWRECT            no          - - -    -     -
5bd8deadSopenharmony_ci      SHADOWCUBE            no          - - -    -     -
5bd8deadSopenharmony_ci      SHADOWARRAY1D         no          - - -    -     -
5bd8deadSopenharmony_ci      SHADOWARRAY2D         no          - - -    -     -
5bd8deadSopenharmony_ci      BUFFER                yes         x - -    -     -
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.18, Mappings of texel fetch coordinates to texel location.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Single-texel fetches do not support LOD clamping or any texture wrap mode,
5bd8deadSopenharmony_ci    and require a mipmapped minification filter to access any level of detail
5bd8deadSopenharmony_ci    other than the base level.  The results of the texel fetch are undefined:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if the computed LOD is less than the texture's base level (level_base)
5bd8deadSopenharmony_ci        or greater than the maximum level (level_max),
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if the computed LOD is not the texture's base level and the texture's
5bd8deadSopenharmony_ci        minification filter is NEAREST or LINEAR,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if the layer specified for array textures is negative or greater than
5bd8deadSopenharmony_ci        the number of layers in the array texture,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if the texel at (i,j,k) coordinates refer to a border texel outside
5bd8deadSopenharmony_ci        the defined extents of the specified LOD, where
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci         i < -b_s, j < -b_s, k < -b_s,
5bd8deadSopenharmony_ci         i >= w_s - b_s, j >= h_s - b_s, or k >= d_s - b_s,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        where the size parameters (w_s, h_s, d_s, and b_s) refer to the width,
5bd8deadSopenharmony_ci        height, depth, and border size of the image, as in equations 3.15,
5bd8deadSopenharmony_ci        3.16, and 3.17, or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if the texture being accessed is not complete (or cube complete for
5bd8deadSopenharmony_ci        cubemaps).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.5, Program Flow Control
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In addition to basic arithmetic, logical, and texture instructions, a
5bd8deadSopenharmony_ci    number of flow control instructions are provided, which are described in
5bd8deadSopenharmony_ci    detail in Section 2.X.8.  Programs can contain several types of
5bd8deadSopenharmony_ci    instruction blocks:  IF/ELSE/ENDIF blocks, REP/ENDREP blocks, and
5bd8deadSopenharmony_ci    subroutine blocks.  IF/ELSE/ENDIF blocks are a set of instructions
5bd8deadSopenharmony_ci    beginning with an "IF" instruction, ending with an "ENDIF" instruction,
5bd8deadSopenharmony_ci    and possibly containing an optional "ELSE" instruction.  REP/ENDREP blocks
5bd8deadSopenharmony_ci    are a set of instructions beginning with a "REP" instruction and ending
5bd8deadSopenharmony_ci    with an "ENDREP" instruction.  Subroutine blocks begin with an instruction
5bd8deadSopenharmony_ci    label identifying the name of the subroutine and ending just before the
5bd8deadSopenharmony_ci    next instruction label or the end of the program.  Examples include the
5bd8deadSopenharmony_ci    following:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MOVC CC, R0;
5bd8deadSopenharmony_ci        IF GT.x;
5bd8deadSopenharmony_ci          MOV R0, R1;     # executes if R0.x > 0
5bd8deadSopenharmony_ci        ELSE;
5bd8deadSopenharmony_ci          MOV R0, R2;     # executes if R0.x <= 0
5bd8deadSopenharmony_ci        ENDIF;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        REP repCount;
5bd8deadSopenharmony_ci        ADD R0, R0, R1;
5bd8deadSopenharmony_ci        ENDREP;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      square:             # subroutine to compute R0^2
5bd8deadSopenharmony_ci        MUL R0, R0, R0;
5bd8deadSopenharmony_ci        RET;
5bd8deadSopenharmony_ci      main:
5bd8deadSopenharmony_ci        MOV R0, 9.0;
5bd8deadSopenharmony_ci        CAL square;       # compute 9.0^2 in R0
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    IF/ELSE/ENDIF and REP/ENDREP blocks may be nested inside each other, and
5bd8deadSopenharmony_ci    inside subroutines.  In all cases, each instruction block must be
5bd8deadSopenharmony_ci    terminated with the appropriate instruction (ENDIF for IF, ENDREP for
5bd8deadSopenharmony_ci    REP).  Nested instruction blocks must be wholly contained within a block
5bd8deadSopenharmony_ci    -- if a REP instruction is found between an IF and ELSE instruction, the
5bd8deadSopenharmony_ci    corresponding ENDREP must also be present between the IF and ELSE.
5bd8deadSopenharmony_ci    Subroutines may not be nested inside IF/ELSE/ENDIF or REP/ENDREP blocks,
5bd8deadSopenharmony_ci    or inside other subroutines.  A program will fail to load if any
5bd8deadSopenharmony_ci    instruction block is terminated by an incorrect instruction, is not
5bd8deadSopenharmony_ci    terminated before the block containing it, or contains an instruction
5bd8deadSopenharmony_ci    label.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    IF/ELSE/ENDIF blocks evaluate a condition to determine which instructions
5bd8deadSopenharmony_ci    to execute.  If the condition is true, all instructions between the IF and
5bd8deadSopenharmony_ci    ELSE are executed.  If the condition is false, all instructions between
5bd8deadSopenharmony_ci    the ELSE and ENDIF are executed.  The ELSE instruction is optional.  If
5bd8deadSopenharmony_ci    the ELSE is omitted, all instructions between the IF and ENDIF are
5bd8deadSopenharmony_ci    executed if the condition is true, or skipped if the condition is false.
5bd8deadSopenharmony_ci    A limited amount of nesting is supported -- a program will fail to load if
5bd8deadSopenharmony_ci    an IF instruction is nested inside MAX_PROGRAM_IF_DEPTH_NV or more
5bd8deadSopenharmony_ci    IF/ELSE/ENDIF blocks.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    REP/ENDREP blocks are used to execute a sequence of instructions multiple
5bd8deadSopenharmony_ci    times.  The REP instruction includes an optional scalar operand to specify
5bd8deadSopenharmony_ci    a loop count indicating the number of times the block of instructions
5bd8deadSopenharmony_ci    should be repeated.  If the loop count is omitted, the contents of a
5bd8deadSopenharmony_ci    REP/ENDREP block will be repeated indefinitely until the loop is
5bd8deadSopenharmony_ci    explicitly terminated.  A limited amount of nesting is supported -- a
5bd8deadSopenharmony_ci    program will fail to load if a REP instruction is nested inside
5bd8deadSopenharmony_ci    MAX_PROGRAM_LOOP_DEPTH_NV or more REP/ENDREP blocks.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Within a REP/ENDREP block, the CONT instruction can be used to terminate
5bd8deadSopenharmony_ci    the current iteration of the loop by effectively jumping to the ENDREP
5bd8deadSopenharmony_ci    instruction.  The BRK instruction can be used to terminate the entire loop
5bd8deadSopenharmony_ci    by effectively jumping to the instruction immediately following the ENDREP
5bd8deadSopenharmony_ci    instruction.  If CONT and BRK instructions are found inside multiply
5bd8deadSopenharmony_ci    nested REP/ENDREP blocks, they apply to the innermost block.  A program
5bd8deadSopenharmony_ci    will fail to load if it includes a CONT or BRK instruction that is not
5bd8deadSopenharmony_ci    contained inside a REP/ENDREP block.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A REP/ENDREP block without a specified loop count can result in an
5bd8deadSopenharmony_ci    infinite loop.  To prevent obvious infinite loops, a program will fail to
5bd8deadSopenharmony_ci    load if it contains a REP/ENDREP block that contains neither a BRK
5bd8deadSopenharmony_ci    instruction at the current nesting level or a RET instruction at any
5bd8deadSopenharmony_ci    nesting level.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Subroutines are supported via the CAL and RET instructions.  A subroutine
5bd8deadSopenharmony_ci    block is identified by an instruction, which can be any valid identifier
5bd8deadSopenharmony_ci    according to the <instLabel> grammar rule.  The CAL instruction identifies
5bd8deadSopenharmony_ci    a subroutine name to call according to the <instTarget> grammar rule.
5bd8deadSopenharmony_ci    Instruction labels used in CAL instructions do not need to be defined in
5bd8deadSopenharmony_ci    the program text that precedes the instruction, but a program will fail to
5bd8deadSopenharmony_ci    load if it includes a CAL instruction that references an instruction label
5bd8deadSopenharmony_ci    that is not defined anywhere in the program.  When a CAL instruction is
5bd8deadSopenharmony_ci    executed, it transfers control to the instruction immediately following
5bd8deadSopenharmony_ci    the specified instruction label.  Subsequent instructions in that
5bd8deadSopenharmony_ci    subroutine are executed until a RET instruction is executed, or until
5bd8deadSopenharmony_ci    program execution reaches another instruction label or the end of the
5bd8deadSopenharmony_ci    program text.  After the subroutine finishes, execution continues with the
5bd8deadSopenharmony_ci    instruction immediately following the CAL instruction.  When a RET
5bd8deadSopenharmony_ci    instruction is issued, it will break out of any IF/ELSE/ENDIF or
5bd8deadSopenharmony_ci    REP/ENDREP blocks that contain it.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Subroutines may call other subroutines before completing, up to an
5bd8deadSopenharmony_ci    implementation-dependent maximum depth of MAX_PROGRAM_CALL_DEPTH_NV calls.
5bd8deadSopenharmony_ci    Subroutines may call any subroutine in the program, including themselves,
5bd8deadSopenharmony_ci    as long as the call depth limit is obeyed.  The results of issuing a CAL
5bd8deadSopenharmony_ci    instruction while MAX_PROGRAM_CALL_DEPTH subroutines have not completed
5bd8deadSopenharmony_ci    has undefined results, including possible program termination.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Several flow control instructions include condition code tests.  The IF
5bd8deadSopenharmony_ci    instruction requires a condition test to determine what instructions are
5bd8deadSopenharmony_ci    executed.  The CONT, BRK, CAL, and RET instructions have an optional
5bd8deadSopenharmony_ci    condition code test; if the test fails, the instructions are not executed.
5bd8deadSopenharmony_ci    Condition code tests are specified by the <ccTest> grammar rule.  The test
5bd8deadSopenharmony_ci    is evaluated like the condition code write mask (section 2.X.4.3), and
5bd8deadSopenharmony_ci    passes if and only if any of the four components passes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If an instruction label named "main" is specified, GPU program execution
5bd8deadSopenharmony_ci    begins with the instruction immediately following that label.  Otherwise,
5bd8deadSopenharmony_ci    it begins with the first instruction of the program.  Instructions are
5bd8deadSopenharmony_ci    executed in sequence until either a RET instruction is issued in the main
5bd8deadSopenharmony_ci    subroutine or the end of the program text is reached.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.6, Program Options
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Programs may specify a number of options to indicate that one or more
5bd8deadSopenharmony_ci    extended language features are used by the program.  All program options
5bd8deadSopenharmony_ci    used by the program must be declared at the beginning of the program
5bd8deadSopenharmony_ci    string.  Each program option specified in a program string will modify the
5bd8deadSopenharmony_ci    syntactic or semantic rules used to interpet the program and the execution
5bd8deadSopenharmony_ci    environment used to execute the program.  Features in program options
5bd8deadSopenharmony_ci    not declared by the program are ignored, even if the option is otherwise
5bd8deadSopenharmony_ci    supported by the GL.  Each option declaration consists of two tokens: the
5bd8deadSopenharmony_ci    keyword "OPTION" and an identifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The set of available options depends on the program type, and is
5bd8deadSopenharmony_ci    enumerated in the specifications for each program type.  Some program
5bd8deadSopenharmony_ci    types may not provide any options.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.7, Program Declarations
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Programs may include a number of declaration statements to specify
5bd8deadSopenharmony_ci    characteristics of the program.  Each declaration statement is followed by
5bd8deadSopenharmony_ci    one or more arguments, separated by commas.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The set of available declarations depends on the program type, and is
5bd8deadSopenharmony_ci    enumerated in the specifications for each program type.  Some program
5bd8deadSopenharmony_ci    types may not provide declarations.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8, Program Instruction Set
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following sections enumerate the set of instructions supported for GPU
5bd8deadSopenharmony_ci    programs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Some instructions allow the use of one of the three basic data type
5bd8deadSopenharmony_ci    modifiers (floating point, signed integer, and unsigned integer).  Unless
5bd8deadSopenharmony_ci    otherwise mentioned:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * the result and all of the operands will be interpreted according to
5bd8deadSopenharmony_ci        the specified data type, and
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      * if no data type modifier is specified, the instruction will operate as
5bd8deadSopenharmony_ci        though a floating-point modifier ("F") were specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Some instructions will override one or both of these rules.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, ABS:  Absolute Value
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The ABS instruction performs a component-wise absolute value operation on
5bd8deadSopenharmony_ci    the single operand to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = abs(tmp.x);
5bd8deadSopenharmony_ci      result.y = abs(tmp.y);
5bd8deadSopenharmony_ci      result.z = abs(tmp.z);
5bd8deadSopenharmony_ci      result.w = abs(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ABS supports all three data type modifiers.  Taking the absolute value of
5bd8deadSopenharmony_ci    an unsigned integer is not a useful operation, but is not illegal.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, ADD:  Add
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The ADD instruction performs a component-wise add of the two operands to
5bd8deadSopenharmony_ci    yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x + tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y + tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z + tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w + tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ADD supports all three data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, AND:  Bitwise AND
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The AND instruction performs a bitwise AND operation on the components of
5bd8deadSopenharmony_ci    the two source vectors to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x & tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y & tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z & tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w & tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    AND supports only signed and unsigned integer data type modifiers.  If no
5bd8deadSopenharmony_ci    type modifier is specified, both operands and the result are treated as
5bd8deadSopenharmony_ci    signed integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, BRK:  Break out of Loop Instruction
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The BRK instruction conditionally transfers control to the instruction
5bd8deadSopenharmony_ci    immediately following the next ENDREP instruction.  A BRK instruction has
5bd8deadSopenharmony_ci    no effect if the condition code test evaluates to FALSE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following pseudocode describes the operation of the instruction:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      if (TestCC(cc.c***) || TestCC(cc.*c**) ||
5bd8deadSopenharmony_ci          TestCC(cc.**c*) || TestCC(cc.***c)) {
5bd8deadSopenharmony_ci        continue execution at instruction following the next ENDREP;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, CAL:  Subroutine Call
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The CAL instruction conditionally transfers control to the instruction
5bd8deadSopenharmony_ci    following the label specified in the instruction.  It also pushes a
5bd8deadSopenharmony_ci    reference to the instruction immediately following the CAL instruction
5bd8deadSopenharmony_ci    onto the call stack, where execution will continue after executing the
5bd8deadSopenharmony_ci    matching RET instruction.  The following pseudocode describes the
5bd8deadSopenharmony_ci    operation of the instruction:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      if (TestCC(cc.c***) || TestCC(cc.*c**) ||
5bd8deadSopenharmony_ci          TestCC(cc.**c*) || TestCC(cc.***c)) {
5bd8deadSopenharmony_ci        if (callStackDepth >= MAX_PROGRAM_CALL_DEPTH_NV) {
5bd8deadSopenharmony_ci          // undefined results
5bd8deadSopenharmony_ci        } else {
5bd8deadSopenharmony_ci          callStack[callStackDepth] = nextInstruction;
5bd8deadSopenharmony_ci          callStackDepth++;
5bd8deadSopenharmony_ci        }
5bd8deadSopenharmony_ci        // continue execution at instruction following <instTarget>
5bd8deadSopenharmony_ci      } else {
5bd8deadSopenharmony_ci        // do nothing
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In the pseudocode, <instTarget> is the label specified in the instruction
5bd8deadSopenharmony_ci    matching the <branchLabel> grammar rule, <callStackDepth> is the current
5bd8deadSopenharmony_ci    depth of the call stack, <callStack> is an array holding the call stack,
5bd8deadSopenharmony_ci    and <nextInstruction> is a reference to the instruction immediately
5bd8deadSopenharmony_ci    following the CAL instruction in the program string.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the call stack overflows, the results of the CAL instruction are
5bd8deadSopenharmony_ci    undefined, and can result in immediate program termination.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    An instruction label signifies the beginning of a new subroutine.
5bd8deadSopenharmony_ci    Subroutines may not nest or overlap.  If a CAL instruction is executed and
5bd8deadSopenharmony_ci    subsequent program execution reaches an instruction label before a
5bd8deadSopenharmony_ci    corresponding RET instruction is executed, the subroutine call returns
5bd8deadSopenharmony_ci    immediately, as though an unconditional RET instruction were inserted
5bd8deadSopenharmony_ci    immediately before the instruction label.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (Note:  On previous vertex program extensions -- NV_vertex_program2 and
5bd8deadSopenharmony_ci    NV_vertex_program3 -- instruction labels were also used as targets for
5bd8deadSopenharmony_ci    branch (BRA) instructions.  This unstructured branching functionality has
5bd8deadSopenharmony_ci    been replaced with the structured branching constructs found in this
5bd8deadSopenharmony_ci    instruction set.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, CEIL:  Ceiling
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The CEIL instruction loads a single vector operand and performs a
5bd8deadSopenharmony_ci    component-wise ceiling operation to generate a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      iresult.x = ceil(tmp.x);
5bd8deadSopenharmony_ci      iresult.y = ceil(tmp.y);
5bd8deadSopenharmony_ci      iresult.z = ceil(tmp.z);
5bd8deadSopenharmony_ci      iresult.w = ceil(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The ceiling operation returns the nearest integer greater than or equal to
5bd8deadSopenharmony_ci    the operand.  For example ceil(-1.7) = -1.0, ceil(+1.0) = +1.0, and
5bd8deadSopenharmony_ci    ceil(+3.7) = +4.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    CEIL supports all three data type modifiers.  The single operand is always
5bd8deadSopenharmony_ci    treated as a floating-point vector, but the result is written as a
5bd8deadSopenharmony_ci    floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier.  If a value is not exactly
5bd8deadSopenharmony_ci    representable using the data type of the result (e.g., an overflow or
5bd8deadSopenharmony_ci    writing a negative value to an unsigned integer), the result is undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, CMP:  Compare
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The CMP instructions performs a component-wise comparison of the first
5bd8deadSopenharmony_ci    operand against zero, and copies the values of the second or third
5bd8deadSopenharmony_ci    operands based on the results of the compare.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      result.x = (tmp0.x < 0) ? tmp1.x : tmp2.x;
5bd8deadSopenharmony_ci      result.y = (tmp0.y < 0) ? tmp1.y : tmp2.y;
5bd8deadSopenharmony_ci      result.z = (tmp0.z < 0) ? tmp1.z : tmp2.z;
5bd8deadSopenharmony_ci      result.w = (tmp0.w < 0) ? tmp1.w : tmp2.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    CMP supports all three data type modifiers.  CMP with an unsigned data
5bd8deadSopenharmony_ci    type modifier is not a useful operation, but is not illegal.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, CONT:  Continue with Next Loop Iteration
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The CONT instruction conditionally transfers control to the next ENDREP
5bd8deadSopenharmony_ci    instruction.  A CONT instruction has no effect if the condition code test
5bd8deadSopenharmony_ci    evaluates to FALSE.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following pseudocode describes the operation of the instruction:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      if (TestCC(cc.c***) || TestCC(cc.*c**) ||
5bd8deadSopenharmony_ci          TestCC(cc.**c*) || TestCC(cc.***c)) {
5bd8deadSopenharmony_ci        continue execution at the next ENDREP;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, COS:  Cosine with Reduction to [-PI,PI]
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The COS instruction approximates the trigonometric cosine of the angle
5bd8deadSopenharmony_ci    specified by the scalar operand and replicates it to all four components
5bd8deadSopenharmony_ci    of the result vector.  The angle is specified in radians and does not have
5bd8deadSopenharmony_ci    to be in the range [-PI,PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxCosine(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxCosine(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxCosine(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxCosine(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    COS supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DDX:  Partial Derivative Relative to X
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DDX instruction computes approximate partial derivatives of a vector
5bd8deadSopenharmony_ci    operand with respect to the X window coordinate, and is only available to
5bd8deadSopenharmony_ci    fragment programs.  See the NV_fragment_program4 specification for more
5bd8deadSopenharmony_ci    details.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DDY:  Partial Derivative Relative to Y
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DDY instruction computes approximate partial derivatives of a vector
5bd8deadSopenharmony_ci    operand with respect to the Y window coordinate, and is only available to
5bd8deadSopenharmony_ci    fragment programs.  See the NV_fragment_program4 specification for more
5bd8deadSopenharmony_ci    details.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DIV:  Divide Vector Components by Scalar
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DIV instruction performs a component-wise divide of the first vector
5bd8deadSopenharmony_ci    operand by the second scalar operand to produce a 4-component result
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = ScalarLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x / tmp1;
5bd8deadSopenharmony_ci      result.y = tmp0.y / tmp1;
5bd8deadSopenharmony_ci      result.z = tmp0.z / tmp1;
5bd8deadSopenharmony_ci      result.w = tmp0.w / tmp1;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    DIV supports all three data type modifiers.  For floating-point division,
5bd8deadSopenharmony_ci    this instruction is not guaranteed to produce results identical to a
5bd8deadSopenharmony_ci    RCP/MUL instruction sequence.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The results of an signed or unsigned integer division by zero are
5bd8deadSopenharmony_ci    undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DP2:  2-Component Dot Product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DP2 instruction computes a two-component dot product of the two
5bd8deadSopenharmony_ci    operands (using the first two components) and replicates the dot product
5bd8deadSopenharmony_ci    to all four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      dot = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y);
5bd8deadSopenharmony_ci      result.x = dot;
5bd8deadSopenharmony_ci      result.y = dot;
5bd8deadSopenharmony_ci      result.z = dot;
5bd8deadSopenharmony_ci      result.w = dot;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    DP2 supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DP2A:  2-Component Dot Product with Scalar Add
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DP2 instruction computes a two-component dot product of the two
5bd8deadSopenharmony_ci    operands (using the first two components), adds the x component of the
5bd8deadSopenharmony_ci    third operand, and replicates the result to all four components of the
5bd8deadSopenharmony_ci    result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      dot = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) + tmp2.x;
5bd8deadSopenharmony_ci      result.x = dot;
5bd8deadSopenharmony_ci      result.y = dot;
5bd8deadSopenharmony_ci      result.z = dot;
5bd8deadSopenharmony_ci      result.w = dot;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    DP2A supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DP3:  3-Component Dot Product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DP3 instruction computes a three-component dot product of the two
5bd8deadSopenharmony_ci    operands (using the x, y, and z components) and replicates the dot product
5bd8deadSopenharmony_ci    to all four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      dot = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci            (tmp0.z * tmp1.z);
5bd8deadSopenharmony_ci      result.x = dot;
5bd8deadSopenharmony_ci      result.y = dot;
5bd8deadSopenharmony_ci      result.z = dot;
5bd8deadSopenharmony_ci      result.w = dot;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    DP3 supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DP4:  4-Component Dot Product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DP4 instruction computes a four-component dot product of the two
5bd8deadSopenharmony_ci    operands and replicates the dot product to all four components of the
5bd8deadSopenharmony_ci    result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1):
5bd8deadSopenharmony_ci      dot = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci            (tmp0.z * tmp1.z) + (tmp0.w * tmp1.w);
5bd8deadSopenharmony_ci      result.x = dot;
5bd8deadSopenharmony_ci      result.y = dot;
5bd8deadSopenharmony_ci      result.z = dot;
5bd8deadSopenharmony_ci      result.w = dot;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    DP4 supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DPH:  Homogeneous Dot Product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DPH instruction computes a three-component dot product of the two
5bd8deadSopenharmony_ci    operands (using the x, y, and z components), adds the w component of the
5bd8deadSopenharmony_ci    second operand, and replicates the sum to all four components of the
5bd8deadSopenharmony_ci    result vector.  This is equivalent to a four-component dot product where
5bd8deadSopenharmony_ci    the w component of the first operand is forced to 1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1):
5bd8deadSopenharmony_ci      dot = (tmp0.x * tmp1.x) + (tmp0.y * tmp1.y) +
5bd8deadSopenharmony_ci            (tmp0.z * tmp1.z) + tmp1.w;
5bd8deadSopenharmony_ci      result.x = dot;
5bd8deadSopenharmony_ci      result.y = dot;
5bd8deadSopenharmony_ci      result.z = dot;
5bd8deadSopenharmony_ci      result.w = dot;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    DPH supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, DST:  Distance Vector
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The DST instruction computes a distance vector from two specially-
5bd8deadSopenharmony_ci    formatted operands.  The first operand should be of the form [NA, d^2,
5bd8deadSopenharmony_ci    d^2, NA] and the second operand should be of the form [NA, 1/d, NA, 1/d],
5bd8deadSopenharmony_ci    where NA values are not relevant to the calculation and d is a vector
5bd8deadSopenharmony_ci    length.  If both vectors satisfy these conditions, the result vector will
5bd8deadSopenharmony_ci    be of the form [1.0, d, d^2, 1/d].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The exact behavior is specified in the following pseudo-code:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = 1.0;
5bd8deadSopenharmony_ci      result.y = tmp0.y * tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z;
5bd8deadSopenharmony_ci      result.w = tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Given an arbitrary vector, d^2 can be obtained using the DP3 instruction
5bd8deadSopenharmony_ci    (using the same vector for both operands) and 1/d can be obtained from d^2
5bd8deadSopenharmony_ci    using the RSQ instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This distance vector is useful for per-vertex light attenuation
5bd8deadSopenharmony_ci    calculations:  a DP3 operation using the distance vector and an
5bd8deadSopenharmony_ci    attenuation constants vector as operands will yield the attenuation
5bd8deadSopenharmony_ci    factor.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    DST supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, ELSE:  Start of If Test Else Block
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The ELSE instruction signifies the end of the "execute if true" portion of
5bd8deadSopenharmony_ci    an IF/ELSE/ENDIF block and the beginning of the "execute if false"
5bd8deadSopenharmony_ci    portion.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the condition evaluated at the IF statement was TRUE, when a program
5bd8deadSopenharmony_ci    reaches the ELSE statement, it has completed the entire "execute if true"
5bd8deadSopenharmony_ci    portion of the IF/ELSE/ENDIF block.  Execution will continue at the
5bd8deadSopenharmony_ci    corresponding ENDIF instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the condition evaluated at the IF statement was FALSE, program
5bd8deadSopenharmony_ci    execution would skip over the entire "execute if true" portion of the
5bd8deadSopenharmony_ci    IF/ELSE/ENDIF block, including the ELSE instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, EMIT:  Emit Vertex
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The EMIT instruction emits a new vertex to be added to the current output
5bd8deadSopenharmony_ci    primitive generated by a geometry program, and is only available to
5bd8deadSopenharmony_ci    geometry programs.  See the NV_geometry_program4 specification for more
5bd8deadSopenharmony_ci    details.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, ENDIF:  End of If Test Block
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The ENDIF instruction signifies the end of an IF/ELSE/ENDIF block.  It has
5bd8deadSopenharmony_ci    no other effect on program execution.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8,Z, ENDPRIM:  End of Primitive
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A geometry program can emit multiple primitives in a single invocation.
5bd8deadSopenharmony_ci    The ENDPRIM instruction is used in a geometry program to signify the end
5bd8deadSopenharmony_ci    of the current primitive and the beginning of a new primitive of the same
5bd8deadSopenharmony_ci    type.  It is only available to geometry programs.  See the
5bd8deadSopenharmony_ci    NV_geometry_program4 specification for more details.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, ENDREP:  End of Repeat Block
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The ENDREP instruction specifies the end of a REP block.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When used with in conjunction with a REP instruction with a loop count,
5bd8deadSopenharmony_ci    ENDREP decrements the loop counter.  If the decremented loop counter is
5bd8deadSopenharmony_ci    greater than zero, ENDREP transfers control to the instruction immediately
5bd8deadSopenharmony_ci    after the corresponding REP instruction.  If the loop counter is less than
5bd8deadSopenharmony_ci    or equal to zero, execution continues at the instruction following the
5bd8deadSopenharmony_ci    ENDREP instruction.  When used in conjunction with a REP instruction
5bd8deadSopenharmony_ci    without loop count, ENDREP always transfers control to the instruction
5bd8deadSopenharmony_ci    immediately after the REP instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      if (REP instruction includes a loop count) {
5bd8deadSopenharmony_ci        LoopCount--;
5bd8deadSopenharmony_ci        if (LoopCount > 0) {
5bd8deadSopenharmony_ci          continue execution at instruction following corresponding REP
5bd8deadSopenharmony_ci            instruction;
5bd8deadSopenharmony_ci        }
5bd8deadSopenharmony_ci      } else {
5bd8deadSopenharmony_ci        continue execution at instruction following corresponding REP
5bd8deadSopenharmony_ci          instruction;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, EX2:  Exponential Base 2
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The EX2 instruction approximates 2 raised to the power of the scalar
5bd8deadSopenharmony_ci    operand and replicates the approximation to all four components of the
5bd8deadSopenharmony_ci    result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = Approx2ToX(tmp);
5bd8deadSopenharmony_ci      result.y = Approx2ToX(tmp);
5bd8deadSopenharmony_ci      result.z = Approx2ToX(tmp);
5bd8deadSopenharmony_ci      result.w = Approx2ToX(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    EX2 supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, FLR:  Floor
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The FLR instruction loads a single vector operand and performs a
5bd8deadSopenharmony_ci    component-wise floor operation to generate a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = floor(tmp.x);
5bd8deadSopenharmony_ci      result.y = floor(tmp.y);
5bd8deadSopenharmony_ci      result.z = floor(tmp.z);
5bd8deadSopenharmony_ci      result.w = floor(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The floor operation returns the nearest integer less than or equal to the
5bd8deadSopenharmony_ci    operand.  For example floor(-1.7) = -2.0, floor(+1.0) = +1.0, and floor(+3.7)
5bd8deadSopenharmony_ci    = +3.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    FLR supports all three data type modifiers.  The single operand is always
5bd8deadSopenharmony_ci    treated as a floating-point value, but the result is written as a
5bd8deadSopenharmony_ci    floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier.  If a value is not exactly
5bd8deadSopenharmony_ci    representable using the data type of the result (e.g., an overflow or
5bd8deadSopenharmony_ci    writing a negative value to an unsigned integer), the result is undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, FRC:  Fraction
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The FRC instruction extracts the fractional portion of each component of
5bd8deadSopenharmony_ci    the operand to generate a result vector.  The fractional portion of a
5bd8deadSopenharmony_ci    component is defined as the result after subtracting off the floor of the
5bd8deadSopenharmony_ci    component (see FLR), and is always in the range [0.0, 1.0).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For negative values, the fractional portion is NOT the number written to
5bd8deadSopenharmony_ci    the right of the decimal point -- the fractional portion of -1.7 is not
5bd8deadSopenharmony_ci    0.7 -- it is 0.3.  0.3 is produced by subtracting the floor of -1.7 (-2.0)
5bd8deadSopenharmony_ci    from -1.7.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = fraction(tmp.x);
5bd8deadSopenharmony_ci      result.y = fraction(tmp.y);
5bd8deadSopenharmony_ci      result.z = fraction(tmp.z);
5bd8deadSopenharmony_ci      result.w = fraction(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    FRC supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, I2F:  Integer to Float
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The I2F instruction converts the components of an integer vector operand
5bd8deadSopenharmony_ci    to floating-point to produce a floating-point result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = (float) tmp.x;
5bd8deadSopenharmony_ci      result.y = (float) tmp.y;
5bd8deadSopenharmony_ci      result.z = (float) tmp.z;
5bd8deadSopenharmony_ci      result.w = (float) tmp.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    I2F supports only signed and unsigned integer data type modifiers.  The
5bd8deadSopenharmony_ci    single operand is interpreted according to the data type modifier.  If no
5bd8deadSopenharmony_ci    data type modifier is specified, the operand is treated as a signed
5bd8deadSopenharmony_ci    integer vector.  The result is always written as a float.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, IF:  Start of If Test Block
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The IF instruction performs a condition code test to determine what
5bd8deadSopenharmony_ci    instructions inside an IF/ELSE/ENDIF block are executed.  If the test
5bd8deadSopenharmony_ci    passes, execution continues at the instruction immediately following the
5bd8deadSopenharmony_ci    IF instruction.  If the test fails, IF transfers control to the
5bd8deadSopenharmony_ci    instruction immediately following the corresponding ELSE instruction (if
5bd8deadSopenharmony_ci    present) or the ENDIF instruction (if no ELSE is present).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Implementations may have a limited ability to nest IF blocks in any
5bd8deadSopenharmony_ci    subroutine.  If the number of IF/ENDIF blocks nested inside each other is
5bd8deadSopenharmony_ci    MAX_PROGRAM_IF_DEPTH_NV or higher, a program will fail to compile.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      // Evaluate the condition.  If the condition is true, continue at the
5bd8deadSopenharmony_ci      // next instruction.  Otherwise, continue at the
5bd8deadSopenharmony_ci      if (TestCC(cc.c***) || TestCC(cc.*c**) ||
5bd8deadSopenharmony_ci          TestCC(cc.**c*) || TestCC(cc.***c)) {
5bd8deadSopenharmony_ci        continue execution at the next instruction;
5bd8deadSopenharmony_ci      } else if (IF block contains an ELSE statement) {
5bd8deadSopenharmony_ci        continue execution at instruction following corresponding ELSE;
5bd8deadSopenharmony_ci      } else {
5bd8deadSopenharmony_ci        continue execution at instruction following corresponding ENDIF;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (Note:  Unlike the NV_fragment_program2 extension, there is no run-time
5bd8deadSopenharmony_ci    limit on the maximum overall depth of IF/ENDIF nesting.  As long as each
5bd8deadSopenharmony_ci    individual subroutine of the program obeys the static nesting limits,
5bd8deadSopenharmony_ci    there will be no run-time errors in the program.  With the
5bd8deadSopenharmony_ci    NV_fragment_program2 extension, a program could terminate abnormally if it
5bd8deadSopenharmony_ci    called a subroutine inside a very deeply nested set of IF/ENDIF blocks and
5bd8deadSopenharmony_ci    the called subroutine also contained deeply nested IF/ENDIF blocks.  SUch
5bd8deadSopenharmony_ci    an error could occur even if neither subroutine exceeded static limits.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, KIL:  Kill Fragment
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The KIL instruction conditionally kills a fragment, and is only available
5bd8deadSopenharmony_ci    to fragment programs.  See the NV_fragment_program4 specification for more
5bd8deadSopenharmony_ci    details.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, LG2:  Logarithm Base 2
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The LG2 instruction approximates the base 2 logarithm of the scalar
5bd8deadSopenharmony_ci    operand and replicates it to all four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxLog2(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxLog2(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxLog2(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxLog2(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the scalar operand is zero or negative, the result is undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    LG2 supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, LIT:  Compute Lighting Coefficients
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The LIT instruction accelerates lighting computations by computing
5bd8deadSopenharmony_ci    lighting coefficients for ambient, diffuse, and specular light
5bd8deadSopenharmony_ci    contributions.  The "x" component of the single operand is assumed to hold
5bd8deadSopenharmony_ci    a diffuse dot product (n dot VP_pli, as in the vertex lighting equations
5bd8deadSopenharmony_ci    in Section 2.13.1).  The "y" component of the operand is assumed to hold a
5bd8deadSopenharmony_ci    specular dot product (n dot h_i).  The "w" component of the operand is
5bd8deadSopenharmony_ci    assumed to hold the specular exponent of the material (s_rm), and is
5bd8deadSopenharmony_ci    clamped to the range (-128, +128) exclusive.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The "x" component of the result vector receives the value that should be
5bd8deadSopenharmony_ci    multiplied by the ambient light/material product (always 1.0).  The "y"
5bd8deadSopenharmony_ci    component of the result vector receives the value that should be
5bd8deadSopenharmony_ci    multiplied by the diffuse light/material product (n dot VP_pli).  The "z"
5bd8deadSopenharmony_ci    component of the result vector receives the value that should be
5bd8deadSopenharmony_ci    multiplied by the specular light/material product (f_i * (n dot h_i) ^
5bd8deadSopenharmony_ci    s_rm).  The "w" component of the result is the constant 1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Negative diffuse and specular dot products are clamped to 0.0, as is done
5bd8deadSopenharmony_ci    in the standard per-vertex lighting operations.  In addition, if the
5bd8deadSopenharmony_ci    diffuse dot product is zero or negative, the specular coefficient is
5bd8deadSopenharmony_ci    forced to zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      if (tmp.x < 0) tmp.x = 0;
5bd8deadSopenharmony_ci      if (tmp.y < 0) tmp.y = 0;
5bd8deadSopenharmony_ci      if (tmp.w < -(128.0-epsilon)) tmp.w = -(128.0-epsilon);
5bd8deadSopenharmony_ci      else if (tmp.w > 128-epsilon) tmp.w = 128-epsilon;
5bd8deadSopenharmony_ci      result.x = 1.0;
5bd8deadSopenharmony_ci      result.y = tmp.x;
5bd8deadSopenharmony_ci      result.z = (tmp.x > 0) ? RoughApproxPower(tmp.y, tmp.w) : 0.0;
5bd8deadSopenharmony_ci      result.w = 1.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Since 0^0 is defined to be 1, RoughApproxPower(0.0, 0.0) will produce 1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    LIT supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, LRP:  Linear Interpolation
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The LRP instruction performs a component-wise linear interpolation between
5bd8deadSopenharmony_ci    the second and third operands using the first operand as the blend factor.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      result.x = tmp0.x * tmp1.x + (1 - tmp0.x) * tmp2.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y * tmp1.y + (1 - tmp0.y) * tmp2.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z * tmp1.z + (1 - tmp0.z) * tmp2.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w * tmp1.w + (1 - tmp0.w) * tmp2.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    LRP supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, MAD:  Multiply and Add
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MAD instruction performs a component-wise multiply of the first two
5bd8deadSopenharmony_ci    operands, and then does a component-wise add of the product to the third
5bd8deadSopenharmony_ci    operand to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      result.x = tmp0.x * tmp1.x + tmp2.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y * tmp1.y + tmp2.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z * tmp1.z + tmp2.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w * tmp1.w + tmp2.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The multiplication and addition operations in this instruction are subject
5bd8deadSopenharmony_ci    to the same rules as described for the MUL and ADD instructions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    MAD supports all three data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, MAX:  Maximum
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MAX instruction computes component-wise maximums of the values in the
5bd8deadSopenharmony_ci    two operands to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x > tmp1.x) ? tmp0.x : tmp1.x;
5bd8deadSopenharmony_ci      result.y = (tmp0.y > tmp1.y) ? tmp0.y : tmp1.y;
5bd8deadSopenharmony_ci      result.z = (tmp0.z > tmp1.z) ? tmp0.z : tmp1.z;
5bd8deadSopenharmony_ci      result.w = (tmp0.w > tmp1.w) ? tmp0.w : tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    MAX supports all three data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, MIN:  Minimum
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MIN instruction computes component-wise minimums of the values in the
5bd8deadSopenharmony_ci    two operands to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x > tmp1.x) ? tmp1.x : tmp0.x;
5bd8deadSopenharmony_ci      result.y = (tmp0.y > tmp1.y) ? tmp1.y : tmp0.y;
5bd8deadSopenharmony_ci      result.z = (tmp0.z > tmp1.z) ? tmp1.z : tmp0.z;
5bd8deadSopenharmony_ci      result.w = (tmp0.w > tmp1.w) ? tmp1.w : tmp0.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    MIN supports all three data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, MOD:  Modulus
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MOD instruction performs a component-wise modulus operation on the first
5bd8deadSopenharmony_ci    vector operand by the second scalar operand to produce a 4-component result
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = ScalarLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x % tmp1;
5bd8deadSopenharmony_ci      result.y = tmp0.y % tmp1;
5bd8deadSopenharmony_ci      result.z = tmp0.z % tmp1;
5bd8deadSopenharmony_ci      result.w = tmp0.w % tmp1;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    MOD supports both signed and unsigned integer data type modifiers.  If no
5bd8deadSopenharmony_ci    data type modifier is specified, both operands and the result are treated
5bd8deadSopenharmony_ci    as signed integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A result component is undefined if the corresponding component of the
5bd8deadSopenharmony_ci    first operand is negative or if the second operand is less than or equal
5bd8deadSopenharmony_ci    to zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, MOV:  Move
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MOV instruction copies the value of the operand to yield a result
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result = VectorLoad(op0);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    MOV supports all three data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, MUL:  Multiply
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The MUL instruction performs a component-wise multiply of the two operands
5bd8deadSopenharmony_ci    to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x * tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y * tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z * tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w * tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    MUL supports all three data type modifiers.  The MUL instruction
5bd8deadSopenharmony_ci    additionally supports three special modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The "S24" and "U24" modifiers specify "fast" signed or unsigned integer
5bd8deadSopenharmony_ci    multiplies of 24-bit quantities, respectively.  The results of such
5bd8deadSopenharmony_ci    multiplies are undefined if either operand is outside the range
5bd8deadSopenharmony_ci    [-2^23,+2^23-1] for S24 or [0,2^24-1] for U24.  If "S24" or "U24" is
5bd8deadSopenharmony_ci    specified, the data type is implied and normal data type modifiers may not
5bd8deadSopenharmony_ci    be provided.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The "HI" modifier specifies a 32-bit integer multiply that returns the 32
5bd8deadSopenharmony_ci    most significant bits of the 64-bit product.  Integer multiplies without
5bd8deadSopenharmony_ci    the "HI" modifier normally return the least significant bits of the
5bd8deadSopenharmony_ci    product.  If "HI" is specified, either of the "S" or "U" integer data type
5bd8deadSopenharmony_ci    modifiers must also be specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that if condition code updates are performed on integer multiplies,
5bd8deadSopenharmony_ci    the overflow or carry flags are always cleared, even if the product
5bd8deadSopenharmony_ci    overflowed.  If it is necessary to determine if the results of an integer
5bd8deadSopenharmony_ci    multiply overflowed, the MUL.HI instruction may be used.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, NOT:  Bitwise Not
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The NOT instruction performs a component-wise bitwise NOT operation on the
5bd8deadSopenharmony_ci    source vector to produce a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp.x = ~tmp.x;
5bd8deadSopenharmony_ci      tmp.y = ~tmp.y;
5bd8deadSopenharmony_ci      tmp.z = ~tmp.z;
5bd8deadSopenharmony_ci      tmp.w = ~tmp.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NOT supports only integer data type modifiers.  If no type modifier is
5bd8deadSopenharmony_ci    specified, the operand and the result are treated as signed integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, NRM:  Normalize 3-Component Vector
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The NRM instruction normalizes the vector given by the x, y, and z
5bd8deadSopenharmony_ci    components of the vector operand to produce the x, y, and z components of
5bd8deadSopenharmony_ci    the result vector.  The w component of the result is undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      scale = ApproxRSQ(tmp.x * tmp.x + tmp.y * tmp.y + tmp.z * tmp.z);
5bd8deadSopenharmony_ci      result.x = tmp.x * scale;
5bd8deadSopenharmony_ci      result.y = tmp.y * scale;
5bd8deadSopenharmony_ci      result.z = tmp.z * scale;
5bd8deadSopenharmony_ci      result.w = undefined;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    NRM supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, OR:  Bitwise Or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The OR instruction performs a bitwise OR operation on the components of
5bd8deadSopenharmony_ci    the two source vectors to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x | tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y | tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z | tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w | tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    OR supports only integer data type modifiers.  If no type modifier is
5bd8deadSopenharmony_ci    specified, both operands and the result are treated as signed integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, PK2H:  Pack Two 16-bit Floats
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The PK2H instruction converts the "x" and "y" components of the single
5bd8deadSopenharmony_ci    floating-point vector operand into 16-bit floating-point format, packs the
5bd8deadSopenharmony_ci    bit representation of these two floats into a 32-bit unsigned integer, and
5bd8deadSopenharmony_ci    replicates that value to all four components of the result vector.  The
5bd8deadSopenharmony_ci    PK2H instruction can be reversed by the UP2H instruction below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      /* result obtained by combining raw bits of tmp0.x, tmp0.y */
5bd8deadSopenharmony_ci      result.x = RawBits(tmp0.x) | (RawBits(tmp0.y) << 16);
5bd8deadSopenharmony_ci      result.y = RawBits(tmp0.x) | (RawBits(tmp0.y) << 16);
5bd8deadSopenharmony_ci      result.z = RawBits(tmp0.x) | (RawBits(tmp0.y) << 16);
5bd8deadSopenharmony_ci      result.w = RawBits(tmp0.x) | (RawBits(tmp0.y) << 16);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    PK2H supports all three data type modifiers.  The single operand is always
5bd8deadSopenharmony_ci    treated as a floating-point value, but the result is written as a
5bd8deadSopenharmony_ci    floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier.  For integer results, the bits can be
5bd8deadSopenharmony_ci    interpreted as described above.  For floating-point result variables, the
5bd8deadSopenharmony_ci    packed results do not constitute a meaningful floating-point variable and
5bd8deadSopenharmony_ci    should only be used to feed future unpack instructions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A program will fail to load if it contains a PK2H instruction that writes
5bd8deadSopenharmony_ci    its results to a variable declared as "SHORT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, PK2US:  Pack Two Floats as Unsigned 16-bit
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The PK2US instruction converts the "x" and "y" components of the single
5bd8deadSopenharmony_ci    floating-point vector operand into a packed pair of 16-bit unsigned
5bd8deadSopenharmony_ci    scalars.  The scalars are represented in a bit pattern where all '0' bits
5bd8deadSopenharmony_ci    corresponds to 0.0 and all '1' bits corresponds to 1.0.  The bit
5bd8deadSopenharmony_ci    representations of the two converted components are packed into a 32-bit
5bd8deadSopenharmony_ci    unsigned integer, and that value is replicated to all four components of
5bd8deadSopenharmony_ci    the result vector.  The PK2US instruction can be reversed by the UP2US
5bd8deadSopenharmony_ci    instruction below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      if (tmp0.x < 0.0) tmp0.x = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.x > 1.0) tmp0.x = 1.0;
5bd8deadSopenharmony_ci      if (tmp0.y < 0.0) tmp0.y = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.y > 1.0) tmp0.y = 1.0;
5bd8deadSopenharmony_ci      us.x = round(65535.0 * tmp0.x);  /* us is a ushort vector */
5bd8deadSopenharmony_ci      us.y = round(65535.0 * tmp0.y);
5bd8deadSopenharmony_ci      /* result obtained by combining raw bits of us. */
5bd8deadSopenharmony_ci      result.x = ((us.x) | (us.y << 16));
5bd8deadSopenharmony_ci      result.y = ((us.x) | (us.y << 16));
5bd8deadSopenharmony_ci      result.z = ((us.x) | (us.y << 16));
5bd8deadSopenharmony_ci      result.w = ((us.x) | (us.y << 16));
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    PK2US supports all three data type modifiers.  The single operand is
5bd8deadSopenharmony_ci    always treated as a floating-point value, but the result is written as a
5bd8deadSopenharmony_ci    floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier.  For integer result variables, the
5bd8deadSopenharmony_ci    bits can be interpreted as described above.  For floating-point result
5bd8deadSopenharmony_ci    variables, the packed results do not constitute a meaningful
5bd8deadSopenharmony_ci    floating-point variable and should only be used to feed future unpack
5bd8deadSopenharmony_ci    instructions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A program will fail to load if it contains a PK2US instruction that writes
5bd8deadSopenharmony_ci    its results to a variable declared as "SHORT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, PK4B:  Pack Four Floats as Signed 8-bit
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The PK4B instruction converts the four components of the single
5bd8deadSopenharmony_ci    floating-point vector operand into 8-bit signed quantities.  The signed
5bd8deadSopenharmony_ci    quantities are represented in a bit pattern where all '0' bits corresponds
5bd8deadSopenharmony_ci    to -128/127 and all '1' bits corresponds to +127/127.  The bit
5bd8deadSopenharmony_ci    representations of the four converted components are packed into a 32-bit
5bd8deadSopenharmony_ci    unsigned integer, and that value is replicated to all four components of
5bd8deadSopenharmony_ci    the result vector.  The PK4B instruction can be reversed by the UP4B
5bd8deadSopenharmony_ci    instruction below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      if (tmp0.x < -128/127) tmp0.x = -128/127;
5bd8deadSopenharmony_ci      if (tmp0.y < -128/127) tmp0.y = -128/127;
5bd8deadSopenharmony_ci      if (tmp0.z < -128/127) tmp0.z = -128/127;
5bd8deadSopenharmony_ci      if (tmp0.w < -128/127) tmp0.w = -128/127;
5bd8deadSopenharmony_ci      if (tmp0.x > +127/127) tmp0.x = +127/127;
5bd8deadSopenharmony_ci      if (tmp0.y > +127/127) tmp0.y = +127/127;
5bd8deadSopenharmony_ci      if (tmp0.z > +127/127) tmp0.z = +127/127;
5bd8deadSopenharmony_ci      if (tmp0.w > +127/127) tmp0.w = +127/127;
5bd8deadSopenharmony_ci      ub.x = round(127.0 * tmp0.x + 128.0);  /* ub is a ubyte vector */
5bd8deadSopenharmony_ci      ub.y = round(127.0 * tmp0.y + 128.0);
5bd8deadSopenharmony_ci      ub.z = round(127.0 * tmp0.z + 128.0);
5bd8deadSopenharmony_ci      ub.w = round(127.0 * tmp0.w + 128.0);
5bd8deadSopenharmony_ci      /* result obtained by combining raw bits of ub. */
5bd8deadSopenharmony_ci      result.x = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.y = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.z = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.w = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    PK4B supports all three data type modifiers.  The single operand is always
5bd8deadSopenharmony_ci    treated as a floating-point value, but the result is written as a
5bd8deadSopenharmony_ci    floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier.  For integer result variables, the
5bd8deadSopenharmony_ci    bits can be interpreted as described above.  For floating-point result
5bd8deadSopenharmony_ci    variables, the packed results do not constitute a meaningful
5bd8deadSopenharmony_ci    floating-point variable and should only be used to feed future unpack
5bd8deadSopenharmony_ci    instructions.  A program will fail to load if it contains a PK4B
5bd8deadSopenharmony_ci    instruction that writes its results to a variable declared as "SHORT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, PK4UB:  Pack Four Floats as Unsigned 8-bit
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The PK4UB instruction converts the four components of the single
5bd8deadSopenharmony_ci    floating-point vector operand into a packed grouping of 8-bit unsigned
5bd8deadSopenharmony_ci    scalars.  The scalars are represented in a bit pattern where all '0' bits
5bd8deadSopenharmony_ci    corresponds to 0.0 and all '1' bits corresponds to 1.0.  The bit
5bd8deadSopenharmony_ci    representations of the four converted components are packed into a 32-bit
5bd8deadSopenharmony_ci    unsigned integer, and that value is replicated to all four components of
5bd8deadSopenharmony_ci    the result vector.  The PK4UB instruction can be reversed by the UP4UB
5bd8deadSopenharmony_ci    instruction below.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      if (tmp0.x < 0.0) tmp0.x = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.x > 1.0) tmp0.x = 1.0;
5bd8deadSopenharmony_ci      if (tmp0.y < 0.0) tmp0.y = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.y > 1.0) tmp0.y = 1.0;
5bd8deadSopenharmony_ci      if (tmp0.z < 0.0) tmp0.z = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.z > 1.0) tmp0.z = 1.0;
5bd8deadSopenharmony_ci      if (tmp0.w < 0.0) tmp0.w = 0.0;
5bd8deadSopenharmony_ci      if (tmp0.w > 1.0) tmp0.w = 1.0;
5bd8deadSopenharmony_ci      ub.x = round(255.0 * tmp0.x);  /* ub is a ubyte vector */
5bd8deadSopenharmony_ci      ub.y = round(255.0 * tmp0.y);
5bd8deadSopenharmony_ci      ub.z = round(255.0 * tmp0.z);
5bd8deadSopenharmony_ci      ub.w = round(255.0 * tmp0.w);
5bd8deadSopenharmony_ci      /* result obtained by combining raw bits of ub. */
5bd8deadSopenharmony_ci      result.x = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.y = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.z = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci      result.w = ((ub.x) | (ub.y << 8) | (ub.z << 16) | (ub.w << 24));
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    PK4UB supports all three data type modifiers.  The single operand is
5bd8deadSopenharmony_ci    always treated as a floating-point value, but the result is written as a
5bd8deadSopenharmony_ci    floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier.  For integer result variables, the
5bd8deadSopenharmony_ci    bits can be interpreted as described above.  For floating-point result
5bd8deadSopenharmony_ci    variables, the packed results do not constitute a meaningful
5bd8deadSopenharmony_ci    floating-point variable and should only be used to feed future unpack
5bd8deadSopenharmony_ci    instructions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A program will fail to load if it contains a PK4UB instruction that writes
5bd8deadSopenharmony_ci    its results to a variable declared as "SHORT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, POW:  Exponentiate
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The POW instruction approximates the value of the first scalar operand
5bd8deadSopenharmony_ci    raised to the power of the second scalar operand and replicates it to all
5bd8deadSopenharmony_ci    four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = ScalarLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = ScalarLoad(op1);
5bd8deadSopenharmony_ci      result.x = ApproxPower(tmp0, tmp1);
5bd8deadSopenharmony_ci      result.y = ApproxPower(tmp0, tmp1);
5bd8deadSopenharmony_ci      result.z = ApproxPower(tmp0, tmp1);
5bd8deadSopenharmony_ci      result.w = ApproxPower(tmp0, tmp1);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The exponentiation approximation function may be implemented using the
5bd8deadSopenharmony_ci    base 2 exponentiation and logarithm approximation operations in the EX2
5bd8deadSopenharmony_ci    and LG2 instructions.  In particular,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      ApproxPower(a,b) = ApproxExp2(b * ApproxLog2(a)).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that a logarithm may be involved even for cases where the exponent is
5bd8deadSopenharmony_ci    an integer.  This means that it may not be possible to exponentiate
5bd8deadSopenharmony_ci    correctly with a negative base.  In constrast, it is possible in a
5bd8deadSopenharmony_ci    "normal" mathematical formulation to raise negative numbers to integral
5bd8deadSopenharmony_ci    powers (e.g., (-3)^2== 9, and (-0.5)^-2==4).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    POW supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, RCC:  Reciprocal (Clamped)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RCC instruction approximates the reciprocal of the scalar operand,
5bd8deadSopenharmony_ci    clamps the result to one of two ranges, and replicates the clamped result
5bd8deadSopenharmony_ci    to all four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the approximated reciprocal is greater than 0.0, the result is clamped
5bd8deadSopenharmony_ci    to the range [2^-64, 2^+64].  If the approximate reciprocal is not greater
5bd8deadSopenharmony_ci    than zero, the result is clamped to the range [-2^+64, -2^-64].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ClampApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.y = ClampApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.z = ClampApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.w = ClampApproxReciprocal(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    RCC supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, RCP:  Reciprocal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RCP instruction approximates the reciprocal of the scalar operand and
5bd8deadSopenharmony_ci    replicates it to all four components of the result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxReciprocal(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxReciprocal(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    RCP supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, REP:  Start of Repeat Block
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The REP instruction begins a REP/ENDREP block.  The REP instruction
5bd8deadSopenharmony_ci    supports an optional operand whose x component specifies the initial value
5bd8deadSopenharmony_ci    for the loop count.  The loop count indicates the number of times the
5bd8deadSopenharmony_ci    instructions between the REP and corresponding ENDREP instruction will be
5bd8deadSopenharmony_ci    executed.  If the initial value of the loop count is not positive, the
5bd8deadSopenharmony_ci    entire block is skipped and execution continues at the instruction
5bd8deadSopenharmony_ci    following the corresponding ENDREP instruction.  If the loop count is
5bd8deadSopenharmony_ci    specified as a floating-point value, it is converted to the largest
5bd8deadSopenharmony_ci    integer less than or equal to the specified value (i.e., taking its
5bd8deadSopenharmony_ci    floor).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If no operand is provided to REP, the loop count is ignored and the
5bd8deadSopenharmony_ci    corresponding ENDREP instruction unconditionally transfers control to the
5bd8deadSopenharmony_ci    instruction immediately following the REP instruction.  The only way to
5bd8deadSopenharmony_ci    exit such a loop is with the BRK instruction.  To prevent obvious infinite
5bd8deadSopenharmony_ci    loops, a program that includes a REP/ENDREP block with no loop count will
5bd8deadSopenharmony_ci    fail to compile unless it contains either a BRK instruction at the current
5bd8deadSopenharmony_ci    nesting level or a RET instruction at any nesting level.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Implementations may have a limited ability to nest REP/ENDREP blocks.  If
5bd8deadSopenharmony_ci    the number of REP/ENDREP blocks nested inside each other is
5bd8deadSopenharmony_ci    MAX_PROGRAM_LOOP_DEPTH_NV or higher, a program will fail to compile.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      // Set up loop information for the new nesting level.
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      LoopCount = floor(tmp.x);
5bd8deadSopenharmony_ci      if (LoopCount <= 0) {
5bd8deadSopenharmony_ci        continue execution at the corresponding ENDREP;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    REP supports all three data type modifiers.  The single operand is
5bd8deadSopenharmony_ci    interpreted according to the data type modifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (Note:  Unlike the NV_fragment_program2 extension, REP blocks in this
5bd8deadSopenharmony_ci    extension support fully general looping; the specified loop count can be
5bd8deadSopenharmony_ci    computed in the program itself.  Additionally, there is no run-time limit
5bd8deadSopenharmony_ci    on the maximum overall depth of REP/ENDREP nesting.  As long as each
5bd8deadSopenharmony_ci    individual subroutine of the program obeys the static nesting limits,
5bd8deadSopenharmony_ci    there will be no run-time errors in the program.  With the
5bd8deadSopenharmony_ci    NV_fragment_program2 extension, a program could terminate abnormally if it
5bd8deadSopenharmony_ci    called a subroutine inside a deeply nested set of REP/ENDREP blocks and
5bd8deadSopenharmony_ci    the called subroutine also contained deeply nested REP/ENDREP blocks.
5bd8deadSopenharmony_ci    Such an error could occur even if neither subroutine exceeded static
5bd8deadSopenharmony_ci    limits.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, RET:  Subroutine Return
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RET instruction conditionally returns from a subroutine initiated by a
5bd8deadSopenharmony_ci    CAL instruction by popping an instruction reference off the top of the
5bd8deadSopenharmony_ci    call stack and transferring control to the referenced instruction.  The
5bd8deadSopenharmony_ci    following pseudocode describes the operation of the instruction:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      if (TestCC(cc.c***) || TestCC(cc.*c**) ||
5bd8deadSopenharmony_ci          TestCC(cc.**c*) || TestCC(cc.***c)) {
5bd8deadSopenharmony_ci        if (callStackDepth <= 0) {
5bd8deadSopenharmony_ci          // terminate program
5bd8deadSopenharmony_ci        } else {
5bd8deadSopenharmony_ci          callStackDepth--;
5bd8deadSopenharmony_ci          instruction = callStack[callStackDepth];
5bd8deadSopenharmony_ci        }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        // continue execution at <instruction>
5bd8deadSopenharmony_ci      } else {
5bd8deadSopenharmony_ci        // do nothing
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    In the pseudocode, <callStackDepth> is the depth of the call stack,
5bd8deadSopenharmony_ci    <callStack> is an array holding the call stack, and <instruction> is a
5bd8deadSopenharmony_ci    reference to an instruction previously pushed onto the call stack.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the call stack is empty when RET executes, the program terminates
5bd8deadSopenharmony_ci    normally.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, RFL:  Reflection Vector
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RFL instruction computes the reflection of the second vector operand
5bd8deadSopenharmony_ci    (the "direction" vector) about the vector specified by the first vector
5bd8deadSopenharmony_ci    operand (the "axis" vector).  Both operands are treated as 3D vectors (the
5bd8deadSopenharmony_ci    w components are ignored).  The result vector is another 3D vector (the
5bd8deadSopenharmony_ci    "reflected direction" vector).  The length of the result vector, ignoring
5bd8deadSopenharmony_ci    rounding errors, should equal that of the second operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      axis = VectorLoad(op0);
5bd8deadSopenharmony_ci      direction = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp.w = (axis.x * axis.x + axis.y * axis.y + axis.z * axis.z);
5bd8deadSopenharmony_ci      tmp.x = (axis.x * direction.x + axis.y * direction.y +
5bd8deadSopenharmony_ci               axis.z * direction.z);
5bd8deadSopenharmony_ci      tmp.x = 2.0 * tmp.x;
5bd8deadSopenharmony_ci      tmp.x = tmp.x / tmp.w;
5bd8deadSopenharmony_ci      result.x = tmp.x * axis.x - direction.x;
5bd8deadSopenharmony_ci      result.y = tmp.x * axis.y - direction.y;
5bd8deadSopenharmony_ci      result.z = tmp.x * axis.z - direction.z;
5bd8deadSopenharmony_ci      result.w = undefined;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    RFL supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, ROUND:  Round to Nearest Integer
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The ROUND instruction loads a single vector operand and performs a
5bd8deadSopenharmony_ci    component-wise round operation to generate a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = round(tmp.x);
5bd8deadSopenharmony_ci      result.y = round(tmp.y);
5bd8deadSopenharmony_ci      result.z = round(tmp.z);
5bd8deadSopenharmony_ci      result.w = round(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The round operation returns the nearest integer to the operand.  If the
5bd8deadSopenharmony_ci    fractional portion of the operand is 0.5, round() selects the nearest even
5bd8deadSopenharmony_ci    integer.  For example round(-1.7) = -2.0, round(+1.0) = +1.0, and
5bd8deadSopenharmony_ci    round(+3.7) = +4.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ROUND supports all three data type modifiers.  The single operand is
5bd8deadSopenharmony_ci    always treated as a floating-point value, but the result is written as a
5bd8deadSopenharmony_ci    floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier.  If a value is not exactly
5bd8deadSopenharmony_ci    representable using the data type of the result (e.g., an overflow or
5bd8deadSopenharmony_ci    writing a negative value to an unsigned integer), the result is undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, RSQ:  Reciprocal Square Root
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The RSQ instruction approximates the reciprocal of the square root of the
5bd8deadSopenharmony_ci    scalar operand and replicates it to all four components of the result
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxRSQRT(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxRSQRT(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxRSQRT(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxRSQRT(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the operand is less than or equal to zero, the results of the
5bd8deadSopenharmony_ci    instruction are undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    RSQ supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that this instruction differs from the RSQ instruction in
5bd8deadSopenharmony_ci    ARB_vertex_program in that it does not implicitly take the absolute value
5bd8deadSopenharmony_ci    of its operand.  The |abs| operator can be used to achieve equivalent
5bd8deadSopenharmony_ci    semantics.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SAD:  Sum of Absolute Differences
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SAD instruction performs a component-wise difference of the first two
5bd8deadSopenharmony_ci    integer operands (subtracting the second from the first), and then does a
5bd8deadSopenharmony_ci    component-wise add of the absolute value of the difference to the third
5bd8deadSopenharmony_ci    unsigned integer operand to yield an unsigned integer result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      result.x = abs(tmp0.x - tmp1.x) + tmp2.x;
5bd8deadSopenharmony_ci      result.y = abs(tmp0.y - tmp1.y) + tmp2.y;
5bd8deadSopenharmony_ci      result.z = abs(tmp0.z - tmp1.z) + tmp2.z;
5bd8deadSopenharmony_ci      result.w = abs(tmp0.w - tmp1.w) + tmp2.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SAD supports signed and unsigned integer data type modifiers.  The first
5bd8deadSopenharmony_ci    two operands are interpreted according to the data type modifier.  The
5bd8deadSopenharmony_ci    third operand and the result are always unsigned integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SCS:  Sine/Cosine without Reduction
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SCS instruction approximates the trigonometric sine and cosine of the
5bd8deadSopenharmony_ci    angle specified by the scalar operand and places the cosine in the x
5bd8deadSopenharmony_ci    component and the sine in the y component of the result vector.  The z and
5bd8deadSopenharmony_ci    w components of the result vector are undefined.  The angle is specified
5bd8deadSopenharmony_ci    in radians and must be in the range [-PI,PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxCosine(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxSine(tmp);
5bd8deadSopenharmony_ci      result.z = undefined;
5bd8deadSopenharmony_ci      result.w = undefined;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the scalar operand is not in the range [-PI,PI], the result vector is
5bd8deadSopenharmony_ci    undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SCS supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SEQ:  Set on Equal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SEQ instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector returns a TRUE value
5bd8deadSopenharmony_ci    (described below) if the corresponding component of the first operand is
5bd8deadSopenharmony_ci    equal to that of the second, and a FALSE value otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x == tmp1.x) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.y = (tmp0.y == tmp1.y) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.z = (tmp0.z == tmp1.z) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.w = (tmp0.w == tmp1.w) ? TRUE : FALSE;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SEQ supports all data type modifiers.  For floating-point data types, the
5bd8deadSopenharmony_ci    TRUE value is 1.0 and the FALSE value is 0.0.  For signed integer data
5bd8deadSopenharmony_ci    types, the TRUE value is -1 and the FALSE value is 0.  For unsigned
5bd8deadSopenharmony_ci    integer data types, the TRUE value is the maximum integer value (all bits
5bd8deadSopenharmony_ci    are ones) and the FALSE value is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SFL:  Set on False
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SFL instruction is a degenerate case of the other "Set on"
5bd8deadSopenharmony_ci    instructions that sets all components of the result vector to a FALSE
5bd8deadSopenharmony_ci    value (described below).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result.x = FALSE;
5bd8deadSopenharmony_ci      result.y = FALSE;
5bd8deadSopenharmony_ci      result.z = FALSE;
5bd8deadSopenharmony_ci      result.w = FALSE;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SFL supports all data type modifiers.  For floating-point data types, the
5bd8deadSopenharmony_ci    FALSE value is 0.0.  For signed and unsigned integer data types, the FALSE
5bd8deadSopenharmony_ci    value is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SGE:  Set on Greater Than or Equal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SGE instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector returns a TRUE value
5bd8deadSopenharmony_ci    (described below) if the corresponding component of the first operand is
5bd8deadSopenharmony_ci    greater than or equal to that of the second, and a FALSE value otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x >= tmp1.x) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.y = (tmp0.y >= tmp1.y) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.z = (tmp0.z >= tmp1.z) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.w = (tmp0.w >= tmp1.w) ? TRUE : FALSE;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SGE supports all data type modifiers.  For floating-point data types, the
5bd8deadSopenharmony_ci    TRUE value is 1.0 and the FALSE value is 0.0.  For signed integer data
5bd8deadSopenharmony_ci    types, the TRUE value is -1 and the FALSE value is 0.  For unsigned
5bd8deadSopenharmony_ci    integer data types, the TRUE value is the maximum integer value (all bits
5bd8deadSopenharmony_ci    are ones) and the FALSE value is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SGT:  Set on Greater Than
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SGT instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector returns a TRUE value
5bd8deadSopenharmony_ci    (described below) if the corresponding component of the first operand is
5bd8deadSopenharmony_ci    greater than that of the second, and a FALSE value otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x > tmp1.x) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.y = (tmp0.y > tmp1.y) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.z = (tmp0.z > tmp1.z) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.w = (tmp0.w > tmp1.w) ? TRUE : FALSE;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SGT supports all data type modifiers.  For floating-point data types, the
5bd8deadSopenharmony_ci    TRUE value is 1.0 and the FALSE value is 0.0.  For signed integer data
5bd8deadSopenharmony_ci    types, the TRUE value is -1 and the FALSE value is 0.  For unsigned
5bd8deadSopenharmony_ci    integer data types, the TRUE value is the maximum integer value (all bits
5bd8deadSopenharmony_ci    are ones) and the FALSE value is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SHL:  Shift Left
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SHL instruction performs a component-wise left shift of the bits of
5bd8deadSopenharmony_ci    the first operand by the value of the second scalar operand to produce a
5bd8deadSopenharmony_ci    result vector.  The bits vacated during the shift operation are filled
5bd8deadSopenharmony_ci    with zeroes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = ScalarLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x << tmp1;
5bd8deadSopenharmony_ci      result.y = tmp0.y << tmp1;
5bd8deadSopenharmony_ci      result.z = tmp0.z << tmp1;
5bd8deadSopenharmony_ci      result.w = tmp0.w << tmp1;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The results of a shift operation ("<<") are undefined if the value of the
5bd8deadSopenharmony_ci    second operand is negative, or greater than or equal to the number of bits
5bd8deadSopenharmony_ci    in the first operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SHL supports both signed and unsigned integer data type modifiers.  If no
5bd8deadSopenharmony_ci    modifier is provided, the operands and the result are treated as signed
5bd8deadSopenharmony_ci    integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SHR:  Shift Right
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SHR instruction performs a component-wise right shift of the bits of
5bd8deadSopenharmony_ci    the first operand by the value of the second scalar operand to produce a
5bd8deadSopenharmony_ci    result vector.  The bits vacated during shift operation are filled with
5bd8deadSopenharmony_ci    zeros if the operand is non-negative and ones otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = ScalarLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x >> tmp1;
5bd8deadSopenharmony_ci      result.y = tmp0.y >> tmp1;
5bd8deadSopenharmony_ci      result.z = tmp0.z >> tmp1;
5bd8deadSopenharmony_ci      result.w = tmp0.w >> tmp1;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The results of a shift operation (">>") are undefined if the value of the
5bd8deadSopenharmony_ci    second operand is negative, or greater than or equal to the number of bits
5bd8deadSopenharmony_ci    in the first operand.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SHR supports both signed and unsigned integer data type modifiers.  If no
5bd8deadSopenharmony_ci    modifiers are provided, the operands and the result are treated as signed
5bd8deadSopenharmony_ci    integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SIN:  Sine with Reduction to [-PI,PI]
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SIN instruction approximates the trigonometric sine of the angle
5bd8deadSopenharmony_ci    specified by the scalar operand and replicates it to all four components
5bd8deadSopenharmony_ci    of the result vector.  The angle is specified in radians and does not have
5bd8deadSopenharmony_ci    to be in the range [-PI,PI].
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ApproxSine(tmp);
5bd8deadSopenharmony_ci      result.y = ApproxSine(tmp);
5bd8deadSopenharmony_ci      result.z = ApproxSine(tmp);
5bd8deadSopenharmony_ci      result.w = ApproxSine(tmp);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SIN supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SLE:  Set on Less Than or Equal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SLE instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector returns a TRUE value
5bd8deadSopenharmony_ci    (described below) if the corresponding component of the first operand is
5bd8deadSopenharmony_ci    less than or equal to that of the second, and a FALSE value otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x <= tmp1.x) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.y = (tmp0.y <= tmp1.y) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.z = (tmp0.z <= tmp1.z) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.w = (tmp0.w <= tmp1.w) ? TRUE : FALSE;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SLE supports all data type modifiers.  For floating-point data types, the
5bd8deadSopenharmony_ci    TRUE value is 1.0 and the FALSE value is 0.0.  For signed integer data
5bd8deadSopenharmony_ci    types, the TRUE value is -1 and the FALSE value is 0.  For unsigned
5bd8deadSopenharmony_ci    integer data types, the TRUE value is the maximum integer value (all bits
5bd8deadSopenharmony_ci    are ones) and the FALSE value is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SLT:  Set on Less Than
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SLT instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector returns a TRUE value
5bd8deadSopenharmony_ci    (described below) if the corresponding component of the first operand is
5bd8deadSopenharmony_ci    less than that of the second, and a FALSE value otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x < tmp1.x) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.y = (tmp0.y < tmp1.y) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.z = (tmp0.z < tmp1.z) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.w = (tmp0.w < tmp1.w) ? TRUE : FALSE;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SLT supports all data type modifiers.  For floating-point data types, the
5bd8deadSopenharmony_ci    TRUE value is 1.0 and the FALSE value is 0.0.  For signed integer data
5bd8deadSopenharmony_ci    types, the TRUE value is -1 and the FALSE value is 0.  For unsigned
5bd8deadSopenharmony_ci    integer data types, the TRUE value is the maximum integer value (all bits
5bd8deadSopenharmony_ci    are ones) and the FALSE value is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SNE:  Set on Not Equal
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SNE instruction performs a component-wise comparison of the two
5bd8deadSopenharmony_ci    operands.  Each component of the result vector returns a TRUE value
5bd8deadSopenharmony_ci    (described below) if the corresponding component of the first operand is
5bd8deadSopenharmony_ci    less than that of the second, and a FALSE value otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = (tmp0.x != tmp1.x) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.y = (tmp0.y != tmp1.y) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.z = (tmp0.z != tmp1.z) ? TRUE : FALSE;
5bd8deadSopenharmony_ci      result.w = (tmp0.w != tmp1.w) ? TRUE : FALSE;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SNE supports all data type modifiers.  For floating-point data types, the
5bd8deadSopenharmony_ci    TRUE value is 1.0 and the FALSE value is 0.0.  For signed integer data
5bd8deadSopenharmony_ci    types, the TRUE value is -1 and the FALSE value is 0.  For unsigned
5bd8deadSopenharmony_ci    integer data types, the TRUE value is the maximum integer value (all bits
5bd8deadSopenharmony_ci    are ones) and the FALSE value is zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SSG:  Set Sign
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SSG instruction generates a result vector containing the signs of
5bd8deadSopenharmony_ci    each component of the single vector operand.  Each component of the
5bd8deadSopenharmony_ci    result vector is 1.0 if the corresponding component of the operand
5bd8deadSopenharmony_ci    is greater than zero, 0.0 if the corresponding component of the
5bd8deadSopenharmony_ci    operand is equal to zero, and -1.0 if the corresponding component
5bd8deadSopenharmony_ci    of the operand is less than zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = SetSign(tmp.x);
5bd8deadSopenharmony_ci      result.y = SetSign(tmp.y);
5bd8deadSopenharmony_ci      result.z = SetSign(tmp.z);
5bd8deadSopenharmony_ci      result.w = SetSign(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SSG supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, STR:  Set on True
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The STR instruction is a degenerate case of the other "Set on"
5bd8deadSopenharmony_ci    instructions that sets all components of the result vector to a TRUE value
5bd8deadSopenharmony_ci    (described below).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      result.x = TRUE;
5bd8deadSopenharmony_ci      result.y = TRUE;
5bd8deadSopenharmony_ci      result.z = TRUE;
5bd8deadSopenharmony_ci      result.w = TRUE;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    STR supports all data type modifiers.  For floating-point data types, the
5bd8deadSopenharmony_ci    TRUE value is 1.0.  For signed integer data types, the TRUE value is -1.
5bd8deadSopenharmony_ci    For unsigned integer data types, the TRUE value is the maximum integer
5bd8deadSopenharmony_ci    value (all bits are ones).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SUB:  Subtract
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SUB instruction performs a component-wise subtraction of the second
5bd8deadSopenharmony_ci    operand from the first to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x - tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y - tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z - tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w - tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SUB supports all three data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, SWZ:  Extended Swizzle
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The SWZ instruction loads the single vector operand, and performs a
5bd8deadSopenharmony_ci    swizzle operation more powerful than that provided for loading normal
5bd8deadSopenharmony_ci    vector operands to yield an instruction vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    After the operand is loaded, the "x", "y", "z", and "w" components of the
5bd8deadSopenharmony_ci    result vector are selected by the first, second, third, and fourth matches
5bd8deadSopenharmony_ci    of the <extSwizComp> pattern in the <extendedSwizzle> rule.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A result component can be selected from any of the four components of the
5bd8deadSopenharmony_ci    operand or the constants 0.0 and 1.0.  The result component can also be
5bd8deadSopenharmony_ci    optionally negated.  The following pseudocode describes the component
5bd8deadSopenharmony_ci    selection method.  "operand" refers to the vector operand, "select" is an
5bd8deadSopenharmony_ci    enumerant where the values ZERO, ONE, X, Y, Z, and W correspond to the
5bd8deadSopenharmony_ci    <extSwizSel> rule matching "0", "1", "x", "y", "z", and "w", respectively.
5bd8deadSopenharmony_ci    "negate" is TRUE if and only if the <optionalSign> rule in <extSwizComp>
5bd8deadSopenharmony_ci    matches "-".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      float ExtSwizComponent(floatVec operand, enum select, boolean negate)
5bd8deadSopenharmony_ci      {
5bd8deadSopenharmony_ci          float result;
5bd8deadSopenharmony_ci          switch (select) {
5bd8deadSopenharmony_ci            case ZERO:  result = 0.0; break;
5bd8deadSopenharmony_ci            case ONE:   result = 1.0; break;
5bd8deadSopenharmony_ci            case X:     result = operand.x; break;
5bd8deadSopenharmony_ci            case Y:     result = operand.y; break;
5bd8deadSopenharmony_ci            case Z:     result = operand.z; break;
5bd8deadSopenharmony_ci            case W:     result = operand.w; break;
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          if (negate) {
5bd8deadSopenharmony_ci            result = -result;
5bd8deadSopenharmony_ci          }
5bd8deadSopenharmony_ci          return result;
5bd8deadSopenharmony_ci      }
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The entire extended swizzle operation is then defined using the following
5bd8deadSopenharmony_ci    pseudocode:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = ExtSwizComponent(tmp, xSelect, xNegate);
5bd8deadSopenharmony_ci      result.y = ExtSwizComponent(tmp, ySelect, yNegate);
5bd8deadSopenharmony_ci      result.z = ExtSwizComponent(tmp, zSelect, zNegate);
5bd8deadSopenharmony_ci      result.w = ExtSwizComponent(tmp, wSelect, wNegate);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    "xSelect", "xNegate", "ySelect", "yNegate", "zSelect", "zNegate",
5bd8deadSopenharmony_ci    "wSelect", and "wNegate" correspond to the "select" and "negate" values
5bd8deadSopenharmony_ci    above for the four <extSwizComp> matches.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Since this instruction allows for component selection and negation for
5bd8deadSopenharmony_ci    each individual component, the grammar does not allow the use of the
5bd8deadSopenharmony_ci    normal swizzle and negation operations allowed for vector operands in
5bd8deadSopenharmony_ci    other instructions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    SWZ supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, TEX:  Texture Sample
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TEX instruction takes the four components of a single floating-point
5bd8deadSopenharmony_ci    source vector and performs a filtered texture access as described in
5bd8deadSopenharmony_ci    Section 2.X.4.4.  The returned (R,G,B,A) value is written to the
5bd8deadSopenharmony_ci    floating-point result vector.  Partial derivatives and the level of detail
5bd8deadSopenharmony_ci    are computed automatically.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      ddx = ComputePartialsX(tmp);
5bd8deadSopenharmony_ci      ddy = ComputePartialsY(tmp);
5bd8deadSopenharmony_ci      lambda = ComputeLOD(ddx, ddy);
5bd8deadSopenharmony_ci      result = TextureSample(tmp, lambda, ddx, ddy, texelOffset);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    TEX supports all three data type modifiers.  The single operand is always
5bd8deadSopenharmony_ci    treated as a floating-point vector; the results are interpreted according
5bd8deadSopenharmony_ci    to the data type modifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, TRUNC:  Truncate (Round Toward Zero)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TRUNC instruction loads a single vector operand and performs a
5bd8deadSopenharmony_ci    component-wise truncate operation to generate a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result.x = trunc(tmp.x);
5bd8deadSopenharmony_ci      result.y = trunc(tmp.y);
5bd8deadSopenharmony_ci      result.z = trunc(tmp.z);
5bd8deadSopenharmony_ci      result.w = trunc(tmp.w);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The truncate operation returns the nearest integer to zero smaller in
5bd8deadSopenharmony_ci    magnitude than the operand.  For example trunc(-1.7) = -1.0, trunc(+1.0) =
5bd8deadSopenharmony_ci    +1.0, and trunc(+3.7) = +3.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    TRUNC supports all three data type modifiers.  The single operand is
5bd8deadSopenharmony_ci    always treated as a floating-point value, but the result is written as a
5bd8deadSopenharmony_ci    floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier.  If a value is not exactly
5bd8deadSopenharmony_ci    representable using the data type of the result (e.g., an overflow or
5bd8deadSopenharmony_ci    writing a negative value to an unsigned integer), the result is undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, TXB:  Texture Sample with Bias
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXB instruction takes the four components of a single floating-point
5bd8deadSopenharmony_ci    source vector and performs a filtered texture access as described in
5bd8deadSopenharmony_ci    Section 2.X.4.4.  The returned (R,G,B,A) value is written to the
5bd8deadSopenharmony_ci    floating-point result vector.  Partial derivatives and the level of detail
5bd8deadSopenharmony_ci    are computed automatically, but the fourth component of the source vector
5bd8deadSopenharmony_ci    is added to the computed LOD prior to sampling.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      ddx = ComputePartialsX(tmp);
5bd8deadSopenharmony_ci      ddy = ComputePartialsY(tmp);
5bd8deadSopenharmony_ci      lambda = ComputeLOD(ddx, ddy);
5bd8deadSopenharmony_ci      result = TextureSample(tmp, lambda + tmp.w, ddx, ddy, texelOffset);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The single source vector in the TXB instruction does not have enough
5bd8deadSopenharmony_ci    coordinates to specify a lookup into a two-dimensional array texture or
5bd8deadSopenharmony_ci    cube map texture with both an LOD bias and an explicit reference value for
5bd8deadSopenharmony_ci    depth comparison.  A program will fail to load if it contains a TXB
5bd8deadSopenharmony_ci    instruction with a target of SHADOWCUBE or SHADOWARRAY2D.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    TXB supports all three data type modifiers.  The single operand is always
5bd8deadSopenharmony_ci    treated as a floating-point vector; the results are interpreted according
5bd8deadSopenharmony_ci    to the data type modifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, TXD:  Texture Sample with Partials
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXD instruction takes the four components of the first floating-point
5bd8deadSopenharmony_ci    source vector and performs a filtered texture access as described in
5bd8deadSopenharmony_ci    Section 2.X.4.4.  The returned (R,G,B,A) value is written to the
5bd8deadSopenharmony_ci    floating-point result vector.  The partial derivatives of the texture
5bd8deadSopenharmony_ci    coordinates with respect to X and Y are specified by the second and third
5bd8deadSopenharmony_ci    floating-point source vectors.  The level of detail is computed
5bd8deadSopenharmony_ci    automatically using the provided partial derivatives.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Note that for cube map texture targets, the provided partial derivatives
5bd8deadSopenharmony_ci    are in the coordinate system used before texture coordinates are projected
5bd8deadSopenharmony_ci    onto the appropriate cube face.  The partial derivatives of the
5bd8deadSopenharmony_ci    post-projection texture coordinates, which are used for level-of-detail
5bd8deadSopenharmony_ci    and anisotropic filtering calculations, are derived from the original
5bd8deadSopenharmony_ci    coordinates and partial derivatives in an implementation-dependent manner.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      lambda = ComputeLOD(tmp1, tmp2);
5bd8deadSopenharmony_ci      result = TextureSample(tmp0, lambda, tmp1, tmp2, texelOffset);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    TXD supports all three data type modifiers.  All three operands are always
5bd8deadSopenharmony_ci    treated as floating-point vectors; the results are interpreted according
5bd8deadSopenharmony_ci    to the data type modifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, TXF:  Texel Fetch
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXF instruction takes the four components of a single signed integer
5bd8deadSopenharmony_ci    source vector and performs a single texel fetch as described in Section
5bd8deadSopenharmony_ci    2.X.4.4.  The first three components provide the <i>, <j>, and <k> values
5bd8deadSopenharmony_ci    for the texel fetch, and the fourth component is used to determine the LOD
5bd8deadSopenharmony_ci    to access.  The returned (R,G,B,A) value is written to the floating-point
5bd8deadSopenharmony_ci    result vector.  Partial derivatives are irrelevant for single texel
5bd8deadSopenharmony_ci    fetches.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      result = TexelFetch(tmp, texelOffset);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    TXF supports all three data type modifiers.  The single vector operand is
5bd8deadSopenharmony_ci    treated as a signed integer vector; the results are interpreted according
5bd8deadSopenharmony_ci    to the data type modifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, TXL:  Texture Sample with LOD
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXL instruction takes the four components of a single floating-point
5bd8deadSopenharmony_ci    source vector and performs a filtered texture access as described in
5bd8deadSopenharmony_ci    Section 2.X.4.4.  The returned (R,G,B,A) value is written to the
5bd8deadSopenharmony_ci    floating-point result vector.  The level of detail is taken from the
5bd8deadSopenharmony_ci    fourth component of the source vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Partial derivatives are not computed by the TXL instruction and
5bd8deadSopenharmony_ci    anisotropic filtering is not performed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = VectorLoad(op0);
5bd8deadSopenharmony_ci      ddx = (0,0,0);
5bd8deadSopenharmony_ci      ddy = (0,0,0);
5bd8deadSopenharmony_ci      result = TextureSample(tmp, tmp.w, ddx, ddy, texelOffset);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The single source vector in the TXL instruction does not have enough
5bd8deadSopenharmony_ci    coordinates to specify a lookup into a 2D array or cube map texture with
5bd8deadSopenharmony_ci    both an explicit LOD and a reference value for depth comparison.  A
5bd8deadSopenharmony_ci    program will fail to load if it contains a TXL instruction with a target
5bd8deadSopenharmony_ci    of SHADOWCUBE or SHADOWARRAY2D.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    TXL supports all three data type modifiers.  The single vector operand is
5bd8deadSopenharmony_ci    treated as a floating-point vector; the results are interpreted according
5bd8deadSopenharmony_ci    to the data type modifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, TXP:  Texture Sample with Projection
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXP instruction divides the first three components of its single
5bd8deadSopenharmony_ci    floating-point source vector by its fourth component, maps the results to
5bd8deadSopenharmony_ci    s, t, and r, and performs a filtered texture access as described in
5bd8deadSopenharmony_ci    Section 2.X.4.4.  The returned (R,G,B,A) value is written to the
5bd8deadSopenharmony_ci    floating-point result vector.  Partial derivatives and the level of detail
5bd8deadSopenharmony_ci    are computed automatically.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp0.x = tmp0.x / tmp0.w;
5bd8deadSopenharmony_ci      tmp0.y = tmp0.y / tmp0.w;
5bd8deadSopenharmony_ci      tmp0.z = tmp0.z / tmp0.w;
5bd8deadSopenharmony_ci      ddx = ComputePartialsX(tmp);
5bd8deadSopenharmony_ci      ddy = ComputePartialsY(tmp);
5bd8deadSopenharmony_ci      lambda = ComputeLOD(ddx, ddy);
5bd8deadSopenharmony_ci      result = TextureSample(tmp, lambda, ddx, ddy, texelOffset);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The single source vector in the TXP instruction does not have enough
5bd8deadSopenharmony_ci    coordinates to specify a lookup into a 2D array or cube map texture with
5bd8deadSopenharmony_ci    both a Q coordinate and an explicit reference value for depth comparison.
5bd8deadSopenharmony_ci    A program will fail to load if it contains a TXP instruction with a target
5bd8deadSopenharmony_ci    of SHADOWCUBE or SHADOWARRAY2D.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    TXP supports all three data type modifiers.  The single vector operand is
5bd8deadSopenharmony_ci    treated as a floating-point vector; the results are interpreted according
5bd8deadSopenharmony_ci    to the data type modifier.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, TXQ:  Texture Size Query
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The TXQ instruction takes the first component of the single integer vector
5bd8deadSopenharmony_ci    operand, adds the number of the base level of the specified texture to
5bd8deadSopenharmony_ci    determine a texture image level, and returns an integer result vector
5bd8deadSopenharmony_ci    containing the size of the image at that level of the texture.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For one-dimensional and one-dimensional array textures, the "x" component
5bd8deadSopenharmony_ci    of the result vector is filled with the width of the image(s).  For
5bd8deadSopenharmony_ci    two-dimensional, rectangle, cube map, and two-dimensional array textures,
5bd8deadSopenharmony_ci    the "x" and "y" components are filled with the width and height of the
5bd8deadSopenharmony_ci    image(s).  For three-dimensional textures, the "x", "y", and "z"
5bd8deadSopenharmony_ci    components are filled with the width, height, and depth of the image.
5bd8deadSopenharmony_ci    Additionally, the number of layers in an array texture is returned in the
5bd8deadSopenharmony_ci    "y" component of the result for one-dimensional array textures or the "z"
5bd8deadSopenharmony_ci    component for two-dimensional array textures.  All other components of the
5bd8deadSopenharmony_ci    result vector is undefined.  For the purposes of this instruction, the
5bd8deadSopenharmony_ci    width, height, and depth of a texture do NOT include any border.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp0.x = tmp0.x + texture[op1].target[op2].base_level;
5bd8deadSopenharmony_ci      result.x = texture[op1].target[op2].level[tmp0.x].width;
5bd8deadSopenharmony_ci      result.y = texture[op1].target[op2].level[tmp0.x].height;
5bd8deadSopenharmony_ci      result.z = texture[op1].target[op2].level[tmp0.x].depth;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the level computed by adding the operand to the base level of the
5bd8deadSopenharmony_ci    texture is less than the base level number or greater than the maximum
5bd8deadSopenharmony_ci    level number, the results are undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    TXQ supports no data type modifiers; the scalar operand and the result
5bd8deadSopenharmony_ci    vector are both interpreted as signed integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, UP2H:  Unpack Two 16-bit Floats
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The UP2H instruction unpacks two 16-bit floats stored together in a 32-bit
5bd8deadSopenharmony_ci    scalar operand.  The first 16-bit float (stored in the 16 least
5bd8deadSopenharmony_ci    significant bits) is written into the "x" and "z" components of the result
5bd8deadSopenharmony_ci    vector; the second is written into the "y" and "w" components of the
5bd8deadSopenharmony_ci    result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This operation undoes the type conversion and packing performed by
5bd8deadSopenharmony_ci    the PK2H instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = (fp16) (RawBits(tmp) & 0xFFFF);
5bd8deadSopenharmony_ci      result.y = (fp16) ((RawBits(tmp) >> 16) & 0xFFFF);
5bd8deadSopenharmony_ci      result.z = (fp16) (RawBits(tmp) & 0xFFFF);
5bd8deadSopenharmony_ci      result.w = (fp16) ((RawBits(tmp) >> 16) & 0xFFFF);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    UP2H supports all three data type modifiers.  The single operand is read
5bd8deadSopenharmony_ci    as a floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier; the 32 least significant bits of the
5bd8deadSopenharmony_ci    encoding are used for unpacking.  For floating-point operand variables, it
5bd8deadSopenharmony_ci    is expected (but not required) that the operand was produced by a previous
5bd8deadSopenharmony_ci    pack instruction.  The result is always written as a floating-point
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A program will fail to load if it contains a UP2H instruction whose
5bd8deadSopenharmony_ci    operand is a variable declared as "SHORT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, UP2US:  Unpack Two Unsigned 16-bit Integers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The UP2US instruction unpacks two 16-bit unsigned values packed
5bd8deadSopenharmony_ci    together in a 32-bit scalar operand.  The unsigned quantities are
5bd8deadSopenharmony_ci    encoded where a bit pattern of all '0' bits corresponds to 0.0 and
5bd8deadSopenharmony_ci    a pattern of all '1' bits corresponds to 1.0.  The "x" and "z"
5bd8deadSopenharmony_ci    components of the result vector are obtained from the 16 least
5bd8deadSopenharmony_ci    significant bits of the operand; the "y" and "w" components are
5bd8deadSopenharmony_ci    obtained from the 16 most significant bits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This operation undoes the type conversion and packing performed by
5bd8deadSopenharmony_ci    the PK2US instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ((RawBits(tmp) >> 0)  & 0xFFFF) / 65535.0;
5bd8deadSopenharmony_ci      result.y = ((RawBits(tmp) >> 16) & 0xFFFF) / 65535.0;
5bd8deadSopenharmony_ci      result.z = ((RawBits(tmp) >> 0)  & 0xFFFF) / 65535.0;
5bd8deadSopenharmony_ci      result.w = ((RawBits(tmp) >> 16) & 0xFFFF) / 65535.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    UP2US supports all three data type modifiers.  The single operand is read
5bd8deadSopenharmony_ci    as a floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier; the 32 least significant bits of the
5bd8deadSopenharmony_ci    encoding are used for unpacking.  For floating-point operand variables, it
5bd8deadSopenharmony_ci    is expected (but not required) that the operand was produced by a previous
5bd8deadSopenharmony_ci    pack instruction.  The result is always written as a floating-point
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A GPU program will fail to load if it contains a UP2S instruction
5bd8deadSopenharmony_ci    whose operand is a variable declared as "SHORT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, UP4B:  Unpack Four Signed 8-bit Integers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The UP4B instruction unpacks four 8-bit signed values packed together
5bd8deadSopenharmony_ci    in a 32-bit scalar operand.  The signed quantities are encoded where
5bd8deadSopenharmony_ci    a bit pattern of all '0' bits corresponds to -128/127 and a pattern
5bd8deadSopenharmony_ci    of all '1' bits corresponds to +127/127.  The "x" component of the
5bd8deadSopenharmony_ci    result vector is the converted value corresponding to the 8 least
5bd8deadSopenharmony_ci    significant bits of the operand; the "w" component corresponds to
5bd8deadSopenharmony_ci    the 8 most significant bits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This operation undoes the type conversion and packing performed by
5bd8deadSopenharmony_ci    the PK4B instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = (((RawBits(tmp) >> 0) & 0xFF) - 128) / 127.0;
5bd8deadSopenharmony_ci      result.y = (((RawBits(tmp) >> 8) & 0xFF) - 128) / 127.0;
5bd8deadSopenharmony_ci      result.z = (((RawBits(tmp) >> 16) & 0xFF) - 128) / 127.0;
5bd8deadSopenharmony_ci      result.w = (((RawBits(tmp) >> 24) & 0xFF) - 128) / 127.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    UP2B supports all three data type modifiers.  The single operand is read
5bd8deadSopenharmony_ci    as a floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier; the 32 least significant bits of the
5bd8deadSopenharmony_ci    encoding are used for unpacking.  For floating-point operand variables, it
5bd8deadSopenharmony_ci    is expected (but not required) that the operand was produced by a previous
5bd8deadSopenharmony_ci    pack instruction.  The result is always written as a floating-point
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A program will fail to load if it contains a UP4B instruction whose
5bd8deadSopenharmony_ci    operand is a variable declared as "SHORT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, UP4UB:  Unpack Four Unsigned 8-bit Integers
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The UP4UB instruction unpacks four 8-bit unsigned values packed
5bd8deadSopenharmony_ci    together in a 32-bit scalar operand.  The unsigned quantities are
5bd8deadSopenharmony_ci    encoded where a bit pattern of all '0' bits corresponds to 0.0 and a
5bd8deadSopenharmony_ci    pattern of all '1' bits corresponds to 1.0.  The "x" component of the
5bd8deadSopenharmony_ci    result vector is obtained from the 8 least significant bits of the
5bd8deadSopenharmony_ci    operand; the "w" component is obtained from the 8 most significant
5bd8deadSopenharmony_ci    bits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    This operation undoes the type conversion and packing performed by
5bd8deadSopenharmony_ci    the PK4UB instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp = ScalarLoad(op0);
5bd8deadSopenharmony_ci      result.x = ((RawBits(tmp) >> 0)  & 0xFF) / 255.0;
5bd8deadSopenharmony_ci      result.y = ((RawBits(tmp) >> 8)  & 0xFF) / 255.0;
5bd8deadSopenharmony_ci      result.z = ((RawBits(tmp) >> 16) & 0xFF) / 255.0;
5bd8deadSopenharmony_ci      result.w = ((RawBits(tmp) >> 24) & 0xFF) / 255.0;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    UP4UB supports all three data type modifiers.  The single operand is read
5bd8deadSopenharmony_ci    as a floating-point value, a signed integer, or an unsigned integer, as
5bd8deadSopenharmony_ci    specified by the data type modifier; the 32 least significant bits of the
5bd8deadSopenharmony_ci    encoding are used for unpacking.  For floating-point operand variables, it
5bd8deadSopenharmony_ci    is expected (but not required) that the operand was produced by a previous
5bd8deadSopenharmony_ci    pack instruction.  The result is always written as a floating-point
5bd8deadSopenharmony_ci    vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    A program will fail to load if it contains a UP4UB instruction whose
5bd8deadSopenharmony_ci    operand is a variable declared as "SHORT".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, X2D:  2D Coordinate Transformation
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The X2D instruction multiplies the 2D offset vector specified by the
5bd8deadSopenharmony_ci    "x" and "y" components of the second vector operand by the 2x2 matrix
5bd8deadSopenharmony_ci    specified by the four components of the third vector operand, and adds
5bd8deadSopenharmony_ci    the transformed offset vector to the 2D vector specified by the "x"
5bd8deadSopenharmony_ci    and "y" components of the first vector operand.  The first component
5bd8deadSopenharmony_ci    of the sum is written to the "x" and "z" components of the result;
5bd8deadSopenharmony_ci    the second component is written to the "y" and "w" components of
5bd8deadSopenharmony_ci    the result.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      tmp2 = VectorLoad(op2);
5bd8deadSopenharmony_ci      result.x = tmp0.x + tmp1.x * tmp2.x + tmp1.y * tmp2.y;
5bd8deadSopenharmony_ci      result.y = tmp0.y + tmp1.x * tmp2.z + tmp1.y * tmp2.w;
5bd8deadSopenharmony_ci      result.z = tmp0.x + tmp1.x * tmp2.x + tmp1.y * tmp2.y;
5bd8deadSopenharmony_ci      result.w = tmp0.y + tmp1.x * tmp2.z + tmp1.y * tmp2.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    X2D supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, XOR:  Exclusive Or
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The XOR instruction performs a bitwise XOR operation on the components of
5bd8deadSopenharmony_ci    the two source vectors to yield a result vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.x ^ tmp1.x;
5bd8deadSopenharmony_ci      result.y = tmp0.y ^ tmp1.y;
5bd8deadSopenharmony_ci      result.z = tmp0.z ^ tmp1.z;
5bd8deadSopenharmony_ci      result.w = tmp0.w ^ tmp1.w;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    XOR supports only integer data type modifiers.  If no type modifier is
5bd8deadSopenharmony_ci    specified, both operands and the result are treated as signed integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Section 2.X.8.Z, XPD:  Cross Product
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The XPD instruction computes the cross product using the first three
5bd8deadSopenharmony_ci    components of its two vector operands to generate the x, y, and z
5bd8deadSopenharmony_ci    components of the result vector.  The w component of the result vector is
5bd8deadSopenharmony_ci    undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      tmp0 = VectorLoad(op0);
5bd8deadSopenharmony_ci      tmp1 = VectorLoad(op1);
5bd8deadSopenharmony_ci      result.x = tmp0.y * tmp1.z - tmp0.z * tmp1.y;
5bd8deadSopenharmony_ci      result.y = tmp0.z * tmp1.x - tmp0.x * tmp1.z;
5bd8deadSopenharmony_ci      result.z = tmp0.x * tmp1.y - tmp0.y * tmp1.x;
5bd8deadSopenharmony_ci      result.w = undefined;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    XPD supports only floating-point data type modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 3 of the OpenGL 1.5 Specification (Rasterization)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.8.1, Texture Image Specification, p. 150
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify 4th paragraph, p. 151 -- add cubemaps to the list of texture
5bd8deadSopenharmony_ci    targets that can be used with DEPTH_COMPONENT textures) Textures with a
5bd8deadSopenharmony_ci    base internal format of DEPTH_COMPONENT are supported by texture image
5bd8deadSopenharmony_ci    specification commands only if <target> is TEXTURE_1D, TEXTURE_2D,
5bd8deadSopenharmony_ci    TEXTURE_CUBE_MAP, TEXTURE_RECTANGLE_ARB, TEXTURE_1D_ARRAY_EXT,
5bd8deadSopenharmony_ci    TEXTURE_2D_ARRAY_EXT, PROXY_TEXTURE_1D PROXY_TEXTURE_2D,
5bd8deadSopenharmony_ci    PROXY_TEXTURE_CUBE_MAP, PROXY_TEXTURE_RECTANGLE_ARB,
5bd8deadSopenharmony_ci    PROXY_TEXTURE_1D_ARRAY_EXT, or PROXY_TEXTURE_2D_ARRAY_EXT.  Using this
5bd8deadSopenharmony_ci    format in conjunction with any other target will result in an
5bd8deadSopenharmony_ci    INVALID_OPERATION error.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Delete Section 3.8.7, Texture Wrap Modes.  (The language in this section
5bd8deadSopenharmony_ci    is folded into updates to the following section, and is no longer needed
5bd8deadSopenharmony_ci    here.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.8.8, Texture Minification:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (replace the last paragraph, p. 171):  Let s(x,y) be the function that
5bd8deadSopenharmony_ci    associates an s texture coordinate with each set of window coordinates
5bd8deadSopenharmony_ci    (x,y) that lie within a primitive; define t(x,y) and r(x,y) analogously.
5bd8deadSopenharmony_ci    Let
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      u(x,y) = w_t * s(x,y) + offsetu_shader,
5bd8deadSopenharmony_ci      v(x,y) = h_t * t(x,y) + offsetv_shader,
5bd8deadSopenharmony_ci      w(x,y) = d_t * r(x,y) + offsetw_shader, and
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where w_t, h_t, and d_t are as defined by equations 3.15, 3.16, and 3.17
5bd8deadSopenharmony_ci    with w_s, h_s, and d_s equal to the width, height, and depth of the image
5bd8deadSopenharmony_ci    array whose level is level_base.  (offsetu_shader, offsetv_shader,
5bd8deadSopenharmony_ci    offsetw_shader) is the texel offset specified in the vertex, geometry, or
5bd8deadSopenharmony_ci    fragment program instruction used to perform the access.  For
5bd8deadSopenharmony_ci    fixed-function texture accesses, all three shader offsets are taken to be
5bd8deadSopenharmony_ci    zero.  For a one-dimensional texture, define v(x,y) == 0 and w(x,y) === 0;
5bd8deadSopenharmony_ci    for two-dimensional textures, define w(x,y) == 0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    After u(x,y), v(x,y), and w(x,y) are generated, they are clamped if the
5bd8deadSopenharmony_ci    corresponding texture wrap modes are CLAMP or MIRROR_CLAMP_EXT.  Let
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      u'(x,y) = clamp(u(x,y), 0, w_t),      if TEXTURE_WRAP_S is CLAMP
5bd8deadSopenharmony_ci                clamp(u(x,y), -w_t, w_t),   if TEXTURE_WRAP_S is
5bd8deadSopenharmony_ci                                              MIRROR_CLAMP_EXT, or
5bd8deadSopenharmony_ci                u(x,y),                     otherwise
5bd8deadSopenharmony_ci      v'(x,y) = clamp(v(x,y), 0, w_t),      if TEXTURE_WRAP_T is CLAMP
5bd8deadSopenharmony_ci                clamp(v(x,y), -w_t, w_t),   if TEXTURE_WRAP_T is
5bd8deadSopenharmony_ci                                              MIRROR_CLAMP_EXT, or
5bd8deadSopenharmony_ci                v(x,y),                     otherwise
5bd8deadSopenharmony_ci      w'(x,y) = clamp(w(x,y), 0, w_t),      if TEXTURE_WRAP_R is CLAMP
5bd8deadSopenharmony_ci                clamp(w(x,y), -w_t, w_t),   if TEXTURE_WRAP_R is
5bd8deadSopenharmony_ci                                              MIRROR_CLAMP_EXT, or
5bd8deadSopenharmony_ci                w(x,y),                     otherwise,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where clamp(<a>,<b>,<c>) returns <b> if <a> is less than <b>, <c> if a is
5bd8deadSopenharmony_ci    greater than <c>, and <a> otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (start a new paragraph with "For a polygon, rho is given at a fragment
5bd8deadSopenharmony_ci    with window coordinates...", and then continue with the original spec
5bd8deadSopenharmony_ci    text.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (replace text starting with the last paragraph on p. 172, continuing to
5bd8deadSopenharmony_ci    the end of p. 174)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When lambda indicates minification, the value assigned to
5bd8deadSopenharmony_ci    TEXTURE_MIN_FILTER is used to determine how the texture value for a
5bd8deadSopenharmony_ci    fragment is selected.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When TEXTURE_MIN_FILTER is NEAREST, the texel in the image array of level
5bd8deadSopenharmony_ci    level_base that is nearest (in Manhattan distance) to that specified by
5bd8deadSopenharmony_ci    (s,t,r) is obtained.  Let i, j, and k be integers such that:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      i = apply_wrap(floor(u'(x,y))),
5bd8deadSopenharmony_ci      j = apply_wrap(floor(v'(x,y))), and
5bd8deadSopenharmony_ci      k = apply_wrap(floor(w'(x,y))),
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where the coordinate returned by apply_wrap() is as defined by Table X.19.
5bd8deadSopenharmony_ci    The values of i, j, and k are then modified according to the texture wrap
5bd8deadSopenharmony_ci    modes, as described in Table 3.19, to produce new values (i', j', and k').
5bd8deadSopenharmony_ci    For a three-dimensional texture, the texel at location (i,j,k) becomes the
5bd8deadSopenharmony_ci    texture value.  For a two-dimensional texture, k is irrelevant, and the
5bd8deadSopenharmony_ci    texel at location (i,j) becomes the texture value.  For a one-dimensional
5bd8deadSopenharmony_ci    texture, j and k are irrelevant, and the texel at location i becomes the
5bd8deadSopenharmony_ci    texture value.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Wrap mode                   Result
5bd8deadSopenharmony_ci      --------------------------  ------------------------------------------
5bd8deadSopenharmony_ci      CLAMP_TO_EDGE               clamp(coord, 0, size-1)
5bd8deadSopenharmony_ci      CLAMP_TO_BORDER             clamp(coord, -1, size)
5bd8deadSopenharmony_ci      CLAMP                       { clamp(coord, 0, size-1),
5bd8deadSopenharmony_ci                                  {         for NEAREST filtering
5bd8deadSopenharmony_ci                                  { clamp(coord, -1, size),
5bd8deadSopenharmony_ci                                  {         for LINEAR filtering
5bd8deadSopenharmony_ci      REPEAT                      mod(coord, size)
5bd8deadSopenharmony_ci      MIRROR_CLAMP_TO_EDGE_EXT    clamp(mirror(coord), 0, size-1)
5bd8deadSopenharmony_ci      MIRROR_CLAMP_TO_BORDER_EXT  clamp(mirror(size), 0, size)
5bd8deadSopenharmony_ci      MIRROR_CLAMP_EXT            { clamp(mirror(coord), 0, size-1),
5bd8deadSopenharmony_ci                                  {         for NEAREST filtering
5bd8deadSopenharmony_ci                                  { clamp(mirror(size), 0, size),
5bd8deadSopenharmony_ci                                  {         for LINEAR filtering
5bd8deadSopenharmony_ci      MIRRORED_REPEAT             (size-1) - mirror(mod(coord, 2*size)-size)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Table X.19:  Texel location wrap mode application.  mod(<a>,<b>) is
5bd8deadSopenharmony_ci      defined to return <a>-<b>*floor(<a>/<b>), and mirror(<a>) is defined to
5bd8deadSopenharmony_ci      return <a> if <a> is greater than or equal to zero or -(1+<a>)
5bd8deadSopenharmony_ci      otherwise.  The values of "wrap mode" and size are TEXTURE_WRAP_S and
5bd8deadSopenharmony_ci      w_t, TEXTURE_WRAP_T and h_t, and TEXTURE_WRAP_R and d_t, for i, j, and k
5bd8deadSopenharmony_ci      coordinates, respectively.  The coordinate clamp and MIRROR_CLAMP_EXT
5bd8deadSopenharmony_ci      depends on the filtering mode (NEAREST or LINEAR).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the selected (i,j,k), (i,j), or i location refers to a border texel
5bd8deadSopenharmony_ci    that satisfies any of the following conditions:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      i < -b_s,
5bd8deadSopenharmony_ci      j < -b_s,
5bd8deadSopenharmony_ci      k < -b_s,
5bd8deadSopenharmony_ci      i >= w_t + b_s,
5bd8deadSopenharmony_ci      j >= h_t + b_s, or
5bd8deadSopenharmony_ci      j >= d_t + b_s,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    then the border values defined by TEXTURE_BORDER_COLOR are used in place
5bd8deadSopenharmony_ci    of the non-existent texel. If the texture contains color components, the
5bd8deadSopenharmony_ci    values of TEXTURE_BORDER_COLOR are interpreted as an RGBA color to match
5bd8deadSopenharmony_ci    the texture's internal format in a manner consistent with table 3.15. If
5bd8deadSopenharmony_ci    the texture contains depth components, the first component of
5bd8deadSopenharmony_ci    TEXTURE_BORDER_COLOR is interpreted as a depth value.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    When TEXTURE_MIN_FILTER is LINEAR, a 2x2x2 cube of texels in the image
5bd8deadSopenharmony_ci    array of level level_base is selected.  Let:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      i_0   = apply_wrap(floor(u' - 0.5)),
5bd8deadSopenharmony_ci      j_0   = apply_wrap(floor(v' - 0.5)),
5bd8deadSopenharmony_ci      k_0   = apply_wrap(floor(w' - 0.5)),
5bd8deadSopenharmony_ci      i_1   = apply_wrap(floor(u' - 0.5) + 1),
5bd8deadSopenharmony_ci      j_1   = apply_wrap(floor(v' - 0.5) + 1),
5bd8deadSopenharmony_ci      k_1   = apply_wrap(floor(w' - 0.5) + 1),
5bd8deadSopenharmony_ci      alpha = frac(u' - 0.5),
5bd8deadSopenharmony_ci      beta  = frac(v' - 0.5),
5bd8deadSopenharmony_ci      gamma = frac(w' - 0.5),
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    where frac(<x>) denotes the fractional part of <x>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    For a three-dimensional texture, the texture value tau is found as...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (replace last paragraph, p.174) For any texel in the equation above that
5bd8deadSopenharmony_ci    refers to a border texel outside the defined range of the image, the texel
5bd8deadSopenharmony_ci    value is taken from the texture border color as with NEAREST filtering.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 3.8.14, Texture Comparison Modes (p. 185)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (modify 2nd paragraph, p. 188, indicating that the Q texture coordinate is
5bd8deadSopenharmony_ci    used for depth comparisons on cubemap textures)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Let D_t be the depth texture value, in the range [0, 1].  For
5bd8deadSopenharmony_ci    fixed-function texture lookups, let R be the interpolated <r> texture
5bd8deadSopenharmony_ci    coordinate, clamped to the range [0, 1].  For texture lookups generated by
5bd8deadSopenharmony_ci    a program instruction, let R be the reference value for depth comparisons
5bd8deadSopenharmony_ci    provided in the instruction, also clamped to [0, 1].  Then the effective
5bd8deadSopenharmony_ci    texture value L_t, I_t, or A_t is computed as follows:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 4 of the OpenGL 1.5 Specification (Per-Fragment
5bd8deadSopenharmony_ciOperations and the Frame Buffer)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 5 of the OpenGL 1.5 Specification (Special Functions)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Chapter 6 of the OpenGL 1.5 Specification (State and
5bd8deadSopenharmony_ciState Requests)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Modify Section 6.1.12 of the ARB_vertex_program specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (Add new integer program parameter queries, plus language that program
5bd8deadSopenharmony_ci    environment or local parameter query results are undefined if the query
5bd8deadSopenharmony_ci    specifies a data type incompatible with the data type of the parameter
5bd8deadSopenharmony_ci    being queried.)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void GetProgramEnvParameterdvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                       double *params);
5bd8deadSopenharmony_ci      void GetProgramEnvParameterfvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                       float *params);
5bd8deadSopenharmony_ci      void GetProgramEnvParameterIivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                       int *params);
5bd8deadSopenharmony_ci      void GetProgramEnvParameterIuivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                        uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    obtain the current value for the program environment parameter numbered
5bd8deadSopenharmony_ci    <index> for the given program target <target>, and places the information
5bd8deadSopenharmony_ci    in the array <params>.  The values returned are undefined if the data type
5bd8deadSopenharmony_ci    of the components of the parameter is not compatible with the data type of
5bd8deadSopenharmony_ci    <params>.  Floating-point components are compatible with "double" or
5bd8deadSopenharmony_ci    "float"; signed and unsigned integer components are compatible with "int"
5bd8deadSopenharmony_ci    and "uint", respectively.  The error INVALID_ENUM is generated if <target>
5bd8deadSopenharmony_ci    specifies a nonexistent program target or a program target that does not
5bd8deadSopenharmony_ci    support program environment parameters.  The error INVALID_VALUE is
5bd8deadSopenharmony_ci    generated if <index> is greater than or equal to the
5bd8deadSopenharmony_ci    implementation-dependent number of supported program environment
5bd8deadSopenharmony_ci    parameters for the program target.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The commands
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void GetProgramLocalParameterdvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                         double *params);
5bd8deadSopenharmony_ci      void GetProgramLocalParameterfvARB(enum target, uint index,
5bd8deadSopenharmony_ci                                         float *params);
5bd8deadSopenharmony_ci      void GetProgramLocalParameterIivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                         int *params);
5bd8deadSopenharmony_ci      void GetProgramLocalParameterIuivNV(enum target, uint index,
5bd8deadSopenharmony_ci                                          uint *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    obtain the current value for the program local parameter numbered <index>
5bd8deadSopenharmony_ci    belonging to the program object currently bound to <target>, and places
5bd8deadSopenharmony_ci    the information in the array <params>.  The values returned are undefined
5bd8deadSopenharmony_ci    if the data type of the components of the parameter is not compatible with
5bd8deadSopenharmony_ci    the data type of <params>.  Floating-point components are compatible with
5bd8deadSopenharmony_ci    "double' or "float"; signed and unsigned integer components are compatible
5bd8deadSopenharmony_ci    with "int" and "uint", respectively.  The error INVALID_ENUM is generated
5bd8deadSopenharmony_ci    if <target> specifies a nonexistent program target or a program target
5bd8deadSopenharmony_ci    that does not support program local parameters.  The error INVALID_VALUE
5bd8deadSopenharmony_ci    is generated if <index> is greater than or equal to the
5bd8deadSopenharmony_ci    implementation-dependent number of supported program local parameters for
5bd8deadSopenharmony_ci    the program target.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The command
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      void GetProgramivARB(enum target, enum pname, int *params);
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    obtains program state for the program target <target>, writing ...
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (add new paragraphs describing the new supported queries)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If <pname> is PROGRAM_ATTRIB_COMPONENTS_NV or
5bd8deadSopenharmony_ci    PROGRAM_RESULT_COMPONENTS_NV, GetProgramivARB returns a single integer
5bd8deadSopenharmony_ci    holding the number of active attribute or result variable components,
5bd8deadSopenharmony_ci    respectively, used by the program object currently bound to <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If <pname> is MAX_PROGRAM_ATTRIB_COMPONENTS or
5bd8deadSopenharmony_ci    MAX_PROGRAM_RESULT_COMPONENTS_NV, GetProgramivARB returns a single integer
5bd8deadSopenharmony_ci    holding the maximum number of active attribute or result variable
5bd8deadSopenharmony_ci    components, respectively, supported for programs of type <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to Appendix A of the OpenGL 1.5 Specification (Invariance)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciAdditions to the AGL/GLX/WGL Specifications
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    None.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciGLX Protocol
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The following new rendering commands are sent to the server as part
5bd8deadSopenharmony_ci    of a glXRender request.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ProgramLocalParameterI4ivNV
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        2           28               rendering command length
5bd8deadSopenharmony_ci        2           4303             rendering command opcode
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci        4           INT32            params[0]
5bd8deadSopenharmony_ci        4           INT32            params[1]
5bd8deadSopenharmony_ci        4           INT32            params[2]
5bd8deadSopenharmony_ci        4           INT32            params[3]
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ProgramLocalParameterI4uivNV
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        2           28               rendering command length
5bd8deadSopenharmony_ci        2           4305             rendering command opcode
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci        4           CARD32           params[0]
5bd8deadSopenharmony_ci        4           CARD32           params[1]
5bd8deadSopenharmony_ci        4           CARD32           params[2]
5bd8deadSopenharmony_ci        4           CARD32           params[3]
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ProgramEnvParameterI4ivNV
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        2           28               rendering command length
5bd8deadSopenharmony_ci        2           4307             rendering command opcode
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci        4           INT32            params[0]
5bd8deadSopenharmony_ci        4           INT32            params[1]
5bd8deadSopenharmony_ci        4           INT32            params[2]
5bd8deadSopenharmony_ci        4           INT32            params[3]
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ProgramEnvParameterI4uivNV
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        2           28               rendering command length
5bd8deadSopenharmony_ci        2           4309             rendering command opcode
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci        4           CARD32           params[0]
5bd8deadSopenharmony_ci        4           CARD32           params[1]
5bd8deadSopenharmony_ci        4           CARD32           params[2]
5bd8deadSopenharmony_ci        4           CARD32           params[3]
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Following new rendering commands are added. These can be sent as a
5bd8deadSopenharmony_ci    glXRender or glXRenderLarge request.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ProgramLocalParametersI4ivNV
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        2           16+count*4*4     rendering command length
5bd8deadSopenharmony_ci        2           4304             rendering command opcode
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci        4           CARD32           count
5bd8deadSopenharmony_ci        4*count*4   LISTofINT32      params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the command is encoded in a glXRenderLarge request, the
5bd8deadSopenharmony_ci    command opcode and command length fields above are expanded to
5bd8deadSopenharmony_ci    4 bytes each:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        4           20+count*4*4     rendering command length
5bd8deadSopenharmony_ci        4           4304             rendering command opcode
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ProgramLocalParametersI4uivNV
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        2           16+count*4*4     rendering command length
5bd8deadSopenharmony_ci        2           4306             rendering command opcode
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci        4           CARD32           count
5bd8deadSopenharmony_ci        4*count*4   LISTofCARD32     params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the command is encoded in a glXRenderLarge request, the
5bd8deadSopenharmony_ci    command opcode and command length fields above are expanded to
5bd8deadSopenharmony_ci    4 bytes each:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        4           20+count*4*4     rendering command length
5bd8deadSopenharmony_ci        4           4306             rendering command opcode
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ProgramEnvParametersI4ivNV
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        2           16+count*4*4     rendering command length
5bd8deadSopenharmony_ci        2           4308             rendering command opcode
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci        4           CARD32           count
5bd8deadSopenharmony_ci        4*count*4   LISTofCARD32     params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the command is encoded in a glXRenderLarge request, the
5bd8deadSopenharmony_ci    command opcode and command length fields above are expanded to
5bd8deadSopenharmony_ci    4 bytes each:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        4           20+count*4*4     rendering command length
5bd8deadSopenharmony_ci        4           4308             rendering command opcode
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    ProgramEnvParametersI4uivNV
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        2           16+count*4*4     rendering command length
5bd8deadSopenharmony_ci        2           4310             rendering command opcode
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci        4           INT32            count
5bd8deadSopenharmony_ci        4*count*4   LISTofCARD32     params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If the command is encoded in a glXRenderLarge request, the
5bd8deadSopenharmony_ci    command opcode and command length fields above are expanded to
5bd8deadSopenharmony_ci    4 bytes each:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        4           20+count*4*4     rendering command length
5bd8deadSopenharmony_ci        4           4310             rendering command opcode
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The remaining commands are non-rendering commands.  These commands
5bd8deadSopenharmony_ci    are sent separately (i.e., not as part of a glXRender or
5bd8deadSopenharmony_ci    glXRenderLarge request), using the glXVendorPrivateWithReply
5bd8deadSopenharmony_ci    request:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GetProgramLocalParameterIivNV
5bd8deadSopenharmony_ci        1           CARD8            opcode (X assigned)
5bd8deadSopenharmony_ci        1           17               GLX opcode (X_GLXVendorPrivateWithReply)
5bd8deadSopenharmony_ci        2           5                request length
5bd8deadSopenharmony_ci        4           1365             vendor specific opcode
5bd8deadSopenharmony_ci        4           GLX_CONTEXT_TAG  context tag
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci      =>
5bd8deadSopenharmony_ci        1           1                reply
5bd8deadSopenharmony_ci        1           CARD8            unused
5bd8deadSopenharmony_ci        2           CARD16           sequence number
5bd8deadSopenharmony_ci        4           4                reply length
5bd8deadSopenharmony_ci        24          CARD32           unused
5bd8deadSopenharmony_ci        16          INT32            params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GetProgramLocalParameterIuivNV
5bd8deadSopenharmony_ci        1           CARD8            opcode (X assigned)
5bd8deadSopenharmony_ci        1           17               GLX opcode (X_GLXVendorPrivateWithReply)
5bd8deadSopenharmony_ci        2           5                request length
5bd8deadSopenharmony_ci        4           1366             vendor specific opcode
5bd8deadSopenharmony_ci        4           GLX_CONTEXT_TAG  context tag
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci      =>
5bd8deadSopenharmony_ci        1           1                reply
5bd8deadSopenharmony_ci        1           CARD8            unused
5bd8deadSopenharmony_ci        2           CARD16           sequence number
5bd8deadSopenharmony_ci        4           4                reply length
5bd8deadSopenharmony_ci        24          CARD32           unused
5bd8deadSopenharmony_ci        16          CARD32           params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GetProgramEnvParameterIivNV
5bd8deadSopenharmony_ci        1           CARD8            opcode (X assigned)
5bd8deadSopenharmony_ci        1           17               GLX opcode (X_GLXVendorPrivateWithReply)
5bd8deadSopenharmony_ci        2           5                request length
5bd8deadSopenharmony_ci        4           1367             vendor specific opcode
5bd8deadSopenharmony_ci        4           GLX_CONTEXT_TAG  context tag
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci      =>
5bd8deadSopenharmony_ci        1           1                reply
5bd8deadSopenharmony_ci        1           CARD8            unused
5bd8deadSopenharmony_ci        2           CARD16           sequence number
5bd8deadSopenharmony_ci        4           4                reply length
5bd8deadSopenharmony_ci        24          CARD32           unused
5bd8deadSopenharmony_ci        16          INT32            params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    GetProgramEnvParameterIuivNV
5bd8deadSopenharmony_ci        1           CARD8            opcode (X assigned)
5bd8deadSopenharmony_ci        1           17               GLX opcode (X_GLXVendorPrivateWithReply)
5bd8deadSopenharmony_ci        2           5                request length
5bd8deadSopenharmony_ci        4           1368             vendor specific opcode
5bd8deadSopenharmony_ci        4           GLX_CONTEXT_TAG  context tag
5bd8deadSopenharmony_ci        4           ENUM             target
5bd8deadSopenharmony_ci        4           CARD32           index
5bd8deadSopenharmony_ci      =>
5bd8deadSopenharmony_ci        1           1                reply
5bd8deadSopenharmony_ci        1           CARD8            unused
5bd8deadSopenharmony_ci        2           CARD16           sequence number
5bd8deadSopenharmony_ci        4           4                reply length
5bd8deadSopenharmony_ci        24          CARD32           unused
5bd8deadSopenharmony_ci        16          CARD32           params
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciErrors
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error INVALID_VALUE is generated by ProgramLocalParameter4fARB,
5bd8deadSopenharmony_ci    ProgramLocalParameter4fvARB, ProgramLocalParameter4dARB,
5bd8deadSopenharmony_ci    ProgramLocalParameter4dvARB, ProgramLocalParameterI4iNV,
5bd8deadSopenharmony_ci    ProgramLocalParameterI4ivNV, ProgramLocalParameterI4uiNV,
5bd8deadSopenharmony_ci    ProgramLocalParameterI4uivNV, GetProgramLocalParameter4fvARB,
5bd8deadSopenharmony_ci    GetProgramLocalParameter4dvARB, GetProgramLocalParameterI4ivNV, and
5bd8deadSopenharmony_ci    GetProgramLocalParameterI4uivNV if <index> is greater than or equal to the
5bd8deadSopenharmony_ci    number of program local parameters supported by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error INVALID_VALUE is generated by ProgramEnvParameter4fARB,
5bd8deadSopenharmony_ci    ProgramEnvParameter4fvARB, ProgramEnvParameter4dARB,
5bd8deadSopenharmony_ci    ProgramEnvParameter4dvARB, ProgramEnvParameterI4iNV,
5bd8deadSopenharmony_ci    ProgramEnvParameterI4ivNV, ProgramEnvParameterI4uiNV,
5bd8deadSopenharmony_ci    ProgramEnvParameterI4uivNV, GetProgramEnvParameter4fvARB,
5bd8deadSopenharmony_ci    GetProgramEnvParameter4dvARB, GetProgramEnvParameterI4ivNV, and
5bd8deadSopenharmony_ci    GetProgramEnvParameterI4uivNV if <index> is greater than or equal to the
5bd8deadSopenharmony_ci    number of program environment parameters supported by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error INVALID_VALUE is generated by ProgramLocalParameters4fvNV,
5bd8deadSopenharmony_ci    ProgramLocalParametersI4ivNV, and ProgramLocalParametersI4uivNV if the sum
5bd8deadSopenharmony_ci    of <index> and <count> is greater than the number of program local
5bd8deadSopenharmony_ci    parameters supported by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    The error INVALID_VALUE is generated by ProgramEnvParameters4fvNV,
5bd8deadSopenharmony_ci    ProgramEnvParametersI4ivNV, and ProgramEnvParametersI4uivNV if the sum of
5bd8deadSopenharmony_ci    <index> and <count> is greater than the number of program environment
5bd8deadSopenharmony_ci    parameters supported by <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on NV_parameter_buffer_object
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If NV_parameter_buffer_object is not supported, references to program
5bd8deadSopenharmony_ci    parameter buffer variables and bindings should be removed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on ARB_texture_rectangle
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If ARB_texture_rectangle is not supported, references to rectangle
5bd8deadSopenharmony_ci    textures and the RECT and SHADOWRECT texture target identifiers should be
5bd8deadSopenharmony_ci    removed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on EXT_gpu_program_parameters
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If EXT_gpu_program_parameters is not supported, references to the
5bd8deadSopenharmony_ci    Program{Local,Env}Parameters4fvNV commands, which set multiple program
5bd8deadSopenharmony_ci    local or environment parameters in a single call, should be removed.
5bd8deadSopenharmony_ci    These prototypes were included in this spec for completeness only.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on EXT_texture_integer
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If EXT_texture_integer is not supported, references to texture lookups
5bd8deadSopenharmony_ci    returning integer values in Section 2.X.4.4 (Texture Access) should be
5bd8deadSopenharmony_ci    removed, and all texture formats are considered to produce floating-point
5bd8deadSopenharmony_ci    values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on EXT_texture_array
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If EXT_texture_array is not supported, references to array textures in
5bd8deadSopenharmony_ci    Section 2.X.4.4 (Texture Access) and elsewhere should be removed, as
5bd8deadSopenharmony_ci    should all references to the "ARRAY1D", "ARRAY2D", "SHADOWARRAY1D", and
5bd8deadSopenharmony_ci    "SHADOWARRAY2D" tokens.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on EXT_texture_buffer_object
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If EXT_texture_buffer_object is not supported, references to buffer
5bd8deadSopenharmony_ci    textures in Section 2.X.4.4 (Texture Access) and elsewhere should be
5bd8deadSopenharmony_ci    removed, as should all references to the "BUFFER" tokens.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciDependencies on NV_primitive_restart
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    If NV_primitive_restart is supported, index values causing a primitive
5bd8deadSopenharmony_ci    restart are not considered as specifying an End command, followed by
5bd8deadSopenharmony_ci    another Begin.  Primitive restart is therefore not guaranteed to
5bd8deadSopenharmony_ci    immediately update bindings for material properties changed inside a
5bd8deadSopenharmony_ci    Begin/End.  The spec language says they "are not guaranteed to update
5bd8deadSopenharmony_ci    program parameter bindings until the following End command."
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew State
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                                                         Initial
5bd8deadSopenharmony_ci    Get Value                     Type  Get Command       Value  Description             Sec     Attrib
5bd8deadSopenharmony_ci    ----------------------------  ----  ---------------  ------- ----------------------  ------  ------
5bd8deadSopenharmony_ci    PROGRAM_ATTRIB_COMPONENTS_NV  Z+    GetProgramivARB     -    number of components    6.1.12   -
5bd8deadSopenharmony_ci                                                                 used for attributes
5bd8deadSopenharmony_ci    PROGRAM_RESULT_COMPONENTS_NV  Z+    GetProgramivARB     -    number of components    6.1.12   -
5bd8deadSopenharmony_ci                                                                 used for results
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Table X.20.  New Program Object State.  Program object queries return
5bd8deadSopenharmony_ci    attributes of the program object currently bound to the program target
5bd8deadSopenharmony_ci    <target>.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciNew Implementation Dependent State
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                                                             Minimum
5bd8deadSopenharmony_ci    Get Value                         Type  Get Command       Value   Description           Sec.   Attrib
5bd8deadSopenharmony_ci    --------------------------------  ----  ---------------  -------  --------------------- ------ ------
5bd8deadSopenharmony_ci    MIN_PROGRAM_TEXEL_OFFSET_EXT      Z     GetIntegerv        -8     minimum texel offset  2.x.4.4  -
5bd8deadSopenharmony_ci                                                                      allowed in lookup
5bd8deadSopenharmony_ci    MAX_PROGRAM_TEXEL_OFFSET_EXT      Z     GetIntegerv        +7     maximum texel offset  2.x.4.4  -
5bd8deadSopenharmony_ci                                                                      allowed in lookup
5bd8deadSopenharmony_ci    MAX_PROGRAM_ATTRIB_COMPONENTS_NV  Z+    GetProgramivARB    (*)    maximum number of     6.1.12   -
5bd8deadSopenharmony_ci                                                                      components allowed
5bd8deadSopenharmony_ci                                                                      for attributes
5bd8deadSopenharmony_ci    MAX_PROGRAM_RESULT_COMPONENTS_NV  Z+    GetProgramivARB    (*)    maximum number of     6.1.12   -
5bd8deadSopenharmony_ci                                                                      components allowed
5bd8deadSopenharmony_ci                                                                      for results
5bd8deadSopenharmony_ci    MAX_PROGRAM_GENERIC_ATTRIBS_NV    Z+    GetProgramivARB    (*)    number of generic     6.1.12   -
5bd8deadSopenharmony_ci                                                                      attribute vectors
5bd8deadSopenharmony_ci                                                                      supported
5bd8deadSopenharmony_ci    MAX_PROGRAM_GENERIC_RESULTS_NV    Z+    GetProgramivARB    (*)    number of generic     6.1.12   -
5bd8deadSopenharmony_ci                                                                      result vectors
5bd8deadSopenharmony_ci                                                                      supported
5bd8deadSopenharmony_ci    MAX_PROGRAM_CALL_DEPTH_NV         Z+    GetProgramivARB     4     maximum program       2.X.5    -
5bd8deadSopenharmony_ci                                                                      call stack depth
5bd8deadSopenharmony_ci    MAX_PROGRAM_IF_DEPTH_NV           Z+    GetProgramivARB     48    maximum program       2.X.5    -
5bd8deadSopenharmony_ci                                                                      if nesting
5bd8deadSopenharmony_ci    MAX_PROGRAM_LOOP_DEPTH_NV         Z+    GetProgramivARB     4     maximum program       2.X.5    -
5bd8deadSopenharmony_ci                                                                      loop nesting
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Table X.21:  New Implementation-Dependent Values Introduced by
5bd8deadSopenharmony_ci    NV_gpu_program4.  (*) means that the required minimum is program
5bd8deadSopenharmony_ci    type-specific.  There are separate limits for each program type.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciIssues
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (1) How does this extension differ from previous NV_vertex_program and
5bd8deadSopenharmony_ci    NV_fragment_program extensions?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - This extension provides a uniform set of instructions and bindings.
5bd8deadSopenharmony_ci          Unlike previous extensions, the set of instructions and bindings
5bd8deadSopenharmony_ci          available is generally the same.  The only exceptions are a small
5bd8deadSopenharmony_ci          number of instructions and bindings that make sense for one specific
5bd8deadSopenharmony_ci          program type.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - This extension supports integer data types and provides a
5bd8deadSopenharmony_ci          full-fledged integer instruction set.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - This extension supports array variables of all types, including
5bd8deadSopenharmony_ci          temporaries.  Array variables can be accessed directly or indirectly
5bd8deadSopenharmony_ci          (using integer temporaries as indices).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - This extension provides a uniform set of structured branching
5bd8deadSopenharmony_ci          constructs (if tests, loops, subroutines) that fully support
5bd8deadSopenharmony_ci          run-time condition testing.  Previous versions of NV_vertex_program
5bd8deadSopenharmony_ci          provided unstructured branching.  Previous versions of
5bd8deadSopenharmony_ci          NV_fragment_program provided structure branching constructs, but the
5bd8deadSopenharmony_ci          support was more limited -- for example, looping constructs couldn't
5bd8deadSopenharmony_ci          specify loop counts with values computed at run time.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - This extension supports geometry programs, which are described in
5bd8deadSopenharmony_ci          more detail in the NV_geometry_program4 extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - This extension provides the ability to specify and use cubemap
5bd8deadSopenharmony_ci          textures with a DEPTH_COMPONENT internal format.  Shadow mapping is
5bd8deadSopenharmony_ci          supported; the Q texture coordinate is used as the reference value
5bd8deadSopenharmony_ci          for comparisons.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (2) Is this extension backward-compatible with previous NV_vertex_program
5bd8deadSopenharmony_ci    and NV_fragment_program extensions?  If not, what support has been
5bd8deadSopenharmony_ci    removed?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  This extension is largely, but not completely,
5bd8deadSopenharmony_ci      backward-compatible.  Functionality removed includes:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - Unstructured branching:  NV_vertex_program2 included a general
5bd8deadSopenharmony_ci          branch instruction "BRA" that could be used to jump to an arbitrary
5bd8deadSopenharmony_ci          instruction.  The "CAL" instruction could "call" to an arbitrary
5bd8deadSopenharmony_ci          instruction into code that was not necessarily structured as simple
5bd8deadSopenharmony_ci          subroutine blocks.  Arbitrary unstructured branching can be
5bd8deadSopenharmony_ci          difficult to implement efficiently on highly parallel GPU
5bd8deadSopenharmony_ci          architectures, while basic structured branching is not nearly as
5bd8deadSopenharmony_ci          difficult.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          This extension retains the "CAL" instruction but treats each block
5bd8deadSopenharmony_ci          of code between instruction labels as a separate subroutine.  The
5bd8deadSopenharmony_ci          "BRA" instruction and arbitrary branching has been removed.  The
5bd8deadSopenharmony_ci          structured branching constructs in this extension are sufficient to
5bd8deadSopenharmony_ci          implement almost all of the looping/branching support in high-level
5bd8deadSopenharmony_ci          languages ("goto" being the most obvious exception).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - Address registers:  NV_vertex_program added the notion of address
5bd8deadSopenharmony_ci          registers, which were effectively under-powered integer temporaries.
5bd8deadSopenharmony_ci          The set of instructions used to manipulate address registers was
5bd8deadSopenharmony_ci          severely limited.  NV_vertex_program[23] extended the original
5bd8deadSopenharmony_ci          scalars to vectors and added a few more instructions to manipulate
5bd8deadSopenharmony_ci          address registers.  Fragment programs had no address registers until
5bd8deadSopenharmony_ci          NV_fragment_program2 added the loop counter, which was very similar
5bd8deadSopenharmony_ci          in functionality to vertex program address registers, but even more
5bd8deadSopenharmony_ci          limited.  This extension adds true integer temporaries, which can
5bd8deadSopenharmony_ci          accomplish everything old address registers could do, and much more.
5bd8deadSopenharmony_ci          Address register support was removed to simplify the API.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - NV_fragment_program2 LOOP construct:  NV_fragment_program2 added a
5bd8deadSopenharmony_ci          LOOP instruction, which let you repeat a block of code <N> times,
5bd8deadSopenharmony_ci          with a parallel loop counter that started at <A> and stepped by <B>
5bd8deadSopenharmony_ci          on each iteration.  This construct was signficantly limited in
5bd8deadSopenharmony_ci          several ways -- the loop count had to be constant, and you could
5bd8deadSopenharmony_ci          only access the innermost loop counter in a nested loop.  This
5bd8deadSopenharmony_ci          extension discards the support and retains the simpler "REP"
5bd8deadSopenharmony_ci          construct to implement loops.  If desired, a loop counter can be
5bd8deadSopenharmony_ci          implemented by manipulating an integer temporary.  The "BRK"
5bd8deadSopenharmony_ci          instruction (conditional break) is retained, and a "CONT"
5bd8deadSopenharmony_ci          instruction (conditional continue) is added.  Additionally, the loop
5bd8deadSopenharmony_ci          count need not be a constant.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - NV_vertex_program and ARB_vertex_program EXP and LOG instructions:
5bd8deadSopenharmony_ci          NV_vertex_program provided EXP and LOG instructions that computed a
5bd8deadSopenharmony_ci          rough approximation of 2^x or log_2(x) and provided some additional
5bd8deadSopenharmony_ci          values that could help refine the approximation.  Those opcodes were
5bd8deadSopenharmony_ci          carried forward into ARB_vertex_program.  Both ARB_vertex_program
5bd8deadSopenharmony_ci          and NV_vertex_program2 provided EX2 and LG2 instructions that
5bd8deadSopenharmony_ci          computed a better approximation.  All fragment program extensions
5bd8deadSopenharmony_ci          also provided EX2 and LG2, but did not bother to include EXP and
5bd8deadSopenharmony_ci          LOG.  On the hardware targeted by this extension, there is no
5bd8deadSopenharmony_ci          advantage to using EXP and LOG, so these opcodes have been removed
5bd8deadSopenharmony_ci          for simplicity.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - NV_vertex_program3 and NV_fragment_program2 provide the ability to
5bd8deadSopenharmony_ci          do indirect addressing of inputs/outputs when using bindings in
5bd8deadSopenharmony_ci          instructions -- for example:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            MOV R0, vertex.attrib[A0.x+2];      # vertex
5bd8deadSopenharmony_ci            MOV result.texcoord[A0.y], R1;      # vertex
5bd8deadSopenharmony_ci            MOV R2, fragment.texcoord[A0.x];    # fragment
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          This extension provides indexing capability, but using named array
5bd8deadSopenharmony_ci          variables instead.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci            ATTRIB attribs[] = { vertex.attrib[2..5] };
5bd8deadSopenharmony_ci            MOV R0, attribs[A0.x];
5bd8deadSopenharmony_ci            OUTPUT outcoords[] = { result.texcoord[0..3] };
5bd8deadSopenharmony_ci            MOV outcoords[A0.y], R1;
5bd8deadSopenharmony_ci            ATTRIB texcoords[] = { fragment.texcoord[0..2] };
5bd8deadSopenharmony_ci            MOV R2, texcoords[A0.x];
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci          This approach makes the set of attribute and result bindings more
5bd8deadSopenharmony_ci          regular.  Additionally, it helps the assembler determine which
5bd8deadSopenharmony_ci          vertex/fragment attributes are actually needed -- when the assembler
5bd8deadSopenharmony_ci          sees constructs like "fragment.texcoord[A0.x]", it must treat *all*
5bd8deadSopenharmony_ci          texture coordinates as live unless it can determine the range of
5bd8deadSopenharmony_ci          values used for indexing.  The named array variable approach
5bd8deadSopenharmony_ci          explicitly identifies which attributes are needed when indexing is
5bd8deadSopenharmony_ci          used.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Functionality altered includes:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - The RSQ instruction in the original NV_vertex_program and
5bd8deadSopenharmony_ci          ARB_vertex_program extensions implicitly took the absolute value of
5bd8deadSopenharmony_ci          their operand.  Since the ARB extensions don't have numerics
5bd8deadSopenharmony_ci          guarantees, computing the reciprocal square root of a negative value
5bd8deadSopenharmony_ci          was not meaningful.  To allow for the possibility of taking the
5bd8deadSopenharmony_ci          reciprocal square root of a negative value (which should yield NaN
5bd8deadSopenharmony_ci          -- "not a number"), the RSQ instruction in this instruction no
5bd8deadSopenharmony_ci          longer implicitly takes the absolute value of its operand.
5bd8deadSopenharmony_ci          Equivalent functionality can be achieved using the explicit |abs|
5bd8deadSopenharmony_ci          absolute value operator on the operand to RSQ.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        - The results of texture lookups accessing inconsistent textures are
5bd8deadSopenharmony_ci          now undefined, instead of producing a fixed constant vector.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (3) What should this set of extensions be called?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  NV_gpu_program4, NV_vertex_program4, NV_fragment_program4,
5bd8deadSopenharmony_ci      and NV_geometry_program4.  Only NV_gpu_program4 will appear in the
5bd8deadSopenharmony_ci      extension string; the other three specifications exist simply to define
5bd8deadSopenharmony_ci      vertex, fragment, and geometry program-specific features.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The "gpu_program" name was chosen due to the common instruction set
5bd8deadSopenharmony_ci      intended to run on GPUs.  On previous chip generations, the vertex and
5bd8deadSopenharmony_ci      fragment instruction sets were similar, but there were enough
5bd8deadSopenharmony_ci      differences to package them separately.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The choice of "4" indicates that this is the fourth generation of
5bd8deadSopenharmony_ci      programmable hardware from NVIDIA.  The GeForce3 and GeForce4 series
5bd8deadSopenharmony_ci      supported NV_vertex_program.  The GeForce FX series supported
5bd8deadSopenharmony_ci      NV_vertex_program2 and added fragment programmability with
5bd8deadSopenharmony_ci      NV_fragment_program.  Around this time, the OpenGL Architecture Review
5bd8deadSopenharmony_ci      Board (ARB) approved ARB_vertex_program and ARB_fragment_program
5bd8deadSopenharmony_ci      extensions, and NVIDIA added NV_vertex_program2_option and
5bd8deadSopenharmony_ci      NV_fragment_program_option extensions exposing GeForce FX features using
5bd8deadSopenharmony_ci      the ARB extensions' instruction set.  The GeForce6 and GeForce7 series
5bd8deadSopenharmony_ci      brought the NV_vertex_program3 and NV_fragment_program2 extensions,
5bd8deadSopenharmony_ci      which extend the ARB extensions further.  This extension adds geometry
5bd8deadSopenharmony_ci      programs, and brings the "version number" for each of these extensions
5bd8deadSopenharmony_ci      up to "4".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (4) This instruction adds integer data type support in programmable
5bd8deadSopenharmony_ci    shaders that were previously float-centric.  Should applications be able
5bd8deadSopenharmony_ci    to pass integer values directly to the shaders, and if so, how does it
5bd8deadSopenharmony_ci    work?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  The diagram at the bottom of this issue depicts data flows in
5bd8deadSopenharmony_ci      the GL, as extended by this and related extensions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      This extension generalizes some state to be "typeless", instead of being
5bd8deadSopenharmony_ci      strongly typed (and almost invariably floating-point) as in the core
5bd8deadSopenharmony_ci      specification.  We introduce a new set of functions to specify GL state
5bd8deadSopenharmony_ci      as signed or unsigned integer values, instead of floating point values.
5bd8deadSopenharmony_ci      These functions include:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * VertexAttribI*{i,ui}() -- Specify generic vertex attributes as
5bd8deadSopenharmony_ci          integers.  This extension does not create "integer" versions for
5bd8deadSopenharmony_ci          fixed-function attribute functions (e.g., glColor, glTexCoord),
5bd8deadSopenharmony_ci          which remain fully floating-point.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * Program{Env,Local}ParameterI*{i,ui}() -- Specify environment and
5bd8deadSopenharmony_ci          local parameters as integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * TexImage*() with EXT_texture_integer internal formats -- Specify
5bd8deadSopenharmony_ci          texture images as containing integer data whose values are not
5bd8deadSopenharmony_ci          converted to floating-point values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * EXT_parameter_buffer_object functions -- Bind (typeless) buffer
5bd8deadSopenharmony_ci          object data stores for use as program parameters.  These buffer
5bd8deadSopenharmony_ci          objects can be loaded with either integer or floating-point data.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * EXT_texture_buffer_object functions -- Bind (typeless) buffer object
5bd8deadSopenharmony_ci          data stores for use as textures.  These buffer objects can be loaded
5bd8deadSopenharmony_ci          with either integer or floating-point data.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Each type of program (using NV_gpu_program4 and related extension) can
5bd8deadSopenharmony_ci      read attributes using any data type (float, signed integer, unsigned
5bd8deadSopenharmony_ci      integer) and write result values used by subsequent stages using any
5bd8deadSopenharmony_ci      data type.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Finally, there are several new places where integer data can be
5bd8deadSopenharmony_ci      consumed by the GL:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * NV_transform_feedback -- Stream transformed vertex attribute
5bd8deadSopenharmony_ci          components to a (typeless) buffer object.  The transformed
5bd8deadSopenharmony_ci          attributes can be written as signed or unsigned integers in vertex
5bd8deadSopenharmony_ci          and geometry programs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * EXT_texture_integer internal formats and framebuffer objects --
5bd8deadSopenharmony_ci          Provide support for rendering to integer texture formats, where
5bd8deadSopenharmony_ci          final fragment values are treated as signed or unsigned integers,
5bd8deadSopenharmony_ci          rather than floating-point values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The diagram below represents a substantial portion of the GL pipeline.
5bd8deadSopenharmony_ci      Each line connecting blocks represents an interface where data is
5bd8deadSopenharmony_ci      "produced" from the GL state or by fixed-function or programmable
5bd8deadSopenharmony_ci      pipeline stages and "consumed" by another pipeline stage.  Each producer
5bd8deadSopenharmony_ci      and consumer is labeled with a data type.  For producers, the
5bd8deadSopenharmony_ci      "(typeless)" designation generally means that the state and/or output
5bd8deadSopenharmony_ci      can be written as floating-point values or as signed or unsigned
5bd8deadSopenharmony_ci      integers.  "(float)" means that the outputs are always written as
5bd8deadSopenharmony_ci      floating-point.  The same distinction applies to consumers --
5bd8deadSopenharmony_ci      "(typeless)" means that the consumer is capable of reading inputs using
5bd8deadSopenharmony_ci      any data type, and "(float)" means that consumer always reads inputs as
5bd8deadSopenharmony_ci      floating-point values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      To get sane results, applications must ensure that each value passed
5bd8deadSopenharmony_ci      between pipeline stages is produced and consumed using the same data
5bd8deadSopenharmony_ci      type.  If a value is written in one stage as a floating-point value; it
5bd8deadSopenharmony_ci      must be read as a floating-point value as well.  If such a value is read
5bd8deadSopenharmony_ci      as a signed or unsigned integer, its value is considered undefined.  In
5bd8deadSopenharmony_ci      practice, the raw bits used to represent the floating-point (IEEE
5bd8deadSopenharmony_ci      single-precision floating-point encoding in the initial implementation
5bd8deadSopenharmony_ci      of this spec) will be treated as an integer.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Type matching between stages is not enforced by the GL, because the
5bd8deadSopenharmony_ci      overhead of doing so would be substantial.  Such overhead would include:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * matching the inputs and outputs of each pipeline stage
5bd8deadSopenharmony_ci          (fixed-function or programmable) every time the program
5bd8deadSopenharmony_ci          configuration or fixed-function state changes,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * tracking the data type of each generic vertex attribute and checking
5bd8deadSopenharmony_ci          it against the vertex program's inputs,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * tracking the data type of each program parameter and checking it
5bd8deadSopenharmony_ci          against the manner the parameters were used in programs,
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * matching color buffers against fragment program outputs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Such error checking is certainly valuable, but the additional CPU
5bd8deadSopenharmony_ci      overhead cost is substantial.  Given that current CPUs often have a hard
5bd8deadSopenharmony_ci      time keeping up with high-end GPUs, adding more overhead is a step in
5bd8deadSopenharmony_ci      the wrong direction.  We expect developer tools, such as instrumented
5bd8deadSopenharmony_ci      drivers, to be able to provide type checking on most interfaces.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The diagram below depicts assembly programmability.  Using vertex,
5bd8deadSopenharmony_ci      geometry, and fragment shaders provided by the OpenGL Shading Language
5bd8deadSopenharmony_ci      (GLSL) isn't substantially different from the assembly interface, except
5bd8deadSopenharmony_ci      that the interfaces between programmable pipeline stages are more
5bd8deadSopenharmony_ci      tightly coupled in GLSL (vertex, geometry, and fragment shaders are
5bd8deadSopenharmony_ci      linked together into a single program object), and that shader variables
5bd8deadSopenharmony_ci      are more strongly typed in GLSL than in the assembly interface.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      In the figure below, the first programmable stage is vertex program
5bd8deadSopenharmony_ci      execution.  For all inputs read by the vertex program, they must be
5bd8deadSopenharmony_ci      specified in the GL vertex APIs (immediate mode or vertex arrays) using
5bd8deadSopenharmony_ci      a data type matching the data type read by the shader.  Additionally,
5bd8deadSopenharmony_ci      vertex programs (and all other program types) can read program
5bd8deadSopenharmony_ci      parameters, parameter buffers, and textures.  In all cases the
5bd8deadSopenharmony_ci      parameter, buffer, or texture data must be accessed in the shader using
5bd8deadSopenharmony_ci      the same data type used to specify the data.  If vertex programs are
5bd8deadSopenharmony_ci      disabled, fixed-function vertex processing is used.  Fixed-function
5bd8deadSopenharmony_ci      vertex processing is fully floating-point, and all the conventional
5bd8deadSopenharmony_ci      vertex attributes and state used by fixed-function are floating-point
5bd8deadSopenharmony_ci      values.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      After vertex processing, an optional geometry program can be executed,
5bd8deadSopenharmony_ci      which reads attributes written by vertex programs (or fixed-functon) and
5bd8deadSopenharmony_ci      writes out new vertex attributes.  The vertex attributes it reads must
5bd8deadSopenharmony_ci      have been written by the vertex program (or fixed-function) using a
5bd8deadSopenharmony_ci      matching data type.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      After geometry program execution, vertex attributes can optionally be
5bd8deadSopenharmony_ci      written out to buffer objects using the NV_transform_feedback extension.
5bd8deadSopenharmony_ci      The vertex attributes are written by the GL to the buffer objects using
5bd8deadSopenharmony_ci      the same data type used to write the attribute in the geometry program
5bd8deadSopenharmony_ci      (or vertex program if geometry programs are disabled).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Then, rasterization generates fragments based on transformed vertices.
5bd8deadSopenharmony_ci      Most attributes written by vertex or geometry programs can be read by
5bd8deadSopenharmony_ci      fragment programs, after the rasterization hardware "interpolates" them.
5bd8deadSopenharmony_ci      This extension allows fragment programs to control how each attribute is
5bd8deadSopenharmony_ci      interpolated.  If an attribute is flat-shaded, it will be taken from the
5bd8deadSopenharmony_ci      output attribute of the provoking vertex of the primitive using the same
5bd8deadSopenharmony_ci      data type.  If an attribute is smooth-shaded, the per-vertex attributes
5bd8deadSopenharmony_ci      will be interpreted as a floating-point value, and a floating-point
5bd8deadSopenharmony_ci      result.  One necessary consequence of this is that any integer
5bd8deadSopenharmony_ci      per-fragment attributes must be flat-shaded.  To prevent some
5bd8deadSopenharmony_ci      interpolation type errors, assembly and GLSL fragment shaders will not
5bd8deadSopenharmony_ci      compile if they declare an integer fragment attribute that is not flat
5bd8deadSopenharmony_ci      shaded.  [NOTE:  While point primitives generally have constant
5bd8deadSopenharmony_ci      attributes, any integer attributes must still be flat-shaded; point
5bd8deadSopenharmony_ci      rasterization may perform (degenerate) floating-point interpolation.]
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Fragment programs must read attributes using data types matching the
5bd8deadSopenharmony_ci      outputs of the interpolation or flat-shading operations.  They may write
5bd8deadSopenharmony_ci      one or more color outputs using any data type, but the data type used
5bd8deadSopenharmony_ci      must match the corresponding framebuffer attachments.  Outputs directed
5bd8deadSopenharmony_ci      at signed or unsigned integer textures (EXT_texture_integer) must be
5bd8deadSopenharmony_ci      written using the appropriate integer data type; all other outputs must
5bd8deadSopenharmony_ci      be written as floating-point values.  Note that some of the
5bd8deadSopenharmony_ci      fixed-function per-fragment operations (e.g., blending, alpha test) are
5bd8deadSopenharmony_ci      specified as floating-point operations and are skipped when directed at
5bd8deadSopenharmony_ci      signed or unsigned integer color buffers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci                                     generic               conventional
5bd8deadSopenharmony_ci                                     vertex                  vertex
5bd8deadSopenharmony_ci                                    attributes              attributes
5bd8deadSopenharmony_ci                                       | (typeless)             | (float)
5bd8deadSopenharmony_ci                                       |                        |
5bd8deadSopenharmony_ci                                       |                        |
5bd8deadSopenharmony_ci                                       | +----------------------+
5bd8deadSopenharmony_ci         program                       | |                      |
5bd8deadSopenharmony_ci        parameters ----+               | |                      |
5bd8deadSopenharmony_ci        (typeless)     |               | | (typeless)           | (float)
5bd8deadSopenharmony_ci                       |               V V                      V
5bd8deadSopenharmony_ci         constant      +-+----------> vertex              fixed-function
5bd8deadSopenharmony_ci         buffers   ----+ |(typeless)  program                 vertex
5bd8deadSopenharmony_ci        (typeless)     | |              |                       |
5bd8deadSopenharmony_ci                       | |              | (typeless)            | (float)
5bd8deadSopenharmony_ci         textures  ----+ |              V                       |
5bd8deadSopenharmony_ci        (typeless)       |              |<----------------------+
5bd8deadSopenharmony_ci            |            |              |
5bd8deadSopenharmony_ci            |            |              +---------------+
5bd8deadSopenharmony_ci            |            |              |               |
5bd8deadSopenharmony_ci            |            |              | (typeless)    |
5bd8deadSopenharmony_ci            |            |              V               |
5bd8deadSopenharmony_ci            |            +---------> geometry           |
5bd8deadSopenharmony_ci            |            |(typeless) program            |
5bd8deadSopenharmony_ci            |            |              |               |
5bd8deadSopenharmony_ci            |            |              | (typeless)    |
5bd8deadSopenharmony_ci            |            |              V               |
5bd8deadSopenharmony_ci            |            |              |<--------------+
5bd8deadSopenharmony_ci            |            |              |
5bd8deadSopenharmony_ci            |            |              |
5bd8deadSopenharmony_ci            |            |              +-----------------+
5bd8deadSopenharmony_ci            |            |              |                 |(typeless)
5bd8deadSopenharmony_ci            |            |              |                 v
5bd8deadSopenharmony_ci            |            |              |             transform
5bd8deadSopenharmony_ci            |            |              |             feedback
5bd8deadSopenharmony_ci            |            |              |              buffers
5bd8deadSopenharmony_ci            |            |              |
5bd8deadSopenharmony_ci            |            |              |
5bd8deadSopenharmony_ci            |            |              +-----------------------+
5bd8deadSopenharmony_ci            |            |              |                       |
5bd8deadSopenharmony_ci            |            |              | (float)               | (typeless)
5bd8deadSopenharmony_ci            |            |              V                       V
5bd8deadSopenharmony_ci            |            |         interpolated               flat
5bd8deadSopenharmony_ci            |            |          attributes             attributes
5bd8deadSopenharmony_ci            |            |              |                       |
5bd8deadSopenharmony_ci            |            |              | (float)               | (typeless)
5bd8deadSopenharmony_ci            |            |              V                       |
5bd8deadSopenharmony_ci            |            |              |<----------------------+
5bd8deadSopenharmony_ci            |            |              |
5bd8deadSopenharmony_ci            |            |              +-----------------------+
5bd8deadSopenharmony_ci            |            |              |                       |
5bd8deadSopenharmony_ci            |            |              | (typeless)            | (float)
5bd8deadSopenharmony_ci            |            |(typeless)    V                       V
5bd8deadSopenharmony_ci            |            +---------> fragment     +------> fixed-function
5bd8deadSopenharmony_ci            |                        program      |(float)   fragment
5bd8deadSopenharmony_ci            |                           |         |             |
5bd8deadSopenharmony_ci            +--------------------------/|/--------+             |
5bd8deadSopenharmony_ci                                        |                       |
5bd8deadSopenharmony_ci                                        | (typeless)            | (float)
5bd8deadSopenharmony_ci                                        V                       |
5bd8deadSopenharmony_ci                                        |<----------------------+
5bd8deadSopenharmony_ci                                        |
5bd8deadSopenharmony_ci                                        +-----------------------+------ ....
5bd8deadSopenharmony_ci                                        |                       |
5bd8deadSopenharmony_ci                                        | (typeless)            | (typeless)
5bd8deadSopenharmony_ci                                        V                       V
5bd8deadSopenharmony_ci                                      color                   color
5bd8deadSopenharmony_ci                                    attachment              attachment
5bd8deadSopenharmony_ci                                        0                       1
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (5) Instructions can operate on signed integer, unsigned integer, and
5bd8deadSopenharmony_ci    floating-point values.  Some operations make sense on all three data
5bd8deadSopenharmony_ci    types?  How is this supported, and what type checking support is provided
5bd8deadSopenharmony_ci    by the assembler?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  One important property of the instruction set is that the
5bd8deadSopenharmony_ci      data type for all operands and the result is fully specified by the
5bd8deadSopenharmony_ci      instructions themselves.  For instructions (such as ADD) that make sense
5bd8deadSopenharmony_ci      for both integer and floating-point values, an optional data type
5bd8deadSopenharmony_ci      modifier is provided to indicate which type of operation should be
5bd8deadSopenharmony_ci      performed.  For example, "ADD.S", "ADD.U", and "ADD.F", add signed
5bd8deadSopenharmony_ci      integers, unsigned integers, or floating-point values, respectively.  If
5bd8deadSopenharmony_ci      no data type modifier is provided, ".F" is assumed if the instruction
5bd8deadSopenharmony_ci      can apply to floating-point values and ".S" is assumed otherwise.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      To help identify errors where the wrong data type is used -- for
5bd8deadSopenharmony_ci      example, adding integer values in an ADD instruction that omits a data
5bd8deadSopenharmony_ci      type modifier and thus defaults to "ADD.F" -- variables may be declared
5bd8deadSopenharmony_ci      with optional data type modifiers.  In the following code:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        INT TEMP a;
5bd8deadSopenharmony_ci        UINT TEMP b;
5bd8deadSopenharmony_ci        FLOAT TEMP c;
5bd8deadSopenharmony_ci        TEMP d;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      "a", "b", "c", and "d" are declared as temporary variables holding
5bd8deadSopenharmony_ci      signed integer, unsigned integer, floating-point, and typeless values.
5bd8deadSopenharmony_ci      Since each instruction fully specifies the data type of each operand and
5bd8deadSopenharmony_ci      its result, these data types can be checked against the data type
5bd8deadSopenharmony_ci      assigned to the variables operated on.  If the types don't match, and
5bd8deadSopenharmony_ci      the variable is not typeless, an error is reported.  The opcode modifier
5bd8deadSopenharmony_ci      ".NTC" can be used to ignore such errors on a per-opcode basis, if
5bd8deadSopenharmony_ci      required.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Note that when bindings are used directly in instructions, they are
5bd8deadSopenharmony_ci      always considered typeless for simplicity.  Some fixed-function bindings
5bd8deadSopenharmony_ci      have an obvious data type, but other bindings (e.g., program parameters)
5bd8deadSopenharmony_ci      can hold either integer or floating-point values, depending on how they
5bd8deadSopenharmony_ci      were specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Variable data types are optional.  Typeless variables are provided
5bd8deadSopenharmony_ci      because some programs may want to reuse the same variable in several
5bd8deadSopenharmony_ci      places with different data types.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (6) Should both signed (INT) and unsigned integer (UINT) data types be
5bd8deadSopenharmony_ci    provided?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Yes.  Signed and unsigned integer operations are supported.
5bd8deadSopenharmony_ci      Providing both "INT" and "UINT" variable modifiers distinguish between
5bd8deadSopenharmony_ci      signed and unsigned values for type checking purposes, to ensure that
5bd8deadSopenharmony_ci      unsigned values aren't read as signed values and vice versa.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      This specification says if a value is read a signed integer, but was
5bd8deadSopenharmony_ci      written as an unsigned integer, the value returned is undefined.
5bd8deadSopenharmony_ci      However, signed and unsigned integers are interchangeable in practice,
5bd8deadSopenharmony_ci      except for very large unsigned integers (which can't be represented as
5bd8deadSopenharmony_ci      signed values of the equivalent size) or negative signed integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      If programs know that they won't generate negative or very large values,
5bd8deadSopenharmony_ci      signed and unsigned integers can be used interchangeably.  To avoid type
5bd8deadSopenharmony_ci      errors in the assembler in this case, typeless variables can be used.
5bd8deadSopenharmony_ci      Or the ".NTC" modifier can be used when appropriate.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (7) Integer and floating-point constants are supported in the instruction
5bd8deadSopenharmony_ci    set.  Integer constants might be interpreted to mean either "real integer"
5bd8deadSopenharmony_ci    values or floating-point values.  How are they supported?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  When an obvious floating point constant is specified (e.g.,
5bd8deadSopenharmony_ci      "3.0"), the developers' intent is clear.  If you try to use a
5bd8deadSopenharmony_ci      floating-point value in an instruction that wants an integer operand, or
5bd8deadSopenharmony_ci      a declaration of an integer parameter variable, the program will fail to
5bd8deadSopenharmony_ci      load.  An integer constant used in an instruction isn't quite as clear.
5bd8deadSopenharmony_ci      But its meaning can be easily inferred because the operand types of
5bd8deadSopenharmony_ci      instructions are well-known at compile time.  An integer multiply
5bd8deadSopenharmony_ci      involving the constant "2" will interpret the "2" as an integer.  A
5bd8deadSopenharmony_ci      floating-point multiply involving the same constant "2" will interpret
5bd8deadSopenharmony_ci      it as a floating-point value.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The only real problem is for a parameter declaration that is typeless.
5bd8deadSopenharmony_ci      For typed variables, the intent is clear:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        INT PARAM two = 2;               # use integer 2
5bd8deadSopenharmony_ci        FLOAT PARAM twoPt0 = 2;          # use floating-point 2.0
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      For typeless variables, there's no context to go on:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        PARAM two = 2;                   # 2?  2.0?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      This extension is intended to be largely upward-compatible with
5bd8deadSopenharmony_ci      ARB_vertex_program, ARB_fragment_program, and the other extensions built
5bd8deadSopenharmony_ci      on top of them.  In all of these, the previous declaration is legal and
5bd8deadSopenharmony_ci      means "2.0".  For compatibility, we choose to interpret integer
5bd8deadSopenharmony_ci      constants in this case as floating-point values.  The assembler in the
5bd8deadSopenharmony_ci      NVIDIA implementation will issue a warning if this case ever occurs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      This extension does not provide decoration of integer constant values --
5bd8deadSopenharmony_ci      we considered adding suffixed integers such as "2U" to mean "2, and
5bd8deadSopenharmony_ci      don't even think about converting me to a float!".  We expect that it
5bd8deadSopenharmony_ci      will be sufficient to use the "INT" or "FLOAT" modifiers to disambiguate
5bd8deadSopenharmony_ci      effectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (8) Should hexadecimal constants (e.g., 0x87A3 or 0xFFFFFFFF) be supported?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Yes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (9) Should we provide data type modifiers with explicit component sizes?
5bd8deadSopenharmony_ci    For example, "INT8", "FLOAT16", or "INT32".  If so, should we provide a
5bd8deadSopenharmony_ci    mechanism to query the size (in bits) of a variable, or of different
5bd8deadSopenharmony_ci    variable types/qualifiers?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  No.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (10) Should this extension provide better support for array variables?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Yes; array variables of all types are allowed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      In ARB_vertex_program, program parameter (constant) variables could be
5bd8deadSopenharmony_ci      addressed as arrays.  Temporary variables, vertex attributes, and vertex
5bd8deadSopenharmony_ci      results could not be declared as arrays.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      In NV_vertex_program3 and NV_fragment_program2, relative addressing was
5bd8deadSopenharmony_ci      supported in program bindings:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MOV R0, vertex.attrib[A0.x];            # vertex
5bd8deadSopenharmony_ci        MOV result.texcoord[A0.x], R0;          # vertex
5bd8deadSopenharmony_ci        MOV R0, fragment.texcoord[A0.x];        # fragment -- inside LOOP
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Explicitly declared attribute or result arrays were not supported, and
5bd8deadSopenharmony_ci      temporaries could also not be arrays.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      This extension allows users to declare attribute, result, and temporary
5bd8deadSopenharmony_ci      arrays such as:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        ATTRIB attribs[] = { vertex.attrib[7..11] };
5bd8deadSopenharmony_ci        TEMP scratch[10];
5bd8deadSopenharmony_ci        RESULT texcoords[] = { result.texcoord[0..3] };
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Additionally, the relative addressing mechanisms provided by
5bd8deadSopenharmony_ci      NV_vertex_program3 and NV_fragment_program2 are NOT supported in this
5bd8deadSopenharmony_ci      extension -- instead, declared array variables are the only way to get
5bd8deadSopenharmony_ci      relative addressing.  Using declared arrays allows the assembler to
5bd8deadSopenharmony_ci      identify which attributes will actually be used.  An expression like
5bd8deadSopenharmony_ci      "vertex.texcoord[A0.x]" doesn't identify which texture coordinates are
5bd8deadSopenharmony_ci      referenced, and the assembler must be conservative in this case and
5bd8deadSopenharmony_ci      assume that they all are.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (11) Is relative addressing of temporaries allowed?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Yes.  However, arrays of temporaries may end up being stored
5bd8deadSopenharmony_ci      in off-chip memory, and may be slower to access than non-array
5bd8deadSopenharmony_ci      temporaries.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (12) Should this extension add bindings to pass generic attributes between
5bd8deadSopenharmony_ci    vertex, geometry, and fragment programs, or are texture coordinates
5bd8deadSopenharmony_ci    sufficient?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  While texture coordinates have been used in the past, generic
5bd8deadSopenharmony_ci      attributes should be provided.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The assembler provides a large set of bindings and automatically
5bd8deadSopenharmony_ci      eliminates generic attributes or components that are unused.  At each
5bd8deadSopenharmony_ci      interface between programs, there is an implementation-dependent limit
5bd8deadSopenharmony_ci      on the number of attribute components that can be passed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      There are several reasons that this approach was chosen.  First, if the
5bd8deadSopenharmony_ci      number of attributes that can be passed between program stages exceeds
5bd8deadSopenharmony_ci      the number of existing texture coordinate sets supported when specifying
5bd8deadSopenharmony_ci      vertex, a second implementation-dependent number of texture coordinates
5bd8deadSopenharmony_ci      would need to be exposed to cover the number supported between stages.
5bd8deadSopenharmony_ci      Second, the mechanisms described above reduce or eliminate the need to
5bd8deadSopenharmony_ci      pack attributes into four component vectors.  Third, "texture
5bd8deadSopenharmony_ci      coordinates" that have been historically used for texture lookups don't
5bd8deadSopenharmony_ci      need to be used to pass values that aren't used this way.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (13) The structured branching support in NV_fragment_program2 provides a
5bd8deadSopenharmony_ci    REP instruction that says to repeat a block of code <N> times, as well as
5bd8deadSopenharmony_ci    a LOOP instruction that does the same, but also provides a special loop
5bd8deadSopenharmony_ci    counter variable.  What sort of looping mechanism should we provide here?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Provide only the REP instruction.  The functionality provided
5bd8deadSopenharmony_ci      by the LOOP instruction can be easily achieved by using an integer
5bd8deadSopenharmony_ci      temporary as the loop index.  This avoids two annoyances of the old LOOP
5bd8deadSopenharmony_ci      models:  (a) the loop index (A0.x) is a special variable name, while all
5bd8deadSopenharmony_ci      other variables are declared normally and (b) instructions can only
5bd8deadSopenharmony_ci      access the loop index of the innermost loop -- loop indices at higher
5bd8deadSopenharmony_ci      nesting levels are not accessible.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      One other option was a considered -- a "LOOPV" instruction (LOOP with a
5bd8deadSopenharmony_ci      variable where the program specified a variable name and component to
5bd8deadSopenharmony_ci      hold the loop index, instead of using the implicit variable name "A0.x".
5bd8deadSopenharmony_ci      In the end, it was decided that using an integer temporary as a loop
5bd8deadSopenharmony_ci      counter was sufficient.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (14) The structured branching support in NV_fragment_program2 provides a
5bd8deadSopenharmony_ci    REP instruction that requires a loop count.  Some looping constructs may
5bd8deadSopenharmony_ci    not have a definite loop count, such as a "while" statement in C.  Should
5bd8deadSopenharmony_ci    this construct be supported, and if so, how?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  The REP instruction is extended to make the loop count
5bd8deadSopenharmony_ci      optional.  If no loop count is provided, the REP instruction specified a
5bd8deadSopenharmony_ci      loop that can only be exited using the BRK (break) or RET instructions.
5bd8deadSopenharmony_ci      To avoid obvious infinite loops, an error will be reported if a
5bd8deadSopenharmony_ci      REP/ENDREP block contains no BRK instruction at the current nesting
5bd8deadSopenharmony_ci      level and no RET instruction at any nesting level.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      To implement a loop like "while (value < 7.0) ...", code such as the
5bd8deadSopenharmony_ci      following can be used:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        TEMP cc;                        # dummy variable
5bd8deadSopenharmony_ci        REP;
5bd8deadSopenharmony_ci          SLT.CC cc.x, value.x, 7.0;    # compare value.x to 7.0, set CC0
5bd8deadSopenharmony_ci          BRK NE.x;                     # break out if not true
5bd8deadSopenharmony_ci          ...
5bd8deadSopenharmony_ci          ...                           # presumably update value!
5bd8deadSopenharmony_ci          ...
5bd8deadSopenharmony_ci        ENDREP;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (15) The structured branching support in NV_fragment_program2 provides a
5bd8deadSopenharmony_ci    BRK instruction that operates like C's "break" statement.  Should we
5bd8deadSopenharmony_ci    provide something similar to C's "continue" statement, which skips to the
5bd8deadSopenharmony_ci    next iteration of the loop?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Yes, a new CONT opcode is provided for this purpose.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (16) Can the BRK or CONT instructions break out of multiple levels of
5bd8deadSopenharmony_ci    nested loops at once?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  No.  BRK and CONT only exit the current nesting level.  To
5bd8deadSopenharmony_ci      break out of multiple levels of nested loops, multiple BRK/CONT
5bd8deadSopenharmony_ci      instructions are required.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (17) For REP instructions, is the loop counter reloaded on each iteration
5bd8deadSopenharmony_ci    of the loop?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  No.  The loop counter is loaded once at the top of the loop,
5bd8deadSopenharmony_ci      compared to zero at the top of the loop, and decremented when each loop
5bd8deadSopenharmony_ci      iteration completes.  A program may overwrite the variable used to
5bd8deadSopenharmony_ci      specify the initial value of the loop counter inside the loop without
5bd8deadSopenharmony_ci      affecting the number of times the loop body is executed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (18) How are floating-point values represented in this extension?  What
5bd8deadSopenharmony_ci    about floating-point arithmetic operations?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  In the initial hardware implementation of this extension,
5bd8deadSopenharmony_ci      floating-point values are represented using the standard 32-bit IEEE
5bd8deadSopenharmony_ci      single-precision encoding, consisting of a sign bit, 8 exponent bits,
5bd8deadSopenharmony_ci      and 23 mantissa bits.  Special encodings for NaN (not a number), +/-INF
5bd8deadSopenharmony_ci      (infinity), and positive and negative zero are supported.  Denorms
5bd8deadSopenharmony_ci      (values less than 2^-126, which have an exponent encoding of "0" and no
5bd8deadSopenharmony_ci      implied leading one) are supported, but may be flushed to zero,
5bd8deadSopenharmony_ci      preserving the sign bit of the original value.  Arithmetic operations
5bd8deadSopenharmony_ci      are carried out at single-precision using normal IEEE floating-point
5bd8deadSopenharmony_ci      rules, including special rules for generating infinities, NaNs, and
5bd8deadSopenharmony_ci      zeros of each sign.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Floating-point temporaries declared as "SHORT" may be, but are not
5bd8deadSopenharmony_ci      necessarily, stored as 16-bit "fp16" values (sign bit, five exponent
5bd8deadSopenharmony_ci      bits, ten mantissa bits), as specified in the NV_float_buffer and
5bd8deadSopenharmony_ci      ARB_half_float_pixel extensions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (19) Should we provide a method to declare how fragment attributes are
5bd8deadSopenharmony_ci    interpolated?  It is possible to have flat-shaded attributes,
5bd8deadSopenharmony_ci    perspective-corrected attributes, and centroid-sampled attributes.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Yes.  Fragment program attribute variable declarations may
5bd8deadSopenharmony_ci      specify the "FLAT", "NOPERSPECTIVE", and "CENTROID" modifiers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      These modifiers are documented in detail in the NV_fragment_program4
5bd8deadSopenharmony_ci      specification.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (20) Should vertex and primitive identifiers be supported?  If so, how?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  A vertex identifier is available as "vertex.id" in a vertex
5bd8deadSopenharmony_ci      program.  The vertex ID is equal to value effectively passed to
5bd8deadSopenharmony_ci      ArrayElement when the vertex is specified, and is defined only if vertex
5bd8deadSopenharmony_ci      arrays are used with buffer objects (VBOs).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      A primitive identifier is available as "primitive.id" in a geometry or
5bd8deadSopenharmony_ci      fragment program.  The primitive ID is equal to the number of primitives
5bd8deadSopenharmony_ci      processed since the last implicit or explicit call to glBegin().
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      See the NV_vertex_program4 spec for more information on vertex IDs, and
5bd8deadSopenharmony_ci      the NV_geometry_program4 or NV_fragment_program4 specs for more
5bd8deadSopenharmony_ci      information on primitive IDs.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (21) For integer opcodes, should a bitwise inversion operator "~" be
5bd8deadSopenharmony_ci    provided, analogous to existing negation operator?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  No.  If this operator were provided, it might allow a program
5bd8deadSopenharmony_ci      to evaluate the expression "a&(~b)" using a single instruction:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        AND.U a, a, ~b;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Instead, it is necessary to instead do something like:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        UINT TEMP t;
5bd8deadSopenharmony_ci        NOT.U t, b;
5bd8deadSopenharmony_ci        AND.U a, a, t;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      If necessary, this functionality could be added in a subsequent
5bd8deadSopenharmony_ci      extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (22) What happens if you negate or take the absolute value of the
5bd8deadSopenharmony_ci    biggest-magnitude negative integer?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Signed integers are represented using two's complement
5bd8deadSopenharmony_ci      representation.  For 32-bit integers, the largest possible value is
5bd8deadSopenharmony_ci      2^31-1; the smallest possible value is -2^31.  There is no way to
5bd8deadSopenharmony_ci      represent 2^31, which is what these operators "should" return.  The
5bd8deadSopenharmony_ci      value returned in this case is the original value of -2^31.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (23) How do condition codes work?  How are they different from those
5bd8deadSopenharmony_ci    provided in previous NVIDIA extensions?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  There are two condition codes -- CC0 and CC1 -- each of which
5bd8deadSopenharmony_ci      is a four-component vector.  The condition codes are set based on the
5bd8deadSopenharmony_ci      result of an instruction that specifies a condition code update
5bd8deadSopenharmony_ci      modifier.  Examples include:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        ADD.S.CC  R0, R1, R2;       # add signed integers R1 and R2, update
5bd8deadSopenharmony_ci                                    #   CC0 based on the result, write the
5bd8deadSopenharmony_ci                                    #   final value to R0
5bd8deadSopenharmony_ci        ADD.F.CC1 R3, R4, R5;       # add floats R4 and R5, update CC1 based
5bd8deadSopenharmony_ci                                    #   on the result, write the final value
5bd8deadSopenharmony_ci                                    #   to R3
5bd8deadSopenharmony_ci        ADD.U.CC0 R6.xy, R7, R8;    # add unsigned integers R7 and R8, update
5bd8deadSopenharmony_ci                                    #   CC0 (x and y components) based on the
5bd8deadSopenharmony_ci                                    #   result, write the final value to R6
5bd8deadSopenharmony_ci                                    #   (x and y components)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Condition codes can be used for conditional writes, conditional
5bd8deadSopenharmony_ci      branches, or other operations.  The condition codes aren't used
5bd8deadSopenharmony_ci      directly, but are instead used with a condition code test such as "LT"
5bd8deadSopenharmony_ci      (less than) or "EQ" (equal to).  Examples include:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MOV R0 (GT.x), R1;          # move R1 to R0 only if the x component of
5bd8deadSopenharmony_ci                                    #   CC0 indicates a result of ">0"
5bd8deadSopenharmony_ci        MOV R2 (NE1), R3;           # component-wise move of R3 to R2 if the
5bd8deadSopenharmony_ci                                    #   corresponding component of CC1
5bd8deadSopenharmony_ci                                    #   indicates a result of "!=0"
5bd8deadSopenharmony_ci        IF LE0.xyxy;                # execute the block of code if the x or
5bd8deadSopenharmony_ci          ...                       #   y components of CC0 indicate a result
5bd8deadSopenharmony_ci        ENDIF;                      #   of "<=0"
5bd8deadSopenharmony_ci        REP;
5bd8deadSopenharmony_ci          ...
5bd8deadSopenharmony_ci          BRK EQ1.xyzx;             # break out of loop if the x, y, or z
5bd8deadSopenharmony_ci        ENDREP;                     #   components of CC1 indicate a result of
5bd8deadSopenharmony_ci                                    #   "==0".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Previous NVIDIA extensions provide eight tests, which are still
5bd8deadSopenharmony_ci      supported here.  The tests "EQ" (equal), "GE" (greater/equal), "GT"
5bd8deadSopenharmony_ci      (greater than), "LE" (less/equal), "LT" (less than), and "NE" (not
5bd8deadSopenharmony_ci      equal) can be used to determine the relation of the result used to set
5bd8deadSopenharmony_ci      the condition code with zero.  The tests "TR" (true) and "FL" (false),
5bd8deadSopenharmony_ci      are special tests that always evaluate to true or false respectively.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      For floating-point results, a NaN (not a number) encoding causes the
5bd8deadSopenharmony_ci      "NE" condition to evaluate to TRUE and all other conditions to evaluate
5bd8deadSopenharmony_ci      to FALSE.  IEEE encodings for "negative" and "positive" zero are both
5bd8deadSopenharmony_ci      treated as equal to zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Condition codes are implemented as a set of flags, which are set
5bd8deadSopenharmony_ci      depending on the type of operation, as described in the spec.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      For instructions that return floating-point or signed integer values,
5bd8deadSopenharmony_ci      the normal condition code tests reliably indicate the relationship of
5bd8deadSopenharmony_ci      the result to zero.  For instructions that return unsigned values, the
5bd8deadSopenharmony_ci      condition codes are a bit more complicated.  For example, the sign flag
5bd8deadSopenharmony_ci      is set if the most significant bit of the result written is set.  As a
5bd8deadSopenharmony_ci      result, very large unsigned integer values (e.g., 0x80000000 -
5bd8deadSopenharmony_ci      0xFFFFFFFF) are effectively treated as negative values.  Condition code
5bd8deadSopenharmony_ci      tests should be used with care with unsigned results -- to test if an
5bd8deadSopenharmony_ci      unsigned integer is ">0", use a sequence like:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MOV.U.CC R0, R1;            # move R1 to R0, set condition code
5bd8deadSopenharmony_ci        IF NE;                      # test if the result is "!=0", a very
5bd8deadSopenharmony_ci          ...                       #   large value might fail "GT"!
5bd8deadSopenharmony_ci        ENDIF;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      This extension provides a number of additional condition code tests
5bd8deadSopenharmony_ci      useful for different floating-point or integer operations:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * NAN (not a number) is true if a floating-point result is a NaN.  LEG
5bd8deadSopenharmony_ci          (less, equal to, or greater) is the opposite of NAN.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * CF (carry flag) is true if an unsigned add overflows, or if an
5bd8deadSopenharmony_ci          unsigned subtract produces a non-negative value.  NCF (no carry
5bd8deadSopenharmony_ci          flag) is the opposite of CF.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * OF (overflow flag) is true if a signed add or subtract overflows.
5bd8deadSopenharmony_ci          NOF (no overflow flag) is the opposite of OF.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * SF (sign flag) is true if the sign flag is set.  NSF (no sign flag)
5bd8deadSopenharmony_ci          is the opposite of SF.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        * AB (above) is true if an unsigned subtract produces a positive
5bd8deadSopenharmony_ci          result.  BLE (below or equal) is the opposite of AB, and is true if
5bd8deadSopenharmony_ci          an unsigned subtract produces a negative result or zero.  Note that
5bd8deadSopenharmony_ci          CF can be used to test if the result is greater than or equal to
5bd8deadSopenharmony_ci          zero, and NCF can be used to test if the result is less than zero.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (24) How do the "set on" instructions (SEQ, SGE, SGT, SLE, SLT, SNE) work
5bd8deadSopenharmony_ci    with integer values and/or condition codes?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  "Set on" instructions comparing signed and unsigned values
5bd8deadSopenharmony_ci      return zero if the condition is false, and an integer with all bits set
5bd8deadSopenharmony_ci      if the condition is true.  If the result is signed, it is interpreted as
5bd8deadSopenharmony_ci      -1.  If the result is unsigned, it is interpreted the largest unsigned
5bd8deadSopenharmony_ci      value (0xFFFFFFFF for 32-bit integers).  This is different from the
5bd8deadSopenharmony_ci      floating-point "set on", which is defined to return 1.0.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      This specific result encoding was chosen so that bitwise operators (NOT,
5bd8deadSopenharmony_ci      AND, OR, XOR) can be used to evaluate boolean expressions.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      When performing condition code tests on the results of an integer "set
5bd8deadSopenharmony_ci      on" instruction, keep in mind that a TRUE result has the most
5bd8deadSopenharmony_ci      significant bit set and will be interpreted as a negative value.  To
5bd8deadSopenharmony_ci      test if a condition is true, use "NE" (!=0).  A condition code test of
5bd8deadSopenharmony_ci      "GT" will always fail if the condition code was written by an integer
5bd8deadSopenharmony_ci      "set on" instruction.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (25) What new texture functionality is provided?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Several new features are provided.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      First, the TXF (texel fetch) instruction allows programs to access a
5bd8deadSopenharmony_ci      texture map like a normal array.  Integer coordinates identifying an
5bd8deadSopenharmony_ci      individual texel and LOD are provided, and the corresponding texture
5bd8deadSopenharmony_ci      data is returned without filtering of any type.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Second, the TXQ (texture size query) instruction allows programs to
5bd8deadSopenharmony_ci      query the size of a specified level of detail of a texture.  This
5bd8deadSopenharmony_ci      feature allows programs to perform computations dependent on the size of
5bd8deadSopenharmony_ci      the texture without having to pass the size as a program parameter or
5bd8deadSopenharmony_ci      via some other mechanism.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Third, applications may specify a constant texel offset in a texture
5bd8deadSopenharmony_ci      instruction that moves the texture sample point by the specified number
5bd8deadSopenharmony_ci      of texels.  This offset can be used to perform custom texture filtering,
5bd8deadSopenharmony_ci      and is also independent of the size of the texture LOD -- the same
5bd8deadSopenharmony_ci      offsets are applied, regardless of the mipmap level.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Fourth, shadow mapping is supported for cube map textures.  The first
5bd8deadSopenharmony_ci      three coordinates are the normal (s,t,r) coordinates for a cube map
5bd8deadSopenharmony_ci      texture lookup, and the fourth component is a depth reference value that
5bd8deadSopenharmony_ci      can be compared to the depth value stored in the texture.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (26) What "consistency" requirements are in effect for textures accessed
5bd8deadSopenharmony_ci    via the TXF (texel fetch) instruction?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      UNRESOLVED:  The texture must be usable for regular texture mapping
5bd8deadSopenharmony_ci      operations -- if texture sizes or formats are inconsistent and a
5bd8deadSopenharmony_ci      mipmapped min filter is used, the results are undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (27) How does the TXF instruction work with bordered textures?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  The entire image can be accessed, including the border
5bd8deadSopenharmony_ci      texels.  For a 64x64 2D texture plus border (66x66 overall), the lower
5bd8deadSopenharmony_ci      left border texel is accessed using the coordinates (-1,-1); the upper
5bd8deadSopenharmony_ci      right border texel is accessed using the coordinates (64,64).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (28) What should TXQ (texture size query) return for "irrelevant" texture
5bd8deadSopenharmony_ci    sizes (e.g., height of a 1D texture)?  Should it return any other
5bd8deadSopenharmony_ci    information at the same time?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  This specification leaves all "extra" components undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (29) How do texture offsets interact with cubemap textures?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  They are not supported in this extension.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (30) How do texture offsets interact with mipmapped textures?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  The texture offsets are added after the (s,t,r) coordinates
5bd8deadSopenharmony_ci      have been divided by q (if applicable) and converted to (u,v,w)
5bd8deadSopenharmony_ci      coordinates by multiplying by the size of the selected texture level.
5bd8deadSopenharmony_ci      The offsets are added to the (u,v,w) coordinates, and always move the
5bd8deadSopenharmony_ci      sample point by an integral number of texel coordinates.  If multiple
5bd8deadSopenharmony_ci      mipmaps are accessed, the sample point in each mipmap level is moved by
5bd8deadSopenharmony_ci      an identical offset.  The applied offsets are independent of the
5bd8deadSopenharmony_ci      selected mipmap level.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (31) How do shadow cube maps work?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      UNRESOLVED:  An application can define a cube map texture with a
5bd8deadSopenharmony_ci      DEPTH_COMPONENT internal format, and then render a scene using the cube
5bd8deadSopenharmony_ci      map faces as the depth buffer(s).  When rendering the projection should
5bd8deadSopenharmony_ci      be set up using the "center" of the cubemap as the eye, and using a
5bd8deadSopenharmony_ci      normal projection matrix.  When applying the shadow map, the fragment
5bd8deadSopenharmony_ci      program read the (x,y,z) eye coordinates, compute the length of the
5bd8deadSopenharmony_ci      major axis (MAX(|x|,|y|,|z|) and then transform this coordinate to [0,1]
5bd8deadSopenharmony_ci      space using the same parameters used to derive Z in the projection
5bd8deadSopenharmony_ci      matrix.  A 4-component vector consisting of x, y, z, and this computed
5bd8deadSopenharmony_ci      depth value should be passed to the texture lookup, and normal shadow
5bd8deadSopenharmony_ci      mapping operations will be performed.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      This issue should include the math needed to do this computation and
5bd8deadSopenharmony_ci      sample code.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (32) Integer multiplies can overflow by a lot.  Should there be some way
5bd8deadSopenharmony_ci    to return the high part of both unsigned and signed integer multiplies?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Yes.  The ".HI" multipler is provided to do a return the 32
5bd8deadSopenharmony_ci      MSBs of a 32x32 integer multiply.  The instruction sequence:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        INT TEMP R0, R1, R2, R3;
5bd8deadSopenharmony_ci        MUL.S    R0, R2, R3;
5bd8deadSopenharmony_ci        MUL.S.HI R1, R2, R3;
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     will do a 32x32 signed integer multiply of R2 and R3, with the 32 LSBs of
5bd8deadSopenharmony_ci     the 64-bit result in R0 and the 32 MSBs in R1.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (33) Should there be any other special multiplication modifiers?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Yes.  The ".S24" and ".U24" modifiers allow for signed and
5bd8deadSopenharmony_ci      unsigned integer multiplies where both operands are guaranteed to fit in
5bd8deadSopenharmony_ci      the least significant 24 bits.  On some architectures supporting this
5bd8deadSopenharmony_ci      extension, ".S24" and ".U24" integer multiplies may be faster than
5bd8deadSopenharmony_ci      general-purpose ".S" and ".U" multiplies.  If either value doesn't fit
5bd8deadSopenharmony_ci      in 24 bits, the results of the operation are undefined --
5bd8deadSopenharmony_ci      implementations may, but are not required to, ignore the MSBs of the
5bd8deadSopenharmony_ci      operands if ".S24" or ".U24" is specified.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (34) This extension provides subroutines, but doesn't provide a stack to
5bd8deadSopenharmony_ci    push and pop parameters.  How do we deal with this?  NV_vertex_program3
5bd8deadSopenharmony_ci    supported PUSHA/POPA instructions to push and pop address registers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  No explicit stack is required.  A program can implement a
5bd8deadSopenharmony_ci      stack by allocating a temporary array plus a single integer temporary to
5bd8deadSopenharmony_ci      use as the stack "pointer".  For example:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        TEMP stack[256];                # 256 4-component vectors
5bd8deadSopenharmony_ci        INT TEMP sp;                    # sp.x == stack pointer
5bd8deadSopenharmony_ci        INT TEMP cc;                    # condition code results
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        function:
5bd8deadSopenharmony_ci          SGE.S.CC cc.x, sp.x, 256;     # compute stackPointer >= 256
5bd8deadSopenharmony_ci          RET NE.x;                     # return if TRUE
5bd8deadSopenharmony_ci          MOV stack[sp], R0;            # push R0 onto the stack
5bd8deadSopenharmony_ci          ADD.S sp.x, sp.x, 1;
5bd8deadSopenharmony_ci          ...
5bd8deadSopenharmony_ci          SUB.S sp.x, sp.x, 1;          # pop R0 off the stack
5bd8deadSopenharmony_ci          MOV R0, stack[sp];
5bd8deadSopenharmony_ci          RET
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (35) Should we provide new vector semantics for previously-defined opcodes
5bd8deadSopenharmony_ci    (e.g., LG2 computes a component-wise logarithm)?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Not in this extension.  The instructions we define here are
5bd8deadSopenharmony_ci      compatible with the vector or scalar nature of previously defined
5bd8deadSopenharmony_ci      opcodes.  This simplifies the implementation of an assembler that needs
5bd8deadSopenharmony_ci      to support both old and new instruction sets.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (36) Should it really be undefined to read from a register storing data of
5bd8deadSopenharmony_ci    one type with an instruction of the other type (e.g., to read the bits of
5bd8deadSopenharmony_ci    a floating-point number as an unsigned integer)?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  The spec describes undefined results for simplicity.  In
5bd8deadSopenharmony_ci      practice, mixing data types can be done, where signed integers are
5bd8deadSopenharmony_ci      represented as two's complement integers and floating-point numbers are
5bd8deadSopenharmony_ci      represented using IEEE single-precision representation.  For example:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        TEMP R0, R1;                    # typeless
5bd8deadSopenharmony_ci        MOV.U R0, 0x3F800000;           # R0 = 1.0
5bd8deadSopenharmony_ci        MOV.U R1, 0xBF800000;           # R1 = -1.0
5bd8deadSopenharmony_ci        MUL.F R0, R0, R1;               # R0 = -1 * 1 = -1 (0xBF800000)
5bd8deadSopenharmony_ci        XOR.U R0, R0, R1;               # R0 = 0xBF800000 ^ 0xBF800000 = 0
5bd8deadSopenharmony_ci        NOT.U R0, R0;                   # R0 = 0xFFFFFFFF
5bd8deadSopenharmony_ci        I2F.S R0, R0;                   # R0 = -1.0 (0xFFFFFFFF = -1 signed)
5bd8deadSopenharmony_ci        SEQ.F R0, R0, R1;               # R0 = 1.0 (-1.0 == -1.0)
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (37) Buffer objects can be sourced as program parameters using the
5bd8deadSopenharmony_ci    NV_parameter_buffer_object extension.  How are they accessed in a program?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  The instruction set and existing program environment and
5bd8deadSopenharmony_ci      local parameter bindings operate largely on four-component vectors.
5bd8deadSopenharmony_ci      However, NV_parameter_buffer_object exposes the ability to reach into
5bd8deadSopenharmony_ci      buffers consisting of user-generated data or data written to the buffer
5bd8deadSopenharmony_ci      object by the GPU.  Such data sets may not consist entirely
5bd8deadSopenharmony_ci      four-component floating-point vectors, so a four-component vector API
5bd8deadSopenharmony_ci      may be unnatural.  An application might need to reformat its data set to
5bd8deadSopenharmony_ci      deal with this issue.  Or it might generate odd code to compensate for
5bd8deadSopenharmony_ci      mis-alignment -- for example, reading an array of 3-component vectors by
5bd8deadSopenharmony_ci      doing two four-component vector accesses and then rotating based on
5bd8deadSopenharmony_ci      alignment.  Neither approach is particularly satisfying.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Instead, this extension takes the approach of treating parameter buffers
5bd8deadSopenharmony_ci      as array of scalar words.  When an individual buffer element is read,
5bd8deadSopenharmony_ci      the single word is replicated to produce a four-component vector.  To
5bd8deadSopenharmony_ci      access an array of 3-component vectors, code like the following can be
5bd8deadSopenharmony_ci      used:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        PARAM buffer[] = { program.buffer[0] };
5bd8deadSopenharmony_ci        INT TEMP index;
5bd8deadSopenharmony_ci        TEMP R0;
5bd8deadSopenharmony_ci        ...
5bd8deadSopenharmony_ci        MUL.S index, index, 3;          # to read "vec3" #X, compute 3*X
5bd8deadSopenharmony_ci        MOV R0.x, buffer[index+0];
5bd8deadSopenharmony_ci        MOV R0.y, buffer[index+1];
5bd8deadSopenharmony_ci        MOV R0.z, buffer[index+2];
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (38) Should recursion be allowed?  If so, how is the total amount of
5bd8deadSopenharmony_ci    recursion limited?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Recursion is allowed, and a call stack is provided by the
5bd8deadSopenharmony_ci      implementation.  The size of the call stack is limited to the
5bd8deadSopenharmony_ci      implementation-dependent constant MAX_PROGRAM_CALL_DEPTH, and when a the
5bd8deadSopenharmony_ci      call stack is full, the results of further CAL instructions is
5bd8deadSopenharmony_ci      undefined.  In the initial implementation of this extension, such
5bd8deadSopenharmony_ci      instructions will have no effect.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Note that no stack is provided to hold local registers; a program may
5bd8deadSopenharmony_ci      implement its own via a temporary array and integer stack "pointer".
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (39) Variables are all four-component vectors in previous extensions.
5bd8deadSopenharmony_ci    Should scalar or small-vector variables be provided?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  It would be a useful feature, but it was left out for
5bd8deadSopenharmony_ci      simplicity.  In practice, a variable where only the X component is used
5bd8deadSopenharmony_ci      will be equivalent to a scalar.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (40) The PK* (pack) and UP* (unpack) instructions allow packing multiple
5bd8deadSopenharmony_ci    components of data into a single component.  The bit packing is
5bd8deadSopenharmony_ci    well-defined.  Should we require specific data types (e.g., unsigned
5bd8deadSopenharmony_ci    integer) to hold packed values?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  No.  Previous instruction sets only allowed programs to write
5bd8deadSopenharmony_ci      packed values to a floating-point variable (the only data type
5bd8deadSopenharmony_ci      provided).  We will allow packed results to be written to a variable of
5bd8deadSopenharmony_ci      any data type.  Integer instructions can be used to manipulate bits of
5bd8deadSopenharmony_ci      packed data in place.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (41) What happens when converting integers to floats or vice versa if
5bd8deadSopenharmony_ci    there is insufficient precision or range to represent the result?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  For integer-to-float conversions, the nearest representable
5bd8deadSopenharmony_ci      floating-point value is used, and the least significant bits of the
5bd8deadSopenharmony_ci      original integer value are lost.  For float-to-integer conversions,
5bd8deadSopenharmony_ci      out-of-range values are clamped to the nearest representable integer.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (42) Why are some of the grammar rules so bizarre (e.g., attribUseD,
5bd8deadSopenharmony_ci    attribUseV, attribUseS, attribUseVNS)?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  This grammar is based upon the original ARB_vertex_program
5bd8deadSopenharmony_ci      grammar, which has a number of "interesting" characteristics.  For
5bd8deadSopenharmony_ci      example, some of the bindings provided by ARB_vertex_program naturally
5bd8deadSopenharmony_ci      require some amount of lookahead.  For example, a vertex program can
5bd8deadSopenharmony_ci      write an output color using any of the following:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MOV result.color, 0;            # primary color
5bd8deadSopenharmony_ci        MOV result.color.primary, 0;    # primary color again
5bd8deadSopenharmony_ci        MOV result.color.secondary, 0;  # secondary color this time
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      The pieces of the color binding are separated by "." tokens.  However,
5bd8deadSopenharmony_ci      writemasks are also supported, which also use "." before the write
5bd8deadSopenharmony_ci      mask.  So, we could also have something like:
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci        MOV result.color.xyz, 0;        # primary color with W masked off
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      In this form, a parser needs to look at both the "." and the "xyz" to
5bd8deadSopenharmony_ci      determine that the binding being used is "result.color" (and not
5bd8deadSopenharmony_ci      "result.color.secondary").
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      Additionally, some checks that should probably be semantic errors (e.g.,
5bd8deadSopenharmony_ci      allowing different swizzle or scalar operand selectors per instruction,
5bd8deadSopenharmony_ci      or disallowing both in the case of SWZ) we specified in the original
5bd8deadSopenharmony_ci      grammar.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      ARB_fragment_program and subsequent NVIDIA instructions built upon this,
5bd8deadSopenharmony_ci      and the grammar for this extension was rewritten in the current form so
5bd8deadSopenharmony_ci      it could be validated more easily.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (43) This is an NV extension (NV_gpu_program4).  Why does the
5bd8deadSopenharmony_ci     MAX_PROGRAM_TEXEL_OFFSET_EXT token has an "EXT" suffix?
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  This token is shared between this extension and the
5bd8deadSopenharmony_ci      comparable high-level GLSL programmability extension (EXT_gpu_shader4).
5bd8deadSopenharmony_ci      Rather than provide a duplicate set of token names, we simply use the
5bd8deadSopenharmony_ci      EXT version here.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    (44) For the purposes of determining the number of attribute and result
5bd8deadSopenharmony_ci         components, how are "scalar" attributes counted.  For example, only
5bd8deadSopenharmony_ci         the x component of the "pointsize" per-vertex output is actually
5bd8deadSopenharmony_ci         relevant.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      RESOLVED:  Implementations are allowed to count all inputs and outputs
5bd8deadSopenharmony_ci      as full four-component vectors.  To avoid this, apply appropriate write
5bd8deadSopenharmony_ci      masks or swizzles.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci      For example, writing to "result.pointsize" may count as four components.
5bd8deadSopenharmony_ci      Consistently writing to "result.pointsize.x" may only count as one.
5bd8deadSopenharmony_ci      Similarly, reading a fragment's fog coordinate as "fragment.fogcoord"
5bd8deadSopenharmony_ci      may count as four components; "fragment.fogcoord.x" will only count as
5bd8deadSopenharmony_ci      one.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ciRevision History
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    Rev.    Date    Author    Changes
5bd8deadSopenharmony_ci    ----  --------  --------  --------------------------------------------
5bd8deadSopenharmony_ci    11    09/11/14  pbrown    Fix cut-and-paste error in PK2US section.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci    10    12/14/09  mgodse    Added GLX protocol.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     9    10/29/09  pbrown    Add language for previously undocumented errors
5bd8deadSopenharmony_ci                              when using "SHORT" and "LONG" modifiers on
5bd8deadSopenharmony_ci                              variable declarations.  They're allowed only on
5bd8deadSopenharmony_ci                              "TEMP" statements, except that "SHORT" is
5bd8deadSopenharmony_ci                              allowed for "OUTPUT" as well.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     8    08/11/08  jbreton   Clarified that when a MOD instruction is
5bd8deadSopenharmony_ci                              performed on negative operands the result is
5bd8deadSopenharmony_ci                              undefined.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     7    07/29/08  pbrown    Discovered additional issues with texture wrap
5bd8deadSopenharmony_ci                              handling, replaced with logic that applies wrap
5bd8deadSopenharmony_ci                              modes per sample.  Add a few instruction
5bd8deadSopenharmony_ci                              pseudo-code lines explicitly identifying
5bd8deadSopenharmony_ci                              undefined components.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     6    05/02/08  pbrown    Fix the prototype for the internal TexelFetch()
5bd8deadSopenharmony_ci                              function used in the spec language; texel
5bd8deadSopenharmony_ci                              coordinates are signed integers.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     5    02/22/08  pbrown    Clarified that when counting attribute/result
5bd8deadSopenharmony_ci                              components, irrelevant/undefined components
5bd8deadSopenharmony_ci                              can still count against the limits.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     4    02/04/08  pbrown    Fix errors in texture wrap mode handling.
5bd8deadSopenharmony_ci                              Added a missing clamp to avoid sampling border
5bd8deadSopenharmony_ci                              in REPEAT mode.  Fixed incorrectly specified
5bd8deadSopenharmony_ci                              weights for LINEAR filtering.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     3    02/09/07  pbrown    Updated status section (now released).
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     2    10/19/06  pbrown    Change the token suffix for maximum texel offset
5bd8deadSopenharmony_ci                              values from NV to EXT, since it is shared with
5bd8deadSopenharmony_ci                              EXT_gpu_shader4.  Clarify what happens on a
5bd8deadSopenharmony_ci                              negate of an unsigned value.  Fix typo in data
5bd8deadSopenharmony_ci                              type modifier description.  Add missing
5bd8deadSopenharmony_ci                              description of the "BUFFER4" declaration
5bd8deadSopenharmony_ci                              keyword.
5bd8deadSopenharmony_ci
5bd8deadSopenharmony_ci     1              pbrown    Internal spec development.