ai/nnrt/neural-network-runtime-guidelines.md

e41f4b71Sopenharmony_ci# Connecting NNRt to an AI Inference Framework
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci## When to Use
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ciAs a bridge between the AI inference engine and acceleration chip, Neural Network Runtime (NNRt) provides simplified native APIs for the AI inference engine to perform end-to-end inference through the acceleration chip.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ciThis topic uses the `Add` single-operator model shown in Figure 1 as an example to describe the NNRt development process. The `Add` operator involves two inputs, one parameter, and one output. Wherein, the `activation` parameter is used to specify the type of the activation function in the `Add` operator.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci**Figure 1** Add single-operator model<br>
e41f4b71Sopenharmony_ci!["Single Add operator model"](figures/neural_network_runtime.png)
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci## Preparing the Environment
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Environment Requirements
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ciThe environment requirements for NNRt are as follows:
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci- Development environment: Ubuntu 18.04 or later.
e41f4b71Sopenharmony_ci- Access device: a standard device whose built-in hardware accelerator driver has been connected to NNRt.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ciNNRt is opened to external systems through native APIs. Therefore, you need to use the native development suite to build NNRt applications. You can download the ohos-sdk package of the corresponding version from the daily build in the OpenHarmony community and then decompress the package to obtain the native development suite of the corresponding platform. Take Linux as an example. The package of the native development suite is named `native-linux-{version number}.zip`.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Environment Setup
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci1. Start the Ubuntu server.
e41f4b71Sopenharmony_ci2. Copy the downloaded package of the Native development suite to the root directory of the current user.
e41f4b71Sopenharmony_ci3. Decompress the package of the native development suite.
e41f4b71Sopenharmony_ci    ```shell
e41f4b71Sopenharmony_ci    unzip native-linux-{version number}.zip
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    The directory structure after decompression is as follows. The content in the directory may vary depending on the version. Use the native APIs of the latest version.
e41f4b71Sopenharmony_ci    ```text
e41f4b71Sopenharmony_ci    native/
e41f4b71Sopenharmony_ci    ├── build // Cross-compilation toolchain
e41f4b71Sopenharmony_ci    ├── build-tools // Compilation and build tools
e41f4b71Sopenharmony_ci    ├── docs
e41f4b71Sopenharmony_ci    ├── llvm
e41f4b71Sopenharmony_ci    ├── nativeapi_syscap_config.json
e41f4b71Sopenharmony_ci    ├── ndk_system_capability.json
e41f4b71Sopenharmony_ci    ├── NOTICE.txt
e41f4b71Sopenharmony_ci    ├── oh-uni-package.json
e41f4b71Sopenharmony_ci    └── sysroot // Native API header files and libraries
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci## Available APIs
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ciThis section describes the common APIs used in the NNRt development process.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Structs
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci| Name| Description|
e41f4b71Sopenharmony_ci| --------- | ---- |
e41f4b71Sopenharmony_ci| typedef struct OH_NNModel OH_NNModel | Model handle of NNRt. It is used to construct a model.|
e41f4b71Sopenharmony_ci| typedef struct OH_NNCompilation OH_NNCompilation | Compiler handle of NNRt. It is used to compile an AI model.|
e41f4b71Sopenharmony_ci| typedef struct OH_NNExecutor OH_NNExecutor | Executor handle of NNRt. It is used to perform inference computing on a specified device.|
e41f4b71Sopenharmony_ci| typedef struct NN_QuantParam NN_QuantParam | Quantization parameter handle, which is used to specify the quantization parameter of the tensor during model construction.|
e41f4b71Sopenharmony_ci| typedef struct NN_TensorDesc NN_TensorDesc | Tensor description handle, which is used to describe tensor attributes, such as the data format, data type, and shape.|
e41f4b71Sopenharmony_ci| typedef struct NN_Tensor NN_Tensor | Tensor handle, which is used to set the inference input and output tensors of the executor.|
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Model Construction APIs
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci| Name| Description|
e41f4b71Sopenharmony_ci| ------- | --- |
e41f4b71Sopenharmony_ci| OH_NNModel_Construct() | Creates a model instance of the OH_NNModel type.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNModel_AddTensorToModel(OH_NNModel *model, const NN_TensorDesc *tensorDesc) | Adds a tensor to a model instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNModel_SetTensorData(OH_NNModel *model, uint32_t index, const void *dataBuffer, size_t length) | Sets the tensor value.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNModel_AddOperation(OH_NNModel *model, OH_NN_OperationType op, const OH_NN_UInt32Array *paramIndices, const OH_NN_UInt32Array *inputIndices, const OH_NN_UInt32Array *outputIndices) | Adds an operator to a model instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNModel_SpecifyInputsAndOutputs(OH_NNModel *model, const OH_NN_UInt32Array *inputIndices, const OH_NN_UInt32Array *outputIndices) | Sets an index value for the input and output tensors of a model.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNModel_Finish(OH_NNModel *model) | Completes model composition.|
e41f4b71Sopenharmony_ci| void OH_NNModel_Destroy(OH_NNModel **model) | Destroys a model instance.|
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Model Compilation APIs
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci| Name| Description|
e41f4b71Sopenharmony_ci| ------- | --- |
e41f4b71Sopenharmony_ci| OH_NNCompilation *OH_NNCompilation_Construct(const OH_NNModel *model) | Creates an **OH_NNCompilation** instance based on the specified model instance.|
e41f4b71Sopenharmony_ci| OH_NNCompilation *OH_NNCompilation_ConstructWithOfflineModelFile(const char *modelPath) | Creates an **OH_NNCompilation** instance based on the specified offline model file path.|
e41f4b71Sopenharmony_ci| OH_NNCompilation *OH_NNCompilation_ConstructWithOfflineModelBuffer(const void *modelBuffer, size_t modelSize) | Creates an **OH_NNCompilation** instance based on the specified offline model buffer.|
e41f4b71Sopenharmony_ci| OH_NNCompilation *OH_NNCompilation_ConstructForCache() | Creates an empty model building instance for later recovery from the model cache.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_ExportCacheToBuffer(OH_NNCompilation *compilation, const void *buffer, size_t length, size_t *modelSize) | Writes the model cache to the specified buffer.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_ImportCacheFromBuffer(OH_NNCompilation *compilation, const void *buffer, size_t modelSize) | Reads the model cache from the specified buffer.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_AddExtensionConfig(OH_NNCompilation *compilation, const char *configName, const void *configValue, const size_t configValueSize) | Adds extended configurations for custom device attributes. For details about the extended attribute names and values, see the documentation that comes with the device.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_SetDevice(OH_NNCompilation *compilation, size_t deviceID) | Sets the Device for model building and computing, which can be obtained through the device management APIs.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_SetCache(OH_NNCompilation *compilation, const char *cachePath, uint32_t version) | Sets the cache directory and version for model building.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_SetPerformanceMode(OH_NNCompilation *compilation, OH_NN_PerformanceMode performanceMode) | Sets the performance mode for model computing.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_SetPriority(OH_NNCompilation *compilation, OH_NN_Priority priority) | Sets the priority for model computing.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_EnableFloat16(OH_NNCompilation *compilation, bool enableFloat16) | Enables float16 for computing.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNCompilation_Build(OH_NNCompilation *compilation) | Performs model building.|
e41f4b71Sopenharmony_ci| void OH_NNCompilation_Destroy(OH_NNCompilation **compilation) | Destroys a model building instance.|
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Tensor Description APIs
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci| Name| Description|
e41f4b71Sopenharmony_ci| ------- | --- |
e41f4b71Sopenharmony_ci| NN_TensorDesc *OH_NNTensorDesc_Create() | Creates an **NN_TensorDesc** instance for creating an **NN_Tensor** instance at a later time.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_SetName(NN_TensorDesc *tensorDesc, const char *name) | Sets the name of the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_GetName(const NN_TensorDesc *tensorDesc, const char **name) | Obtains the name of the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_SetDataType(NN_TensorDesc *tensorDesc, OH_NN_DataType dataType) | Sets the data type of the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_GetDataType(const NN_TensorDesc *tensorDesc, OH_NN_DataType *dataType) | Obtains the data type of the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_SetShape(NN_TensorDesc *tensorDesc, const int32_t *shape, size_t shapeLength) | Sets the shape of the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_GetShape(const NN_TensorDesc *tensorDesc, int32_t **shape, size_t *shapeLength) | Obtains the shape of the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_SetFormat(NN_TensorDesc *tensorDesc, OH_NN_Format format) | Sets the data format of the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_GetFormat(const NN_TensorDesc *tensorDesc, OH_NN_Format *format) | Obtains the data format of the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_GetElementCount(const NN_TensorDesc *tensorDesc, size_t *elementCount) | Obtains the number of elements in the **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_GetByteSize(const NN_TensorDesc *tensorDesc, size_t *byteSize) | Obtains the number of bytes occupied by the tensor data obtained through calculation based on the shape and data type of an **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensorDesc_Destroy(NN_TensorDesc **tensorDesc) | Destroys an **NN_TensorDesc** instance.|
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Tensor APIs
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci| Name| Description|
e41f4b71Sopenharmony_ci| ------- | --- |
e41f4b71Sopenharmony_ci| NN_Tensor* OH_NNTensor_Create(size_t deviceID, NN_TensorDesc *tensorDesc) | Creates an **NN_Tensor** instance based on the specified tensor description. This API will request for device shared memory.|
e41f4b71Sopenharmony_ci| NN_Tensor* OH_NNTensor_CreateWithSize(size_t deviceID, NN_TensorDesc *tensorDesc, size_t size) | Creates an **NN_Tensor** instance based on the specified memory size and tensor description. This API will request for device shared memory.|
e41f4b71Sopenharmony_ci| NN_Tensor* OH_NNTensor_CreateWithFd(size_t deviceID, NN_TensorDesc *tensorDesc, int fd, size_t size, size_t offset) | Creates an **NN_Tensor** instance based on the specified file descriptor of the shared memory and tensor description. This way, the device shared memory of other tensors can be reused.|
e41f4b71Sopenharmony_ci| NN_TensorDesc* OH_NNTensor_GetTensorDesc(const NN_Tensor *tensor) | Obtains the pointer to the **NN_TensorDesc** instance in a tensor to read tensor attributes, such as the data type and shape.|
e41f4b71Sopenharmony_ci| void* OH_NNTensor_GetDataBuffer(const NN_Tensor *tensor) | Obtains the memory address of tensor data to read or write tensor data.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensor_GetFd(const NN_Tensor *tensor, int *fd) | Obtains the file descriptor of the shared memory where the tensor data is located. A file descriptor corresponds to a device shared memory block.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensor_GetSize(const NN_Tensor *tensor, size_t *size) | Obtains the size of the shared memory where tensor data is located.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensor_GetOffset(const NN_Tensor *tensor, size_t *offset) | Obtains the offset of the tensor data in the shared memory. The available size of the tensor data is the size of the shared memory minus the offset.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNTensor_Destroy(NN_Tensor **tensor) | Destroys an **NN_Tensor** instance.|
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Inference APIs
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci| Name| Description|
e41f4b71Sopenharmony_ci| ------- | --- |
e41f4b71Sopenharmony_ci| OH_NNExecutor *OH_NNExecutor_Construct(OH_NNCompilation *compilation) | Creates an **OH_NNExecutor** instance.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNExecutor_GetOutputShape(OH_NNExecutor *executor, uint32_t outputIndex, int32_t **shape, uint32_t *shapeLength) | Obtains the dimension information about the output tensor. This API is applicable only if the output tensor has a dynamic shape.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNExecutor_GetInputCount(const OH_NNExecutor *executor, size_t *inputCount) | Obtains the number of input tensors.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNExecutor_GetOutputCount(const OH_NNExecutor *executor, size_t *outputCount) | Obtains the number of output tensors.|
e41f4b71Sopenharmony_ci| NN_TensorDesc* OH_NNExecutor_CreateInputTensorDesc(const OH_NNExecutor *executor, size_t index) | Creates an **NN_TensorDesc** instance for an input tensor based on the specified index value. This instance will be used to read tensor attributes or create **NN_Tensor** instances.|
e41f4b71Sopenharmony_ci| NN_TensorDesc* OH_NNExecutor_CreateOutputTensorDesc(const OH_NNExecutor *executor, size_t index) | Creates an **NN_TensorDesc** instance for an output tensor based on the specified index value. This instance will be used to read tensor attributes or create **NN_Tensor** instances.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNExecutor_GetInputDimRange(const OH_NNExecutor *executor, size_t index, size_t **minInputDims, size_t **maxInputDims, size_t *shapeLength) |Obtains the dimension range of all input tensors. If the input tensor has a dynamic shape, the dimension range supported by the tensor may vary according to device. |
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNExecutor_SetOnRunDone(OH_NNExecutor *executor, NN_OnRunDone onRunDone) | Sets the callback function invoked when the asynchronous inference ends. For the definition of the callback function, see the *API Reference*.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNExecutor_SetOnServiceDied(OH_NNExecutor *executor, NN_OnServiceDied onServiceDied) | Sets the callback function invoked when the device driver service terminates unexpectedly during asynchronous inference. For the definition of the callback function, see the *API Reference*.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNExecutor_RunSync(OH_NNExecutor *executor, NN_Tensor *inputTensor[], size_t inputCount, NN_Tensor *outputTensor[], size_t outputCount) | Performs synchronous inference.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNExecutor_RunAsync(OH_NNExecutor *executor, NN_Tensor *inputTensor[], size_t inputCount, NN_Tensor *outputTensor[], size_t outputCount, int32_t timeout, void *userData) | Performs asynchronous inference.|
e41f4b71Sopenharmony_ci| void OH_NNExecutor_Destroy(OH_NNExecutor **executor) | Destroys an **OH_NNExecutor** instance.|
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci### Device Management APIs
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci| Name| Description|
e41f4b71Sopenharmony_ci| ------- | --- |
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNDevice_GetAllDevicesID(const size_t **allDevicesID, uint32_t *deviceCount) | Obtains the ID of the device connected to NNRt.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNDevice_GetName(size_t deviceID, const char **name) | Obtains the name of the specified device.|
e41f4b71Sopenharmony_ci| OH_NN_ReturnCode OH_NNDevice_GetType(size_t deviceID, OH_NN_DeviceType *deviceType) | Obtains the type of the specified device.|
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci## How to Develop
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ciThe development process of NNRt consists of three phases: model construction, model compilation, and inference execution. The following uses the `Add` single-operator model as an example to describe how to call NNRt APIs during application development.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci1. Create an application sample file.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    Create the source file of the NNRt application sample. Run the following commands in the project directory to create the `nnrt_example/` directory and create the `nnrt_example.cpp` source file in the directory:
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    ```shell
e41f4b71Sopenharmony_ci    mkdir ~/nnrt_example && cd ~/nnrt_example
e41f4b71Sopenharmony_ci    touch nnrt_example.cpp
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci2. Import the NNRt module.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    Add the following code at the beginning of the `nnrt_example.cpp` file to import NNRt:
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    ```cpp
e41f4b71Sopenharmony_ci    #include <iostream>
e41f4b71Sopenharmony_ci    #include <cstdarg>
e41f4b71Sopenharmony_ci    #include "neural_network_runtime/neural_network_runtime.h"
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci3. Defines auxiliary functions, such as log printing, input data setting, and data printing.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    ```cpp
e41f4b71Sopenharmony_ci    // Macro for checking the return value
e41f4b71Sopenharmony_ci    #define CHECKNEQ(realRet, expectRet, retValue, ...) \
e41f4b71Sopenharmony_ci        do { \
e41f4b71Sopenharmony_ci            if ((realRet) != (expectRet)) { \
e41f4b71Sopenharmony_ci                printf(__VA_ARGS__); \
e41f4b71Sopenharmony_ci                return (retValue); \
e41f4b71Sopenharmony_ci            } \
e41f4b71Sopenharmony_ci        } while (0)
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    #define CHECKEQ(realRet, expectRet, retValue, ...) \
e41f4b71Sopenharmony_ci        do { \
e41f4b71Sopenharmony_ci            if ((realRet) == (expectRet)) { \
e41f4b71Sopenharmony_ci                printf(__VA_ARGS__); \
e41f4b71Sopenharmony_ci                return (retValue); \
e41f4b71Sopenharmony_ci            } \
e41f4b71Sopenharmony_ci        } while (0)
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    // Set the input data for inference.
e41f4b71Sopenharmony_ci    OH_NN_ReturnCode SetInputData(NN_Tensor* inputTensor[], size_t inputSize)
e41f4b71Sopenharmony_ci    {
e41f4b71Sopenharmony_ci        OH_NN_DataType dataType(OH_NN_FLOAT32);
e41f4b71Sopenharmony_ci        OH_NN_ReturnCode ret{OH_NN_FAILED};
e41f4b71Sopenharmony_ci        size_t elementCount = 0;
e41f4b71Sopenharmony_ci        for (size_t i = 0; i < inputSize; ++i) {
e41f4b71Sopenharmony_ci            // Obtain the data memory of the tensor.
e41f4b71Sopenharmony_ci            auto data = OH_NNTensor_GetDataBuffer(inputTensor[i]);
e41f4b71Sopenharmony_ci            CHECKEQ(data, nullptr, OH_NN_FAILED, "Failed to get data buffer.");
e41f4b71Sopenharmony_ci            // Obtain the tensor description.
e41f4b71Sopenharmony_ci            auto desc = OH_NNTensor_GetTensorDesc(inputTensor[i]);
e41f4b71Sopenharmony_ci            CHECKEQ(desc, nullptr, OH_NN_FAILED, "Failed to get desc.");
e41f4b71Sopenharmony_ci            // Obtain the data type of the tensor.
e41f4b71Sopenharmony_ci            ret = OH_NNTensorDesc_GetDataType(desc, &dataType);
e41f4b71Sopenharmony_ci            CHECKNEQ(ret, OH_NN_SUCCESS, OH_NN_FAILED, "Failed to get data type.");
e41f4b71Sopenharmony_ci            // Obtain the number of elements in the tensor.
e41f4b71Sopenharmony_ci            ret = OH_NNTensorDesc_GetElementCount(desc, &elementCount);
e41f4b71Sopenharmony_ci            CHECKNEQ(ret, OH_NN_SUCCESS, OH_NN_FAILED, "Failed to get element count.");
e41f4b71Sopenharmony_ci            switch(dataType) {
e41f4b71Sopenharmony_ci                case OH_NN_FLOAT32: {
e41f4b71Sopenharmony_ci                    float* floatValue = reinterpret_cast<float*>(data);
e41f4b71Sopenharmony_ci                    for (size_t j = 0; j < elementCount; ++j) {
e41f4b71Sopenharmony_ci                        floatValue[j] = static_cast<float>(j);
e41f4b71Sopenharmony_ci                    }
e41f4b71Sopenharmony_ci                    break;
e41f4b71Sopenharmony_ci                }
e41f4b71Sopenharmony_ci                case OH_NN_INT32: {
e41f4b71Sopenharmony_ci                    int* intValue = reinterpret_cast<int*>(data);
e41f4b71Sopenharmony_ci                    for (size_t j = 0; j < elementCount; ++j) {
e41f4b71Sopenharmony_ci                        intValue[j] = static_cast<int>(j);
e41f4b71Sopenharmony_ci                    }
e41f4b71Sopenharmony_ci                    break;
e41f4b71Sopenharmony_ci                }
e41f4b71Sopenharmony_ci                default:
e41f4b71Sopenharmony_ci                    return OH_NN_FAILED;
e41f4b71Sopenharmony_ci            }
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci        return OH_NN_SUCCESS;
e41f4b71Sopenharmony_ci    }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    OH_NN_ReturnCode Print(NN_Tensor* outputTensor[], size_t outputSize)
e41f4b71Sopenharmony_ci    {
e41f4b71Sopenharmony_ci        OH_NN_DataType dataType(OH_NN_FLOAT32);
e41f4b71Sopenharmony_ci        OH_NN_ReturnCode ret{OH_NN_FAILED};
e41f4b71Sopenharmony_ci        size_t elementCount = 0;
e41f4b71Sopenharmony_ci        for (size_t i = 0; i < outputSize; ++i) {
e41f4b71Sopenharmony_ci            auto data = OH_NNTensor_GetDataBuffer(outputTensor[i]);
e41f4b71Sopenharmony_ci            CHECKEQ(data, nullptr, OH_NN_FAILED, "Failed to get data buffer.");
e41f4b71Sopenharmony_ci            auto desc = OH_NNTensor_GetTensorDesc(outputTensor[i]);
e41f4b71Sopenharmony_ci            CHECKEQ(desc, nullptr, OH_NN_FAILED, "Failed to get desc.");
e41f4b71Sopenharmony_ci            ret = OH_NNTensorDesc_GetDataType(desc, &dataType);
e41f4b71Sopenharmony_ci            CHECKNEQ(ret, OH_NN_SUCCESS, OH_NN_FAILED, "Failed to get data type.");
e41f4b71Sopenharmony_ci            ret = OH_NNTensorDesc_GetElementCount(desc, &elementCount);
e41f4b71Sopenharmony_ci            CHECKNEQ(ret, OH_NN_SUCCESS, OH_NN_FAILED, "Failed to get element count.");
e41f4b71Sopenharmony_ci            switch(dataType) {
e41f4b71Sopenharmony_ci                case OH_NN_FLOAT32: {
e41f4b71Sopenharmony_ci                    float* floatValue = reinterpret_cast<float*>(data);
e41f4b71Sopenharmony_ci                    for (size_t j = 0; j < elementCount; ++j) {
e41f4b71Sopenharmony_ci                        std::cout << "Output index: " << j << ", value is: " << floatValue[j] << "." << std::endl;
e41f4b71Sopenharmony_ci                    }
e41f4b71Sopenharmony_ci                    break;
e41f4b71Sopenharmony_ci                }
e41f4b71Sopenharmony_ci                case OH_NN_INT32: {
e41f4b71Sopenharmony_ci                    int* intValue = reinterpret_cast<int*>(data);
e41f4b71Sopenharmony_ci                    for (size_t j = 0; j < elementCount; ++j) {
e41f4b71Sopenharmony_ci                        std::cout << "Output index: " << j << ", value is: " << intValue[j] << "." << std::endl;
e41f4b71Sopenharmony_ci                    }
e41f4b71Sopenharmony_ci                    break;
e41f4b71Sopenharmony_ci                }
e41f4b71Sopenharmony_ci                default:
e41f4b71Sopenharmony_ci                    return OH_NN_FAILED;
e41f4b71Sopenharmony_ci            }
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        return OH_NN_SUCCESS;
e41f4b71Sopenharmony_ci    }
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci4. Construct a model.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    Use the model construction APIs to construct a single `Add` operator model.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    ```cpp
e41f4b71Sopenharmony_ci    OH_NN_ReturnCode BuildModel(OH_NNModel** pmodel)
e41f4b71Sopenharmony_ci    {
e41f4b71Sopenharmony_ci        // Create a model instance and construct a model.
e41f4b71Sopenharmony_ci        OH_NNModel* model = OH_NNModel_Construct();
e41f4b71Sopenharmony_ci        CHECKEQ(model, nullptr, OH_NN_FAILED, "Create model failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Add the first input tensor of the float32 type for the Add operator. The tensor shape is [1, 2, 2, 3].
e41f4b71Sopenharmony_ci        NN_TensorDesc* tensorDesc = OH_NNTensorDesc_Create();
e41f4b71Sopenharmony_ci        CHECKEQ(tensorDesc, nullptr, OH_NN_FAILED, "Create TensorDesc failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        int32_t inputDims[4] = {1, 2, 2, 3};
e41f4b71Sopenharmony_ci        auto returnCode = OH_NNTensorDesc_SetShape(tensorDesc, inputDims, 4);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc shape failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetDataType(tensorDesc, OH_NN_FLOAT32);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc data type failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetFormat(tensorDesc, OH_NN_FORMAT_NONE);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc format failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_AddTensorToModel(model, tensorDesc);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Add first TensorDesc to model failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_SetTensorType(model, 0, OH_NN_TENSOR);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set model tensor type failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Add the second input tensor of the float32 type for the Add operator. The tensor shape is [1, 2, 2, 3].
e41f4b71Sopenharmony_ci        tensorDesc = OH_NNTensorDesc_Create();
e41f4b71Sopenharmony_ci        CHECKEQ(tensorDesc, nullptr, OH_NN_FAILED, "Create TensorDesc failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetShape(tensorDesc, inputDims, 4);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc shape failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetDataType(tensorDesc, OH_NN_FLOAT32);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc data type failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetFormat(tensorDesc, OH_NN_FORMAT_NONE);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc format failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_AddTensorToModel(model, tensorDesc);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Add second TensorDesc to model failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_SetTensorType(model, 1, OH_NN_TENSOR);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set model tensor type failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Add the parameter tensor of the int8 type for the Add operator. The parameter tensor is used to specify the type of the activation function.
e41f4b71Sopenharmony_ci        tensorDesc = OH_NNTensorDesc_Create();
e41f4b71Sopenharmony_ci        CHECKEQ(tensorDesc, nullptr, OH_NN_FAILED, "Create TensorDesc failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        int32_t activationDims = 1;
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetShape(tensorDesc, &activationDims, 1);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc shape failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetDataType(tensorDesc, OH_NN_INT8);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc data type failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetFormat(tensorDesc, OH_NN_FORMAT_NONE);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc format failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_AddTensorToModel(model, tensorDesc);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Add second TensorDesc to model failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_SetTensorType(model, 2, OH_NN_ADD_ACTIVATIONTYPE);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set model tensor type failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Set the type of the activation function to OH_NN_FUSED_NONE, indicating that no activation function is added to the operator.
e41f4b71Sopenharmony_ci        int8_t activationValue = OH_NN_FUSED_NONE;
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_SetTensorData(model, 2, &activationValue, sizeof(int8_t));
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set model tensor data failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Add the output tensor of the float32 type for the Add operator. The tensor shape is [1, 2, 2, 3].
e41f4b71Sopenharmony_ci        tensorDesc = OH_NNTensorDesc_Create();
e41f4b71Sopenharmony_ci        CHECKEQ(tensorDesc, nullptr, OH_NN_FAILED, "Create TensorDesc failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetShape(tensorDesc, inputDims, 4);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc shape failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetDataType(tensorDesc, OH_NN_FLOAT32);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc data type failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNTensorDesc_SetFormat(tensorDesc, OH_NN_FORMAT_NONE);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set TensorDesc format failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_AddTensorToModel(model, tensorDesc);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Add forth TensorDesc to model failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_SetTensorType(model, 3, OH_NN_TENSOR);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Set model tensor type failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Specify index values of the input tensor, parameter tensor, and output tensor for the Add operator.
e41f4b71Sopenharmony_ci        uint32_t inputIndicesValues[2] = {0, 1};
e41f4b71Sopenharmony_ci        uint32_t paramIndicesValues = 2;
e41f4b71Sopenharmony_ci        uint32_t outputIndicesValues = 3;
e41f4b71Sopenharmony_ci        OH_NN_UInt32Array paramIndices = {&paramIndicesValues, 1};
e41f4b71Sopenharmony_ci        OH_NN_UInt32Array inputIndices = {inputIndicesValues, 2};
e41f4b71Sopenharmony_ci        OH_NN_UInt32Array outputIndices = {&outputIndicesValues, 1};
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Add the Add operator to the model instance.
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_AddOperation(model, OH_NN_OPS_ADD, &paramIndices, &inputIndices, &outputIndices);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Add operation to model failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Set the index values of the input tensor and output tensor for the model instance.
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_SpecifyInputsAndOutputs(model, &inputIndices, &outputIndices);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Specify model inputs and outputs failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Complete the model instance construction.
e41f4b71Sopenharmony_ci        returnCode = OH_NNModel_Finish(model);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "Build model failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Return the model instance.
e41f4b71Sopenharmony_ci        *pmodel = model;
e41f4b71Sopenharmony_ci        return OH_NN_SUCCESS;
e41f4b71Sopenharmony_ci    }
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci5. Query the AI acceleration chips connected to NNRt.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    NNRt can connect to multiple AI acceleration chips through HDIs. Before model building, you need to query the AI acceleration chips connected to NNRt on the current device. Each AI acceleration chip has a unique ID. In the compilation phase, you need to specify the chip for model compilation based on the ID.
e41f4b71Sopenharmony_ci    ```cpp
e41f4b71Sopenharmony_ci    void GetAvailableDevices(std::vector<size_t>& availableDevice)
e41f4b71Sopenharmony_ci    {
e41f4b71Sopenharmony_ci        availableDevice.clear();
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Obtain the available hardware IDs.
e41f4b71Sopenharmony_ci        const size_t* devices = nullptr;
e41f4b71Sopenharmony_ci        uint32_t deviceCount = 0;
e41f4b71Sopenharmony_ci        OH_NN_ReturnCode ret = OH_NNDevice_GetAllDevicesID(&devices, &deviceCount);
e41f4b71Sopenharmony_ci        if (ret != OH_NN_SUCCESS) {
e41f4b71Sopenharmony_ci            std::cout << "GetAllDevicesID failed, get no available device." << std::endl;
e41f4b71Sopenharmony_ci            return;
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        for (uint32_t i = 0; i < deviceCount; i++) {
e41f4b71Sopenharmony_ci            availableDevice.emplace_back(devices[i]);
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci    }
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci6. Compile a model on the specified device.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    NNRt uses abstract model expressions to describe the topology structure of an AI model. Before inference execution on an AI acceleration chip, the build module provided by NNRt needs to deliver the abstract model expressions to the chip driver layer and convert the abstract model expressions into a format that supports inference and computing.
e41f4b71Sopenharmony_ci    ```cpp
e41f4b71Sopenharmony_ci    OH_NN_ReturnCode CreateCompilation(OH_NNModel* model, const std::vector<size_t>& availableDevice,
e41f4b71Sopenharmony_ci                                       OH_NNCompilation** pCompilation)
e41f4b71Sopenharmony_ci    {
e41f4b71Sopenharmony_ci        // Create an OH_NNCompilation instance and pass the image composition model instance or the MindSpore Lite model instance to it.
e41f4b71Sopenharmony_ci        OH_NNCompilation* compilation = OH_NNCompilation_Construct(model);
e41f4b71Sopenharmony_ci        CHECKEQ(compilation, nullptr, OH_NN_FAILED, "OH_NNCore_ConstructCompilationWithNNModel failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Set compilation options, such as the compilation hardware, cache path, performance mode, computing priority, and whether to enable float16 low-precision computing.
e41f4b71Sopenharmony_ci        // Choose to perform model compilation on the first device.
e41f4b71Sopenharmony_ci        auto returnCode = OH_NNCompilation_SetDevice(compilation, availableDevice[0]);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNCompilation_SetDevice failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Have the model compilation result cached in the /data/local/tmp directory, with the version number set to 1.
e41f4b71Sopenharmony_ci        returnCode = OH_NNCompilation_SetCache(compilation, "/data/local/tmp", 1);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNCompilation_SetCache failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Set the performance mode of the device.
e41f4b71Sopenharmony_ci        returnCode = OH_NNCompilation_SetPerformanceMode(compilation, OH_NN_PERFORMANCE_EXTREME);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNCompilation_SetPerformanceMode failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Set the inference priority.
e41f4b71Sopenharmony_ci        returnCode = OH_NNCompilation_SetPriority(compilation, OH_NN_PRIORITY_HIGH);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNCompilation_SetPriority failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Specify whether to enable FP16 computing.
e41f4b71Sopenharmony_ci        returnCode = OH_NNCompilation_EnableFloat16(compilation, false);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNCompilation_EnableFloat16 failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Perform model building
e41f4b71Sopenharmony_ci        returnCode = OH_NNCompilation_Build(compilation);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNCompilation_Build failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        *pCompilation = compilation;
e41f4b71Sopenharmony_ci        return OH_NN_SUCCESS;
e41f4b71Sopenharmony_ci    }
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci7. Create an executor.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    After the model building is complete, you need to call the NNRt execution module to create an executor. In the inference phase, operations such as setting the model input, obtaining the model output, and triggering inference computing are performed through the executor.
e41f4b71Sopenharmony_ci    ```cpp
e41f4b71Sopenharmony_ci    OH_NNExecutor* CreateExecutor(OH_NNCompilation* compilation)
e41f4b71Sopenharmony_ci    {
e41f4b71Sopenharmony_ci        // Create an executor based on the specified OH_NNCompilation instance.
e41f4b71Sopenharmony_ci        OH_NNExecutor *executor = OH_NNExecutor_Construct(compilation);
e41f4b71Sopenharmony_ci        CHECKEQ(executor, nullptr, nullptr, "OH_NNExecutor_Construct failed.");
e41f4b71Sopenharmony_ci        return executor;
e41f4b71Sopenharmony_ci    }
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci8. Perform inference computing, and print the inference result.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    The input data required for inference computing is passed to the executor through the API provided by the execution module. This way, the executor is triggered to perform inference computing once to obtain and print the inference computing result.
e41f4b71Sopenharmony_ci    ```cpp
e41f4b71Sopenharmony_ci    OH_NN_ReturnCode Run(OH_NNExecutor* executor, const std::vector<size_t>& availableDevice)
e41f4b71Sopenharmony_ci    {
e41f4b71Sopenharmony_ci        // Obtain information about the input and output tensors from the executor.
e41f4b71Sopenharmony_ci        // Obtain the number of input tensors.
e41f4b71Sopenharmony_ci        size_t inputCount = 0;
e41f4b71Sopenharmony_ci        auto returnCode = OH_NNExecutor_GetInputCount(executor, &inputCount);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNExecutor_GetInputCount failed.");
e41f4b71Sopenharmony_ci        std::vector<NN_TensorDesc*> inputTensorDescs;
e41f4b71Sopenharmony_ci        NN_TensorDesc* tensorDescTmp = nullptr;
e41f4b71Sopenharmony_ci        for (size_t i = 0; i < inputCount; ++i) {
e41f4b71Sopenharmony_ci            // Create the description of the input tensor.
e41f4b71Sopenharmony_ci            tensorDescTmp = OH_NNExecutor_CreateInputTensorDesc(executor, i);
e41f4b71Sopenharmony_ci            CHECKEQ(tensorDescTmp, nullptr, OH_NN_FAILED, "OH_NNExecutor_CreateInputTensorDesc failed.");
e41f4b71Sopenharmony_ci            inputTensorDescs.emplace_back(tensorDescTmp);
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci        // Obtain the number of output tensors.
e41f4b71Sopenharmony_ci        size_t outputCount = 0;
e41f4b71Sopenharmony_ci        returnCode = OH_NNExecutor_GetOutputCount(executor, &outputCount);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNExecutor_GetOutputCount failed.");
e41f4b71Sopenharmony_ci        std::vector<NN_TensorDesc*> outputTensorDescs;
e41f4b71Sopenharmony_ci        for (size_t i = 0; i < outputCount; ++i) {
e41f4b71Sopenharmony_ci            // Create the description of the output tensor.
e41f4b71Sopenharmony_ci            tensorDescTmp = OH_NNExecutor_CreateOutputTensorDesc(executor, i);
e41f4b71Sopenharmony_ci            CHECKEQ(tensorDescTmp, nullptr, OH_NN_FAILED, "OH_NNExecutor_CreateOutputTensorDesc failed.");
e41f4b71Sopenharmony_ci            outputTensorDescs.emplace_back(tensorDescTmp);
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Create input and output tensors.
e41f4b71Sopenharmony_ci        NN_Tensor* inputTensors[inputCount];
e41f4b71Sopenharmony_ci        NN_Tensor* tensor = nullptr;
e41f4b71Sopenharmony_ci        for (size_t i = 0; i < inputCount; ++i) {
e41f4b71Sopenharmony_ci            tensor = nullptr;
e41f4b71Sopenharmony_ci            tensor = OH_NNTensor_Create(availableDevice[0], inputTensorDescs[i]);
e41f4b71Sopenharmony_ci            CHECKEQ(tensor, nullptr, OH_NN_FAILED, "OH_NNTensor_Create failed.");
e41f4b71Sopenharmony_ci            inputTensors[i] = tensor;
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci        NN_Tensor* outputTensors[outputCount];
e41f4b71Sopenharmony_ci        for (size_t i = 0; i < outputCount; ++i) {
e41f4b71Sopenharmony_ci            tensor = nullptr;
e41f4b71Sopenharmony_ci            tensor = OH_NNTensor_Create(availableDevice[0], outputTensorDescs[i]);
e41f4b71Sopenharmony_ci            CHECKEQ(tensor, nullptr, OH_NN_FAILED, "OH_NNTensor_Create failed.");
e41f4b71Sopenharmony_ci            outputTensors[i] = tensor;
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Set the data of the input tensor.
e41f4b71Sopenharmony_ci        returnCode = SetInputData(inputTensors, inputCount);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "SetInputData failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Perform inference
e41f4b71Sopenharmony_ci        returnCode = OH_NNExecutor_RunSync(executor, inputTensors, inputCount, outputTensors, outputCount);
e41f4b71Sopenharmony_ci        CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNExecutor_RunSync failed.");
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Print the data of the output tensor.
e41f4b71Sopenharmony_ci        Print(outputTensors, outputCount);
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Clear the input and output tensors and tensor description.
e41f4b71Sopenharmony_ci        for (size_t i = 0; i < inputCount; ++i) {
e41f4b71Sopenharmony_ci            returnCode = OH_NNTensor_Destroy(&inputTensors[i]);
e41f4b71Sopenharmony_ci            CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNTensor_Destroy failed.");
e41f4b71Sopenharmony_ci            returnCode = OH_NNTensorDesc_Destroy(&inputTensorDescs[i]);
e41f4b71Sopenharmony_ci            CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNTensorDesc_Destroy failed.");
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci        for (size_t i = 0; i < outputCount; ++i) {
e41f4b71Sopenharmony_ci            returnCode = OH_NNTensor_Destroy(&outputTensors[i]);
e41f4b71Sopenharmony_ci            CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNTensor_Destroy failed.");
e41f4b71Sopenharmony_ci            returnCode = OH_NNTensorDesc_Destroy(&outputTensorDescs[i]);
e41f4b71Sopenharmony_ci            CHECKNEQ(returnCode, OH_NN_SUCCESS, OH_NN_FAILED, "OH_NNTensorDesc_Destroy failed.");
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        return OH_NN_SUCCESS;
e41f4b71Sopenharmony_ci    }
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci9. Build an end-to-end process from model construction to model compilation and execution.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    Steps 4 to 8 implement the model construction, compilation, and execution processes and encapsulates them into multiple functions to facilitate modular development. The following sample code shows how to apply these functions into a complete NNRt development process.
e41f4b71Sopenharmony_ci    ```cpp
e41f4b71Sopenharmony_ci    int main(int argc, char** argv)
e41f4b71Sopenharmony_ci    {
e41f4b71Sopenharmony_ci        OH_NNModel* model = nullptr;
e41f4b71Sopenharmony_ci        OH_NNCompilation* compilation = nullptr;
e41f4b71Sopenharmony_ci        OH_NNExecutor* executor = nullptr;
e41f4b71Sopenharmony_ci        std::vector<size_t> availableDevices;
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Construct a model.
e41f4b71Sopenharmony_ci        OH_NN_ReturnCode ret = BuildModel(&model);
e41f4b71Sopenharmony_ci        if (ret != OH_NN_SUCCESS) {
e41f4b71Sopenharmony_ci            std::cout << "BuildModel failed." << std::endl;
e41f4b71Sopenharmony_ci            OH_NNModel_Destroy(&model);
e41f4b71Sopenharmony_ci            return -1;
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Obtain the available devices.
e41f4b71Sopenharmony_ci        GetAvailableDevices(availableDevices);
e41f4b71Sopenharmony_ci        if (availableDevices.empty()) {
e41f4b71Sopenharmony_ci            std::cout << "No available device." << std::endl;
e41f4b71Sopenharmony_ci            OH_NNModel_Destroy(&model);
e41f4b71Sopenharmony_ci            return -1;
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Build the model.
e41f4b71Sopenharmony_ci        ret = CreateCompilation(model, availableDevices, &compilation);
e41f4b71Sopenharmony_ci        if (ret != OH_NN_SUCCESS) {
e41f4b71Sopenharmony_ci            std::cout << "CreateCompilation failed." << std::endl;
e41f4b71Sopenharmony_ci            OH_NNModel_Destroy(&model);
e41f4b71Sopenharmony_ci            OH_NNCompilation_Destroy(&compilation);
e41f4b71Sopenharmony_ci            return -1;
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Destroy the model instance.
e41f4b71Sopenharmony_ci        OH_NNModel_Destroy(&model);
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Create an inference executor for the model.
e41f4b71Sopenharmony_ci        executor = CreateExecutor(compilation);
e41f4b71Sopenharmony_ci        if (executor == nullptr) {
e41f4b71Sopenharmony_ci            std::cout << "CreateExecutor failed, no executor is created." << std::endl;
e41f4b71Sopenharmony_ci            OH_NNCompilation_Destroy(&compilation);
e41f4b71Sopenharmony_ci            return -1;
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Destroy the model building instance.
e41f4b71Sopenharmony_ci        OH_NNCompilation_Destroy(&compilation);
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Use the created executor to perform inference.
e41f4b71Sopenharmony_ci        ret = Run(executor, availableDevices);
e41f4b71Sopenharmony_ci        if (ret != OH_NN_SUCCESS) {
e41f4b71Sopenharmony_ci            std::cout << "Run failed." << std::endl;
e41f4b71Sopenharmony_ci            OH_NNExecutor_Destroy(&executor);
e41f4b71Sopenharmony_ci            return -1;
e41f4b71Sopenharmony_ci        }
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        // Destroy the executor instance.
e41f4b71Sopenharmony_ci        OH_NNExecutor_Destroy(&executor);
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci        return 0;
e41f4b71Sopenharmony_ci    }
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci## Verification
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci1. Prepare the compilation configuration file of the application sample.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    Create a `CMakeLists.txt` file, and add compilation configurations to the application sample file `nnrt_example.cpp`. The following is a simple example of the `CMakeLists.txt` file:
e41f4b71Sopenharmony_ci    ```text
e41f4b71Sopenharmony_ci    cmake_minimum_required(VERSION 3.16)
e41f4b71Sopenharmony_ci    project(nnrt_example C CXX)
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    add_executable(nnrt_example
e41f4b71Sopenharmony_ci        ./nnrt_example.cpp
e41f4b71Sopenharmony_ci    )
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    target_link_libraries(nnrt_example
e41f4b71Sopenharmony_ci        neural_network_runtime
e41f4b71Sopenharmony_ci        neural_network_core
e41f4b71Sopenharmony_ci    )
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci2. Compile the application sample.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    Create the **build/** directory in the current directory, and compile `nnrt\_example.cpp` in the **build/** directory to obtain the binary file `nnrt\_example`:
e41f4b71Sopenharmony_ci    ```shell
e41f4b71Sopenharmony_ci    mkdir build && cd build
e41f4b71Sopenharmony_ci    cmake -DCMAKE_TOOLCHAIN_FILE={Path of the cross-compilation toolchain}/build/cmake/ohos.toolchain.cmake -DOHOS_ARCH=arm64-v8a -DOHOS_PLATFORM=OHOS -DOHOS_STL=c++_static ..
e41f4b71Sopenharmony_ci    make
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci3. Push the application sample to the device for execution.
e41f4b71Sopenharmony_ci    ```shell
e41f4b71Sopenharmony_ci    # Push the `nnrt_example` obtained through compilation to the device, and execute it.
e41f4b71Sopenharmony_ci    hdc_std file send ./nnrt_example /data/local/tmp/.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    # Grant required permissions to the executable file of the test case.
e41f4b71Sopenharmony_ci    hdc_std shell "chmod +x /data/local/tmp/nnrt_example"
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    # Execute the test case.
e41f4b71Sopenharmony_ci    hdc_std shell "/data/local/tmp/nnrt_example"
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    If the execution is normal, information similar to the following is displayed:
e41f4b71Sopenharmony_ci    ```text
e41f4b71Sopenharmony_ci    Output index: 0, value is: 0.000000.
e41f4b71Sopenharmony_ci    Output index: 1, value is: 2.000000.
e41f4b71Sopenharmony_ci    Output index: 2, value is: 4.000000.
e41f4b71Sopenharmony_ci    Output index: 3, value is: 6.000000.
e41f4b71Sopenharmony_ci    Output index: 4, value is: 8.000000.
e41f4b71Sopenharmony_ci    Output index: 5, value is: 10.000000.
e41f4b71Sopenharmony_ci    Output index: 6, value is: 12.000000.
e41f4b71Sopenharmony_ci    Output index: 7, value is: 14.000000.
e41f4b71Sopenharmony_ci    Output index: 8, value is: 16.000000.
e41f4b71Sopenharmony_ci    Output index: 9, value is: 18.000000.
e41f4b71Sopenharmony_ci    Output index: 10, value is: 20.000000.
e41f4b71Sopenharmony_ci    Output index: 11, value is: 22.000000.
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci4. (Optional) Check the model cache.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    If the HDI service connected to NNRt supports the model cache function, you can find the generated cache file in the `/data/local/tmp` directory after the `nnrt_example` is executed successfully.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    > **NOTE**
e41f4b71Sopenharmony_ci    >
e41f4b71Sopenharmony_ci    > The IR graphs of the model need to be passed to the hardware driver layer, so that the HDI service compiles the IR graphs into a computing graph dedicated to hardware. The compilation process is time-consuming. The NNRt supports the computing graph cache feature. It can cache the computing graphs compiled by the HDI service to the device storage. If the same model is compiled on the same acceleration chip next time, you can specify the cache path so that NNRt can directly load the computing graphs in the cache file, reducing the compilation time.
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    Check the cached files in the cache directory.
e41f4b71Sopenharmony_ci    ```shell
e41f4b71Sopenharmony_ci    ls /data/local/tmp
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    The command output is as follows:
e41f4b71Sopenharmony_ci    ```text
e41f4b71Sopenharmony_ci    # 0.nncache 1.nncache 2.nncache cache_info.nncache
e41f4b71Sopenharmony_ci    ```
e41f4b71Sopenharmony_ci
e41f4b71Sopenharmony_ci    If the cache is no longer used, manually delete the cache files.
e41f4b71Sopenharmony_ci    ```shell
e41f4b71Sopenharmony_ci    rm /data/local/tmp/*nncache
e41f4b71Sopenharmony_ci    ```