QNN HTP¶
The following tutorial demonstrates running the Llama 2 7B model on the QNN HTP backend using genie-t2t-run.
Note
This section assumes that the QNN HTP context binaries have been obtained via the QNN workflow.
An example backend_ext_config.json can be found at
${QNN_SDK_ROOT}/examples/Genie/configs/htp_backend_ext_config.json.
For more information on the QNN HTP backend extension configurations options, please refer to
${QNN_SDK_ROOT}/docs/QNN/general/htp/htp_backend.html.
Please select your target platform: