2.39.0 |
Genie C API 1.13.0:
Introduced new profiling option for collecting detailed trace events.
Added the GenieDialog_embeddingTokenQuery API.
Added the GenieDialog_setMaxNumTokens API.
Added the GENIE_STATUS_WARNING_CONTEXT_EXCEEDED status code to
provide a specific status when a prompt exceeds the model’s context
length limit.
Bugfixes:
Fixed an issue that caused incorrect calculation of KV cache tensor
sizes on the HTP backend, which could lead to segmentation faults.
Fixed an issue where no output was generated for certain models when
the prompt prefill phase required multiple graph executions.
|
2.38.0 |
|
2.37.0 |
|
2.36.0 |
|
2.35.0 |
|
2.34.0 |
Genie C API 1.8.0:
Added GenieEngine.h, GenieDialog_getEngine, and
GenieDialog_bindEngine APIs.
Added GenieSampler_registerUserDataCallback API which adds a userData
argument to the sampler custom callback.
Added token-acceptance-rate to the GenieProfile output for some
dialog types.
Added the Eaglet dialog type.
SDK:
Bugfixes:
Fixed genie-t2t-run issue where dialog de-init data was not saved.
Fixed issue where GenieEmbedding_generate would return a rank of 0.
Fixed issue where quantized value may overflow or underflow.
|
2.33.0 |
Genie C API 1.7.0:
Added the GenieLog.h API.
Added LoRA adapter switch latency to GenieProfile output.
Allow sampler type to be changed in GenieSampler_applyConfig.
Bugfixes:
|
2.32.0 |
Genie C API 1.6.0:
SDK:
Added Windows build support for the source code examples.
Reorganized the Genie SDK documentation.
Removed shift concat and pointer shift KV cache update methods in
lieu of smart mask.
Bugfixes:
Fixed issue where SPD token rate is incorrectly reported when the
query is aborted.
Fixed issue where multi-token stop sequences were not fully omitted
in queryCallback and KV$.
Fixed issue where tokenizer state is corrupted after a query abort.
Fixed issue where a Gen AI Transformer dialog attempts to double free
memory.
Fixed a qnn-genai-transformer-composer failure when preparing LoRA
adapters.
Fixed a performance regression for kv-share dialogs using the token
query API.
|
2.31.0 |
|
2.30.0 |
Genie C API 1.4.0:
Added GenieProfile.h APIs.
Added GENIE_DIALOG_SENTENCE_REWIND sentence code option.
Added support for dialog custom sampler implementations.
Added GenieDialog_setStopSequence API to allow updating the stop
sequence configuration between dialog queries.
Bugfixes:
|
2.29.0 |
|
2.28.0 |
|
2.27.0 |
|
2.26.0 |
|
2.25.0 |
|
2.23.0 |
|