ipex-llm/python/llm/example
Ruonan Wang e9aa2bd890 LLM: reduce GPU 1st token latency and update example (#8763)
* reduce 1st token latency

* update example

* fix

* fix style

* update readme of gpu benchmark
2023-08-16 18:01:23 +08:00
..
cpp-python LLM: update langchain and cpp-python style API examples (#8456) 2023-07-06 14:36:42 +08:00
langchain LLM: fix langchain native int4 voiceasistant example (#8750) 2023-08-14 17:23:33 +08:00
transformers LLM: reduce GPU 1st token latency and update example (#8763) 2023-08-16 18:01:23 +08:00