ipex-llm/python/llm/src/ipex_llm/transformers/npu_models
2024-07-04 16:31:04 +08:00
..
__init__.py optimize llama npu perf (#11426) 2024-06-25 17:43:20 +08:00
common.py optimize npu llama perf again (#11431) 2024-06-26 10:52:54 +08:00
convert.py add minicpm 1B/2B npu support (#11507) 2024-07-04 16:31:04 +08:00
llama.py optimize npu llama long context performance (#11478) 2024-07-01 16:49:23 +08:00
minicpm.py add minicpm 1B/2B npu support (#11507) 2024-07-04 16:31:04 +08:00
qwen2.py add qwen2 npu support (#11504) 2024-07-04 11:01:25 +08:00