|
__init__.py
|
optimize llama npu perf (#11426)
|
2024-06-25 17:43:20 +08:00 |
|
common.py
|
optimize npu llama perf again (#11431)
|
2024-06-26 10:52:54 +08:00 |
|
convert.py
|
add minicpm 1B/2B npu support (#11507)
|
2024-07-04 16:31:04 +08:00 |
|
minicpm.py
|
add minicpm 1B/2B npu support (#11507)
|
2024-07-04 16:31:04 +08:00 |
|
qwen2.py
|
add qwen2 npu support (#11504)
|
2024-07-04 11:01:25 +08:00 |