This website requires JavaScript.
Explore
Help
Sign In
ayo
/
ipex-llm
Watch
1
Fork
You've already forked ipex-llm
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
Actions
1
39bcb33a67
ipex-llm
/
python
/
llm
/
src
/
ipex_llm
/
transformers
/
npu_models
History
Zhao Changmin
cf8eb7b128
Init NPU quantize method and support q8_0_rtn (
#11452
)
...
* q8_0_rtn * fix float point
2024-07-01 13:45:07 +08:00
..
__init__.py
optimize llama npu perf (
#11426
)
2024-06-25 17:43:20 +08:00
common.py
optimize npu llama perf again (
#11431
)
2024-06-26 10:52:54 +08:00
convert.py
Init NPU quantize method and support q8_0_rtn (
#11452
)
2024-07-01 13:45:07 +08:00
llama.py
fix npu llama2 (
#11471
)
2024-07-01 10:14:11 +08:00