ipex-llm/python/llm/test/inference_gpu
Yuwen Hu c6d4f91777 [LLM] Add UTs of load_low_bit for transformers-style API (#10001)
* Add uts for transformers api load_low_bit generation

* Small fixes

* Remove replit-code for CPU tests due to current load_low_bit issue on MPT

* Small change

* Small reorganization to llm unit tests on CPU

* Small fixes
2024-01-29 10:18:23 +08:00
..
test_layer_fast_rope.py Support fast rope for training (#9745) 2024-01-17 15:51:38 +08:00
test_transformers_api.py [LLM] Add UTs of load_low_bit for transformers-style API (#10001) 2024-01-29 10:18:23 +08:00
test_transformers_api_disable_xmx.py Disable llama2 optimize model true or false test for now in Arc UTs (#10008) 2024-01-26 14:42:11 +08:00