ipex-llm/python/llm/test/inference
Yuwen Hu c6d4f91777 [LLM] Add UTs of load_low_bit for transformers-style API (#10001)
* Add uts for transformers api load_low_bit generation

* Small fixes

* Remove replit-code for CPU tests due to current load_low_bit issue on MPT

* Small change

* Small reorganization to llm unit tests on CPU

* Small fixes
2024-01-29 10:18:23 +08:00
..
test_call_models.py [LLM] Unify Transformers and Native API (#8713) 2023-08-11 19:45:47 +08:00
test_optimize_model_api.py [LLM] Add UTs of load_low_bit for transformers-style API (#10001) 2024-01-29 10:18:23 +08:00
test_transformers_api.py [LLM] Add UTs of load_low_bit for transformers-style API (#10001) 2024-01-29 10:18:23 +08:00
test_transformesr_api_434.py [LLM] Add UTs of load_low_bit for transformers-style API (#10001) 2024-01-29 10:18:23 +08:00