ipex-llm/python/llm/test/inference_gpu
Cheen Hau, 俊豪 cee9eaf542 [LLM] Fix llm arc ut oom (#9300)
* Move model to cpu after testing so that gpu memory is deallocated

* Add code comment

---------

Co-authored-by: sgwhat <ge.song@intel.com>
2023-10-30 14:38:34 +08:00
..
test_optimize_model.py [LLM] Fix llm arc ut oom (#9300) 2023-10-30 14:38:34 +08:00
test_transformers_api.py [LLM] Fix llm arc ut oom (#9300) 2023-10-30 14:38:34 +08:00