ipex-llm/python/llm/test/inference_gpu
SONG Ge dfb00e37e9 [LLM] Add model correctness test on ARC for llama and falcon (#9347)
* add correctness test on arc for llama model

* modify layer name

* add falcon ut

* refactor and add ut for falcon model

* modify lambda positions and update docs

* replace loading pre input with last decodelayer output

* switch lower bound to single model instead of using the common one

* make the code implementation simple

* fix gpu action allocation memory issue
2023-11-10 13:48:57 +08:00
..
test_optimize_model.py [LLM] Add model correctness test on ARC for llama and falcon (#9347) 2023-11-10 13:48:57 +08:00
test_transformers_api.py Add inference test for Whisper model on Arc (#9330) 2023-11-03 10:15:52 +08:00