ipex-llm

History

SONG Ge dfb00e37e9 [LLM] Add model correctness test on ARC for llama and falcon (#9347 ) * add correctness test on arc for llama model * modify layer name * add falcon ut * refactor and add ut for falcon model * modify lambda positions and update docs * replace loading pre input with last decodelayer output * switch lower bound to single model instead of using the common one * make the code implementation simple * fix gpu action allocation memory issue	2023-11-10 13:48:57 +08:00
..
test_optimize_model.py	[LLM] Add model correctness test on ARC for llama and falcon (#9347 )	2023-11-10 13:48:57 +08:00
test_transformers_api.py	Add inference test for Whisper model on Arc (#9330 )	2023-11-03 10:15:52 +08:00

SONG Ge dfb00e37e9 [LLM] Add model correctness test on ARC for llama and falcon (#9347 )

* add correctness test on arc for llama model

* modify layer name

* add falcon ut

* refactor and add ut for falcon model

* modify lambda positions and update docs

* replace loading pre input with last decodelayer output

* switch lower bound to single model instead of using the common one

* make the code implementation simple

* fix gpu action allocation memory issue

2023-11-10 13:48:57 +08:00

test_optimize_model.py

[LLM] Add model correctness test on ARC for llama and falcon (#9347 )

2023-11-10 13:48:57 +08:00

test_transformers_api.py

Add inference test for Whisper model on Arc (#9330 )

2023-11-03 10:15:52 +08:00