ipex-llm

ayo/ipex-llm

Fork 0

Commit graph

Author	SHA1	Message	Date
SONG Ge	dfb00e37e9	[LLM] Add model correctness test on ARC for llama and falcon (#9347 ) * add correctness test on arc for llama model * modify layer name * add falcon ut * refactor and add ut for falcon model * modify lambda positions and update docs * replace loading pre input with last decodelayer output * switch lower bound to single model instead of using the common one * make the code implementation simple * fix gpu action allocation memory issue	2023-11-10 13:48:57 +08:00
Cheen Hau, 俊豪	8f23fb04dc	Add inference test for Whisper model on Arc (#9330 ) * Add inference test for Whisper model * Remove unnecessary inference time measurement	2023-11-03 10:15:52 +08:00
Cheen Hau, 俊豪	ab40607b87	Enable unit test workflow on Arc (#9213 ) * Add gpu workflow and a transformers API inference test * Set device-specific env variables in script instead of workflow * Fix status message --------- Co-authored-by: sgwhat <ge.song@intel.com>	2023-10-25 15:17:18 +08:00

Author

SHA1

Message

Date

SONG Ge

dfb00e37e9

[LLM] Add model correctness test on ARC for llama and falcon (#9347 )

* add correctness test on arc for llama model

* modify layer name

* add falcon ut

* refactor and add ut for falcon model

* modify lambda positions and update docs

* replace loading pre input with last decodelayer output

* switch lower bound to single model instead of using the common one

* make the code implementation simple

* fix gpu action allocation memory issue

2023-11-10 13:48:57 +08:00

Cheen Hau, 俊豪

8f23fb04dc

Add inference test for Whisper model on Arc (#9330 )

* Add inference test for Whisper model

* Remove unnecessary inference time measurement

2023-11-03 10:15:52 +08:00

Cheen Hau, 俊豪

ab40607b87

Enable unit test workflow on Arc (#9213 )

* Add gpu workflow and a transformers API inference test

* Set device-specific env variables in script instead of workflow

* Fix status message

---------

Co-authored-by: sgwhat <ge.song@intel.com>

2023-10-25 15:17:18 +08:00

3 commits