ipex-llm

History

Yuwen Hu c6d4f91777 [LLM] Add UTs of load_low_bit for transformers-style API (#10001 ) * Add uts for transformers api load_low_bit generation * Small fixes * Remove replit-code for CPU tests due to current load_low_bit issue on MPT * Small change * Small reorganization to llm unit tests on CPU * Small fixes		2024-01-29 10:18:23 +08:00
..
benchmark	Update perf test all-in-one config for batch_size arg (#10012 )	2024-01-26 16:46:36 +08:00
convert	LLM: Adapt transformers models for `optimize model` SL (#9022 )	2023-10-09 11:13:44 +08:00
inference	[LLM] Add UTs of load_low_bit for transformers-style API (#10001 )	2024-01-29 10:18:23 +08:00
inference_gpu	[LLM] Add UTs of load_low_bit for transformers-style API (#10001 )	2024-01-29 10:18:23 +08:00
install	[LLM] Refactor LLM Linux tests (#8349 )	2023-06-16 15:22:48 +08:00
langchain	[LLM] Unify Langchain Native and Transformers LLM API (#8752 )	2023-08-25 11:14:21 +08:00
win	[LLM] Remove old windows nightly test code (#8668 )	2023-08-03 17:12:23 +09:00
__init__.py	[LLM] Enable UT workflow logics for LLM (#8243 )	2023-06-02 17:06:35 +08:00
run-llm-convert-tests.sh	[LLM] Change default runner for LLM Linux tests to the ones with AVX512 (#8448 )	2023-07-04 14:53:03 +08:00
run-llm-example-tests-gpu.sh	LLM: reorganize GPU finetuning examples (#9952 )	2024-01-25 19:02:38 +08:00
run-llm-inference-tests-gpu.sh	LLM: separate arc ut for disable XMX (#9953 )	2024-01-23 19:04:47 +08:00
run-llm-inference-tests.sh	[LLM] Add UTs of load_low_bit for transformers-style API (#10001 )	2024-01-29 10:18:23 +08:00
run-llm-install-tests.sh	[LLM] Refactor LLM Linux tests (#8349 )	2023-06-16 15:22:48 +08:00
run-llm-langchain-tests.sh	[LLM] langchain bloom, UT's, default parameters (#8357 )	2023-06-25 17:38:00 +08:00
run-llm-windows-tests.sh	LLM: fix langchain windows failure (#8417 )	2023-06-30 09:59:10 +08:00