..
benchmark
Update perf test all-in-one config for batch_size arg ( #10012 )
2024-01-26 16:46:36 +08:00
convert
LLM: Adapt transformers models for optimize model SL ( #9022 )
2023-10-09 11:13:44 +08:00
inference
[LLM] Add UTs of load_low_bit for transformers-style API ( #10001 )
2024-01-29 10:18:23 +08:00
inference_gpu
[LLM] Add UTs of load_low_bit for transformers-style API ( #10001 )
2024-01-29 10:18:23 +08:00
install
[LLM] Refactor LLM Linux tests ( #8349 )
2023-06-16 15:22:48 +08:00
langchain
[LLM] Unify Langchain Native and Transformers LLM API ( #8752 )
2023-08-25 11:14:21 +08:00
win
[LLM] Remove old windows nightly test code ( #8668 )
2023-08-03 17:12:23 +09:00
__init__.py
[LLM] Enable UT workflow logics for LLM ( #8243 )
2023-06-02 17:06:35 +08:00
run-llm-convert-tests.sh
[LLM] Change default runner for LLM Linux tests to the ones with AVX512 ( #8448 )
2023-07-04 14:53:03 +08:00
run-llm-example-tests-gpu.sh
LLM: reorganize GPU finetuning examples ( #9952 )
2024-01-25 19:02:38 +08:00
run-llm-inference-tests-gpu.sh
LLM: separate arc ut for disable XMX ( #9953 )
2024-01-23 19:04:47 +08:00
run-llm-inference-tests.sh
[LLM] Add UTs of load_low_bit for transformers-style API ( #10001 )
2024-01-29 10:18:23 +08:00
run-llm-install-tests.sh
[LLM] Refactor LLM Linux tests ( #8349 )
2023-06-16 15:22:48 +08:00
run-llm-langchain-tests.sh
[LLM] langchain bloom, UT's, default parameters ( #8357 )
2023-06-25 17:38:00 +08:00
run-llm-windows-tests.sh
LLM: fix langchain windows failure ( #8417 )
2023-06-30 09:59:10 +08:00