Song Jiaming
c06f1ca93e
[LLM] auto perf test to output to csv ( #8846 )
2023-09-01 10:48:00 +08:00
Song Jiaming
b8b1b6888b
[LLM] Performance test ( #8796 )
2023-08-25 14:31:45 +08:00
Ruonan Wang
e9aa2bd890
LLM: reduce GPU 1st token latency and update example ( #8763 )
...
* reduce 1st token latency
* update example
* fix
* fix style
* update readme of gpu benchmark
2023-08-16 18:01:23 +08:00
Song Jiaming
c1f9af6d97
[LLM] chatglm example and transformers low-bit examples ( #8751 )
2023-08-16 11:41:44 +08:00
Ruonan Wang
8805186f2f
LLM: add benchmark tool for gpu ( #8760 )
...
* add benchmark tool for gpu
* update
2023-08-16 11:22:10 +08:00
Song Jiaming
e717e304a6
LLM first example test and template ( #8658 )
2023-08-10 10:03:11 +08:00
Ruonan Wang
64b38e1dc8
llm: benchmark tool for transformers int4 (separate 1st token and rest) ( #8460 )
...
* add benchmark utils
* fix
* fix bug and add readme
* hidden latency data
2023-07-06 09:49:52 +08:00
Junwei Deng
2fd751de7a
LLM: add a dev tool for getting glibc/glibcxx requirement ( #8399 )
...
* add a dev tool
* pep8 change
2023-06-30 11:09:50 +08:00
Shengsheng Huang
02c583144c
[LLM] langchain integrations and examples ( #8256 )
...
* langchain intergrations and examples
* add licences and rename
* add licences
* fix license issues and change backbone to model_family
* update examples to use model_family param
* fix linting
* fix code style
* exclude langchain integration from stylecheck
* update langchain examples and update integrations based on latets changes
* update simple llama-cpp-python style API example
* remove bloom in README
* change default n_threads to 2 and remove redundant code
---------
Co-authored-by: leonardozcm <changmin.zhao@intel.com>
2023-06-12 19:22:07 +08:00
Pingchuan Ma (Henry)
773255e009
[LLM] Add dev wheel building and basic UT script for LLM package on Linux ( #8264 )
...
* add wheel build for linux
* test fix
* test self-hosted runner
* test fix
* update runner
* update runner
* update fix
* init cicd
* init cicd
* test conda
* update fix
* update no need manual python deps
* test fix bugs
* test fix bugs
* test fix bugs
* fix bugs
2023-06-08 00:49:57 +08:00
Pingchuan Ma (Henry)
2ed5842448
[LLM] add convert's python deps for LLM ( #8260 )
...
* add python deps for LLM
* update release.sh
* change deps group name
* update all
* fix update
* test fix
* update
2023-06-06 16:01:17 +08:00
Pingchuan Ma (Henry)
c48d5f7cff
[LLM] Enable UT workflow logics for LLM ( #8243 )
...
* check push connection
* enable UT workflow logics for LLM
* test fix
* add licenses
* test fix according to suggestions
* test fix
* update changes
2023-06-02 17:06:35 +08:00
Pingchuan Ma (Henry)
141febec1f
Add dev wheel building script for LLM package on Windows ( #8238 )
...
* Add dev wheel building script for LLM package on Windows
* delete conda
* delete python version check
* minor adjust
* wheel name fixed
* test check
* test fix
* change wheel name
2023-06-01 11:55:26 +08:00
binbin Deng
8421af51ae
LLM: support converting to ggml format ( #8235 )
...
* add convert
* fix
* fix
* fix
* try
* test
* update check
* fix
* fix
2023-05-31 15:20:06 +08:00
Pingchuan Ma (Henry)
1f913a6941
[LLM] Add LLM pep8 coding style checking ( #8233 )
...
* add LLM pep8 coding checking
* resolve bugs in testing scripts and code style revision
2023-05-30 15:58:14 +08:00