Commit graph

6 commits

Author SHA1 Message Date
Yuwen Hu
e290660b20 [LLM] Add so shared library for Bloom family models (#8258)
* Add so file downloading for bloom family models

* Supports selecting of avx2/avx512 so for bloom
2023-06-02 17:39:40 +08:00
Yina Chen
91a1528fce [LLM]Support for linux package (llama, NeoX) & quantize (llama) (#8246)
* temp

* update

* update

* remove cmake

* runtime get platform  ->  change platform name using sed

* update

* update

* add platform flags(default: current platform) & delete legacy libs & add neox quantize
2023-06-02 13:51:35 +08:00
Ruonan Wang
3fd716d422 LLM: update setup.py to add a missing data(#8240) 2023-06-01 10:25:43 +08:00
Ruonan Wang
c890609d1e LLM: Support package/quantize for llama.cpp/redpajama.cpp on Windows (#8236)
* support windows of llama.cpp

* update quantize

* update version of llama.cp submodule

* add gptneox.dll

* add quantize-gptneox.exe
2023-05-31 14:47:12 +08:00
Ruonan Wang
4638b85f3e [llm] Initial support of package and quantize (#8228)
* first commit of CMakeFiles.txt to include llama & gptneox

* initial support of quantize

* update cmake for only consider linux now

* support quantize interface

* update based on comment
2023-05-26 16:36:46 +08:00
Junwei Deng
ea22416525 LLM: add first round files (#8225) 2023-05-25 11:29:18 +08:00