Commit graph

1511 commits

Author SHA1 Message Date
Yina Chen
91a1528fce [LLM]Support for linux package (llama, NeoX) & quantize (llama) (#8246)
* temp

* update

* update

* remove cmake

* runtime get platform  ->  change platform name using sed

* update

* update

* add platform flags(default: current platform) & delete legacy libs & add neox quantize
2023-06-02 13:51:35 +08:00
Junwei Deng
350d31a472 LLM: first push gptneox pybinding (#8234)
* first push gptneox pybinding

* fix

* fix code style and add license

---------

Co-authored-by: binbin <binbin1.deng@intel.com>
2023-06-02 09:28:00 +08:00
binbin Deng
3a9aa23835 LLM: fix and update related license in llama pybinding (#8250) 2023-06-01 17:09:15 +08:00
Pingchuan Ma (Henry)
141febec1f Add dev wheel building script for LLM package on Windows (#8238)
* Add dev wheel building script for LLM package on Windows

* delete conda

* delete python version check

* minor adjust

* wheel name fixed

* test check

* test fix

* change wheel name
2023-06-01 11:55:26 +08:00
binbin Deng
e56f24b424 LLM: first push llama pybinding (#8241)
* first push llama binding

* update dll
2023-06-01 10:59:15 +08:00
Ruonan Wang
3fd716d422 LLM: update setup.py to add a missing data(#8240) 2023-06-01 10:25:43 +08:00
binbin Deng
8421af51ae LLM: support converting to ggml format (#8235)
* add convert

* fix

* fix

* fix

* try

* test

* update check

* fix

* fix
2023-05-31 15:20:06 +08:00
Ruonan Wang
c890609d1e LLM: Support package/quantize for llama.cpp/redpajama.cpp on Windows (#8236)
* support windows of llama.cpp

* update quantize

* update version of llama.cp submodule

* add gptneox.dll

* add quantize-gptneox.exe
2023-05-31 14:47:12 +08:00
Pingchuan Ma (Henry)
1f913a6941 [LLM] Add LLM pep8 coding style checking (#8233)
* add LLM pep8 coding checking

* resolve bugs in testing scripts and code style revision
2023-05-30 15:58:14 +08:00
Ruonan Wang
4638b85f3e [llm] Initial support of package and quantize (#8228)
* first commit of CMakeFiles.txt to include llama & gptneox

* initial support of quantize

* update cmake for only consider linux now

* support quantize interface

* update based on comment
2023-05-26 16:36:46 +08:00
Junwei Deng
ea22416525 LLM: add first round files (#8225) 2023-05-25 11:29:18 +08:00