Commit graph

1523 commits

Author SHA1 Message Date
Junwei Deng
2d14e593f0 LLM: Support generate(max_new_tokens=...), tokenize and decode for transformers-like API (#8283)
* first push

* fix pep8
2023-06-07 11:50:35 +08:00
Yina Chen
11cd2a07e0 [LLM] llm transformers format interface first part (#8276)
* llm-transformers-format

* update

* fix style
2023-06-06 17:17:37 +08:00
Pingchuan Ma (Henry)
a3f353b939 [LLM] add long time loading disclaimer for LLM model converting (#8279) 2023-06-06 17:15:13 +08:00
Yuwen Hu
64bc123dd3 [LLM] Add transformers-like API from_pretrained (#8271)
* Init commit for bigdl.llm.transformers.AutoModelForCausalLM

* Temp change to avoid name conflicts with external transformers lib

* Support downloading model from huggingface

* Small python style fix

* Change location of transformers to avoid library conflicts

* Add return value for converted ggml binary ckpt path for convert_model

* Avoid repeated loading of shared library and adding some comments

* Small fix

* Path type fix anddocstring fix

* Small fix

* Small fix

* Change cache dir to pwd
2023-06-06 17:04:16 +08:00
Pingchuan Ma (Henry)
2ed5842448 [LLM] add convert's python deps for LLM (#8260)
* add python deps for LLM

* update release.sh

* change deps group name

* update all

* fix update

* test fix

* update
2023-06-06 16:01:17 +08:00
xingyuan li
38be471140 [LLM] convert_model bug fix (#8274)
* Renamed all bloomz to bloom in ggml/model & utls/convert_util.py
* Add an optional parameter for specific the model conversion path to avoid running out of disk space
2023-06-06 15:16:42 +08:00
Ruonan Wang
8bd2992a8d LLM: accelerate sample of gptneox and update quantize (#8262)
* update quantize & accelerate sample

* fix style check

* fix style error
2023-06-05 15:36:00 +08:00
Jun Wang
2bc0e7abbb [llm] Add convert_model api (#8244)
* add convert_model api

* change the model_path to input_path

* map int4 to q4_0

* fix blank line

* change bloomz to bloom

* remove default model_family

* change dtype to lower first
2023-06-03 10:18:29 +08:00
Yuwen Hu
e290660b20 [LLM] Add so shared library for Bloom family models (#8258)
* Add so file downloading for bloom family models

* Supports selecting of avx2/avx512 so for bloom
2023-06-02 17:39:40 +08:00
Pingchuan Ma (Henry)
c48d5f7cff [LLM] Enable UT workflow logics for LLM (#8243)
* check push connection

* enable UT workflow logics for LLM

* test fix

* add licenses

* test fix according to suggestions

* test fix

* update changes
2023-06-02 17:06:35 +08:00
Yina Chen
657ea0ee50 [LLM] Fix linux load libs for NeoX and llama (#8257)
* init

* add lisence

* fix style
2023-06-02 17:03:17 +08:00
Yuwen Hu
286b010bf1 [LLM] First push for Bloomz pybinding (#8252)
* Initial commit to move bloom pybinding to bigdl-llm

* Revise path for shared library

* Small fix
2023-06-02 14:41:04 +08:00
Yina Chen
91a1528fce [LLM]Support for linux package (llama, NeoX) & quantize (llama) (#8246)
* temp

* update

* update

* remove cmake

* runtime get platform  ->  change platform name using sed

* update

* update

* add platform flags(default: current platform) & delete legacy libs & add neox quantize
2023-06-02 13:51:35 +08:00
Junwei Deng
350d31a472 LLM: first push gptneox pybinding (#8234)
* first push gptneox pybinding

* fix

* fix code style and add license

---------

Co-authored-by: binbin <binbin1.deng@intel.com>
2023-06-02 09:28:00 +08:00
binbin Deng
3a9aa23835 LLM: fix and update related license in llama pybinding (#8250) 2023-06-01 17:09:15 +08:00
Pingchuan Ma (Henry)
141febec1f Add dev wheel building script for LLM package on Windows (#8238)
* Add dev wheel building script for LLM package on Windows

* delete conda

* delete python version check

* minor adjust

* wheel name fixed

* test check

* test fix

* change wheel name
2023-06-01 11:55:26 +08:00
binbin Deng
e56f24b424 LLM: first push llama pybinding (#8241)
* first push llama binding

* update dll
2023-06-01 10:59:15 +08:00
Ruonan Wang
3fd716d422 LLM: update setup.py to add a missing data(#8240) 2023-06-01 10:25:43 +08:00
binbin Deng
8421af51ae LLM: support converting to ggml format (#8235)
* add convert

* fix

* fix

* fix

* try

* test

* update check

* fix

* fix
2023-05-31 15:20:06 +08:00
Ruonan Wang
c890609d1e LLM: Support package/quantize for llama.cpp/redpajama.cpp on Windows (#8236)
* support windows of llama.cpp

* update quantize

* update version of llama.cp submodule

* add gptneox.dll

* add quantize-gptneox.exe
2023-05-31 14:47:12 +08:00
Pingchuan Ma (Henry)
1f913a6941 [LLM] Add LLM pep8 coding style checking (#8233)
* add LLM pep8 coding checking

* resolve bugs in testing scripts and code style revision
2023-05-30 15:58:14 +08:00
Ruonan Wang
4638b85f3e [llm] Initial support of package and quantize (#8228)
* first commit of CMakeFiles.txt to include llama & gptneox

* initial support of quantize

* update cmake for only consider linux now

* support quantize interface

* update based on comment
2023-05-26 16:36:46 +08:00
Junwei Deng
ea22416525 LLM: add first round files (#8225) 2023-05-25 11:29:18 +08:00