ipex-llm/python/llm
Yuwen Hu c1d07bc626
Support streaming for lookup generation (#11922)
* Support streaming for lookup generation

* Small update

* Style fixes

* Add origin generate full back for batch inference and beam search; support input length threshold judgement for directly input with input_ids

* Fix lookup stream generate with eos token

* Small fixes

* Small fix

* index fix

* Small fix
2024-08-26 19:33:31 +08:00
..
dev All-in-one benchmark update regarding performance mode for input length threshold (#11920) 2024-08-26 18:52:13 +08:00
example Add troubleshooting about transpose value setting 2024-08-26 16:06:32 +08:00
portable-zip Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
scripts fix typo in python/llm/scripts/README.md (#11536) 2024-07-09 09:53:14 +08:00
src/ipex_llm Support streaming for lookup generation (#11922) 2024-08-26 19:33:31 +08:00
test update mlp of llama (#11897) 2024-08-22 20:34:53 +08:00
tpp OSPDT: add tpp licenses (#11165) 2024-06-06 10:59:06 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py Update pypi tag to 2.2.0.dev0 (#11895) 2024-08-22 16:48:09 +08:00
version.txt Update pypi tag to 2.2.0.dev0 (#11895) 2024-08-22 16:48:09 +08:00