* first commit * update example * fix style * update example * embedding as const * fix generate * code refactor * meet code review * fix style * change max_output_len to max_context_len * fix all-in-one * fix example * add check for new tokens |
||
|---|---|---|
| .. | ||
| cli | ||
| ggml | ||
| gptq | ||
| langchain | ||
| llamaindex | ||
| serving | ||
| transformers | ||
| utils | ||
| vllm | ||
| __init__.py | ||
| convert_model.py | ||
| format.sh | ||
| llm_patching.py | ||
| models.py | ||
| optimize.py | ||