* first commit * update example * fix style * update example * embedding as const * fix generate * code refactor * meet code review * fix style * change max_output_len to max_context_len * fix all-in-one * fix example * add check for new tokens
* fix * fix * fix * fix stype * fix style * fix style
* add initial support for minicpm-llama-v2.5 * update impl * add minicpm-llama3-v2.5 example