* Update 8192 prompt in all-in-one * Add cpu_embedding param for linux api * Update run.py * Update README.md