* add langchain rag on gpu * add rag example in readme * add trust_remote_code in TransformersEmbeddings.from_model_id * add trust_remote_code in TransformersEmbeddings.from_model_id in cpu
* remove typing extension downgrade in readme; minor fixes of code * fix typos in README * change default question of docqa.py