Orca: add grpc error to orca known issues (#5309)
* feat: add grpc error to orca known issues * refactor: update short name and style. * refactor: refine error explanation * Update known_issues.md * Update known_issues.md
This commit is contained in:
parent
41f602fcec
commit
985aec4425
1 changed files with 17 additions and 3 deletions
|
|
@ -12,7 +12,7 @@ To solve this issue, you need to set the path of `libhdfs.so` in Cloudera to the
|
||||||
3. If you are using `init_orca_context(cluster_mode="yarn-client")`:
|
3. If you are using `init_orca_context(cluster_mode="yarn-client")`:
|
||||||
```
|
```
|
||||||
conf = {"spark.executorEnv.ARROW_LIBHDFS_DIR": "/opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/lib64"}
|
conf = {"spark.executorEnv.ARROW_LIBHDFS_DIR": "/opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/lib64"}
|
||||||
init_orca_context(cluster_mode="yarn", conf=conf)
|
init_orca_context(cluster_mode="yarn-client", conf=conf)
|
||||||
```
|
```
|
||||||
If you are using `init_orca_context(cluster_mode="spark-submit")`:
|
If you are using `init_orca_context(cluster_mode="spark-submit")`:
|
||||||
```
|
```
|
||||||
|
|
@ -24,6 +24,20 @@ To solve this issue, you need to set the path of `libhdfs.so` in Cloudera to the
|
||||||
--conf spark.yarn.appMasterEnv.ARROW_LIBHDFS_DIR=/opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/lib64
|
--conf spark.yarn.appMasterEnv.ARROW_LIBHDFS_DIR=/opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/lib64
|
||||||
```
|
```
|
||||||
|
|
||||||
|
### UnkownError: Could not start gRPC server
|
||||||
|
|
||||||
|
This error occurs while running Orca TF2 Estimator with spark backend, which may because the previous pyspark tensorflow job was not cleaned completely. You can retry later or you can set spark config `spark.python.worker.reuse=false` in your application.
|
||||||
|
|
||||||
|
If you are using `init_orca_context(cluster_mode="yarn-client")`:
|
||||||
|
```
|
||||||
|
conf = {"spark.python.worker.reuse": "false"}
|
||||||
|
init_orca_context(cluster_mode="yarn-client", conf=conf)
|
||||||
|
```
|
||||||
|
If you are using `init_orca_context(cluster_mode="spark-submit")`:
|
||||||
|
```
|
||||||
|
spark-submit --conf spark.python.worker.reuse=false
|
||||||
|
```
|
||||||
|
|
||||||
## **Orca Context Issues**
|
## **Orca Context Issues**
|
||||||
|
|
||||||
### **Exception: Failed to read dashbord log: [Errno 2] No such file or directory: '/tmp/ray/.../dashboard.log'**
|
### **Exception: Failed to read dashbord log: [Errno 2] No such file or directory: '/tmp/ray/.../dashboard.log'**
|
||||||
|
|
|
||||||
Loading…
Reference in a new issue