Doc: add yarn cluster running results (#7562)
* feat(yarn.md): add yarn cluster run results * fix(yarn.md): rm extra | * docs(yarn.md): perf description * docs(yarn.md): perf * feat(yarn.md): how to see logs * fix(yarn.md): perf: time * perf(yarn.md): rm which python * fix(yarn.md): hide internal info * docs(yarn.md): perf * fix(yarn.md): url
This commit is contained in:
parent
213cfd985b
commit
479042a93a
1 changed files with 23 additions and 3 deletions
|
|
@ -43,7 +43,27 @@ The difference between yarn-client mode and yarn-cluster mode is where you run y
|
||||||
|
|
||||||
For yarn-client, the Spark driver runs in the client process, and the application master is only used for requesting resources from YARN, while for yarn-cluster the Spark driver runs inside an application master process which is managed by YARN in the cluster.
|
For yarn-client, the Spark driver runs in the client process, and the application master is only used for requesting resources from YARN, while for yarn-cluster the Spark driver runs inside an application master process which is managed by YARN in the cluster.
|
||||||
|
|
||||||
For more details, please see [Launching Spark on YARN](https://spark.apache.org/docs/latest/running-on-yarn.html#launching-spark-on-yarn).
|
Please see more details in [Launching Spark on YARN](https://spark.apache.org/docs/latest/running-on-yarn.html#launching-spark-on-yarn).
|
||||||
|
|
||||||
|
For **yarn-client** mode, you can directly find the driver logs in the console.
|
||||||
|
|
||||||
|
For **yarn-cluster** mode, an `application_time_id` will be returned (`application_1668477395550_1045` in the following log) when the application master process is completed.
|
||||||
|
|
||||||
|
```bash
|
||||||
|
23/02/15 15:30:26 INFO yarn.Client: Application report for application_1668477395550_1045 (state: FINISHED)
|
||||||
|
23/02/15 15:30:26 INFO yarn.Client:
|
||||||
|
client token: N/A
|
||||||
|
diagnostics: N/A
|
||||||
|
ApplicationMaster host: ...
|
||||||
|
ApplicationMaster RPC port: 46652
|
||||||
|
queue: ...
|
||||||
|
start time: 1676446090408
|
||||||
|
final status: SUCCEEDED
|
||||||
|
tracking URL: http://.../application_1668477395550_1045/
|
||||||
|
user: ...
|
||||||
|
```
|
||||||
|
|
||||||
|
Visit the tracking URL and then click `logs` in the table `ApplicationMaster` to see the driver logs.
|
||||||
|
|
||||||
### 1.3 Distributed storage on YARN
|
### 1.3 Distributed storage on YARN
|
||||||
__Note__:
|
__Note__:
|
||||||
|
|
@ -255,7 +275,7 @@ bigdl-submit \
|
||||||
--driver-memory 2g \
|
--driver-memory 2g \
|
||||||
--py-files model.py \
|
--py-files model.py \
|
||||||
--archives /path/to/environment.tar.gz#environment \
|
--archives /path/to/environment.tar.gz#environment \
|
||||||
--conf spark.pyspark.driver.python=/path/to/python \
|
--conf spark.pyspark.driver.python=python \
|
||||||
--conf spark.pyspark.python=environment/bin/python \
|
--conf spark.pyspark.python=environment/bin/python \
|
||||||
train.py --cluster_mode bigdl-submit --data_dir hdfs://path/to/remote/data
|
train.py --cluster_mode bigdl-submit --data_dir hdfs://path/to/remote/data
|
||||||
```
|
```
|
||||||
|
|
@ -345,7 +365,7 @@ ${SPARK_HOME}/bin/spark-submit \
|
||||||
--driver-memory 2g \
|
--driver-memory 2g \
|
||||||
--archives /path/to/environment.tar.gz#environment \
|
--archives /path/to/environment.tar.gz#environment \
|
||||||
--properties-file ${BIGDL_HOME}/conf/spark-bigdl.conf \
|
--properties-file ${BIGDL_HOME}/conf/spark-bigdl.conf \
|
||||||
--conf spark.pyspark.driver.python=/path/to/python \
|
--conf spark.pyspark.driver.python=python \
|
||||||
--conf spark.pyspark.python=environment/bin/python \
|
--conf spark.pyspark.python=environment/bin/python \
|
||||||
--py-files ${BIGDL_HOME}/python/bigdl-spark_${SPARK_VERSION}-${BIGDL_VERSION}-python-api.zip,model.py \
|
--py-files ${BIGDL_HOME}/python/bigdl-spark_${SPARK_VERSION}-${BIGDL_VERSION}-python-api.zip,model.py \
|
||||||
--jars ${BIGDL_HOME}/jars/bigdl-assembly-spark_${SPARK_VERSION}-${BIGDL_VERSION}-jar-with-dependencies.jar \
|
--jars ${BIGDL_HOME}/jars/bigdl-assembly-spark_${SPARK_VERSION}-${BIGDL_VERSION}-jar-with-dependencies.jar \
|
||||||
|
|
|
||||||
Loading…
Reference in a new issue