我對一份工作有困難(父)觸發多個並行運行的另一份工作(孩子)批次(例如10並行運行每批)。
偶爾的一些並行的“孩子”工作將崩潰幾分鍾,期間或之後立即集群初始化。墜毀運行終止與一個“取消”的結果狀態。
看似相關摘錄log4j的輸出:
引起的:java.sql。SQLNonTransientConnectionException:太多的連接在org.mariadb.jdbc.internal.util.exceptions.ExceptionMapper.get (ExceptionMapper.java: 175) org.mariadb.jdbc.internal.util.exceptions.ExceptionMapper.getException (ExceptionMapper.java: 110) org.mariadb.jdbc.internal.protocol.AbstractConnectProtocol.connectWithoutProxy (AbstractConnectProtocol.java: 1107) org.mariadb.jdbc.internal.util.Utils.retrieveProxy (Utils.java: 502) org.mariadb.jdbc.MariaDbConnection.newConnection (MariaDbConnection.java: 155) org.mariadb.jdbc.Driver.connect (Driver.java: 86) java.sql.DriverManager.getConnection (DriverManager.java: 664) java.sql.DriverManager.getConnection (DriverManager.java: 208) com.jolbox.bonecp.BoneCP.obtainRawInternalConnection (BoneCP.java: 361) com.jolbox.bonecp.BoneCP。< init > (BoneCP.java: 416)…116多所造成的:java.sql。SQLException異常:太多的連接在org.mariadb.jdbc.internal.protocol.AbstractConnectProtocol.authentication (AbstractConnectProtocol.java: 856) org.mariadb.jdbc.internal.protocol.AbstractConnectProtocol.handleConnectionPhases (AbstractConnectProtocol.java: 777) org.mariadb.jdbc.internal.protocol.AbstractConnectProtocol.connect (AbstractConnectProtocol.java: 451) org.mariadb.jdbc.internal.protocol.AbstractConnectProtocol.connectWithoutProxy (AbstractConnectProtocol.java: 1103)…123多22/01/14 21:24:42警告PythonDriverWrapper: setupRepl: replid - 409 - cf - 88936 - 53 - fe7 8:最後,狀態是錯誤(replid - 409 - cf - 88936 - 53 - fe7 - 8, org.apache.spark.sql。AnalysisException: . lang。RuntimeException: . lang。RuntimeException:無法實例化org.apache.hadoop.hive.metastore.HiveMetaStoreClient) 22/01/14 21:24:42信息DriverCorral美元:清洗包裝replid - 409 - cf - 88936 - 53 - fe7 - 8(目前狀態停止(replid - 409 - cf - 88936 - 53 - fe7 - 8)) 22/01/14 21:24:42信息DriverCorral美元:發送關閉信號REPL replid - 409 - cf - 88936 - 53 - fe7 - 8 22/01/14 21:24:42警告PythonDriverWrapper: REPL replid - 409 - cf - 88936 - 53 - fe7 - 8已經關閉:停止(replid - 409 - cf - 88936 - 53 - fe7 - 8) 22/01/14 21:24:42信息DriverCorral美元:發送中斷信號REPL replid - 409 - cf - 88936 - 53 - fe7 - 8 22/01/14 21:24:42信息DriverCorral美元:localThread停止等待REPL replid - 409 - cf - 88936 - 53 - fe7 - 8 22/01/14 21:24:42信息DriverCorral美元:replid - 409 - cf - 88936 - 53 - fe7 - 8成功地丟棄
滿log4j-active輸出連接。