ignite經(jīng)常每隔數(shù)小時(shí)就掛掉,手動(dòng)重啟也經(jīng)常出現(xiàn)無法啟動(dòng)成功
手動(dòng)重啟報(bào)錯(cuò)日志:
Caused by: class org.apache.ignite.spi.IgniteSpiException: Node with the same ID was found in node IDs history or existing node in topology has the same ID
(fix configuration and restart local node)
運(yùn)行數(shù)小時(shí)后失敗日志:
[06:15:36,458][WARNING][tcp-disco-ip-finder-cleaner-#33-#92][TcpDiscoverySpi] Timed out waiting for message to be read (most probably, the reason is long GC pauses on remote node) [curTimeout=9998, rmtAddr=/10.42.2.181:47500, rmtPort=47500]
[06:15:36,458][WARNING][tcp-disco-ip-finder-cleaner-#33-#92][TcpDiscoverySpi] Failed to ping node [nodeId=null]. Reached the timeout 10000ms. Cause: Failed to deserialize object with given class loader: sun.misc.Launcher$AppClassLoader@75b84c92
[06:16:28,666][WARNING][tcp-disco-msg-worker-[1483ea3b 10.42.5.149:47500 crd]-#2-#60][TcpDiscoverySpi] Node is out of topology (probably, due to short-time network problems).
初步判斷,ignite中數(shù)據(jù)量暴增后出現(xiàn)網(wǎng)絡(luò)問題,增加超時(shí)時(shí)間
在ignite的配置文件中,在IgniteConfiguration bean,中,添加以下參數(shù):
<property name="failureDetectionTimeout" value="30000"/>
<property name="clientFailureDetectionTimeout" value="50000"/>