PM861 communication lost
we have 30 redundant PM861 on our 30 well heads and are facing below issue on regular basis.
Controller is healthy and suddenly communication drops (controller becomes inaccessible on ABB OPC utility)
Scenario # 1 – Control builder unable to go online and HMI communication lost, but ping to controller is available. I/O scanning and logic execution is normal. Controller has to be downloaded again to make communication healthy. Log attached.
Scenario # 2 – Controller becomes inaccessible at all, no ping, no I/O scanning, no logic execution, no download availability. Controller has to give power cycle to make it available for download.
Thanks & Regards,
Rizwan ul Hassan.
Voted best answer
I recommend that you check the Ethernet connections.
Your comment about observing storm warning concur with the controller log(s). There are communication errors with peer controllers and hardware warnings from both Ethernet ports (0.1 and 0.2).
The AC 800M is designed to shut down if subjected to too high load, e.g. due to incoming Ethernet telegrams. The Network Storm Protection (NSP) might not be fast enough to save the day.
The close (in time) messages from primary and secondary Ethernet ports may indicate an unfavorable installation or wiring. ABB recommend having primary and secondary Ethernet connections fully separated (do not use VLAN or other muxing equipment resulting in a blend of the traffic in some common media/fiber/radio/etc).
If you need further assistance, please contact your regional ABB support or sales representative.
E 1979-12-31 00:00:07.280 RNRP Config error: Socket send queue full to 172.17.80.177. Short sendPeriod?
E 1979-12-31 00:00:07.766 RNRP Config error: Socket send queue full to 172.16.80.165. Short sendPeriod?
E 1979-12-31 00:00:06.770 RNRP Config error: Socket send queue full to 172.16.80.170. Short sendPeriod?
E 1979-12-31 00:00:07.266 RNRP Config error: Socket send queue full to 172.17.80.178. Short sendPeriod?
W 2017-09-09 01:01:13.045 On Unit= 0.1 HWError Contro~_RTU_12 0000 See HWTree Error 16#40020000 16#00000000
W 2017-09-09 01:01:13.046 On Unit= 0.2 HWError Contro~_RTU_12 0000 See HWTree Error 16#40020000 16#00000000