RNRP Routers resundancy network error Causes IAC lost
Hi, I had a strange problem concerning the RNRP we have 3 pairs of redundant RNRP Routers. (Attached in details)
1st Pair (2 RNRP Routers for Control Network)
2nd Pair (2 RNRP Routers For Safety Control Network)
3rd Pair (2 RNRP Routers for Gateway Network)
If we we have the 1st Pair powered off, we have IAC communication running between Control Network and Safety network without any problems.
Once we disconnect the Primary cable from Control Network from one of the 2 Gateway routers then we lose IAC communication between Safety and control network (We suppose to have redundant control network cable in the router and another router full redundant).
This case happens only when we disconnect the primary control cable from the 1st router only, if we disconnect any other cable nothing happens even if we disconnect the power for this router the IAC is working normally!.
Any explanation ?
Thanks
Waleed Yehia
1st Pair (2 RNRP Routers for Control Network)
2nd Pair (2 RNRP Routers For Safety Control Network)
3rd Pair (2 RNRP Routers for Gateway Network)
If we we have the 1st Pair powered off, we have IAC communication running between Control Network and Safety network without any problems.
Once we disconnect the Primary cable from Control Network from one of the 2 Gateway routers then we lose IAC communication between Safety and control network (We suppose to have redundant control network cable in the router and another router full redundant).
This case happens only when we disconnect the primary control cable from the 1st router only, if we disconnect any other cable nothing happens even if we disconnect the power for this router the IAC is working normally!.
Any explanation ?
Thanks
Waleed Yehia
Answers
Please provide a better network map.
Also, attach the C:\ProgramData\rnrpevent.log log file from a computer connected to the network. Describe what actions you took and at what time (so that I can correlate with what RNRP logged).
Also, attach the C:\ProgramData\rnrpevent.log log file from a computer connected to the network. Describe what actions you took and at what time (so that I can correlate with what RNRP logged).
Hi, Kindly find attached the configuration for the routers, Events and network topology
We make the same test today and it works as it should ! but if you can support me with explanation for this case as we have to report a reason for it.
Date and time of tests was 14-May around 13:00
Thanks for your support in advance
We make the same test today and it works as it should ! but if you can support me with explanation for this case as we have to report a reason for it.
Date and time of tests was 14-May around 13:00
Thanks for your support in advance
I've downloaded the files and attempted to follow your tests made at 1300 hours on the 14th.
In the attached Excel-file I have isolated all recorded "router" events (R141, R142, R143, R144, R145 and R146).
As per your description, having R141 and R142 powered down, then removing Path 0 Area 20 from R145 triggers the problem?
I have not been able to find that particular sequence in the log file? See my coloring of R145 Area 20 Path 0 events.
Maybe I failed because the RNRP events were recorded at 172.16.8.84 that need R141 or R142, or a combination of R143/R144+R145/R146 to track what takes place on Area 20?
Can you fill in some more comments, when the IAC communication was lost, etc?
Also, I suggest that you connect a regular PC with RNRP service "as a status monitor" to Area 20 and/or Area 21 while testing this; when IAC breaks down, use the RNRP Utility's function Get Network Map From One Node to query the two disconnected IAC peers for their network status. The RNRP Network Status tool should also provide what the measuring PC experiences (it should reach the nodes on the other side of R145 or R146 for IAC to be operational).
PS. The firewall seem to have "redundant" rules in some routers.
In the attached Excel-file I have isolated all recorded "router" events (R141, R142, R143, R144, R145 and R146).
As per your description, having R141 and R142 powered down, then removing Path 0 Area 20 from R145 triggers the problem?
I have not been able to find that particular sequence in the log file? See my coloring of R145 Area 20 Path 0 events.
Maybe I failed because the RNRP events were recorded at 172.16.8.84 that need R141 or R142, or a combination of R143/R144+R145/R146 to track what takes place on Area 20?
Can you fill in some more comments, when the IAC communication was lost, etc?
Also, I suggest that you connect a regular PC with RNRP service "as a status monitor" to Area 20 and/or Area 21 while testing this; when IAC breaks down, use the RNRP Utility's function Get Network Map From One Node to query the two disconnected IAC peers for their network status. The RNRP Network Status tool should also provide what the measuring PC experiences (it should reach the nodes on the other side of R145 or R146 for IAC to be operational).
PS. The firewall seem to have "redundant" rules in some routers.
filter allow src 172.16.0.0/15 dst 172.16.0.0/15 proto icmp
filter allow src 172.16.0.0/15 dst 172.16.0.0/15 proto udp
filter allow src 172.16.0.0/15 dst 172.16.0.0/15 proto tcp
filter allow src 172.16.0.0/22 dst 172.16.0.0/22 proto icmp
filter allow src 172.16.0.0/22 dst 172.16.0.0/22 proto tcp
filter allow src 172.16.0.0/22 dst 172.16.0.0/22 proto udp
Add new comment