| Author |
Message |
kian_hong2000
Joined: 08 Nov 2006
Posts: 8
|
Posted:
Sat Aug 25, 2007 2:35 am Post subject:
Early discover for faulty Switches |
|
|
Normally we will get to the location to check if something went wrong through the feedback from user that their LAN ports is not working. Is there any ways to know in advance that this switches need to be replaced instead of waiting for users to complaint?
Sometimes the switches is working and nothing seemed to be wrong. I vefiy it using "Show post" command. However, within 3 days this switches is failure and need to be replaced. Thanks! |
|
| Back to top |
|
 |
bitgod
Joined: 13 Nov 2007
Posts: 9
|
Posted:
Tue Nov 13, 2007 5:56 am Post subject:
|
|
|
| Thats tricky because there are so many ways a switch can fail. You best best is using a network monitoring tool. There are lot of open source options out there like Zenoss, Cacti, OpenNMS... just to name a small fraction of them. They use SNMP to check on health status and can be configured to alert based on anomalies in traffic, errors, or unresponsiveness. Of coarse this is really more reactive than proactive, its still good to get the alert from a monitoring tool before getting surprised by an angry end user. |
|
| Back to top |
|
 |
kian_hong2000
Joined: 08 Nov 2006
Posts: 8
|
Posted:
Sat Nov 17, 2007 2:23 am Post subject:
|
|
|
| However, most of the monitoring tools gives alarm only when that switches is down which is too late to discover and plan in advance. |
|
| Back to top |
|
 |
bitgod
Joined: 13 Nov 2007
Posts: 9
|
Posted:
Mon Nov 26, 2007 4:19 am Post subject:
|
|
|
yep.. predicting (rather than detecting) the failure of a switch or a specific port is difficult to do if its not a feature already built into the switch by the manufacturer. If you find a reliable one, let me know! As far as your third party options go, you can setup threshold alerting with most network monitoring tools to get a warning alert based on a certain number of errors, etc. Do some research on what kind of snmp data you can gather form your switch. It *may* have something already built in to alert you via a snmp trap.
With todays tools the best way to truly ensure no one ever goes down is a solid redundant infrastructure. Obviously providing redundancy all the way down through the access layer to the desktop can be extraordinarily expensive. Very rarely will you see a company willing to spend the kind of money it takes to get all the way down to redundant network cards in a computer which are then connected to two separate switches.
The realistic solution for this issue is developing a proper service level agreement with your customers or end users. If your systems alert you of a down switch within 5 or 10 minutes of it dying, you should have the appropriate tools to re-kick a new switch based on a backup configuration of the old one that is now nonoperational for whatever reason. If you have the right resources, including personnel, you may have the switch replaced and back online in 30 minutes or less.
A mere 30 or less minutes of downtime for the average end user generally sounds better to the finance department... You can always build a highly redundant environment out to your more important users. |
|
| Back to top |
|
 |
|
|
|
|