Advertisement
If you have a new account but are having problems posting or verifying your account, please email us on hello@boards.ie for help. Thanks :)
Hello all! Please ensure that you are posting a new thread or question in the appropriate forum. The Feedback forum is overwhelmed with questions that are having to be moved elsewhere. If you need help to verify your account contact hello@boards.ie

Best practices in web server uptime monitoring

Options
  • 16-01-2012 11:04pm
    #1
    Registered Users Posts: 16,413 ✭✭✭✭


    I was wondering what you folks use or see as best practices when monitoring your sites, or clients sites, for uptime? Or more importantly, downtime :)


Comments

  • Closed Accounts Posts: 1 xaban


    I've been using www.website-monitoring.com, which I have chosen after reading this article: "15 website monitoring services compared".

    Cheers,


  • Registered Users Posts: 3,140 ✭✭✭ocallagh


    Pingdom are good - free SMS/Emails etc. I can tell it works as I had my first server go down on me last week.. got the texts from pingdom within about 20 seconds of it going down.

    edit: Best practices:

    3rd Party SMS/Email monitoring like above (check different ports, response times etc)
    Cron which checks Apache+MySql are up and txtx/emails me with issues
    Cron which checks a few DB timestamps to ensure the app is still ticking along


  • Registered Users Posts: 2,781 ✭✭✭amen


    don't forget though that if you are in a clustered environment you also need to monitor each of the nodes in the cluster, the response time of each node,

    Also random sample log files and and have a history of response/access time on parts of the application/files.

    Handy to see if something has changed (maybe a page displaying historical data and the query is no longer efficient that sort of thing)

    If there are logged in users envolved also track response times vs number of simultaneous users


  • Registered Users Posts: 16,413 ✭✭✭✭Trojan


    Are you guys just checking HTTP is up, calling a static HTML page, calling a script that does other tests?

    What about false positives? Some of the services I've seen are prone to false positives due to lack of multiple sites.


Advertisement