Hi all - I normally check the site every morning when I get up, but this morning I didn't. When I got to work, I tried to pull up the site, and couldn't get anything. I tried to ping and ssh to the server, and still nothing. So I left work to drive home and check the server (luckily I have the flexibility to work from home when I need to), and it was simply hung, so I bounced the server.
After that, everything is working fine again. This isn't the first time that the server has flat out hung. The last time was when I restarted the mysqld service.
So, I obviously don't like the fact that this new server is even the slightest bit unreliable. I am looking into why the server is haning, and will post any details if I find any.
The server that the site was on before was slackware, and I've been told that it is more reliable than RH9. I'm sure that this is debatable, but I know that the site was never down due to server/OS issues when it was on the old server.
That said, I have three physical servers for the ZUG. All three servers are Dell Poweredge 350's, PIII 800mhz with 1gb of RAM each.
(1) zaurususergroup.com site
(2) downloads.zaurususergroup.com site
(3) spare server - I was planning on making this server an exact duplicate of the zaurususergroup.com site, with the site and db being rsync'd on a regular basis in case something happens with the primary server.
I am now going to build out the spare server (3) with slackware, and get the site ready to move to this server. I will then rebuild the existing zaurususergroup.com site server (1) to be the secondary mirror server.
I do appologize for the outage, and I will do everything within my means to make the site as reliable as possible.