instability

My server, which runs Planet Lisp, xach.com, etc, has been flaking out with increasing frequency lately.

Here's the setup:

  • Relion 1XT 1U Pentium 4 3.0Ghz from Penguin Computing
  • 2GB ECC memory
  • Two 80GB SATA drives in md-raid 1 mounted on /
  • Fedora Core 4, kernel 2.6.13-1.1532_FC4

It's gotten to the point where it is locking up every few days. I can't even compile a kernel; it either segfaults or I get this in random include files:

error: static or type qualifiers in non-parameter array declarator

Everything screams "hardware problem". I'm extremely bummed about it. I didn't save any of the material to ship the unit back, if that proves necessary, and I don't really want to have weeks of downtime waiting for some resolution. Anyone have any comments or suggestions?

UPDATE The server will be going down for overnight maintenance today. Planet Lisp should be back up sometime on Thursday.

Comments

(Anonymous)

Re: Memory or disk?

Since you run ECC RAM it's probably not the RAM, maybe the mainboard?
Not so. If the RAM is faulty, it won't correct properly. I've seen failures in ECC with the memory tester I recommended below.

September 2014

S M T W T F S
 123456
78910111213
14151617181920
21222324252627
282930    
Powered by LiveJournal.com