Unexplained kernel panic on FAI client
Chris Jewell
chris.jewell at warwick.ac.uk
Sat Apr 3 09:08:24 CEST 2010
Hi All,
First, I am new to this list, so let me introduce myself. I am
primarily a statistician who runs a small high performance cluster in an
academic department. Because I am first and foremost an academic, I
have tried to automate as much of the cluster maintenance as I can, so
that I can spend as little time tinkering with it as possible. Hence I
decided to use FAI!
The original cluster was composed of Dell PE1950 servers (with a PE2950
fileserver). This all went together very well, and FAI performed really
well for installing the 5 execution nodes. However, I have just
recently purchased two more execution nodes, this time Dell R410s as the
1950s are now obsolete. As you will see from the log below, the PXE
boot works fine, up until the root filesystem is mounted. I then get a
kernel panic. Can anyone suggest why?
Cheers,
Chris
Log snippet: (full log at
http://www2.warwick.ac.uk/fac/sci/statistics/staff/research/jewell/fai-fail-log.txt)
Begin: Loading essential drivers...
...
[ 5.369859] md: linear personality registered for level
-1
[ 5.381653] Fusion MPT base driver
3.04.10
[ 5.384557] md: multipath personality registered for level
-4
[ 5.386658] md: raid0 personality registered for level
0
[ 5.389990] md: raid1 personality registered for level
1
[ 5.392157] xor: automatically using best checksumming function:
generic_sse
[ 5.392448] Adding 4114496k swap on /dev/ramzswap0. Priority:100
extents:1 across:4114496k SSD
[ 5.417716] Copyright (c) 1999-2008 LSI
Corporation
[ 5.433996] Fusion MPT SAS Host driver
3.04.10
[ 5.439527] generic_sse: 5934.800
MB/sec
[ 5.444072] xor: using function: generic_sse (5934.800
MB/sec)
[ 5.452725] mptsas 0000:02:00.0: PCI INT A -> GSI 32 (level, low) ->
IRQ 32
[ 5.460888] async_tx: api initialized
(async)
[ 5.519340] usb 5-1: new low speed USB device using uhci_hcd and
address 2
[ 5.629048] raid6: int64x1 1416
MB/s
[ 5.712992] usb 5-1: configuration #1 chosen from 1
choice
[ 5.744145] usbcore: registered new interface driver
hiddev
[ 5.762954] input: Avocent Dell 03R874 as
/devices/pci0000:00/0000:00:1d.0/usb5/5-1/5-1:1.0/input/input2
[ 5.772446] generic-usb 0003:0624:0294.0001: input,hidraw0: USB HID
v1.10 Keyboard [Avocent Dell 03R874] on
usb-0000:00:1d.0-1/input0
[ 5.798580] raid6: int64x2 1825
MB/s
[ 5.803894] input: Avocent Dell 03R874 as
/devices/pci0000:00/0000:00:1d.0/usb5/5-1/5-1:1.1/input/input3
[ 5.813402] generic-usb 0003:0624:0294.0002: input,hidraw1: USB HID
v1.10 Mouse [Avocent Dell 03R874] on
usb-0000:00:1d.0-1/input1
[ 5.825095] usbcore: registered new interface driver
usbhid
[ 5.830642] usbhid: v2.6:USB HID core
driver
[ 5.968148] raid6: int64x4 1229
MB/s
[ 6.137669] raid6: int64x8 1845
MB/s
[ 6.307209] raid6: sse2x1 7953
MB/s
[ 6.476759] raid6: sse2x2 9290
MB/s
[ 6.646301] raid6: sse2x4 10607
MB/s
[ 6.650291] raid6: using algorithm sse2x4 (10607
MB/s)
[ 6.655448] mptbase: ioc0: Initiating
bringup
[ 6.663059] md: raid6 personality registered for level
6
[ 6.668376] md: raid5 personality registered for level
5
[ 6.673661] md: raid4 personality registered for level
4
[ 6.683360] md: raid10 personality registered for level
10
Done.
Begin: Running /scripts/init-premount
...
Done.
Begin: Mounting root file system...
...
Begin: Running /scripts/live-premount
...
Done.
[ 8.221965] ioc0: LSISAS1068E B3: Capabilities={Initiator}
[ 25.878441] scsi4 : ioc0: LSISAS1068E B3, FwRev=00192f00h, Ports=1,
MaxQ=266, IRQ=32
[ 25.922536] mptsas: ioc0: attaching sata device: fw_channel 0, fw_id
0, phy 0, sas_addr 0x1221000000000000
[ 25.934580] scsi 4:0:0:0: Direct-Access ATA WDC WD1602ABKS-1
3B04 PQ: 0 ANSI: 5
[ 25.944316] sd 4:0:0:0: Attached scsi generic sg1 type 0
[ 25.950718] sd 4:0:0:0: [sda] 312500000 512-byte logical blocks: (160
GB/149 GiB)
[ 25.966492] sd 4:0:0:0: [sda] Write Protect is off
[ 25.973310] sd 4:0:0:0: [sda] Write cache: enabled, read cache:
enabled, doesn't support DPO or FUA
[ 25.993370] sda: sda1 sda2
[ 26.017709] sd 4:0:0:0: [sda] Attached SCSI disk
ata_id[534]: HDIO_GET_IDENTITY failed for '/dev/.tmp-block-8:0'
[ 26.125158] Kernel panic - not syncing: Attempted to kill init!
[ 26.131052] Pid: 1, comm: init Not tainted 2.6.31-20-generic #58-Ubuntu
[ 26.137640] Call Trace:
[ 26.140082] [<ffffffff8152a0dd>] panic+0x73/0x12b
[ 26.144857] [<ffffffff81120fe4>] ? __fput+0x194/0x210
[ 26.149971] [<ffffffff8106039b>] find_new_reaper+0x9b/0xa0
[ 26.155520] [<ffffffff81060f7d>] forget_original_parent+0x3d/0x290
[ 26.161756] [<ffffffff8106058c>] ? put_files_struct+0xbc/0xe0
[ 26.167562] [<ffffffff810611e6>] exit_notify+0x16/0x1c0
[ 26.172848] [<ffffffff810619c5>] do_exit+0x1c5/0x360
[ 26.177876] [<ffffffff81061ba9>] do_group_exit+0x49/0xc0
[ 26.183249] [<ffffffff81061c32>] sys_exit_group+0x12/0x20
[ 26.188712] [<ffffffff81012082>] system_call_fastpath+0x16/0x1b
More information about the linux-fai
mailing list