Unexplained kernel panic on FAI client

Chris Jewell chris.jewell at warwick.ac.uk
Sat Apr 3 09:08:24 CEST 2010


Hi All,

First, I am new to this list, so let me introduce myself.  I am
primarily a statistician who runs a small high performance cluster in an
academic department.  Because I am first and foremost an academic, I
have tried to automate as much of the cluster maintenance as I can, so
that I can spend as little time tinkering with it as possible.  Hence I
decided to use FAI!

The original cluster was composed of Dell PE1950 servers (with a PE2950
fileserver).  This all went together very well, and FAI performed really
well for installing the 5 execution nodes.  However, I have just
recently purchased two more execution nodes, this time Dell R410s as the
1950s are now obsolete.  As you will see from the log below, the PXE
boot works fine, up until the root filesystem is mounted. I then get a
kernel panic.  Can anyone suggest why?

Cheers,

Chris

Log snippet: (full log at
http://www2.warwick.ac.uk/fac/sci/statistics/staff/research/jewell/fai-fail-log.txt)

Begin: Loading essential drivers...
...                                                            
[    5.369859] md: linear personality registered for level
-1                                      
[    5.381653] Fusion MPT base driver
3.04.10                                                      
[    5.384557] md: multipath personality registered for level
-4                                   
[    5.386658] md: raid0 personality registered for level
0                                        
[    5.389990] md: raid1 personality registered for level
1                                        
[    5.392157] xor: automatically using best checksumming function:
generic_sse                    
[    5.392448] Adding 4114496k swap on /dev/ramzswap0.  Priority:100
extents:1 across:4114496k SSD 
[    5.417716] Copyright (c) 1999-2008 LSI
Corporation                                             
[    5.433996] Fusion MPT SAS Host driver
3.04.10                                                  
[    5.439527]    generic_sse:  5934.800
MB/sec                                                    
[    5.444072] xor: using function: generic_sse (5934.800
MB/sec)                                  
[    5.452725] mptsas 0000:02:00.0: PCI INT A -> GSI 32 (level, low) ->
IRQ 32                     
[    5.460888] async_tx: api initialized
(async)                                                   
[    5.519340] usb 5-1: new low speed USB device using uhci_hcd and
address 2                      
[    5.629048] raid6: int64x1   1416
MB/s                                                          
[    5.712992] usb 5-1: configuration #1 chosen from 1
choice                                      
[    5.744145] usbcore: registered new interface driver
hiddev                                     
[    5.762954] input: Avocent Dell 03R874 as
/devices/pci0000:00/0000:00:1d.0/usb5/5-1/5-1:1.0/input/input2                                                                                            

[    5.772446] generic-usb 0003:0624:0294.0001: input,hidraw0: USB HID
v1.10 Keyboard [Avocent Dell 03R874] on
usb-0000:00:1d.0-1/input0                                                               

[    5.798580] raid6: int64x2   1825
MB/s                                                          
[    5.803894] input: Avocent Dell 03R874 as
/devices/pci0000:00/0000:00:1d.0/usb5/5-1/5-1:1.1/input/input3                                                                                            

[    5.813402] generic-usb 0003:0624:0294.0002: input,hidraw1: USB HID
v1.10 Mouse [Avocent Dell 03R874] on
usb-0000:00:1d.0-1/input1                                                                  

[    5.825095] usbcore: registered new interface driver
usbhid                                     
[    5.830642] usbhid: v2.6:USB HID core
driver                                                    
[    5.968148] raid6: int64x4   1229
MB/s                                                          
[    6.137669] raid6: int64x8   1845
MB/s                                                          
[    6.307209] raid6: sse2x1    7953
MB/s                                                          
[    6.476759] raid6: sse2x2    9290
MB/s                                                          
[    6.646301] raid6: sse2x4   10607
MB/s                                                          
[    6.650291] raid6: using algorithm sse2x4 (10607
MB/s)                                          
[    6.655448] mptbase: ioc0: Initiating
bringup                                                   
[    6.663059] md: raid6 personality registered for level
6                                        
[    6.668376] md: raid5 personality registered for level
5                                        
[    6.673661] md: raid4 personality registered for level
4                                        
[    6.683360] md: raid10 personality registered for level
10                                      
Done.                                                                                              

Begin: Running /scripts/init-premount
...                                                          
Done.                                                                                              

Begin: Mounting root file system...
...                                                            
Begin: Running /scripts/live-premount
...                                                          
Done.
[    8.221965] ioc0: LSISAS1068E B3: Capabilities={Initiator}
[   25.878441] scsi4 : ioc0: LSISAS1068E B3, FwRev=00192f00h, Ports=1,
MaxQ=266, IRQ=32
[   25.922536] mptsas: ioc0: attaching sata device: fw_channel 0, fw_id
0, phy 0, sas_addr 0x1221000000000000
[   25.934580] scsi 4:0:0:0: Direct-Access     ATA      WDC WD1602ABKS-1
3B04 PQ: 0 ANSI: 5
[   25.944316] sd 4:0:0:0: Attached scsi generic sg1 type 0
[   25.950718] sd 4:0:0:0: [sda] 312500000 512-byte logical blocks: (160
GB/149 GiB)
[   25.966492] sd 4:0:0:0: [sda] Write Protect is off
[   25.973310] sd 4:0:0:0: [sda] Write cache: enabled, read cache:
enabled, doesn't support DPO or FUA
[   25.993370]  sda: sda1 sda2
[   26.017709] sd 4:0:0:0: [sda] Attached SCSI disk
ata_id[534]: HDIO_GET_IDENTITY failed for '/dev/.tmp-block-8:0'

[   26.125158] Kernel panic - not syncing: Attempted to kill init!
[   26.131052] Pid: 1, comm: init Not tainted 2.6.31-20-generic #58-Ubuntu
[   26.137640] Call Trace:
[   26.140082]  [<ffffffff8152a0dd>] panic+0x73/0x12b
[   26.144857]  [<ffffffff81120fe4>] ? __fput+0x194/0x210
[   26.149971]  [<ffffffff8106039b>] find_new_reaper+0x9b/0xa0
[   26.155520]  [<ffffffff81060f7d>] forget_original_parent+0x3d/0x290
[   26.161756]  [<ffffffff8106058c>] ? put_files_struct+0xbc/0xe0
[   26.167562]  [<ffffffff810611e6>] exit_notify+0x16/0x1c0
[   26.172848]  [<ffffffff810619c5>] do_exit+0x1c5/0x360
[   26.177876]  [<ffffffff81061ba9>] do_group_exit+0x49/0xc0
[   26.183249]  [<ffffffff81061c32>] sys_exit_group+0x12/0x20
[   26.188712]  [<ffffffff81012082>] system_call_fastpath+0x16/0x1b




More information about the linux-fai mailing list