FAI and NFS... please help troubleshoot

Yaroslav Halchenko yoh at psychology.rutgers.edu
Thu May 25 23:51:24 CEST 2006


Dear FAI users,

I came back to use FAI to install few additional nodes with amd64
debian. Let me put some specifics aside (I was trying to put different
releases, ie sarge sid etc). The main problem now is misbehaving NFS
server I think, but it seems to be working fine when accessed from the
other nodes. I thought that may be it is due to bonding of interfaces on
the server? I added another interface and placed it on a separate IP  -
the same luck.  I thought that may be it is a server kernel (was
2.6.16.18) -- downgraded to 2.6.15-1-amd64-k8-smp from backports.org --
the same story

For the client I've tried to switch to tcp and nfs v3 -- at some point
it suceeded after few "still trying" messages, but it failed on the next
trial

Now a bit of details:

On the server side I see the requests which get granted:

May 25 17:44:08 raider in.tftpd[4421]: connect from 10.0.0.26 (10.0.0.26)
May 25 17:44:08 raider in.tftpd[4422]: RRQ from 10.0.0.26 filename pxelinux.0
May 25 17:44:08 raider in.tftpd[4422]: tftp: client does not accept options
May 25 17:44:08 raider in.tftpd[4423]: RRQ from 10.0.0.26 filename pxelinux.0
May 25 17:44:08 raider in.tftpd[4424]: RRQ from 10.0.0.26 filename pxelinux.cfg/01-00-17-31-2b-55-85
May 25 17:44:08 raider in.tftpd[4425]: RRQ from 10.0.0.26 filename pxelinux.cfg/0A00001A
May 25 17:44:08 raider in.tftpd[4426]: RRQ from 10.0.0.26 filename vmlinuz-install
May 25 17:44:18 raider mountd[4145]: authenticated mount request from node26.ravana.rutgers.edu:1023 for /home/fai/nfsroot/amd64.sid (/home/fai/nfsroot)

as I said nfs seems to be working fine...

raider:/boot/fai/pxelinux.cfg# rpcinfo -p localhost
   program vers proto   port
    100000    2   tcp    111  portmapper
    100000    2   udp    111  portmapper
    100021    1   udp  32768  nlockmgr
    100021    3   udp  32768  nlockmgr
    100021    4   udp  32768  nlockmgr
    100021    1   tcp  57106  nlockmgr
    100021    3   tcp  57106  nlockmgr
    100021    4   tcp  57106  nlockmgr
    100007    2   udp    935  ypbind
    100007    1   udp    935  ypbind
    100007    2   tcp    938  ypbind
    100007    1   tcp    938  ypbind
    100024    1   udp    950  status
    100024    1   tcp    953  status
    300019    1   tcp    958  amd
    300019    1   udp    959  amd
    100003    2   udp   2049  nfs
    100003    2   tcp   2049  nfs
    100005    1   udp    649  mountd
    100005    2   udp    649  mountd
    100005    1   tcp    652  mountd
    100005    2   tcp    652  mountd

FAI mounts were exported as

/usr/local/share/fai 10.0.0.0/24(async,ro)
/home/fai/nfsroot 10.0.0.0/24(async,ro,no_root_squash)


FAI's reports (from telnet session -- tails were cut)

IP-Config: Got DHCP answer from 10.0.0.249, my address is 10.0.0.26
IP-Config: Complete:                                               
      device=eth1, addr=10.0.0.26, mask=255.255.255.0, gw=10.0.0.1,
     host=node26, domain=ravana.rutgers.edu rutgers.edu, nis-domain=ravana.rutgers.edu,
     bootserver=10.0.0.249, rootserver=10.0.0.249, rootpath=/usr/lib/fai/nfsroot       
Looking up port of RPC 100003/3 on 10.0.0.249                                   
Looking up port of RPC 100005/3 on 10.0.0.249
VFS: Mounted root (nfs filesystem) readonly. 
Freeing unused kernel memory: 208k freed    
INIT: version 2.86 booting              
Kernel parameters: ip=dhcp devfs=nomount FAI_ACTION=install console=ttyS0,115200n8 root=/dev/nfs nfsroot=/home/fai/nfsroot/amd64.sid,tcp,v3 FAI_FLAGS=verbose,sshd,reb 
             -----------------------------------------------------
               Fully Automatic Installation for Debian GNU/Linux
               FAI 2.10.1, 20 Apr 2006    Copyright (c) 1999-2006

               Thomas Lange      <lange at informatik.uni-koeln.de>
             -----------------------------------------------------
Calling task_confdir
Kernel parameters: ip=dhcp devfs=nomount FAI_ACTION=install console=ttyS0,115200n8 root=/dev/nfs nfsroot=/home/fai/nfsroot/amd64.sid,tcp,v3 FAI_FLAGS=verbose,sshd,reb 
nfs: server 10.0.0.249 not responding, still trying
nfs: server 10.0.0.249 not responding, still trying
/usr/lib/fai/get-boot-info: line 66: killall: command not found
/usr/lib/fai/get-boot-info: line 124: dmesg: command not found
/usr/lib/fai/get-boot-info: line 124: grep: com


Could you please point me to the right direction on how to troubleshoot
this beastie... 

Thank you in advance for any ideas

-- 
Yaroslav Halchenko
Research Assistant, Psychology Department, Rutgers-Newark
Office: (973) 353-5440x263 | FWD: 82823 | Fax: (973) 353-1171
        101 Warren Str, Smith Hall, Rm 4-105, Newark NJ 07102
Student  Ph.D. @ CS Dept. NJIT



More information about the linux-fai mailing list