OpenSolaris

Printable Version Enter a New Search
Bug ID 6707555
Synopsis stmsboot is lost in a ZFS root world
State 11-Closed:Verified (Closed)
Category:Subcategory kernel:driver-mpt-x86
Keywords snv | stmsboot | zfs
Responsible Engineer James Mcpherson
Reported Against s10u6_03 , s10u6_05
Duplicate Of
Introduced In solaris_nevada
Commit to Fix snv_99
Fixed In snv_99
Release Fixed solaris_nevada(snv_99) , solaris_10u7(s10u7_02) (Bug ID:2166003)
Related Bugs 6673278 , 6673281 , 6691090 , 6714681 , 6761587 , 6775167 , 6781590
Submit Date 27-May-2008
Last Update Date 6-October-2008
Description
After installing snv_90 and boot from a zfs root file system using SB2500 internal disk,
attempting to enable MPxIO on the system with the 'stmsboot -e or -d' commands resulted in the following errors.

"ERROR: stmsboot: failed to mount the root filesystem.
Instructions to recover your previous STMS configuration (if in case the system does not boot)"

d17b-root@[/root]>stmsboot -e

WARNING: stmsboot operates on each supported multipath-capable controller
         detected in a host. In your system, these controllers are

/devices/pci@1c,600000/ xxxxx W,qlc@2/fp@0,0
/devices/pci@1c,600000/ xxxxx W,qlc@2,1/fp@0,0
/devices/pci@1d,700000/ xxxxx W,emlxs@2/fp@0,0
/devices/pci@1d,700000/ xxxxx W,emlxs@2/fp@1,0
/devices/pci@1d,700000/ xxxxx W,emlxs@2,1/fp@0,0
/devices/pci@1d,700000/ xxxxx W,emlxs@2,1/fp@1,0

If you do NOT wish to operate on these controllers, please quit stmsboot
and re-invoke with -D { fp | mpt } to specify which controllers you wish
to modify your multipathing configuration for.

Do you wish to continue? [y/n] (default: y)
WARNING: This operation will require a reboot.
Do you want to continue ? [y/n] (default: y)
The changes will come into effect after rebooting the system.
Reboot the system now ? [y/n] (default: y)
May 27 13:05:43 d17b reboot: initiated by root on /dev/console
updating /platform/sun4u/boot_archive
May 27 13:05:51 /usr/lib/snmp/snmpdx: received signal 15
syncing file systems... done
rebooting...

SC Alert: Host System has Reset

Sun Fire V250, No Keyboard
Copyright 2005 Sun Microsystems, Inc.  All rights reserved.
OpenBoot 4.17.1, 8192 MB memory installed, Serial #58641225.
Ethernet address 0:3:ba:7e:cb:49, Host ID: 837ecb49.



Rebooting with command: boot
Boot device: /pci@1d,700000/scsi@4/disk@0,0:a  File and args:
SunOS Release 5.11 Version snv_90 64-bit
Copyright 1983-2008 Sun Microsystems, Inc.  All rights reserved.
Use is subject to license terms.
Hardware watchdog enabled
mount: Mount point cannot be determined

ERROR: stmsboot: failed to mount the root filesystem.
Instructions to recover your previous STMS configuration (if in case the system does not boot):

boot net (or from a cd/dvd/another disk)
fsck <your-root-device>
mount <your-root-device> /mnt
cp /mnt/etc/mpxio/mpt.conf /mnt/kernel/drv/mpt.conf
cp /mnt/etc/mpxio/vfstab /mnt/etc/vfstab
/usr/sbin/svccfg -f /mnt/etc/mpxio/svccfg_recover
umount /mnt
reboot

rpool/ROOT/zfsBE was your root device,
but it could be named differently after you boot net.
These instructions were also logged to the file /etc/mpxio/recover_instructions

Hostname: d17b
NIS domain name is cst.sfbay.sun.com
Reading ZFS config: done.
Mounting ZFS filesystems: (5/5)

d17b console login:


Config:
------

- Jumpstart profile:
install_type    initial_install
system_type     standalone
partitioning    explicit
cluster  xxxxx WCXall add
# ZFS boot device
pool rpool auto 2g 2g mirror c1t0d0s0
bootenv installbe bename zfsBE

- d17b-root@[/net/csstinstall/export/Jumpstart/csst]>zpool status
  pool: rpool
 state: ONLINE
 scrub: none requested
config:

        NAME        STATE     READ WRITE CKSUM
        rpool       ONLINE       0     0     0
          c1t0d0s0  ONLINE       0     0     0

errors: No known data errors
d17b-root@[/net/csstinstall/export/Jumpstart/csst]>zfs list
NAME                USED  AVAIL  REFER  MOUNTPOINT
rpool              9.77G  57.2G    63K  /rpool
rpool/ROOT         5.77G  57.2G    18K  legacy
rpool/ROOT/zfsBE   5.77G  57.2G  5.77G  /
rpool/dump         2.00G  57.2G  2.00G  -
rpool/export         38K  57.2G    20K  /export
rpool/export/home    18K  57.2G    18K  /export/home
rpool/swap            2G  59.2G  2.68M  -
d17b-root@[/net/csstinstall/export/Jumpstart/csst]>


- vfstab:
d17b-root@[/net/csstinstall/export/Jumpstart/csst]>more /etc/vfstab
#device         device          mount           FS      fsck    mount   mount
#to mount       to fsck         point           type    pass    at boot options
#
fd      -       /dev/fd fd      -       no      -
/proc   -       /proc   proc    -       no      -
/dev/zvol/dsk/rpool/swap        -       -       swap    -       no      -
/devices        -       /devices        devfs   -       no      -
sharefs -       /etc/dfs/sharetab       sharefs -       no      -
ctfs    -       /system/contract        ctfs    -       no      -
objfs   -       /system/object  objfs   -       no      -
swap    -       /tmp    tmpfs   -       yes     -
# File modification by Jumpstart installation
### csstdist-240:/usr/dist      -       /usr/dist       nfs     -       yes     -
# End file modificaiton
d17b-root@[/net/csstinstall/export/Jumpstart/csst]>

xxxxxxxxxxxxxxxxxxxxxxxxxx Platform Information xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

 O/S: Solaris Express Community Edition snv_90 SPARC

SunOS d17b 5.11 snv_90 sun4u sparc  xxxxx W,Sun-Fire-V250

OBP 4.17.1 2005/04/11 14:24 Sun Blade 2500
 Boot Drive             size   used  avail capacity  Mounted on
rpool/ROOT/zfsBE        67G   5.8G    57G    10%    /

Number of Processors: 2 sparcv9 (2 Online)

xxxxxxxxxxxxxxxxxxxxxxxxxxx Driver Levels xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
  scsi_vhci (SCSI VHCI Driver 1.68)
  scsi_vhci_f_asym_sun (f_asym_sun 1.8)
  scsi_vhci_f_asym_lsi (f_asym_lsi 1.24)
  scsi_vhci_f_asym_emc (f_asym_emc 1.3)
  scsi_vhci_f_sym_emc (f_sym_emc 1.1)
  scsi_vhci_f_sym_hds (f_sym_hds 1.1)
  scsi_vhci_f_sym (f_sym 1.13)
  scsi_vhci_f_tpgs (f_tpgs 1.9)
  sd (SCSI Disk Driver 1.579)
  glm (GLM SCSI HBA Driver 1.208.)
  qlc (SunFC Qlogic FCA v20080221-2.27)
  fctl (SunFC Transport v20080512-1.67)
  qlc_fw_2300 (SunFC Qlogic FCA-FW 3.03.27)
  fp (SunFC Port v20080512-1.93)
  emlxs (SunFC emlxs FCA v20080326-2.31c)
  fcp (SunFC FCP v20080512-1.177)
  ssd (SCSI SSA/FCAL Disk Driver 1.579)
  mpt (MPT HBA Driver v1.136)
  st (SCSI tape Driver 1.279)
  qus (isp10160 HBA Driver 1.12)
  fcip (SunFC FCIP v20080512-1.60)
  fcsm (SunFC FCSM v20080512-1.24)
xxxxxxxxxxxxxxxxxxxxxxxxxx Layered SW xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
 SunOS d17b 5.11 snv_90 sun4u sparc  xxxxx W,Sun-Fire-V250
  Solaris Express Community Edition snv_90 SPARC
  Patches: None
  VXVM: NOT installed.
   SVM: NOT configured.
 ZONES: NOT configured.
ZPOOLS: CONFIGURED

NAME    SIZE   USED  AVAIL    CAP  HEALTH  ALTROOT
rpool    68G  7.77G  60.2G    11%  ONLINE  -

 TSlvm: NOT installed.
 Mizer: NOT installed.
  DDCG: NOT installed.
   DEX: NOT installed.
 MPxIO: ENABLED

xxxxxxxxxxxxxxxxxxxxxxxxxx HBA & Storage Info xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
get_dualpath called by mistake. Nothing to check cause bootdrive is blank at /net/csstdata-240/export/storage/slafond/get_rootvols.pl line 178.
 Using current config data file.
 Using current zone data file.
 DATE: .27May2008:14:01:24
  :banner-name   Sun Fire V250
 HOST: d17b:  xxxxx W,Sun-Fire-V250  Solaris Express Community Edition snv_90 SPARC

c0                             connected    configured   unknown
c0  qlc0   210000e08b0910aa  Crystal-2A      1.14.15  CONNECTED  /pci@1c,600000/ xxxxx W,qlc@2
c0  204400a0b8117690           connected    configured   unknown    SE6540    STK Universal Xport
c1                             connected    configured   unknown
c1  glm INTL-LVD  NA            /pci@1d,700000/scsi@4:    NA
c1  dsk/c1t0d0                 connected    configured   unknown    SEAGATE ST -->BOOT DISK
373307L xxxxx 72G
c1  dsk/c1t8d0                 connected    configured   unknown    SEAGATE ST373307L xxxxx 72G
c10                            connected    unconfigured unknown
c10 qus Jasper    QLGC,ISP10160 /pci@1e,600000/pci@4/scsi@5: 1.0.2
c11                            connected    unconfigured unknown
c11 glm Cauld/S   NA            /pci@1f,700000/pci@2/scsi@2: NA
c12                            connected    unconfigured unknown
c12 glm Cauld/S   NA            /pci@1f,700000/pci@2/scsi@2,1: NA
c2                             connected    configured   unknown
c2  emlxs1 10000000c9488851  Crystal-2R      1.50a8   CONNECTED  /pci@1d,700000/ xxxxx W,emlxs@2,1
c2  204500a0b8117690           connected    configured   unknown    SE6540    STK Universal Xport
c3                             connected    configured   unknown
c3  dsk/c3t2d0                 connected    configured   unknown    JLMS XJ-HD166S
c4                             connected    configured   unknown
c4  mpt Jasper320 LSI,1030      /pci@1e,600000/scsi@3:    LSI1030 FCode: 1.14 07/10/02 Release 4.27.5 
c4  dsk/c4t0d0                 connected    configured   unknown     xxxxx  StorEdge 3320
c5                             connected    unconfigured unknown
c5  qlc1   210100e08b2910aa  Crystal-2A      1.14.15  NOT CONNECTED /pci@1c,600000/ xxxxx W,qlc@2,1
c6                             connected    unconfigured unknown
c6  glm Unknown   NA            /pci@1d,700000/scsi@4,1:  NA
c7                             connected    unconfigured unknown
c7  emlxs0 10000000c9488850  Crystal-2R      1.50a8   NOT CONNECTED /pci@1d,700000/ xxxxx W,emlxs@2
c8                             connected    unconfigured unknown
c8  mpt Jasper320 LSI,1030      /pci@1e,600000/scsi@3,1:  LSI1030 FCode: 1.14 07/10/02 Release 4.27.5 
c9                             connected    unconfigured unknown
c9  qus Jasper    QLGC,ISP10160 /pci@1e,600000/pci@4/scsi@4: 1.0.2
cfgpath is     set               to:             Not Applicable, and:       Not Applicable
Please attach a tar'd up copy of your /etc/mpxio directory, as well as the
output from "prtconf -v" and "prtpicl -v"
Work Around
On reboot, login and run  

svcadm disable mpxio-upgrade

then reboot again.
Comments
N/A