Sun & FreeBSD Probleme

From: Andreas Zymny <andreas(at)zymny.de>
Date: Sat, 06 Jan 2007 13:25:59 +0100

Hallo zusammen,

ich habe hier eine 420R stehen, auf der ein FreeBSD laeuft.

Seit einigen Tagen verhaelt sie sich etwas merkwuerdig. Sie rebootet
spontan. Bisher dachte ich, das laeg an einem Wackler in einer Steckdose
ist, weil das immer nur dann passierte, wenn ich an den Kabeln
rumgefingert habe (absichtlich oder unabsichtlich). In der Regel faehrt
sie danach einfach wieder hoch, aber heute hatte sie ihre Probleme
damit. In den Logfiles ist nichts zu finden - dazu scheint der syslogd
nicht mehr zu kommen.

Auf der Konsole hat sie heute die Meldung

panic: trap: data access exception

geschmissen, und danach resettet. Das passierte rein zufaellig. Mal wenn
FreeBSD versucht hatte das GEOM zu starten, mal wenn gerade der sendmail
gestartet wurde, oder die Netzwerkinterfaces konfiguriert wurden.

Ich hatte sie jetzt mal eine gute Stunde abgeschaltet, und sie kam
spontan hoch. und laeuft jetzt auch wieder seit einer guten Stunde.

Das hier war der Konsolenoutput des letzten Reboots, die anderen Male
kam der Panic nur zu einer anderen Gelegenheit.

Hit [Enter] to boot immediately, or any other key for command prompt.

Booting [/boot/kernel/kernel]...

nothing to autoload yet.

jumping to kernel entry at 0xc0060000.

GDB: no destray vector interrupt 2029

bug ports present

KDB: debugger backends: ddb

KDB: current backend: ddb

Copyright (c) 1992-2006 The FreeBSD Project.

Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994

        The Regents of the University of California. All rights
reserved.
FreeBSD 6.2-PRERELEASE #6: Sat Sep 23 02:51:58 CEST 2006

    root@:/usr/obj/usr/src/sys/SERVER

real memory = 2147483648 (2048 MB)

avail memory = 2067505152 (1971 MB)

cpu0: Sun Microsystems UltraSparc-II Processor (450.03 MHz CPU)

cpu1: Sun Microsystems UltraSparc-II Processor (450.03 MHz CPU)

FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs

nexus0: <Open Firmware Nexus device>

pcib0: <U2P UPA-PCI bridge> on nexus0

pcib0: Psycho, impl 0, version 4, ign 0x7c0, bus B

pcib0: [FAST]

pcib0: [FAST]

pcib0: [GIANT-LOCKED]

pcib0: [GIANT-LOCKED]

pcib0: [FAST]

initializing counter-timer

Timecounter "counter-timer" frequency 1000000 Hz quality 100

pcib0 dvma: DVMA map: 0xfc000000 to 0xffffffff

pci0: <OFW PCI bus> on pcib0

ebus0: <PCI-EBus2 bridge> mem
0x70000000-0x70ffffff,0x71000000-0x717fffff at de0
auxio0: <Sun Auxiliary I/O> addr
0x1400726000-0x1400726003,0x1400728000-0x140070
ebus0: <power> addr 0x1400724000-0x1400724003 (no driver attached)

ebus0: <SUNW,pll> addr 0x1400504000-0x1400504002 (no driver attached)

ebus0: <sc> addr 0x1400500000-0x1400500007 (no driver attached)

puc0: <Siemens SAB 82532 dual channel SCC> addr
0x1400400000-0x140040007f irq 40
uart0: <SAB 82532 v3.2, channel A> on puc0

uart0: CTS oflow

uart0: console (9600,n,8,1)

uart1: <SAB 82532 v3.2, channel B> on puc0

uart1: CTS oflow

uart2: <16550 or compatible> addr 0x14003083f8-0x14003083ff irq 41 on
ebus0
uart2: keyboard (1200,n,8,1)

uart2: keyboard not present

uart3: <16550 or compatible> addr 0x14003062f8-0x14003062ff irq 42 on
ebus0
ebus0: <ecpp> addr
0x14003043bc-0x14003043cb,0x1400300398-0x1400300399,0x140070)
ebus0: <fdthree> addr
0x14003023f0-0x14003023f7,0x1400706000-0x140070600f,0x140)
eeprom0: <EEPROM/clock> addr 0x1400000000-0x1400001fff on ebus0

eeprom0: model mk48t59

eeprom0: hostid 83010c9d

ebus0: <flashprom> addr 0x1000000000-0x10000fffff (no driver attached)

hme0: <Sun HME 10/100 Ethernet> mem 0x100000-0x107fff at device 1.1 on
pci0
miibus0: <MII bus> on hme0

ukphy0: <Generic IEEE 802.3u media interface> on miibus0

ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto

hme0: Ethernet address: 00:03:ba:01:0c:9d

sym0: <875> port 0x1000-0x10ff mem 0x108000-0x1080ff,0x10a000-0x10afff
at devic0
sym0: No NVRAM, ID 7, Fast-20, SE, parity checking

sym0: [GIANT-LOCKED]

sym1: <875> port 0x1400-0x14ff mem 0x10c000-0x10c0ff,0x10e000-0x10efff
at devic0
sym1: No NVRAM, ID 7, Fast-20, SE, parity checking

sym1: [GIANT-LOCKED]

pci0: <display> at device 5.0 (no driver attached)

pcib1: <U2P UPA-PCI bridge> on nexus0

pcib1: Psycho, impl 0, version 4, ign 0x7c0, bus A

pcib1: [FAST]

pci1: <OFW PCI bus> on pcib1

pcib2: <OFW PCI-PCI bridge> at device 1.0 on pci1

pci2: <OFW PCI bus> on pcib2

pci2: <bridge> at device 0.0 (no driver attached)

hme1: <Sun HME 10/100 Ethernet> mem 0x4000000-0x4007fff at device 0.1 on
pci2
miibus1: <MII bus> on hme1

ukphy1: <Generic IEEE 802.3u media interface> on miibus1

ukphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto

hme1: Ethernet address: 08:00:20:f1:0c:e4

pci2: <bridge> at device 1.0 (no driver attached)

hme2: <Sun HME 10/100 Ethernet> mem 0x8000000-0x8007fff at device 1.1 on
pci2
miibus2: <MII bus> on hme2

ukphy2: <Generic IEEE 802.3u media interface> on miibus2

ukphy2: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto

hme2: Ethernet address: 08:00:20:f1:0c:e5

pci2: <bridge> at device 2.0 (no driver attached)

hme3: <Sun HME 10/100 Ethernet> mem 0xc000000-0xc007fff at device 2.1 on
pci2
miibus3: <MII bus> on hme3

ukphy3: <Generic IEEE 802.3u media interface> on miibus3

ukphy3: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto

hme3: Ethernet address: 08:00:20:f1:0c:e6

pci2: <bridge> at device 3.0 (no driver attached)

hme4: <Sun HME 10/100 Ethernet> mem 0x10000000-0x10007fff at device 3.1
on pci2
miibus4: <MII bus> on hme4

ukphy4: <Generic IEEE 802.3u media interface> on miibus4

ukphy4: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto

hme4: Ethernet address: 08:00:20:f1:0c:e7

nexus0: <syscons>, type (unknown) (no driver attached)

Timecounters tick every 1.000 msec

ipfw2 (+ipv6) initialized, divert loadable, rule-based forwarding
enabled, defad
Waiting 5 seconds for SCSI devices to settle

SMP: AP CPU #1 Launched!

cd0 at sym0 bus 0 target 6 lun 0

cd0: <TOSHIBA DVD-ROM SD-M1401 1009> Removable CD-ROM SCSI-2 device

cd0: 20.000MB/s transfers (20.000MHz, offset 16)

cd0: Attempt to query device size failed: NOT READY, Medium not present

da0 at sym0 bus 0 target 0 lun 0

da0: <FUJITSU MAT3073NC 0108> Fixed Direct Access SCSI-3 device

da0: 40.000MB/s transfers (20.000MHz, offset 16, 16bit), Tagged Queueing
Enabled
da0: 70136MB (143638992 512 byte sectors: 255H 63S/T 8941C)

GEOM_MIRROR: Device gm0 created (id=1247000163).

GEOM_MIRROR: Device gm0: provider da0 detected.

Root mount waiting for: GMIRROR

Root mount waiting for: GMIRROR

Root mount waiting for: GMIRROR

Root mount waiting for: GMIRROR

GEOM_MIRROR: Force device gm0 start due to timeout.

GEOM_MIRROR: Device gm0: provider da0 activated.

GEOM_MIRROR: Device gm0: provider mirror/gm0 launched.

Trying to mount root from ufs:/dev/mirror/gm0a

WARNING: / was not properly dismounted

Loading configuration files.

Loading configuration files.

Entropy harvesting: interrupts ethernet point_to_point kickstart.

swapon: adding /dev/mirror/gm0b as swap device

Starting file system checks:

/dev/mirror/gm0a: 1217 files, 30826 used, 225853 free (1413 frags, 28055
blocks)
/dev/mirror/gm0d: DEFER FOR BACKGROUND CHECKING

/dev/mirror/gm0e: DEFER FOR BACKGROUND CHECKING

Mounting local file systems:WARNING: /usr was not properly dismounted

WARNING: /var was not properly dismounted

.

Setting hostname: server.zymny.de.

hme1: link state changed to UP

hme2: link state changed to UP

lo0: flags=8049<UP,LOOPBhme0: link state changed to UP

ACK,RUNNING,MULTICAST> mtu 16384

        inet6 ::1 prefixlen 128

        inet6 fe80::1%lo0 prefixlen 64 scopeid 0x6

        inet 127.0.0.1 netmask 0xff000000

hme0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500

        options=b<RXCSUM,TXCSUM,VLAN_MTU>

        inet6 fe80::203:baff:fe01:c9d%hme0 prefixlen 64 tentative
scopeid 0x1
        ether 00:03:ba:01:0c:9d

        media: Ethernet autoselect (10baseT/UTP <full-duplex>)

        status: active

hme1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500

        options=b<RXCSUM,TXCSUM,VLAN_MTU>

        inet6 fe80::a00:20ff:fef1:ce4%hme1 prefixlen 64 tentative
scopeid 0x2
        inet 192.168.0.1 netmask 0xfffffff0 broadcast 192.168.0.15

        ether 08:00:20:f1:0c:e4

        media: Ethernet 100baseTX <full-duplex>

        status: active

hme2: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500

        options=b<RXCSUM,TXCSUM,VLAN_MTU>

        inet6 fe80::a00:20ff:fef1:ce5%hme2 prefixlen 64 tentative
scopeid 0x3
        inet 192.168.1.1 netmask 0xfffffff8 broadcast 192.168.1.7

        ether 08:00:20:f1:0c:e5

        media: Ethernet 10baseT/UTP

        status: active

hme3: flags=8802<BROADCAST,SIMPLEX,MULTICAST> mtu 1500

        options=b<RXCSUM,TXCSUM,VLAN_MTU>

        inet6 fe80::a00:20ff:fef1:ce6%hme3 prefixlen 64 tentative
scopeid 0x4
        ether 08:00:20:f1:0c:e6

        media: Ethernet autoselect

hme4: flags=8802<BROADCAST,SIMPLEX,MULTICAST> mtu 1500

        options=b<RXCSUM,TXCSUM,VLAN_MTU>

        inet6 fe80::a00:20ff:fef1:ce7%hme4 prefixlen 64 tentative
scopeid 0x5
        ether 08:00:20:f1:0c:e7

        media: Ethernet autoselect

Starting devd.

hme3: flags=8802<BROADCAST,SIMPLEX,MULTICAST> mtu 1500

        options=b<RXCSUM,TXCSUM,VLAN_MTU>

        inet6 fe80::a00:20ff:fef1:ce6%hme3 prefixlen 64 tentative
scopeid 0x4
        ether 08:00:20:f1:0c:e6

        media: Ethernet autoselect

hme4: flags=8802<BROADCAST,SIMPLEX,MULTICAST> mtu 1500

        options=b<RXCSUM,TXCSUM,VLAN_MTU>

        inet6 fe80::a00:20ff:fef1:ce7%hme4 prefixlen 64 tentative
scopeid 0x5
        ether 08:00:20:f1:0c:e7

        media: Ethernet autoselect

Starting ppp.

Starting divert daemons:Flushed all rules.

00001 check-state

65533 allow ip from any to any keep-state

Additional routing options: IP gateway=YES.

Starting routed.

Mounting NFS file systems:.

ELF ldconfig path: /lib /usr/lib /usr/lib/compat /usr/local/lib
/usr/local/lib/l
Creating and/or trimming log files:.

Starting syslogd.

Initial sparc64 initialization:.

Additional ABI support:.

Starting named.

Recovering vi editor sessions:.

Starting smartd.

Starting saslauthd.

Starting local daemons:.

Updating motd.

Starting apache2.

Starting dhcpd.

Starting mysql.

Starting cyrus_imapd.

Starting sshd.

Starting sendmail.

panic: trap: data access exception

cpuid = 0

Uptime: 33s

Cannot dump. No dump device defined.

Automatic reboot in 15 seconds - press a key on the console to abort

Rebooting...

Resetting ...

Sun Enterprise 220R (2 X UltraSPARC-II 450MHz), No Keyboard

OpenBoot 3.31, 2048 MB memory installed, Serial #50400413.

Ethernet address 0:3:ba:1:c:9d, Host ID: 83010c9d.

Debug SYmbole waeren vermutlich eine gute Idee...und das Dump Device...

Das Logfile dazu sieht folgendermassen aus:

[bis hier ist noch nichts spannendes passiert, der named wurde
gestartet, der ppp hat eine Verbindung aufgemacht...]

Jan 6 11:22:08 server kernel: Starting mysql.
Jan 6 11:22:09 server kernel: Starting cyrus_imapd.
Jan 6 11:22:09 server master[1062]: process started
Jan 6 11:22:09 server master[1069]: about to exec
/usr/local/cyrus/bin/ctl_cyrusdb
Jan 6 11:22:09 server ctl_cyrusdb[1069]: recovering cyrus databases
Jan 6 11:22:09 server kernel: Starting sshd.
Jan 6 11:22:09 server sshd[1089]: Server listening on :: port 22.
Jan 6 11:22:09 server sshd[1089]: Server listening on 0.0.0.0 port 22.
Jan 6 11:22:09 server kernel: Starting sendmail.
Jan 6 12:14:55 server syslogd: restart
Jan 6 12:14:55 server syslogd: kernel boot file is /boot/kernel/kernel

Spontan wuerde ich auf ein Problem mit dem Speicher der SUN tippen. Gibt
es eine Moeglichkeit diesen irgendwie zu testen?

Dass die nach einer guten Stunde wieder problemlos arbeitet, koennte
auch auf ein Temperaturproblem hindeuten. Gibt es eine Moeglichkeit bei
einer SUN verschiedene Hardwarekomponenten dahingehend abzufragen?

Weder an der Hardware noch an dem FreeBSD wurde etwas geaendert..

Mit freundlichem Grusse,

Andreas

To Unsubscribe: send mail to majordomo(at)de.FreeBSD.org
with "unsubscribe de-bsd-questions" in the body of the message
Received on Sat 06 Jan 2007 - 13:29:30 CET

search this site