Exadata shows error on kfod

Following is an example which configured on Exadata issue.

kfod shows segment fault.

Dump file /oraTB/base/diag/kfod/user_oracle/host_1620649457_93/incident/incdir_81/ora_95934_140163329665856_i81.trc
[TOC00000]
Jump to table of contents
Dump continued from file: /oraTB/base/diag/kfod/user_oracle/host_1620649457_93/trace/ora_95934_140163329665856.trc
[TOC00001]
oci-24550 [11] [[si_signo=11] [si_errno=0] [si_code=1] [si_int=1364841332] [si_ptr=0x7f7a5159d374] [si_addr=0x18]] [] [] [] [] [] [] [] [] [] []
[TOC00001-END]
[TOC00002]
========= Dump for incident 81 (oci 24550) ========
Starting a Diag Context default dump (level=3)

—– Incident Context Dump —–
Address: 0x7fff8c1392f0
Incident ID: 81
Problem Key: oci 24550
Error: oci-24550 [11] [[si_signo=11] [si_errno=0] [si_code=1] [si_int=1364841332] [si_ptr=0x7f7a5159d374] [si_addr=0x18]] [] [] [] [] [] [] [] [] [] []
[00]: dbgePostErrorDirectVaList_int [diag_dde]
[01]: dbgePostErrorDirect [diag_dde]
[02]: kpeDbgSignalHandler []<– Signaling
[03]: skgesig_sigactionHandler []
[04]: __sighandler []
[05]: kgfdTerm [KGF]
[06]: kfodddTerm []
[07]: kfodddInit []
[08]: kfodExecute []
[09]: kfod_main []
[10]: lpmcall []
[11]: lpmpmai []
[12]: main []
[13]: __libc_start_main []

check kfod log:
Trace file /oraTB/base/diag/kfod/user_oracle/host_1620649457_93/trace/ora_95934_140163329665856.trc
OSSIPC:SKGXP:[1378fb0.0]{0}: ifconfig_int(3047a8c0) failed with 0.
OSSIPC:SKGXP:[1378fb0.1]{0}: skgxpcini: Invalid IP address [192.168.71.48] provided
OS system dependent operation:if_not_found failed with status: 0
OS failure message: Error 0
failure occurred at: skgxpvaddr9
additional information: requested interface 192.1xx.71.48 not found. Check output from ifconfig command
DDE: Flood control is not active
2015-06-22T22:39:25.137052+18:00
Incident 81 created, dump file: /oraTB/base/diag/kfod/user_oracle/host_1620649457_93/incident/incdir_81/ora_95934_140163329665856_i81.trc
oci-24550 [11] [[si_signo=11] [si_errno=0] [si_code=1] [si_int=1364841332] [si_ptr=0x7f7a5159d374] [si_addr=0x18]] [] [] [] [] [] [] [] [] [] []

 

Fix:
check that this ip 71.48 is on 1st node, while current ip should be 71.49.

change following cellinit ip to 49

#cat /etc/oracle/cell/network-config/cellinit.ora
ipaddress1=192.168.71.48/21

ASMLib support on RedHat EL6

recently oracle adds support on ASMlib+RHEL6 kernel.

will seems some policies already being changed.

if someone interested, refer to

http://www.oracle.com/technetwork/server-storage/linux/asmlib/rhel6-1940776.html

To enable ASMlib supports, please download following file from oracle web-site

and get supports ‘kmod-oracleasm’ from RedHat RHN or related support.

good luck.

Use kfed to read ASM disk metadata

又有几时没有更新了,又挤着写东西
kfed可以read/merge,这样可以把一些Disk Metadata以一种可以读的方式展示出来。

养成好习惯还是要的。

ERROR: no PST quorum in group: required 2, found 0

今天在Xdata上犯了个小错误。。 上午忙活了半天
几个DG一直没办法MOUNT,显示问题很难看。只觉得是disk missing 又没说具体哪个disk,
然后 mount force也不顶用。

找了半天发现是重启instance时候不小心,discovery_string改掉了,
是我以前本用来建OCRVD时候,DG太多用来过滤的,于是只好改了回来。。

然后mount就一切顺利了。。。看来有时候小细节不能忽视啊。。