专注业务连续性支持与数据保护
2014-04-15技术合集

HP SuperDome CELL 微码同步已关闭评论

HP SuperDome CELL 微码同步

当同属于一个nPar下的CELL板IPF FW版本不一致时,启动会出现BIB BLOCK错误,导致自检无法通过,同时日志出现如下提示:

1
2
3
4
5
6
7
8
9
10
11
12
13
16698 SFW  0,0,0 *5 a38001f900e00fed 0000000000000000 RESET_FOR_RECONFIG
16694 SFW  0,1,0 *5 a38001f910e00fe5 0000000000000001 RESET_FOR_RECONFIG
16690 SFW  0,7,0 *5 a38001f970e00fdd 0000000000000007 RESET_FOR_RECONFIG
16686 SFW  0,2,0 *5 a38001f920e00fd5 0000000000000002 RESET_FOR_RECONFIG
16682 SFW  0,3,0 *5 a38001f930e00fcd 0000000000000003 RESET_FOR_RECONFIG
16678 SFW  0,5,0 *5 a38001f950e00fc5 0000000000000005 RESET_FOR_RECONFIG
16674 SFW  0,1,0 *5 ae800c8710e00fbd 0000000000000000 PD_INCOMPATIBLE_FW_REVS
16673 SFW  0,0,0 *5 ae800c8700e00fbb 0000000000000001 PD_INCOMPATIBLE_FW_REVS
16672 SFW  0,2,0 *5 ae800c8720e00fb9 0000000000000000 PD_INCOMPATIBLE_FW_REVS
16671 SFW  0,7,0 *5 ae800c8770e00fb7 0000000000000000 PD_INCOMPATIBLE_FW_REVS
16670 SFW  0,3,0 *5 ae800c8730e00fb5 0000000000000000 PD_INCOMPATIBLE_FW_REVS
16670 SFW  0,3,0 *5 ae800c8730e00fb5 0000000000000000 PD_INCOMPATIBLE_FW_REVS
16669 SFW  0,5,0 *5 ae800c8750e00fb3 0000000000000000 PD_INCOMPATIBLE_FW_REVS

进入MP-CM-sysrev查看版本
继续阅读 »

2014-03-15技术合集

Cacti和Nagios因目录占满引起的问题已关闭评论

Cacti和Nagios因目录占满引起的问题

进行系统 “df -lh” 发现根分区100%。伴随cacti无图像,nagios无数据。

检查数据库日志发现很多:
[ERROR] /usr/libexec/mysqld: Table ‘./xxx’ is marked as crashed and should be repaired

检查Cacti日志发现很多,文件增长迅速:
CMDPHP: Poller[0] ERROR: SQL Assoc Failed!, Error:’1017′,

先修复mysql文件:
myisamchk -c -r /var/lib/mysql/cacti/host_snmp_cache.MYI
myisamchk -c -r /var/lib/mysql/nagios/nagios_logentries.MYI

针对cacti日志出现表查询报错进行修复:
mysql> truncate table poller_output;

之后查询Cacti日志发现如下报错:
CMDPHP: Poller[0] ERROR: SQL Cell Failed!, Error:’2006′

进行cacti控制台,点击“Console-System Utilities-Rebuild Poller Cache” 重建缓存。

Cacti和Nagios恢复正常。

2014-02-26技术合集

一则关于SAN Switch奇怪的故障已关闭评论

一则关于SAN Switch奇怪的故障

SAN Switch设备型号:EMC DS-4700M
存储型号:HITACHI VSP
主机型号:HP SuperDome

故障描述:

按照端口进行zone划分后,存储端无法扫描到主机wwn,之后检查光纤交换机日志,无异常。

因为交换机端口速率为2Gb,存储端口为8Gb,推测在速率协商时出现问题,之后通过将端口模式从自协商改为固定速率,观察状况,仍无效。

之后尝试在存储端手动增加wwn进行磁盘Mapping,但是无效。

之后尝试通过wwn进行划分zone,生效。

总结:

基于端口划分zone是交换机的基本功能,很难解释。

2014-01-17生活琐记

糟心的Lotus Notes已关闭评论

糟心的Lotus Notes

IBM Lotus Notes 是我使用过的最糟糕的邮件系统,没有之一。

最近更新到Lotus Notes 8.5.3后,发现D盘多了一目录:D:\My Documents\SametimeTranscripts,从名字上推断应该是Lotus Notes的Sametime组件的保存聊天记录或者脚本的目录,于是就去配置里修改,成功应用配置。于是,删除目录,重启,然后,然后还是出现了这个目录。

对于我一个处女座的人,这点是不能忍受的。

于是查官方文档,看到com.ibm.collaboration.realtime.chat.logging中有个字段定义了这个目录,好吧,修改这个字段,完成。

删除目录,重启,然后,然后还是出现了这个目录。

想到去看看注册表里是不是有定义,搜索,木有。
继续阅读 »

2014-01-08技术合集

HP-UX操作系统备份已关闭评论

HP-UX操作系统备份

make_tape_recovery命令的格式和简单说明

#/opt/ignite/bin/make_tape_recovery [ -AprvC] [-d destination] [-b boot_destination] –make_tape_recovery命令所在的位置

-A : 指定要备份整个root disk /

-p : 预览备份过程,并不创建 Bootable DDS Tape

a. 确认 /var/opt/ignite/recovery/mkrec.append 文件

b. 创建 /var/opt/ignite/recovery/arch.include 文件
继续阅读 »

2013-12-08技术合集

LINKLOOP二层协议连通性测试已关闭评论

LINKLOOP二层协议连通性测试

linkloop命令是HP-UX独有的命令,所以在使用的时候对方也一定要是HP-UX主机
该命令是用来测试第二层(网卡MAC地址)的连通性,该命令是向远程HP-UX主机发送一个测试包来测试MAC地址的连通性.
用法:
linkloop 0x00248177E460 #0x00248177E460为远程主机的MAC地址.
该命令要在root用户下才能执行.
-n 设置传输的桢数
-i 指定使用哪个PPA(physical poit of attachment ,指示网卡的唯一标志)默认使用它在内部数据结构中遇到的第一个PPA(lan0)
-t 以秒为单位,指定等到(超时)时间
-s 指定数据报的大小
-v 详细信息选项
继续阅读 »

2013-11-17技术合集

关于VIP的奇怪故障一例已关闭评论

关于VIP的奇怪故障一例

OS:HP-UX 11.31 DB:Oracle 10.0.2.5 RAC (2nodes)

问题描述:节点二在服务启动后,经过1分钟至3分钟会发生VIP导致的服务切换,切换后节点二VIP在节点一上启动,检查日志主要报错为:

1
Invalid parameters, or failed to bring up VIP (host=essrzc2)

经过检查,系统网关配置正常,响应速度正常,系统日志无异常。开启服务debug后收获如下日志:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: Mon Dec 16 14:57:03 EAT 2013 [ 21730 ] Checking interface existance
Mon Dec 16 14:57:03 EAT 2013 [ 21730 ] Calling getifbyip
Mon Dec 16 14:57:03 EAT 2013 [ 21730 ] getifbyip:  started for 132.42.37.144
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: Mon Dec 16 14:57:03 EAT 2013 [ 21730 ] Completed getifbyip lan900:801
Mon Dec 16 14:57:03 EAT 2013 [ 21730 ] Completed with initial interface test
Mon Dec 16 14:57:03 EAT 2013 [ 21730 ] Broadcast = 132.42.37.255
<span id="more-707"></span>
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: Mon Dec 16 14:57:03 EAT 2013 [ 21730 ] checkIf: start for if=lan900
Mon Dec 16 14:57:07 EAT 2013 [ 21730 ] checkIf: RX packets checked if=lan900 ok
Mon Dec 16 14:57:07 EAT 2013 [ 21730 ] checkIf: end for if=lan900
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: Mon Dec 16 14:57:07 EAT 2013 [ 21730 ] main: interface is okay, exit success
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcexecut: env ORACLE_CONFIG_HOME=/oracle/product/10.2/crs
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcexecut: cmd = /oracle/product/10.2/crs/bin/racgeut -e _USR_ORA_DEBUG=5 54 /oracle/product/10.2/crs/bin/racgvip check essrzc2
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcexecut: rc = 0, time = 4.624s
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcaction: restyp=0 act_typ=2 stat=0
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcaction: return CLSR_CAA_SUCCESS
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcaction:  init 0.000s
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcaction:  action ok, 4.653s
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcaction:  post 0.000s
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcaction: all 4.653s
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: end for resource = ora.essrzc2.vip, action = check, status = 0, time = 4.692s
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrccln: exiting ora.essrzc2.vip refcount=1
 
2013-12-16 14:57:08.192: [    RACG][1] [21721][1][ora.essrzc2.vip]: clsrcprsrgter: gctx-&gt;prsrcfgref_clsrcgctx = 0
2013-12-16 14:57:38.420: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcprsrgini: gctx-&gt;prsrcfgref_clsrcgctx = 0
2013-12-16 14:57:38.420: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcini_ext: starting ora.essrzc2.vip refcount=1 global
 
2013-12-16 14:57:38.421: [    RACG][1] [21974][1][ora.essrzc2.vip]: begin for resource = ora.essrzc2.vip, action = check
 
2013-12-16 14:57:38.428: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrccssgetctx: all 0.007s
 
2013-12-16 14:57:38.436: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcgetprsrctx: all 0.015s
 
2013-12-16 14:57:38.448: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcnodeapp: prsr num_env = 0
 
2013-12-16 14:57:38.449: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcnodeapp: setting ORACLE_CONFIG_HOME=/oracle/product/10.2/crs
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: Unable to get IPv6 interface information.
Permission denied to 127.0.0.1
cmviewcl: Cannot view the cluster configuration: Permission denied.
This user doesn't have access to view the cluster configuration.
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: Mon Dec 16 14:57:38 EAT 2013 [ 21979 ] Checking interface existance
Mon Dec 16 14:57:38 EAT 2013 [ 21979 ] Calling getifbyip
Mon Dec 16 14:57:38 EAT 2013 [ 21979 ] getifbyip:  started for 132.42.37.144
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: Mon Dec 16 14:57:38 EAT 2013 [ 21979 ] Completed getifbyip lan900:801
Mon Dec 16 14:57:38 EAT 2013 [ 21979 ] Completed with initial interface test
Mon Dec 16 14:57:38 EAT 2013 [ 21979 ] Broadcast = 132.42.37.255
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: Mon Dec 16 14:57:38 EAT 2013 [ 21979 ] checkIf: start for if=lan900
Mon Dec 16 14:57:42 EAT 2013 [ 21979 ] checkIf: RX packets checked if=lan900 failed
Interface lan900 checked failed (host=essrzc2)
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: Mon Dec 16 14:57:42 EAT 2013 [ 21979 ] checkIf: end for if=lan900
Mon Dec 16 14:57:42 EAT 2013 [ 21979 ] Performing CRS_STAT testing
Mon Dec 16 14:57:42 EAT 2013 [ 21979 ] Completed CRS_STAT testing
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: Mon Dec 16 14:57:43 EAT 2013 [ 21979 ] Completed second gateway test
Mon Dec 16 14:57:43 EAT 2013 [ 21979 ] Interface tests
Mon Dec 16 14:57:43 EAT 2013 [ 21979 ] DEBUG: FAIL_WHEN_ALL_LINK_DOWN = 1 and IF_USING = lan900
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: Invalid parameters, or failed to bring up VIP (host=essrzc2)
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcexecut: env ORACLE_CONFIG_HOME=/oracle/product/10.2/crs
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcexecut: cmd = /oracle/product/10.2/crs/bin/racgeut -e _USR_ORA_DEBUG=5 54 /oracle/product/10.2/crs/bin/racgvip check essrzc2
 
2013-12-16 14:57:43.181: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcexecut: rc = 1, time = 4.731s
 
2013-12-16 14:57:43.182: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcaction: restyp=0 act_typ=2 stat=1
 
2013-12-16 14:57:43.182: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcaction:  init 0.000s
 
2013-12-16 14:57:43.182: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcaction:  action failed, 4.761s
 
2013-12-16 14:57:43.182: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcaction:  post 0.000s
 
2013-12-16 14:57:43.182: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcaction: all 4.761s
 
2013-12-16 14:57:43.182: [    RACG][1] [21974][1][ora.essrzc2.vip]: end for resource = ora.essrzc2.vip, action = check, status = 1, time = 4.801s
 
2013-12-16 14:57:43.182: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrccln: exiting ora.essrzc2.vip refcount=1
 
2013-12-16 14:57:43.182: [    RACG][1] [21974][1][ora.essrzc2.vip]: clsrcprsrgter: gctx-&gt;prsrcfgref_clsrcgctx = 0
2013-12-16 14:57:43.434: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrcprsrgini: gctx-&gt;prsrcfgref_clsrcgctx = 0
2013-12-16 14:57:43.434: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrcini_ext: starting ora.essrzc2.vip refcount=1 global
 
2013-12-16 14:57:43.435: [    RACG][1] [22058][1][ora.essrzc2.vip]: begin for resource = ora.essrzc2.vip, action = stop
 
2013-12-16 14:57:43.466: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrccssgetnodename: all 0.030s
 
2013-12-16 14:57:43.466: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrcnodeapps: calling FAILSRVSA
2013-12-16 14:57:43.483: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrcrundetach: cmd = /oracle/product/10.2/crs/bin/racgmain ora.essrzc2.vip rundetach  1 failsrvsa essrzc2, rc = 0, time = 0.016s
 
2013-12-16 14:57:43.483: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrcnodeapps: Posting PNWDOWN_EVENT
 
2013-12-16 14:57:43.484: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrccssgetnodename: all 0.000s
 
2013-12-16 14:57:43.484: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrccssgetnodename: all 0.000s
 
2013-12-16 14:57:43.484: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrcssgetrhost: using cached local host name
 
2013-12-16 14:57:43.486: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrccssgetlhost: all 0.002s
 
2013-12-16 14:57:43.493: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrcpostevt: EvmEventPost 1 0.001s
 
2013-12-16 14:57:43.493: [    RACG][1] [22058][1][ora.essrzc2.vip]: clsrcposthaevt: forward to EVM