Exadata遭遇ORA-27603和ORA-27626
2012-10-17 17:37
260 查看
今天CRM反应说他们有个作业今天没跑出来报错了,一查发现Exadata的节点2上的alert日志里面在同一时间点果然报错了:
WedOct1706:22:512012
Errorsinfile/u01/app/oracle/diag/rdbms/srcbfin/SRCBFIN2/trace/SRCBFIN2_arc2_72209.trc:
ORA-27603:CellstorageI/Oerror,I/Ofailedondisko/172.11.211.9/DATA_DM01_CD_01_dm01cel01atoffset464519168fordatalength1048576
ORA-27626:Exadataerror:201(GenericI/Oerror)
WARNING:ReadFailed.group:1disk:25AU:110offset:3145728size:1048576
WARNING:failedtoreadmirrorside1ofvirtualextent29logicalextent0offile266ingroup[1.2063103479]fromdiskDATA_DM01_CD_01_DM01CEL01allocationunit110reasonerror;ifpossible,willtryanothermirrorside
NOTE:successfullyreadmirrorside2ofvirtualextent29logicalextent1offile266ingroup[1.2063103479]fromdiskDATA_DM01_CD_10_DM01CEL03allocationunit113
相应的那个trace日志文件里面也是只用相同的寥寥几行:
***2012-10-1706:22:51.655
ORA-27626:Exadataerror:201(GenericI/Oerror)
WARNING:ReadFailed.group:1disk:25AU:110offset:3145728size:1048576
path:o/172.11.211.9/DATA_DM01_CD_01_dm01cel01
incarnation:0xe9688586asynchronousresult:'I/Oerror'
subsys:OSSiop:0x2b58752a2680bufp:0x2b5879517000osderr:0xc9osderr1:0x0
Exadataerror:'GenericI/Oerror'
IOelapsedtime:12334426usecTimewaitedonI/O:12334426usec
WARNING:failedtoreadmirrorside1ofvirtualextent29logicalextent0offile266ingroup[1.2063103479]fromdiskDATA_DM01_CD_01_DM01CEL01allocationunit110reasonerror;ifpossible,willtryanothermirrorside
NOTE:successfullyreadmirrorside2ofvirtualextent29logicalextent1offile266ingroup[1.2063103479]fromdiskDATA_DM01_CD_10_DM01CEL03allocationunit113
在MOS上查了一下发现这可能是Exadata的一个Bug,无语了,上线两个月各种Bug各种宕机,官方还吹嘘的那么牛叉!!!
Thecontentwaslastupdatedon:17-JUN-2011
Clickherefordetailsofeachofthesectionsbelow.
Bug:8782572(ThislinkwillonlyworkforPUBLISHEDbugs)
Note:245840.1Informationonthesectionsinthisarticle
这个Bug可以早到相应的Patch:
另一篇文章,不过好像不是这么一回事,应该是个Bug,不过还是把文章附上:
Informationinthisdocumentappliestoanyplatform.
Errorsinfile/u01/app/oracle/diag/rdbms/test/test2/trace/test2_ora_22485.trc(incident=242913):
ORA-00600:internalerrorcode,arguments:[kssadpm1],[],[],[],[],[],[],[],[],[],[],[]
Incidentdetailsin:/u01/app/oracle/diag/rdbms/test/test2/incident/incdir_242913/test2_ora_22485_i242913.trc
Errorsinfile/u01/app/oracle/diag/rdbms/test/test2/trace/test2_ora_22485.trc(incident=242914):
ORA-00600:internalerrorcode,arguments:[kfddsGet03],[56861],[],[],[],[],[],[],[],[],[],[]
Incidentdetailsin:/u01/app/oracle/diag/rdbms/test/test2/incident/incdir_242914/test2_ora_22485_i242914.trc
ORA-600[kssadpm1]israisedbecauseofbug9750033
Theseconderror,ORA-600[kfddsGet03],iscausedbyORA-600[kssadpm1].
Wecanconcludeonthebug9750033basedonthefollowingcriteria:
1.Callstackmatchesasfollows:
kssadpm
ksz_gen_reid
kfddsGet
kfioTranslateIO
kfioRqSetPrepare
2.Problematicstateobjectis'kszparent'
Thiscanbeverifiedfromthetracefile.
Example:
SO:0x6cbc55f40,type:22,owner:(nil),flag:-/FLST/-/0x00if:0x0c:0x0
proc=(nil),name=kszparent,file=ksz2.hLINE:394,pg=0
Dumpofmemoryfrom0x00000006CBC55F40to0x00000006CBC55F98
Onlytheprocessencounteringtheerrorwillbeterminated.Alsotheerrorisduringnormalserverprocessexit.So,theimpactisveryminimal.
Fixincludedin11.2.0.1BP12.
Otheroptionistoupgradeto11.2.0.2orabovewhichincludesthefix.
WedOct1706:22:512012
Errorsinfile/u01/app/oracle/diag/rdbms/srcbfin/SRCBFIN2/trace/SRCBFIN2_arc2_72209.trc:
ORA-27603:CellstorageI/Oerror,I/Ofailedondisko/172.11.211.9/DATA_DM01_CD_01_dm01cel01atoffset464519168fordatalength1048576
ORA-27626:Exadataerror:201(GenericI/Oerror)
WARNING:ReadFailed.group:1disk:25AU:110offset:3145728size:1048576
WARNING:failedtoreadmirrorside1ofvirtualextent29logicalextent0offile266ingroup[1.2063103479]fromdiskDATA_DM01_CD_01_DM01CEL01allocationunit110reasonerror;ifpossible,willtryanothermirrorside
NOTE:successfullyreadmirrorside2ofvirtualextent29logicalextent1offile266ingroup[1.2063103479]fromdiskDATA_DM01_CD_10_DM01CEL03allocationunit113
相应的那个trace日志文件里面也是只用相同的寥寥几行:
***2012-10-1706:22:51.655
ORA-27626:Exadataerror:201(GenericI/Oerror)
WARNING:ReadFailed.group:1disk:25AU:110offset:3145728size:1048576
path:o/172.11.211.9/DATA_DM01_CD_01_dm01cel01
incarnation:0xe9688586asynchronousresult:'I/Oerror'
subsys:OSSiop:0x2b58752a2680bufp:0x2b5879517000osderr:0xc9osderr1:0x0
Exadataerror:'GenericI/Oerror'
IOelapsedtime:12334426usecTimewaitedonI/O:12334426usec
WARNING:failedtoreadmirrorside1ofvirtualextent29logicalextent0offile266ingroup[1.2063103479]fromdiskDATA_DM01_CD_01_DM01CEL01allocationunit110reasonerror;ifpossible,willtryanothermirrorside
NOTE:successfullyreadmirrorside2ofvirtualextent29logicalextent1offile266ingroup[1.2063103479]fromdiskDATA_DM01_CD_10_DM01CEL03allocationunit113
在MOS上查了一下发现这可能是Exadata的一个Bug,无语了,上线两个月各种Bug各种宕机,官方还吹嘘的那么牛叉!!!
Bug8782572ARCHfencedinASMdiskscausinginternalerrorsatshutdown
Thisnotegivesabriefoverviewofbug8782572.Thecontentwaslastupdatedon:17-JUN-2011
Click
Affects:
Product(Component) | OracleServer(Rdbms) |
Rangeofversionsbelievedtobeaffected | VersionsBELOW12.1 |
Versionsconfirmedasbeingaffected | |
Platformsaffected | Generic(all/mostplatformsaffected) |
Fixed:
Thisissueisfixedin | |
Symptoms: | RelatedTo: |
ORA-27603 Dumpinorunder |
Description
ThisbugcausesARCHprocessestokeepissuingIO's(orarchivelogs)
evenafteranRDBMSinstancehasbeendismounted.SuchIO'sare
fencedoffinASMdisksafterinstanceisnolongerpartofthecluster
(i.e.afterdismount),andtheASMdiskgroupcanbedismounted
asaresultoftheseIOs.
HereisanexampleexcerptfromalertlogshowingtheIOerrorsduetofence:
ORA-27603:CellstorageI/Oerror,I/Ofailedondisko/<IPAddress>/<ASMDisk>atoffset<offset#>fordatalength<length>WARNING:IOFailed.group:<group#>disk(number.incarnation):<number.inc>disk_path:o/<IPAddress>/<ASMdisk>
AU:<AU>disk_offset(bytes):<bytes>io_size:<IOsize>operation:Readtype:asynchronous
result:I/Oerrorprocess_id:<pid>
Exadataerror:221(I/Orequestfenced)
Anotherexamplefromacellalertlog:
Information:CellsrvcancelingOSSMSG_COMMAND_BREADrequestfromhost
xxxx[pid:<pid>]forfencing,sendport<port#>openfd2
Pleasenote:Theaboveisasummarydescriptiononly.Actualsymptomscanvary.Matchingtoanysymptomsheredoesnotconfirmthatyouareencounteringthisproblem.ForquestionsaboutthisbugpleaseconsultOracleSupport. |
References
这个Bug可以早到相应的Patch:
另一篇文章,不过好像不是这么一回事,应该是个Bug,不过还是把文章附上:
Exadata/Rac-Ora-27603:CellStorageI/OError[ID1445223.1] |
修改时间:2012-5-30 类型:PROBLEM 状态:PUBLISHED 优先级:3 |
|
Appliesto:
OracleExadataHardware-Version11.2.0.1to11.2.0.1[Release11.2]Informationinthisdocumentappliestoanyplatform.
Symptoms
Gettingfollowingerrorsintheexadataenvironmentrunning11.2.0.1database:Errorsinfile/u01/app/oracle/diag/rdbms/test/test2/trace/test2_ora_22485.trc(incident=242913):
ORA-00600:internalerrorcode,arguments:[kssadpm1],[],[],[],[],[],[],[],[],[],[],[]
Incidentdetailsin:/u01/app/oracle/diag/rdbms/test/test2/incident/incdir_242913/test2_ora_22485_i242913.trc
Errorsinfile/u01/app/oracle/diag/rdbms/test/test2/trace/test2_ora_22485.trc(incident=242914):
ORA-00600:internalerrorcode,arguments:[kfddsGet03],[56861],[],[],[],[],[],[],[],[],[],[]
Incidentdetailsin:/u01/app/oracle/diag/rdbms/test/test2/incident/incdir_242914/test2_ora_22485_i242914.trc
Changes
NorecentchangesCause
IOsarefencedevenbeforethetxnstateobjectgetsdeletedwhichneedstoperformsIOsinordertodothetxnrollback.Thisiscausingtheerror.ORA-600[kssadpm1]israisedbecauseofbug9750033
Theseconderror,ORA-600[kfddsGet03],iscausedbyORA-600[kssadpm1].
Wecanconcludeonthebug9750033basedonthefollowingcriteria:
1.Callstackmatchesasfollows:
kssadpm
ksz_gen_reid
kfddsGet
kfioTranslateIO
kfioRqSetPrepare
2.Problematicstateobjectis'kszparent'
Thiscanbeverifiedfromthetracefile.
Example:
SO:0x6cbc55f40,type:22,owner:(nil),flag:-/FLST/-/0x00if:0x0c:0x0
proc=(nil),name=kszparent,file=ksz2.hLINE:394,pg=0
Dumpofmemoryfrom0x00000006CBC55F40to0x00000006CBC55F98
Solution
Impactofthebugistheprocessfailure.Astheerrorisfromserverprocess(notbackgroundprocess),noeffectattheinstancelevelandnocorruptionaswell.Onlytheprocessencounteringtheerrorwillbeterminated.Alsotheerrorisduringnormalserverprocessexit.So,theimpactisveryminimal.
Fixincludedin11.2.0.1BP12.
Otheroptionistoupgradeto11.2.0.2orabovewhichincludesthefix.
相关文章推荐
- ORA-27603: Cell storage I/O error, I/O failed on disk ORA-27626: Exadata error:
- 数据导入时遭遇 ORA-01187 ORA-01110
- oracle 10g dataguard修改保护模式遭遇ORA-03113
- 遭遇Ora-02041:客户端数据库未启动一个事务,好在摆平了
- RMAN duplicate from active 时遭遇 ORA-17627 ORA-12154
- impdp遭遇ORA-39029、ORA-31671、ORA-06512
- 遭遇到ORA-12560: TNS: 协议适配器错误
- Oracle异常关机后启动时遭遇ORA-00600,ORA-00471
- Oracle 操作系统(外部用户)验证 登陆database,遭遇 ora-27121
- 非归档恢复遭遇ORA-01190 和 ORA-600 [krhpfh_03-1202]–恢复小记
- 数据库通过数据泵由10.2.0.5到11.2.0.4遭遇ORA-39126、ORA-01555
- imp导入数据到ORACLE遭遇ORA-12899错误
- 安装Oracle10g后在服务器上命令行下使用sqlplus遭遇ORA-12560: TNS: 协议适配器错误
- 遭遇ORA-01552错误
- 非归档恢复遭遇ORA-01190 和 ORA-600 [krhpfh_03-1202]–恢复小记
- [原]第一次遭遇Oracle的Bug,纪念一下 |ORA-00600 kmgs_pre_process_request_6|
- Exp時遭遇 EXP-00008 ORA-00942 EXP-00024 EXP-00000
- 数据导入时遭遇 ORA-01187 ORA-01110
- impdp遭遇ORA-39001、ORA-39000,ORA-39142
- RMAN还原遭遇ORA-32006&ORA-27102错误