【翻译自mos文章】在win2003 sp1下,遇到无法解释的数据库性能下降
2015-04-25 19:33
344 查看
翻译自mos文章:在win2003 sp1下,遇到无法解释的数据库性能下降
Unexplained Database Slowdown Seen on Windows 2003 Service Pack 1 (Doc ID 464683.1)
Applies to:
Oracle Server - Enterprise Edition - Version 9.2.0.1 to 11.1.0.6 [Release 9.2 to 11.1]
z*OBSOLETE: Microsoft Windows Server 2003
Microsoft Windows Itanium (64-bit)
z*OBSOLETE: Microsoft Windows XP (64-bit AMD64 and Intel EM64T)
Microsoft Windows Server 2003 (64-bit Itanium)Microsoft Windows Server 2003Microsoft Windows XP (64-bit AMD64 and Intel EM64T)
1. General slowdowns in the database, which may appear to be hung, even though systemstate dumps may not show evidence of hangs or locking contention. Sometimes, the database may recover from these slowdowns on its own, and begin responding with no user
intervention. At other times, the database may appear to be hung for long periods of time.
2. Other symptoms may include slow disk I/O - i.e. a large increase in average I/O times may be observed when reviewing AWR reports covering the time period in question.
3. It may also be observed that network latency increases dramatically. If running in a RAC environment, you may notice an increase in cluster latency - i.e. an increase in the wait times for global cache requests and messages (again, as observed in AWR
reports or in some cases as seen via real-time monitoring tools).
4. If running in a RAC environment, we may see IPC timeouts in the ipcdbg logfiles. On Oracle 10g, the ipcdbg logs are found in the $ORACLE_HOME\rdbms\log subdirectory. In Oracle 9i, the ipcdbg logs are found in C:\WINDOWS\system32\osd9i. Examples
of entries in the ipcdbg logfiles that indicate a slowdown would be entries such as the following:
5. Monitoring of system resources via Perfmon or other monitoring utilities may also reveal an increase in system time, indicating a large amount of work being done by the OS kernel.
In some cases, this problem may be diagnosed by verifying via Perfmon stats, AWR reports, and ipcdbg logs as noted above. In order to get confirmation of the problem, as noted above, systemstate dumps may not show any symptoms other than slow processing, so
it may be necessary to run adplus and take user level dumps of the oracle.exe process, in addition to collecting the normal systemstate dumps and hanganalyze dumps. Having a combination of systemsate dumps and adplus dumps of the oracle.exe process may make
this issue easier to debug, but in some cases it may also be necessary to get a kernel dump to confirm the exact cause. The OS or Hardware vendor should be consulted for assistance in obtaining a kernel dump.
In order to take an adplus dump of the oracle.exe process, at a high level, you must install the 'Debugging Tools for Windows' and then run adplus as such:
adplus -hang -p <PID for oracle.exe>
Be sure to use the correct PID corresponding to the correct oracle.exe. If you are running an ASM instance, you will most likely have at least two oracle.exe processes, but you will most likely want to run adplus and dump the oracle.exe
associated with the database instance, NOT the ASM instance
For a complete description of what adplus is and how to run it, please refer to the following Microsoft Support note.
http://support.microsoft.com/kb/286350/en-us
@ For the issue described in the alert, the Adplus dumps of the oracle process may show threads
@ kiswapcontext
@ As noted above, in some cases, it may be necessary to take a
http://support.microsoft.com/kb/919341
http://support.microsoft.com/kb/922658
This issue is known to be a problem in Windows 2003 with Service Pack 1, and can affect any application with large memory requirements, including single-instance Oracle database, Oracle RAC Databases and non-Oracle databases. It is strongly advised that
this Windows hotfix be installed proactively on all Oracle installations running on Windows 2003 with Service Pack 1, or that those installations be upgraded to Service Pack 2 proactively.
http://support.microsoft.com/kb/919341/en-us
Unexplained Database Slowdown Seen on Windows 2003 Service Pack 1 (Doc ID 464683.1)
Applies to:
Oracle Server - Enterprise Edition - Version 9.2.0.1 to 11.1.0.6 [Release 9.2 to 11.1]
z*OBSOLETE: Microsoft Windows Server 2003
Microsoft Windows Itanium (64-bit)
z*OBSOLETE: Microsoft Windows XP (64-bit AMD64 and Intel EM64T)
Microsoft Windows Server 2003 (64-bit Itanium)Microsoft Windows Server 2003Microsoft Windows XP (64-bit AMD64 and Intel EM64T)
Description
Periodic Database slowdowns seen in Windows 2003 environments with lots of memory, running Service Pack 1Occurrence
Environments with a high level of concurrency, many CPU's and large amounts of physical ram are prone to this type of problem.Symptoms
Symptoms
Symptoms include any or all of the following:1. General slowdowns in the database, which may appear to be hung, even though systemstate dumps may not show evidence of hangs or locking contention. Sometimes, the database may recover from these slowdowns on its own, and begin responding with no user
intervention. At other times, the database may appear to be hung for long periods of time.
2. Other symptoms may include slow disk I/O - i.e. a large increase in average I/O times may be observed when reviewing AWR reports covering the time period in question.
3. It may also be observed that network latency increases dramatically. If running in a RAC environment, you may notice an increase in cluster latency - i.e. an increase in the wait times for global cache requests and messages (again, as observed in AWR
reports or in some cases as seen via real-time monitoring tools).
4. If running in a RAC environment, we may see IPC timeouts in the ipcdbg logfiles. On Oracle 10g, the ipcdbg logs are found in the $ORACLE_HOME\rdbms\log subdirectory. In Oracle 9i, the ipcdbg logs are found in C:\WINDOWS\system32\osd9i. Examples
of entries in the ipcdbg logfiles that indicate a slowdown would be entries such as the following:
Thread(06c4): 10/25/07 05:59:56 TCP/IP IPC ErrorLog, version[ 10.2.0.2.0.99 ] 10/25/07 05:59:58 | INFO | 02dc | Initialized proc [oracle : opprodrc1 :####] from interface at####### 10/25/07 07:35:27 | ERROR | 1ae8 | IPC_TCPConnect: async connect err 10048 to [opp :####:###:#####] 10/25/07 07:38:44 | ERROR | 1c6c | IPC_TCPConnect: async connect err 10048 to [opp :###:###:#####] 10/25/07 07:43:37 | ERROR | 1414 | [ipcosd.c :####] IPC_TCPListen2: Timeout 63050 exceeds max 25000 + margin 5000 10/25/07 07:43:37 | ERROR | 1414 | [ipcosd.c :####] IPCInitPort: Timeout 63140 exceeds max 25000 + margin 5000 10/25/07 07:43:40 | ERROR | 15f0 | [ipcosd.c :####] IPC_TCPListen2: Timeout 64950 exceeds max 25000 + margin 5000
5. Monitoring of system resources via Perfmon or other monitoring utilities may also reveal an increase in system time, indicating a large amount of work being done by the OS kernel.
Diagnosing the problem:
This problem can be difficult to diagnose, and it is therefore advised that the solution recommended below be implemented proactively. The symptoms listed in this note will not always manifest in the same way.In some cases, this problem may be diagnosed by verifying via Perfmon stats, AWR reports, and ipcdbg logs as noted above. In order to get confirmation of the problem, as noted above, systemstate dumps may not show any symptoms other than slow processing, so
it may be necessary to run adplus and take user level dumps of the oracle.exe process, in addition to collecting the normal systemstate dumps and hanganalyze dumps. Having a combination of systemsate dumps and adplus dumps of the oracle.exe process may make
this issue easier to debug, but in some cases it may also be necessary to get a kernel dump to confirm the exact cause. The OS or Hardware vendor should be consulted for assistance in obtaining a kernel dump.
In order to take an adplus dump of the oracle.exe process, at a high level, you must install the 'Debugging Tools for Windows' and then run adplus as such:
adplus -hang -p <PID for oracle.exe>
Be sure to use the correct PID corresponding to the correct oracle.exe. If you are running an ASM instance, you will most likely have at least two oracle.exe processes, but you will most likely want to run adplus and dump the oracle.exe
associated with the database instance, NOT the ASM instance
For a complete description of what adplus is and how to run it, please refer to the following Microsoft Support note.
http://support.microsoft.com/kb/286350/en-us
@ For the issue described in the alert, the Adplus dumps of the oracle process may show threads
@ kiswapcontext
@ As noted above, in some cases, it may be necessary to take a
Workaround
The cause of this problem may be the following Microsoft bug, which is described in the following links:http://support.microsoft.com/kb/919341
http://support.microsoft.com/kb/922658
This issue is known to be a problem in Windows 2003 with Service Pack 1, and can affect any application with large memory requirements, including single-instance Oracle database, Oracle RAC Databases and non-Oracle databases. It is strongly advised that
this Windows hotfix be installed proactively on all Oracle installations running on Windows 2003 with Service Pack 1, or that those installations be upgraded to Service Pack 2 proactively.
Patches
Microsoft has a hotfix available on top of Service Pack 1, and the issue should also be resolved by upgrading to Service Pack 2. As noted above, see the following referenced Microsoft article for information on this fix:http://support.microsoft.com/kb/919341/en-us
相关文章推荐
- [翻译自mos文章]不完全恢复之后,open resetlogs之前,怎么快速的检查数据库是否处于一致性的状态?
- 【翻译自mos文章】使用buffer memory 参数来调整rman的性能。
- 【翻译自mos文章】在数据库实例的alert日志中出现“WARNING: No cluster interconnect has been specified”
- 【翻译自mos文章】在一次失败的 'Shutdown Immediate'之后,数据库job 不能执行。
- 【翻译自mos文章】对于oracle 数据库来说,OGG的抽取进程什么时候到database中获取数据?
- 【翻译自mos文章】改变数据库用户sysman(该用户是DB Control Repository 的schema)password的方法
- 【翻译自mos文章】解释 datafile name中的MISSING关键字
- 【翻译自mos文章】在alter/drop表空间时遇到错误ORA-38301,ORA-00604,purge dba_recyclebin 也不行
- 【翻译自mos文章】对dba_users视图中,account_status列的解释
- 【翻译自mos文章】在一次失败的 'Shutdown Immediate'之后,数据库job 不能运行。
- 【翻译自mos文章】rac数据库中,HC_<SID>.dat文件被其他Oracle_Home下的实例所使用。
- 【翻译自mos文章】DBA_JOBS 和 DBA_JOBS_RUNNING 不同的结果的解释
- 【翻译自mos文章】在重建控制文件之后,丢失了数据库补充日志信息(Missed Database Supplemental Log Information)
- 【翻译自mos文章】使用aum( Automatic Undo Management) 时遇到 ORA-01555错误--- 原因和解决方案。
- 【翻译自MOS文章】对于使用dblink的并且数据库版本小于等于11.2.0.3的数据库需要的强制性补丁
- 【翻译自mos文章】当在Windows 2008 server 上安装OGG Director Server之后,Director Web无法访问
- 翻译mos文章 磁盘组的错误依赖关系导致数据库启动失败
- 【翻译自mos文章】解释直方图信息
- 【翻译自mos文章】改变数据库用户sysman(该用户是DB Control Repository 的schema)密码的方法
- 【从翻译mos文章】rac数据库,HC_<SID>.dat其他文件Oracle_Home用例下。