Oracle undo坏块测试和修复

伊文硕Oracle 2017-01-05

UNDO段头块损坏测试与修复,本次案例通过BBED工具模拟UNDO段头坏块,并在没有备份情况下启动数据库;

1 查看UNDO段头块位置

select header_file, header_block

  from dba_segments

 where segment_name like '_SYSSMU%'

 order by 2;

2 通过BBED工具,破坏UNDO某一段的段头块(file=3 block=280)

破坏的方式是直接将其他的数据块覆盖段头块

[Oracle11@primary ~]$ bbed parfile=bbed.par

Password:

BBED: Release 2.0.0.0.0 - Limited Production on Sat Jul 30 18:00:26 2016

Copyright (c) 1982, 2011, Oracle and/or its affiliates.  All rights reserved.

************* !!! For Oracle Internal Use only !!! ***************

BBED> copy dba 1,1 to dba 3,280

BBED> sum apply;

Check value for File 3, Block 280:

current = 0x599e, required = 0x599e
 

BBED> verify

DBVERIFY - Verification starting

FILE = /u02/app/oracle/oradata/orcl11/undotbs01.dbf

BLOCK = 280

Block 280 is corrupt

Corrupt block relative dba: 0x00400118 (file 0, block 280)

Bad header found during verification

Data in bad block:

 type: 11 format: 2 rdba: 0x00400001

 last change scn: 0x0000.00000000 seq: 0x1 flg: 0x04

 spare1: 0x0 spare2: 0x0 spare3: 0x0

 consistency value in tail: 0x00000b01

 check value in block header: 0xc8c7

 computed block checksum: 0x0

DBVERIFY - Verification complete

Total Blocks Examined        : 1

Total Blocks Processed (Data) : 0

Total Blocks Failing  (Data) : 0

Total Blocks Processed (Index): 0

Total Blocks Failing  (Index): 0

Total Blocks Empty            : 0

Total Blocks Marked Corrupt  : 1

Total Blocks Influx          : 0

Message 531 not found;  product=RDBMS; facility=BBED

---通过BBED和DBV检查结果都是file3,block 280损坏

[oracle11@primary orcl11]$ dbv file=undotbs01.dbf 

DBVERIFY: Release 11.2.0.4.0 - Production on Sat Jul 30 18:01:38 2016

Copyright (c) 1982, 2011, Oracle and/or its affiliates.  All rights reserved.

DBVERIFY - Verification starting : FILE = /u02/app/oracle/oradata/orcl11/undotbs01.dbf

Page 280 is marked corrupt

Corrupt block relative dba: 0x00c00118 (file 3, block 280)

Bad header found during dbv: 

Data in bad block:

 type: 11 format: 2 rdba: 0x00400001

 last change scn: 0x0000.00000000 seq: 0x1 flg: 0x04

 spare1: 0x0 spare2: 0x0 spare3: 0x0

 consistency value in tail: 0x00000b01

 check value in block header: 0xc8c7

 computed block checksum: 0x0

DBVERIFY - Verification complete

Total Pages Examined        : 392

Total Pages Processed (Data) : 0

Total Pages Failing  (Data) : 0

Total Pages Processed (Index): 0

Total Pages Failing  (Index): 0

Total Pages Processed (Other): 45

Total Pages Processed (Seg)  : 23

Total Pages Failing  (Seg)  : 0

Total Pages Empty            : 346

Total Pages Marked Corrupt  : 1

Total Pages Influx          : 0

Total Pages Encrypted        : 0

Highest block SCN            : 1283208 (0.1283208)

---模拟异常断电

SQL> shutdown abort

ORACLE instance shut down.

---启动数据库,报错ORA-01578

SQL> startup

ORACLE instance started.

Total System Global Area  784998400 bytes

Fixed Size                  2257352 bytes

Variable Size            515903032 bytes

Database Buffers          264241152 bytes

Redo Buffers                2596864 bytes

Database mounted.

ORA-01092: ORACLE instance terminated. Disconnection forced

ORA-01578: ORACLE data block corrupted (file # 3, block # 280)

ORA-01110: data file 3: '/u02/app/oracle/oradata/orcl11/undotbs01.dbf'

Process ID: 8265

Session ID: 1 Serial number: 5

通常UNDO损坏,在没有备份的情况下,可以通过以下方式启动数据库

#*.undo_tablespace='UNDOTBS1'      ----注释原UNDO表空间

#*.undo_management=AUTO            ----UNDO管理方式改为手动

*.undo_management='MANUAL'

*.undo_tablespace='SYSTEM'            ---将UNDO表空间改成SYSTEM

*._corrupted_rollback_segments=损坏的回滚段    ---屏蔽损坏的UNDO段

创建新的回滚段:

create undo tablespace UNDOTBS2 datafile '/u02/app/oracle/oradata/orcl11/undotbs02.dbf' size 1M autoextend on;

删除旧的回滚段:

drop tablespace UNDOTBS1 including contents and datafiles;

*.undo_tablespace='UNDOTBS2'

*.undo_management=AUTO

但是在mount状态下无法查询(创建或删除)回滚段

SQL> select * from v$rollname;

select * from v$rollname

              *

ERROR at line 1:

ORA-01219: database not open: queries allowed on fixed tables/views only

无法创建新的UNDO表空间

SQL> create undo tablespace UNDOTBS2 datafile '/u02/app/oracle/oradata/orcl11/undotbs02.dbf' size 1M autoextend on;

create undo tablespace UNDOTBS2 datafile '/u02/app/oracle/oradata/orcl11/undotbs02.dbf' size 1M autoextend on

*

ERROR at line 1:

ORA-01109: database not open

无法删除旧的UNDO表空间

SQL> drop tablespace UNDOTBS1 including contents and datafiles;

drop tablespace UNDOTBS1 including contents and datafiles

*

ERROR at line 1:

ORA-01109: database not open

在数据库不能OPEN情况下,有两种方式可以查询数据库部分信息;

1:strings命令可以查询所有的UNDO回滚段名,包括已经删除的回滚段;

[oracle11@primary orcl11]$ strings system01.dbf | grep _SYSSMU | cut -d $ -f 1 | sort -u > listSMU

[oracle11@primary orcl11]$ vim listSMU

_SYSSMU20_3293637928$

_SYSSMU20_379396250$

_SYSSMU20_379396250$

_SYSSMU13_811223436$

........

2:BBED工具也可以查询UNDO段名;

BBED> set file 1 block 225  -----Oracle 11g版本,undo$表信息一般位于1号文件第225个数据块中

        FILE#          1

        BLOCK#        225

BBED> map

 File: /u02/app/oracle/oradata/orcl11/system01.dbf (1)

 Block: 225                                  Dba:0x004000e1

------------------------------------------------------------

 KTB Data Block (Table/Cluster)

 struct kcbh, 20 bytes                      @0       

 struct ktbbh, 48 bytes                    @20     

 struct kdbh, 14 bytes                      @68     

 struct kdbt[1], 4 bytes                    @82     

 sb2 kdbr[25]                              @86      -------含有25个UNDO段

 ub1 freespace[6402]                        @136     

 ub1 rowdata[1650]                          @6538   

 ub4 tailchk                                @8188   

BBED> p kdbr

sb2 kdbr[0]                                @86      8078

sb2 kdbr[1]                                @88      8011

sb2 kdbr[2]                                @90      7944

......

sb2 kdbr[22]                                @130      6603

sb2 kdbr[23]                                @132      6537

sb2 kdbr[24]                                @134      6470

BBED> x /rnc *kdbr[0]        ----查看0号UNDO段名称

col    1[6] @8151: SYSTEM

BBED> x /rnc *kdbr[1]        ----查看1号UNDO段名称

col  1[20] @8085: _SYSSMU1_4115952380$


如果UNDO段特别多,可以通过EXECL,自动生成多个x /rnc *kdbr[0]......*kdbr[n]命令,再将命令复制粘贴到BBED中,同时获取多个UNDO段名;


 x /rnc *kdbr[0]
 

 x /rnc *kdbr[1]
 

 x /rnc *kdbr[2]
 

 x /rnc *kdbr[3]
 

......

 x /rnc *kdbr[24]
 

如果不能判断具体哪个回滚段出现问题,可以跳过所有的回滚段

*._corrupted_rollback_segments='_SYSSMU1_4115952380$','_SYSSMU2_3882698531$','_SYSSMU3_1780844141$','_SYSSMU4_1137450214$','_SYSSMU5_2972601029$','_SYSSMU6_2318781079$','_SYSSMU7_1865616030$','_SYSSMU8_4279519761$','_SYSSMU9_1551968587$','_SYSSMU10_2324134815$','_SYSSMU11_2069826877$','_SYSSMU12_2242918609$','_SYSSMU13_811223436$','_SYSSMU14_1093125402$','_SYSSMU15_2825991097$','_SYSSMU16_252471872$','_SYSSMU17_3347133763$','_SYSSMU18_1765883319$','_SYSSMU19_1005333767$','_SYSSMU20_3293637928$','_SYSSMU21_3641740596$','_SYSSMU22_3421614834$','_SYSSMU23_138031739$'

参数文件:

#*.undo_tablespace='UNDOTBS1'

#*.undo_management=AUTO

*.undo_tablespace='SYSTEM'

*.undo_management='MANUAL'

*._corrupted_rollback_segments='_SYSSMU1_4115952380$','_SYSSMU2_3882698531$','_SYSSMU3_1780844141$','_SYSSMU4_1137450214$','_SYSSMU5_2972601029$','_SYSSMU6_2318781079$','_SYSSMU7_1865616030$','_SYSSMU8_4279519761$','_SYSSMU9_1551968587$','_SYSSMU10_2324134815$','_SYSSMU11_2069826877$','_SYSSMU12_2242918609$','_SYSSMU13_811223436$','_SYSSMU14_1093125402$','_SYSSMU15_2825991097$','_SYSSMU16_252471872$','_SYSSMU17_3347133763$','_SYSSMU18_1765883319$','_SYSSMU19_1005333767$','_SYSSMU20_3293637928$','_SYSSMU21_3641740596$','_SYSSMU22_3421614834$','_SYSSMU23_138031739$'                   
 

SQL> shutdown immediate
 

SQL> startup

ORACLE instance started.

Total System Global Area  784998400 bytes

Fixed Size                  2257352 bytes

Variable Size            515903032 bytes

Database Buffers          264241152 bytes

Redo Buffers                2596864 bytes

Database mounted.

Database opened.

创建新的UNDO表空间

create undo tablespace UNDOTBS2 datafile '/u02/app/oracle/oradata/orcl11/undotbs02.dbf' size 1M autoextend on;

删除旧的UNDO表空间

drop tablespace UNDOTBS1 including contents and datafiles;

修改参数文件

*.undo_tablespace='UNDOTBS2'

*.undo_management=AUTO

#*.undo_tablespace='SYSTEM'

#*.undo_management='MANUAL'

#*._corrupted_rollback_segments='_SYSSMU1_4115952380$','_SYSSMU2_3882698531$','_SYSSMU3_1780844141$','_SYSSMU4_1137450214$','_SYSSMU5_2972601029$','_SYSSMU6_2318781079$','_SYSSMU7_1865616030$','_SYSSMU8_4279519761$','_SYSSMU9_1551968587$','_SYSSMU10_2324134815$','_SYSSMU11_2069826877$','_SYSSMU12_2242918609$','_SYSSMU13_811223436$','_SYSSMU14_1093125402$','_SYSSMU15_2825991097$','_SYSSMU16_252471872$','_SYSSMU17_3347133763$','_SYSSMU18_1765883319$','_SYSSMU19_1005333767$','_SYSSMU20_3293637928$','_SYSSMU21_3641740596$','_SYSSMU22_3421614834$','_SYSSMU23_138031739$'

SQL> shutdown immediate

Database closed.

Database dismounted.

ORACLE instance shut down.

SQL> startup

ORACLE instance started.

Total System Global Area  784998400 bytes

Fixed Size                  2257352 bytes

Variable Size            515903032 bytes

Database Buffers          264241152 bytes

Redo Buffers                2596864 bytes

Database mounted.

Database opened.

SQL> show parameter undo

NAME                                TYPE        VALUE

------------------------------------ ----------- ------------------------------

undo_management                      string      AUTO

undo_retention                      integer    900

undo_tablespace                      string      UNDOTBS2

BBED修改数据块是比较危险的操作,如果某个修改操作有误,可以通过revert或undo命令回退BBED的修改操作;

例如:BBED回退3,280块上所有修改

BBED> revert dba 3,280

All changes made to this block will be rolled back. Proceed? (Y/N) y

Reverted file '/u02/app/oracle/oradata/orcl11/undotbs01.dbf', block 280

BBED> sum apply;

Check value for File 3, Block 280:

current = 0x3f90, required = 0x3f90

UNDO非段头(文件头)块损坏测试与修复

undo非段头(文件头)损坏,数据库可以正常启动,在没有备份的情况下,可以通过alert报错信息,找到并删除受损的回滚段

SQL> insert into t values(1);      -----插入一条数据,不提交

SQL> select usn,status,xacts from v$rollstat;

      USN STATUS              XACTS

    ---------- ---------------          ----------

        0 ONLINE                  0

        8 ONLINE                  0

        9 ONLINE                  1          ----9号回滚段存在活动事物

        10 ONLINE                  0

        11 ONLINE                  0

        12 ONLINE                  0

        24 ONLINE                  0

        25 ONLINE                  0

        26 ONLINE                  0

        27 ONLINE                  0

        28 ONLINE                  0

11 rows selected.

---查看回滚段头块位置

SQL> SET LINE 100

SQL> col segment_name for a30

SQL> select segment_name,header_file,header_block from dba_segments where segment_name like '_SYSSMU%' order by 3;

SEGMENT_NAME                  HEADER_FILE HEADER_BLOCK

------------------------------ ----------- ------------

_SYSSMU8_4161384913$                    3            8

_SYSSMU9_1458183674$                    3          24

_SYSSMU10_2644453179$                    3          40

_SYSSMU11_4737420$                      3          56

_SYSSMU12_392022772$                    3          72

_SYSSMU24_4044825012$                    3          88

_SYSSMU25_2098992521$                    3          104

_SYSSMU26_2158116475$                    3          120

_SYSSMU27_4048022843$                    3          136

_SYSSMU28_1413754230$                    3          152

10 rows selected.

通过BBED工具,手动破坏9号回滚段非头块;

[oracle11@primary ~]$ bbed parfile=bbed.par

Password: 

BBED: Release 2.0.0.0.0 - Limited Production on Sat Aug 13 22:35:38 2016

Copyright (c) 1982, 2011, Oracle and/or its affiliates.  All rights reserved.

************* !!! For Oracle Internal Use only !!! ***************

BBED> copy dba 1,1 to dba 3,25

BBED> sum apply;

Check value for File 3, Block 25:

current = 0xae9a, required = 0xae9a

BBED> verify

DBVERIFY - Verification starting

FILE = /u02/app/oracle/oradata/orcl11/undotbs01.dbf

BLOCK = 25

Block 25 is corrupt

Corrupt block relative dba: 0x00400019 (file 3, block 25)

Bad header found during verification

Data in bad block:

 type: 11 format: 2 rdba: 0x00400001

 last change scn: 0x0000.00000000 seq: 0x1 flg: 0x04

 spare1: 0x0 spare2: 0x0 spare3: 0x0

 consistency value in tail: 0x00000b01

 check value in block header: 0xae9a

 computed block checksum: 0x0

DBVERIFY - Verification complete

Total Blocks Examined        : 1

Total Blocks Processed (Data) : 0

Total Blocks Failing  (Data) : 0

Total Blocks Processed (Index): 0

Total Blocks Failing  (Index): 0

Total Blocks Empty            : 0

Total Blocks Marked Corrupt  : 1

Total Blocks Influx          : 0

Message 531 not found;  product=RDBMS; facility=BBED

[oracle11@primary orcl11]$ dbv file=undotbs01.dbf 

DBVERIFY: Release 11.2.0.4.0 - Production on Wed Aug 17 11:39:35 2016

Copyright (c) 1982, 2011, Oracle and/or its affiliates.  All rights reserved.

DBVERIFY - Verification starting : FILE = /u02/app/oracle/oradata/orcl11/undotbs01.dbf

Page 25 is marked corrupt

Corrupt block relative dba: 0x00c00019 (file 3, block 25)

Bad header found during dbv: 

Data in bad block:

 type: 11 format: 2 rdba: 0x00400001

 last change scn: 0x0000.00000000 seq: 0x1 flg: 0x04

 spare1: 0x0 spare2: 0x0 spare3: 0x0

 consistency value in tail: 0x00000b01

 check value in block header: 0xae9a

 computed block checksum: 0x0

DBVERIFY - Verification complete

Total Pages Examined        : 208

Total Pages Processed (Data) : 0

Total Pages Failing  (Data) : 0

Total Pages Processed (Index): 0

Total Pages Failing  (Index): 0

Total Pages Processed (Other): 88

Total Pages Processed (Seg)  : 10

Total Pages Failing  (Seg)  : 0

Total Pages Empty            : 119

Total Pages Marked Corrupt  : 1

Total Pages Influx          : 0

Total Pages Encrypted        : 0

Highest block SCN            : 1570655 (0.1570655)

SQL> shutdown abort

ORACLE instance shut down.

SQL> startup

ORACLE instance started.

Total System Global Area  784998400 bytes

Fixed Size                  2257352 bytes

Variable Size            515903032 bytes

Database Buffers          264241152 bytes

Redo Buffers                2596864 bytes

Database mounted.

Database opened.

数据库可以正常启动,后台alert日志也没有报错,通过dbv或者bbed工具检查出坏块后,可以手动删除坏块对应的undo段:

(1):select * from dba_extents where file_id=xx and xxx between block_id and block_id+blocks-1;

(2):DROP ROLLBACK SEGMENT rollback_segment; 

或者直接新建UNDO表空间:

(1):创建新的UNDO表空间

create undo tablespace UNDOTBS2 datafile '/u02/app/oracle/oradata/orcl11/undotbs02.dbf' size 1M autoextend on;

(2):删除旧的UNDO表空间

drop tablespace UNDOTBS1 including contents and datafiles;

UNDO文件头块损坏测试与修复

UNDO文件头损坏,无法正常open数据库;

SQL> shutdown abort

ORACLE instance shut down.

SQL> startup

ORACLE instance started.

Total System Global Area  784998400 bytes

Fixed Size                  2257352 bytes

Variable Size            515903032 bytes

Database Buffers          264241152 bytes

Redo Buffers                2596864 bytes

Database mounted.

ORA-01122: database file 3 failed verification check

ORA-01110: data file 3: '/u02/app/oracle/oradata/orcl11/undotbs01.dbf'

ORA-01210: data file header is media corrupt


在没有备份的情况下,需要通过BBED工具进行修复损坏的文件头;

修复的方式是通过复制其他数据文件头,并手动修改文件头中相关信息;

1、修改数据的DBA,rdba_kcbh

 2、修改文件的大小,kccfhfsz

 3、修改文件号,kccfhfno

 4、修改文件创建时SCN,kcvfhcrs

 5、修改文件创建时间,kcvfhcrt

 6、修改表空间号,kcvfhtsn 

 7、修改相对文件号,kcvfhrfn 

 8、修改表空间的名称, kcvfhtnm

 9、修改表空间的长度,kcvfhtln     

 10、修改检查点的SCN,kcvfhckp

 11、修改检查点的时间,kcvcptim 

 12、修改检查点的计数器,kcvfhcpc

 13、修改检查点的控制文件备份的计数器, kcvfhccc

 14、如果你修改是1号文件的1号块,他的root rdba的地针是指向了bootstrap$

通过BBED修复UNDO文件头坏块过程比较复制,并且BBED工具并不对外公开,也不提供技术支持,使用过程中很容易出现问题,建议在正式环境尽量避免使用BBED工具;

可以通过下面网站查看具体修改过程;

http://blog.csdn.net/guoyjoe/article/details/31018075

BBED工具的安装

Oracle 11g版本和以后的版本已经不提供bbed工具了,11g数据库如果需要使用bbed工具,可以拷贝10g或之前版本数据库上的三个文件:

[oracle11@primary ~]$ ll -rth bbed_install/

total 20K

-rw-r--r-- 1 root root 8.5K Sep  8  2012 bbedus.msb

-rw-r--r-- 1 root root 1.9K Sep  8  2012 sbbdpt.o

-rw-r--r-- 1 root root 1.2K Sep  8  2012 ssbbded.o

将文件拷贝到指定目录

[oracle11@primary ~]$ cp /home/oracle11/bbed_install/bbedus.msb /u02/app/oracle/

product/11.2.0/rdbms/mesg/

[oracle11@primary ~]$ cp /home/oracle11/bbed_install/ssbbded.o /u02/app/oracle/product/11.2.0/rdbms/lib/

[oracle11@primary ~]$ cp /home/oracle11/bbed_install/sbbdpt.o /u02/app/oracle/product/11.2.0/rdbms/lib/

编译

[oracle11@primary ~]$ make -f /u02/app/oracle/product/11.2.0/rdbms/lib/ins_rdbms

.mk BBED=$ORACLE_HOME/bin/bbed $ORACLE_HOME/bin/bbed

bbed默认密码"blockedit"

[oracle11@primary ~]$ bbed

Password: 

BBED: Release 2.0.0.0.0 - Limited Production on Sat Jul 30 14:22:17 2016

Copyright (c) 1982, 2011, Oracle and/or its affiliates.  All rights reserved.

************* !!! For Oracle Internal Use only !!! ***************

BBED>

使用BBED工具之前需要创建filelist文件

SQL> set linesize 100

SQL> col name for a45

SQL> spool /home/oracle11/filelist.txt

SQL> select file#,name,bytes from v$datafile order by 1;

    FILE# NAME                                              BYTES

---------- --------------------------------------------- ----------

        1 /u02/app/oracle/oradata/orcl11/system01.dbf    775946240

        2 /u02/app/oracle/oradata/orcl11/sysaux01.dbf    545259520

        3 /u02/app/oracle/oradata/orcl11/undotbs01.dbf    73400320

        4 /u02/app/oracle/oradata/orcl11/users01.dbf      5242880

        5 /u02/app/oracle/oradata/orcl11/chen01.dbf        1048576

SQL> spool off

[oracle11@primary ~]$ touch bbed.par

[oracle11@primary ~]$ vim bbed.par

blocksize=8192

listfile=/home/oracle11/filelist.txt

mode=edit

[oracle11@primary ~]$ bbed parfile=bbed.par

Password: 

BBED: Release 2.0.0.0.0 - Limited Production on Sat Jul 30 14:36:34 2016

Copyright (c) 1982, 2011, Oracle and/or its affiliates.  All rights reserved.

************* !!! For Oracle Internal Use only !!! ***************

BBED> show 

        FILE#          1

        BLOCK#          1

        OFFSET          0

        DBA            0x00400001 (4194305 1,1)

        FILENAME        /u02/app/oracle/oradata/orcl11/system01.dbf

        BIFILE          bifile.bbd

        LISTFILE        /home/oracle11/filelist.txt

        BLOCKSIZE      8192

        MODE            Edit

        EDIT            Unrecoverable

        IBASE          Dec

        OBASE          Dec

        WIDTH          80

        COUNT          512

        LOGFILE        log.bbd

        SPOOL          No

相关推荐