- 招聘 : Linux運(yùn)維
- 論壇徽章:
- 0
|
接到case,用戶刪除了一個(gè)數(shù)據(jù)庫表,幾百M(fèi)左右。(為什么會(huì)刪除就不是咱該多問的了)\r\n沒說的,通用做法:先不完全恢復(fù)到輔助庫,然后導(dǎo)出導(dǎo)入。\r\n\r\n1.環(huán)境介紹\r\nInformix數(shù)據(jù)庫,Netbackup 5.1,做的Onbar備份。\r\n1個(gè)Master server(server6)\r\n幾個(gè)Media server(SSO 共享磁帶庫,后面提到的server1,server4都即為client也是media server)\r\n\r\n用戶誤刪除了server4服務(wù)器上db_glxt數(shù)據(jù)庫中一個(gè)表,幾百M(fèi)左右。\r\n\r\n準(zhǔn)備要用server1服務(wù)器作為輔助服務(wù)器\r\n將server4上的數(shù)據(jù)庫恢復(fù)到server1上,server1、server4都是生產(chǎn)用機(jī),\r\nserver4的servername是aa_ol_yingye數(shù)據(jù)庫為db_glxt\r\nserver1的servername是ol_aa_sett 數(shù)據(jù)庫名為db_sett\r\n\r\n2.在server1上準(zhǔn)備Informix環(huán)境\r\n1) 建立chunk:\r\n在目錄/data目錄下創(chuàng)建文件\r\ntouch rlv_phychunk \r\ntouch rlv_logchunk01 \r\ntouch rlv_logchunk02 \r\ntouch rlv_logchunk03 \r\ntouch rlv_logchunk04 \r\ntouch rlv_logchunk05 \r\ntouch rlv_logchunk06 \r\n...\r\ntouch rlv_datachunk15\r\ntouch rlv_datachunk16\r\ntouch rlv_datachunk17\r\ntouch rlv_rootchunk\r\n鏈接文件(目的是為了跟原來庫的環(huán)境相同,源庫是在/informix/dbs_glxt/下)\r\nln -s /data/rlv_phychunk /informix/dbs_glxt/phychk\r\nln -s /data/rlv_logchunk01 /informix/dbs_glxt/logchk1 \r\nln -s /data/rlv_logchunk02 /informix/dbs_glxt/logchk2 \r\nln -s /data/rlv_logchunk03 /informix/dbs_glxt/logchk3 \r\nln -s /data/rlv_logchunk04 /informix/dbs_glxt/logchk4 \r\nln -s /data/rlv_logchunk05 /informix/dbs_glxt/logchk5 \r\nln -s /data/rlv_logchunk06 /informix/dbs_glxt/logchk6\r\n...\r\nln -s /data/rlv_datachunk15 /informix/dbs_glxt/datachk15\r\nln -s /data/rlv_datachunk16 /informix/dbs_glxt/datachk16\r\nln -s /data/rlv_datachunk17 /informix/dbs_glxt/datachk17\r\nln -s /data/rlv_rootchunk /informix/dbs_glxt/rootchk\r\n\r\n2) 復(fù)制源庫的相關(guān)文件\r\n將server4: infomix/etc/ixbar.135 拷到server1:/informix/ixbar.135\r\nserver4: infomix/etc/oncfg_ol_aa_yingye.135 拷到server1:/informix/etc/oncfg_ol_aa_yingye.135\r\nserver4: infomix/etc/onconfig.glxt_p 拷到server1:/informix/etc/onconfig.glxt_p\r\n\r\n3.準(zhǔn)備Netbackup:\r\n1) To remove restrictions for all clients, create the following file on the NetBackup master server:\r\n/usr/openv/netbackup/db/altnames/No.Restrictions\r\n2) 檢查了前一天晚上的備份情況\r\n\r\n4.su - informix然后export下面的環(huán)境變量\r\nexport ONCONFIG=onconfig.glxt_p\r\nexport INFORMIXSERVER=ol_aa_yingye\r\nexport INFXBSA_CLIENT=server4\r\n\r\n5.執(zhí)行恢復(fù):\r\nonbar -r -t \'2008-12-30 8:0:0\'\r\n報(bào)錯(cuò)了,呵呵。\r\n狀態(tài)到開始mouting 介質(zhì)了,過程中status code先59,后25.\r\n148769 Restore Done 25 server1 \r\n148770 Restore Done 25 (這里其實(shí)是先報(bào)59然后最后才25) server1\r\n\r\n(NetBackup 狀態(tài)碼: 25\r\n消息:無法連接到套接字上\r\n解釋:在某個(gè)進(jìn)程連接到另一個(gè)進(jìn)程進(jìn)行特定操作時(shí),該進(jìn)程超時(shí)。如果某個(gè)進(jìn)程嘗試連\r\n接到 NetBackup 請求后臺(tái)駐留程序 (bprd) 或數(shù)據(jù)庫管理器后臺(tái)駐留程序 (bpdbm),而該\r\n后臺(tái)駐留程序沒有運(yùn)行,則可能會(huì)發(fā)生此問題。(在 Windows 上,這些后臺(tái)駐留程序是\r\nNetBackup 請求管理器服務(wù)和 NetBackup 數(shù)據(jù)庫管理器服務(wù)。)如果網(wǎng)絡(luò)或服務(wù)器負(fù)載過\r\n重且響應(yīng)時(shí)間很長,或者 NetBackup 的試用許可證密鑰已失效,則也可能會(huì)發(fā)生此問題。\r\n但是,該錯(cuò)誤的最常見原因是主機(jī)名解析問題。\r\nNetBackup 狀態(tài)碼: 59\r\n消息:不允許訪問客戶機(jī)\r\n解釋:主服務(wù)器或介質(zhì)服務(wù)器試圖訪問客戶機(jī),但客戶機(jī)未將該服務(wù)器識(shí)別為有效的服\r\n務(wù)器。)\r\n\r\n貌似都是bp.conf的問題,檢查server1上的bp.conf:\r\nSERVER = server6\r\nSERVER = server1\r\nCLIENT_NAME = server1\r\nCLIENT_READ_TIMEOUT = 7200\r\nINFORMIX_HOME = /informix\r\nMEDIA_UNMOUNT_DELAY = 200\r\nMEDIA_REQUEST_DELAY = 30\r\nREQUIRED_INTERFACE = server1\r\n\r\n好像沒問題,于是在set1上創(chuàng)建/usr/openv/netbackup/logs/bpcd打開調(diào)試日志,在備份一次從日志里看到點(diǎn)疑點(diǎn):\r\n15:12:37.501 [25938] <2> bpcd main: setup_sockopts complete\r\n15:12:37.512 [25938] <2> bpcd peer_hostname: Connection from host server4 (10.64.0.33) port 898\r\n15:12:37.514 [25938] <2> bpcd valid_server: comparing server6 and server4\r\n15:12:37.516 [25938] <2> bpcd valid_server: comparing server1 and server4\r\n15:12:37.518 [25938] <16> bpcd valid_server: server4 is not a server\r\n15:12:37.518 [25938] <16> bpcd valid_server: server4 is not a media server\r\n15:12:37.519 [25938] <2> bpcd main: output socket port number = 628\r\n15:12:37.519 [25938] <2> bpcd peer_hostname: Connection from host server4 (10.64.0.33) port 898\r\n15:12:37.519 [25938] <2> bpcd main: Peer hostname is server4\r\n15:12:37.519 [25938] <2> bpcd main: Got socket for output 5, lport = 866\r\n15:12:37.520 [25938] <2> bpcd main: Connected on output socket\r\n15:12:37.520 [25938] <2> bpcd main: Duplicated socket on stderr\r\n這一段之前還有一段驗(yàn)證了server1和server6,都是valid_server,但是這里卻server4 is not a server。\r\n\r\n懷疑:雖然是在server上做恢復(fù),但是因?yàn)楫?dāng)初備份是以server4做為Media server備份,恢復(fù)的時(shí)候也要用server4當(dāng)media server\r\n這個(gè)在后面會(huì)給出證據(jù)。\r\n\r\n在server1上增加了一行:\r\nSERVER = server4\r\n\r\n再執(zhí)行onbar -r -t \'2008-12-30 8:0:0\' 沒有問題了,開始讀數(shù)據(jù)\r\n\r\n6.監(jiān)控恢復(fù)進(jìn)度和restore完成:\r\nbpdbjobs輸出\r\n148769 Restore Done 25 server1\r\n148770 Restore Done 25 server1\r\n148771 Restore Done 0 server1\r\n148772 Restore Done 0 server1\r\n148773 Restore Done 0 server1\r\n148774 Restore Active server1\r\n\r\nserver1#[/usr/openv/netbackup/logs/bpcd]bpdbjobs -jobid 148771 -all_columns\r\n148771,2,3,0,,,server1,server4,1230622460,0000000352,1230622812,,1,,74112,0,,100,24198,informix,,,,,informix,server6,,,,,,1,/ol_aa_yingye/rootdbs/0,1,24198,,,1230622460,0000000352,1230622812,0,the requested operation was successfully completed,12,12/30/08 15:34:20 - begin Restore operation,12/30/08 15:34:21 - 1 images required,12/30/08 15:34:21 - media F805L2 required,12/30/08 15:34:27 - started process bptm (15302),12/30/08 15:34:27 - mounting F805L2,12/30/08 15:34:30 - connected,12/30/08 15:35:24 - mounted; mount time: 000:00:57,12/30/08 15:35:25 - positioning F805L2 to file 13,12/30/08 15:36:32 - positioned; position time: 000:01:07,12/30/08 15:36:32 - begin reading,12/30/08 15:40:11 - end reading; read time: 000:03:39,12/30/08 15:40:12 - end Restore operation; operation time: 000:05:52,74112,0,,63,,,,,,,,server4,,,,,,,,,,,\r\n\r\nserver1#[/usr/openv/netbackup/logs/bpcd]bpdbjobs -jobid 148772 -all_columns\r\n148772,2,3,0,,,server1,server4,1230622818,0000000022,1230622840,,1,,96,0,,100,27150,informix,,,,,informix,server6,,,,,,1,/ol_aa_yingye/logdbs/0,1,27150,,,1230622818,0000000022,1230622840,0,the requested operation was successfully completed,10,12/30/08 15:40:18 - begin Restore operation,12/30/08 15:40:20 - 1 images required,12/30/08 15:40:20 - media F805L2 required,12/30/08 15:40:34 - mounted,12/30/08 15:40:34 - positioning F805L2 to file 14,12/30/08 15:40:34 - positioned; position time: 000:00:00,12/30/08 15:40:34 - begin reading,12/30/08 15:40:38 - connected,12/30/08 15:40:38 - end reading; read time: 000:00:04,12/30/08 15:40:39 - end Restore operation; operation time: 000:00:21,96,0,,2823,,,,,,,,server4,,,,,,,,,,,\r\n\r\nserver1#[/usr/openv/netbackup/logs/bpcd]bpdbjobs -jobid 148773 -all_columns\r\n148773,2,3,0,,,server1,server4,1230622893,0000000016,1230622909,,1,,96,0,,100,27519,informix,,,,,informix,server6,,,,,,1,/ol_aa_yingye/phydbs/0,1,27519,,,1230622893,0000000016,1230622909,0,the requested operation was successfully completed,10,12/30/08 15:41:33 - begin Restore operation,12/30/08 15:41:34 - 1 images required,12/30/08 15:41:34 - media F805L2 required,12/30/08 15:41:42 - mounted,12/30/08 15:41:42 - positioning F805L2 to file 15,12/30/08 15:41:42 - positioned; position time: 000:00:00,12/30/08 15:41:42 - begin reading,12/30/08 15:41:43 - connected,12/30/08 15:41:44 - end reading; read time: 000:00:02,12/30/08 15:41:49 - end Restore operation; operation time: 000:00:16,96,0,,2823,,,,,,,,server4,,,,,,,,,,,\r\n\r\nserver1#[/usr/openv/netbackup/logs/bpcd]bpdbjobs -jobid 148774 -all_columns\r\n148774,2,1,,,,server1,server4,1230622926,0000007274,0000000000,,1,0,35192320,0,/ol_aa_yingye/datadbs1/0,0,27696,informix,,,,,informix,server6,,,,0,1,1,/ol_aa_yingye/datadbs1/0,1,27696,,,1230622926,0000007274,0000000000,,,11,12/30/08 15:42:06 - begin Restore operation,12/30/08 15:42:07 - 1 images required,12/30/08 15:42:07 - media F805L2 required,12/30/08 15:42:07 - media F815L2 required,12/30/08 15:42:20 - started process bptm (15585),12/30/08 15:42:20 - mounting F805L2,12/30/08 15:42:22 - connected,12/30/08 15:43:33 - mounted; mount time: 000:01:13,12/30/08 15:43:33 - positioning F805L2 to file 16,12/30/08 15:44:49 - positioned; position time: 000:01:16,12/30/08 15:44:49 - begin reading,35192320,0,,9532,,,,,,,,server4,,,,0,0,,,,,1,\r\n\r\n顯然:client=server1,server=server4,master,server=server6每個(gè)chunk對應(yīng)了一個(gè)restore的job.\r\n恢復(fù)完成到此NBU相關(guān)的工作完成,dbaccess導(dǎo)出導(dǎo)入是用戶的工作了。\n\n[ 本帖最后由 天涯明月刀 于 2009-1-23 15:58 編輯 ] |
|