Oracle大数据量更新方法

创建时间：2014-04-10 投稿人：浏览次数：3528

实际工作中我们有时会遇到，需要对一些超级大（BT）的表进行数据更新（更悲剧的是有时需要关联更新，就是依据一张大表的数据来更新另一张表的数据）我们通常的更新方法可能在性能效率上遭遇瓶颈，下面就笔者遇到的一个案列进行记录分享。

首先我们来看下需求：

一张玩家首登表：account_first_login（此表中有account,login_date,login_region字段）一张玩家每日活跃表：account_active_day（有account,active_date,login_region）

现在的需求是需要把首登表account_first_login中的login_region的值更新为对应的每日活跃表：account_active_day中的login_region值，有同事最初的写法是：

1 2 3 4 5 6 7 alter session enable parallel dml; update /*+ parallel(t1,4) */ account_first_login t1 set login_region= (select login_region from account_active_day t2 where t1.account = t2.account and t1.login_date = t2.active_date);

结果5,6个小时过去了更新还没有完成，后来优化更改为关联update

1 2 3 4 5 6 7 update (select /*+ bypass_ujvc */ t1.login_region old_id, t2.login_region new_id from account_first_login t1, account_active_day t2 where t1.account = t2.account and t1.login_date = t2.active_date) r set r.old_id = r.new_id;

修改后的写法效率有了一定的提升，2个多小时完成，可是时间还是不能接受，继续优化，想到了通过rowid批量进行更新

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 declare maxrows number default 5000; row_id_table dbms_sql.Urowid_Table; p_id_table dbms_sql.Number_Table; cursor acnt_first_cur is select /*+ use_hash(t1,t2) parallel(t1,4) parallel(t2,4) */ t2.login_region, t1.rowid from account_first_login t1, account_active_day t2 where t1.account = t2.account and t1.login_date = t2.login_date order by t1.rowid; begin open acnt_first_cur; loop exit when acnt_first_cur%notfound; fetch acnt_first_cur bulk collect into p_id_table, row_id_table limit maxrows; forall i in 1 .. row_id_table.count update account_first_login set login_region = p_id_table(i) where rowid = row_id_table(i); commit; end loop; end;

结果优化后的更新速度是相当神速啊：247s完成！

结论：当更新大数据量时可以通过rowid进行排序后再批量更新，因为通过rowid进行排序后想应的被更新数据多数在一个数据块上，这样按块进行批量更新速度应该能够大大提高。

from：http://hi.baidu.com/songyunkui/item/04aba589f7ffa705100ef3c1

声明：该文观点仅代表作者本人，牛骨文系教育信息发布平台，牛骨文仅提供信息存储空间服务。

上一篇：数据库存储大数据量(千万条记录级别)数据的考虑要点
下一篇：项目开发中，大数据量的增删改查的若干解决方法

热门文章: CTF writeup 2_南邮网络攻防训...; SSM框架——详细整合教程（...; Linux Shell脚本编程－－curl命...; HttpClient使用详解; Java面试题全集（上）; JAVA设计模式之单例模式; java.lang.OutOfMemoryError: PermGen ...; TCP协议中的三次握手和四次...; form表单的两种提交方式，su...; String,StringBuffer与StringBuilder...

最新文章: Java之品优购课程讲义_day20（7）; 剑指 Offer - 8：跳台阶; Netty权威指南_札记02_NIO编程; mysql时间属性之时间戳和datetime之...; 虚拟现实或许可以拯救古埃及的“...; spring cloud服务注册中心eureka---集群...; Java SE 第六章; HTTP请求+数据库; HIDL学习笔记之HIDL C++（第二天）; ubuntu系统下指定tomcat运行时为JDK1.8...