首页 > 代码库 > Oracle Sql Loader的学习使用

Oracle Sql Loader的学习使用

 

最近由于遇到oracle控制文件的使用,虽然不是很复杂,但是从来没有用过,专门花点时间看看。点击 这里 查看详细

1,概述:

Sql Loader: 一个批量工具,将文件数据导入到数据库。可以导入一个表或者多个表,甚至可以在导入时修改数据。

2,使用

a,你电脑需要装Oracle,不然你是找不到Sqlldr 这个命令的。

在控制输入台输入 sqlldr:

会列出相关的参数介绍。

> sqlldr
.
.
.
Usage: SQLLDR keyword=value [,keyword=value,...]

Valid Keywords:

    userid -- ORACLE username/password           
   control -- control file name                  
       log -- log file name                      
       bad -- bad file name                      
      data -- data file name                     
   discard -- discard file name                  
discardmax -- number of discards to allow          (Default all)
      skip -- number of logical records to skip    (Default 0)
      load -- number of logical records to load    (Default all)
    errors -- number of errors to allow            (Default 50)
      rows -- number of rows in conventional path bind array or between direct
path data saves
               (Default: Conventional path 64, Direct path all)
  bindsize -- size of conventional path bind array in bytes  (Default 256000)
    silent -- suppress messages during run (header,feedback,errors,discards,
partitions)
    direct -- use direct path                      (Default FALSE)
   parfile -- parameter file: name of file that contains parameter specifications
  parallel -- do parallel load                     (Default FALSE)
      file -- file to allocate extents from      
skip_unusable_indexes -- disallow/allow unusable indexes or index partitions
(Default FALSE)
skip_index_maintenance -- do not maintain indexes, mark affected indexes as 
unusable  (Default FALSE)
commit_discontinued -- commit loaded rows when load is discontinued  (Default
FALSE)
  readsize -- size of read buffer                  (Default 1048576)
external_table -- use external table for load; NOT_USED, GENERATE_ONLY, EXECUTE 
 (Default NOT_USED)
columnarrayrows -- number of rows for direct path column array  (Default 5000)
streamsize -- size of direct path stream buffer in bytes  (Default 256000)
multithreading -- use multithreading in direct path  
resumable -- enable or disable resumable for current session  (Default FALSE)
resumable_name -- text string to help identify resumable statement
resumable_timeout -- wait time (in seconds) for RESUMABLE  (Default 7200)
date_cache -- size (in entries) of date conversion cache  (Default 1000)

PLEASE NOTE: Command-line parameters may be specified either by position or by keywords.
An example of the former case is sqlldr scott/tiger foo; an example of the latter 
is sqlldr control=foo userid=scott/tiger.One may specify parameters by position before
but not after parameters specified by keywords.For example, sqlldr scott/tiger control=foo
logfile=log is allowed, but sqlldr scott/tiger control=foo log is not, even though the
position of the parameter log is correct.
 

 

 

b, sqlldr 将文本文件的导入到数据库

这里看个简单例子。看看sqlldr到底怎么工作的。

1,准备数据文件,例如input.txt.这个文件将导入到数据库中。

首先查看我们数据库的表格式。

create table student(
SNAME VARCHAR(20),
SAGE INTEGER,
SEMAIL VARCHAR(20),
SPHONE VARCHAR(20),
SADDRESS VARCHAR(20)
)

input.txt 文件

12,12,abc@gmail.com,12,address
13,13,abc@gmail.com,13,address
14,14,abc@gmail.com,14,address
15,15,abc@gmail.com,15,address
16,16,abc@gmail.com,16,address
17,17,abc@gmail.com,17,address
18,18,abc@gmail.com,18,address
19,19,abc@gmail.com,19,address

2,控制文件input.ctl

load data
infile input.txt
append into table student   --这里用的Append.
fields terminated by ","   --这里表示逗号分割。
(SNAME,SAGE,SEMAIL,SPHONE,SADDRESS)

这里用的Append, 追加数据,还有几个其他的参数:

   a,insert,为缺省方式,在数据装载开始时要求表为空  

   b,append,在表中追加新记录   

   c ,replace,删除旧记录,替换成新装载的记录

   d,truncate,同上  

 

3,sqlldr 调用控制文件

sqlldr username/password@Database control =input.ctl             //input.ctl 为控制文件

在这里需要提下,这里是会生成日志文件,默认为文件名文件名+.log. 当前为 input.log

如果执行失败了,会生成bad file. 如果在当前执行中错误,会生成input.bad file。 

 下面指定Log 和bad 文件,当然可以加上路径

sqlldr userid=username/password@database control=input.ctl log=input.log bad=input.bad  SILENT=(HEADER, FEEDBACK)

 

SILENT=(HEADER, FEEDBACK) 控制端不显示信息,例如下面的信息将不再控制端显示。只在日志文件中

Record 4: Rejected - Error on table EMP
ORA-00001: unique constraint <name> violated

 

 

当然是可以显示指定的。

load data
infile input.txt
badfile  t.bad
discardfile t.dsc
append into table student
fields terminated by ","
(SNAME,SAGE,SEMAIL,SPHONE,SADDRESS)

 

 看看日志文件:input.log

SQL*Loader: Release 10.2.0.1.0 - Production on Tue May 20 17:36:52 2014

Copyright (c) 1982, 2005, Oracle.  All rights reserved.

Control File:   input1.ctl
Data File:      input1.ctl
  Bad File:     input1.bad
  Discard File:  none specified
 
 (Allow all discards)

Number to load: ALL
Number to skip: 0
Errors allowed: 50
Bind array:     64 rows, maximum of 256000 bytes
Continuation:    none specified
Path used:      Conventional

Table STUDENT, loaded from every logical record.
Insert option in effect for this table: APPEND

   Column Name                  Position   Len  Term Encl Datatype
------------------------------ ---------- ----- ---- ---- ---------------------
SNAME                               FIRST     *   ,       CHARACTER            
SAGE                                 NEXT     *   ,       CHARACTER            
SEMAIL                               NEXT     *   ,       CHARACTER            
SPHONE                               NEXT     *   ,       CHARACTER            
SADDRESS                             NEXT     *   ,       CHARACTER            


Table STUDENT:
  1 Row successfully loaded.
  0 Rows not loaded due to data errors.
  0 Rows not loaded because all WHEN clauses were failed.
  0 Rows not loaded because all fields were null.


Space allocated for bind array:                  82560 bytes(64 rows)
Read   buffer bytes: 1048576

Total logical records skipped:          0
Total logical records read:             1
Total logical records rejected:         0
Total logical records discarded:        0

Run began on Tue May 20 17:36:52 2014
Run ended on Tue May 20 17:36:52 2014

Elapsed time was:     00:00:00.05
CPU time was:         00:00:00.04
View Code

 

4,查看数据库

 

到此一个简单的例子完成,从一个文本文件导入到数据库。

文件可以为不同格式文件,.dat,.csv都可以的。

 

C,sqlldr直接在控制文件中导入数据。

 

load data
infile *
append into table student
fields terminated by ","
(SNAME,SAGE,SEMAIL,SPHONE,SADDRESS)
begindata
20,20,abc@gmail.com,20,address  --这里是数据

 

 

D,当文件数据是以绝对位置分开的,我们可以直接截取。当然,截取的开始与结束必须小心了。

load data
infile t.dat
append into table student
(SNAME  position(01:20),
 SAGE position(21:23) ,
 SEMAIL position(41:60),
 SPHONE position(61:80),
 SADDRESS position(81:100)
 )

 

t.dat 文件

Jack                12                  abc@gmail.com       134998879           Singapore
Jack2               12                  abc@gmail.com       134998879           Singapore
Jack3               12                  abc@gmail.com       134998879           Singapore
Jack4               12                  abc@gmail.com       134998879           Singapore
Jack5               12                  abc@gmail.com       134998879           Singapore
Jack6               12                  abc@gmail.com       134998879           Singapore
Jack7               12                  abc@gmail.com       134998879           Singapore

 

还数据在Load to database 的时候,load的数据是可以改变的。

LOAD DATA
  INFILE *
  INTO TABLE modified_data
  (  rec_no                      "my_db_sequence.nextval",
     region                      CONSTANT 31,
     time_loaded                 "to_char(SYSDATE, HH24:MI)",
     data1        POSITION(1:5)  ":data1/100",
     data2        POSITION(6:15) "upper(:data2)",
     data3        POSITION(16:22)"to_date(:data3, YYMMDD)"
  )
BEGINDATA
11111AAAAAAAAAA991201
22222BBBBBBBBBB990112

 

 

这里有很多命令的解释

这里有很多问题的回答(FAQ)

简单实现几个例子,稍后有时间添加多点理论知识,再边学习边完善了。