首页 > 代码库 > Python 文件操作
Python 文件操作
一、Python文件I/O
打印到屏幕
>>> print("Python 是一个非常棒的语言,不是吗?") Python 是一个非常棒的语言,不是吗? >>> >>> import sys >>> sys.stdout.write("Python 是一个非常棒的语言,不是吗?") Python 是一个非常棒的语言,不是吗?21 >>>
读取键盘输入
>>> str = input(‘please input something here: ‘) # python3.x 相当于Python2.x的raw_input() please input something here: something >>> print(‘the content you put is : ‘, str) the content you put is : something >>>
打开和关闭文件
open 函数
你必须先用Python内置的open()函数打开一个文件,创建一个file对象,相关的方法才可以调用它进行读写。
语法:
file object = open(file_name [, access_mode][, buffering])
各个参数的细节如下:
- file_name:file_name变量是一个包含了你要访问的文件名称的字符串值。
- access_mode:access_mode决定了打开文件的模式:只读,写入,追加等。所有可取值见如下的完全列表。这个参数是非强制的,默认文件访问模式为只读(r)。
- buffering:如果buffering的值被设为0,就不会有寄存。如果buffering的值取1,访问文件时会寄存行。如果将buffering的值设为大于1的整数,表明了这就是的寄存区的缓冲大小。如果取负值,寄存区的缓冲大小则为系统默认。
1 Somehow, it seems the love I knew was always the most destructive kind 2 不知为何,我经历的爱情总是最具毁灭性的的那种 3 Yesterday when I was young 4 昨日当我年少轻狂 5 The taste of life was sweet 6 生命的滋味是甜的 7 As rain upon my tongue 8 就如舌尖上的雨露 9 I teased at life as if it were a foolish game 10 我戏弄生命 视其为愚蠢的游戏 11 The way the evening breeze 12 就如夜晚的微风 13 May tease the candle flame 14 逗弄蜡烛的火苗 15 The thousand dreams I dreamed 16 我曾千万次梦见 17 The splendid things I planned 18 那些我计划的绚丽蓝图 19 I always built to last on weak and shifting sand 20 但我总是将之建筑在易逝的流沙上 21 I lived by night and shunned the naked light of day 22 我夜夜笙歌 逃避白昼赤裸的阳光 23 And only now I see how the time ran away 24 事到如今我才看清岁月是如何匆匆流逝 25 Yesterday when I was young 26 昨日当我年少轻狂 27 So many lovely songs were waiting to be sung 28 有那么多甜美的曲儿等我歌唱 29 So many wild pleasures lay in store for me 30 有那么多肆意的快乐等我享受 31 And so much pain my eyes refused to see 32 还有那么多痛苦 我的双眼却视而不见 33 I ran so fast that time and youth at last ran out 34 我飞快地奔走 最终时光与青春消逝殆尽 35 I never stopped to think what life was all about 36 我从未停下脚步去思考生命的意义 37 And every conversation that I can now recall 38 如今回想起的所有对话 39 Concerned itself with me and nothing else at all 40 除了和我相关的 什么都记不得了 41 The game of love I played with arrogance and pride 42 我用自负和傲慢玩着爱情的游戏 43 And every flame I lit too quickly, quickly died 44 所有我点燃的火焰都熄灭得太快 45 The friends I made all somehow seemed to slip away 46 所有我交的朋友似乎都不知不觉地离开了 47 And only now I‘m left alone to end the play, yeah 48 只剩我一个人在台上来结束这场闹剧 49 Oh, yesterday when I was young 50 噢 昨日当我年少轻狂 51 So many, many songs were waiting to be sung 52 有那么那么多甜美的曲儿等我歌唱 53 So many wild pleasures lay in store for me 54 有那么多肆意的快乐等我享受 55 And so much pain my eyes refused to see 56 还有那么多痛苦 我的双眼却视而不见 57 There are so many songs in me that won‘t be sung 58 我有太多歌曲永远不会被唱起 59 I feel the bitter taste of tears upon my tongue 60 我尝到了舌尖泪水的苦涩滋味 61 The time has come for me to pay for yesterday 62 终于到了付出代价的时间 为了昨日 63 When I was young 64 当我年少轻狂
#!/usr/bin/evn python # -*- coding:utf-8 -*- # Author: antcolonies # data = http://www.mamicode.com/open(‘yesterday1.lrc‘) # 默认为系统的编码格式 # UnicodeDecodeError: ‘gbk‘ codec can‘t decode byte 0x80 in position 106: illegal multibyte sequence # data = http://www.mamicode.com/open(‘yesterday1.lrc‘,encoding=‘utf-8‘) # data: <_io.TextIOWrapper name=‘yesterday1.lrc‘ mode=‘r‘ encoding=‘utf-8‘> f = open(‘yesterday1.lrc‘,encoding=‘utf-8‘) # 内存对象f,在文件操作中称为句柄 print(f) # f: <_io.TextIOWrapper name=‘yesterday1.lrc‘ mode=‘r‘ encoding=‘utf-8‘> # 默认打开的模式为‘读‘ data = http://www.mamicode.com/f.read() # 读取整个文件 data2 = f.read() print(id(data),id(data2)) # 42019856 6715464 f.write(‘Love u‘) # io.UnsupportedOperation: not writable f = open(‘yesterday2.lrc‘,‘w‘,encoding=‘utf-8‘) # print(f) <_io.TextIOWrapper name=‘yesterday2.lrc‘ mode=‘w‘ encoding=‘utf-8‘> # ‘w‘模式下打开会创建一个yesterday2.lrc名称的文件,若文件存在,则覆盖原文件 data = http://www.mamicode.com/f.read() # io.UnsupportedOperation: not readable f.write(‘Love u\n‘) f.write(‘Hate u‘) f = open(‘yesterday2.lrc‘,‘a‘,encoding=‘utf-8‘) print(f) # f: <_io.TextIOWrapper name=‘yesterday2.lrc‘ mode=‘a‘ encoding=‘utf-8‘> # a(append) 末尾追加, 若文件不存在,则先创建,再追加 f.write(‘Love u\n‘) f.write(‘Hate u\n‘) f.write(‘when i was yong i listen to the radio\n‘) f.read() # io.UnsupportedOperation: not readable # 读取前5行 f = open(‘yesterday1.lrc‘,‘r‘,encoding=‘utf-8‘) for I in range(5): print(f.readline()) # f.readline() 读取一行 # 读取整个文件,第十行不打印 f = open(‘yesterday1.lrc‘,‘r‘,encoding=‘utf-8‘) print(f.readlines()) #读取整个文件,并以行为元素,按照文件内容顺序组成一个列表,比较适合读取小文件 [‘Somehow, it seems the love I knew was always the most destructive kind\n‘, ‘不知为何,我经历的爱情总是最具毁灭性的的那种\n‘, ‘Yesterday when I was young\n‘, ‘昨日当我年少轻狂\n‘, ‘The taste of life was sweet\n‘, ‘生命的滋味是甜的\n‘, ‘As rain upon my tongue\n‘, ‘就如舌尖上的雨露\n‘, ‘I teased at life as if it were a foolish game\n‘, ‘我戏弄生命 视其为愚蠢的游戏\n‘, ‘The way the evening breeze\n‘, ‘就如夜晚的微风\n‘, ‘May tease the candle flame\n‘, ‘逗弄蜡烛的火苗\n‘, ‘The thousand dreams I dreamed\n‘, ‘我曾千万次梦见\n‘, ‘The splendid things I planned\n‘, ‘那些我计划的绚丽蓝图\n‘, ‘I always built to last on weak and shifting sand\n‘, ‘但我总是将之建筑在易逝的流沙上\n‘, ‘I lived by night and shunned the naked light of day\n‘, ‘我夜夜笙歌 逃避白昼赤裸的阳光\n‘, ‘And only now I see how the time ran away\n‘, ‘事到如今我才看清岁月是如何匆匆流逝\n‘, ‘Yesterday when I was young\n‘, ‘昨日当我年少轻狂\n‘, ‘So many lovely songs were waiting to be sung\n‘, ‘有那么多甜美的曲儿等我歌唱\n‘, ‘So many wild pleasures lay in store for me\n‘, ‘有那么多肆意的快乐等我享受\n‘, ‘And so much pain my eyes refused to see\n‘, ‘还有那么多痛苦 我的双眼却视而不见\n‘, ‘I ran so fast that time and youth at last ran out\n‘, ‘我飞快地奔走 最终时光与青春消逝殆尽\n‘, ‘I never stopped to think what life was all about\n‘, ‘我从未停下脚步去思考生命的意义\n‘, ‘And every conversation that I can now recall\n‘, ‘如今回想起的所有对话\n‘, ‘Concerned itself with me and nothing else at all\n‘, ‘除了和我相关的 什么都记不得了\n‘, ‘The game of love I played with arrogance and pride\n‘, ‘我用自负和傲慢玩着爱情的游戏\n‘, ‘And every flame I lit too quickly, quickly died\n‘, ‘所有我点燃的火焰都熄灭得太快\n‘, ‘The friends I made all somehow seemed to slip away\n‘, ‘所有我交的朋友似乎都不知不觉地离开了\n‘, "And only now I‘m left alone to end the play, yeah\n", ‘只剩我一个人在台上来结束这场闹剧\n‘, ‘Oh, yesterday when I was young\n‘, ‘噢 昨日当我年少轻狂\n‘, ‘So many, many songs were waiting to be sung\n‘, ‘有那么那么多甜美的曲儿等我歌唱\n‘, ‘So many wild pleasures lay in store for me\n‘, ‘有那么多肆意的快乐等我享受\n‘, ‘And so much pain my eyes refused to see\n‘, ‘还有那么多痛苦 我的双眼却视而不见\n‘, "There are so many songs in me that won‘t be sung\n", ‘我有太多歌曲永远不会被唱起\n‘, ‘I feel the bitter taste of tears upon my tongue\n‘, ‘我尝到了舌尖泪水的苦涩滋味\n‘, ‘The time has come for me to pay for yesterday\n‘, ‘终于到了付出代价的时间 为了昨日\n‘, ‘When I was young\n‘, ‘当我年少轻狂‘] for index,line in enumerate(f.readlines()): # 枚举 if index == 9: print(‘-------------------优美的分割线-----------‘) continue print(line.strip()) # 对于处理大文件(比如大小为4G),f.readline()或f.readlines(),会将文件读出并存储于内存中,严重的消耗内存 f = open(‘yesterday1.lrc‘,‘r‘,encoding=‘utf-8‘) count = 0 for line in f: # 内存中始终存储一行,效率最高,此时f对象被视为迭代器 if count == 9: print(‘-------------------优美的分割线-----------‘) count += 1 continue print(line.strip()) count += 1 # f = open(‘yesterday2.lrc‘,‘r+‘,encoding=‘utf-8‘) # 读写 # f = open(‘yesterday2.lrc‘,‘w+‘,encoding=‘utf-8‘) # 写读 # f = open(‘yesterday2.lrc‘,‘a+‘,encoding=‘utf-8‘) # 追加读 # print(f.readline()) # print(f.readline()) # print(f.readline()) # print(f.tell()) # f.write(‘---------------perfect---------------\n‘) # f.write(‘---------------perfect---------------\n‘) # f.write(‘---------------perfect---------------\n‘) # f.write(‘---------------perfect---------------\n‘) # print(f.tell()) # f.seek(10) # print(f.readline()) # f.write(‘should be at the begining of the second line.\n‘) # f = open(‘yesterday2.lrc‘,‘rb‘,encoding=‘utf-8‘) # 二进制读 # ValueError: binary mode doesn‘t take an encoding argument # f = open(‘yesterday2.lrc‘,‘rb‘) # 二进制读 (网络传输) # print(f.readline()) # print(f.readline()) # print(f.readline()) ‘‘‘ b‘---------------perfect---------------\r\n‘ b‘---------------perfect---------------\r\n‘ b‘---------------perfect---------------\r\n‘ ‘‘‘ f = open(‘yesterday2.lrc‘,‘wb‘) # f.write(‘hello, binary‘) # TypeError: ‘str‘ does not support the buffer interface f.write(‘hello, binary\n‘.encode()) # 默认为‘utf-8‘ f.close() # 手动关闭文件
close()方法
File 对象的 close()方法刷新缓冲区里任何还没写入的信息,并关闭该文件,这之后便不能再进行写入。
当一个文件对象的引用被重新指定给另一个文件时,Python 会关闭之前的文件。用 close()方法关闭文件是一个很好的习惯。
File对象的属性
一个文件被打开后,你有一个file对象,你可以得到有关该文件的各种信息。
以下是和file对象相关的所有属性的列表:
属性 描述
file.closed 返回true如果文件已被关闭,否则返回false。
file.mode 返回被打开文件的访问模式。
file.name 返回文件的名称。
f = open(‘yesterday2.lrc‘,‘wb‘) print(f.closed) print(f.mode) print(f.name) ‘‘‘ False wb yesterday2.lrc ‘‘‘
读写文件
write()方法
write()方法可将任何字符串写入一个打开的文件。需要重点注意的是,Python字符串可以是二进制数据,而不是仅仅是文字。write()方法不会在字符串的结尾添加换行符(‘\n‘):
#!/usr/bin/evn python # -*- coding:utf-8 -*- # Author: antcolonies f = open(‘foo.txt‘,‘wb‘) # f.write(‘www.runoob.com!\nVery good site!\n‘) # TypeError: ‘str‘ does not support the buffer interface f.write(‘www.runoob.com!\nVery good site!\n‘.encode()) f.close()
read()方法
read()方法从一个打开的文件中读取一个字符串。需要重点注意的是,Python字符串可以是二进制数据,而不是仅仅是文字。
语法:
fileObject.read([count]);
在这里,被传递的参数是要从已打开文件中读取的字节计数。该方法从文件的开头开始读入,如果没有传入count,它会尝试尽可能多地读取更多的内容,很可能是直到文件的末尾。
#!/usr/bin/evn python # -*- coding:utf-8 -*- # Author: antcolonies f = open(‘foo.txt‘,‘r‘) print(f.read(10)) # www.runoob f.close()
文件定位
tell()方法告诉你文件内的当前位置;换句话说,下一次的读写会发生在文件开头这么多字节之后。
seek(offset)方法改变当前文件的位置。Offset变量表示相对于文件开始(0)要移动的字节数。
例子:
就用我们上面创建的文件foo.txt。
#!/usr/bin/evn python # -*- coding:utf-8 -*- # Author: antcolonies f = open(‘foo.txt‘,‘r‘,encoding=‘utf-8‘) print(f.tell()) print(f.read(10)) # www.runoob print(f.tell()) f.close()
重命名和删除文件
Python的os模块提供了帮你执行文件处理操作的方法,比如重命名和删除文件。
要使用这个模块,你必须先导入它,然后才可以调用相关的各种功能。
rename()方法
rename()方法需要两个参数,当前的文件名和新文件名。
语法:
os.rename(current_file_name, new_file_name)
例子:
#!/usr/bin/evn python # -*- coding:utf-8 -*- # Author: antcolonies import os # 重命名文件foo.txt到foo.lrc os.rename(‘foo.txt‘,‘foo.lrc‘)
remove()方法
你可以用remove()方法删除文件,需要提供要删除的文件名作为参数。
#!/usr/bin/evn python # -*- coding:utf-8 -*- # Author: antcolonies import os # 删除一个已经存在的文件 os.remove(‘foo.lrc‘)
Python目录的创建、修改和删除,目录的切换和当前目录的获取
os.mkdir()
os.rename()
os.getcwd()
os.chdir()
os.rmdir()
#!/usr/bin/evn python # -*- coding:utf-8 -*- # Author: antcolonies import os os.mkdir(r‘e:/python‘) # make directory in current directory os.rename(r‘e:/python‘,r‘e:/Python‘) print(os.getcwd()) # E:\python14_workspace\s14\day03 os.chdir(r‘F:\Music‘) # change directory print(os.getcwd()) # F:\Music (getcwd--get current working directory) os.rmdir(r‘e:/Python‘) # remove directory
二、Python file 方法
reference to : http://www.runoob.com/python/file-methods.html
1 #!/usr/bin/evn python 2 # -*- coding:utf-8 -*- 3 # Author: antcolonies 4 5 f = open(‘yesterday1.lrc‘,‘r‘,encoding=‘utf-8‘) 6 print(f.tell()) # f.tell() 查看目前光标所在位置 7 print(f.read()) # 默认读取整个文件(不包括换行符) 8 print(f.read(10)) # 从第一个字符开始,连续读取5个字符 9 print(f.tell()) 10 f.seek(0) 11 print(f.readline()) 12 print(f.tell()) 13 f.seek(10) # 跳转至第10个字符处 14 print(f.tell()) 15 print(f.readline()) # t seems the love I knew was always the most destructive kind 16 17 f.detach() 18 print(f.encoding) 19 print(f.fileno()) # 3,返回该文件在内存的编号(不是内存地址) 20 print(f.name) # yesterday1.lrc 21 print(f.seekable()) # 文件是否可跳转 22 print(f.readable()) 23 print(f.writable()) 24 25 print(dir(f.buffer)) 26 f.flush() # 将写入给文件(尚在缓存)的内容强制刷新写入硬盘 27 import sys,time 28 for i in range(20): 29 sys.stdout.write(‘#‘) # 标准输出(往控制台打印且不换行) 30 sys.stdout.flush() 31 time.sleep(0.1) 32 33 print(f.closed) # 文件是否关闭 34 35 f = open(‘yesterday2.lrc‘,‘a‘,encoding=‘utf-8‘) 36 # io.UnsupportedOperation: File not open for writing 37 f.truncate() # 默认清空文件 38 f.seek(20) 39 print(f.tell()) 40 f.truncate(40) # 从文件0处截取40个字符 41 print(f.tell())
对文件操作流程
- 打开文件,得到文件句柄并赋值给一个变量
- 通过句柄对文件进行操作
- 关闭文件
打开文件的模式有:
- r,只读模式(默认)。
- w,只写模式。【不可读;不存在则创建;存在则删除内容;】
- a,追加模式。【可读; 不存在则创建;存在则只追加内容;】
"+" 表示可以同时读写某个文件
- r+,可读写文件。【可读;可写;可追加】
- w+,写读
- a+,同a
"U"表示在读取时,可以将 \r \n \r\n自动转换成 \n (与 r 或 r+ 模式同使用)
- rU
- r+U
"b"表示处理二进制文件(如:FTP发送上传ISO镜像文件,linux可忽略,windows处理二进制文件时需标注)
- rb
- wb
- ab
with语句
为了避免打开文件后忘记关闭,可以通过管理上下文,即:
with open(‘log‘,‘r‘) as f: ...
如此方式,当with代码块执行完毕时,内部会自动关闭并释放文件资源。
在Python 2.7 后,with又支持同时对多个文件的上下文进行管理,即:
with open(‘log1‘) as obj1, open(‘log2‘) as obj2: pass
1 #!/usr/bin/env python 2 # -*- coding:utf-8 -*- 3 # Author: antcolonies 4 5 import sys 6 7 f = open(‘yesterday2.lrc‘,‘r‘,encoding=‘utf-8‘) 8 f_new = open(‘yesterday2.bak‘,‘w‘,encoding=‘utf-8‘) 9 10 search_str = sys.argv[1] 11 replace_str = sys.argv[2] 12 13 for line in f: 14 if search_str in line: 15 line = line.replace(search_str,replace_str) 16 f_new.write(line) 17 18 f.close() 19 f_new.close()
1 #!/usr/bin/env python 2 # -*- coding:utf-8 -*- 3 # Author: antcolonies 4 5 import sys 6 7 search_str = sys.argv[1] 8 replace_str = sys.argv[2] 9 10 with open(‘yesterday2.lrc‘,‘r‘,encoding=‘utf-8‘) as f,11 open(‘yesterday2.lrc.bak‘,‘w‘,encoding=‘utf-8‘) as f_new: 12 for line in f: 13 if search_str in line: 14 line = line.replace(search_str,replace_str) 15 f_new.write(line)
程序练习
程序1: 实现简单的shell sed替换功能
1 #!/usr/local/Python-3.5.2/python 2 # -*- coding:utf-8 -*- 3 # antcolonies 4 5 import sys,os 6 7 search_str = sys.argv[1] 8 ori_str = sys.argv[2] 9 replace_str = sys.argv[3] 10 filename = sys.argv[4] 11 filename_1 = filename + ‘_1‘ 12 13 with open(filename,‘r‘,encoding=‘utf-8‘) as f ,14 open(filename_1,‘w‘,encoding=‘utf-8‘) as f_new: 15 for line in f: 16 if search_str in line: 17 line = line.replace(ori_str,replace_str) 18 f_new.write(line) 19 20 os.rename(filename_1,filename)
程序2:修改haproxy配置文件
1 1、查 2 输入:www.oldboy.org 3 获取当前backend下的所有记录 4 5 2、新建 6 输入: 7 arg = { 8 ‘backend‘: ‘www.oldboy.org‘, 9 ‘record‘:{ 10 ‘server‘: ‘100.1.7.9‘, 11 ‘weight‘: 20, 12 ‘maxconn‘: 30 13 } 14 } 15 16 3、删除 17 输入: 18 arg = { 19 ‘backend‘: ‘www.oldboy.org‘, 20 ‘record‘:{ 21 ‘server‘: ‘100.1.7.9‘, 22 ‘weight‘: 20, 23 ‘maxconn‘: 30 24 } 25 }
1 global 2 log 127.0.0.1 local2 3 daemon 4 maxconn 256 5 log 127.0.0.1 local2 info 6 defaults 7 log global 8 mode http 9 timeout connect 5000ms 10 timeout client 50000ms 11 timeout server 50000ms 12 option dontlognull 13 14 listen stats :8888 15 stats enable 16 stats uri /admin 17 stats auth admin:1234 18 19 frontend oldboy.org 20 bind 0.0.0.0:80 21 option httplog 22 option httpclose 23 option forwardfor 24 log global 25 acl www hdr_reg(host) -i www.oldboy.org 26 use_backend www.oldboy.org if www 27 28 backend www.oldboy.org 29 server 100.1.7.9 100.1.7.9 weight 20 maxconn 3000
1 #!/usr/local/Python-3.5.2/python 2 # -*- coding:utf-8 -*- 3 # antcolonies 4 5 import sys,os 6 7 filename = sys.argv[1] 8 9 def search(string): 10 flag = ‘‘ 11 with open(filename,‘r‘,encoding=‘utf-8‘) as f: 12 for line in f: 13 if string in line: 14 if len(line)==len(line.lstrip()): 15 flag = True 16 if flag: 17 print(line.rstrip()) 18 if line.strip() == ‘‘: 19 break 20 21 def create(arg): 22 with open(filename,‘r‘,encoding=‘utf-8‘) as f: 23 listfile = f.readlines() 24 lastline = listfile[-1] 25 arg_str = ‘%s %s %s %d %s %d‘ %(‘server‘,arg[‘record‘][‘server‘],‘weight‘,26 arg[‘record‘][‘weight‘],‘maxconn‘,arg[‘record‘][‘maxconn‘]) 27 if lastline.strip() == arg_str: 28 print(‘the record you put has already existed!‘) 29 return False 30 indent = len(lastline) - len(lastline.lstrip()) 31 indentstr = ‘‘ 32 for i in range(indent): 33 indentstr += ‘ ‘ 34 arg_str = ‘%s%s‘ %(indentstr,arg_str) 35 with open(filename,‘a‘,encoding=‘utf-8‘) as f: 36 f.write(‘%s%s %s %s %d %s %d‘ %(indentstr,‘server‘,arg[‘record‘][‘server‘],37 ‘weight‘,arg[‘record‘][‘weight‘],‘maxconn‘,arg[‘record‘][‘maxconn‘])) 38 39 40 def delete(arg): 41 filename_1 = filename + ‘_1‘ 42 record = ‘%s %s %s %d %s %d‘ %(‘server‘,arg[‘record‘][‘server‘],‘weight‘,43 arg[‘record‘][‘weight‘],‘maxconn‘,arg[‘record‘][ ‘maxconn‘]) 44 with open(filename,‘r‘,encoding=‘utf-8‘) as f,45 open(filename_1,‘w‘,encoding=‘utf-8‘) as f_new: 46 for line in f: 47 if record == line.strip(): 48 continue 49 f_new.write(line) 50 os.rename(filename_1,filename) 51 52 53 while True: 54 action = input(‘(search|create|delete)‘) 55 if action == ‘search‘: 56 record = input(‘select a title: ‘) 57 search(record) 58 elif action == ‘create‘: 59 arg = input(‘input an argv: ‘) 60 arg_list = eval(arg) 61 create(arg_list) 62 elif action == ‘delete‘: 63 arg = input(‘input an argv: ‘) 64 arg_list = eval(arg) 65 delete(arg_list) 66 else: 67 print(‘you put a wrong action!‘) 68 stat = input("q(quit) or any other keywords to continue: ") 69 if stat == ‘q‘: 70 break
Python 文件操作