处理音频--pyaudio

首页 > 代码库 > 处理音频--pyaudio

2024-09-21 02:40:11 224人阅读

- 前言
- 安装
- 读写音频文件
  - play
  - record
  - wired
  - playcallback
  - Wirecallback
- 外部应用
- 总结

前言

很久之前写过一个将文本转成语音的，借助了一个名为pyttsx的库。具体使用可以参考下面的链接。

http://blog.csdn.net/marksinoberg/article/details/52137547

今天再来分享一个处理音频的博文。接住百度的语音接口，差不多可以方便的将音频转成文字了。

安装

安装的过程比较麻烦一点，不是说安装的步骤，而是找到能用的库不是很容易。

目标库： pyaudio。

但是奈何我的Python版本是36，而pip是安装不了的。找了很多资料，最后还是在pypi上找到了兼容的版本。

Python36版本： https://pypi.python.org/pypi/PyAudio/0.2.11
Python35 Python2：http://people.csail.mit.edu/hubert/pyaudio/packages/

读写音频文件

官网上给了几个小例子，个人觉得不错。拿来分享一下。

play

"""PyAudio Example: Play a WAVE file."""

import pyaudio
import wave
import sys

CHUNK = 1024

if len(sys.argv) < 2:
    print("Plays a wave file.\n\nUsage: %s filename.wav" % sys.argv[0])
    sys.exit(-1)

wf = wave.open(sys.argv[1], ‘rb‘)

p = pyaudio.PyAudio()

stream = p.open(format=p.get_format_from_width(wf.getsampwidth()),
                channels=wf.getnchannels(),
                rate=wf.getframerate(),
                output=True)

data = http://www.mamicode.com/wf.readframes(CHUNK)"hljs-keyword">while data != ‘‘:
    stream.write(data)
    data = http://www.mamicode.com/wf.readframes(CHUNK)>



保存为play.py  

然后可以再terminal中来尝试一下。当然了，前提是先准备一个 .wav音频文件。



Python play.py 3.12.wav

然后不出意外的话，就可以听到电脑播放的音频了。



record

有了读的，那么再来个记录的吧。



"""PyAudio example: Record a few seconds of audio and save to a WAVE file."""

import pyaudio
import wave

CHUNK = 1024
FORMAT = pyaudio.paInt16
CHANNELS = 2
RATE = 44100
RECORD_SECONDS = 5
WAVE_OUTPUT_FILENAME = "output.wav"

p = pyaudio.PyAudio()

stream = p.open(format=FORMAT,
                channels=CHANNELS,
                rate=RATE,
                input=True,
                frames_per_buffer=CHUNK)

print("* recording")

frames = []

for i in range(0, int(RATE / CHUNK * RECORD_SECONDS)):
    data = http://www.mamicode.com/stream.read(CHUNK)"hljs-string">"* done recording")

stream.stop_stream()
stream.close()
p.terminate()

wf = wave.open(WAVE_OUTPUT_FILENAME, ‘wb‘)
wf.setnchannels(CHANNELS)
wf.setsampwidth(p.get_sample_size(FORMAT))
wf.setframerate(RATE)
wf.writeframes(b‘‘.join(frames))
wf.close()

保存为record.py 

然后运行下面的命令。



Python record.py

代码中声明的是5秒的记录时长，这一点可以根据自己的需要来进行动态的修改。然后程序运行结束之后，就会在同一级目录下得到一个output.wav 的音频文件。



wired

刚才那俩小例子要么一个读，要么一个记录。那么要是既想读，然后再看下结果的需求呢？可以这么来实现。



"""
PyAudio Example: Make a wire between input and output (i.e., record a
few samples and play them back immediately).
"""

import pyaudio

CHUNK = 1024
WIDTH = 2
CHANNELS = 2
RATE = 44100
RECORD_SECONDS = 5

p = pyaudio.PyAudio()

stream = p.open(format=p.get_format_from_width(WIDTH),
                channels=CHANNELS,
                rate=RATE,
                input=True,
                output=True,
                frames_per_buffer=CHUNK)

print("* recording")

for i in range(0, int(RATE / CHUNK * RECORD_SECONDS)):
    data = http://www.mamicode.com/stream.read(CHUNK)"hljs-string">"* done")

stream.stop_stream()
stream.close()

p.terminate()

保存为wire.py 

然后运行下面的命令



Python wire.py

就可以记录一个5秒的实现了。



play(callback)

搞定了上面三个小例子，做出自己的东西也不是什么难事了。基本上可以满足自己的需求。但是官网上还给了更加优雅的方式，那就是使用回调函数。除此之外很重要的一点就是callback方式是noblocking的。 

官网的api解释如下：



Note that in “blocking mode”, each pyaudio.Stream.write() or pyaudio.Stream.read() blocks until all the given/requested frames have been played/recorded. Alternatively, to generate audio data on the fly or immediately process recorded audio data, use the “callback mode” outlined below.



"""
PyAudio Example: Make a wire between input and output (i.e., record a
few samples and play them back immediately).
"""

import pyaudio

CHUNK = 1024
WIDTH = 2
CHANNELS = 2
RATE = 44100
RECORD_SECONDS = 5

p = pyaudio.PyAudio()

stream = p.open(format=p.get_format_from_width(WIDTH),
                channels=CHANNELS,
                rate=RATE,
                input=True,
                output=True,
                frames_per_buffer=CHUNK)

print("* recording")

for i in range(0, int(RATE / CHUNK * RECORD_SECONDS)):
    data = http://www.mamicode.com/stream.read(CHUNK)"hljs-string">"* done")

stream.stop_stream()
stream.close()

p.terminate()

其实也没啥特殊的地方了，就是代码看起来更加精简了。



Wire(callback)



"""
PyAudio Example: Make a wire between input and output (i.e., record a
few samples and play them back immediately).

This is the callback (non-blocking) version.
"""

import pyaudio
import time

WIDTH = 2
CHANNELS = 2
RATE = 44100

p = pyaudio.PyAudio()

def callback(in_data, frame_count, time_info, status):
    return (in_data, pyaudio.paContinue)

stream = p.open(format=p.get_format_from_width(WIDTH),
                channels=CHANNELS,
                rate=RATE,
                input=True,
                output=True,
                stream_callback=callback)

stream.start_stream()

while stream.is_active():
    time.sleep(0.1)

stream.stop_stream()
stream.close()

p.terminate()
License



外部应用

下面拿一个小例子入手，实时的测试一下音频转文字。因为本人没有百度语音服务的权限，所以在网上搜索了一个key。在这里感谢下面的这个链接。

https://github.com/luyishisi/python_yuyinduihua

话不多说， 上例子吧。



# coding: utf8

# @Author: 郭 璞
# @File: baiduyuyinshibie.py                                                                 
# @Time: 2017/5/10                                   
# @Contact: 1064319632@qq.com
# @blog: http://blog.csdn.net/marksinoberg
# @Description: 百度语音识别接口调用
import wave
import requests
import json

def get_token():
    apiKey = "。。。GBOtpg22ZSGAU"
    secretKey = "44。。。e34936227d4a19dc2"

    auth_url = "https://openapi.baidu.com/oauth/2.0/token?grant_type=client_credentials&client_id=" + apiKey + "&client_secret=" + secretKey
    response = requests.get(url=auth_url)
    jsondata = http://www.mamicode.com/response.text"hljs-keyword">return json.loads(jsondata)[‘access_token‘]

def use_cloud(token, wavefile):
    fp = wave.open(wavefile, ‘rb‘)
    # 已经录好音的音频片段内容
    nframes = fp.getnframes()
    filelength = nframes*2
    audiodata = http://www.mamicode.com/fp.readframes(nframes)"hljs-comment"># 百度语音接口的产品ID
    cuid = ‘71XXXX663‘
    server_url = ‘http://vop.baidu.com/server_api‘ + ‘?cuid={}&token={}‘.format(cuid, token)
    headers = {
        ‘Content-Type‘: ‘audio/pcm; rete=8000‘,
        ‘Content-Length‘: ‘{}‘.format(filelength),
    }

    response = requests.post(url=server_url, headers=headers, data=http://www.mamicode.com/audiodata)"hljs-keyword">return response.text if response.status_code==200 else ‘Something Wrong!‘



if __name__ == ‘__main__‘:
    access_token = get_token()
    print(access_token)
    result = use_cloud(token=access_token, wavefile=‘./output.wav‘)
    print(result)



音频文件借助了上面第二个例子中录制的音频。



我说的话是： 345
# 由于在图书馆，所以不敢太大声，要是用标准的普通话，相信准确度可能会更高一点。

然后运行的结果就是下面这样的了。 



可以看出请求成功，返回的结果里面包含了相应的文本内容。虽然不是很准确，但是也还算可以吧。



总结

最后来总结一下，今天貌似净拿人家的东西了，虽然自己整合了一下。但是还远远不够啊。其实结合这个语音接口可以做出很多更加好玩的功能的。



参考链接：

聊天机器人

GitHub机器人聊天参考源码

pyaudio官方文档

pyaudio官网

PyPI宝库

        <script  type="text/javascript">            $(function () {                $(‘pre.prettyprint code‘).each(function () {                    var lines = $(this).text().split(‘\n‘).length;                    var $numbering = $(‘‘).addClass(‘pre-numbering‘).hide();                    $(this).addClass(‘has-numbering‘).parent().append($numbering);                    for (i = 1; i <= lines; i++) {                        $numbering.append($(‘‘).text(i));                    };                    $numbering.fadeIn(1700);                });            });        </script>处理音频--pyaudio



script art asc .com llb 


 声明：以上内容来自用户投稿及互联网公开渠道收集整理发布，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任，若内容有误或涉及侵权可进行投诉： 投诉/举报 工作人员会在5个工作日内联系你，一经查实，本站将立刻删除涉嫌侵权内容。









 看完仍有疑问？有类似问题直接问程序猿









  
    
      
        投诉/举报
        
      
      
        




       
          
            
            
            
            
          
        
       
           您的姓名
          
            
            
            
          
          
        
        
           反馈内容






 相关代码解决方案


 音频处理 音频处理平台 python的音频处理库 python音频处理用到的操作 python音频处理相关类库 音频处理之语音加速播放 数字信号处理与音频处理（使用Audition） 介绍几个python的音频处理库 python处理音频文件(mp3) 如何把视频中的音频处理掉 音视频开发音频处理技术 python处理音频文件(mp3) 音频 PCM音频编码格式详解 *音频-2 音频播放 移动音频 音频控制 HTML 音频 音频测试 Android音频文件浏览+音频播放 (转)同步音频--同步音频到视频 音频特征提取——常用音频特征 Cocos2d-x中使用音频CocosDenshion引擎介绍与音频文件的预处理 Cocos2d-x中使用音频CocosDenshion引擎介绍与音频文件的预处理 Cocos2d-x中使用音频CocosDenshion引擎介绍与音频文件的预处理 Android中的音频处理------SoundPool,MediaRecorder,MediaPlayer以及RingStone总结 iOS开发拓展篇—音频处理（音乐播放器3） iOS开发拓展篇—音频处理（音乐播放器4） iOS开发拓展篇—音频处理（音乐播放器5）


当日更新
 Spring-MVC理解之一：应用上下
 绘制SVG内容到Canvas的HTML5
 Linux运维常用的 MySQL基础
 [SAP ABAP开发技术总结]权限
 推荐一款分页插件
 苹果手机  disabled 的背景
 C#程序遍历数组A中所有元素
 Linux suse初安装
 Evaluate Reverse Polish No
 wxWidgets第十四课 wxTimer