Python中BeautifulSoup中对HTML标签的提取

首页 > 代码库 > Python中BeautifulSoup中对HTML标签的提取

Python中BeautifulSoup中对HTML标签的提取

2024-08-31 17:08:33 219人阅读

一开始使用了beautifulSoup的get_text()进行字符串的提取，后来一直提取失败，并提示错误为TypeError: ‘NoneType‘ object is not callable

返回了none类型，可能是对Span标签内容的提取产生错误，于是采用name.string进行字符的提取，成功。

# -*- coding: utf-8 -*-
"""
Created on Wed Jan 11 17:21:54 2017

@author: PE-Monitor
"""
import urllib2
import BeautifulSoup
import sys

reload(sys)
sys.setdefaultencoding(‘utf-8‘)
responce = urllib2.urlopen("http://www.pythonscraping.com/pages/warandpeace.html")
html =BeautifulSoup.BeautifulSoup(responce)
nameList=html.findAll(‘span‘,{‘class‘:{‘green‘}})
for name in nameList:
     print(name.string)

Python中BeautifulSoup中对HTML标签的提取

声明：以上内容来自用户投稿及互联网公开渠道收集整理发布，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任，若内容有误或涉及侵权可进行投诉：投诉/举报工作人员会在5个工作日内联系你，一经查实，本站将立刻删除涉嫌侵权内容。

联系
我们

首页 > 代码库 > Python中BeautifulSoup中对HTML标签的提取

Python中BeautifulSoup中对HTML标签的提取

看完仍有疑问？有类似问题直接问程序猿