python beautifulsoup bs4爬虫爬取糗事百科 - 程序员工具箱

2000万优秀解决方案库，覆盖所有编程及软件开发类，极速查询

今日已更新 352 篇代码解决方案

首页 > 代码库 > python beautifulsoup bs4爬虫爬取糗事百科

python beautifulsoup bs4爬虫爬取糗事百科

2024-09-08 22:51:12 216人阅读

声明：仅用于学习语法，请勿用于非法用途
import urllib.request
import re
from bs4 import BeautifulSoup
# -*- coding:utf-8 -*-
url = ‘http://www.qiushibaike.com/hot/‘
user_agent=‘Mozilla/4.0 (compatible; MSIE 5.5; Windows NT)‘
headers={‘User-Agent‘:user_agent}
request = urllib.request.Request(url=url,headers=headers)
response = urllib.request.urlopen(request)
bsobj = BeautifulSoup(response.read(), "html5lib")
#content = response.read().decode(‘utf-8‘)
#print(bsobj)
nameList = bsobj.find_all("div", {"class":"content"})
for name in nameList:
print(name.get_text())
input_enter = str(input())
if input_enter ==‘‘:
continue

本文出自 “净空蓝星” 博客，谢绝转载！

python beautifulsoup bs4爬虫爬取糗事百科

声明：以上内容来自用户投稿及互联网公开渠道收集整理发布，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任，若内容有误或涉及侵权可进行投诉：投诉/举报工作人员会在5个工作日内联系你，一经查实，本站将立刻删除涉嫌侵权内容。

联系
我们