python：lxml中etree方法获取中文乱码的问题_wx_1993_see

http://blog.sina.com.cn/u/2651863955

首页博文目录关于我

个人资料

微博

加好友发纸条

写留言加关注

博客等级：
博客积分：

博客访问：
关注人气：
获赠金笔：0支
赠出金笔：0支
荣誉徽章：

正文字体大小：大中小

python：lxml中etree方法获取中文乱码的问题

(2017-06-07 21:04:17)

分类： python

如果响应html文件中存在中文，那么下面的代码运行就会输出乱码

html 
= 
etree.HTML(text)

result 
= 
etree.tostring(html)

改成以下即可

html 
= 
etree.HTML(text)


result 
= 
etree.tostring(html，encoding="utf-8",pretty_print=True,method="html")


其中pretty_print是关于输出格式的参数，encoding定义了编码方式

阅读┊ 收藏 ┊ 喜欢 ▼ ┊打印┊举报/Report

前一篇：[转载]关于IEEE文章格式问题的解决方法

后一篇：编码问题：UnicodeEncodeError: 'gbk' codec can't encode c

新浪BLOG意见反馈留言板　欢迎批评指正