Python爬虫:scrapy内置网页解析库parsel-通过css和xpath解析xml、html
导读:文档https://pypi.org/project/parsel/https://github.com/scrapy/parsel安装pip install parsel代码示例from parsel import Selector s...
文档
- https://pypi.org/project/parsel/
- https://github.com/scrapy/parsel
安装
pip install parsel
代码示例
from parsel import Selector
selector = Selector(text="""html>
body>
h1>
Hello, Parsel!/h1>
ul>
li>
a href="http://example.com">
Link 1/a>
/li>
li>
a href="http://scrapy.org">
Link 2/a>
/li>
/ul>
/body>
/html>
""")
selector.css('h1::text').get()
'Hello, Parsel!'
selector.xpath('//h1/text()').re(r'\w+')
['Hello', 'Parsel']
for li in selector.css('ul >
li'):
print(li.xpath('.//@href').get())
http://example.com
http://scrapy.org声明:本文内容由网友自发贡献,本站不承担相应法律责任。对本内容有异议或投诉,请联系2913721942#qq.com核实处理,我们将尽快回复您,谢谢合作!
若转载请注明出处: Python爬虫:scrapy内置网页解析库parsel-通过css和xpath解析xml、html
本文地址: https://pptw.com/jishu/7922.html
