Python爬虫:scrapy内置网页解析库parsel-通过css和xpath解析xml、html
导读:文档https://pypi.org/project/parsel/https://github.com/scrapy/parsel安装pip install parsel代码示例from parsel import Selector s...
文档
- https://pypi.org/project/parsel/
- https://github.com/scrapy/parsel
安装
pip install parsel
代码示例
from parsel import Selector selector = Selector(text="""html> body> h1> Hello, Parsel!/h1> ul> li> a href="http://example.com"> Link 1/a> /li> li> a href="http://scrapy.org"> Link 2/a> /li> /ul> /body> /html> """) selector.css('h1::text').get() 'Hello, Parsel!' selector.xpath('//h1/text()').re(r'\w+') ['Hello', 'Parsel'] for li in selector.css('ul > li'): print(li.xpath('.//@href').get()) http://example.com http://scrapy.org
声明:本文内容由网友自发贡献,本站不承担相应法律责任。对本内容有异议或投诉,请联系2913721942#qq.com核实处理,我们将尽快回复您,谢谢合作!
若转载请注明出处: Python爬虫:scrapy内置网页解析库parsel-通过css和xpath解析xml、html
本文地址: https://pptw.com/jishu/7922.html