实时奥运金牌榜"/>
5行代码获取实时奥运金牌榜
如有需要:
pip install requests
pip install beautifulsoup4
代码如下:
from bs4 import BeautifulSoup
import re
import requests
for i in BeautifulSoup(requests.get(r'/%E5%A5%96%E7%89%8C%E6%A6%9C').text, "html.parser").select_one('.rank-list').select('a')[:10]:print(re.split(r'(\d+)\s+([\u4e00-\u9fa5]+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)', i.text.replace('\n', ' '))[1:-1])
写成5行不利于阅读:
展开如下:
from bs4 import BeautifulSoup
import re
import requestsurl = r'/%E5%A5%96%E7%89%8C%E6%A6%9C'
bs = BeautifulSoup(requests.get(url).text, "html.parser")
rank_list = bs.select_one('.rank-list')pattern = r'(\d+)\s+([\u4e00-\u9fa5]+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)'
for i in rank_list.select('a')[:10]:print(re.split(pattern, i.text.replace('\n', ' '))[1:-1])
数据引用百度体育:
/%E5%A5%96%E7%89%8C%E6%A6%9C/from/pc
更多推荐
5行代码获取实时奥运金牌榜
发布评论