Scrape Center爬虫平台之spa6案例

编程入门 行业动态 更新时间:2024-10-11 13:19:40

Scrape Center<a href=https://www.elefans.com/category/jswz/34/1770264.html style=爬虫平台之spa6案例"/>

Scrape Center爬虫平台之spa6案例

先看明白Scrape Center爬虫平台之spa2案例




import requests
import time
import hashlib
import base64
def getHTMLText(url):try:r=requests.get(url,timeout=60)r.raise_for_status()r.encoding='utf-8'return r.json()except:pass
for j in range(10):#1:时间戳取整    t=int(time.time())#2:SHA1加密    s1 = f"/api/movie,{t}"o = hashlib.sha1(s1.encode("utf-8")).hexdigest()    s2=f'{o},{t}'s3=s2.encode('utf-8')#3:Base64加密token=base64.b64encode(s3)#4:bytes转strtoken=token.decode()url=f"/api/movie/?limit=10&offset={j*10}&token={token}"html=getHTMLText(url)for i in range(10):print(html['results'][i]['id'],html['results'][i]['name'])
#异步爬取详情页
import requests
import time
t1=time.time()
import hashlib
import base64
import asyncio
import aiohttp
def getURL(b):a="ef34#teuq0btua#(-57w1q5o5--j@98xygimlyfxs*-!i-0-mb"c=f"{a}{b}"url_id=base64.b64encode(c.encode('utf-8'))url_id=url_id.decode()#1:时间戳取整    t=int(time.time())#2:SHA1加密    s1 = f"/api/movie/{url_id},{t}"o = hashlib.sha1(s1.encode("utf-8")).hexdigest()    s2=f'{o},{t}'s3=s2.encode('utf-8')#3:Base64加密token=base64.b64encode(s3)#4:bytes转strtoken=token.decode()url=f"/api/movie/{url_id}/?token={token}"return(url)
async def get(session, queue):while True:try:page = queue.get_nowait()except asyncio.QueueEmpty:returnurl = getURL(page)resp = await session.get(url,timeout=60)html=await resp.json(encoding='utf-8')print(html['id'],html['drama'])        
async def main():async with aiohttp.ClientSession() as session:queue = asyncio.Queue()for page in range(1,101):queue.put_nowait(page)tasks = []for _ in range(100):task = get(session, queue)tasks.append(task)await asyncio.wait(tasks)
loop = asyncio.get_event_loop()
loop.run_until_complete(main())
print(time.time()-t1)#约6秒

更多推荐

Scrape Center爬虫平台之spa6案例

本文发布于:2024-02-07 06:27:32,感谢您对本站的认可!
本文链接:https://www.elefans.com/category/jswz/34/1754155.html
版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系,我们将在24小时内删除。
本文标签:爬虫   案例   平台   Scrape   Center

发布评论

评论列表 (有 0 条评论)
草根站长

>www.elefans.com

编程频道|电子爱好者 - 技术资讯及电子产品介绍!