【Python爬虫】IOS苹果商店指定APP评论爬取

编程入门 行业动态 更新时间:2024-10-26 09:22:07
import requests
import json
import time
import random
import pandas as pd 
from random import randint


app_id='1288515333'
### 设置部分
# app_id='1419975581'  # 设置app的id
reviews_count=450  # 设置采集的数量


### 采集部分
all_list=[]
for page_num in range(10000000):
    
    offset=page_num*10
    if offset>=reviews_count:
        break
    print('*'*60)
    print('已采集{}个'.format(offset+10))
    api='https://amp-api.apps.apple/v1/catalog/cn/apps/{}/reviews?l=zh-Hans-CN&offset={}&platform=web&additionalPlatforms=appletv%2Cipad%2Ciphone%2Cmac'.format(app_id,offset)
    headers={
        'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.183 Safari/537.36',
        'authorization': 'Bearer eyJhbGciOiJFUzI1NiIsInR5cCI6IkpXVCIsImtpZCI6IlU4UlRZVjVaRFMifQ.eyJpc3MiOiI3TktaMlZQNDhaIiwiaWF0IjoxNjEzMTY5Mzg0LCJleHAiOjE2MTYxOTMzODR9.AyRdwIabKUIAOzUjwhw4tdWdfj4BSkUxpctFNg19A87VmOe1iplzpvcKlIJhb5mfOwi27UdVfDCRzSn6iT8PoA'
        # Bearer eyJhbGciOiJFUzI1NiIsInR5cCI6IkpXVCIsImtpZCI6IlU4UlRZVjVaRFMifQ.eyJpc3MiOiI3TktaMlZQNDhaIiwiaWF0IjoxNjEzMTY5Mzg0LCJleHAiOjE2MTYxOTMzODR9.AyRdwIabKUIAOzUjwhw4tdWdfj4BSkUxpctFNg19A87VmOe1iplzpvcKlIJhb5mfOwi27UdVfDCRzSn6iT8PoA
    }
    
    while True:
        try:
            while True:
                try:
                    response=requests.get(url=api,headers=headers)
                    break
                except requests.exceptions.ConnectTimeout:
                    time.sleep(randint(1,3))
                    continue
                except requests.exceptions.ReadTimeout:
                    time.sleep(randint(1,3))
                    continue
                except requests.exceptions.ConnectionError:
                    time.sleep(randint(1,3))
                    continue
                except requests.exceptions.ChunkedEncodingError:
                    time.sleep(randint(1,3))
                    continue
            json=response.json()
            info_list=json['data']
            break
        except KeyError:
            time.sleep(2)
            continue

    for info in info_list:
        # print(info)
        rating=info['attributes']['rating']
        title=info['attributes']['title']
        date=info['attributes']['date']
        review=info['attributes']['review']
        info=[rating,title,date,review]
        all_list.append(info)
    time.sleep(1)

columns=['评分','标题','日期','评论']
df=pd.DataFrame(data=all_list,columns=columns)
df.to_csv(r'C:\Users\kww\Desktop\移动端爬取\{}.csv'.format(app_id),index=False)

结果展示:

更多推荐

【Python爬虫】IOS苹果商店指定APP评论爬取

本文发布于:2023-06-14 06:21:00,感谢您对本站的认可!
本文链接:https://www.elefans.com/category/jswz/34/1446005.html
版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系,我们将在24小时内删除。
本文标签:爬虫   商店   苹果   Python   APP

发布评论

评论列表 (有 0 条评论)
草根站长

>www.elefans.com

编程频道|电子爱好者 - 技术资讯及电子产品介绍!