Python爬取抖音主页视频

Python爬取抖音主页视频

枫
2023-02-28 / 0 评论 / 153 阅读 / 耗时 200ms / 正在检测是否收录...

介绍

  技术有限,只使用了requests模块,爬取操作过程有点复杂,一次大概爬取20个视频,该帖只是分享学习心得。 表情
  已更新批量爬取抖音视频,请移步 批量爬取抖音

操作步骤

代码

import requests

data_url= input('输入抖音主页url链接')
url = f'{data_url}'


headers ={
    'Cookie': 'ttwid=1%7CNYBsOJjLZ2AsV5W4Iz4ZzspVZTkn2KyMX7W0XiixtQQ%7C1667296084%7Cd2307abe2d42dca1611e1830098224cad0edc815aeb9b025161e95d3bc2302cc; passport_csrf_token=925a434d5c62ddccb4b938b7fda2d766; passport_csrf_token_default=925a434d5c62ddccb4b938b7fda2d766; s_v_web_id=verify_le3vazcc_eKdzRY18_VbQm_4b0c_Bqjq_OMI7a8xreQcv; tt_scid=BnKWC8SzLa7R2CY4QfSzwRW1JqcNfsG3h1FzEnkhBsb7s773aVwkWE3VnK6xT2Fjfc8b; SEARCH_RESULT_LIST_TYPE=%22single%22; download_guide=%223%2F20230224%22; strategyABtestKey=%221677550434.211%22; n_mh=FskDsKYpXNjKBP0t6Gko5ZTJydFA7S4umyMW4C5obHE; passport_auth_status=2bb728821b62c7a71d1073a297196011%2C; passport_auth_status_ss=2bb728821b62c7a71d1073a297196011%2C; FOLLOW_LIVE_POINT_INFO=%22MS4wLjABAAAAsPPiLKnMyCVF_v9F8i8KJQ7ftqflDrcZRkYRM_MqTRk%2F1677600000000%2F0%2F1677550508469%2F0%22; store-region=cn-hn; store-region-src=uid; LOGIN_STATUS=0; sid_guard=cf9d3b21a4a97290783acfc26a21bb7d%7C1677556074%7C21600%7CTue%2C+28-Feb-2023+09%3A47%3A54+GMT; uid_tt=1b092a049e575cc02094b4ebcd067e49; uid_tt_ss=1b092a049e575cc02094b4ebcd067e49; sid_tt=cf9d3b21a4a97290783acfc26a21bb7d; sessionid=cf9d3b21a4a97290783acfc26a21bb7d; sessionid_ss=cf9d3b21a4a97290783acfc26a21bb7d; sid_ucp_v1=1.0.0-KDNmZjg5MmIxODUxNjVjNWQzNDRiMWFiMzA4ZDAwNzhjOTgwYTUwNTIKCBDq8vWfBhgNGgJobCIgY2Y5ZDNiMjFhNGE5NzI5MDc4M2FjZmMyNmEyMWJiN2Q; ssid_ucp_v1=1.0.0-KDNmZjg5MmIxODUxNjVjNWQzNDRiMWFiMzA4ZDAwNzhjOTgwYTUwNTIKCBDq8vWfBhgNGgJobCIgY2Y5ZDNiMjFhNGE5NzI5MDc4M2FjZmMyNmEyMWJiN2Q; VIDEO_FILTER_MEMO_SELECT=%7B%22expireTime%22%3A1678172191557%2C%22type%22%3A1%7D; home_can_add_dy_2_desktop=%221%22; __ac_nonce=063fdb7ac006195a4e835; __ac_signature=_02B4Z6wo00f010S0ygAAAIDDyDaEjsgEw3dEhM6AALLbRvWRtTjaphY65KhY1LF3sMoI.W00lFeOONJBRI3kfNWjEVqP2Ile4WG9a30aSj2F4ieYktiLBKGOZxABE0FoWqP-7auPzTk42JO.15; msToken=GF3L_CnlFMvFZqTifNPlPYaldF-_wjhYnKOkEy7X2GLfhpa0LP3UbvzERsgfyq0XzOWyedW9IsRBoLOu0RwSedpls81d7uuBviHSIDOcgQWHQ3RAxsB5GixrpEU7UCM=; msToken=7PJMuDzhOpSyMzTM12rynp74R9GHeaZSF_GxEDzr-F3IajHONmF4YV_Al31OyhInsseEIVudv7lTeGWO2SDyNiWeP4QyVc_eLcEuUE8Uao93mce3I8EpB39le8dyvBU=',
    'User-Agent': 'Mozilla/5.0 (Linux; Android 6.0; Nexus 5 Build/MRA58N) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/109.0.0.0 Mobile Safari/537.36'
}

rsq = requests.get(url=url,headers=headers).json()
data = rsq['aweme_list']
len_num = len(data)
for i in range(0,len_num):
    title = data[i]['desc']
    url = data[i]['video']['play_addr_lowbr']['url_list'][0]

    print(title,url)
    video = requests.get(url=url,headers=headers).content
    with open(f'C:/Users/枫/Desktop/img/imgs/{title}'+ '.mp4',mode='wb')as f:
        f.write(video)
print('共下载:',len_num)
4

海报

正在生成.....

评论 (0)

取消