renyijiu / douyin_downloader Goto Github PK

View Code? Open in Web Editor NEW

469.0 16.0 104.0 26.88 MB

👏Download all douyin videos of user(including favorites) , 下载指定用户的所有抖音视频以及收藏的视频(无水印)

License: MIT License

Python 81.64% HTML 18.36%

douyin downloader download-videos

douyin_downloader's Introduction

⚠️️务必阅读

最近查看网页版本发现项目所使用的部分接口参数有所修改，因此项目可能已经无法使用，保留仅供参考学习使用。目前本人暂时没有时间更新，欢迎有兴趣且有能力的人修改提交你的方案，谢谢！

抖音寻找漂亮小姐姐or帅气小哥哥，并下载她（他）们的所有作品

⚠️ 因为依赖关系，只支持Python3.6且仅在此版本上进行过测试，其他版本无法保证相同效果，欢迎提交PR支持。

原本只是写了一个下载抖音无水印视频的小脚本，后面突然想到了Douyin-Bot这个项目，觉得是可以结合操作的，达到完全自动化，所以就引入了相关的代码，并进行了一定的逻辑修改，实现了目前的流程。

Python + ADB实现抖音的控制浏览 -> 复制视频链接 -> 提取用户信息 -> 程序下载用户所有视频

⏰ 如果你只需要下载功能，可以直接查看 DOWNLOAD.md，无需查看后续内容

环境安装

请在使用项目之前确保你的手机可以正常使用adb控制，相关信息可以网上搜索。另外复制内容需要使用clipper.apk，在apks中有提供，项目地址，可自行查看，另外 请允许此app后台运行，自测发现未后台运行会导致获取不到剪贴板内容

$ git clone [email protected]:renyijiu/douyin_downloader.git
$ cd douyin_downloader
$ virtualenv -p python3.6 douyin
$ source douyin/bin/activate
$ pip install -r requirements.txt

使用

打开抖音app
执行 python douyin-bot.py

⚠注意️

具体Python + ADB实现抖音的控制浏览，可以查看Douyin-Bot去了解，这里不做介绍了
目前ADB获取剪贴板操作，通过 clipper.apk实现，如果你有更好的方案，欢迎提出更改，感谢🙏！
目前提供的配置是基于自己的 魅族pro5 测试机，不同机型请自行修改（欢迎提供你的配置）

config.json配置文件参考：
- center_point: 屏幕中心点(x, y)，区域范围(rx, ry)，主要翻页使用
- left_swipe_point: 起始点坐标(x, y)，区域范围(rx, ry)，分享按钮时活动获取复制链接使用
- follow_bottom: 关注按钮坐标(x, y), 区域范围(rx, ry)
- star_bottom: 点赞按钮坐标(x, y)，区域范围(rx, ry)
- share_bootom: 分享按钮坐标(x, y)，区域范围(rx, ry)
- copy_link_bottom: 复制链接按钮（分享按钮点击后弹出）(x, y)，区域范围(rx, ry)
- crop_img: 截图范围起始点坐标(x, y)，区域范围(width, height), 从页面截图裁剪部分（为了去除头像之类的干扰信息），另外范围过大可能导致图像过大使接口报错，请自行增加压缩操作

感谢

站在巨人的肩膀

建议反馈

请直接在Github上开新的issue，描述清楚你的问题需求即可。

CHANGELOG

changelog

赞赏

douyin_downloader's People

Contributors

Stargazers

Watchers

Forkers

sublime-cn jiaosl impteam foxgeek36 dbdoer awesome-archive loongws lesbian406 arnoldzhou zhaoxianjin blue-skycat zszen snamper vonboe gregoriusxu kingking888 nyon-one 547555909 icehell leaf918 quincyc379 generalbao yang8807 yang123vc beifenku expressgit visionandy chen19921212 ascat xuliang2018 gongxiaoze wind959 littlesogo floridexkj null-bot9875 russ168 weihli mockerpeking kyxkbbs wherego wavetry jamessunxx 0xczer0 binnarylee davidftv mrhaozi mr1128 susithrupasinghe wanan-s ephao weiling103 viponedream jimmy2012 konessyu hao0oah 1002753959 cnbillow tianqiyuan crackercat orjuly taowin 9allenzhao xuezh01 chuanyu1 hdzhangjl hwanghakbeom jkh5 tomgou xdyb flyfire chinanala lcklozz13 gloomymay tcjj3 shijie32177 whoerau 552301 jsevenk yeyeyeid boiio lightflyer pisethmk zgpxgame direct1986 zhuyoucai168 thewheatbran github9110 oldcai youxianbo fenkyoo git-hash sewiahho tiepbm breezevn phearun008 wei168hua chadwick-hu jiapengwei assassindesign

douyin_downloader's Issues

下载出错了

Traceback (most recent call last):
File "douyin.py", line 539, in
CrawlerScheduler(content, favorite)
File "douyin.py", line 341, in init
self.scheduling(favorite)
File "douyin.py", line 350, in scheduling
self.download_user_videos(user_id, favorite)
File "douyin.py", line 368, in download_user_videos
self.push_download_job(uid, dytk, 0, favorite)
File "douyin.py", line 379, in push_download_job
list_json = get_list_by_uid(user_id, dytk, cursor, favorite)
File "douyin.py", line 95, in get_list_by_uid
signature = FREEZE_SIGNATURE if FREEZE_SIGNATURE else get_signature(user_id)
File "douyin.py", line 70, in get_signature
r.html.render()
File "D:\Python\Python36\lib\site-packages\requests_html.py", line 654, in html
self._html = HTML(session=self.session, url=self.url, html=self.content, default_encoding=self.encoding)
File "D:\Python\Python36\lib\site-packages\requests_html.py", line 421, in init
element=PyQuery(html)('html') or PyQuery(f'{html}')('html'),
File "D:\Python\Python36\lib\site-packages\pyquery\pyquery.py", line 266, in init
raise TypeError(context)
TypeError: None
他提到的requests_html和pyquery我重新卸载安装过，问题依旧，是哪出问题了？

TypeError: 'NoneType' object is not subscriptable

Traceback (most recent call last):
File "douyin.py", line 574, in
CrawlerScheduler(content, favorite)
File "douyin.py", line 370, in init
self.scheduling(favorite)
File "douyin.py", line 379, in scheduling
self.download_user_videos(user_id, favorite)
File "douyin.py", line 396, in download_user_videos
uid, dytk = get_user_info(user_id)
File "douyin.py", line 78, in get_user_info
uid = r.html.search('uid: "{uid}"')['uid']
TypeError: 'NoneType' object is not subscriptable

ssl problem

When I use the scripts python douyin.py --url=https://v.douyin.com/JRoydED/, I jsut encountered the problem following
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='storage.googleapis.com', port=443): Max retries exceeded with url: /chromium-browser-snapshots/Mac/575458/chrome-mac.zip (Caused by SSLError(SSLError("bad handshake: Error([('SSL routines', 'tls_process_server_certificate', 'certificate verify failed')])"))). I don't know how to solve it.

取list不行

去水印怎么做的

通过分享链接，如何能做到去水印呢？我看渲染出来的video标签内的链接并没有水印的参数，是在那步修改的url呢？

添加了链接无法下载东西

我在里面添加了链接，但是下载的时候什么都没有

可以下载用户收藏视频，但是会一直retry

get empty list, {'has_more': 1, 'status_code': 0, 'min_cursor': 0, 'aweme_list': []}
retry...

不错，可以下载高清长视频

当一个视频有点击查看完整版的时候无法下载完整版视频

这个人的主页第一条视频就是这样
http://v.douyin.com/Pqgn6Q/
视频是这个
http://v.douyin.com/Pqbf7r/

原视频有几分钟，下载的只有18秒

无法 git clone

我得到这个错误。请帮帮我

Traceback (most recent call last):
File "douyin.py", line 11, in
from requests_html import HTMLSession
File "C:\Python\lib\site-packages\requests_html.py", line 9, in
import pyppeteer
File "C:\Python\lib\site-packages\pyppeteer_init_.py", line 30, in
from pyppeteer.launcher import connect, launch, executablePath # noqa: E402
File "C:\Python\lib\site-packages\pyppeteer\launcher.py", line 24, in
from pyppeteer.browser import Browser
File "C:\Python\lib\site-packages\pyppeteer\browser.py", line 13, in
from pyppeteer.connection import Connection
File "C:\Python\lib\site-packages\pyppeteer\connection.py", line 12, in
import websockets
File "C:\Python\lib\site-packages\websockets_init_.py", line 3, in
from .auth import *
File "C:\Python\lib\site-packages\websockets\auth.py", line 15, in
from .server import HTTPResponse, WebSocketServerProtocol
File "C:\Python\lib\site-packages\websockets\server.py", line 49, in
from .protocol import WebSocketCommonProtocol
File "C:\Python\lib\site-packages\websockets\protocol.py", line 18, in
from typing import (
ImportError: cannot import name 'Deque'

下载单个分享视频的时候，TypeError: 'NoneType' object is not subscriptable

无法一次下载一个账号的所有视频

python douyin.py --urls="http://v.douyin.com/moUEXM/"
terminal中打印出

(douyin_env) F:\工作 2\一>python douyin.py --urls="http://v.douyin.com/moUEXM/"
Traceback (most recent call last):
File "douyin.py", line 539, in
CrawlerScheduler(content, favorite)
File "douyin.py", line 341, in init
self.scheduling(favorite)
File "douyin.py", line 350, in scheduling
self.download_user_videos(user_id, favorite)
File "douyin.py", line 368, in download_user_videos
self.push_download_job(uid, dytk, 0, favorite)
File "douyin.py", line 379, in push_download_job
list_json = get_list_by_uid(user_id, dytk, cursor, favorite)
File "douyin.py", line 95, in get_list_by_uid
signature = FREEZE_SIGNATURE if FREEZE_SIGNATURE else get_signature(user_id)
File "douyin.py", line 70, in get_signature
r.html.render()
File "F:\工作 2\一\douyin_env\lib\site-packages\requests_html.py", line 654, in html
self._html = HTML(session=self.session, url=self.url, html=self.content, default_encoding=self.encoding)
File "F:\工作 2\一\douyin_env\lib\site-packages\requests_html.py", line 421, in init
element=PyQuery(html)('html') or PyQuery(f'{html}')('html'),
File "F:\工作 2一\douyin_env\lib\site-packages\pyquery\pyquery.py", line 266, in init
raise TypeError(context)
TypeError: None

python douyin.py -s -u 分享短链接/长链接可以下载单个视频的

Cannot get list video

I only got response: get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}

使用方法写的太随意了

不知道写的啥，不认真。看不懂是我水平低了

ItemId: 0, uid: 0

why i got itemId = 0 and, uid =0, i get douyin link in vietnam

here the link:

https://v.douyin.com/cTgexD/

请问无水印视频是修改视频URL的哪个参数呢？

目前从HTML解析出来类似这样：https://aweme.snssdk.com/aweme/v1/playwm/?s_vid=93f1b41336a8b7a442dbf1c29c6bbc563cee1a2ae9eee323b4f0155dba57fdd61ae2c983b1893ec7074426e7c178662f668166c7cfb860c615ca576132238e0b&line=0

这个链接的怎么替换才是无水印的视频链接

无法获取视频，报错如下

get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
get empty list, {'has_more': 0, 'status_code': 0, 'aweme_list': []}
retry...
Traceback (most recent call last):
File "douyin.py", line 539, in
CrawlerScheduler(content, favorite)
File "douyin.py", line 341, in init
self.scheduling(favorite)
File "douyin.py", line 350, in scheduling
self.download_user_videos(user_id, favorite)
File "douyin.py", line 368, in download_user_videos
self.push_download_job(uid, dytk, 0, favorite)
File "douyin.py", line 379, in push_download_job
list_json = get_list_by_uid(user_id, dytk, cursor, favorite)
File "douyin.py", line 116, in get_list_by_uid
res_json = json.loads(r.html.text)
File "/usr/lib64/python3.6/json/init.py", line 354, in loads
return _default_decoder.decode(s)
File "/usr/lib64/python3.6/json/decoder.py", line 339, in decode
obj, end = self.raw_decode(s, idx=_w(s, 0).end())
File "/usr/lib64/python3.6/json/decoder.py", line 357, in raw_decode
raise JSONDecodeError("Expecting value", s, err.value) from None
json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)