后面打算搞nas,先整理出可以查找电影海报、主演、字母、简介等描述的网站和接口,后续做个爬虫插件,nice。

imdb,离线库,官方每天更新:
https://www.imdb.com/interfaces/

name.basics.tsv.gz
https://datasets.imdbws.com/name.basics.tsv.gz

title.akas.tsv.gz
https://datasets.imdbws.com/title.akas.tsv.gz

title.basics.tsv.gz
https://datasets.imdbws.com/title.basics.tsv.gz

title.crew.tsv.gz
https://datasets.imdbws.com/title.crew.tsv.gz

title.episode.tsv.gz
https://datasets.imdbws.com/title.episode.tsv.gz

title.principals.tsv.gz
https://datasets.imdbws.com/title.principals.tsv.gz

title.ratings.tsv.gz
https://datasets.imdbws.com/title.ratings.tsv.gz

title.akas.tsv.gz - Contains the following information for titles:
titleId (string) - a tconst, an alphanumeric unique identifier of the title
ordering (integer) – a number to uniquely identify rows for a given titleId
title (string) – the localized title
region (string) - the region for this version of the title
language (string) - the language of the title
types (array) - Enumerated set of attributes for this alternative title. One or more of the following: "alternative", "dvd", "festival", "tv", "video", "working", "original", "imdbDisplay". New values may be added in the future without warning
attributes (array) - Additional terms to describe this alternative title, not enumerated
isOriginalTitle (boolean) – 0: not original title; 1: original title

title.basics.tsv.gz - Contains the following information for titles:
tconst (string) - alphanumeric unique identifier of the title
titleType (string) – the type/format of the title (e.g. movie, short, tvseries, tvepisode, video, etc)
primaryTitle (string) – the more popular title / the title used by the filmmakers on promotional materials at the point of release
originalTitle (string) - original title, in the original language
isAdult (boolean) - 0: non-adult title; 1: adult title
startYear (YYYY) – represents the release year of a title. In the case of TV Series, it is the series start year
endYear (YYYY) – TV Series end year. ‘\N’ for all other title types
runtimeMinutes – primary runtime of the title, in minutes
genres (string array) – includes up to three genres associated with the title

title.crew.tsv.gz – Contains the director and writer information for all the titles in IMDb. Fields include:
tconst (string) - alphanumeric unique identifier of the title
directors (array of nconsts) - director(s) of the given title
writers (array of nconsts) – writer(s) of the given title

title.episode.tsv.gz – Contains the tv episode information. Fields include:
tconst (string) - alphanumeric identifier of episode
parentTconst (string) - alphanumeric identifier of the parent TV Series
seasonNumber (integer) – season number the episode belongs to
episodeNumber (integer) – episode number of the tconst in the TV series

title.principals.tsv.gz – Contains the principal cast/crew for titles
tconst (string) - alphanumeric unique identifier of the title
ordering (integer) – a number to uniquely identify rows for a given titleId
nconst (string) - alphanumeric unique identifier of the name/person
category (string) - the category of job that person was in
job (string) - the specific job title if applicable, else '\N'
characters (string) - the name of the character played if applicable, else '\N'

title.ratings.tsv.gz – Contains the IMDb rating and votes information for titles
tconst (string) - alphanumeric unique identifier of the title
averageRating – weighted average of all the individual user ratings
numVotes - number of votes the title has received

name.basics.tsv.gz – Contains the following information for names:
nconst (string) - alphanumeric unique identifier of the name/person
primaryName (string)– name by which the person is most often credited
birthYear – in YYYY format
deathYear – in YYYY format if applicable, else '\N'
primaryProfession (array of strings)– the top-3 professions of the person
knownForTitles (array of tconsts) – titles the person is known for

imdb搜索API,直接返回json:
https://sg.media-imdb.com/suggests/a/关键词.json
https://v2.sg.media-imdb.com/suggests/h/关键词.json (备用)

imdb搜索页面,返回html:
https://www.imdb.com/search/title/?title=Thor:%20Love%20and%20Thunder&title_type=feature,tv_movie,tv_series,tv_miniseries,short,video

tmdb,搜索接口:
https://www.themoviedb.org/settings/api

API 密钥 (v3 auth)
b0ba92e82215d7dcfae50eefb217ddba
范例 API 请求
https://api.themoviedb.org/3/movie/550?api_key=b0ba92e82215d7dcfae50eefb217ddba
API 读访问令牌 (v4 auth)
eyJhbGciOiJIUzI1NiJ9.eyJhdWQiOiJiMGJhOTJlODIyMTVkN2RjZmFlNTBlZWZiMjE3ZGRiYSIsInN1YiI6IjYzNDhjMzUyMzk2ZTk3MDA3ZTBjZmYwMSIsInNjb3BlcyI6WyJhcGlfcmVhZCJdLCJ2ZXJzaW9uIjoxfQ.IF8DBvJ5ZuBjB4DRYNOjexQ4nWCoJzcNPCVz6esIXAY

video station 自定义搜刮器插件,官方开发文档:
https://download.synology.com/download/Document/Software/DeveloperGuide/Package/VideoStation/All/enu/Synology_Video_Station_API_enu.pdf

查询电视剧信息:
免费,快速且可靠的。直接返回json格式
https://www.tvmaze.com/api#show-search

imdb,开源python库,可查询电影各种信息:
https://github.com/cinemagoer/cinemagoer

还可以将自己开发的插件放到dsm 套件中心,官方插件开发文档:

https://download.synology.com/download/Document/Software/DeveloperGuide/Package/VideoStation/All/enu/Synology_Video_Station_API_enu.pdf

群辉资源整理教程:

群晖DS Video(Station)自动同步视频简介和海报(最新官方解决方案)_程序员Daddy的博客-CSDN博客_群晖videostation自动海报

群晖-VideoStation-(TMDB刮削器)api申请_Adair~Z的博客-CSDN博客_tmdb的api怎么申请

【nas电影整理】可做爬虫的网站工具和api整理相关推荐

  1. paypal开发整理文档(14)——PayPal Orders API整理

    官方文档地址:https://developer.paypal.com/docs/api/orders/v2/ 目录 Orders 1. Create Order - 创建订单 Header para ...

  2. 最全的Python网站开发库!整理出来了!

    用了 2 周的时间整理了 Python 中所有的网站开发库(下文简称:Web 框架),供大家学习参考. Q:Web 框架到底是什么? A:Web 框架主要用于网站开发.开发者在基于 Web 框架实现自 ...

  3. python 爬虫视频网站

    一 前言 暑假在家,精神状态一直不是很好,闲来无事,便学起了python.借助这一平台,想谈谈自己在学习中的心得与困惑,也算是记录暑期生活的一种方式.从6月底开始自学python爬虫,到今天断断续续学 ...

  4. 做三年地方网站不如别人打一场麻将

    做网站三年了,坚持了三年,也宅了三年. 三年前,看到这个40万人口的县城,没有个像样的门户网站和论坛,真是太不像话.像我这样的技术出身,又能够花极大的精力去运营的本地人,不做本地门户网站,人民都不答应 ...

  5. php和python写爬虫-为什么选择用python做爬虫

    什么是网络爬虫? 网络爬虫是一个自动提取网页的程序,它为搜索引擎从万维网上下载网页,是搜索引擎的重要组成.传统爬虫从一个或若干初始网页的URL开始,获得初始网页上的URL,在抓取网页的过程中,不断从当 ...

  6. python的爬虫库_python做爬虫常用库

    python做爬虫常用库 一.总结 一句话总结: urllib:一系列用于操作URL的功能. requests:基于 urllib 编写的,阻塞式 HTTP 请求库,发出一个请求,一直等待服务器响应后 ...

  7. 爬虫python下载视频_用python做爬虫下载视频

    用python有一段时间了,对python是十二分喜爱,在我看来python是个好工具,可以轻松简洁的帮我完成一些我想要完成的工作.下面和大家分享一下我用python爬取某网站视频的案例.用pytho ...

  8. python做爬虫 怎么样效率最高_公认8个效率最高的爬虫框架

    一些较为高效的Python爬虫框架.分享给大家. 1.Scrapy Scrapy是一个为了爬取网站数据,提取结构性数据而编写的应用框架. 可以应用在包括数据挖掘,信息处理或存储历史数据等一系列的程序中 ...

  9. python爬虫获取网站销售情况(内置源码)

    在现在这个信息爆炸的时代,要想高效的获取数据,爬虫是非常好用的.而用python做爬虫也十分简单方便,下面通过一个简单的小爬虫程序来看一看写爬虫的基本过程: 注:此处猫咪销售网站中的内容本来就可以免费 ...

最新文章

  1. linux之history命令
  2. splay详解(三)
  3. Emacs基本操作说明
  4. 动态生成实物文件思路
  5. AIDE MD主题配置
  6. 双物块弹簧阻尼系统模型,李雅普诺夫稳定性分析,matlab仿真验证
  7. matlab地震动,MATLAB在结构地震动力分析中的应用
  8. pnp型三极管 饱和 截至_截至2013年核心Java帖子
  9. 万豪国际12家餐厅再登米其林指南;五大高端品牌酒店签约港珠澳口岸城 | 中国酒店业周刊...
  10. 网络号、主机号、子网号--例题
  11. zookeeper--模拟买票代码
  12. AndroidStudio导入arr包
  13. WLST 命令和变量
  14. 语音相关的数据集-5个数据集
  15. Day15-servlet
  16. Prometheus 告警收敛
  17. tcpdump抓包神器详细介绍
  18. Bugku CTF web 你必须让他停下来 解题思路
  19. [python] 个人日常python工具代码
  20. 我们为什么不用Kubernetes?

热门文章

  1. 头条搜索Bytespider蜘蛛说明
  2. 华为云人脸识别SDK踩坑日记
  3. 笔记本计算机管理打不开怎么办,NVIDIA控制面板打不开怎么办解决方案
  4. charles2 重写
  5. 自定义menu菜单布局
  6. 2.1 rtthread pin设备详解
  7. 嵌入式学习笔记7 WIFI相关1——WIFI模组选型
  8. 【深度学习】预训练语言模型-BERT
  9. 考完软考中级,还可以向高级进阶
  10. Mac 自动化自动操作工作流程(Workflow)完成日常软件文件网页一键打开