【nas电影整理】可做爬虫的网站工具和api整理

后面打算搞nas，先整理出可以查找电影海报、主演、字母、简介等描述的网站和接口，后续做个爬虫插件，nice。

imdb，离线库，官方每天更新：
https://www.imdb.com/interfaces/

name.basics.tsv.gz
https://datasets.imdbws.com/name.basics.tsv.gz

title.akas.tsv.gz
https://datasets.imdbws.com/title.akas.tsv.gz

title.basics.tsv.gz
https://datasets.imdbws.com/title.basics.tsv.gz

title.crew.tsv.gz
https://datasets.imdbws.com/title.crew.tsv.gz

title.episode.tsv.gz
https://datasets.imdbws.com/title.episode.tsv.gz

title.principals.tsv.gz
https://datasets.imdbws.com/title.principals.tsv.gz

title.ratings.tsv.gz
https://datasets.imdbws.com/title.ratings.tsv.gz

title.akas.tsv.gz - Contains the following information for titles:
titleId (string) - a tconst, an alphanumeric unique identifier of the title
ordering (integer) – a number to uniquely identify rows for a given titleId
title (string) – the localized title
region (string) - the region for this version of the title
language (string) - the language of the title
types (array) - Enumerated set of attributes for this alternative title. One or more of the following: "alternative", "dvd", "festival", "tv", "video", "working", "original", "imdbDisplay". New values may be added in the future without warning
attributes (array) - Additional terms to describe this alternative title, not enumerated
isOriginalTitle (boolean) – 0: not original title; 1: original title

title.basics.tsv.gz - Contains the following information for titles:
tconst (string) - alphanumeric unique identifier of the title
titleType (string) – the type/format of the title (e.g. movie, short, tvseries, tvepisode, video, etc)
primaryTitle (string) – the more popular title / the title used by the filmmakers on promotional materials at the point of release
originalTitle (string) - original title, in the original language
isAdult (boolean) - 0: non-adult title; 1: adult title
startYear (YYYY) – represents the release year of a title. In the case of TV Series, it is the series start year
endYear (YYYY) – TV Series end year. ‘\N’ for all other title types
runtimeMinutes – primary runtime of the title, in minutes
genres (string array) – includes up to three genres associated with the title

title.crew.tsv.gz – Contains the director and writer information for all the titles in IMDb. Fields include:
tconst (string) - alphanumeric unique identifier of the title
directors (array of nconsts) - director(s) of the given title
writers (array of nconsts) – writer(s) of the given title

title.episode.tsv.gz – Contains the tv episode information. Fields include:
tconst (string) - alphanumeric identifier of episode
parentTconst (string) - alphanumeric identifier of the parent TV Series
seasonNumber (integer) – season number the episode belongs to
episodeNumber (integer) – episode number of the tconst in the TV series

title.principals.tsv.gz – Contains the principal cast/crew for titles
tconst (string) - alphanumeric unique identifier of the title
ordering (integer) – a number to uniquely identify rows for a given titleId
nconst (string) - alphanumeric unique identifier of the name/person
category (string) - the category of job that person was in
job (string) - the specific job title if applicable, else '\N'
characters (string) - the name of the character played if applicable, else '\N'

title.ratings.tsv.gz – Contains the IMDb rating and votes information for titles
tconst (string) - alphanumeric unique identifier of the title
averageRating – weighted average of all the individual user ratings
numVotes - number of votes the title has received

name.basics.tsv.gz – Contains the following information for names:
nconst (string) - alphanumeric unique identifier of the name/person
primaryName (string)– name by which the person is most often credited
birthYear – in YYYY format
deathYear – in YYYY format if applicable, else '\N'
primaryProfession (array of strings)– the top-3 professions of the person
knownForTitles (array of tconsts) – titles the person is known for

imdb搜索API，直接返回json：
https://sg.media-imdb.com/suggests/a/关键词.json
https://v2.sg.media-imdb.com/suggests/h/关键词.json （备用）

imdb搜索页面，返回html：
https://www.imdb.com/search/title/?title=Thor:%20Love%20and%20Thunder&title_type=feature,tv_movie,tv_series,tv_miniseries,short,video

tmdb，搜索接口：
https://www.themoviedb.org/settings/api

API 密钥 (v3 auth)
b0ba92e82215d7dcfae50eefb217ddba
范例 API 请求
https://api.themoviedb.org/3/movie/550?api_key=b0ba92e82215d7dcfae50eefb217ddba
API 读访问令牌 (v4 auth)
eyJhbGciOiJIUzI1NiJ9.eyJhdWQiOiJiMGJhOTJlODIyMTVkN2RjZmFlNTBlZWZiMjE3ZGRiYSIsInN1YiI6IjYzNDhjMzUyMzk2ZTk3MDA3ZTBjZmYwMSIsInNjb3BlcyI6WyJhcGlfcmVhZCJdLCJ2ZXJzaW9uIjoxfQ.IF8DBvJ5ZuBjB4DRYNOjexQ4nWCoJzcNPCVz6esIXAY

video station 自定义搜刮器插件，官方开发文档：
https://download.synology.com/download/Document/Software/DeveloperGuide/Package/VideoStation/All/enu/Synology_Video_Station_API_enu.pdf

查询电视剧信息：
免费，快速且可靠的。直接返回json格式
https://www.tvmaze.com/api#show-search

imdb，开源python库，可查询电影各种信息：
https://github.com/cinemagoer/cinemagoer

还可以将自己开发的插件放到dsm 套件中心，官方插件开发文档：

https://download.synology.com/download/Document/Software/DeveloperGuide/Package/VideoStation/All/enu/Synology_Video_Station_API_enu.pdf

群辉资源整理教程：

群晖DS Video(Station)自动同步视频简介和海报(最新官方解决方案)_程序员Daddy的博客-CSDN博客_群晖videostation自动海报

群晖-VideoStation-（TMDB刮削器）api申请_Adair～Z的博客-CSDN博客_tmdb的api怎么申请

【nas电影整理】可做爬虫的网站工具和api整理相关推荐

paypal开发整理文档（14）——PayPal Orders API整理
官方文档地址:https://developer.paypal.com/docs/api/orders/v2/ 目录 Orders 1. Create Order - 创建订单 Header para ...
最全的Python网站开发库！整理出来了！
用了 2 周的时间整理了 Python 中所有的网站开发库(下文简称:Web 框架),供大家学习参考. Q:Web 框架到底是什么? A:Web 框架主要用于网站开发.开发者在基于 Web 框架实现自 ...
python 爬虫视频网站
一前言暑假在家,精神状态一直不是很好,闲来无事,便学起了python.借助这一平台,想谈谈自己在学习中的心得与困惑,也算是记录暑期生活的一种方式.从6月底开始自学python爬虫,到今天断断续续学 ...
做三年地方网站不如别人打一场麻将
做网站三年了,坚持了三年,也宅了三年. 三年前,看到这个40万人口的县城,没有个像样的门户网站和论坛,真是太不像话.像我这样的技术出身,又能够花极大的精力去运营的本地人,不做本地门户网站,人民都不答应 ...
php和python写爬虫-为什么选择用python做爬虫
什么是网络爬虫? 网络爬虫是一个自动提取网页的程序,它为搜索引擎从万维网上下载网页,是搜索引擎的重要组成.传统爬虫从一个或若干初始网页的URL开始,获得初始网页上的URL,在抓取网页的过程中,不断从当 ...
python的爬虫库_python做爬虫常用库
python做爬虫常用库一.总结一句话总结: urllib:一系列用于操作URL的功能. requests:基于 urllib 编写的,阻塞式 HTTP 请求库,发出一个请求,一直等待服务器响应后 ...
爬虫python下载视频_用python做爬虫下载视频
用python有一段时间了,对python是十二分喜爱,在我看来python是个好工具,可以轻松简洁的帮我完成一些我想要完成的工作.下面和大家分享一下我用python爬取某网站视频的案例.用pytho ...
python做爬虫怎么样效率最高_公认8个效率最高的爬虫框架
一些较为高效的Python爬虫框架.分享给大家. 1.Scrapy Scrapy是一个为了爬取网站数据,提取结构性数据而编写的应用框架. 可以应用在包括数据挖掘,信息处理或存储历史数据等一系列的程序中 ...
python爬虫获取网站销售情况（内置源码）
在现在这个信息爆炸的时代,要想高效的获取数据,爬虫是非常好用的.而用python做爬虫也十分简单方便,下面通过一个简单的小爬虫程序来看一看写爬虫的基本过程: 注:此处猫咪销售网站中的内容本来就可以免费 ...

【nas电影整理】可做爬虫的网站工具和api整理

【nas电影整理】可做爬虫的网站工具和api整理相关推荐

最新文章

热门文章