pdf2htmlEX命令行参数大全
Provided by: pdf2htmlex_0.14.6+ds-2build1_amd64
原文链接:Ubuntu Manpage: pdf2htmlEX - converts PDF to HTML without losing text and format. http://manpages.ubuntu.com/manpages/xenial/man1/pdf2htmlEX.1.html
NAME
pdf2htmlEX - converts PDF to HTML without losing text and format.
USAGE
pdf2htmlEX [options] <input-filename> [<output-filename>]
DESCRIPTION
pdf2htmlEX is a utility that converts PDF files to HTML files.pdf2htmlEX tries its best to render the PDF precisely, maintain proper styling, whileretaining text and optimizing for Web.Fonts are extracted form PDF and then embedded into HTML, text in the converted HTML fileis usually selectable and copyable.Other objects are rendered as images and also embedded.
OPTIONS
Pages-f, --first-page <num> (Default: 1)Specify the first page to process-l, --last-page <num> (Default: last page)Specify the last page to processDimensions--zoom <ratio>, --fit-width <width>, --fit-height <height>--zoom specifies the zoom factor directly; --fit-width/height specifies the maximumwidth/height of a page, the values are in pixels.If multiple values are specified, the minimum one will be used.If none is specified, pages will be rendered as 72DPI.--use-cropbox <0|1> (Default: 1)Use CropBox instead of MediaBox for output.--hdpi <dpi>, --vdpi <dpi> (Default: 144)Specify the horizontal and vertical DPI for imagesOutput--embed <string>--embed-css <0|1> (Default: 1)--embed-font <0|1> (Default: 1)--embed-image <0|1> (Default: 1)--embed-javascript <0|1> (Default: 1)--embed-outline <0|1> (Default: 1)Specify which elements should be embedded into the output HTML file.If switched off, separated files will be generated along with the HTML file for thecorresponding elements.--embed accepts a string as argument. Each letter of the string must be one of`cCfFiIjJoO`, which corresponds to one of the --embed-*** switches. Lower caseletters for 0 and upper case letters for 1. For example, `--embed cFIJo` means toembed everything but CSS files and outlines.--split-pages <0|1> (Default: 0)If turned on, the content of each page is stored in a separated file.This switch is useful if you want pages to be loaded separately & dynamically -- asupporting server might be necessary.Also see --page-filename.--dest-dir <dir> (Default: .)Specify destination folder.--css-filename <filename> (Default: <none>)Specify the filename of the generated css file, if not embedded.If it's empty, the file name will be determined automatically.--page-filename <filename> (Default: <none>)Specify the filename template for pages when --split-pages is 1A %d placeholder may be included in `filename` to indicate where the page numbershould be placed. The placeholder supports a limited subset of normal numericalplaceholders, including specified width and zero padding.If `filename` does not contain a placeholder for the page number, the page numberwill be inserted directly before the file extension. If the filename does not havean extension, the page number will be placed at the end of the file name.If --page-filename is not specified, <input-filename> will be used for the outputfilename, replacing the extension with .page and adding the page number directlybefore the extension.Examplespdf2htmlEX --split-pages 1 foo.pdfYields page files foo1.page, foo2.page, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename bar.bazYields page files bar1.baz, bar2.baz, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename page%dbar.bazYields page files page1bar.baz, page2bar.baz, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename bar%03d.bazYields page files bar001.baz, bar002.baz, etc.--outline-filename <filename> (Default: <none>)Specify the filename of the generated outline file, if not embedded.If it's empty, the file name will be determined automatically.--process-nontext <0|1> (Default: 1)Whether to process non-text objects (as images)--process-outline <0|1> (Default: 1)Whether to show outline in the generated HTML--process-annotation <0|1> (Default: 0)Whether to show annotation in the generated HTML--process-form <0|1> (Default: 0)Whether to include text fields and radio buttons in the generated HTML--printing <0|1> (Default: 1)Enable printing support. Disabling this option may reduce the size of CSS.--fallback <0|1> (Default: 0)Output in fallback mode, for better accuracy and browser compatibility, but thesize becomes larger.--tmp-file-size-limit <limit> (Default: -1)This limits the total size (in KB) of the temporary files which will also limit thetotal size of the output file. This is an estimate and it will stop after a page,once the total temporary files size is greater than this number.-1 means no limit and is the default.Fonts--embed-external-font <0|1> (Default: 1)Specify whether the local matched fonts, for fonts not embedded in PDF, should beembedded into HTML.If this switch is off, only font names are exported such that web browsers may tryto find proper fonts themselves, and that might cause issues about incorrect fontmetrics.--font-format <format> (Default: woff)Specify the format of fonts extracted from the PDF file.--decompose-ligature <0|1> (Default: 0)Decompose ligatures. For example 'fi' -> 'f''i'.--auto-hint <0|1> (Default: 0)If set to 1, hints will be generated for the fonts using FontForge.This may be preceded by --external-hint-tool.--external-hint-tool <tool> (Default: <none>)If specified, the tool will be called in order to enhanced hinting for fonts, thiswill precede --auto-hint.The tool will be called as '<tool> <in.suffix> <out.suffix>', where suffix will bethe same as specified for --font-format.--stretch-narrow-glyph <0|1> (Default: 0)If set to 1, glyphs narrower than described in PDF will be stretched; otherwisespace will be padded to the right of the glyphs--squeeze-wide-glyph <0|1> (Default: 1)If set to 1, glyphs wider than described in PDF will be squeezed; otherwise it willbe truncated.--override-fstype <0|1> (Default: 0)Clear the fstype bits in TTF/OTF fonts.Turn this on if Internet Explorer complains about 'Permission must be Installable'AND you have permission to do so.--process-type3 <0|1> (Default: 0)If turned on, pdf2htmlEX will try to convert Type 3 fonts such that text can berendered natively in HTML. Otherwise all text with Type 3 fonts will be renderedas image.This feature is highly experimental.Text--heps <len>, --veps <len> (Default: 1)Specify the maximum tolerable horizontal/vertical offset (in pixels).pdf2htmlEX would try to optimize the generated HTML file moving Text within thisdistance.--space-threshold <ratio> (Default: 0.125)pdf2htmlEX would insert a whitespace character ' ' if the distance between twoconsecutive letters in the same line is wider than ratio * font_size.--font-size-multiplier <ratio> (Default: 4.0)Many web browsers limit the minimum font size, and many would round the given fontsize, which results in incorrect rendering.Specify a ratio greater than 1 would resolve this issue, however it might freezesome browsers.For some versions of Firefox, however, there will be a problem when the font sizeis too large, in which case a smaller value should be specified here.--space-as-offset <0|1> (Default: 0)If set to 1, space characters will be treated as offsets, which allows a betteroptimization.For PDF files with bad encodings, turning on this option may cause losingcharacters.--tounicode <-1|0|1> (Default: 0)A ToUnicode map may be provided for each font in PDF which indicates the 'meaning'of the characters. However often there is better "ToUnicode" info in Type 0/1fonts, and sometimes the ToUnicode map provided is wrong. If this value is set to1, the ToUnicode Map is always applied, if provided in PDF, and characters may notrender correctly in HTML if there are collisions.If set to -1, a customized map is used such that rendering will be correct in HTML(visually the same), but you may not get correct characters by select & copy &paste.If set to 0, pdf2htmlEX would try its best to balance the two methods above.--optimize-text <0|1> (Default: 0)If set to 1, pdf2htmlEX will try to reduce the number of HTML elements used fortext. Turn it off if anything goes wrong.--correct-text-visibility <0|1> (Default: 0)If set to 1, pdf2htmlEX will try to detect texts covered by other graphics andproperly arrange them, i.e. covered texts are made transparent in text layer, andare drawn on background layer.Background Image--bg-format <format> (Default: png)Specify the background image format. Run `pdf2htmlEX -v` to check all supportedformats.--svg-node-count-limit <limit> (Default: -1)If node count in a svg background image exceeds this limit, fall back this page tobitmap background; negative value means no limit. This option is only useful when'--bg-format svg' is specified. Note that node count in svg is just calculatedapproximately.--svg-embed-bitmap <0|1> (Default: 1)Whether embed bitmaps in svg background image. 1: embed bitmaps in svg background;0: dump bitmaps to external files if possible.This option is only useful when '--bg-format svg' is specified and '--embed-image'is off.Currently, RGB or Gray JPEG bitmaps in a PDF can be dumped, while those in otherformats or colorspaces are still embedded. If bitmaps are not dumped as expected,try pre-processing your PDF by ghostscript or acrobat and make sure bitmaps in itare converted to RGB/Gray JPEG format. See the project wiki for more details.PDF Protection-o, --owner-password <password>Specify owner password-u, --user-password <password>Specify user password--no-drm <0|1> (Default: 0)Override document DRM settingsTurn this on only when you have permission.Misc.--clean-tmp <0|1> (Default: 1)If switched off, intermediate files won't be cleaned in the end.--data-dir <dir> (Default: /usr/share/pdf2htmlEX)Specify the folder holding the manifest and other files (see below for the manifestfile)`--tmp-dir <dir> (Default: /tmp or $TMPDIR if set)Specify the temporary folder to use for temporary files--css-draw <0|1> (Default: 0)Experimental and unsupported CSS drawing--debug <0|1> (Default: 0)Print debug information.--proof <0|1|2> (Default: 0)Output a proof version. If a positive value is specified, texts are drawn on bothtext layer and background image for comparison. If 2 is specified, texts onbackground are in different colors. If png/jpg background format is used, a higherhdpi/vdpi (e.g. 288) is recommended for legibility.Meta-v, --versionPrint copyright and version info--help Print usage information
MANIFEST and DATA-DIR
When split-pages is 0, the manifest file describes how the final html page should begenerated.By default, pdf2htmlEX will use the manifest in the default data-dir (run `pdf2htmlEX -v`to check), which gives a simple demo of its syntax.You can modify the default one, or you can create a new one and specify the correctdata-dir in the command line.All files referred by the manifest must be located in the data-dir.
EXAMPLE
pdf2htmlEX /path/to/file.pdfConvert file.pdf into file.htmlpdf2htmlEX --clean-tmp 0 --debug 1 /path/to/file.pdfConvert file.pdf and leave all intermediate files.pdf2htmlEX --dest-dir out --embed fi /path/to/file.pdfConvert file.pdf into out/file.html and leave font/image files separated.
COPYRIGHT
Copyright 2012,2013 Lu Wang <coolwanglu@gmail.com>pdf2htmlEX is licensed under GPLv3 with additional terms, read LICENSE for details.
AUTHOR
pdf2htmlEX is written by Lu Wang <coolwanglu@gmail.com>
pdf2htmlEX命令行参数大全相关推荐
- x265 命令行参数大全
除非一个参数被标记为 CLI ONLY,否则该参数也被x265_param_parse() 支持.CLI使用getopt 函数来解释命令行参数,长短版本的参数都可以使用,长参数在不引起歧义的情况下可以 ...
- 最新VLC命令行参数大全(一)
用法: vlc [选项] [流] ... 您可以在命令行中指定多个流. 它们将被加入播放列表队列. 指定的首个项目将被首先播放. 选项风格: --选项 用于设置程序执行期间的全局选项. - ...
- pytest合集(3)— 命令行参数
1.命令行参数大全 使用 pytest -h 可以查看 pytest 的命令行参数,有 10 大类共 132 个. 详见:Python pytest 132 个命令行参数用法 - 习久性成 - 博客园 ...
- python 代码命令大全-Python 命令行参数
Python 命令行参数 Python 提供了 getopt 模块来获取命令行参数. $ python test.py arg1 arg2 arg3 Python 中也可以使用 sys 的 sys.a ...
- JVM调优系列--Java命令选项(参数)--大全/详解/常用
原文网址:JVM调优系列--Java命令选项(参数)--大全/详解/常用_IT利刃出鞘的博客-CSDN博客 简介 说明 本文介绍Java的java命令用法,包括:常用用法.选项大全. J ...
- cmd中mysql的操作语句,mysql命令行操作大全 mysql命令行操作总结
摘要 腾兴网为您分享:mysql命令行操作大全 mysql命令行操作总结,悦作业,英语字典,银万财富,小书亭等软件知识,以及webex,过租号,宁夏教育云,百货通,准妈妈课堂,京客隆网上超市,壁纸生成 ...
- kafka 命令行操作大全
kafka 命令行操作大全 一.集群相关常用命令 二.topic相关常用命令 2.1 脚本&参数简介 2.2 举例 三.生产者命令行常用操作 3.1 脚本&参数简介 3.2 举例发送消 ...
- Go 学习笔记(45)— Go 标准库之 flag(命令行参数解析)
1. 参数解析说明 import "flag" flag 包实现了命令行参数的解析.每个参数认为一条记录,根据实际进行定义,到一个 set 集合.每条都有各自的状态参数. 使用 f ...
- INNO SETUP 获得命令行参数
INNO SETUP 获得命令行参数 原文 http://www.cnblogs.com/ahuo/archive/2009/07/30/1534998.html [Code] function Ge ...
最新文章
- 多花5美元提速500%,树莓派新品Zero 2 W发布
- centos java程序_刚得到一台centos7服务器,作为Java程序员应该做的事
- [翻译]XNA建设者俱乐部在线网站上的社区新亮点
- wpspbc按钮是什么意思_抖音私密账号什么意思 抖音热评私密账号什么梗怎么设置?...
- baseresponse响应类_Java response响应体和文件下载实现原理
- Android广播(笔记)
- getclass方法_防止数据重复提交的6种方法(超简单)!
- 当汽车工业遇见 AI,开发者的时代到来
- 381. O(1) 时间插入、删除和获取随机元素 - 允许重复
- 程序员桌面都这么秀?网友:用砖头当杯垫这样合适吗?留着吓谁
- 创客教室-中小学创客教育课程介绍
- 云计算中的第二个boss——网络虚拟化
- 金山词霸 2009 专业版
- Java—计算长方体、四棱锥的表面积和体积
- 降雪致国道315线部分道路通行受阻公路交警部门昼夜坚守
- 黑洞同样释放能量--霍金修正黑洞理论
- firstvalue函数mysql_MySQL 窗口函数之头尾函数
- TPS和QPS的区别
- DirectShow开发
- 多人使用服务器,如何开个人账户?以及 个人账户如何操作服务器?