Provided by: pdf2htmlex_0.14.6+ds-2build1_amd64

原文链接:Ubuntu Manpage: pdf2htmlEX - converts PDF to HTML without losing text and format. http://manpages.ubuntu.com/manpages/xenial/man1/pdf2htmlEX.1.html

NAME

         pdf2htmlEX - converts PDF to HTML without losing text and format.

USAGE

         pdf2htmlEX [options] <input-filename> [<output-filename>]

DESCRIPTION

       pdf2htmlEX is a utility that converts PDF files to HTML files.pdf2htmlEX  tries  its  best  to  render the PDF precisely, maintain proper styling, whileretaining text and optimizing for Web.Fonts are extracted form PDF and then embedded into HTML, text in the converted HTML  fileis usually selectable and copyable.Other objects are rendered as images and also embedded.

OPTIONS

   Pages-f, --first-page <num> (Default: 1)Specify the first page to process-l, --last-page <num> (Default: last page)Specify the last page to processDimensions--zoom <ratio>, --fit-width <width>, --fit-height <height>--zoom specifies the zoom factor directly; --fit-width/height specifies the maximumwidth/height of a page, the values are in pixels.If multiple values are specified, the minimum one will be used.If none is specified, pages will be rendered as 72DPI.--use-cropbox <0|1> (Default: 1)Use CropBox instead of MediaBox for output.--hdpi <dpi>, --vdpi <dpi> (Default: 144)Specify the horizontal and vertical DPI for imagesOutput--embed <string>--embed-css <0|1> (Default: 1)--embed-font <0|1> (Default: 1)--embed-image <0|1> (Default: 1)--embed-javascript <0|1> (Default: 1)--embed-outline <0|1> (Default: 1)Specify which elements should be embedded into the output HTML file.If switched off, separated files will be generated along with the HTML file for thecorresponding elements.--embed  accepts  a  string  as  argument. Each letter of the string must be one of`cCfFiIjJoO`, which corresponds to one of  the  --embed-***  switches.  Lower  caseletters  for  0 and upper case letters for 1. For example, `--embed cFIJo` means toembed everything but CSS files and outlines.--split-pages <0|1> (Default: 0)If turned on, the content of each page is stored in a separated file.This switch is useful if you want pages to be loaded separately & dynamically --  asupporting server might be necessary.Also see --page-filename.--dest-dir <dir> (Default: .)Specify destination folder.--css-filename <filename> (Default: <none>)Specify the filename of the generated css file, if not embedded.If it's empty, the file name will be determined automatically.--page-filename <filename> (Default: <none>)Specify the filename template for pages when --split-pages is 1A  %d  placeholder  may be included in `filename` to indicate where the page numbershould be placed. The placeholder supports a limited  subset  of  normal  numericalplaceholders, including specified width and zero padding.If  `filename`  does not contain a placeholder for the page number, the page numberwill be inserted directly before the file extension. If the filename does not  havean extension, the page number will be placed at the end of the file name.If  --page-filename  is not specified, <input-filename> will be used for the outputfilename, replacing the extension with .page and adding the  page  number  directlybefore the extension.Examplespdf2htmlEX --split-pages 1 foo.pdfYields page files foo1.page, foo2.page, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename bar.bazYields page files bar1.baz, bar2.baz, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename page%dbar.bazYields page files page1bar.baz, page2bar.baz, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename bar%03d.bazYields page files bar001.baz, bar002.baz, etc.--outline-filename <filename> (Default: <none>)Specify the filename of the generated outline file, if not embedded.If it's empty, the file name will be determined automatically.--process-nontext <0|1> (Default: 1)Whether to process non-text objects (as images)--process-outline <0|1> (Default: 1)Whether to show outline in the generated HTML--process-annotation <0|1> (Default: 0)Whether to show annotation in the generated HTML--process-form <0|1> (Default: 0)Whether to include text fields and radio buttons in the generated HTML--printing <0|1> (Default: 1)Enable printing support. Disabling this option may reduce the size of CSS.--fallback <0|1> (Default: 0)Output  in  fallback  mode,  for better accuracy and browser compatibility, but thesize becomes larger.--tmp-file-size-limit <limit> (Default: -1)This limits the total size (in KB) of the temporary files which will also limit thetotal  size of the output file.  This is an estimate and it will stop after a page,once the total temporary files size is greater than this number.-1 means no limit and is the default.Fonts--embed-external-font <0|1> (Default: 1)Specify whether the local matched fonts, for fonts not embedded in PDF,  should  beembedded into HTML.If  this switch is off, only font names are exported such that web browsers may tryto find proper fonts themselves, and that might cause issues about  incorrect  fontmetrics.--font-format <format> (Default: woff)Specify the format of fonts extracted from the PDF file.--decompose-ligature <0|1> (Default: 0)Decompose ligatures. For example 'fi' -> 'f''i'.--auto-hint <0|1> (Default: 0)If set to 1, hints will be generated for the fonts using FontForge.This may be preceded by --external-hint-tool.--external-hint-tool <tool> (Default: <none>)If  specified, the tool will be called in order to enhanced hinting for fonts, thiswill precede --auto-hint.The tool will be called as '<tool> <in.suffix> <out.suffix>', where suffix will  bethe same as specified for --font-format.--stretch-narrow-glyph <0|1> (Default: 0)If  set  to  1,  glyphs narrower than described in PDF will be stretched; otherwisespace will be padded to the right of the glyphs--squeeze-wide-glyph <0|1> (Default: 1)If set to 1, glyphs wider than described in PDF will be squeezed; otherwise it willbe truncated.--override-fstype <0|1> (Default: 0)Clear the fstype bits in TTF/OTF fonts.Turn  this on if Internet Explorer complains about 'Permission must be Installable'AND you have permission to do so.--process-type3 <0|1> (Default: 0)If turned on, pdf2htmlEX will try to convert Type 3 fonts such  that  text  can  berendered  natively  in HTML.  Otherwise all text with Type 3 fonts will be renderedas image.This feature is highly experimental.Text--heps <len>, --veps <len> (Default: 1)Specify the maximum tolerable horizontal/vertical offset (in pixels).pdf2htmlEX would try to optimize the generated HTML file moving  Text  within  thisdistance.--space-threshold <ratio> (Default: 0.125)pdf2htmlEX  would  insert  a  whitespace  character ' ' if the distance between twoconsecutive letters in the same line is wider than ratio * font_size.--font-size-multiplier <ratio> (Default: 4.0)Many web browsers limit the minimum font size, and many would round the given  fontsize, which results in incorrect rendering.Specify  a  ratio  greater than 1 would resolve this issue, however it might freezesome browsers.For some versions of Firefox, however, there will be a problem when the  font  sizeis too large, in which case a smaller value should be specified here.--space-as-offset <0|1> (Default: 0)If  set  to  1,  space characters will be treated as offsets, which allows a betteroptimization.For PDF files  with  bad  encodings,  turning  on  this  option  may  cause  losingcharacters.--tounicode <-1|0|1> (Default: 0)A  ToUnicode map may be provided for each font in PDF which indicates the 'meaning'of the characters. However often there is  better  "ToUnicode"  info  in  Type  0/1fonts,  and sometimes the ToUnicode map provided is wrong.  If this value is set to1, the ToUnicode Map is always applied, if provided in PDF, and characters may  notrender correctly in HTML if there are collisions.If  set to -1, a customized map is used such that rendering will be correct in HTML(visually the same), but you may not get correct characters  by  select  &  copy  &paste.If set to 0, pdf2htmlEX would try its best to balance the two methods above.--optimize-text <0|1> (Default: 0)If  set  to  1,  pdf2htmlEX will try to reduce the number of HTML elements used fortext. Turn it off if anything goes wrong.--correct-text-visibility <0|1> (Default: 0)If set to 1, pdf2htmlEX will try to detect texts  covered  by  other  graphics  andproperly  arrange  them, i.e. covered texts are made transparent in text layer, andare drawn on background layer.Background Image--bg-format <format> (Default: png)Specify the background image format. Run `pdf2htmlEX -v`  to  check  all  supportedformats.--svg-node-count-limit <limit> (Default: -1)If  node count in a svg background image exceeds this limit, fall back this page tobitmap background; negative value means no limit.  This option is only useful  when'--bg-format  svg'  is  specified.  Note  that node count in svg is just calculatedapproximately.--svg-embed-bitmap <0|1> (Default: 1)Whether embed bitmaps in svg background image. 1: embed bitmaps in svg  background;0: dump bitmaps to external files if possible.This  option is only useful when '--bg-format svg' is specified and '--embed-image'is off.Currently, RGB or Gray JPEG bitmaps in a PDF can be dumped, while  those  in  otherformats  or colorspaces are still embedded.  If bitmaps are not dumped as expected,try pre-processing your PDF by ghostscript or acrobat and make sure bitmaps  in  itare converted to RGB/Gray JPEG format. See the project wiki for more details.PDF Protection-o, --owner-password <password>Specify owner password-u, --user-password <password>Specify user password--no-drm <0|1> (Default: 0)Override document DRM settingsTurn this on only when you have permission.Misc.--clean-tmp <0|1> (Default: 1)If switched off, intermediate files won't be cleaned in the end.--data-dir <dir> (Default: /usr/share/pdf2htmlEX)Specify the folder holding the manifest and other files (see below for the manifestfile)`--tmp-dir <dir> (Default: /tmp or $TMPDIR if set)Specify the temporary folder to use for temporary files--css-draw <0|1> (Default: 0)Experimental and unsupported CSS drawing--debug <0|1> (Default: 0)Print debug information.--proof <0|1|2> (Default: 0)Output a proof version. If a positive value is specified, texts are drawn  on  bothtext  layer  and  background  image  for  comparison.   If 2 is specified, texts onbackground are in different colors. If png/jpg background format is used, a  higherhdpi/vdpi (e.g. 288) is recommended for legibility.Meta-v, --versionPrint copyright and version info--help Print usage information

MANIFEST and DATA-DIR

       When  split-pages  is  0,  the  manifest  file describes how the final html page should begenerated.By default, pdf2htmlEX will use the manifest in the default data-dir (run `pdf2htmlEX  -v`to check), which gives a simple demo of its syntax.You  can  modify  the  default  one,  or  you can create a new one and specify the correctdata-dir in the command line.All files referred by the manifest must be located in the data-dir.

EXAMPLE

       pdf2htmlEX /path/to/file.pdfConvert file.pdf into file.htmlpdf2htmlEX --clean-tmp 0 --debug 1 /path/to/file.pdfConvert file.pdf and leave all intermediate files.pdf2htmlEX --dest-dir out --embed fi /path/to/file.pdfConvert file.pdf into out/file.html and leave font/image files separated.

COPYRIGHT

       Copyright 2012,2013 Lu Wang <coolwanglu@gmail.com>pdf2htmlEX is licensed under GPLv3 with additional terms, read LICENSE for details.

AUTHOR

       pdf2htmlEX is written by Lu Wang <coolwanglu@gmail.com>

pdf2htmlEX命令行参数大全相关推荐

  1. x265 命令行参数大全

    除非一个参数被标记为 CLI ONLY,否则该参数也被x265_param_parse() 支持.CLI使用getopt 函数来解释命令行参数,长短版本的参数都可以使用,长参数在不引起歧义的情况下可以 ...

  2. 最新VLC命令行参数大全(一)

    用法: vlc [选项] [流] ... 您可以在命令行中指定多个流. 它们将被加入播放列表队列. 指定的首个项目将被首先播放. 选项风格:   --选项  用于设置程序执行期间的全局选项.    - ...

  3. pytest合集(3)— 命令行参数

    1.命令行参数大全 使用 pytest -h 可以查看 pytest 的命令行参数,有 10 大类共 132 个. 详见:Python pytest 132 个命令行参数用法 - 习久性成 - 博客园 ...

  4. python 代码命令大全-Python 命令行参数

    Python 命令行参数 Python 提供了 getopt 模块来获取命令行参数. $ python test.py arg1 arg2 arg3 Python 中也可以使用 sys 的 sys.a ...

  5. JVM调优系列--Java命令选项(参数)--大全/详解/常用

    原文网址:JVM调优系列--Java命令选项(参数)--大全/详解/常用_IT利刃出鞘的博客-CSDN博客 简介 说明        本文介绍Java的java命令用法,包括:常用用法.选项大全. J ...

  6. cmd中mysql的操作语句,mysql命令行操作大全 mysql命令行操作总结

    摘要 腾兴网为您分享:mysql命令行操作大全 mysql命令行操作总结,悦作业,英语字典,银万财富,小书亭等软件知识,以及webex,过租号,宁夏教育云,百货通,准妈妈课堂,京客隆网上超市,壁纸生成 ...

  7. kafka 命令行操作大全

    kafka 命令行操作大全 一.集群相关常用命令 二.topic相关常用命令 2.1 脚本&参数简介 2.2 举例 三.生产者命令行常用操作 3.1 脚本&参数简介 3.2 举例发送消 ...

  8. Go 学习笔记(45)— Go 标准库之 flag(命令行参数解析)

    1. 参数解析说明 import "flag" flag 包实现了命令行参数的解析.每个参数认为一条记录,根据实际进行定义,到一个 set 集合.每条都有各自的状态参数. 使用 f ...

  9. INNO SETUP 获得命令行参数

    INNO SETUP 获得命令行参数 原文 http://www.cnblogs.com/ahuo/archive/2009/07/30/1534998.html [Code] function Ge ...

最新文章

  1. 多花5美元提速500%,树莓派新品Zero 2 W发布
  2. centos java程序_刚得到一台centos7服务器,作为Java程序员应该做的事
  3. [翻译]XNA建设者俱乐部在线网站上的社区新亮点
  4. wpspbc按钮是什么意思_抖音私密账号什么意思 抖音热评私密账号什么梗怎么设置?...
  5. baseresponse响应类_Java response响应体和文件下载实现原理
  6. Android广播(笔记)
  7. getclass方法_防止数据重复提交的6种方法(超简单)!
  8. 当汽车工业遇见 AI,开发者的时代到来
  9. 381. O(1) 时间插入、删除和获取随机元素 - 允许重复
  10. 程序员桌面都这么秀?网友:用砖头当杯垫这样合适吗?留着吓谁
  11. 创客教室-中小学创客教育课程介绍
  12. 云计算中的第二个boss——网络虚拟化
  13. 金山词霸 2009 专业版
  14. Java—计算长方体、四棱锥的表面积和体积
  15. 降雪致国道315线部分道路通行受阻公路交警部门昼夜坚守
  16. 黑洞同样释放能量--霍金修正黑洞理论
  17. firstvalue函数mysql_MySQL 窗口函数之头尾函数
  18. TPS和QPS的区别
  19. DirectShow开发
  20. 多人使用服务器,如何开个人账户?以及 个人账户如何操作服务器?

热门文章

  1. 《代码整洁之道》读后感及总结
  2. linux脚本:每天晚上 12 点,打包站点目录/var/www/html 备份到/data 目录下
  3. android 夜间模式功能,Android 夜间模式的三种实现
  4. 无线网卡wifi无法启用
  5. iOS 10诸如相机、相册、通讯录、麦克风、定位权限设置,防止奔溃或上架被拒
  6. 第十六天(配置BPDU,TCN BPDU)
  7. python如何把矩阵转换为图片_如何将numpy数组转换为(并显示)图片
  8. 快速排序与冒泡排序的效率对比
  9. CSDN怎么加粗字体
  10. 云计算供应商在合同谈判时可能拒绝的三个事项以及要求