Question

我正在寻找一种使用Java库将HTML文件转换为PDF的方法，该库最好是免费的。我已经在网上搜索了一些工具来寻找可以使用的工具，但是还没有找到一个突出的解决方案（我已经看到一些提到的iText，但看起来这样会收取费用）。是否有可用于完成HTML转换为PDF的现有库？

Answer 1

更新：

我最终使用了Maven回购中的Flying-Saucer：https://mvnrepository.com/artifact/org.xhtmlrenderer/flying-saucer-pdf

让这个为我工作非常简单，这是我创建的一个使用它的方法：

public static void generatePDF(String inputHtmlPath, String outputPdfPath)
{
    try {
        String url = new File(inputHtmlPath).toURI().toURL().toString();
        System.out.println("URL: " + url);

        OutputStream out = new FileOutputStream(outputPdfPath);

        //Flying Saucer part
        ITextRenderer renderer = new ITextRenderer();

        renderer.setDocument(url);
        renderer.layout();
        renderer.createPDF(out);

        out.close();
    } catch (DocumentException | IOException e) {
        // TODO Auto-generated catch block
        e.printStackTrace();
    }
}

以下是用法：

public static void main(String[] args){
    String inputFile = "C:/Users/jrothst/Desktop/TestHtml.htm";
    String outputFile = "C:/Users/jrothst/Desktop/TestPdf.pdf";

    generatePDF(inputFile, outputFile);

    System.out.println("Done!");
}

输出PDF效果非常好，使用起来非常简单。它还很好地处理了html中的CSS。没有将它用于外部CSS，但我相信这也是可能的。

Answer 2

您有几个选择：

openhtmltopdf - 新代码，仍在酿造，但效果不错
Apache FOP - 可以转换XML，而不是HTML，但可能很有用
itext旧版本（第2版）
Wkhtmltopdf - 可以通过外部流程从Java调用它，到目前为止使用它非常成功

Answer 3

我使用javascript在我的应用程序中打印HTML表。

单击“打印”按钮执行以下功能。我有'printElement'功能用于打印。

$("#btnlwPrint").off('click').on('click',function() { 
    var cboVendorName ;
    cboVendorName= $('#cboVendorName').combogrid('textbox').val();
     var tbodylength=$('#ssgGrid tbody tr').length;
    if (cboVendorName =="" || tbodylength <= 1){
     $('#warPrintEmptyRow').modal('toggle');
     } else {

         $('#lWVendor').text(cboVendorName);
         printElement(document.getElementById("printHeader"));
         printElement(document.getElementById("ssgGrid"),true);     
         window.print();
     }
 });

//Below function prints the grid
function printElement(elem, append, delimiter) {
    var domClone = elem.cloneNode(true);
    var $printSection = document.getElementById("printSection");
    if (!$printSection) {
        var $printSection = document.createElement("div");
        $printSection.id = "printSection";
        document.body.appendChild($printSection);
    }

    if (append !== true) {
        $printSection.innerHTML = "";
    }

    else if (append === true) {
        if (typeof(delimiter) === "string") {
            $printSection.innerHTML += delimiter;
        }
        else if (typeof(delimiter) === "object") {
            $printSection.appendChlid(delimiter);
        }
    }

    $printSection.appendChild(domClone);
 }

Answer 4

以下是将html文件完整转换为pdf文件的工作示例。

import com.itextpdf.text.Document;
import com.itextpdf.text.html.simpleparser.HTMLWorker;
import com.itextpdf.text.pdf.PdfWriter;
import java.io.IOException;
import java.io.FileReader;
import java.io.Reader;
import java.io.BufferedReader;
import java.io.File;
import java.io.FileOutputStream;
import java.io.OutputStream;
import java.io.StringReader;
import org.jsoup.Jsoup;

public class Html2pdf2 {
private Html2pdf2() {}

public static String extractText(Reader reader) throws IOException {
StringBuilder sb = new StringBuilder();
BufferedReader br = new BufferedReader(reader);
String line;
while ( (line=br.readLine()) != null) {
  sb.append(line);
}
String textOnly = Jsoup.parse(sb.toString()).text();
return textOnly;
}

public final static void main(String[] args) throws Exception{
FileReader reader = new FileReader
      ("example.html");

 try {

OutputStream file = new FileOutputStream(new File("D:\\Test.pdf"));
Document document = new Document();
PdfWriter.getInstance(document, file);
document.open();
HTMLWorker htmlWorker = new HTMLWorker(document);
htmlWorker.parse(new StringReader(ht));
document.close();
file.close();

} catch (Exception e) {
e.printStackTrace();
}

System.out.println("finished converting");
}
}

使用Java将HTML文件转换为PDF

4 个答案: