Is Flying Saucer cell imaging a mistake?

I managed to shorten my example to the following (it uses some reasonable data URLs, but stackoverflow should distinguish them from the side ...):

static { String oldValue = System.getProperty("java.protocol.handler.pkgs"); if (oldValue == null) { System.setProperty("java.protocol.handler.pkgs", "org.xhtmlrenderer.protocols"); } else if (!oldValue.contains("org.xhtmlrenderer.protocols")) { System.setProperty("java.protocol.handler.pkgs", oldValue + "|org.xhtmlrenderer.protocols"); } } @Test public void testLegacyCellSpacing() throws Exception { ITextRenderer renderer = new ITextRenderer(); renderer.setDocumentFromString("<html>\n" + " <head>\n" + " <title>Some title</title>\n" + " </head>\n" + " <body>\n" + " <div>\n" + "<table cellpadding=\"0\" cellspacing=\"0\">\n" + "<tr>\n" + "<td>\n" + " <img src=\"\" width=\"185\" height=\"83\"/>\n" + "</td>\n" + "</tr>\n" + "<tr>\n" + "<td>\n" + " <img src=\"\" width=\"185\" height=\"84\"/>\n" + "</td>\n" + "</tr>\n" + "<tr>\n" + "<td>\n" + " <img src=\"\" width=\"185\" height=\"83\"/>\n" + "</td>\n" + "</tr>\n" + "</table>\n" + " </div>\n" + " </body>\n" + "</html>", null); renderer.layout(); File pdfFile = File.createTempFile("TestFlyingSaucer", ".pdf"); OutputStream output = new BufferedOutputStream(new FileOutputStream(pdfFile)); try { renderer.createPDF(output, true); } finally { output.close(); } System.out.println(); // breakpointing here to look at the output. no good assertions available for checking this problem yet. :( } 

This code runs and generates a PDF file that has the wrong spaces between each image. I cannot understand what causes the spaces, but I cannot understand if this is a mistake. What i know:

  • Web browsers do not show spaces between images.
  • If you delete the images and replace them with something like a fixed div size, the problem will magically disappear (this proves that the flying saucer does the right thing with the inherited attributes of cellpadding and cellspacing, which was my first concern).
  • If you add a cell border to the cells ( td { border: 1px solid red; } ), this indicates that the extra space is outside the td elements.
  • If you remove the space between <td> and <img , the problem will disappear.

It seems strange to me that the extra space inside the element can lead to the addition of additional fields / fields outside the element. Maybe I'm completely mistaken in HTML, so I hope someone installs me right here before I delay the problem in my tracker.

(Also: interrupting the typical response to the knee-jerk reflex β€œdo not use tables for this” is not my HTML, I'm trying to make the real world of HTML into PDF after cleaning it. In this example, the message -cleanup HTML is shown to eliminate the problem in the cleaning code. )

+4
source share
1 answer

Browsers are sensitive to white space in HTML in many places, including inside and between elements, and even in places you don't expect. For example, if you have ul elements with li , if you use CSS to style elements with li {display:inline} to create a horizontal menu, any free space between </li> and the next <li> will contain the menu elements separately!

In this particular case, seeing the empty space departing from the first child is certainly unusual, but I can’t say categorically that this will be considered an error in accordance with HTML standards. The best solution is to simply eliminate the empty space in which it causes problems.

+1
source

All Articles