An exploration of how pdftopdf treats rotation of pages in a PDF file.
The purpose of most of this account is to examine and explain the treatment of what are often called portrait and landscape pages in a PDF file when they are handled by the filtering subsystem on a jessie/stretch/buster CUPS printing system. On wheezy the page management of rotation by pdftopdf is significantly different.
Portrait and landscape are not clearly defined technical terms. Generally speaking, they refer to the way a rectangular object is normally orientated so that sense can easily be made of it. An A4 sheet of paper with text parallel to the short edge would held in portrait mode, short edge horizontal. With text parallel to the long edge, a 90 or 270 degree rotation would take the sheet to landscape mode for ease of reading.
Portrait and landscape are conventions to describe what we see displayed. Beware of using the phrase "landscape printing". This openSUSE page explains why.
The Role of pdftopdf
Printers, even modern IPP printers, have limitations. Overall, they cannot directly scale the text in a PDF, directly do N-up printing, directly rotate a page or directly use a specific font such as freefont when printing a text file. It is not that they could not be manufactured to do such things; they just don't (in general).
So we need a filter, pdftopdf, to apply options such as these to a PDF file it receives. pdftopdf is a page management filter that handles a wide range of options, including ones that are specific to CUPS and cups-filters and others that are better not left to being dealt with by an an IPP printer. We will look at how it manages rotation of pages in a PDF file.
Portrait and Landscape Pages in a PDF
A PDF file with mixed orientation pages would be useful to have. mixed.pdf can be downloaded for you to use.
The file has three document pages. Information on each page can be displayed with pdfinfo:
pdfinfo -f 1 -l 3 mixed.pdf
For the first page:
Page 1 size: 595 x 842 pts (A4) Page 1 rot: 0
The page width is less than its height; it is a portrait page. rot: 0 tells a PDF viewer not to rotate the page. The page would be easily viewed without any rotation.
The second page has:
Page 2 size: 841.89 x 595.28 pts (A4) Page 2 rot: 0
This is a landscape page because its width is greater than its height. There is no rotation done by the viewer. This is another easily viewable page.
Page 3 size: 595 x 842 pts (A4) Page 3 rot: 90
This is a portrait designed page but the viewer rotates it 90 degrees clockwise for easy viewing, making it a landscape page.
Printing Portrait and Landscape Pages
Many printers take paper and feed it through the printer short edge first so that text is printed parallel to this edge. For suitably sized paper, page 1 should produce a satisfactory printout. On the other hand, pages 2 and 3 will probably have some part of the right-hand portion of the image not printed if they are sent as-is to the printer.
There are two solutions the printing system could adopt to overcome this problem. One is to rotate document pages 2 and 3 relative to the printer page before printing them. The second is to alter the images of document pages 2 and 3 to make them fit the width of the medium they are being printed on.
pdftopdf and Autorotation
A good way to appreciate what pdftopdf on jessie/stretch does to fit an image on a page of paper is to use cupsfilter. A PPD file, test.ppd, is available for this test and later ones; or you can provide your own.
/usr/sbin/cupsfilter -p test.ppd -m application/vnd.cups-pdf mixed.pdf > out.pdf
mixed.pdf can be compared with out.pdf and it can be seen that pages 2 and 3 have been autorotated to fit the images on paper which goes through a printer short edge first.
All print jobs sent from an application or via lp/lpr will undergo selective rotation of landscape pages by pdftopdf.
Let's see what pdfinfo reports for out.pdf and compare it with what was given for mixed.pdf:
Page 1 size: 595 x 842 pts (A4) Page 1 rot: 0 Page 2 size: 841.89 x 595.28 pts (A4) Page 2 rot: 270 Page 3 size: 595 x 842 pts (A4) Page 3 rot: 0
Page 1 hasn't been touched; the rotation field still has 0. Page 2 would now be rotated 270 degrees clockwise (90 degrees anticlockwise) when viewed. Page 3 has also been rotated 90 degrees anticlockwise and this cancels out its existing 90 degree clockwise rotation.
The autorotate feature is the only way a PDF with mixed orientation pages will be printed correctly when sent to a printer with lp/lpr or from some applications.
pdftopdf will autorotate appropriate pages by 90 degrees anticlockwise unless the PPD file specifies *LandscapeOrientation: Minus90.
Autorotation will not take place when the PDF workflow is not used.
Edit test.ppd and alter the line with *LandscapeOrientation to read
and repeat the previous cupsfilter command. You should observe that the rotation is now 90 degrees clockwise. Deleting or commenting out the line gives the same result.
Autorotation can be dispensed with by using -o nopdfAutoRotate-default or -o pdfAutoRotate-default=off when setting up a print queue or, on a job-to-job basis with lp/lpr, by -o nopdfAutoRotate or -o pdfAutoRotate=off. There is no way to turn it off from an application. Test with
/usr/sbin/cupsfilter -p test.ppd -m application/vnd.cups-pdf -o nopdfAutoRotate mixed.pdf > out.pdf
The landscape and orientation-requested Options
CUPS has a help page on Command-Line Printing and Options. It says there that
The -o landscape option will rotate the page 90 degrees....
The direction of rotation is governed by *LandscapeOrientation in the PPD file. Minus90 rotates all pages in a PDF file 90 degrees clockwise; Plus90 rotates by the same amount but anticlockwise. *LandscapeOrientation is part of Adobe's PostScript Printer Description File Format Specification.
On the same page we see:
The -o orientation-requested=N option rotates the page depending on the value of N: -o orientation-requested=4 - landscape orientation (90 degrees)
However, the orientation-requested=N option always rotates every page in a file in an anticlockwise sense. *LandscapeOrientation is not consulted by it. orientation-requested=N is an IPP attribute specified in rfc8011.
pdftopdf acts on the landscape and orientation-requested=N options whether or not autorotation (which is applied after either of these two options) is operative. The following command with *LandscapeOrientation as Plus90 in test.ppd might prove instructive:
/usr/sbin/cupsfilter -p test.ppd -m application/vnd.cups-pdf -o orientation-requested=4 mixed.pdf > out.pdf
Page 1 is first rotated 90 degrees anticlockwise by orientation-requested=4 and then autorotation applies a further 90 degrees anticlockwise rotation. The other two pages are rotated 90 degrees anticlockwise by orientation-requested=4 but then autorotation has nothing to do, so no further rotation takes place.
The fit-to-page Option
Suppose you want pages 2 and 3 of mixed.pdf to be printed with the text parallel to the short edge of the paper and also for them to fit on the sheet. Autorotation will have to be off for that.
/usr/sbin/cupsfilter -p test.ppd -m application/vnd.cups-pdf -o 'fit-to-page nopdfAutoRotate' mixed.pdf > out.pdf
For fitting all pages on the sheet of paper and having text parallel to its long edge:
/usr/sbin/cupsfilter -p test.ppd -m application/vnd.cups-pdf -o 'fit-to-page nopdfAutoRotate landscape' mixed.pdf > out.pdf
The Landscape and Portrait Options in Applications
The printing dialogue of a graphical application usully offers Landscape and Portrait options. The meaning attached to these two words when printing a PDF can differ from what CUPS means and also have different meanings for different applications. Three illustrations:
Portrait with Firefox/Iceweasel and Evince effectively means Do not rotate any page in this PDF. Landscape effectively means Rotate only pages with width less than height by 90 degrees anticlockwise. These instructions are used by Cairo to produce a PDF from the submitted one. The Portrait and Landscape options are not sent to CUPS when the Cairo-produced PDF is sent to the printing system.
Okular converts a PDF to PostScript before sending it to CUPS. Its Portrait option sends -o portrait to CUPS. This option is unknown to CUPS (which ignores it) and there is no rotation of any page. In principle its Landscape option sends -o landscape to CUPS; in practice it may send -o portrait with certain PostScript files. The Portrait and Landscape instructions play no part in the production of the PostScript file.
- Qpdfview does not modify the original PDF file. It simply sends it off to be processed by the printing system with the options selected in qpdfview's print dialog.
Autorotation with pdftopdf still functions as usual no matter how the applications themselves deal with their Portrait and Landscape options. This could be a factor in the apparent decrease in bug reports involving applications and page rotation.
A pdftopdf Bonus
Very simply - try this:
/usr/sbin/cupsfilter -p test.ppd -m application/vnd.cups-pdf -o 'fit-to-page booklet' mixed.pdf > out.pdf
Available only through the PDF workflow
Tobias Hoffmann <smilingthax SPAMFREE AT googlemail DOT com> and Till Kamppeter <till.kamppeter SPAMFREE AT gmail DOT com> for writing and improving the pdftopdf filter.