PDF hacks

internetarchivepdf.pdfhacks.fast_insert_image(page, rect=None, width=0, height=0, stream=None, mask=None, stream_fmt='jpeg2000', mask_fmt='jbig2', gray=True)[source]

Fast image insertion

Args:

  • page: output fitz.Page

  • rect: rectangle to use

  • width: image width

  • height: image height

  • stream: image stream

  • mask: mask image stream (if any)

  • stream_fmt: COMPRESSOR_JPEG2000 or COMPRESSOR_JPEG

  • mask_fmt: COMPRESSOR_JBIG2 or None

  • gray: if the image is grayscale (otherwise RGB is assumed)

internetarchivepdf.pdfhacks.write_metadata(from_pdf, to_pdf, extra_metadata)[source]

Write document and XMP metadata.

Args:

  • from_pdf (fitz.Document or None): metadata to copy from input PDF, can be omitted

  • to_pdf: (fitz.Document): PDF to write metadata to

  • extra_metadata (dict): dictionary with extra metadata values

Allowed values for extra_metadata:

  • ‘url’

  • ‘title’

  • ‘author’

  • ‘creator’

  • ‘subject’

  • ‘creatortool’

  • ‘language’ (can be a list)