DEALING WITH PDFS
©2020 by Donald R. Snow
This page was last updated 2020-01-19.
Return to
the Utah Valley Technology
and Genealogy Group Home Page or
Don's
Class Listings Page .
ABSTRACT: PDF stands for Portable
Document Format and was developed and released to the public without copyright by Adobe. A pdf page displays exactly the same on any device or computer, regardless of the operating system. Because of this many organizations, including the Church of Jesus Christ of Latter-day Saints, have adopted it for their manuals, handbooks, magazines, documents, and reports. Thus, the need to knowhow to work with pdfs, editing, separating into parts, rearranging pages, and making them searchable, if they aren't already. Many forms are in pdf and free programs allow you to fill in the forms electronically. There are many programs to do these tasks, including many free ones, and we will discuss some of these. The class notes and related articles,
all with active Internet links, are on Don's
website
http://uvtagg.org/classes/dons/dons-classes.html
.
WELCOME AND INTRODUCTION
- Instructor is Donald R. Snow of St. George and
Provo, Utah ( snowd@math.byu.edu
).
- These notes, with active Internet links and other
related articles, are on Don's website http://uvtagg.org/classes/dons/dons-classes.html
.
- Tips: (1) To put an icon on your desktop
for the URL for these notes, or any webpage, just drag
the icon that is in front of the address in your browser to
your desktop. (2) To open a link while
keeping your place on this page, hold down the
Control key while clicking the link, so it opens in a
new tab.
- The problem for today: What are pdfs and what are
some free programs to work with them?
DOCUMENTS AND IMAGES OF DOCUMENTS
- Documents can be in many formats: .odt,
.doc, .docx, .txt, etc., but these formats don't display the same on all computers.
- Images of documents (pictures of the text) can be in
many formats: .jpg, .tif, png, etc., and these don't display the same on all computers, either.
- .pdf is a format for images of documents that do display the same on all computers and all operating systems; hence the benefit of pdfs
- Remember: pdf's are PICTURES of the document pages, so they look exactly the
same on any computer whether Windows, Apple, Linux, or anything else
- pdf was developed by Adobe in the 1990s and released to the public with no copyright restrictions so everyone could use them as they wanted -- see
the history of pdf in Wikipedia -- many organizations world-wide have adopted this format, including the Church of Jesus Christ of Latter-day Saints
- Types of pdf's
- pdf's with no text layer -- These are images (pictures) of the text page and are not searchable.
- pdf's with text layer -- these have a layer over the picture of the page that identifies the symbols as words in some language and hence are searchable; adding this text layer is called OCR'ing (Optical Character Recognition)
- There are several new pdf formats with additional features such as encryption and password protection and we won't discuss those here.
- pdfs are in different resolutions, e.g. 150 dpi (dots per inch), 300
dpi, or 600 dpi , and the higher the resolution, the more accurate, but also the larger the file size
- pdfs can be read, searched, printed, edited, split, merged, rotated, annotated, emailed, uploaded to FamilySearch, etc.
SOURCES OF PDFS
- Text editors (Word, LibreOffice, WordPerfect, etc.) can save pages as pdfs; this is usually in an Export menu
- Windows 10 has a built-in "printer" to print anything to pdf -- Print to Microsoft PDF -- I set that as my default printer since I usually only want a pdf and not a printed hardcopy
- Flatbed and other types of scanners produce pdfs, usually without the text layer, so they are not searchable without OCR'ing them; as an example, you can scan the forms your doctor sends and write on them electronically and print a copy to take to your appointment
- Screen capture programs such as FastStone Capture (old free version is available from FastStone
Capture 5.3) can save as pdf
- Books downloaded from
FamilySearch
, Google , Internet
Archive , and
HeritageQuest
Online are usually in pdf
- Downloading from websites such as -- https://www.churchofjesuschrist.org/?lang=eng -- Church manuals, handbooks, magazines, conference reports are
all pdf
- Ordinance forms from FamilySearchae are pdfs.
- Can convert from many formats by just
printing the file to a pdf printer and two free conversion programs are
IrfanView
and Hexonic
PDF READERS
- There are many free pdf reader programs for computers, tablets, and
smartphones; they have various settings, e.g. to show only one page or two pages side-by-side, as in a book
- List of pdf programs in various categories -- Wikipedia
- Adobe Reader is a free pdf reader and (partial) editor --this is NOT the full and expensive Adobe program; the free Adobe Reader is available from many websites including https://acrobat.adobe.com/us/en/acrobat/pdf-reader.html -- (download the free version, not the commercial trial version)
- Sumatra
PDF -- freeware, very fast reader, easy to use
- Gizmo's
List of top 5 pdf Readers
- Calibre -- good book
cataloger, reader, organizer, and converter -- drag-and-drop pdfs and ePubs and it organizes and catalogs them; allows reading and conversion to and from various formats
PDF EDITORS
- Printing to pdf -- Windows 10 has a built-in Print To PDF which you can set as your default printer so any program you want to print from will go to pdf; there are also many free pdf printer programs you can download and install
- Good list and description of free pdf programs -- Gizmo's
Review of Best Free PDF Tools
- Two helpful and free pdf editors are -- https://www.tracker-software.com/product/downloads/enduser /pdf-xchange-editor and https://acrobat.adobe.com/us/en/acrobat/pdf-reader.html
- Writing on pdfs
- Can write on, annotate, put sticky notes on, etc., with the above two free programs
- LibreOffice -- freeware,
can import pdf's into its DRAW program, edit them, and export as
edited pdf's, but it's not as simple as writing text
- For forms to fill out you can scan them to pdf and fill them in with an editor,; then save the filled in form and print it, if you need to, but you always have the original copy on your computer
- Spliting, merging, rearranging or rotating pages
- Split, merge, and rotate pdf's -- pdfsam
and 7-PDF Maker
and PDFill
Free PDF Tools -- create several pdf's from
one or create a single pdf from several --
7-PDF Split and Merge -- https://www.7-pdf.com/downloads -- see review and link on dotTech
- Splitting out every page -- sometimes called "bursting"
the file -- can usually set the numbering so the file number contains the page number so they sort in order
- To reassemble a split pdf you can open a copy of the original file in another window so you can see which pages are the ones you want to combine.
- Renaming
- Use Windows Explorer with Preview Panel (right side) open to show
what's in the pdf without having to open it -- can then rename it without opening it
- May need to add leading 0's to make the pages sort in order, e.g. 001, 002, ...
- To sort pages with same name use numbering as ###a, ###b, etc.
- Sometimes helps to write page number in pencil on oridinal paper before scanning so you know which page it is
- Bulk
File Namer -- very useful to rename collections of files
- See other class notes for my file naming system to make files findable and sort
in order regardless of location
SEARCHING AND OCR'ING PDF'S
- To be searchable pdf's must have the text layer from OCR (Optical
Character Recognition) ; books from FamilySearch
Books and Internet
Archive already have this, but books from Google do not
- Adobe Acrobat Reader will now do OCR -- see link above
- Freeware program with built-in OCR is PDF-XChangeEditor from https://www.tracker-software.com/product/downloads/enduser/pdf-xchange-editor -- open the pdf, click on OCR; the new file is searchable; save it with a new name so you don't overwrite your old file; PDF-XChangeEditor does a reasonable job, but is not as good as the full and expensive commercial OCR programs
CONCLUSIONS
- Working with pdf's is important in family history and in everyday computer work.
- This has been just an introduction to dealing with pdf's and there is much more. Some additional information is in other class notes and Freeware Corner articles on my webpage.
Return to the Utah
Valley Technology and Genealogy Group Home Page or Don's
Class Listings Page .