Always starts with header:
<?xml version="1.0"?> <!DOCTYPE article PUBLIC "-//OASIS//DTD DocBook XML V4.1.2//EN" "http://www.oasis-open.org/docbook/xml/4.1.2/docbookx.dtd">
May have to change URL to local path if offline
Will usually be a <book> or <article> (need to change header line appropriately)
If <book>, will be composed of multiple <chapter> elements each with sections inside
If <article>, will be composed of multiple section elements
Indenting and all whitespace is ignored except in some verbatim elements