You can inspect individual XML files with an XML file viewer. XML files consist of text only, so a text editor such as the Microsoft Windows Notepad utility will do. Here is a small portion of the USPTO’s XML file for US patent no. 8329177 as viewed in Notepad. You can see the descriptive tag pairs (e.g. <document-id></document-id>) encapsulating the information content, but the tags’ hierarchical structure isn’t readily apparent via Notepad.


A web browser can also be used to inspect an XML file. Here is the same small portion of the XML file for US 8329177 shown above in Notepad, as viewed in Microsoft Internet Explorer. In this case, the tags’ hierarchical structure is made apparent by color highlighting and by hierarchical indentation levels. The encapsulated information is bolded. This makes it somewhat easier to browse through the contents of a single XML file—if that is what you want to do.
None of this is very helpful, unless you are only interested in a particular XML file’s bibliographic patent data content and don’t mind manually inspecting and deciphering the file’s tagged information as outlined above. More generally, what we want to do is examine the information content of a number (preferably a very large number) of bibliographic patent data XML files in parallel. Future posts will delve into that topic.