Additional File 1.

Neoplasia classification structure (XML version) Neoclxml.gz is a compressed (gzipped) XML file. The downloaded file should be renamed neoclxml.gz so that the .gz suffix can be recognized by unzip utilities. Unzip the file (using a free, open source utility such as gunzip.exe [23], or a proprietary utility such as Winzip). Once unzipped, the file should be renamed neocl.xml, so that it will have an .xml suffix. If the file is too large for viewing on your web browser, it can be viewed on plain-text word processors.

