The scraper currently stores the DAC XML and XLS, but only uses the XLS for generating CSV.
However, some stuff is present in the XML but not the XLS (e.g. sector category descriptions), and some mistakes have been corrected in the XML (see e.g. #53). We should find a way to merge the two sources.