For sure. Seems like a great case for automated tests, imo. One could imagine systems that automatically write out both formats and read them in in various ways to ensure continuous availability of that functionality (if it's really necessary). Defining a representative set of documents will also help, so when something does inevitably go wrong, you can identify regressions to discover what you didn't expect, and grow that set of documents/tests over time to improve coverage. Good luck.