I recommend taking a look at William Underwood’s article on the extraction of metadata that was published in the The International Journal of Digital Curation. The case study focused on automating the extraction of metadata from a collection of university records.