====== Machine Identification Code (MIC) Dataset ====== This dataset was generated in a joint effort by the Electronic Frontier Foundation ([[http://www.eff.org|EFF]]) and the Multimedia Analysis and Data Mining (MADM) Group at the German Research Center for Artificial Intelligence (DFKI). The purpose of this dataset is to provide researchers a wide variety of different machine identification codes for development and evaluation purposes. The documents were collected by the EFF and scanned and ground-truthed at DFKI. An overview of the printer samples is given [[https://madm.dfki.de/files/downloads/mic-dataset-overview.txt|here]] Two versions are available: * [[https://madm.dfki.de/files/downloads/mic.zip|full color 600 dpi scans]] (32 GByte) * [[https://madm.dfki.de/files/downloads/mic-bw.zip|b/w images]] containing only the extracted dots (see [[http://www.dfki.uni-kl.de/~beusekom/PDFs/beusekom--optical-document-security-in-high-volume-office-environments--thesis--2010.pdf|thesis]] for binarization method) (139 MByte) Contact: Faisal Shafait, Joost van Beusekom