In some embodiments, the system may be configured to implement one or more CNN architectures that are capable of retaining the sequential representation of the one or more high resolution images associated with the one or more documents in addition to extracting the one or more features from the one or more high resolution images associated with the one or more documents. For example, as the high resolution images are processed, the one or more IBE algorithms may be configured to traverse the one or more high resolution images associated with the one or more documents to extract features from various portions of the high resolution images associated with the one or more documents. For example, a high resolution image of the document may be a letterhead which has portions such as a heading, signature block, subject line, greeting, body, and/or the like. By implementing IBE algorithms such as WaveNet, the system may be configured to retain the sequence in which the features are extracted from the one or more high resolution images associated with the one or more documents.
Next, as shown in block 212, the process flow includes storing the one or more features extracted from the one or more high resolution images in a feature repository.