DIVAServices

A RESTful Web Service Framework for Document Image Analysis Methods

Find it on Github

State-of-the-Art
Document Analysis Methods

Currently 25 different methods available

RESTful Architecture

Execute methods with a simple HTTP requests

Micro Service Architecture

One Docker Image per method

SDK Reinvented: Document Image Analysis Methods as RESTful Web Services

Abstract. Document Image Analysis (DIA) systems become ever more advanced, but also more complex — computationally, and logically. This increases the difficulty of integrating existing state-of-the-art approaches into new research or into practical workflows. The current approach to sharing software is publishing source code — leaving the burden to the integrator — or creating a Software Development Kit (SDK) which is often restricted to one programming language. We present DIVAServices a framework for sharing and accessing DIA methods within the research community and beyond. Using a RESTful web service architecture we provide access to the methods, leading to only one system on which the binaries of methods need to be maintained. All it takes for a developer to use an algorithm is a simple HTTP request with the image data and parameters for the method and they will receive the computed results in a format that allows for seamless integration into any kind of workflow or for further processing. Furthermore, DIVAServices is open-source, enabling other research groups or libraries to host their own instance in their environment. Using this framework, future DIA systems can be built on the shoulders of well tested algorithms, accessible to everyone.

Web Services in Document Image Analysis - Recent Developments and the Importance of Building an Ecosystem

Abstract. Web Services are being adapted into the workflows of many Document Image Analysis researchers. However, so far, there is no common platform for providing access to algorithms in the community. DIVAServices aims at becoming this by providing a platform that is open to the whole community to provide their own methods as Web Services. In this paper we present updates and enhancements of the existing DIVAServices platform. These includes a new computational backend, a revamped execution workflow based on asynchronous communication, and the possibility for methods to specify their outputs. Furthermore we discuss the importance of an “ecosystem” for such a platform. We argue that only providing a RESTful API is not enough. Users need tools and services around the framework that supports them in adapting the Web Services and we introduce some of the tools that we built around DIVAServices.