@@ -6,83 +6,61 @@ Opaque is designed as a web application which can be easily used by researchers
In particular researchers can use Opaque to start OCR jobs for digitized sources. The text output of these OCR jobs can then be used as an input for tagging processes (POS, NER etc.).
As a last step texts can be loaded into an information retrieval system to query for specific words, phrases in connection with POS tags.
As a last step texts can be loaded into an information retrieval system to query for specific words, phrases in connection with linguistic features.
## Dependencies
- Docker: https://www.docker.com/
- Python 3.5+
- cifs-utils
- Docker
- Docker Compose
## Setup
0.**Create log files**
``` bash
mkdir /logs
```
## Configuration and startup
1.**Create Docker swarm:**
The generated computational workload is handled by a [Docker](https://docs.docker.com/) swarm. A swarm is a group of machines that are running Docker and joined into a cluster. It consists out of two different kinds of members, managers and workers. Currently it is not possible to specify a dedicated Docker host, instead Opaque expects the executing system to be a swarm manager of a swarm with at least one dedicated worker machine. The [swarm setup](https://docs.docker.com/engine/swarm/swarm-tutorial/) process is described best in the Docker documentation.
2. Create a dedicated user `opaque` on all swarm members with `sudo useradd opaque`.
The generated computational workload is handled by a [Docker](https://docs.docker.com/) swarm. A swarm is a group of machines that are running Docker and joined into a cluster. It consists out of two different kinds of members, managers and workers. Currently it is not possible to specify a dedicated Docker host, instead Opaque expects the executing system to be a swarm manager of a cluster with at least one dedicated worker machine. The swarm setup process is described best in the [Docker documentation](https://docs.docker.com/engine/swarm/swarm-tutorial/).
3. Create shared network storage
A shared network space is necessary so that all swarm members have access to all the data. To achieve this a [Samba](https://www.samba.org/)share is used.
2.**Create a network storage**
A shared network space is necessary so that all swarm members have access to all the data. To achieve this a [Samba](https://www.samba.org/)can be used.