Skip to content

Commit

Permalink
Install Tesseract dependency
Browse files Browse the repository at this point in the history
Changes the Dockerfile of the container used to run the tests. It is
necessary to install Tesseract to allow the OCR to work.

Signed-off-by: José Guilherme Vanz <[email protected]>
  • Loading branch information
jvanz committed Dec 3, 2020
1 parent 98fc326 commit bf65f15
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,11 @@ FROM python:3.8
RUN adduser --system gazette

RUN apt-get update -y
RUN apt-get -y install default-jre wait-for-it
RUN apt-get -y install default-jre wait-for-it libtesseract4 tesseract-ocr-por

# install Apache Tika
ADD http://archive.apache.org/dist/tika/tika-app-1.24.jar /tika-app.jar
RUN chmod 755 /tika-app.jar

RUN mkdir /mnt/code
COPY . /mnt/code
Expand All @@ -12,9 +16,5 @@ ENV PYTHONPATH=/mnt/code

RUN pip install --no-cache-dir -r requirements.txt

# install Apache Tika
ADD http://archive.apache.org/dist/tika/tika-app-1.24.jar /tika-app.jar
RUN chmod 755 /tika-app.jar

USER gazette

0 comments on commit bf65f15

Please sign in to comment.