To run the application locally from scratch, follow these steps:
-
Clone the Repository: Clone the repository onto your local machine.
git clone https://github.com/BigDataIA-Spring2024-Sec1-Team4/Assignment3
-
Create a Virtual Environment: Set up a virtual environment to isolate project dependencies.
python -m venv venv
-
Activate the Virtual Environment: Activate the virtual environment.
-
Windows:
venv\Scripts\activate
-
Unix or MacOS:
source venv/bin/activate
-
-
Host Grobid Server and Run Airflow: Open Docker Desktop and host the Grobid server. (Run this in a separate terminal)
docker run -t --rm -p 8070:8070 lfoppiano/grobid:0.8.0 docker-compose up -d