|
A batch-oriented workflow can be developed, scheduled, and monitored using Apache Airflow, an open-source platform. You can integrate Airflow with virtually any technology thanks to its Python extension framework. Workflows can be managed using a web interface. Airflow is deployable in many ways, from simple processes running on laptops to distributed setups that can support even a huge flow of data. Why Choose Airflow?The Airflow framework can be easily extended to connect to new technology if your workflows have a clear start and end, and run at regular intervals. It is a batch workflow orchestration platform. If your workflows have a clear start and end and are scheduled to run at regular intervals, you can create Airflow DAGs. Features:
Advantages:
Disadvantages:
Installation for Apache Airflow:For Apache Airflow installation you should have pip installed first. Step 1: Install pip first, in case you have already installed move to Step 3. $ sudo apt-get install python3-pip Step 2: Set the location $ export AIRFLOW_HOME=~/airflow Step 3: Install Apache Airflow using pip $ pip3 install apache-airflow Output:
Step 4: Backend initialization to maintain workflow $ airflow initdb Step 5: Run the below command to start the web server or Apache user interface $ airflow webserver -p 8080 Step 6: Airflow scheduler to monitor workflow $ airflow scheduler |
Reffered: https://www.geeksforgeeks.org
Technical Scripter |
Type: | Geek |
Category: | Coding |
Sub Category: | Tutorial |
Uploaded by: | Admin |
Views: | 11 |