Thursday 15 February 2018

Apache Nifi - I

In this post, we will look at another tool, Apache Nifi, that is touted as supporting powerful and scalable directed graphs of data routing, transformation, and system mediation logic. More details on Apache Nifi can be found here. One of the outstanding features of this tool is the web interface that can be used for design, control, feedback, and monitoring purposes. All ETL experts out there coming from "small data" background and familiar with tools like Informatica PowerCenter, Oracle Data Integrator Enterprise Edition, Microsoft SSIS, IBM InfoSphere DataStage, etc will enjoy working on Apache Nifi. Apache NiFi is based on technology previously called "Niagara Files". Hence, the name Nifi that is short for "Niagara Files"

The first post on this topic is dedicated to installation. This is in line with our strategy - we try to understand tools by starting from scratch and not overwhelm the readers with complex configurations on the first run. We will then try to add more concepts and features of the product in latter posts. We will install Apache Nifi on Windows and we will use this environment for all the work in this post. One of the prerequisites for installation of Nifi is Java. Also, make sure the JAVA_HOME of PATH Environment variables are set correctly to point to the Java installation. You can make sure it is installed by running below command:

java -version









The binaries for Apache Nifi can be downloaded here. We will work with the latest release, 1.5.0. On this page, click here to download the binaries. After downloading nifi-1.5.0-bin.zip, drop it into a folder called nifi as shown below:









 Right click on the file click on Extract all ...

















Change the installation directory to D:\ as shown below and click on Extract:



















The files are extracted as shown below:




















Navigate as shown below to see the installed folders:










On the command line, enter below commands to run Nifi:

D:\>
D:\>cd D:\nifi-1.5.0\bin\
D:\nifi-1.5.0\bin>run-nifi











A series of output statements are seen that will end with s line like "INFO [main] org.apache.nifi.bootstrap.Command Launched Apache NiFi with Process ID 9048" as shown below:


















This means that Nifi is up and running. One can verify that Nifi is running by running status command as follows:

D:\>
D:\>cd D:\nifi-1.5.0\bin\
D:\nifi-1.5.0\bin>status-nifi









A series of output statements are seen that will end with s line like "[main] INFO org.apache.nifi.bootstrap.Command - Apache NiFi is currently running, listening to Bootstrap on port 49643, PID=9048" as shown below:



















Lastly, navigate to http://localhost:8080/ to open the Nifi Designer window as shown below:


















Click on nifi as a final step to see below application:














Alternately, you can navigate to http://localhost:8080/nifi/ directly. 

To bring down the application, click CTRL+C on the window where the application was started.

This complete the installation steps for Apache Nifi on Windows