This article represents information and code/scripts which could be used to get started with Cloudera using Dockers. Please feel free to comment/suggest if I missed to mention one or more important points. Also, sorry for the typos.
Following are the key points described later in this article:
To run the cloudera in docker container, one would require to do following configuration to the Docker machine. Open Oracle VM Virtualbox Manager. Stop the default machine. Then, change the settings as shown below.
If not done, running “cloudera-manager –express” throws following error:
docker pull cloudera/quickstart:latest
FROM cloudera/quickstart:latest
Save the file as cloudera.df and then, use following command to build the image:
docker build -t cloudera -f cloudera.df .
The image is tagged as cloudera.
tar xzf cloudera-quickstart-vm-*-docker.tar.gz
docker import - cloudera/quickstart:latest < cloudera-quickstart-vm-*-docker/*.tar
docker run --privileged=true -ti -d -p 8888:8888 -p 80:80 -p 7180:7180 --name $1 --hostname=quickstart.cloudera -v /c/Users:/mnt/Users $cd_image /usr/bin/docker-quickstart
Note that image is named/tagged as cloudera. You could as well check “docker images” command to find the tag name of Cloudera image and use it in place of “cloudera”. Also, note the port such as 7180, 8888 mapped from guest to host.
Execute following command to start the Cloudera service assuming the you started the container with name as “cdh”. Use the scripts below to start “cdh” cloudera container.
docker exec -ti cdh /home/cloudera/cloudera-manager --express
With above command, Cloudera starts as shown in following diagram.
Open a browser and access following command: http://192.168.99.100:7180/. It would open up the login page for Cloudera Manager. Enter the login/password as cloudera/cloudera and you are all set!
Following is the script which could be used to install/build the image and run the cloudera container.
FROM cloudera/quickstart:latest
#!/bin/sh
if [ $# == 0 ]; then
echo "This script expect container name argument. Example: ./runCloudera.sh cdh"
exit 100
fi
docker stop $1;docker rm $1
# Build Cloudera image if it does not exists
#
cd_image="cloudera"
cd_df="cloudera.df"
if [ `docker images $cd_image | wc -l` -lt 2 ]; then
echo "Docker Image $cd_image do not exist..."
echo "Builing docker image $cd_image"
if [ -f $cd_df ]; then
docker build -t $cd_image -f $cd_df .
else
echo "Can't find Dockerfile $cd_df in the current location"
exit 200
fi
fi
docker run --privileged=true -ti -d -p 8888:8888 -p 80:80 -p 7180:7180 --name $1 --hostname=quickstart.cloudera -v /c/Users:/mnt/Users $cd_image /usr/bin/docker-quickstart
Open a Docker terminal, place both the files within a folder and execute the command such as “./runCLoudera.sh cdh”. This would build the image and start the container namely “cdh”.
Artificial Intelligence (AI) agents have started becoming an integral part of our lives. Imagine asking…
In the ever-evolving landscape of agentic AI workflows and applications, understanding and leveraging design patterns…
In this blog, I aim to provide a comprehensive list of valuable resources for learning…
Have you ever wondered how systems determine whether to grant or deny access, and how…
What revolutionary technologies and industries will define the future of business in 2025? As we…
For data scientists and machine learning researchers, 2024 has been a landmark year in AI…