Download Hadoop 3.1 2 For Mac

Posted on by
Download Hadoop 3.1 2 For Mac Rating: 9,1/10 2021 votes

Hadoop best performs on a cluster of multiple nodes/servers, however, it can run perfectly on a single machine, even a Mac, so we can use it for development. Also, Spark is a popular tool to process data in Hadoop. The purpose of this blog is to show you the steps to install Hadoop and Spark on a Mac.

Operating System: Mac OSX Yosemite 10.11.3
Hadoop Version 2.7.2
Spark 1.6.1

The output should be compared with the contents of the SHA256 file. Similarly for other hashes (SHA512, SHA1, MD5 etc) which may be provided. Windows 7 and later systems should all now have certUtil. Setting up Hadoop 2.6 on Mac OS X Yosemite After comparing different guides on the internet, I ended up my own version base on the Hadoop official guide with manual download. If you prefer Homebrew, this one would be your best choice.

Pre-requisites

1. Install Java

Open a terminal window to check what Java version is installed.
$ java -version

If Java is not installed, go to https://java.com/en/download/ to download and install latest JDK. If Java is installed, use following command in a terminal window to find the java home path
$ /usr/libexec/java_home

Next we need to set JAVA_HOME environment on mac
$ echo export “JAVA_HOME=$(/usr/libexec/java_home)” >> ~/.bash_profile
$ source ~/.bash_profile

2. Enable SSH as Hadoop requires it.

Go to System Preferences -> Sharing -> and check “Remote Login”.

Generate SSH Keys
$ ssh-keygen -t rsa -P “”
$ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys

Open a terminal window, and make sure we can do this.
$>ssh localhost

Download Hadoop Distribution

Download the latest hadoop distribution (2.7.2 at the time of writing)
http://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-2.7.2/hadoop-2.7.2.tar.gz

Create Hadoop Folder

Open a new terminal window, and go to the download folder, (let’s use “~/Downloads”), and find hadoop-2.7.2.tar

Download hadoop 3.1 2 for mac download

$ cd ~/Downloads
$ tar xzvf hadoop-2.7.2.tar
$ mv hadoop-2.7.2 /usr/local/hadoop

Hadoop Configuration Files

Go to the directory where your hadoop distribution is installed.
$ cd /usr/local/hadoop

Then change the following files /automatically-download-pictures-in-outlook-mac-2016.html.

$ vi etc/hadoop/hdfs-site.xml

2
4
6
<property>
<value>1</value>
</configuration>

$ vi etc/hadoop/core-site.xml

2
4
6
<property>
<value>hdfs://localhost:9000</value>
</configuration>
Install

$ vi etc/hadoop/yarn-site.xml

2
4
6
<property>
<value>mapreduce_shuffle</value>
</configuration>

$ vi etc/hadoop/mapred-site.xml

2
4
6
<property>
<value>yarn</value>
</configuration>

Start Hadoop Services

Format HDFS
$ cd /usr/local/hadoop
$ bin/hdfs namenode -format

Start HDFS
$ sbin/start-dfs.sh

Start YARN
$ sbin/start-yarn.sh

Validation

Check HDFS file Directory
$ bin/hdfs dfs -ls /

If you don’t like to include the bin/ every time you run a hadoop command, you can do the following

$ vi ~/.bash_profile
append this line to the end of the file “export PATH=$PATH:/usr/local/hadoop/bin”
$ source ~/.bash_profile

Now try to add the following two folders in HDFS that is needed for MapReduce job, but this time, don’t include the bin/.

Download converter video to mp3 for mac. $ hdfs dfs -mkdir /user
$ hdfs dfs -mkdir /user/{your username}

You can also open a browser and access Hadoop by using the following URL
http://localhost:50070/

Next: Spark

Installing Spark is a little easier. You can download the latest Spark here:
http://spark.apache.org/downloads.html

It’s a little tricky on choosing which package type. We want to choose “pre-build with user provided Hadoop [can use with most Hadoop distributions]” type, and the downloaded file name is spark-1.6.1-bin-without-hadoop.tgz

After spark is downloaded, we need to untar it. Open a terminal window and do the following:

$ cd ~/Downloads
$ tar xzvf spark-1.6.1-bin-without-hadoop.tgz
$ mv spark-1.6.1-bin-without-hadoop /usr/local/spark

Add spark bin folder to PATH

$ vi ~/.bash_profile
append this line to the end of the file “export PATH=$PATH:/usr/local/spark/bin”
$ source ~/.bash_profile

What about Scala?

Spark is written in Scala, so even though we can use Java to write Spark code, we want to install Scala as well.

Download Scala from here: http://www.scala-lang.org/download/
Choose the first one to download Scala in binary, and the downloaded file is scala-2.11.8.tar

Untar Scala and move it to a dedicated folder

Download Hadoop 2.8.3

$ cd ~/Downloads
$ tar xzvf scala-2.11.8.tar
$ mv scala-2.11.8 /usr/local/scala

Add Scala bin folder to PATH

$ vi ~/.bash_profile
append this line to the end of the file “export PATH=$PATH:/usr/local/scala/bin”
$ source ~/.bash_profile

Now you should be able to do the following to access Spark shell for Scala

$ spark-shell

That’s it! Happy coding!

Hadoop 3.2.1 Windows

Hadoop is released as source code tarballs with corresponding binary tarballs for convenience. The downloads are distributed via mirror sites and should be checked for tampering using GPG or SHA-512.

VersionRelease dateSource downloadBinary downloadRelease notes
2.10.12020 Sep 21 source (checksumsignature) binary (checksumsignature) Announcement
3.1.42020 Aug 3 source (checksumsignature) binary (checksumsignature) Announcement
3.3.02020 Jul 14 source (checksumsignature) binary (checksumsignature)
binary-aarch64 (checksumsignature)
Announcement
3.2.12019 Sep 22 source (checksumsignature) binary (checksumsignature) Announcement
2.9.22018 Nov 19 source (checksumsignature) binary (checksumsignature) Announcement

Download Hadoop 3.1 2 For Mac Catalina

To verify Hadoop releases using GPG:

  1. Download the release hadoop-X.Y.Z-src.tar.gz from a mirrorsite.
  2. Download the signature file hadoop-X.Y.Z-src.tar.gz.asc fromApache.
  3. Download the HadoopKEYS file.
  4. gpg –import KEYS
  5. gpg –verify hadoop-X.Y.Z-src.tar.gz.asc

To perform a quick check using SHA-512:

  1. Download the release hadoop-X.Y.Z-src.tar.gz from a mirrorsite.
  2. Download the checksum hadoop-X.Y.Z-src.tar.gz.sha512 or hadoop-X.Y.Z-src.tar.gz.mds fromApache.
  3. shasum -a 512 hadoop-X.Y.Z-src.tar.gz

All previous releases of Hadoop are available from the Apache releasearchive site.

Many third parties distribute products that include Apache Hadoop andrelated tools. Some of these are listed on the Distributions wikipage.

License

Download Hadoop 3.1 2 For Mac Osx

The software licensed under Apache License 2.0