hadoop集群搭建步骤

实验介绍

下面将要在三台linux虚拟机上搭建hadoop集群。

知识点

linux基本命令

集群安装

完成实验需要以下相关知识

解压命令

tar -zxvf XX.tar.gz -C dist

vi编辑器的使用

vi + file 打开一个文件,要想了解更多请了解vi编辑器的使用

远程拷贝

scp -r srcfile user@hostName:distpath

实验前准备

准备三台linux虚拟机

配置ip和host 下面表格是本次实验的配置情况

iphost软件名192.168.1.111linux1java8、hadoop

192.168.1.112linux2java8,hadoop

192.168.1.113linux3java8,hadoop

配置免密登录,免密登录方案 linux1免密登录linux2和linux3

安装jdk8

准备hadoop2.7.7版本的安装包

下面开始进行实验。

hadoop集群搭建实验

上传hadoop安装文件到 /root/apps/srcclauster

进入主节点创建一个目录apps就作为安装目录

[root@linux1 ~]# mkdir /root/apps

复制代码

解压hadoop

[root@linux1 ~]#tar –zxvf /root/srcclauster/hadoop-2.7.7.tar.gz -C /root/apps

复制代码

配置hadoop

进入hadoop配置目录打开hadoop-env.sh文件 配置一下JAVA_HOME

[root@linux1 ~]#cd /root/srcclauster/hadoop-2.7.7/etc/hadoop

[root@linux1 hadoop]#

[root@linux1 hadoop]# vi hadoop-env.sh

复制代码

# Licensed to the Apache Software Foundation (ASF) under one

# or more contributor license agreements. See the NOTICE file

# distributed with this work for additional information

# regarding copyright ownership. The ASF licenses this file

# to you under the Apache License, Version 2.0 (the

# "License"); you may not use this file except in compliance

# with the License. You may obtain a copy of the License at

#

# http://www.apache.org/licenses/LICENSE-2.0

#

# Unless required by applicable law or agreed to in writing, software

# distributed under the License is distributed on an "AS IS" BASIS,

# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

# See the License for the specific language governing permissions and

# limitations under the License.

# Set Hadoop-specific environment variables here.

# The only required environment variable is JAVA_HOME. All others are

# optional. When running a distributed configuration it is best to

# set JAVA_HOME in this file, so that it is correctly defined on

# remote nodes.

# The java implementation to use.

export JAVA_HOME=/root/appstest1/jdk1.8.0_101

# The jsvc implementation to use. Jsvc is required to run secure datanodes

# that bind to privileged ports to provide authentication of data transfer

# protocol. Jsvc is not required if SASL is configured for authentication of

# data transfer protocol using non-privileged ports.

#export JSVC_HOME=${JSVC_HOME}

export HADOOP_CONF_DIR=${HADOOP_CONF_DIR:-"/etc/hadoop"}

# Extra Java CLASSPATH elements. Automatically insert capacity-scheduler.

for f in $HADOOP_HOME/contrib/capacity-scheduler/*.jar; do

if [ "$HADOOP_CLASSPATH" ]; then

export HADOOP_CLASSPATH=$HADOOP_CLASSPATH:$f

else

export HADOOP_CLASSPATH=$f

fi

done

# The maximum amount of heap to use, in MB. Default is 1000.

#export HADOOP_HEAPSIZE=

#export HADOOP_NAMENODE_INIT_HEAPSIZE=""

# Extra Java runtime options. Empty by default.

export HADOOP_OPTS="$HADOOP_OPTS -Djava.net.preferIPv4Stack=true"

# Command specific options appended to HADOOP_OPTS when specified

export HADOOP_NAMENODE_OPTS="-Dhadoop.security.logger=${HADOOP_SECURITY_LOGGER:-INFO,RFAS} -Dhdfs.audit.logger=${HDFS_AUDIT_LOGGER:-INFO,NullAppender} $HADOOP_NAMENODE_OPTS"

export HADOOP_DATANODE_OPTS="-Dhadoop.security.logger=ERROR,RFAS $HADOOP_DATANODE_OPTS"

export HADOOP_SECONDARYNAMENODE_OPTS="-Dhadoop.security.logger=${HADOOP_SECURITY_LOGGER:-INFO,RFAS} -Dhdfs.audit.logger=${HDFS_AUDIT_LOGGER:-INFO,NullAppender} $HADOOP_SECONDARYNAMENODE_OPTS"

export HADOOP_NFS3_OPTS="$HADOOP_NFS3_OPTS"

export HADOOP_PORTMAP_OPTS="-Xmx512m $HADOOP_PORTMAP_OPTS"

# The following applies to multiple commands (fs, dfs, fsck, distcp etc)

export HADOOP_CLIENT_OPTS="-Xmx512m $HADOOP_CLIENT_OPTS"

#HADOOP_JAVA_PLATFORM_OPTS="-XX:-UsePerfData $HADOOP_JAVA_PLATFORM_OPTS"

# On secure datanodes, user to run the datanode as after dropping privileges.

# This **MUST** be uncommented to enable secure HDFS if using privileged ports

# to provide authentication of data transfer protocol. This **MUST NOT** be

# defined if SASL is configured for authentication of data transfer protocol

# using non-privileged ports.

export HADOOP_SECURE_DN_USER=${HADOOP_SECURE_DN_USER}

# Where log files are stored. $HADOOP_HOME/logs by default.

#export HADOOP_LOG_DIR=${HADOOP_LOG_DIR}/$USER

# Where log files are stored in the secure data environment.

export HADOOP_SECURE_DN_LOG_DIR=${HADOOP_LOG_DIR}/${HADOOP_HDFS_USER}

###

# HDFS Mover specific parameters

###

# Specify the JVM options to be used when starting the HDFS Mover.

# These options will be appended to the options specified as HADOOP_OPTS

# and therefore may override any similar flags set in HADOOP_OPTS

#

# export HADOOP_MOVER_OPTS=""

###

# Advanced Users Only!

###

# The directory where pid files are stored. /tmp by default.

# NOTE: this should be set to a directory that can only be written to by

# the user that will run the hadoop daemons. Otherwise there is the

# potential for a symlink attack.

export HADOOP_PID_DIR=${HADOOP_PID_DIR}

export HADOOP_SECURE_DN_PID_DIR=${HADOOP_PID_DIR}

# A string representing this instance of hadoop. $USER by default.

export HADOOP_IDENT_STRING=$USER

复制代码

打开core-site.xml文件配置一下主节点和工作目录

[root@linux1 hadoop]# vi core-site.xml

复制代码

fs.defaultFS

hdfs://linux1:9000

hadoop.tmp.dir

/root/appstest1/appdata

复制代码

打开mapred-site.xml配置MR运行方式

[root@linux1 hadoop]# vi mapred-site.xm

复制代码

mapreduce.framework.name

yarn

复制代码

打开yarn-site.xml文件配置yarn的主节点

[root@linux1 hadoop]# vi yarn-site.xml

复制代码

yarn.resourcemanager.hostname

linux1

yarn.nodemanager.aux-services

mapreduce_shuffle

复制代码

配置slaves

[root@linux1 hadoop]# vi slaves

复制代码

linux2

linux3

复制代码

格式化hdfs

[root@linux1 ~]#/root/hadoop-2.7.7/bin/hadoop namenode -format

复制代码

启动hadoop集群

进入linux1

[root@linux1 apps]# /root/apps/hadoop-2.7.7/sbin/start-dfs.sh

20/04/27 16:14:50 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable

Starting namenodes on [linux1]

linux1: starting namenode, logging to /root/apps/hadoop-2.7.7/logs/hadoop-root-namenode-linux1.out

linux3: datanode running as process 1618. Stop it first.

linux2: datanode running as process 1617. Stop it first.

Starting secondary namenodes [0.0.0.0]

0.0.0.0: starting secondarynamenode, logging to /root/apps/hadoop-2.7.7/logs/hadoop-root-secondarynamenode-linux1.out

20/04/27 16:15:08 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable

[root@linux1 apps]#

复制代码

测试是否启动成功

总结

配置核心4个文件 ,hadoop-env.sh配置JAVA_HOME,core-site.xml配置主节点,mapred-site.xm配置MR运行方式, yarn-site.xml配置yarn的主节点。

linux hadoop集群搭建,hadoop集群搭建相关推荐

  1. Linux搭建hadoop高性能集群(一)

    文章目录 关于此篇(题外话) Linux系统搭建hadoop高性能集群(一) 第一步 虚拟机的安装 第二步 创建一个虚拟机 第三步 虚拟机启动初始化 第四步 虚拟机克隆 第五步 Linux系统网络配置 ...

  2. lxd linux系统视频教程,Linux 下 LXD 容器搭建 Hadoop 集群

    配置要求 主机内存 4GB . 磁盘 100 GB 以上. HOST 机安装常用 Linux 发行版. Linux Container ( LXD ) 以主机 Ubuntu 16.04 为例. 安装 ...

  3. docker 安装oracle_阿里云使用Docker搭建Hadoop集群

    摘要 吐血整理,搭建了两遍,亲测可用!!! 我买的是阿里云2C4G的服务器,使用的是CentOS 7.7版本.在搭建过程中踩了不少坑,本篇文章希望对大家有用 CentOS 7.7安装Docker 查看 ...

  4. hadoop集群_Ambari搭建hadoop集群

    Ambari搭建hadoop集群 Ambari提供了对Hadoop更加方便快捷的管理功能,主要包含: 通过一步一步的安装向导简化了集群供应. 预先配置好关键的运维指标(metrics),可以直接查看H ...

  5. hadoop 查看节点主备装填_基于ZooKeeper搭建Hadoop高可用集群

    作者博客:https://blog.csdn.net/m0_37809146/ 一.高可用简介 Hadoop 高可用 (High Availability) 分为 HDFS 高可用和 YARN 高可用 ...

  6. CentOS7上搭建Hadoop集群(入门级)

    场景 Hadoop Apache Hadoop是一款支持数据密集型分布式应用并以Apache 2.0许可协议发布的开源软 件框架,支持在商品硬件构建的大型集群上运行应用程序.Hadoop是根据Goog ...

  7. 华为云 和 阿里云 跨服务器搭建Hadoop集群

    目录 华为云 和 阿里云 跨服务器搭建Hadoop集群 说明 期间遇到的问题 CentOS6 7 8更换阿里yum源 修改服务器名称 安装JDK 安装Hadoop 编写集群分发脚本 xsync scp ...

  8. Hadoop集群环境搭建之伪分布式搭建流程

    关防火墙 1.首先关闭window防火墙 service iptables status 查看Linux 防火墙状态 service iptables stop 关闭Linux防火墙 chkconfi ...

  9. hadoop搭建_阿里云搭建Hadoop集群

    1 前言 在讲述Hadoop集群搭建之前,首先要了解Hadoop和集群两个名词,Hadoop是由Apache基金会开发的分布式系统基础架构,简单理解就是大数据技术应用的基础.集群可以理解为多台装有ha ...

最新文章

  1. GPU高效通信算法-Ring Allreduce
  2. Mac OS X snow leopard 10.6 VMware安装
  3. datax底层原理_Datax 插件加载原理
  4. 树莓派与笔记本用远程桌面连接(Xrdp远程桌面服务)
  5. 线性规划中的单纯形法与内点法(原理、步骤以及matlab实现)(二)
  6. CMMI认证多少钱?
  7. 电子邮件--详解SMTP和POP3协议
  8. 【HDU 5145】 NPY and girls(组合+莫队)
  9. 为什么程序员喜欢用dark mode深色模式
  10. 安装Mathtype之后,打开word出现错误的解决方法
  11. android开发教程!看懂这些帮你轻松解决就业问题!知乎上转疯了!
  12. 10个H5页面制作工具,功能全面评测
  13. Python:nii格式的核磁共振图像(MRI)的读取、处理、显示操作(以缩放操作举例)
  14. Gstore官网学习三:快速入门
  15. 非完备信息的机器博弈-麻将篇
  16. 厉害了,可以不重启JVM就替换掉已经加载的类?
  17. 解决薪资倒挂的最佳方式!
  18. Android 百分比布局、权重、隐藏TitleBar、引入自定义控件
  19. 益智游戏网开发之路二(网址yzyx.info)
  20. Linux下使用云笔记及OneNote

热门文章

  1. Bzoj3261 最大异或和
  2. 【转载】使用Nexus搭建Maven服务器详细配置
  3. 海洋女神建新installshield交流群了,原来的老群都满了,请加新群哦,记得认真填写验证信息...
  4. PortSentry
  5. 剑指Offer:包含main函数的栈(借助辅助栈)
  6. linux内存释放和使用限制
  7. 【Python学习系列十九】基于scikit-learn库进行特征选择
  8. 机器学习知识点(二十)矩阵奇异值分解基础知识及Java实现
  9. Android异步加载
  10. 存储引擎——概述|| 各种存储引擎的特性(InnoDB||MyISAM||MEMORY||MERGE) ||存储引擎的选择