zookeeper完全分布式集群的搭建，实现配置管理

时间：2019-08-28 09:43:32 来源：作者：

目前的公司是使用的阿里内部的dubbo，也就是EDAS，里面用了阿里自己的EDAS服务，如果是使用过dubbo的老铁，应该知道zookeeper，zookeeper在大数据和RPC通信上应用比较管饭。不管用过zookeeper没有，这次主要是介绍下zookeeper和集群的部署。这个必须要实际操作下，才能理解的深刻。

（一）介绍zookeeper

历史

Zookeeper 最早起源于雅虎研究院的一个研究小组。在当时，研究人员发现，在雅虎内部很多大型系统基本都需要依赖一个类似的系统来进行分布式协调，但是这些系统往往都存在分布式单点问题。所以，雅虎的开发人员就试图开发一个通用的无单点问题的分布式协调框架，以便让开发人员将精力集中在处理业务逻辑上。

关于“ZooKeeper”这个项目的名字，其实也有一段趣闻。在立项初期，考虑到之前内部很多项目都是使用动物的名字来命名的（例如著名的Pig项目),雅虎的工程师希望给这个项目也取一个动物的名字。时任研究院的首席科学家 RaghuRamakrishnan 开玩笑地说：“在这样下去，我们这儿就变成动物园了！”此话一出，大家纷纷表示就叫动物园管理员吧一一一因为各个以动物命名的分布式组件放在一起，雅虎的整个分布式系统看上去就像一个大型的动物园了，而 Zookeeper 正好要用来进行分布式环境的协调一一于是，Zookeeper 的名字也就由此诞生了。

为什么zookeeper会火

20世纪60年代，大型机被发明了出来，凭借自身的超强计算和I/O处理能力以及稳定、安全性，成为了世界的主流。但是大型机也有一些致命的问题，比如造价昂贵、操作复杂、单点问题等。特别是对大型机的人才的培养成本非常之高。随着这些问题的出现，不断影响了大型机的发展。而另一方面PC机性能的不断提升和网络普及，大家开始转向了使用小型机和普通的PC服务器来搭建分布式的计算机系统，来降级成本。而后分布式便不断火热起来。

zookeeper的官网

https://zookeeper.Apache.org/

下载地址： https://www-eu.apache.org/dist/zookeeper/

image.png

源码地址：https://github.com/apache/zookeeper

（二）集群部署

集群分为2类，一种是分布式集群，一种是伪分布式集群。

分布式：每个应用在单独的独立主机上，端口都是一致的。

伪分布式：多个应用在一台主机上，端口进行区别。伪分布式在实际生产环境很少。

对于操作来说伪分布式集群更难一些。

mac 安装vgarant ：https://idig8.com/2018/07/29/Docker-zhongji-07/

window安装vgaranthttps://idig8.com/2018/07/29/docker-zhongji-08/

系统类型IP地址节点角色CPUMemoryHostnamecentos7192.168.69.100伪分布式22Gzookeeper-virtuaCentos7192.168.69.101真分布式-领导者22Gzookeeper-LeaderCentos7192.168.69.102真分布式-属下122Gzookeeper-Follower1Centos7192.168.69.103真分布式-属下222Gzookeeper-Follower2

src的小技巧，这样就有颜色了，之前一直忽略了，看的眼疼，现在颜色分明了好多了。

（2.1）伪环境配置

还是用vagrant来，自从熟悉了vagrant 我基本没手动创建过虚拟机。

（2.1.1）基础设置

su
#密码 vagrant
cd ~
vi /etc/ssh/sshd_config
sudo systemctl restart sshd
vi /etc/resolv.conf 
# 设置成8.8.8.8
service network restart

（2.1.2）jdk安装

脚本在我的源码里面

vi pro.sh
sh pro.sh

（2.1.3）zookeeper下载

下载工具千万切记用最新的已经出到3.5.4，我还是用3.4.10

wget https://www-eu.apache.org/dist/zookeeper/zookeeper-3.4.10/zookeeper-3.4.10.tar.gz

（2.1.4）解压zookeeper

tar zxvf zookeeper-3.4.10.tar.gz

（2.1.5）进入zk中的conf目录下拷贝3个文件

cd /root/zookeeper-3.4.10/conf
cp zoo_sample.cfg zoo1.cfg
cp zoo_sample.cfg zoo2.cfg
cp zoo_sample.cfg zoo3.cfg

（2.1.6) 编辑这3个文件zoo1.cfg，zoo2.cfg，zoo3.cfg

（2.1.6.1）编辑zoo1.cfg

vi zoo1.cfg

dataDir=/Apps/servers/data/d_1

dataLogDir=/apps/servers/logs/logs_1

clientPort=2181

# The number of milliseconds of each tick
tickTime=2000
# The number of ticks that the initial
# synchronization phase can take
initLimit=10
# The number of ticks that can pass between
# sending a request and getting an acknowledgement
syncLimit=5
# the directory where the snapshot is stored.
# do not use /tmp for storage, /tmp here is just
# example sakes.
dataDir=/apps/servers/data/d_1
dataLogDir=/apps/servers/logs/logs_1
# the port at which the clients will connect
clientPort=2181
# the maximum number of client connections.
# increase this if you need to handle more clients
#maxClientCnxns=60
#
# Be sure to read the maintenance section of the
# administrator guide before turning on autopurge.
#
# http://zookeeper.apache.org/doc/current/zookeeperAdmin.html#sc_maintenance
#
# The number of snapshots to retain in dataDir
#autopurge.snapRetainCount=3
# Purge task interval in hours
# Set to "0" to disable auto purge feature
#autopurge.purgeInterval=1
server.1=localhost:2187:2887 
server.2=localhost:2188:2888
server.3=localhost:2189:2889

（2.1.6.2）编辑zoo2.cfg

vi zoo2.cfg

dataDir=/apps/servers/data/d_2

dataLogDir=/apps/servers/logs/logs_2

clientPort=2182

# The number of milliseconds of each tick
tickTime=2000
# The number of ticks that the initial
# synchronization phase can take
initLimit=10
# The number of ticks that can pass between
# sending a request and getting an acknowledgement
syncLimit=5
# the directory where the snapshot is stored.
# do not use /tmp for storage, /tmp here is just
# example sakes.
dataDir=/apps/servers/data/d_2
dataLogDir=/apps/servers/logs/logs_2
# the port at which the clients will connect
clientPort=2182
# the maximum number of client connections.
# increase this if you need to handle more clients
#maxClientCnxns=60
#
# Be sure to read the maintenance section of the
# administrator guide before turning on autopurge.
#
# http://zookeeper.apache.org/doc/current/zookeeperAdmin.html#sc_maintenance
#
# The number of snapshots to retain in dataDir
#autopurge.snapRetainCount=3
# Purge task interval in hours
# Set to "0" to disable auto purge feature
#autopurge.purgeInterval=1
server.1=localhost:2187:2887 
server.2=localhost:2188:2888
server.3=localhost:2189:2889

（2.1.6.3）编辑zoo3.cfg

vi zoo3.cfg

dataDir=/apps/servers/data/d_3

dataLogDir=/apps/servers/logs/logs_3

clientPort=2183

# The number of milliseconds of each tick
tickTime=2000
# The number of ticks that the initial
# synchronization phase can take
initLimit=10
# The number of ticks that can pass between
# sending a request and getting an acknowledgement
syncLimit=5
# the directory where the snapshot is stored.
# do not use /tmp for storage, /tmp here is just
# example sakes.
dataDir=/apps/servers/data/d_3
dataLogDir=/apps/servers/logs/logs_3
# the port at which the clients will connect
clientPort=2183
# the maximum number of client connections.
# increase this if you need to handle more clients
#maxClientCnxns=60
#
# Be sure to read the maintenance section of the
# administrator guide before turning on autopurge.
#
# http://zookeeper.apache.org/doc/current/zookeeperAdmin.html#sc_maintenance
#
# The number of snapshots to retain in dataDir
#autopurge.snapRetainCount=3
# Purge task interval in hours
# Set to "0" to disable auto purge feature
#autopurge.purgeInterval=1
server.1=localhost:2187:2887 
server.2=localhost:2188:2888
server.3=localhost:2189:2889

（2.1.7）创建data目录和日志目录

mkdir -p /apps/servers/data/d_1
mkdir -p /apps/servers/data/d_2
mkdir -p /apps/servers/data/d_3
mkdir -p /apps/servers/logs/logs_1
mkdir -p /apps/servers/logs/logs_2
mkdir -p /apps/servers/logs/logs_3
echo "1" >/apps/servers/data/d_1/myid
echo "2" >/apps/servers/data/d_2/myid
echo "3" >/apps/servers/data/d_3/myid

（2.1.8）进入bin目录下输入命令分别进行启动

cd /root/zookeeper-3.4.10/bin
sh zkServer.sh start ../conf/zoo1.cfg
sh zkServer.sh start ../conf/zoo2.cfg
sh zkServer.sh start ../conf/zoo3.cfg

（2.1.9）进入每一个看看效果

source /etc/profile
sh zkCli.sh -server localhost:2181
sh zkCli.sh -server localhost:2182
sh zkCli.sh -server localhost:2183

伪分布式其实就是这样就搭建完毕了。重点还是分布式的往下看。

（1.2）分布式环境配置

（1.2.1）基础设置（三台机器都需要设置）

su
#密码 vagrant
cd ~
vi /etc/ssh/sshd_config
sudo systemctl restart sshd
vi /etc/resolv.conf 
# 设置成8.8.8.8
service network restart

（1.2.2）jdk安装（三台机器都需要设置）

脚本在我的源码里面

vi pro.sh
sh pro.sh

（1.2.3）zookeeper下载（三台机器都需要设置）

下载工具千万切记用最新的已经出到3.5.4，我还是用3.4.10

为什么是三个，因为Zookeeper喜欢奇数不喜欢偶数。

wget https://www-eu.apache.org/dist/zookeeper/zookeeper-3.4.10/zookeeper-3.4.10.tar.gz

（1.2.4）解压zookeeper

tar zxvf zookeeper-3.4.10.tar.gz

（1.2.4）配置cfg文件（三台机器都需要设置）

cd ~
cd zookeeper-3.4.10/
cd conf
cp zoo_sample.cfg zoo.cfg

（1.2.5）配置cfg文件，其实这3个机器的配置文件是一样的，我就不重复写cfg了，就直接截图了

vi zoo.cfg
# The number of milliseconds of each tick
tickTime=2000
# The number of ticks that the initial
# synchronization phase can take
initLimit=10
# The number of ticks that can pass between
# sending a request and getting an acknowledgement
syncLimit=5
# the directory where the snapshot is stored.
# do not use /tmp for storage, /tmp here is just
# example sakes.
dataDir=/tmp/zookeeper
dataLogDir=/tmp/zookeeper/logs
# the port at which the clients will connect
clientPort=2181
# the maximum number of client connections.
# increase this if you need to handle more clients
#maxClientCnxns=60
#
# Be sure to read the maintenance section of the
# administrator guide before turning on autopurge.
#
# http://zookeeper.apache.org/doc/current/zookeeperAdmin.html#sc_maintenance
#
# The number of snapshots to retain in dataDir
#autopurge.snapRetainCount=3
# Purge task interval in hours
# Set to "0" to disable auto purge feature
#autopurge.purgeInterval=1
server.0=192.168.69.101:2888:3888
server.1=192.168.69.102:2888:3888
server.2=192.168.69.103:2888:3888

（1.2.6）配置myid

需要配置到上边dataDir=/tmp/zookeeper的目录下。

cd /
cd tmp
mkdir zookeeper
cd zookeeper

（1.2.6.1）192.168.69.101配置myid

echo '0'>myid
cat myid

（1.2.6.2）192.168.69.102配置myid

echo '1'>myid
cat myid

（1.2.6.3）192.168.69.103配置myid

echo '2'>myid
cat myid

启动命令运行3台虚拟机下的zookeeper

cd ~/zookeeper-3.4.10/bin
sh zkServer.sh start

（三）概念梳理

（3.1）Zoo.cfg 配置

参数意义tickTime2000syncLimitLeader 和 follower 之间的通讯时长最长不能超过initLimt*ticktimeinitLimt接受客户端链接 zk 初始化的最长等待心跳时长initLimt*ticktimedataDir数据目录dataLogDir日志文件clientPort客户端链接服务端端口号Server.A=B:C:D A:第几号服务器 B 服务 IP C 代表 Leader 和 follower 通讯端口 D 备用选 leader 端口

（3.2）角色

Leader：

Leader 作为整个 ZooKeeper 集群的主节点，负责响应所有对 ZooKeeper 状态变更的请求。它会将每个状态更新请求进行排序和编号，以便保证整个集群内部消息处理的 FIFO，写操作都走 leader

Follower ：

Follower 的逻辑就比较简单了。除了响应本服务器上的读请求外，follower 还要处理leader 的提议，并在 leader 提交该提议时在本地也进行提交。另外需要注意的是，leader 和 follower 构成 ZooKeeper 集群的法定人数，也就是说，只有他们才参与新 leader的选举、响应 leader 的提议。

Observer ：

如果 ZooKeeper 集群的读取负载很高，或者客户端多到跨机房，可以设置一些 observer服务器，以提高读取的吞吐量。Observer 和 Follower 比较相似，只有一些小区别：首先observer 不属于法定人数，即不参加选举也不响应提议；其次是 observer 不需要将事务持

久化到磁盘，一旦 observer 被重启，需要从 leader 重新同步整个名字空间。

（3.3）Zookeeper 特性

Zookeeper 是一个由多个 server 组成的

1.集群一个 leader，多个 follower

2.每个 server 保存一份数据副本

3.全局数据一致分布式读follower，写由 leader 实施更新请求转发，由 leader 实施更新请求顺序进行，来自同一个 client 的更新请求按其发送顺序依次执行数据更新原子性，

4.一次数据更新要么成功，要么失败全局唯一数据视图，

5.client 无论连接到哪个 server，数据视图都是一致的实时性，在一定事件范围内，client 能读到最新数据

PS：本次主要说说zookeeper的原理和集群部署，没有太详细的介绍里面的细节，下次说说zookeeper使用。

Tags：zookeeper 点击:() 评论:()

声明：本站部分内容及图片来自互联网,转载是出于传递更多信息之目的,内容观点仅代表作者本人,如有任何标注错误或版权侵犯请与我们联系(Email:2595517585@qq.com)，我们将及时更正、删除，谢谢。

▌相关推荐

ZooKeeper集群部署流程

一、准备三台机器这里我使用VirtualBox创建3个虚拟机来进行部署zk集群，VirtualBox不了解的可自行百度；二、部署linux系统此处不讲解linux部署,很简单，百度一下很多教程的部署...【详细内容】

2021-12-08　　Tags: zookeeper 点击:(17)　　评论:(0)　　加入收藏

zookeeper架构及工作原理

zookeeper动物管理员，是一个很形象的名字，是一个分布式协调服务。它可以用来做分布式配置管理，服务注册及发现，分布式锁。在CAP中，属于CP型。下图是zookeeper的架构图：图中，绿色的...【详细内容】

2021-11-16　　Tags: zookeeper 点击:(38)　　评论:(0)　　加入收藏

SpringCloud Zookeeper配置中心详解

环境：Spring Boot 2.3.9 + Spring Cloud Hoxton.SR8服务发现注册请参考《SpringCloud Zookeeper服务发现及负载均衡》zookeeper安装配置请参考《Kafka（zookeeper）环境配置超级...【详细内容】

2021-04-06　　Tags: zookeeper 点击:(276)　　评论:(0)　　加入收藏

Zookeeper的选举算法和脑裂问题深度讲解

ZK（zookeeper）是微服务解决方案中拥有服务注册发现最为核心的环境，是微服务的基石。作为服务注册发现模块，并不是只有ZK一种产品，目前得到行业认可的还有：Eureka、Consul。这里我...【详细内容】

2021-03-04　　Tags: zookeeper 点击:(179)　　评论:(0)　　加入收藏

Zookeeper ZAB协议实现源码分析

前三篇讲了Zookeeper的特性、客户端使用和集群原理、典型使用场景实践，本篇重点深入了解ZAB协议以及源码实现的解析。...【详细内容】

2020-10-08　　Tags: zookeeper 点击:(85)　　评论:(0)　　加入收藏

zookeeper集群的leader挂了怎么办？

某天程序员小白参加面试：几番苦战之后，面试进入白热化阶段。面试官大开大合，小白见招拆招。一时之间，难解难分，两人对拆数十回合不分胜负。说时迟，那时快，小白的左手像火焰一般炙热...【详细内容】

2020-08-18　　Tags: zookeeper 点击:(112)　　评论:(0)　　加入收藏

Zookeeper 面试总结

一、zk是什么：1、个人理解zk=文件系统+通知机制。2、zk是一个分布式的应用程序协调服务，我理解的就是有两台集器A、B，A对一个数据进行了操作，B是如何知道的，这个就需要zk的支持。...【详细内容】

2020-08-11　　Tags: zookeeper 点击:(58)　　评论:(0)　　加入收藏

Zookeeper的7种使用场景详解

典型应用场景Apache HBaseHBase是一个通常与Hadoop一起使用的数据存储仓库。在HBase中，ZooKeeper用于选举一个集群内的主节点，以便跟踪可用的服务器，并保存集群的元数据。Apach...【详细内容】

2020-07-29　　Tags: zookeeper 点击:(45)　　评论:(0)　　加入收藏

kafka面试题：Zookeeper 在 Kafka 中的作用

如上图所示，kafaka集群的 broker，和 Consumer 都需要连接 Zookeeper。 Producer 直接连接 Broker。Producer 把数据上传到 Broker，Producer可以指定数据有几个分区、几个备份...【详细内容】

2020-06-15　　Tags: zookeeper 点击:(125)　　评论:(0)　　加入收藏

不会这些概念，劝你简历不要写 “熟悉” zookeeper！

本文主要分享一下zookeeper的一些基本概念，在正式进入正题前，和大家聊一聊刚入行时我的面试经验，可以说是耿直的有些可爱。面试官：用过zookeeper 吗？我：用过啊，给dubbo提供服务的...【详细内容】

2020-04-01　　Tags: zookeeper 点击:(141)　　评论:(0)　　加入收藏

▌简易百科推荐

聊聊如何自定义数据脱敏

前言什么是数据脱敏数据脱敏是指对某些敏感信息通过脱敏规则进行数据的变形，实现敏感隐私数据的可靠保护常用脱敏规则替换、重排、加密、截断、掩码良好的数据脱敏实施1、尽...【详细内容】

2021-12-28　　linyb极客之路　　　　Tags:数据脱敏　点击:(3)　　评论:(0)　　加入收藏

浅谈大数据在电力营销系统中的应用及工程实例解析

张欣安科瑞电气股份有限公司上海嘉定 201801 摘要：随着电力行业各系统接入，海量数据涌现，如何利用电网信息化中大量数据，对客户需求进行判断分析，服务于营销链条，提升企业市场竞...【详细内容】

2021-12-14　　安科瑞张欣　　　　Tags:大数据　点击:(10)　　评论:(0)　　加入收藏

大数据时代，我们应该拥有怎样的数据分析思维？

1、什么是数据分析结合分析工具，运用数据分析思维，分析庞杂数据信息，为业务赋能。 2、数据分析师工作的核心流程：（1）界定问题：明确具体问题是什么；●what 发生了什么（是什么）●why 为...【详细内容】

2021-12-01　　逆风北极光　　　　Tags:大数据　点击:(26)　　评论:(0)　　加入收藏

快速汇总分析大批量数据

在实际工作中，我们经常需要整理各个业务部门发来的数据。不仅分散，而且数据量大、格式多。单是从不同地方汇总整理这些原始数据就花了大量的时间，更不用说还要把有效的数据收集...【详细内容】

2021-11-30　　百数　　　　Tags:数据　点击:(21)　　评论:(0)　　加入收藏

一文读懂十大数据存储加密技术

数据作为新的生产要素，其蕴含的价值日益凸显，而安全问题却愈发突出。密码技术，是实现数据安全最经济、最有效、最可靠的手段，对数据进行加密，并结合有效的密钥保护手段，可在开放环...【详细内容】

2021-11-26　　炼石网络　　　　Tags:数据存储　点击:(17)　　评论:(0)　　加入收藏

Impala在网易大数据的优化和实践

导读：网易大数据平台的底层数据查询引擎，选用了Impala作为OLAP查询引擎，不但支撑了网易大数据的交互式查询与自助分析，还为外部客户提供了商业化的产品与服务。今天将为大家分享...【详细内容】

2021-11-26　　DataFunTalk　　　　Tags:大数据　点击:(15)　　评论:(0)　　加入收藏

终于有人把数据挖掘讲明白了

导读：数据挖掘是一种发现知识的手段。数据挖掘要求数据分析师通过合理的方法，从数据中获取与挖掘项目相关的知识。作者：赵仁乾田建中叶本华常国珍来源：华章科技数据挖掘是一...【详细内容】

2021-11-23　　华章科技　　今日头条　　Tags:数据挖掘　点击:(20)　　评论:(0)　　加入收藏

大屏数据可视化分析平台DashboardClient

今天再给大家分享一个不错的可视化大屏分析平台模板DataColour。 data-colour 可视化分析平台采用前后端分离模式，后端架构设计采用微服务架构模式。前端技术：Angularjs、Jq...【详细内容】

2021-11-04　　web前端进阶　　　　Tags:DashboardClient 　点击:(40)　　评论:(0)　　加入收藏

Prometheus监控业务指标

在Kubernetes已经成了事实上的容器编排标准之下，微服务的部署变得非常容易。但随着微服务规模的扩大，服务治理带来的挑战也会越来越大。在这样的背景下出现了服务可观测性（obs...【详细内容】

2021-11-02　　大数据推荐杂谈　　　　Tags:Prometheus 　点击:(40)　　评论:(0)　　加入收藏

大数据用对了才有大价值

同一产品对老客户的要价竟然比新客户要高？这是当下“大数据杀熟”的直接结果。近年来，随着平台经济的蓬勃发展，大数据在为用户服务之外，也引发了多种不合理现象。为了有效遏制“...【详细内容】

2021-10-29　　　　海外网　　Tags:大数据　点击:(31)　　评论:(0)　　加入收藏

推荐资讯

聊聊如何自定义数据脱	河南人到底有多爱吃面
人称“犬中四煞”的4	离婚后，约定每月给孩子
“三皇五帝”分别是哪	印度低种姓群体如何翻
日本研发“飞行摩托”	2021年Steam最畅销游