sunzxh 2019-12-12
前面使用了哨兵集群搭建单个分片高可用的结构,实现了主从复制和故障转移,但是没有实现分布式。接下来使用Redis集群搭建,实现哨兵集群没有实现的分布式高可用。
下面准备6个redis节点,搭建如图所示的三主三从分布式redis集群,实现两两互联,以master8001为例,它可以连接另外两个主节点,以及三个从节点。搭建过程可以使用原生redis命令,也可以使用ruby自带的脚本(需要先安装ruby)。
使用ruby脚本辅助安装,即使用redis根目录/src/redis-trib.rb脚本来完成,需要先安装ruby。
-rwxrwxr-x. 1 root root 60852 Sep 21 2017 redis-trib.rb
(1)安装ruby,这里解压了压缩包,进行编译和安装。但是安装完只是最基本的一步,后面还需要对ruby进行配置,比较麻烦。
# 解压 [ /home/software]# tar -zxvf ruby-2.3.1.tar.gz # 检查 [ /home/software/ruby-2.3.1]# . /configure # 编译,安装 [ /home/software/ruby-2.3.1]# make && make install # 省略具体安装日志,安装完成后使用ruby -v如果能查看到版本号,就安装ok [ /home/software/ruby-2.3.1]# ruby -v ruby 2.3.1p112 (2016-04-26 revision 54768) [x86_4-linux]
(2)先准备一个redis-cluster.conf启动加载文件,基本配置参考前面博文https://www.cnblogs.com/youngchaolin/p/11983705.html#_label0 ,主要是bind设置不绑定ip、protected mode设置为no、daemonize设置为yes、其他还有pid、logfile、rdb等,保持原设置不变。
(3)配置appendonly为yes,启动aof持久化方式,这是除了rdb外redis提供的第二种持久化方式。
# AOF and RDB persistence can be enabled at the same time without problems. # If the AOF is enabled on startup Redis will load the AOF, that is the file # with the better durability guarantees. # # Please check http://redis.io/topics/persistence for more information. # 配置第二种持久化方式aof appendonly yes 597 # The name of the append only file (default: "appendonly.aof") # 标识aof持久化文件名,以端口号区分 appendfilename "appendonly6379.aof"
其中rdb和aof两种持久化的主要区别为:
a.rdb保存的是具体的key-value数据,如name-messi,aof保存的是操作记录,如set name messi。
b.因为aof保存的数据更全,redis启动默认加载的是aof,rdb启动可以单独开启。
c.rdb持久化频率是按照持久化策略来的,容易造成数据丢失,而aof是每秒保存一次数据,数据不容易丢失。
d.如果对数据的可靠性要求高,使用aof持久化,如果需要一定的数据恢复能力,但是又不需要很高的可靠性,就选择rdb。
(4)开启集群模式,如果不开启无法计算槽道号。
# Normal Redis instances can‘t be part of a Redis Cluster; only nodes that are # started as cluster nodes can. In order to start a Redis instance as a # cluster node enable the cluster support uncommenting the following: # 需设置yes,否则无法计算槽道号,无法创建集群 cluster-enabled yes
(5)需要开启集群节点状态记录文件,这个文件会自动更新,每个redis节点都需要一个这样的文件。
# Every cluster node has a cluster configuration file. This file is not # intended to be edited by hand. It is created and updated by Redis nodes. # Every Redis Cluster node requires a different cluster configuration file. # Make sure that instances running in the same system do not have # overlapping cluster configuration file names. # 开启集群节点状态记录文件 cluster-config-file nodes-6379.conf
(6)将上面的配置文件保存,接下来复制到6个不同的文件夹下,其中文件夹的名字就是redis节点的端口号,根据上图准备8000、8001、8002、8003、8004和8005六个文件夹。复制完成后,修改对应端口号,如8000文件夹的配置文件,将里面全是6379的部分替换为8000,其他类似修改即可。
# 创建目录 [ /home/software/redis-3.2.11]# mkdir 8000 8001 8002 8003 8004 8005 # 复制文件 [ /home/software/redis-3.2.11]# cp redis.conf 8000/ [ /home/software/redis-3.2.11]# cp redis.conf 8001/ [ /home/software/redis-3.2.11]# cp redis.conf 8002/ [ /home/software/redis-3.2.11]# cp redis.conf 8003/ [ /home/software/redis-3.2.11]# cp redis.conf 8004/ [ /home/software/redis-3.2.11]# cp redis.conf 8005/ # 递归查看是否copy成功 [ /home/software/redis-3.2.11]# ls -R 800* 8000: redis.conf 8001: redis.conf 8002: redis.conf 8003: redis.conf 8004: redis.conf 8005: redis.conf# 修改端口略
(7)启动各个节点,发现均是cluster的方式启动,这是创建集群的基础,另外查看集群为fail,是因为槽道还没分配的原因,且每个节点cluster nodes查看集群节点信息都只能找到自己一个节点,两两互联还没有建立。
# 加载修改后的配置文件启动各个节点 [ /home/software/redis-3.2.11]# redis-server 8000/redis.conf [ /home/software/redis-3.2.11]# redis-server 8001/redis.conf [ /home/software/redis-3.2.11]# redis-server 8002/redis.conf [ /home/software/redis-3.2.11]# redis-server 8003/redis.conf [ /home/software/redis-3.2.11]# redis-server 8004/redis.conf [ /home/software/redis-3.2.11]# redis-server 8005/redis.conf # 查看是否以cluster方式启动,ok [ /home/software/redis-3.2.11]# ps -ef|grep redis root 16028 1 0 15:43 ? 00:00:00 redis-server *:8000 [cluster] root 16033 1 0 15:43 ? 00:00:00 redis-server *:8001 [cluster] root 16037 1 0 15:43 ? 00:00:00 redis-server *:8002 [cluster] root 16041 1 0 15:43 ? 00:00:00 redis-server *:8003 [cluster] root 16045 1 0 15:43 ? 00:00:00 redis-server *:8004 [cluster] root 16049 1 0 15:43 ? 00:00:00 redis-server *:8005 [cluster] root 16053 1409 0 15:43 pts/0 00:00:00 grep redis # 登录一个节点,查看集群信息 [ /home/software/redis-3.2.11]# redis-cli -c -p 8000 127.0.0.1:8000> cluster info cluster_state:fail # 只要有一个槽道没分配,就是fail cluster_slots_assigned:0 cluster_slots_ok:0 cluster_slots_pfail:0 cluster_slots_fail:0 cluster_known_nodes:1 # 只能发现自己一个节点,两两互联暂未建立 cluster_size:0 cluster_current_epoch:0 # 集群的纪元 cluster_my_epoch:0 cluster_stats_messages_sent:0 # 两两互联发送数据量 cluster_stats_messages_received:0 # 两两互联接受数据量 # 查看集群节点,发现只有自己一个 127.0.0.1:8000> cluster nodes dc0e2e16888426089de31466ff398679eec81b86 :8000 myself,master - 0 0 0 connected
(8)启动src/redis-trib.rb命令后想查看帮忙信息发现报错。这是正常的,这意味着踏入了一个大坑,接下来的需要填坑了,参考文末博文,完成了报错解决。
# 报错 /usr/local/lib/ruby/2.3.0/rubygems/core_ext/kernel_require.rb:55:in `require‘: cannot load such file -- redis (LoadError) from /usr/local/lib/ruby/2.3.0/rubygems/core_ext/kernel_require.rb:55:in `require‘ from src/redis-trib.rb:25:in `<main>‘
(9)上面报错是需要redis的gem包来安装ruby和redis的接口,先安装gem包,网址https://rubygems.org/gems/redis/,选择对应的版本下载,我这里是选择3.2.1版本,下载好后上传到了redis根目录/src目录下了,具体位置放哪不清楚,放src下是可以验证通过的。
-rw-r--r--. 1 root root 73728 Dec 12 16:51 redis-3.2.1.gem
(10)准备使用gem install redis命令安装接口,发现报错,需要安装zlib,网址http://www.zlib.net,本次选择了1.2.11版本。
ERROR: Loading command: install (LoadError) cannot load such file -- zlib ERROR: While executing gem ... (NoMethodError) undefined method `invoke_with_build_args‘ for nil:NilClass
(11)解压zlib压缩包后,进入解压目录,先指定安装目录,后编译安装zlib。
# 指定安装目录/usr/local/zlib [ /home/software/zlib-1.2.11]# ./configure --prefix=/usr/local/zlib Checking for gcc... Checking for shared library support... Building shared library libz.so.1.2.11 with gcc. Checking for size_t... Yes. Checking for off64_t... Yes. Checking for fseeko... Yes. Checking for strerror... Yes. Checking for unistd.h... Yes. Checking for stdarg.h... Yes. Checking whether to use vs[n]printf() or s[n]printf()... using vs[n]printf(). Checking for vsnprintf() in stdio.h... Yes. Checking for return value of vsnprintf()... Yes. Checking for attribute(visibility) support... Yes. You have new mail in /var/spool/mail/root # 编译后安装 [ /home/software/zlib-1.2.11]# make && make install ...省略 chmod 644 /usr/local/zlib/include/zlib.h /usr/local/zlib/include/zconf.h
(12)进入ruby根目录/ext/zlib,先使用ruby extconf.rb命令,执行完后会在当前目录下生成Makefile文件。
[ /home/software/ruby-2.3.1/ext/zlib]# ll total 892 -rw-r--r--. 1 root root 146 Apr 16 2013 depend -rw-r--r--. 1 root root 1447 Dec 16 2015 extconf.rb# 使用ruby extconf.rb命令后会生成一个Makefile文件 -rw-r--r--. 1 root root 7656 Dec 12 17:14 Makefile -rw-r--r--. 1 root root 7090 Dec 12 17:14 mkmf.log -rw-r--r--. 1 root root 124168 Dec 19 2015 zlib.c -rw-r--r--. 1 root root 418576 Dec 12 17:18 zlib.o -rwxr-xr-x. 1 root root 336032 Dec 12 17:18 zlib.so# 安装时需添加配置信息,指定zlib目录下的include和lib,zlib目录就是前面编译指定安装的目录 [ /home/software/ruby-2.3.1/ext/zlib]# ruby extconf.rb --with-zlib-include=/usr/local/zlib/include/ --with-zlib-lib=/usr/local/zlib/lib # 省略 [ /home/software/ruby-2.3.1/ext/zlib]# make && make install # 省略
接下来编译,本次编译顺利通过了,如果没有通过出现如下报错,还需要修改Makefile内容。
make: *** No rule to make target `/include/ruby.h‘, needed by `zlib.o‘. Stop
修改Makefile内容,将$(top_srcdir)整个换成绝对路径,我这里没有报错,因此没有修改。
zlib.o: $(top_srcdir)/include/ruby.h 改成:zlib.o: ../../include/ruby.h
最后编译通过会提示如下内容。
[ /home/software/ruby-2.3.1/ext/zlib]# make && make install compiling zlib.c linking shared-object zlib.so /usr/bin/install -c -m 0755 zlib.so /usr/local/lib/ruby/site_ruby/2.4.0/x86_64-linux
(13)继续安装openssl,如果不安装切换到redis目录使用gem install redis会报‘‘unable to require openssl...‘‘的报错,因此需要继续像安装zlib一样安装openssl。
网址https://www.openssl.org/source/,解压后进入安装目录,执行以下命令,编译安装执行时间本机比较长。
[ /home/software/openssl-1.0.2t]# ./config -fPIC --prefix=/usr/local/openssl enable-shared [ /home/software/openssl-1.0.2t]# ./config -t # 这里会编译很久 [ /home/software/openssl-1.0.2t]# $ make && make install
(14)进入ruby根目录/ext/openssl下,执行ruby extconf.rb命令,也会在当前目录下生成Makefile文件,虽然编译和安装,光荣的报错了。
[ /home/software/ruby-2.3.1/ext/openssl]# ruby extconf.rb --with-openssl-include=/usr/local/openssl/include/ --with-openssl-lib=/usr/local/openssl/lib [ /home/software/ruby-2.3.1/ext/openssl]# make && make install
(15)类似上面安装zlib,也将Makefile文件的\$(top_srcdir)全部改成"../..",这里在使用了":%s/\${top_srcdir}/..\/../g"进行了全局替换。再次编译安装就通过了。
(16)到这里,就差不多可以了,切换目录到redis根目录,然后执行gem install redis,再次报错。没辙了,网上找了一种土办法,先让安装成功。
[ /home/software/redis-3.2.11]# gem install redis ERROR: Could not find a valid gem ‘redis‘ (>= 0), here is why: Unable to download data from https://rubygems.org/ - SSL_connect returned=1 errno=0 state=error: certificate verify failed (https://api.rubygems.org/specs.4.8.gz)
临时办法,大概意思让https连接改成http连接。
[ /home/software/redis-3.2.11]# gem sources -r https://rubygems.org source https://rubygems.org not present in cache [ /home/software/redis-3.2.11]# gem sources -a http://rubygems.org https://rubygems.org is recommended for security over http://rubygems.org Do you want to add this insecure source? [yn] y http://rubygems.org added to sources You have new mail in /var/spool/mail/root
继续执行命令,终于ok了,可谓一波三折,这个方法只能作为参考,换一个redis或ruby又不知道会发生什么报错。
# 继续 [ /home/software/redis-3.2.11/src]# gem install redis Successfully installed redis-3.2.1 Parsing documentation for redis-3.2.1 Installing ri documentation for redis-3.2.1 Done installing documentation for redis after 0 seconds #成功了 WARNING: Unable to pull data from ‘https://rubygems.org/‘: SSL_connect returned=1 errno=0 state=error: certificate verify failed (https://api.rubygems.org/specs.4.8.gz) 1 gem installed You have new mail in /var/spool/mail/root # 试运行redis-trib.rb脚本,终于ok了 [ /home/software/redis-3.2.11]# src/redis-trib.rb Usage: redis-trib <command> <options> <arguments ...> create host1:port1 ... hostN:portN --replicas <arg> check host:port info host:port fix host:port --timeout <arg> reshard host:port --from <arg> --to <arg> --slots <arg> --yes --timeout <arg> --pipeline <arg> rebalance host:port --weight <arg> --auto-weights --use-empty-masters --timeout <arg> ...省略
(17)前面准备好了后,就可以使用ruby脚本来开始构建集群了。上面启动后的redis节点都是各自为政互不联系,接下来先将8000、8001、8002作为主节点连起来,并分配槽道号,本次使用默认分配。
# 使用create [ /home/software/redis-3.2.11]# src/redis-trib.rb create 192.168.200.140:8000 192.168.200.140:8001 192.168.200.140:8002 >>> Creating cluster >>> Performing hash slots allocation on 3 nodes... Using 3 masters: 192.168.200.140:8000 192.168.200.140:8001 192.168.200.140:8002 M: dc0e2e16888426089de31466ff398679eec81b86 192.168.200.140:8000 slots:0-5460 (5461 slots) master M: aa79ace502e5369236b62ed61c0eb43733ddcbde 192.168.200.140:8001 slots:5461-10922 (5462 slots) master M: 719d1dd412faf6b4e1eb348c65c329a262e393e1 192.168.200.140:8002 slots:10923-16383 (5461 slots) master Can I set the above configuration? (type ‘yes‘ to accept): yes >>> Nodes configuration updated >>> Assign a different config epoch to each node >>> Sending CLUSTER MEET messages to join the cluster Waiting for the cluster to join >>> Performing Cluster Check (using node 192.168.200.140:8000) M: dc0e2e16888426089de31466ff398679eec81b86 192.168.200.140:8000 slots:0-5460 (5461 slots) master 0 additional replica(s) M: aa79ace502e5369236b62ed61c0eb43733ddcbde 192.168.200.140:8001 slots:5461-10922 (5462 slots) master 0 additional replica(s) M: 719d1dd412faf6b4e1eb348c65c329a262e393e1 192.168.200.140:8002 slots:10923-16383 (5461 slots) master 0 additional replica(s) [OK] All nodes agree about slots configuration. >>> Check for open slots... >>> Check slots coverage... [OK] All 16384 slots covered. You have new mail in /var/spool/mail/root [ /home/software/redis-3.2.11]# redis-cli -c -p 8000 # 查看发现集群状态已变成ok,说明槽道没有一个没分配 127.0.0.1:8000> cluster into (error) ERR Wrong CLUSTER subcommand or number of arguments 127.0.0.1:8000> cluster info cluster_state:ok cluster_slots_assigned:16384 cluster_slots_ok:16384 cluster_slots_pfail:0 cluster_slots_fail:0 cluster_known_nodes:3 cluster_size:3 cluster_current_epoch:3 cluster_my_epoch:1 cluster_stats_messages_sent:68 cluster_stats_messages_received:68 # 三个主节点都分配了槽道 127.0.0.1:8000> cluster nodes# 节点信息# 第一个为节点id,用40位的十六进制表示# 第二个为节点ip+端口# 第三个为角色# 第四个为主节点id,没有主节点就用‘-‘表示# 第五个是与操作相关的时间戳# 第六个代表序号# 第七个代表连接状态# 第八个代表槽道号区间 aa79ace502e5369236b62ed61c0eb43733ddcbde 192.168.200.140:8001 master - 0 1576145625541 2 connected 5461-10922 719d1dd412faf6b4e1eb348c65c329a262e393e1 192.168.200.140:8002 master - 0 1576145623522 3 connected 10923-16383 dc0e2e16888426089de31466ff398679eec81b86 192.168.200.140:8000 myself,master - 0 0 1 connected 0-5460
(18)将8003挂接到8000,8004挂接到8001,8005挂接到8002成为从节点,使用add node命令,这里需要用到两个选项,一个是--salve,代表新加的节点是从,一个是--master-id,指定需要添加的主是谁,最后分别添加slaveHost:slavePort existHost:existPort,后面的节点可以是集群中任意一个加入集群的节点。
# 以添加8003为例 ,注意主节点可以是任意一个主节点,不一定是当前从的主节点[ /home/software/redis-3.2.11]# src/redis-trib.rb add-node --slave --master-id dc0e2e16888426089de31466ff398679eec81b86 192.168.200.140:8003 192.168.200.140:8000 >>> Adding node 192.168.200.140:8003 to cluster 192.168.200.140:8000 >>> Performing Cluster Check (using node 192.168.200.140:8000) M: dc0e2e16888426089de31466ff398679eec81b86 192.168.200.140:8000 slots:0-5460 (5461 slots) master 0 additional replica(s) M: aa79ace502e5369236b62ed61c0eb43733ddcbde 192.168.200.140:8001 slots:5461-10922 (5462 slots) master 0 additional replica(s) M: 719d1dd412faf6b4e1eb348c65c329a262e393e1 192.168.200.140:8002 slots:10923-16383 (5461 slots) master 0 additional replica(s) [OK] All nodes agree about slots configuration. >>> Check for open slots... >>> Check slots coverage... [OK] All 16384 slots covered.# 底层还是调用redis原生命令cluster meet >>> Send CLUSTER MEET to node 192.168.200.140:8003 to make it join the cluster. Waiting for the cluster to join. >>> Configure node as replica of 192.168.200.140:8000. [OK] New node added correctly. You have new mail in /var/spool/mail/root
登录集群客户端查看集群信息,发现搭建成功,实现了三主三从redis集群的搭建。
127.0.0.1:8000> cluster nodes aa79ace502e5369236b62ed61c0eb43733ddcbde 192.168.200.140:8001 master - 0 1576152242486 2 connected 5461-10922 dc0e2e16888426089de31466ff398679eec81b86 192.168.200.140:8000 myself,master - 0 0 1 connected 0-5460 824a999176c12eb6ebd944b75944fd97c65b579d 192.168.200.140:8005 slave 719d1dd412faf6b4e1eb348c65c329a262e393e1 0 1576152244002 3 connected 312accb9be27c48c107a19ab501a75a8c5d321c6 192.168.200.140:8004 slave aa79ace502e5369236b62ed61c0eb43733ddcbde 0 1576152239461 2 connected ada1903bcb5744900916151dd385936ceb6ce049 192.168.200.140:8003 slave dc0e2e16888426089de31466ff398679eec81b86 0 1576152243496 1 connected 719d1dd412faf6b4e1eb348c65c329a262e393e1 192.168.200.140:8002 master - 0 1576152244507 3 connected 10923-16383
可以看出,这个搭建比较繁琐, 如果解决了ruby的报错问题,只需要一个命令就可以完成三主三从redis集群的搭建,下面记录一下。
(1)停止redis所有进程。
[ /home/software/redis-3.2.11]# redis-cli -c -p 8000 shutdown [ /home/software/redis-3.2.11]# redis-cli -c -p 8001 shutdown [ /home/software/redis-3.2.11]# redis-cli -c -p 8002 shutdown [ /home/software/redis-3.2.11]# redis-cli -c -p 8003 shutdown [ /home/software/redis-3.2.11]# redis-cli -c -p 8004 shutdown [ /home/software/redis-3.2.11]# redis-cli -c -p 8005 shutdown [ /home/software/redis-3.2.11]# ps -ef|grep redis root 3657 18054 0 20:13 pts/2 00:00:00 grep redis
(2)删除rdb和aof持久化文件,删除集群节点状态记录文件。
[ /home/software/redis-3.2.11]# rm -rf appendonly800* [ /home/software/redis-3.2.11]# rm -rf dump800* [ /home/software/redis-3.2.11]# rm -rf nodes-800*
(3)再次启动6个redis节点。
# 启动 [ /home/software/redis-3.2.11]# redis-server 8000/redis.conf [ /home/software/redis-3.2.11]# redis-server 8001/redis.conf [ /home/software/redis-3.2.11]# redis-server 8002/redis.conf [ /home/software/redis-3.2.11]# redis-server 8003/redis.conf [ /home/software/redis-3.2.11]# redis-server 8004/redis.conf [ /home/software/redis-3.2.11]# redis-server 8005/redis.conf [ /home/software/redis-3.2.11]# redis-cli -c -p 8000 # 还没分配槽道,redis集群状态fail 127.0.0.1:8000> cluster info cluster_state:fail cluster_slots_assigned:0 cluster_slots_ok:0 cluster_slots_pfail:0 cluster_slots_fail:0 cluster_known_nodes:1 cluster_size:0 cluster_current_epoch:0 cluster_my_epoch:0 cluster_stats_messages_sent:0 cluster_stats_messages_received:0
(4)使用ruby脚本src/redis-trib.rb create --replicas 1 六个的节点ip:端口,这一条命令就可以自动搭建了三主三从。
# 一个命令 [ /home/software/redis-3.2.11]# src/redis-trib.rb create --replicas 1 192.168.200.140:8000 192.168.200.140:8001 192.168.200.140:8002 192.168.200.140:8003 192.168.200.140:8004 192.168.200.140:8005 >>> Creating cluster >>> Performing hash slots allocation on 6 nodes... Using 3 masters: 192.168.200.140:8000 192.168.200.140:8001 192.168.200.140:8002 Adding replica 192.168.200.140:8003 to 192.168.200.140:8000 Adding replica 192.168.200.140:8004 to 192.168.200.140:8001 Adding replica 192.168.200.140:8005 to 192.168.200.140:8002 M: 6537ac1cb42f209600ea20b1d6310e7b789257d4 192.168.200.140:8000 slots:0-5460 (5461 slots) master M: 5da79737cf493085724a689bdb5bd878c9121c07 192.168.200.140:8001 slots:5461-10922 (5462 slots) master M: 49f40898d2d18cc162b00e5f69593f0b91af5766 192.168.200.140:8002 slots:10923-16383 (5461 slots) master S: a92a3eb2cfff488e0af6ca07d9b612f2f3595c0a 192.168.200.140:8003 replicates 6537ac1cb42f209600ea20b1d6310e7b789257d4 S: ea41d1e7a41e5c5b19baa8e6cbac15b7d4d2bdb8 192.168.200.140:8004 replicates 5da79737cf493085724a689bdb5bd878c9121c07 S: bede8e5c11fe947cbaf0e435d822057a7b0fe414 192.168.200.140:8005 replicates 49f40898d2d18cc162b00e5f69593f0b91af5766 Can I set the above configuration? (type ‘yes‘ to accept): yes >>> Nodes configuration updated >>> Assign a different config epoch to each node >>> Sending CLUSTER MEET messages to join the cluster Waiting for the cluster to join... >>> Performing Cluster Check (using node 192.168.200.140:8000) M: 6537ac1cb42f209600ea20b1d6310e7b789257d4 192.168.200.140:8000 slots:0-5460 (5461 slots) master 1 additional replica(s) M: 49f40898d2d18cc162b00e5f69593f0b91af5766 192.168.200.140:8002 slots:10923-16383 (5461 slots) master 1 additional replica(s) S: bede8e5c11fe947cbaf0e435d822057a7b0fe414 192.168.200.140:8005 slots: (0 slots) slave replicates 49f40898d2d18cc162b00e5f69593f0b91af5766 M: 5da79737cf493085724a689bdb5bd878c9121c07 192.168.200.140:8001 slots:5461-10922 (5462 slots) master 1 additional replica(s) S: a92a3eb2cfff488e0af6ca07d9b612f2f3595c0a 192.168.200.140:8003 slots: (0 slots) slave replicates 6537ac1cb42f209600ea20b1d6310e7b789257d4 S: ea41d1e7a41e5c5b19baa8e6cbac15b7d4d2bdb8 192.168.200.140:8004 slots: (0 slots) slave replicates 5da79737cf493085724a689bdb5bd878c9121c07 [OK] All nodes agree about slots configuration. >>> Check for open slots... >>> Check slots coverage... [OK] All 16384 slots covered. You have new mail in /var/spool/mail/root [ /home/software/redis-3.2.11]# redis-cli -c -p 8000 # 结果一样 127.0.0.1:8000> cluster nodes 49f40898d2d18cc162b00e5f69593f0b91af5766 192.168.200.140:8002 master - 0 1576153283083 3 connected 10923-16383 bede8e5c11fe947cbaf0e435d822057a7b0fe414 192.168.200.140:8005 slave 49f40898d2d18cc162b00e5f69593f0b91af5766 0 1576153285101 6 connected 5da79737cf493085724a689bdb5bd878c9121c07 192.168.200.140:8001 master - 0 1576153286109 2 connected 5461-10922 6537ac1cb42f209600ea20b1d6310e7b789257d4 192.168.200.140:8000 myself,master - 0 0 1 connected 0-5460 a92a3eb2cfff488e0af6ca07d9b612f2f3595c0a 192.168.200.140:8003 slave 6537ac1cb42f209600ea20b1d6310e7b789257d4 0 1576153283585 4 connected ea41d1e7a41e5c5b19baa8e6cbac15b7d4d2bdb8 192.168.200.140:8004 slave 5da79737cf493085724a689bdb5bd878c9121c07 0 1576153284091 5 connected
以上两种方法,都是依赖ruby脚本,其实它还是基于redis cluster的命令来搭建完成,使用这种方式搭建能更好的理解搭建中经历了什么。
(1)提前准备6个启动的节点,前提也需删除持久化文件,以及节点状态记录文件,跟上面一样。
(2)登录任何一个节点,和其他5个进行meet,这里使用8000的客户端来完成,meet完后任意一个节点都认为自己是主节点。
# 使用cluster meet命令192.168.200.140:8000> CLUSTER MEET 192.168.200.140 8001 OK 192.168.200.140:8000> CLUSTER MEET 192.168.200.140 8002 OK 192.168.200.140:8000> CLUSTER MEET 192.168.200.140 8003 OK 192.168.200.140:8000> CLUSTER MEET 192.168.200.140 8004 OK 192.168.200.140:8000> CLUSTER MEET 192.168.200.140 8005 OK# 登录任何一个节点查看,都能得到如下信息 192.168.200.140:8000> cluster nodes fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 master - 0 1576155518818 0 connected ced952bc6a47c38756bbc48d85a5f601db620b79 192.168.200.140:8000 myself,master - 0 0 4 connected 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 master - 0 1576155517810 5 connected 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 master - 0 1576155514284 1 connected 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 master - 0 1576155515791 2 connected 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 master - 0 1576155516800 3 connected
(3)给8000,8001和8002分配槽道号,让其作为主节点。分配槽道号的命令为cluster addslots 槽道号...。分配完成后不管登录哪个节点都能看到槽道号信息,
# 槽道号之间有多个需要用空格隔开 [ /home/software/redis-3.2.11]# redis-cli -c -p 8000 -h 192.168.200.140 192.168.200.140:8000> CLUSTER ADDSLOTS 0 1 2 3 4 5 OK# 可以看到8000的节点分配了0-5这6个槽道号 192.168.200.140:8000> cluster nodes fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 master - 0 1576155898158 0 connected ced952bc6a47c38756bbc48d85a5f601db620b79 192.168.200.140:8000 myself,master - 0 0 4 connected 0-5 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 master - 0 1576155896137 5 connected 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 master - 0 1576155894119 1 connected 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 master - 0 1576155897147 2 connected 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 master - 0 1576155895128 3 connected
一个个写来分配会比较费力气,可以写一个shell脚本来批量分配槽道号,使用for循环来实现。
shell脚本
#分配槽道号的脚本 #!/bin/bash # 8000节点 for slot in {6..5460} do redis-cli -c -p 8000 -h 192.168.200.140 cluster addslots $slot done # 8001节点 for slot in {5461..10922} do redis-cli -c -p 8001 -h 192.168.200.140 cluster addslots $slot done # 8002节点 for slot in {10923..16383} do redis-cli -c -p 8002 -h 192.168.200.140 cluster addslots $slot done # 执行完打印出来结果 echo "分配槽道号结束"
运行脚本执行完结果,ok。
登录一个节点查看集群槽道信息,发现跟分配的预想一样。
[ /home/software/redis-3.2.11]# redis-cli -c -p 8001 -h 192.168.200.140 192.168.200.140:8001> cluster nodes# 8001结果,与上面脚本一样结果 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 myself,master - 0 0 1 connected 5461-10922 fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 master - 0 1576157229386 0 connected ced952bc6a47c38756bbc48d85a5f601db620b79 192.168.200.140:8000 master - 0 1576157228376 4 connected 0-5460 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 master - 0 1576157229386 2 connected 10923-16383 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 master - 0 1576157227368 3 connected 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 master - 0 1576157230393 5 connected
(4)将8003、8004和8005角色转换为从节点,使用cluster replicate 主节点id 命令。最后查看集群信息,发现成功的分配了槽道号。
[ /home/software/redis-3.2.11]# redis-cli -c -p 8003 -h 192.168.200.140 # 成为8000的从节点 192.168.200.140:8003> cluster replicate ced952bc6a47c38756bbc48d85a5f601db620b79 OK 192.168.200.140:8003> quit [ /home/software/redis-3.2.11]# redis-cli -c -p 8004 -h 192.168.200.140 # 成为8001的从节点 192.168.200.140:8004> cluster replicate 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 OK 192.168.200.140:8004> quit You have new mail in /var/spool/mail/root [ /home/software/redis-3.2.11]# redis-cli -c -p 8005 -h 192.168.200.140 # 成为8002的从节点 192.168.200.140:8005> cluster replicate 7ce388bde879f686fc3c8491175397ca20405565 OK # 结果ok 192.168.200.140:8005> cluster nodes 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 myself,slave 7ce388bde879f686fc3c8491175397ca20405565 0 0 5 connected 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 master - 0 1576157733223 2 connected 10923-16383 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 slave ced952bc6a47c38756bbc48d85a5f601db620b79 0 1576157732216 4 connected ced952bc6a47c38756bbc48d85a5f601db620b79 192.168.200.140:8000 master - 0 1576157735238 4 connected 0-5460 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 master - 0 1576157736242 1 connected 5461-10922 fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 slave 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 0 1576157734230 1 connected
最后集群状态为ok,槽道全部分配完成,形成了三主三从的结构。
192.168.200.140:8005> cluster info cluster_state:ok cluster_slots_assigned:16384 cluster_slots_ok:16384 cluster_slots_pfail:0 cluster_slots_fail:0# 6个节点 cluster_known_nodes:6 # 3个主cluster_size:3 cluster_current_epoch:5 cluster_my_epoch:2 cluster_stats_messages_sent:5093 cluster_stats_messages_received:5093
现在在8000节点上保存数据,然后将8000的主节点宕机,8003从节点会顶替上来成为主节点,并获取到8000的槽道信息。
[ /home/software/redis-3.2.11]# redis-cli -c -p 8000 -h 192.168.200.140 192.168.200.140:8000> set age 28 OK 192.168.200.140:8000> get age "28" 192.168.200.140:8000> quit [ /home/software/redis-3.2.11]# redis-cli -c -p 8003 -h 192.168.200.140 192.168.200.140:8003> get age -> Redirected to slot [741] located at 192.168.200.140:8000 "28" 192.168.200.140:8000> shutdown not connected> quit You have new mail in /var/spool/mail/root [ /home/software/redis-3.2.11]# redis-cli -c -p 8003 -h 192.168.200.140 # 8000宕机后,8003暂时还是从 192.168.200.140:8003> cluster nodes fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 slave 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 0 1576158134892 1 connected 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 myself,slave ced952bc6a47c38756bbc48d85a5f601db620b79 0 0 3 connected ced952bc6a47c38756bbc48d85a5f601db620b79 192.168.200.140:8000 master - 1576158120865 1576158119755 4 disconnected 0-5460 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 slave 7ce388bde879f686fc3c8491175397ca20405565 0 1576158132873 5 connected 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 master - 0 1576158133883 1 connected 5461-10922 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 master - 0 1576158131864 2 connected 10923-16383 # 过一会,8003变为主,并且获得槽道信息 192.168.200.140:8003> cluster nodes fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 slave 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 0 1576158149019 1 connected 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 myself,master - 0 0 6 connected 0-5460 ced952bc6a47c38756bbc48d85a5f601db620b79 192.168.200.140:8000 master,fail - 1576158120865 1576158119755 4 disconnected 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 slave 7ce388bde879f686fc3c8491175397ca20405565 0 1576158148008 5 connected 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 master - 0 1576158146999 1 connected 5461-10922 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 master - 0 1576158150028 2 connected 10923-16383 # 数据也备份了 192.168.200.140:8003> get age "28"
再补充添加和删除节点的操作,关于删除节点,它只能删除从节点或没有槽道管理权的节点,由于8000已经没有了槽道管理权,将8000重启后,下面使用ruby脚本删除,命令为del-node ip:port 节点id。
# 确认8000开启[ /home/software/redis-3.2.11]# ps -ef|grep redis root 4170 1 0 20:51 ? 00:00:13 redis-server *:8001 [cluster] root 4174 1 0 20:51 ? 00:00:11 redis-server *:8002 [cluster] root 4178 1 0 20:51 ? 00:00:08 redis-server *:8003 [cluster] root 4182 1 0 20:51 ? 00:00:08 redis-server *:8004 [cluster] root 4186 1 0 20:51 ? 00:00:08 redis-server *:8005 [cluster] root 7231 1 0 22:09 ? 00:00:00 redis-server *:8000 [cluster] root 7298 18054 0 22:13 pts/2 00:00:00 grep redis# 删除8000 [ /home/software/redis-3.2.11]# src/redis-trib.rb del-node 192.168.200.140:8000 94f634b2d364d560d3f8fb6e88fa874cf2b493f6 >>> Removing node 94f634b2d364d560d3f8fb6e88fa874cf2b493f6 from cluster 192.168.200.140:8000 >>> Sending CLUSTER FORGET messages to the cluster...# 关闭节点 >>> SHUTDOWN the node. You have new mail in /var/spool/mail/root
发现删除节点后,还自动将节点关闭了,因为8000节点的状态记录文件nodes-8000.conf还记录着8000是连接状态,但是其他节点的这个文件已经没了8000的信息,为了集群数据的不混乱,需要将8000立即关闭。
既然删除了,尝试启动8000后添加到集群,需要使用add-node newHost:newPort existHost:existPort命令,类似添加从节点命令只是没有选项。发现启动8000后再添加进集群会报错。
# 确认开启8000 [ /home/software/redis-3.2.11]# ps -ef|grep redis root 4170 1 0 20:51 ? 00:00:14 redis-server *:8001 [cluster] root 4174 1 0 20:51 ? 00:00:11 redis-server *:8002 [cluster] root 4178 1 0 20:51 ? 00:00:08 redis-server *:8003 [cluster] root 4182 1 0 20:51 ? 00:00:08 redis-server *:8004 [cluster] root 4186 1 0 20:51 ? 00:00:08 redis-server *:8005 [cluster] root 7400 1 0 22:21 ? 00:00:00 redis-server *:8000 [cluster] root 7422 18054 0 22:22 pts/2 00:00:00 grep redis You have new mail in /var/spool/mail/root # 添加8000到集群 [ /home/software/redis-3.2.11]# src/redis-trib.rb add-node 192.168.200.140:8000 192.168.200.140:8001 >>> Adding node 192.168.200.140:8000 to cluster 192.168.200.140:8001 >>> Performing Cluster Check (using node 192.168.200.140:8001) M: 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 slots:5461-10922 (5462 slots) master 1 additional replica(s) S: fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 slots: (0 slots) slave replicates 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 M: 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 slots:10923-16383 (5461 slots) master 1 additional replica(s) M: 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 slots:0-5460 (5461 slots) master 0 additional replica(s) S: 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 slots: (0 slots) slave replicates 7ce388bde879f686fc3c8491175397ca20405565 [OK] All nodes agree about slots configuration. >>> Check for open slots... >>> Check slots coverage... [OK] All 16384 slots covered. # 提示报错,提示8000可能有数据,或者8000里有其他集群的信息 [ERR] Node 192.168.200.140:8000 is not empty. Either the node already knows other nodes (check with CLUSTER NODES) or contains some key in database 0.
这里删除了8000的rdb和aof持久化文件,以及状态文件后,重启8000节点再次尝试添加进去集群发现成功,但是添加进去的节点为没有槽道管理权的主节点。
[ /home/software/redis-3.2.11]# src/redis-trib.rb add-node 192.168.200.140:8000 192.168.200.140:8001 >>> Adding node 192.168.200.140:8000 to cluster 192.168.200.140:8001 >>> Performing Cluster Check (using node 192.168.200.140:8001) M: 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 slots:5461-10922 (5462 slots) master 1 additional replica(s) S: fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 slots: (0 slots) slave replicates 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 M: 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 slots:10923-16383 (5461 slots) master 1 additional replica(s) M: 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 slots:0-5460 (5461 slots) master 0 additional replica(s) S: 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 slots: (0 slots) slave replicates 7ce388bde879f686fc3c8491175397ca20405565 [OK] All nodes agree about slots configuration. >>> Check for open slots... >>> Check slots coverage... [OK] All 16384 slots covered. >>> Send CLUSTER MEET to node 192.168.200.140:8000 to make it join the cluster. [OK] New node added correctly.
8000的信息
fd4c160edc74536d79ea29d239dca43275ec6b5a 192.168.200.140:8004 slave 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 0 1576161337438 1 connected # 没有槽道管理权的主节点 c41dbe9595ae83725d1322b032736fd198b26c49 192.168.200.140:8000 myself,master - 0 0 0 connected 2c52d95c3d6d4c396469a81edfc1493d984e0f2d 192.168.200.140:8005 slave 7ce388bde879f686fc3c8491175397ca20405565 0 1576161336432 2 connected 231fe9df31dc1ccf7cca5ae2fb2313979cd6fa83 192.168.200.140:8001 master - 0 1576161336432 1 connected 5461-10922 2e0f23d703874db80373f28b1be8c13f9de4fe6b 192.168.200.140:8003 master - 0 1576161339456 6 connected 0-5460 7ce388bde879f686fc3c8491175397ca20405565 192.168.200.140:8002 master - 0 1576161338448 2 connected 10923-16383
以上是redis-cluster集群搭建的基本知识,后面继续补充槽道知识。
参考博文:
(1)《Redis设计与实践》
(2)https://blog.csdn.net/qq_26710443/article/details/82724268 ruby安装
(3)https://www.jianshu.com/p/c38369097448
(4)https://www.cnblogs.com/xuliangxing/p/7146868.html ruby脚本不能执行解决,报错主要参考文
(5)https://blog.csdn.net/ck3207/article/details/90404952 openssl无法编译