Galeon
13.11.13, 10:44
Guten Morgen liebes Forum,
ich hab mich jetzt zwei diverse Stunden mit folgendem Problem rumgeschlagen, komme aber keinen Schritt weiter:
Ich versuche einen Server in einen Cluster mit Corosync und Pacemaker einzubinden, scheitere jedoch beim Start von Corosync 1.4.2
Nov 13 09:58:22 corosync [MAIN ] Corosync Cluster Engine ('1.4.2'): started and ready to provide service.
Nov 13 09:58:22 corosync [MAIN ] Corosync built-in features: nss
Nov 13 09:58:22 corosync [MAIN ] Successfully read config from /etc/cluster/cluster.conf
Nov 13 09:58:22 corosync [MAIN ] Successfully parsed cman config
Nov 13 09:58:22 corosync [MAIN ] Successfully configured openais services to load
Nov 13 09:58:22 corosync [TOTEM ] Token Timeout (10000 ms) retransmit timeout (2380 ms)
Nov 13 09:58:22 corosync [TOTEM ] token hold (1894 ms) retransmits before loss (4 retrans)
Nov 13 09:58:22 corosync [TOTEM ] join (60 ms) send_join (0 ms) consensus (12000 ms) merge (200 ms)
Nov 13 09:58:22 corosync [TOTEM ] downcheck (1000 ms) fail to recv const (2500 msgs)
Nov 13 09:58:22 corosync [TOTEM ] seqno unchanged const (30 rotations) Maximum network MTU 1402
Nov 13 09:58:22 corosync [TOTEM ] window size per rotation (50 messages) maximum messages per rotation (17 messages)
Nov 13 09:58:22 corosync [TOTEM ] missed count const (5 messages)
Nov 13 09:58:22 corosync [TOTEM ] send threads (0 threads)
Nov 13 09:58:22 corosync [TOTEM ] RRP token expired timeout (2380 ms)
Nov 13 09:58:22 corosync [TOTEM ] RRP token problem counter (2000 ms)
Nov 13 09:58:22 corosync [TOTEM ] RRP threshold (10 problem count)
Nov 13 09:58:22 corosync [TOTEM ] RRP multicast threshold (100 problem count)
Nov 13 09:58:22 corosync [TOTEM ] RRP automatic recovery check timeout (1000 ms)
Nov 13 09:58:22 corosync [TOTEM ] RRP mode set to none.
Nov 13 09:58:22 corosync [TOTEM ] heartbeat_failures_allowed (0)
Nov 13 09:58:22 corosync [TOTEM ] max_network_delay (50 ms)
Nov 13 09:58:22 corosync [TOTEM ] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0
Nov 13 09:58:22 corosync [TOTEM ] Initializing transport (UDP/IP Multicast).
Nov 13 09:58:22 corosync [TOTEM ] Initializing transmit/receive security: libtomcrypt SOBER128/SHA1HMAC (mode 0).
Nov 13 09:58:22 corosync [IPC ] you are using ipc api v2
Nov 13 09:58:22 corosync [TOTEM ] Unable to bind the socket to send multicast packets: Cannot assign requested address (99)
Nov 13 09:58:22 corosync [TOTEM ] Could not set traffic priority: Socket operation on non-socket (88)
Nov 13 09:58:22 corosync [TOTEM ] The network interface [10.8.0.0] is now up.
Das Interface existiert
tun0 Link encap:UNSPEC HWaddr 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
inet addr:10.8.0.11 P-t-P:10.8.0.0 Mask:255.255.255.255
UP POINTOPOINT RUNNING NOARP MULTICAST MTU:1500 Metric:1
RX packets:881906 errors:0 dropped:0 overruns:0 frame:0
TX packets:1659174 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:100
RX bytes:50026638 (50.0 MB) TX bytes:2294220609 (2.2 GB)
und auch lauscht nichts auf dem Port
sudo netstat -tulpen
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address Foreign Address State User Inode PID/Program name
tcp 0 0 0.0.0.0:3306 0.0.0.0:* LISTEN 107 14222 2477/mysqld
tcp 0 0 127.0.0.1:5900 0.0.0.0:* LISTEN 104 12623 3000/kvm
tcp 0 0 127.0.0.1:5901 0.0.0.0:* LISTEN 104 14325 3026/kvm
tcp 0 0 127.0.0.1:5902 0.0.0.0:* LISTEN 104 15282 3051/kvm
tcp 0 0 10.8.0.11:9102 0.0.0.0:* LISTEN 0 11690 2513/bacula-fd
tcp 0 0 127.0.0.1:5903 0.0.0.0:* LISTEN 104 15291 3075/kvm
tcp 0 0 192.168.122.1:53 0.0.0.0:* LISTEN 0 13821 1926/dnsmasq
tcp 0 0 0.0.0.0:22 0.0.0.0:* LISTEN 0 11570 1167/sshd
tcp 0 0 0.0.0.0:4567 0.0.0.0:* LISTEN 107 14972 2477/mysqld
tcp6 0 0 :::22 :::* LISTEN 0 11572 1167/sshd
udp 0 0 0.0.0.0:57514 0.0.0.0:* 0 14774 1714/openvpn
udp 0 0 192.168.122.1:53 0.0.0.0:* 0 13820 1926/dnsmasq
udp 0 0 0.0.0.0:67 0.0.0.0:* 0 13811 1926/dnsmasq
udp 0 0 0.0.0.0:123 0.0.0.0:* 0 10945 2316/chronyd
udp 0 0 0.0.0.0:323 0.0.0.0:* 0 10947 2316/chronyd
udp6 0 0 :::123 :::* 0 10946 2316/chronyd
udp6 0 0 :::323 :::* 0 10948 2316/chronyd
Die Ports sind auch in der UFW freigegeben (deaktivieren selbiger bringt auch nichts)
sudo ufw status
Status: active
To Action From
-- ------ ----
22 ALLOW Anywhere
3306 ALLOW Anywhere
4444 ALLOW Anywhere
4567 ALLOW Anywhere
4568 ALLOW Anywhere
80 ALLOW Anywhere
9102 ALLOW Anywhere
443 ALLOW Anywhere
4949/tcp on tun0 ALLOW Anywhere
5405 ALLOW Anywhere
5404 ALLOW Anywhere
/etc/cluster/cluster.conf
<?xml version="1.0" ?>
<cluster config_version="2" name="pcmk_esp">
<logging debug="on"/>
<clusternodes>
<clusternode name="cluster_esp_prod" nodeid="1">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="cluster_esp_prod"/>
</method>
</fence>
</clusternode>
<clusternode name="cluster_esp_monitor" nodeid="2">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="cluster_esp_monitor"/>
</method>
</fence>
</clusternode>
<clusternode name="cluster_esp_fallback" nodeid="3">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="cluster_esp_fallback"/>
</method>
</fence>
</clusternode>
</clusternodes>
<fencedevices>
<fencedevice agent="fence_pcmk" name="pcmk"/>
</fencedevices>
</cluster>
In der Hosts sind die Namen entsprechend hinterlegt
10.8.0.21 cluster_esp_prod
10.8.0.11 cluster_esp_fallback
10.8.0.16 cluster_esp_monitor
Und in /etc/corosync/corosync.conf ein entsprechendes Setup hinterlegt
...
interface {
member {
memberaddr: 10.8.0.21
}
member {
memberaddr: 10.8.0.11
}
member {
memberaddr: 10.8.0.16
}
# The following values need to be set based on your environment
ringnumber: 0
bindnetaddr: 10.8.0.0
mcastport: 5405
}
transport: udpu
...
Das selbe Setup läuft auf den beiden anderen Kisten wie erwartet, aber diese eine will einfach nicht und ich komme nicht hinter den Grund der Fehlermeldung.
Ich habe auch schon einen simplen Echoserver auf der selben 10.8.0.11 und 5405 problemfrei gestartet und getestet.
Momentan bin ich etwas Ratlos, hat einer von Euch eine schlaue Idee?
ich hab mich jetzt zwei diverse Stunden mit folgendem Problem rumgeschlagen, komme aber keinen Schritt weiter:
Ich versuche einen Server in einen Cluster mit Corosync und Pacemaker einzubinden, scheitere jedoch beim Start von Corosync 1.4.2
Nov 13 09:58:22 corosync [MAIN ] Corosync Cluster Engine ('1.4.2'): started and ready to provide service.
Nov 13 09:58:22 corosync [MAIN ] Corosync built-in features: nss
Nov 13 09:58:22 corosync [MAIN ] Successfully read config from /etc/cluster/cluster.conf
Nov 13 09:58:22 corosync [MAIN ] Successfully parsed cman config
Nov 13 09:58:22 corosync [MAIN ] Successfully configured openais services to load
Nov 13 09:58:22 corosync [TOTEM ] Token Timeout (10000 ms) retransmit timeout (2380 ms)
Nov 13 09:58:22 corosync [TOTEM ] token hold (1894 ms) retransmits before loss (4 retrans)
Nov 13 09:58:22 corosync [TOTEM ] join (60 ms) send_join (0 ms) consensus (12000 ms) merge (200 ms)
Nov 13 09:58:22 corosync [TOTEM ] downcheck (1000 ms) fail to recv const (2500 msgs)
Nov 13 09:58:22 corosync [TOTEM ] seqno unchanged const (30 rotations) Maximum network MTU 1402
Nov 13 09:58:22 corosync [TOTEM ] window size per rotation (50 messages) maximum messages per rotation (17 messages)
Nov 13 09:58:22 corosync [TOTEM ] missed count const (5 messages)
Nov 13 09:58:22 corosync [TOTEM ] send threads (0 threads)
Nov 13 09:58:22 corosync [TOTEM ] RRP token expired timeout (2380 ms)
Nov 13 09:58:22 corosync [TOTEM ] RRP token problem counter (2000 ms)
Nov 13 09:58:22 corosync [TOTEM ] RRP threshold (10 problem count)
Nov 13 09:58:22 corosync [TOTEM ] RRP multicast threshold (100 problem count)
Nov 13 09:58:22 corosync [TOTEM ] RRP automatic recovery check timeout (1000 ms)
Nov 13 09:58:22 corosync [TOTEM ] RRP mode set to none.
Nov 13 09:58:22 corosync [TOTEM ] heartbeat_failures_allowed (0)
Nov 13 09:58:22 corosync [TOTEM ] max_network_delay (50 ms)
Nov 13 09:58:22 corosync [TOTEM ] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0
Nov 13 09:58:22 corosync [TOTEM ] Initializing transport (UDP/IP Multicast).
Nov 13 09:58:22 corosync [TOTEM ] Initializing transmit/receive security: libtomcrypt SOBER128/SHA1HMAC (mode 0).
Nov 13 09:58:22 corosync [IPC ] you are using ipc api v2
Nov 13 09:58:22 corosync [TOTEM ] Unable to bind the socket to send multicast packets: Cannot assign requested address (99)
Nov 13 09:58:22 corosync [TOTEM ] Could not set traffic priority: Socket operation on non-socket (88)
Nov 13 09:58:22 corosync [TOTEM ] The network interface [10.8.0.0] is now up.
Das Interface existiert
tun0 Link encap:UNSPEC HWaddr 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
inet addr:10.8.0.11 P-t-P:10.8.0.0 Mask:255.255.255.255
UP POINTOPOINT RUNNING NOARP MULTICAST MTU:1500 Metric:1
RX packets:881906 errors:0 dropped:0 overruns:0 frame:0
TX packets:1659174 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:100
RX bytes:50026638 (50.0 MB) TX bytes:2294220609 (2.2 GB)
und auch lauscht nichts auf dem Port
sudo netstat -tulpen
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address Foreign Address State User Inode PID/Program name
tcp 0 0 0.0.0.0:3306 0.0.0.0:* LISTEN 107 14222 2477/mysqld
tcp 0 0 127.0.0.1:5900 0.0.0.0:* LISTEN 104 12623 3000/kvm
tcp 0 0 127.0.0.1:5901 0.0.0.0:* LISTEN 104 14325 3026/kvm
tcp 0 0 127.0.0.1:5902 0.0.0.0:* LISTEN 104 15282 3051/kvm
tcp 0 0 10.8.0.11:9102 0.0.0.0:* LISTEN 0 11690 2513/bacula-fd
tcp 0 0 127.0.0.1:5903 0.0.0.0:* LISTEN 104 15291 3075/kvm
tcp 0 0 192.168.122.1:53 0.0.0.0:* LISTEN 0 13821 1926/dnsmasq
tcp 0 0 0.0.0.0:22 0.0.0.0:* LISTEN 0 11570 1167/sshd
tcp 0 0 0.0.0.0:4567 0.0.0.0:* LISTEN 107 14972 2477/mysqld
tcp6 0 0 :::22 :::* LISTEN 0 11572 1167/sshd
udp 0 0 0.0.0.0:57514 0.0.0.0:* 0 14774 1714/openvpn
udp 0 0 192.168.122.1:53 0.0.0.0:* 0 13820 1926/dnsmasq
udp 0 0 0.0.0.0:67 0.0.0.0:* 0 13811 1926/dnsmasq
udp 0 0 0.0.0.0:123 0.0.0.0:* 0 10945 2316/chronyd
udp 0 0 0.0.0.0:323 0.0.0.0:* 0 10947 2316/chronyd
udp6 0 0 :::123 :::* 0 10946 2316/chronyd
udp6 0 0 :::323 :::* 0 10948 2316/chronyd
Die Ports sind auch in der UFW freigegeben (deaktivieren selbiger bringt auch nichts)
sudo ufw status
Status: active
To Action From
-- ------ ----
22 ALLOW Anywhere
3306 ALLOW Anywhere
4444 ALLOW Anywhere
4567 ALLOW Anywhere
4568 ALLOW Anywhere
80 ALLOW Anywhere
9102 ALLOW Anywhere
443 ALLOW Anywhere
4949/tcp on tun0 ALLOW Anywhere
5405 ALLOW Anywhere
5404 ALLOW Anywhere
/etc/cluster/cluster.conf
<?xml version="1.0" ?>
<cluster config_version="2" name="pcmk_esp">
<logging debug="on"/>
<clusternodes>
<clusternode name="cluster_esp_prod" nodeid="1">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="cluster_esp_prod"/>
</method>
</fence>
</clusternode>
<clusternode name="cluster_esp_monitor" nodeid="2">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="cluster_esp_monitor"/>
</method>
</fence>
</clusternode>
<clusternode name="cluster_esp_fallback" nodeid="3">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="cluster_esp_fallback"/>
</method>
</fence>
</clusternode>
</clusternodes>
<fencedevices>
<fencedevice agent="fence_pcmk" name="pcmk"/>
</fencedevices>
</cluster>
In der Hosts sind die Namen entsprechend hinterlegt
10.8.0.21 cluster_esp_prod
10.8.0.11 cluster_esp_fallback
10.8.0.16 cluster_esp_monitor
Und in /etc/corosync/corosync.conf ein entsprechendes Setup hinterlegt
...
interface {
member {
memberaddr: 10.8.0.21
}
member {
memberaddr: 10.8.0.11
}
member {
memberaddr: 10.8.0.16
}
# The following values need to be set based on your environment
ringnumber: 0
bindnetaddr: 10.8.0.0
mcastport: 5405
}
transport: udpu
...
Das selbe Setup läuft auf den beiden anderen Kisten wie erwartet, aber diese eine will einfach nicht und ich komme nicht hinter den Grund der Fehlermeldung.
Ich habe auch schon einen simplen Echoserver auf der selben 10.8.0.11 und 5405 problemfrei gestartet und getestet.
Momentan bin ich etwas Ratlos, hat einer von Euch eine schlaue Idee?