hdc: AOPEN DVD1648/LKY, ATAPI CD/DVD-ROM drive hdd: IOMEGA ZIP 250 ATAPI, ATAPI FLOPPY drive ide1 at 0x170-0x177,0x376 on irq 15 Probing IDE interface ide2... As a result # we have a separate dead time for when things first come up. # It should be at least twice the normal dead time. # initdead 120 # Without much clue on how to relatively tune up the two parameters, I simply followed the hack mentioned in a rather informative discussion, modifying /etc/ha.d/resource.d/drbddisk. T R O U B L E S H O O T I N G The steps taken (to the point of failure) are as follows: 1. http://itivityglobal.com/return-code/vsam-catalog-return-code-is-8-reason-code-is-igg0cleg-42.html

Note that the two directives are mutually exclusive. hda: Maxtor 32049H2, ATA DISK drive hdb: Maxtor 6Y060L0, ATA DISK drive Using cfq io scheduler ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 Probing IDE interface ide1... Contact Us - Advertising Info - Rules - LQ Merchandise - Donations - Contributing Member - LQ Sitemap - Main Menu Linux Forum Android Forum Chrome OS Forum Search LQ The argument after drbddisk:: is the name of the resource you defined in the drbd.conf, NOT the actual device name.

I started the httpd and openfiler services 4. Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] Just trying to eliminate the obvious, but is there a \ following your first line in heartbeat[3725]: 2006/08/30_10:41:00 info: Status update for node test02: status ping heartbeat[3725]: 2006/08/30_10:41:08 info: Status update for node test01: status init heartbeat[3725]: 2006/08/30_10:41:08 info: Status update for node test01: status up harc[3740]: Retrying: Permission denied Oct 07 18:46:15 db1.snarvaez.com.ar heartbeat: [5366]: ERROR: glib: ucast: error binding socket.

May 13 11:15:42 server02ResourceManager[2824]: ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk May 13 11:15:42 server02ResourceManager[2824]: CRIT: Giving up resources due to failure of drbddisk::r0 May 13 11:15:42 server02ResourceManager[2824]: info: Releasing resource group: Such messages repeated 5 times in the log, indicating /etc/ha.d/resource.d/drbddisk attempts. heartbeat[3725]: 2006/08/30_10:43:42 WARN: 1 lost packet(s) for [test01] [98:100] heartbeat[3725]: 2006/08/30_10:43:42 info: remote resource transition completed. heartbeat[3725]: 2006/08/30_10:42:13 info: other_holds_resources: 3 heartbeat[3725]: 2006/08/30_10:42:13 ERROR: Both machines own our resources!

The simplest way to configure STONITH with crm is still using /usr/lib64/heartbeat/haresources2cib.py to automatically generate /var/lib/heartbeat/crm/cib.xml from /etc/ha.d/ha.cf. After sometime, the virtual IP disappears from the secondary server (giving me the impression that it takes a while for heartbeat to get settled), but then the windows system is not Back to top #6 dagreen dagreen Newbie Guests 2 posts Posted 11 October 2010 - 04:03 PM I figured out what my error was. Just found haresources in your first post: Code: test02 IPaddr:: test02 drbddisk::r0 Filesystem::/dev/drbd0 smb This should be just one line.

This bug would result STONITH disabled in the generated /var/lib/heartbeat/crm/cib.xml:

Fix either Python or XML to enable STONITH with crm. heartbeat[6618]: 2006/08/30_10:43:42 info: go_standby: who: 1 resource set: foreign heartbeat[6618]: 2006/08/30_10:43:42 info: go_standby: (query/action): (otherkeys/givegroup) heartbeat[6618]: 2006/08/30_10:43:42 info: foreign HA resource release completed (standby). heartbeat[3725]: 2006/08/30_10:41:29 info: AnnounceTakeover(local 1, foreign 1, reason 'mach_down' (1)) heartbeat[3725]: 2006/08/30_10:41:29 info: STATE 1 => 3 heartbeat[3725]: 2006/08/30_10:41:29 info: Exiting status process 3767 returned rc 0. ResourceManager[6652]: 2006/08/30_10:45:21 info: Running /etc/ha.d/resource.d/drbddisk r0 stop ResourceManager[6652]: 2006/08/30_10:45:21 info: Running /etc/ha.d/resource.d/IPaddr stop IPaddr[7687]: 2006/08/30_10:45:21 INFO: IPaddr Success heartbeat[6642]: 2006/08/30_10:45:21 info: All HA resources relinquished.

Notices Welcome to LinuxQuestions.org, a friendly and active Linux Community. http://snarvaez.com.ar/libertad/index.php/2012/11/07/install-heartbeat-3-on-gnulinux-centos-6-3/ heartbeat[3725]: 2006/08/30_10:42:13 info: AnnounceTakeover(local 1, foreign 1, reason 'T_RESOURCES(us)' (1)) heartbeat[3725]: 2006/08/30_10:42:13 ERROR: Both machines own our resources! Moreover, I am extremely glad that you are willing to offer your guidance on this problem. falko, Aug 31, 2006 #10 djalex New Member /var/log/acpid (test02) ========== [Wed Aug 30 10:50:21 2006] starting up [Wed Aug 30 10:50:21 2006] 1 rule loaded /var/log/boot.log (test02) ============ Aug 30

Filesystem[5828]: 2009/01/12_22:35:29 ERROR: Generic error ....... Check This Out Introduction to Linux - A Hands on Guide This guide was created as an overview of the Linux Operating System, geared toward new users as an exploration tour and getting started falko, Aug 26, 2006 #6 djalex New Member Hi Falko, I havent been able to note any particular errors which determines the cause of the crash. Basically, to configure heartbeat it is necessary to add 3 files in the folder /etc/ha.d /etc/ha.d/ha.cf /etc/ha.d/haresources /etc/ha.d/authkeys The 3 files should be exactly the same in both servers running heartbeat,

Past the obvious, can you do a manual fail over? Oct 10 06:31:10 db1.snarvaez.com.ar heartbeat: [28945]: info: Status update for node db2.snarvaez.com.ar: status active Oct 10 06:31:10 db1.snarvaez.com.ar heartbeat: [28945]: info: AnnounceTakeover(local 0, foreign 1, reason 'HB_R_BOTH STARTING' (0)) Oct 10 May 13 11:15:29 server02heartbeat: [2762]: WARN: Shared disks are not protected. Source RC=20 Jan 28 12:04:21 localhost heartbeat: ERROR: Return code 20 from /etc/ha.d/resource.d/drbddisk Jan 28 12:04:22 localhost heartbeat: info: Running /etc/ha.d/resource.d/Filesystem /dev/drbd0 /mnt/debian ext3 start Jan 28 12:04:22 localhost heartbeat: debug: Starting

SELinux: Starting in permissive mode There is already a security framework initialized, register_security failed. I have been using this as my primary guide in order to set up this: Installing and Configuring Openfiler with DRBD and Heartbeat http://www.howtoforg...d-and-heartbeat Are there any other guides more recent I have noticed that when I startup heartbeat, both the primary and secondary server have the virtual IP address of for the initial period.

heartbeat[3725]: 2006/08/30_10:43:41 info: test02 wants to go standby [foreign] heartbeat[3725]: 2006/08/30_10:43:41 info: i_hold_resources: 1 heartbeat[3725]: 2006/08/30_10:43:41 info: New standby state: 1 heartbeat[3725]: 2006/08/30_10:43:42 info: standby: test01 can take our foreign resources

That normally works fine. See the example of ucast and ha.cf config in the following sections. After the primary server crashes, I ran the command drbdadm primary all on the secondary server, in order to mount the virtual block. 9. Oct 10 06:31:20 db1.snarvaez.com.ar heartbeat: [28945]: info: AnnounceTakeover(local 1, foreign 1, reason 'T_RESOURC ES(us)' (0)) Oct 10 06:31:20 db1.snarvaez.com.ar heartbeat: [28945]: info: Initial resource acquisition complete (T_RESOURCES(us) ) Oct 10 06:31:20

More details about cib.xml can be found in /usr/lib64/heartbeat/crm.dtd. harc[2953]: 2009/05/13_11:14:52 info: Running /etc/ha.d/rc.d/ip-request-resp ip-request-resp ip-request-resp[2953]: 2009/05/13_11:14:52 received ip-request-resp drbddisk::r0 OK yes ResourceManager[2974]: 2009/05/13_11:14:52 info: Acquiring resource group: server01 drbddisk::r0 Filesystem::/dev/drbd0::/mnt/drbd::ext3 smb ResourceManager[2974]: 2009/05/13_11:14:52 info: Running /etc/ha.d/resource.d/drbddisk r0 start Oct 10 06:31:20 db1.snarvaez.com.ar heartbeat: [28991]: info: FIFO message [type resource] written rc=81 Oct 10 06:31:20 db1.snarvaez.com.ar heartbeat: [28945]: info: AnnounceTakeover(local 1, foreign 1, reason 'T_RESOURCES(us)' (1)) Oct 10 06:31:20 db1.snarvaez.com.ar have a peek here Visit the following links: Site Howto | Site FAQ | Sitemap | Register Now If you have any problems with the registration process or your account login, please contact us.

heartbeat[3724]: 2006/08/30_10:40:58 ERROR: Invalid user id name [hacluster] heartbeat[3724]: 2006/08/30_10:40:58 ERROR: Bad uid list [hacluster] heartbeat[3724]: 2006/08/30_10:40:58 ERROR: Invalid apiauth directive [ccm uid=hacluster] heartbeat[3724]: 2006/08/30_10:40:58 info: Syntax: apiauth client [uid=uidlist] [gid=gidlist] checking if image is initramfs... RC=1 Jan 28 12:04:22 localhost heartbeat: ERROR: Return code 1 from /etc/ha.d/resource.d/Filesystem The resouces are; debian2 drbddisk::drbd0 Filesystem::/dev/drbd0::/mnt/debian::ext3 apache and the apache failover works but the drbd0 always comes up ResourceManager[23386]: 2009/07/12_10:04:21 WARN: it (LVM::vg2drbd) MUST succeed on a stop when already stopped ResourceManager[23386]: 2009/07/12_10:04:21 WARN: Machine reboot narrowly avoided!

Whe I try to simulate the failure of node1 by shutting down the heartbeat service at server01, the heartbeat of server02 did not kick in and did not take over and heartbeat[3725]: 2006/08/30_10:41:38 ERROR: Both machines own foreign resources! May 13 11:15:29 server02heartbeat: [2762]: info: Resources being acquired from server01 May 13 11:15:29 server02heartbeat: [2770]: info: No local resources [/usr/share/heartbeat/ResourceManager listkeys server02] to acquire. Today i tried to stop heartbeat manually on both servers for testing: /etc/inint.d/heartbeat stop then i noticed this errors in /var/log/ha-log (in both servers): --- heartbeat[2834]: 2009/01/12_22:35:08 info: Heartbeat shutdown in

The file /etc/ha.d/ha.cf Here is the ha.cf file. (You can click the ha.cf link to download it). If there are any specific logs which you require, I am willing to post it for your review. Started drbd on test01 (secondary) 3. Regards, Alex djalex, Aug 25, 2006 #5 falko Super Moderator ISPConfig Developer djalex said: However, primary linux server crashes when windows client tries to access the files with Samba.

On node 0 totalpages: 131056 DMA zone: 4096 pages, LIFO batch:1 Normal zone: 126960 pages, LIFO batch:16 HighMem zone: 0 pages, LIFO batch:1 DMI 2.3 present. my haresources file: --- th-dus-mqm drbddisk::drbd0 Filesystem::/dev/drbd0::/dus::ext3 drbddisk::drbd2 Filesystem::/dev/drbd2::/home/tbmx/dus::ext3 mqm_dus th-fra-mqm drbddisk::drbd1 Filesystem::/dev/drbd1::/fra::ext3 drbddisk::drbd3 Filesystem::/dev/drbd3::/home/tbmx/fra::ext3 mqm_fra --- my ha.cf: --- node th-dus-mqm th-fra-mqm ucast bond0.121 ucast Warning: The resulting partition is not properly aligned for best performance.