1
* Fri Mar 08 2013 Andrew Beekhof <andrew@beekhof.net> Pacemaker-1.1.9-1
2
- Update source tarball to revision: 7e42d77
5
Diff: 1301 files changed, 92909 insertions(+), 57455 deletions(-)
7
- Features added in Pacemaker-1.1.9
8
+ corosync: Allow cman and corosync 2.0 nodes to use a name other than uname()
9
+ corosync: Use queues to avoid blocking when sending CPG messages
10
+ ipc: Compress messages that exceed the configured IPC message limit
11
+ ipc: Use queues to prevent slow clients from blocking the server
12
+ ipc: Use shared memory by default
13
+ lrmd: Support nagios remote monitoring
14
+ lrmd: Pacemaker Remote Daemon for extending pacemaker functionality outside corosync cluster.
15
+ pengine: Check for master/slave resources that are not OCF agents
16
+ pengine: Support a 'requires' resource meta-attribute for controlling whether it needs quorum, fencing or nothing
17
+ pengine: Support for resource container
18
+ pengine: Support resources that require unfencing before start
20
- Changes since Pacemaker-1.1.8
21
+ attrd: Correctly handle deletion of non-existant attributes
22
+ Bug cl#5135 - Improved detection of the active cluster type
23
+ Bug rhbz#913093 - Use crm_node instead of uname
24
+ cib: Avoid use-after-free by correctly support cib_no_children for non-xpath queries
25
+ cib: Correctly process XML diff's involving element removal
26
+ cib: Performance improvements for non-DC nodes
27
+ cib: Prevent error message by correctly handling peer replies
28
+ cib: Prevent ordering changes when applying xml diffs
29
+ cib: Remove text nodes from cib replace operations
30
+ cluster: Detect node name collisions in corosync
31
+ cluster: Preserve corosync membership state when matching node name/id entries
32
+ cman: Force fenced to terminate on shutdown
33
+ cman: Ignore qdisk 'nodes'
34
+ core: Drop per-user core directories
35
+ corosync: Avoid errors when closing failed connections
36
+ corosync: Ensure peer state is preserved when matching names to nodeids
37
+ corosync: Clean up CMAP connections after querying node name
38
+ corosync: Correctly detect corosync 2.0 clusters even if we don't have permission to access it
39
+ crmd: Bug cl#5144 - Do not updated the expected status of failed nodes
40
+ crmd: Correctly determin if cluster disconnection was abnormal
41
+ crmd: Correctly relay messages for remote clients (bnc#805626, bnc#804704)
42
+ crmd: Correctly stall the FSA when waiting for additional inputs
43
+ crmd: Detect and recover when we are evicted from CPG
44
+ crmd: Differentiate between a node that is up and coming up in peer_update_callback()
45
+ crmd: Have cib operation timeouts scale with node count
46
+ crmd: Improved continue/wait logic in do_dc_join_finalize()
47
+ crmd: Prevent election storms caused by getrusage() values being too close
48
+ crmd: Prevent timeouts when performing pacemaker level membership negotiation
49
+ crmd: Prevent use-after-free of fsa_message_queue during exit
50
+ crmd: Store all current actions when stalling the FSA
51
+ crm_mon: Do not try to render a blank cib and indicate the previous output is now stale
52
+ crm_mon: Fixes crm_mon crash when using snmp traps.
53
+ crm_mon: Look for the correct error codes when applying configuration updates
54
+ crm_report: Ensure policy engine logs are found
55
+ crm_report: Fix node list detection
56
+ crm_resource: Have crm_resource generate a valid transition key when sending resource commands to the crmd
57
+ date/time: Bug cl#5118 - Correctly convert seconds-since-epoch to the current time
58
+ fencing: Attempt to provide more information that just 'generic error' for failed actions
59
+ fencing: Correctly record completed but previously unknown fencing operations
60
+ fencing: Correctly terminate when all device options have been exhausted
61
+ fencing: cov#739453 - String not null terminated
62
+ fencing: Do not merge new fencing requests with stale ones from dead nodes
63
+ fencing: Do not start fencing until entire device topology is found or query results timeout.
64
+ fencing: Do not wait for the query timeout if all replies have arrived
65
+ fencing: Fix passing of parameters from CMAN containing '='
66
+ fencing: Fix non-comparison when sorting devices by priority
67
+ fencing: On failure, only try a topology device once from the remote level.
68
+ fencing: Only try peers for non-topology based operations once
69
+ fencing: Retry stonith device for duration of action's timeout period.
70
+ heartbeat: Remove incorrect assert during cluster connect
71
+ ipc: Bug cl#5110 - Prevent 100% CPU usage when looking for synchronous replies
72
+ ipc: Use 50k as the default compression threshold
73
+ legacy: Prevent assertion failure on routing ais messages (bnc#805626)
74
+ legacy: Re-enable logging from the pacemaker plugin
75
+ legacy: Relax the 'active' check for plugin based clusters to avoid false negatives
76
+ legacy: Skip peer process check if the process list is empty in crm_is_corosync_peer_active()
77
+ mcp: Only define HA_DEBUGLOG to avoid agent calls to ocf_log printing everything twice
78
+ mcp: Re-attach to existing pacemaker components when mcp fails
79
+ pengine: Any location constraint for the slave role applies to all roles
80
+ pengine: Avoid leaking memory when cleaning up failcounts and using containers
81
+ pengine: Bug cl#5101 - Ensure stop order is preserved for partially active groups
82
+ pengine: Bug cl#5140 - Allow set members to be stopped when the subseqent set has require-all=false
83
+ pengine: Bug cl#5143 - Prevent shuffling of anonymous master/slave instances
84
+ pengine: Bug rhbz#880249 - Ensure orphan masters are demoted before being stopped
85
+ pengine: Bug rhbz#880249 - Teach the PE how to recover masters into primitives
86
+ pengine: cl#5025 - Automatically clear failcount for start/monitor failures after resource parameters change
87
+ pengine: cl#5099 - Probe operation uses the timeout value from the minimum interval monitor by default (#bnc776386)
88
+ pengine: cl#5111 - When clone/master child rsc has on-fail=stop, insure all children stop on failure.
89
+ pengine: cl#5142 - Do not delete orphaned children of an anonymous clone
90
+ pengine: Correctly unpack active anonymous clones
91
+ pengine: Ensure previous migrations are closed out before attempting another one
92
+ pengine: Introducing the whitebox container resources feature
93
+ pengine: Prevent double-free for cloned primitive from template
94
+ pengine: Process rsc_ticket dependencies earlier for correctly allocating resources (bnc#802307)
95
+ pengine: Remove special cases for fencing resources
96
+ pengine: rhbz#902459 - Remove rsc node status for orphan resources
97
+ systemd: Gracefully handle unexpected DBus return types
98
+ Replace the use of the insecure mktemp(3) with mkstemp(3)
100
* Thu Sep 20 2012 Andrew Beekhof <andrew@beekhof.net> Pacemaker-1.1.8-1
102
- Update source tarball to revision: 1a5341f
105
Diff: 2107 files changed, 117258 insertions(+), 73606 deletions(-)
107
- All APIs have been cleaned up and reduced to essentials
108
- Pacemaker now includes a replacement lrmd that supports systemd and upstart agents
109
- Config and state files (cib.xml, PE inputs and core files) have moved to new locations
110
- The crm shell has become a separate project and no longer included with Pacemaker
111
- All daemons/tools now have a unified set of error codes based on errno.h (see crm_error)
113
- Changes since Pacemaker-1.1.7
114
+ Core: Bug cl#5032 - Rewrite the iso8601 date handling code
115
+ Core: Correctly extract the version details from a diff
116
+ Core: Log blackbox contents, if enabled, when an error occurs
117
+ Core: Only LOG_NOTICE and higher are sent to syslog
118
+ Core: Replace use of IPC from clplumbing with IPC from libqb
119
+ Core: SIGUSR1 now enables blackbox logging, SIGTRAP to write out
120
+ Core: Support a blackbox for additional logging detail after crashes/errors
121
+ Promote support for advanced fencing logic to the stable schema
122
+ Promote support for node starting scores to the stable schema
123
+ Promote support for service and systemd to the stable schema
125
+ attrd: Differentiate between updating all our attributes and everybody updating all theirs too
126
+ attrd: Have single-shot clients wait for an ack before disconnecting
127
+ cib: cl#5026 - Synced cib updates should not return until the cpg broadcast is complete.
128
+ corosync: Detect when the first corosync has not yet formed and handle it gracefully
129
+ corosync: Obtain a full list of configured nodes, including their names, when we connect to the quorum API
130
+ corosync: Obtain a node name from DNS if one was not already known
131
+ corosync: Populate the cib nodelist from corosync if available
132
+ corosync: Use the CFG API and DNS to determine node names if not configured in corosync.conf
133
+ crmd: Block after 10 failed fencing attempts for a node
134
+ crmd: cl#5051 - Fixes file leak in pe ipc connection initialization.
135
+ crmd: cl#5053 - Fixes fail-count not being updated properly.
136
+ crmd: cl#5057 - Restart sub-systems correctly (bnc#755671)
137
+ crmd: cl#5068 - Fixes crm_node -R option so it works with corosync 2.0
138
+ crmd: Correctly re-establish failed attrd connections
139
+ crmd: Detect when the quorum API isn't configured for corosync 2.0
140
+ crmd: Do not overwrite any configured node type (eg. quorum node)
141
+ crmd: Enable use of new lrmd daemon and client library in crmd.
142
+ crmd: Overhaul the way node state is recorded and updated in the CIB
143
+ fencing: Bug rhbz#853537 - Prevent use-of-NULL when the cib libraries are not available
144
+ fencing: cl#5073 - Add 'off' as an valid value for stonith-action option.
145
+ fencing: cl#5092 - Always timeout stonith operations if timeout period expires.
146
+ fencing: cl#5093 - Stonith per device timeout option
147
+ fencing: Clean up if we detect a failed connection
148
+ fencing: Delegate complex self fencing requests - we wont be around to see it to completion
149
+ fencing: Ensure all peers are notified of complex fencing op completion
150
+ fencing: Fix passing of fence_legacy parameters containing '='
151
+ fencing: Gracefully handle metadata requests for unknown agents
152
+ fencing: Return cached dynamic target list for busy devices.
153
+ fencing: rhbz#801355 - Abort transition on DC when external fencing operation is detected
154
+ fencing: rhbz#801355 - Merge fence requests for identical operations already in progress.
155
+ fencing: rhbz#801355 - Report fencing operations external of pacemaker to cib
156
+ fencing: Specify the action to perform using action= instead of the older option=
157
+ fencing: Stop building fake metadata for broken agents
158
+ fencing: Tolerate agents that report empty metadata in the admin tool
159
+ mcp: Correctly retry the connection to corosync on failure
160
+ mcp: Do not shut down IPC until the last client exits
161
+ mcp: Prevent use-after-free when running against corosync 1.x
162
+ pengine: Bug cl#5059 - Use the correct action's status when calculating required actions for interleaved clones
163
+ pengine: Bypass online/offline checking resource detection for ping/quorum nodes
164
+ pengine: cl#5044 - migrate_to no longer requires load_stopped for avoiding possible transition loop
165
+ pengine: cl#5069 - Honor 'on-fail=ignore' even when operation is disabled.
166
+ pengine: cl#5070 - Allow influence of promotion score when multistate rsc is left hand of colocation
167
+ pengine: cl#5072 - Fixes monitor op stopping after rsc promotion.
168
+ pengine: cl#5072 - Fixes pengine regression test failures
169
+ pengine: Correctly set the status for nodes not intended to run Pacemaker
170
+ pengine: Do not append instance numbers to anonymous clones
171
+ pengine: Fix failcount expiration
172
+ pengine: Fix memory leaks found by valgrind
173
+ pengine: Fix use-after-free and use-of-NULL errors detected by coverity
174
+ pengine: Fixes use of colocation scores other than +/- INFINITY
175
+ pengine: Improve detection of rejoining nodes
176
+ pengine: Prevent use-of-NULL when tracing is enabled
177
+ pengine: Stonith resources are allowed to start even if their probes haven't completed on partially active nodes
178
+ services: New class called 'service' which expands to the correct (LSB/systemd/upstart) standard
179
+ services: Support Asynchronous systemd/upstart actions
180
+ Tools: crm_shadow - Bug cl#5062 - Correctly set argv[0] when forking a shell process
181
+ Tools: crm_report: Always include system logs (if we can find them)
1
183
* Wed Mar 28 2012 Andrew Beekhof <andrew@beekhof.net> Pacemaker-1.1.7-1
2
184
- Update source tarball to revision: bc7ff2c
5
187
Diff: 1171 files changed, 90472 insertions(+), 19368 deletions(-)
7
189
- Changes since Pacemaker-1.1.6.1
8
High: ais: Prepare for corosync versions using IPC from libqb
9
High: cib: Correctly shutdown in the presence of peers without relying on timers
10
High: cib: Don't halt disk writes if the previous digest is missing
11
High: cib: Determine when there are no peers to respond to our shutdown request and exit
12
High: cib: Ensure no additional messages are processed after we begin terminating
13
High: Cluster: Hook up the callbacks to the corosync quorum notifications
14
High: Core: basename() may modify its input, do not pass in a constant
15
High: Core: Bug cl#5016 - Prevent failures in recurring ops from being lost
16
High: Core: Bug rhbz#800054 - Correctly retrieve heartbeat uuids
17
High: Core: Correctly determine when an XML file should be decompressed
18
High: Core: Correctly track the length of a string without reading from uninitialzied memory (valgrind)
19
High: Core: Ensure signals are handled eventually in the absense of timer sources or IPC messages
20
High: Core: Prevent use-of-NULL in crm_update_peer()
21
High: Core: Strip text nodes from on disk xml files
22
High: Core: Support libqb for logging
23
High: corosync: Consistently set the correct uuid with get_node_uuid()
24
High: Corosync: Correctly disconnect from corosync variants
25
High: Corosync: Correctly extract the node id from membership udpates
26
High: corosync: Correctly infer lost members from the quorum API
27
High: Corosync: Default to using the nodeid as the node's uuid (instead of uname)
28
High: corosync: Ensure we catch nodes that leave the membership, even if the ringid doesn't change
29
High: corosync: Hook up CPG membership
30
High: corosync: Relax a development assert and gracefully handle the error condition
31
High: corosync: Remove deprecated member of the CFG API
32
High: corosync: Treat CS_ERR_QUEUE_FULL the same as CS_ERR_TRY_AGAIN
33
High: corosync: Unset the process list when nodes dissappear on us
34
High: crmd: Also purge fencing results when we enter S_NOT_DC
35
High: crmd: Bug cl#5015 - Remove the failed operation as well as the resulting fail-count and last-failure attributes
36
High: crmd: Correctly determine when a node can suicide with fencing
37
High: crmd: Election - perform the age comparison only once
38
High: crmd: Fast-track shutdown if we couldn't request it via attrd
39
High: crmd: Leave it up to the PE to decide which ops can/cannot be reload
40
High: crmd: Prevent use-after-free when calling delete_resource due to CRM_OP_REPROBE
41
High: crmd: Supply format arguments in the correct order
42
High: Fencing: Add missing format parameter
43
High: fencing: Add the fencing topology section to the 1.1 configuration schema
44
High: fencing: fence_legacy - Drop spurilous host argument from status query
45
High: Fencing: fence_legacy - Ensure port is available as an environment variable when calling monitor
46
High: Fencing: fence_pcmk - don't block if nothing is specified on stdin
47
High: fencing: Fix log format error
48
High: Fencing: Fix segfault caused by passing garbage to dlsym()
49
High: Fencing: Fix use-of-NULL in process_remote_stonith_query()
50
High: Fencing: Fix use-of-NULL when listing installed devices
51
High: Fencing: Implement support for advanced fencing topologies: eg. kdump || (network && disk) || power
52
High: fencing: More gracefully handle failed 'list' operations for devices that only support a single connection
53
High: Fencing: Prevent duplicate free when listing devices
54
High: Fencing: Prevent uninitialized pointers being passed to free
55
High: Fencing: Prevent use-after-free, we may need the query result for subsequent operations
56
High: fencing: Provide enough data to construct an entry in the node's fencing history
57
High: Fencing: Standardize on /one/ method for clients to request members be fenced
58
High: Fencing: Supress errors when listing all registered devices
59
High: mcp: corosync_cfg_state_track was removed from the corosync API, luckily we didnt use it for anything
60
High: mcp: Do not specify a WorkingDirectory in the systemd unit file - startup fails if its not available
61
High: mcp: Set the HA_quorum_type env variable consistently with our corosync plugin
62
High: mcp: Shut down if one of our child processes can/should not be respawned
63
High: PE: Bug cl#5000 - Ensure ordering is preserved when depending on partial sets
64
High: PE: Bug cl#5028 - Unmanaged services should block shutdown unless in maintainence mode
65
High: PE: Bug cl#5038 - Prevent restart of anonymous clones when clone-max decreases
66
High: PE: Bug cl#5007 - Fixes use of colocation constraints with multi-state resources
67
High: PE: Bug cl#5014 - Prevent asymmetrical order constraints from causing resource stops
68
High: PE: Bug cl#5000 - Implements ability to create rsc_order constraint sets such that A can start after B or C has started.
69
High: PE: Correctly migrate a resource that has just migrated
70
High: PE: Correct return from error path
71
High: PE: Detect reloads of previously migrated resources
72
High: PE: Ensure post-migration stop actions occur before node shutdown
73
High: PE: Log as loudly as possible when we cannot shut down a cluster node
74
High: PE: Reload of a resource no longer causes a restart of dependant resources
75
High: PE: Support limiting the number of concurrent live migrations
76
High: PE: Support referencing templates in constraints
77
High: PE: Support of referencing resource templates in resource sets
78
High: PE: Support to make tickets standby for relinquishing tickets gracefully
79
High: stonith: A "start" operation of a stonith resource does a "monitor" on the device beyond registering it
80
High: stonith: Bug rhbz#745526 - Ensure stonith_admin actually gets called by fence_pcmk
81
High: Stonith: Ensure all nodes receive and deliver notifications of the manual override
82
High: stonith: Fix the stonith timeout issue (cl#5009, bnc#727498)
83
High: Stonith: Implement a manual override for when nodes are known to be safely off
84
High: Tools: Bug cl#5003 - Prevent use-after-free in crm_simlate
85
High: Tools: crm_mon - Support to display tickets (based on Yuusuke Iida's work)
86
High: Tools: crm_simulate - Support to grant/revoke/standby/activate tickets from the new ticket state section
87
High: Tools: Implement crm_node functionality for native corosync
88
High: Fix a number of potential problems reported by coverity
190
+ ais: Prepare for corosync versions using IPC from libqb
191
+ cib: Correctly shutdown in the presence of peers without relying on timers
192
+ cib: Don't halt disk writes if the previous digest is missing
193
+ cib: Determine when there are no peers to respond to our shutdown request and exit
194
+ cib: Ensure no additional messages are processed after we begin terminating
195
+ Cluster: Hook up the callbacks to the corosync quorum notifications
196
+ Core: basename() may modify its input, do not pass in a constant
197
+ Core: Bug cl#5016 - Prevent failures in recurring ops from being lost
198
+ Core: Bug rhbz#800054 - Correctly retrieve heartbeat uuids
199
+ Core: Correctly determine when an XML file should be decompressed
200
+ Core: Correctly track the length of a string without reading from uninitialzied memory (valgrind)
201
+ Core: Ensure signals are handled eventually in the absense of timer sources or IPC messages
202
+ Core: Prevent use-of-NULL in crm_update_peer()
203
+ Core: Strip text nodes from on disk xml files
204
+ Core: Support libqb for logging
205
+ corosync: Consistently set the correct uuid with get_node_uuid()
206
+ Corosync: Correctly disconnect from corosync variants
207
+ Corosync: Correctly extract the node id from membership udpates
208
+ corosync: Correctly infer lost members from the quorum API
209
+ Corosync: Default to using the nodeid as the node's uuid (instead of uname)
210
+ corosync: Ensure we catch nodes that leave the membership, even if the ringid doesn't change
211
+ corosync: Hook up CPG membership
212
+ corosync: Relax a development assert and gracefully handle the error condition
213
+ corosync: Remove deprecated member of the CFG API
214
+ corosync: Treat CS_ERR_QUEUE_FULL the same as CS_ERR_TRY_AGAIN
215
+ corosync: Unset the process list when nodes dissappear on us
216
+ crmd: Also purge fencing results when we enter S_NOT_DC
217
+ crmd: Bug cl#5015 - Remove the failed operation as well as the resulting fail-count and last-failure attributes
218
+ crmd: Correctly determine when a node can suicide with fencing
219
+ crmd: Election - perform the age comparison only once
220
+ crmd: Fast-track shutdown if we couldn't request it via attrd
221
+ crmd: Leave it up to the PE to decide which ops can/cannot be reload
222
+ crmd: Prevent use-after-free when calling delete_resource due to CRM_OP_REPROBE
223
+ crmd: Supply format arguments in the correct order
224
+ fencing: Add missing format parameter
225
+ fencing: Add the fencing topology section to the 1.1 configuration schema
226
+ fencing: fence_legacy - Drop spurilous host argument from status query
227
+ fencing: fence_legacy - Ensure port is available as an environment variable when calling monitor
228
+ fencing: fence_pcmk - don't block if nothing is specified on stdin
229
+ fencing: Fix log format error
230
+ fencing: Fix segfault caused by passing garbage to dlsym()
231
+ fencing: Fix use-of-NULL in process_remote_stonith_query()
232
+ fencing: Fix use-of-NULL when listing installed devices
233
+ fencing: Implement support for advanced fencing topologies: eg. kdump || (network && disk) || power
234
+ fencing: More gracefully handle failed 'list' operations for devices that only support a single connection
235
+ fencing: Prevent duplicate free when listing devices
236
+ fencing: Prevent uninitialized pointers being passed to free
237
+ fencing: Prevent use-after-free, we may need the query result for subsequent operations
238
+ fencing: Provide enough data to construct an entry in the node's fencing history
239
+ fencing: Standardize on /one/ method for clients to request members be fenced
240
+ fencing: Supress errors when listing all registered devices
241
+ mcp: corosync_cfg_state_track was removed from the corosync API, luckily we didnt use it for anything
242
+ mcp: Do not specify a WorkingDirectory in the systemd unit file - startup fails if its not available
243
+ mcp: Set the HA_quorum_type env variable consistently with our corosync plugin
244
+ mcp: Shut down if one of our child processes can/should not be respawned
245
+ pengine: Bug cl#5000 - Ensure ordering is preserved when depending on partial sets
246
+ pengine: Bug cl#5028 - Unmanaged services should block shutdown unless in maintainence mode
247
+ pengine: Bug cl#5038 - Prevent restart of anonymous clones when clone-max decreases
248
+ pengine: Bug cl#5007 - Fixes use of colocation constraints with multi-state resources
249
+ pengine: Bug cl#5014 - Prevent asymmetrical order constraints from causing resource stops
250
+ pengine: Bug cl#5000 - Implements ability to create rsc_order constraint sets such that A can start after B or C has started.
251
+ pengine: Correctly migrate a resource that has just migrated
252
+ pengine: Correct return from error path
253
+ pengine: Detect reloads of previously migrated resources
254
+ pengine: Ensure post-migration stop actions occur before node shutdown
255
+ pengine: Log as loudly as possible when we cannot shut down a cluster node
256
+ pengine: Reload of a resource no longer causes a restart of dependant resources
257
+ pengine: Support limiting the number of concurrent live migrations
258
+ pengine: Support referencing templates in constraints
259
+ pengine: Support of referencing resource templates in resource sets
260
+ pengine: Support to make tickets standby for relinquishing tickets gracefully
261
+ stonith: A "start" operation of a stonith resource does a "monitor" on the device beyond registering it
262
+ stonith: Bug rhbz#745526 - Ensure stonith_admin actually gets called by fence_pcmk
263
+ Stonith: Ensure all nodes receive and deliver notifications of the manual override
264
+ stonith: Fix the stonith timeout issue (cl#5009, bnc#727498)
265
+ Stonith: Implement a manual override for when nodes are known to be safely off
266
+ Tools: Bug cl#5003 - Prevent use-after-free in crm_simlate
267
+ Tools: crm_mon - Support to display tickets (based on Yuusuke Iida's work)
268
+ Tools: crm_simulate - Support to grant/revoke/standby/activate tickets from the new ticket state section
269
+ Tools: Implement crm_node functionality for native corosync
270
+ Fix a number of potential problems reported by coverity
90
272
* Wed Aug 31 2011 Andrew Beekhof <andrew@beekhof.net> 1.1.6-1
91
273
- Update source tarball to revision: 676e5f25aa46 tip
94
276
Diff: 1761 files changed, 36259 insertions(+), 140578 deletions(-)
96
278
- Changes since Pacemaker-1.1.5
97
+ High: ais: check for retryable errors when dispatching AIS messages
98
+ High: ais: Correctly disconnect from Corosync and Cman based clusters
99
+ High: ais: Followup to previous patch - Ensure we drain the corosync queue of messages when Glib tells us there is input
100
+ High: ais: Handle IPC error before checking for NULL data (bnc#702907)
101
+ High: cib: Check the validation version before adding the originator details of a CIB change
102
+ High: cib: Remove disconnected remote connections from mainloop
103
+ High: cman: Correctly override existing fenced operations
104
+ High: cman: Dequeue all the cman emitted events and not only the first one leaving the others in the event's queue.
105
+ High: cman: Don't call fenced_join and fenced_leave when notifying cman of a fencing event.
106
+ High: cman: We need to run the crmd as root for CMAN so that we can ACK fencing operations
107
+ High: Core: Cancelled and pending operations do not count as failed
108
+ High: Core: Ensure there is sufficient space for EOS when building short-form option strings
109
+ High: Core: Fix variable expansion in pkg-config files
110
+ High: Core: Partial revert of accidental commit in previous patch
111
+ High: Core: Use dlopen to load heartbeat libraries on-demand
112
+ High: crmd: Bug lf#2509 - Watch for config option changes from the CIB even if we're not the DC
113
+ High: crmd: Bug lf#2528 - Introduce a slight delay when creating a transition to allow attrd time to perform its updates
114
+ High: crmd: Bug lf#2559 - Fail actions that were scheduled for a failed/fenced node
115
+ High: crmd: Bug lf#2584 - Allow nodes to fence themselves if they're the last one standing
116
+ High: crmd: Bug lf#2632 - Correctly handle nodes that return faster than stonith
117
+ High: crmd: Cancel timers for actions that were pending on dead nodes
118
+ High: crmd: Catch fence operations that claim to succeed but did not really
119
+ High: crmd: Do not wait for actions that were pending on dead nodes
120
+ High: crmd: Ensure we do not attempt to perform action on failed nodes
121
+ High: crmd: Prevent use-of-NULL by g_hash_table_iter_next()
122
+ High: crmd: Recurring actions shouldn't cause the last non-recurring action to be forgotten
123
+ High: crmd: Store only the last and last failed operation in the CIB
124
+ High: mcp: dirname() modifies the input path - pass in a copy of the logfile path
125
+ High: mcp: Enable stack detection logic instead of forcing 'corosync'
126
+ High: mcp: Fix spelling mistake in systemd service script that prevents shutdown
127
+ High: mcp: Shut down if corosync becomes unavailable
128
+ High: mcp: systemd control file is now functional
129
+ High: PE: Before migrating an utilization-using resource to a node, take off the load which will no longer run there (lf#2599, bnc#695440)
130
+ High: PE: Before migrating an utilization-using resource to a node, take off the load which will no longer run there (regression tests) (lf#2599, bnc#695440)
131
+ High: PE: Bug lf#2574 - Prevent shuffling by choosing the correct clone instance to stop
132
+ High: PE: Bug lf#2575 - Use uname for migration variables, id is a UUID on heartbeat
133
+ High: PE: Bug lf#2581 - Avoid group restart when clone (re)starts on an unrelated node
134
+ High: PE: Bug lf#2613, lf#2619 - Group migration after failures and non-default utilization policies
135
+ High: PE: Bug suse#707150 - Prevent services being active if dependancies on clones are not satisfied
136
+ High: PE: Correctly recognise which recurring operations are currently active
137
+ High: PE: Demote from Master does not clear previous errors
138
+ High: PE: Ensure restarts due to definition changes cause the start action to be re-issued not probes
139
+ High: PE: Ensure role is preserved for unmanaged resources
140
+ High: PE: Ensure unmanaged resources have the correct role set so the correct monitor operation is chosen
141
+ High: PE: Fix memory leak for re-allocated resources reported by valgrind
142
+ High: PE: Implement cluster ticket and deadman
143
+ High: PE: Implement resource template
144
+ High: pengine: Correctly determine the state of multi-state resources with a partial operation history
145
+ High: PE: Only allocate master/slave resources once
146
+ High: PE: Partial revert of 'Minor code cleanup CS: cf6bca32376c On: 2011-08-15'
147
+ High: PE: Resolve memory leak reported by valgrind
148
+ High: PE: Restore the ability to save inputs to disk
149
+ High: Shell: implement -w,--wait option to wait for the transition to finish
150
+ High: Shell: repair template list command
151
+ High: Shell: set of commands to examine logs, reports, etc
152
+ High: Stonith: Consolidate pcmk_host_map into run_stonith_agent so that it is applied consistently
153
+ High: Stonith: Deprecate pcmk_arg_map for the saner pcmk_host_argument
154
+ High: Stonith: Fix use-of-NULL by g_hash_table_lookup
155
+ High: Stonith: Improved pcmk_host_map parsing
156
+ High: Stonith: Prevent use-of-NULL by g_hash_table_lookup
157
+ High: Stonith: Prevent use-of-NULL when no Linux-HA stonith agents are present
158
+ High: stonith: Add missing entries to stonith_error2string()
159
+ High: Stonith: Correctly finish sending agent options if the initial write is interrupted
160
+ High: stonith: Correctly handle synchronous calls
161
+ High: stonith: Coverity - Correctly construct result list for the query API call
162
+ High: stonith: Coverity - Remove badly constructed memory allocation from the query API call
163
+ High: stonith: Ensure completed operations are recorded as such in the history
164
+ High: Stonith: Ensure device parameters are passed to the daemon during registration
165
+ High: stonith: Fix use-of-NULL in stonith_api_device_list()
166
+ High: stonith: stonith_admin - Prevent use of uninitialized pointer by --history command
167
+ High: Tools: Bug lf#2528 - Make progress when attrd_updater is called repeatedly within the dampen interval but with the same value
168
+ High: Tools: crm_report - Correctly extract data from the local node
169
+ High: Tools: crm_report - Remove newlines when detecting the node list
170
+ High: Tools: crm_report - Repair the ability to extract data from the local machine
171
+ High: Tools: crm_report - Report on all detected backtraces
279
+ ais: check for retryable errors when dispatching AIS messages
280
+ ais: Correctly disconnect from Corosync and Cman based clusters
281
+ ais: Followup to previous patch - Ensure we drain the corosync queue of messages when Glib tells us there is input
282
+ ais: Handle IPC error before checking for NULL data (bnc#702907)
283
+ cib: Check the validation version before adding the originator details of a CIB change
284
+ cib: Remove disconnected remote connections from mainloop
285
+ cman: Correctly override existing fenced operations
286
+ cman: Dequeue all the cman emitted events and not only the first one leaving the others in the event's queue.
287
+ cman: Don't call fenced_join and fenced_leave when notifying cman of a fencing event.
288
+ cman: We need to run the crmd as root for CMAN so that we can ACK fencing operations
289
+ Core: Cancelled and pending operations do not count as failed
290
+ Core: Ensure there is sufficient space for EOS when building short-form option strings
291
+ Core: Fix variable expansion in pkg-config files
292
+ Core: Partial revert of accidental commit in previous patch
293
+ Core: Use dlopen to load heartbeat libraries on-demand
294
+ crmd: Bug lf#2509 - Watch for config option changes from the CIB even if we're not the DC
295
+ crmd: Bug lf#2528 - Introduce a slight delay when creating a transition to allow attrd time to perform its updates
296
+ crmd: Bug lf#2559 - Fail actions that were scheduled for a failed/fenced node
297
+ crmd: Bug lf#2584 - Allow nodes to fence themselves if they're the last one standing
298
+ crmd: Bug lf#2632 - Correctly handle nodes that return faster than stonith
299
+ crmd: Cancel timers for actions that were pending on dead nodes
300
+ crmd: Catch fence operations that claim to succeed but did not really
301
+ crmd: Do not wait for actions that were pending on dead nodes
302
+ crmd: Ensure we do not attempt to perform action on failed nodes
303
+ crmd: Prevent use-of-NULL by g_hash_table_iter_next()
304
+ crmd: Recurring actions shouldn't cause the last non-recurring action to be forgotten
305
+ crmd: Store only the last and last failed operation in the CIB
306
+ mcp: dirname() modifies the input path - pass in a copy of the logfile path
307
+ mcp: Enable stack detection logic instead of forcing 'corosync'
308
+ mcp: Fix spelling mistake in systemd service script that prevents shutdown
309
+ mcp: Shut down if corosync becomes unavailable
310
+ mcp: systemd control file is now functional
311
+ pengine: Before migrating an utilization-using resource to a node, take off the load which will no longer run there (lf#2599, bnc#695440)
312
+ pengine: Before migrating an utilization-using resource to a node, take off the load which will no longer run there (regression tests) (lf#2599, bnc#695440)
313
+ pengine: Bug lf#2574 - Prevent shuffling by choosing the correct clone instance to stop
314
+ pengine: Bug lf#2575 - Use uname for migration variables, id is a UUID on heartbeat
315
+ pengine: Bug lf#2581 - Avoid group restart when clone (re)starts on an unrelated node
316
+ pengine: Bug lf#2613, lf#2619 - Group migration after failures and non-default utilization policies
317
+ pengine: Bug suse#707150 - Prevent services being active if dependancies on clones are not satisfied
318
+ pengine: Correctly recognise which recurring operations are currently active
319
+ pengine: Demote from Master does not clear previous errors
320
+ pengine: Ensure restarts due to definition changes cause the start action to be re-issued not probes
321
+ pengine: Ensure role is preserved for unmanaged resources
322
+ pengine: Ensure unmanaged resources have the correct role set so the correct monitor operation is chosen
323
+ pengine: Fix memory leak for re-allocated resources reported by valgrind
324
+ pengine: Implement cluster ticket and deadman
325
+ pengine: Implement resource template
326
+ pengine: Correctly determine the state of multi-state resources with a partial operation history
327
+ pengine: Only allocate master/slave resources once
328
+ pengine: Partial revert of 'Minor code cleanup CS: cf6bca32376c On: 2011-08-15'
329
+ pengine: Resolve memory leak reported by valgrind
330
+ pengine: Restore the ability to save inputs to disk
331
+ Shell: implement -w,--wait option to wait for the transition to finish
332
+ Shell: repair template list command
333
+ Shell: set of commands to examine logs, reports, etc
334
+ Stonith: Consolidate pcmk_host_map into run_stonith_agent so that it is applied consistently
335
+ Stonith: Deprecate pcmk_arg_map for the saner pcmk_host_argument
336
+ Stonith: Fix use-of-NULL by g_hash_table_lookup
337
+ Stonith: Improved pcmk_host_map parsing
338
+ Stonith: Prevent use-of-NULL by g_hash_table_lookup
339
+ Stonith: Prevent use-of-NULL when no Linux-HA stonith agents are present
340
+ stonith: Add missing entries to stonith_error2string()
341
+ Stonith: Correctly finish sending agent options if the initial write is interrupted
342
+ stonith: Correctly handle synchronous calls
343
+ stonith: Coverity - Correctly construct result list for the query API call
344
+ stonith: Coverity - Remove badly constructed memory allocation from the query API call
345
+ stonith: Ensure completed operations are recorded as such in the history
346
+ Stonith: Ensure device parameters are passed to the daemon during registration
347
+ stonith: Fix use-of-NULL in stonith_api_device_list()
348
+ stonith: stonith_admin - Prevent use of uninitialized pointer by --history command
349
+ Tools: Bug lf#2528 - Make progress when attrd_updater is called repeatedly within the dampen interval but with the same value
350
+ Tools: crm_report - Correctly extract data from the local node
351
+ Tools: crm_report - Remove newlines when detecting the node list
352
+ Tools: crm_report - Repair the ability to extract data from the local machine
353
+ Tools: crm_report - Report on all detected backtraces
173
355
* Fri Feb 11 2011 Andrew Beekhof <andrew@beekhof.net> 1.1.5-1
174
356
- Update source tarball to revision: baad6636a053
177
359
Diff: 605 files changed, 46103 insertions(+), 26417 deletions(-)
179
361
- Changes since Pacemaker-1.1.4
180
+ High: Add the ability to delegate sub-sections of the cluster to non-root users via ACLs
362
+ Add the ability to delegate sub-sections of the cluster to non-root users via ACLs
181
363
Needs to be enabled at compile time, not enabled by default.
182
+ High: ais: Bug lf#2550 - Report failed processes immediately
183
+ High: Core: Prevent recently introduced use-after-free in replace_xml_child()
184
+ High: Core: Reinstate the logic that skips past non-XML_ELEMENT_NODE children
185
+ High: Core: Remove extra calls to xmlCleanupParser resulting in use-after-free
186
+ High: Core: Repair reference to child-of-child after removal of xml_child_iter_filter from get_message_xml()
187
+ High: crmd: Bug lf#2545 - Ensure notify variables are accurate for stop operations
188
+ High: crmd: Cancel recurring operations while we're still connected to the lrmd
189
+ High: crmd: Reschedule the PE_START action if its not already running when we try to use it
190
+ High: crmd: Update failcount for failed promote and demote operations
191
+ High: PE: Bug lf#2445 - Avoid relying on stickness for stable clone placement
192
+ High: PE: Bug lf#2445 - Do not override configured clone stickiness values
193
+ High: PE: Bug lf#2493 - Don't imply colocation requirements when applying ordering constraints with clones
194
+ High: PE: Bug lf#2495 - Prevent segfault by validating the contents of ordering sets
195
+ High: PE: Bug lf#2508 - Correctly reconstruct the status of anonymous cloned groups
196
+ High: PE: Bug lf#2518 - Avoid spamming the logs with errors for orphan resources
197
+ High: PE: Bug lf#2544 - Prevent unstable clone placement by factoring in the current node's score before all others
198
+ High: PE: Bug lf#2554 - target-role alone is not sufficient to promote resources
199
+ High: PE: Correct target_rc for probes of inactive resources (fix regression introduced by cs:ac3f03006e95)
200
+ High: PE: Ensure that fencing has completed for stop actions on stonith-dependent resources (lf#2551)
201
+ High: PE: Only update the node's promotion score if the resource is active there
202
+ High: PE: Only use the promotion score from the current clone instance
203
+ High: PE: Prevent use-of-NULL resulting from variable shadowing spotted by Coverity
204
+ High: PE: Prevent use-of-NULL when there is status for an undefined node
205
+ High: PE: Prevet use-after-free resulting from unintended recursion when chosing a node to promote master/slave resources
206
+ High: Shell: don't create empty optional sections (bnc#665131)
207
+ High: Stonith: Teach stonith_admin to automagically obtain the current node attributes for the target from the CIB
208
+ High: tools: Bug lf#2527 - Prevent use-of-NULL in crm_simulate
209
+ High: Tools: Prevent crm_resource commands from being lost due to the use of cib_scope_local
364
+ ais: Bug lf#2550 - Report failed processes immediately
365
+ Core: Prevent recently introduced use-after-free in replace_xml_child()
366
+ Core: Reinstate the logic that skips past non-XML_ELEMENT_NODE children
367
+ Core: Remove extra calls to xmlCleanupParser resulting in use-after-free
368
+ Core: Repair reference to child-of-child after removal of xml_child_iter_filter from get_message_xml()
369
+ crmd: Bug lf#2545 - Ensure notify variables are accurate for stop operations
370
+ crmd: Cancel recurring operations while we're still connected to the lrmd
371
+ crmd: Reschedule the PE_START action if its not already running when we try to use it
372
+ crmd: Update failcount for failed promote and demote operations
373
+ pengine: Bug lf#2445 - Avoid relying on stickness for stable clone placement
374
+ pengine: Bug lf#2445 - Do not override configured clone stickiness values
375
+ pengine: Bug lf#2493 - Don't imply colocation requirements when applying ordering constraints with clones
376
+ pengine: Bug lf#2495 - Prevent segfault by validating the contents of ordering sets
377
+ pengine: Bug lf#2508 - Correctly reconstruct the status of anonymous cloned groups
378
+ pengine: Bug lf#2518 - Avoid spamming the logs with errors for orphan resources
379
+ pengine: Bug lf#2544 - Prevent unstable clone placement by factoring in the current node's score before all others
380
+ pengine: Bug lf#2554 - target-role alone is not sufficient to promote resources
381
+ pengine: Correct target_rc for probes of inactive resources (fix regression introduced by cs:ac3f03006e95)
382
+ pengine: Ensure that fencing has completed for stop actions on stonith-dependent resources (lf#2551)
383
+ pengine: Only update the node's promotion score if the resource is active there
384
+ pengine: Only use the promotion score from the current clone instance
385
+ pengine: Prevent use-of-NULL resulting from variable shadowing spotted by Coverity
386
+ pengine: Prevent use-of-NULL when there is status for an undefined node
387
+ pengine: Prevet use-after-free resulting from unintended recursion when chosing a node to promote master/slave resources
388
+ Shell: don't create empty optional sections (bnc#665131)
389
+ Stonith: Teach stonith_admin to automagically obtain the current node attributes for the target from the CIB
390
+ tools: Bug lf#2527 - Prevent use-of-NULL in crm_simulate
391
+ Tools: Prevent crm_resource commands from being lost due to the use of cib_scope_local
211
393
* Wed Oct 20 2010 Andrew Beekhof <andrew@beekhof.net> 1.1.4-1
212
394
- Update source tarball to revision: 75406c3eb2c1 tip
217
399
- Changes since Pacemaker-1.1.3
218
400
+ Italian translation of Clusters from Scratch
219
401
+ Significant performance enhancements to the Policy Engine and CIB
220
+ High: cib: Bug lf#2506 - Don't remove client's when notifications fail, they might just be too big
221
+ High: cib: Drop invalid/failed connections from the client hashtable
222
+ High: cib: Ensure all diffs sent to peers have sufficient ordering information
223
+ High: cib: Ensure non-change diffs can preserve the ordering on the other side
224
+ High: cib: Fix the feature set check
225
+ High: cib: Include version information on our synthesised diffs when nothing changed
226
+ High: cib: Optimize the way we detect group/set ordering changes - 15% speedup
227
+ High: cib: Prevent false detection of config updates with the new diff format
228
+ High: cib: Reduce unnecessary copying when comparing xml objects
229
+ High: cib: Repair the processing of updates sent from peer nodes
230
+ High: cib: Revert part of a recent commit that purged still valid connections
231
+ High: cib: The feature set version check is only valid if the current value is non-NULL
232
+ High: Core: Actually removing diff markers is necessary
233
+ High: Core: Bug lf#2506 - Drop the compression limit because Heartbeat's IPC code sucks
234
+ High: Core: Cache Relax-NG schemas - profiling indicates many cycles are wasted needlessly re-parsing them
235
+ High: Core: Correctly compare against crm_log_level in the logging macros
236
+ High: Core: Correctly extract the version details from a diff
237
+ High: Core: Correctly hook up the RNG schema cache
238
+ High: Core: Correctly use lazy_xml_sort() for v2 digests
239
+ High: Core: Don't compress large payload elements unless we're approaching message limits
240
+ High: Core: Don't insert empty ID tags when applying diffs
241
+ High: Core: Enable the improve v2 digests
242
+ High: Core: Ensure ordering is preserved when applying diffs
243
+ High: Core: Fix the CRM_CHECK macro
244
+ High: Core: Modify the v2 digest algorithm so that some fields are sorted
245
+ High: Core: Prevent use-after-free when creating a CIB update for a timed out action
246
+ High: Core: Prevent use-of-NULL when cleaning up RelaxNG data structures
247
+ High: Core: Provide significant performance improvements by implementing versioned diffs and digests
248
+ High: crmd: All pending operations should be recorded, even recurring ones with high start delays
249
+ High: crmd: Don't abort transitions when probes are completed on a node
250
+ High: crmd: Don't hide stop events that time out - allowing faster recovery in the presence of overloaded hosts
251
+ High: crmd: Ensure the CIB is always writable on the DC by removing a timing hole
252
+ High: crmd: Include the correct transition details for timed out operations
253
+ High: crmd: Prevent use of NULL by making copies of the operation's hash table
254
+ High: crmd: There's no need to check the cib version from the 'added' part of diff updates
255
+ High: crmd: Use the supplied timeout for stop actions
256
+ High: mcp: Ensure valgrind is able to log its output somewhere
257
+ High: mcp: Use 99/01 for the start/stop sequence to avoid problems with services (such as libvirtd) started by init - Patch from Vladislav Bogdanov
258
+ High: PE: Ensure fencing of the DC preceeds the STONITH_DONE operation
259
+ High: PE: Fix memory leak introduced as part of the conversion to GHashTables
260
+ High: PE: Fix memory leak when processing completed migration actions
261
+ High: PE: Fix typo leading to use-of-NULL in the new ordering code
262
+ High: PE: Free memory in recently introduced helper function
263
+ High: PE: lf#2478 - Implement improved handling and recovery of atomic resource migrations
264
+ High: PE: Obtain massive speedup by prepending to the list of ordering constraints (which can grow quite large)
265
+ High: PE: Optimize the logic for deciding which non-grouped anonymous clone instances to probe for
266
+ High: PE: Prevent clones from being stopped because resources colocated with them cannot be active
267
+ High: PE: Try to ensure atomic migration ops occur within a single transition
268
+ High: PE: Use hashtables instead of linked lists for performance sensitive datastructures
269
+ High: PE: Use the original digest algorithm for parameter lists
270
+ High: stonith: cleanup children on timeout in fence_legacy
271
+ High: Stonith: Fix two memory leaks
272
+ High: Tools: crm_shadow - Avoid replacing the entire configuration (including status)
402
+ cib: Bug lf#2506 - Don't remove client's when notifications fail, they might just be too big
403
+ cib: Drop invalid/failed connections from the client hashtable
404
+ cib: Ensure all diffs sent to peers have sufficient ordering information
405
+ cib: Ensure non-change diffs can preserve the ordering on the other side
406
+ cib: Fix the feature set check
407
+ cib: Include version information on our synthesised diffs when nothing changed
408
+ cib: Optimize the way we detect group/set ordering changes - 15% speedup
409
+ cib: Prevent false detection of config updates with the new diff format
410
+ cib: Reduce unnecessary copying when comparing xml objects
411
+ cib: Repair the processing of updates sent from peer nodes
412
+ cib: Revert part of a recent commit that purged still valid connections
413
+ cib: The feature set version check is only valid if the current value is non-NULL
414
+ Core: Actually removing diff markers is necessary
415
+ Core: Bug lf#2506 - Drop the compression limit because Heartbeat's IPC code sucks
416
+ Core: Cache Relax-NG schemas - profiling indicates many cycles are wasted needlessly re-parsing them
417
+ Core: Correctly compare against crm_log_level in the logging macros
418
+ Core: Correctly extract the version details from a diff
419
+ Core: Correctly hook up the RNG schema cache
420
+ Core: Correctly use lazy_xml_sort() for v2 digests
421
+ Core: Don't compress large payload elements unless we're approaching message limits
422
+ Core: Don't insert empty ID tags when applying diffs
423
+ Core: Enable the improve v2 digests
424
+ Core: Ensure ordering is preserved when applying diffs
425
+ Core: Fix the CRM_CHECK macro
426
+ Core: Modify the v2 digest algorithm so that some fields are sorted
427
+ Core: Prevent use-after-free when creating a CIB update for a timed out action
428
+ Core: Prevent use-of-NULL when cleaning up RelaxNG data structures
429
+ Core: Provide significant performance improvements by implementing versioned diffs and digests
430
+ crmd: All pending operations should be recorded, even recurring ones with high start delays
431
+ crmd: Don't abort transitions when probes are completed on a node
432
+ crmd: Don't hide stop events that time out - allowing faster recovery in the presence of overloaded hosts
433
+ crmd: Ensure the CIB is always writable on the DC by removing a timing hole
434
+ crmd: Include the correct transition details for timed out operations
435
+ crmd: Prevent use of NULL by making copies of the operation's hash table
436
+ crmd: There's no need to check the cib version from the 'added' part of diff updates
437
+ crmd: Use the supplied timeout for stop actions
438
+ mcp: Ensure valgrind is able to log its output somewhere
439
+ mcp: Use 99/01 for the start/stop sequence to avoid problems with services (such as libvirtd) started by init - Patch from Vladislav Bogdanov
440
+ pengine: Ensure fencing of the DC preceeds the STONITH_DONE operation
441
+ pengine: Fix memory leak introduced as part of the conversion to GHashTables
442
+ pengine: Fix memory leak when processing completed migration actions
443
+ pengine: Fix typo leading to use-of-NULL in the new ordering code
444
+ pengine: Free memory in recently introduced helper function
445
+ pengine: lf#2478 - Implement improved handling and recovery of atomic resource migrations
446
+ pengine: Obtain massive speedup by prepending to the list of ordering constraints (which can grow quite large)
447
+ pengine: Optimize the logic for deciding which non-grouped anonymous clone instances to probe for
448
+ pengine: Prevent clones from being stopped because resources colocated with them cannot be active
449
+ pengine: Try to ensure atomic migration ops occur within a single transition
450
+ pengine: Use hashtables instead of linked lists for performance sensitive datastructures
451
+ pengine: Use the original digest algorithm for parameter lists
452
+ stonith: cleanup children on timeout in fence_legacy
453
+ Stonith: Fix two memory leaks
454
+ Tools: crm_shadow - Avoid replacing the entire configuration (including status)
274
456
* Tue Sep 21 2010 Andrew Beekhof <andrew@beekhof.net> 1.1.3-1
275
457
- Update source tarball to revision: e3bb31c56244 tip
278
460
Diff: 481 files changed, 14130 insertions(+), 11156 deletions(-)
280
462
- Changes since Pacemaker-1.1.2.1
281
+ High: ais: Bug lf#2401 - Improved processing when the peer crmd processes join/leave
282
+ High: ais: Correct the logic for conecting to plugin based clusters
283
+ High: ais: Do not supply a process list in mcp-mode
284
+ High: ais: Drop support for whitetank in the 1.1 release series
285
+ High: ais: Get an initial dump of the node membership when connecting to quorum-based clusters
286
+ High: ais: Guard against saturated cpg connections
287
+ High: ais: Handle CS_ERR_TRY_AGAIN in more cases
288
+ High: ais: Move the code for finding uid before the fork so that the child does no logging
289
+ High: ais: Never allow quorum plugins to affect connection to the pacemaker plugin
290
+ High: ais: Sign everyone up for peer process updates, not just the crmd
291
+ High: ais: The cluster type needs to be set before initializing classic openais connections
292
+ High: cib: Also free query result for xpath operations that return more than one hit
293
+ High: cib: Attempt to resolve memory corruption when forking a child to write the cib to disk
294
+ High: cib: Correctly free memory when writing out the cib to disk
295
+ High: cib: Fix the application of unversioned diffs
296
+ High: cib: Remove old developmental error logging
297
+ High: cib: Restructure the 'valid peer' check for deciding which instructions to ignore
298
+ High: cman: Correctly process membership/quorum changes from the pcmk plugin. Allow other message types through untouched
299
+ High: cman: Filter directed messages not intended for us
300
+ High: cman: Grab the initial membership when we connect
301
+ High: cman: Keep the list of peer processes up-to-date
302
+ High: cman: Make sure our common hooks are called after a cman membership update
303
+ High: cman: Make sure we can compile without cman present
304
+ High: cman: Populate sender details for cpg messages
305
+ High: cman: Update the ringid for cman based clusters
306
+ High: Core: Correctly unpack HA_Messages containing multiple entries with the same name
307
+ High: Core: crm_count_member() should only track nodes that have the full stack up
308
+ High: Core: New developmental logging system inspired by the kernel and a PoC from Lars Ellenberg
309
+ High: crmd: All nodes should see status updates, not just he DC
310
+ High: crmd: Allow non-DC nodes to clear failcounts too
311
+ High: crmd: Base DC election on process relative uptime
312
+ High: crmd: Bug lf#2439 - cancel_op() can also return HA_RSCBUSY
313
+ High: crmd: Bug lf#2439 - Handle asynchronous notification of resource deletion events
314
+ High: crmd: Bug lf#2458 - Ensure stop actions always have the relevant resource attributes
315
+ High: crmd: Disable age as a criteria for cman based clusters, its not reliable enough
316
+ High: crmd: Ensure we activate the DC timer if we detect an alternate DC
317
+ High: crmd: Factor the nanosecond component of process uptime in elections
318
+ High: crmd: Fix assertion failure when performing async resource failures
319
+ High: crmd: Fix handling of async resource deletion results
320
+ High: crmd: Include the action for crm graph operations
321
+ High: crmd: Make sure the membership cache is accurate after a sucessful fencing operation
322
+ High: crmd: Make sure we always poke the FSA after a transition to clear any TE_HALT actions
323
+ High: crmd: Offer crm-level membership once the peer starts the crmd process
324
+ High: crmd: Only need to request quorum update for plugin based clusters
325
+ High: crmd: Prevent assertion failure for stop actions resulting from cs: 3c0bc17c6daf
326
+ High: crmd: Prevent everyone from loosing DC elections by correctly initializing all relevant variables
327
+ High: crmd: Prevent segmentation fault
328
+ High: crmd: several fixes for async resource delete (thanks to beekhof)
329
+ High: crmd: Use the correct define/size for lrm resource IDs
330
+ High: Introduce two new cluster types 'cman' and 'corosync', replaces 'quorum_provider' concept
331
+ High: mcp: Add missing headers when built without heartbeat support
332
+ High: mcp: Correctly initialize the string containing the list of active daemons
333
+ High: mcp: Fix macro expansion in init script
334
+ High: mcp: Fix the expansion of the pid file in the init script
335
+ High: mcp: Handle CS_ERR_TRY_AGAIN when connecting to libcfg
336
+ High: mcp: Make sure we can compile the mcp without cman present
337
+ High: mcp: New master control process for (re)spawning pacemaker daemons
338
+ High: mcp: Read config early so we can re-initialize logging asap if daemonizing
339
+ High: mcp: Rename the mcp binary to pacemakerd and create a 'pacemaker' init script
340
+ High: mcp: Resend our process list after every CPG change
341
+ High: mcp: Tell chkconfig we need to shut down early on
342
+ High: PE: Avoid creating invalid ordering constraints for probes that are not needed
343
+ High: PE: Bug lf#1959 - Fail unmanaged resources should not prevent other services from shutting down
344
+ High: PE: Bug lf#2422 - Ordering dependencies on partially active groups not observed properly
345
+ High: PE: Bug lf#2424 - Use notify oepration definition if it exists in the configuration
346
+ High: PE: Bug lf#2433 - No services should be stopped until probes finish
347
+ High: PE: Bug lf#2453 - Enforce clone ordering in the absense of colocation constraints
348
+ High: PE: Bug lf#2476 - Repair on-fail=block for groups and primitive resources
349
+ High: PE: Correctly detect when there is a real failcount that expired and needs to be cleared
350
+ High: PE: Correctly handle pseudo action creation
351
+ High: PE: Correctly order clone startup after group/clone start
352
+ High: PE: Correct use-after-free introduced in the prior patch
353
+ High: PE: Do not demote resources because something that requires it can not run
354
+ High: PE: Fix colocation for interleaved clones
355
+ High: PE: Fix colocation with partially active groups
356
+ High: PE: Fix potential use-after-free defect from coverity
357
+ High: PE: Fix previous merge
358
+ High: PE: Fix use-after-free in order_actions() reported by valgrind
359
+ High: PE: Make the current data set a global variable so it does not need to be passed around everywhere
360
+ High: PE: Prevent endless loop when looking for operation definitions in the configuration
361
+ High: PE: Prevent segfault by ensuring the arguments to do_calculations() are initialized
362
+ High: PE: Rewrite the ordering constraint logic to be simplicity, clarity and maintainability
363
+ High: PE: Wait until stonith is available, do not fall back to shutdown for nodes requesting termination
364
+ High: Resolve coverity RESOURCE_LEAK defects
365
+ High: Shell: Complete the transition to using crm_attribute instead of crm_failcount and crm_standby
366
+ High: stonith: Advertise stonith-ng options in the metadata
367
+ High: stonith: Bug lf#2461 - Prevent segfault by not looking up operations if the hashtable has not been initialized yet
368
+ High: stonith: Bug lf#2473 - Add the timeout at the top level where the daemon is looking for it
369
+ High: Stonith: Bug lf#2473 - Ensure stonith operations complete within the timeout and are terminated if they run too long
370
+ High: stonith: Bug lf#2473 - Ensure timeouts are included for fencing operations
371
+ High: stonith: Bug lf#2473 - Gracefully handle remote operations that arrive late (after we have done notifications)
372
+ High: stonith: Correctly parse pcmk_host_list parameters that appear on a single line
373
+ High: stonith: Map poweron/poweroff back to on/off expected by the stonith tool from cluster-glue
374
+ High: stonith: pass the configuration to the stonith program via environment variables (bnc#620781)
375
+ High: Stonith: Use the timeout specified by the user
376
+ High: Support starting plugin-based Pacemaker clusters with the MCP as well
377
+ High: Tools: Bug lf#2456 - Fix assertion failure in crm_resource
378
+ High: tools: crm_node - Repair the ability to connect to openais based clusters
379
+ High: tools: crm_node - Use the correct short option for --cman
380
+ High: tools: crm_report - corosync.conf wont necessarily contain the text 'pacemaker' anymore
381
+ High: Tools: crm_simulate - Fix use-after-free in when terminating
382
+ High: tools: crm_simulate - Resolve coverity USE_AFTER_FREE defect
383
+ High: Tools: Drop the 'pingd' daemon and resource agent in favor of ocf:pacemaker:ping
384
+ High: Tools: Fix recently introduced use-of-NULL
385
+ High: Tools: Fix use-after-free defects from coverity
463
+ ais: Bug lf#2401 - Improved processing when the peer crmd processes join/leave
464
+ ais: Correct the logic for conecting to plugin based clusters
465
+ ais: Do not supply a process list in mcp-mode
466
+ ais: Drop support for whitetank in the 1.1 release series
467
+ ais: Get an initial dump of the node membership when connecting to quorum-based clusters
468
+ ais: Guard against saturated cpg connections
469
+ ais: Handle CS_ERR_TRY_AGAIN in more cases
470
+ ais: Move the code for finding uid before the fork so that the child does no logging
471
+ ais: Never allow quorum plugins to affect connection to the pacemaker plugin
472
+ ais: Sign everyone up for peer process updates, not just the crmd
473
+ ais: The cluster type needs to be set before initializing classic openais connections
474
+ cib: Also free query result for xpath operations that return more than one hit
475
+ cib: Attempt to resolve memory corruption when forking a child to write the cib to disk
476
+ cib: Correctly free memory when writing out the cib to disk
477
+ cib: Fix the application of unversioned diffs
478
+ cib: Remove old developmental error logging
479
+ cib: Restructure the 'valid peer' check for deciding which instructions to ignore
480
+ cman: Correctly process membership/quorum changes from the pcmk plugin. Allow other message types through untouched
481
+ cman: Filter directed messages not intended for us
482
+ cman: Grab the initial membership when we connect
483
+ cman: Keep the list of peer processes up-to-date
484
+ cman: Make sure our common hooks are called after a cman membership update
485
+ cman: Make sure we can compile without cman present
486
+ cman: Populate sender details for cpg messages
487
+ cman: Update the ringid for cman based clusters
488
+ Core: Correctly unpack HA_Messages containing multiple entries with the same name
489
+ Core: crm_count_member() should only track nodes that have the full stack up
490
+ Core: New developmental logging system inspired by the kernel and a PoC from Lars Ellenberg
491
+ crmd: All nodes should see status updates, not just he DC
492
+ crmd: Allow non-DC nodes to clear failcounts too
493
+ crmd: Base DC election on process relative uptime
494
+ crmd: Bug lf#2439 - cancel_op() can also return HA_RSCBUSY
495
+ crmd: Bug lf#2439 - Handle asynchronous notification of resource deletion events
496
+ crmd: Bug lf#2458 - Ensure stop actions always have the relevant resource attributes
497
+ crmd: Disable age as a criteria for cman based clusters, its not reliable enough
498
+ crmd: Ensure we activate the DC timer if we detect an alternate DC
499
+ crmd: Factor the nanosecond component of process uptime in elections
500
+ crmd: Fix assertion failure when performing async resource failures
501
+ crmd: Fix handling of async resource deletion results
502
+ crmd: Include the action for crm graph operations
503
+ crmd: Make sure the membership cache is accurate after a sucessful fencing operation
504
+ crmd: Make sure we always poke the FSA after a transition to clear any TE_HALT actions
505
+ crmd: Offer crm-level membership once the peer starts the crmd process
506
+ crmd: Only need to request quorum update for plugin based clusters
507
+ crmd: Prevent assertion failure for stop actions resulting from cs: 3c0bc17c6daf
508
+ crmd: Prevent everyone from loosing DC elections by correctly initializing all relevant variables
509
+ crmd: Prevent segmentation fault
510
+ crmd: several fixes for async resource delete (thanks to beekhof)
511
+ crmd: Use the correct define/size for lrm resource IDs
512
+ Introduce two new cluster types 'cman' and 'corosync', replaces 'quorum_provider' concept
513
+ mcp: Add missing headers when built without heartbeat support
514
+ mcp: Correctly initialize the string containing the list of active daemons
515
+ mcp: Fix macro expansion in init script
516
+ mcp: Fix the expansion of the pid file in the init script
517
+ mcp: Handle CS_ERR_TRY_AGAIN when connecting to libcfg
518
+ mcp: Make sure we can compile the mcp without cman present
519
+ mcp: New master control process for (re)spawning pacemaker daemons
520
+ mcp: Read config early so we can re-initialize logging asap if daemonizing
521
+ mcp: Rename the mcp binary to pacemakerd and create a 'pacemaker' init script
522
+ mcp: Resend our process list after every CPG change
523
+ mcp: Tell chkconfig we need to shut down early on
524
+ pengine: Avoid creating invalid ordering constraints for probes that are not needed
525
+ pengine: Bug lf#1959 - Fail unmanaged resources should not prevent other services from shutting down
526
+ pengine: Bug lf#2422 - Ordering dependencies on partially active groups not observed properly
527
+ pengine: Bug lf#2424 - Use notify oepration definition if it exists in the configuration
528
+ pengine: Bug lf#2433 - No services should be stopped until probes finish
529
+ pengine: Bug lf#2453 - Enforce clone ordering in the absense of colocation constraints
530
+ pengine: Bug lf#2476 - Repair on-fail=block for groups and primitive resources
531
+ pengine: Correctly detect when there is a real failcount that expired and needs to be cleared
532
+ pengine: Correctly handle pseudo action creation
533
+ pengine: Correctly order clone startup after group/clone start
534
+ pengine: Correct use-after-free introduced in the prior patch
535
+ pengine: Do not demote resources because something that requires it can not run
536
+ pengine: Fix colocation for interleaved clones
537
+ pengine: Fix colocation with partially active groups
538
+ pengine: Fix potential use-after-free defect from coverity
539
+ pengine: Fix previous merge
540
+ pengine: Fix use-after-free in order_actions() reported by valgrind
541
+ pengine: Make the current data set a global variable so it does not need to be passed around everywhere
542
+ pengine: Prevent endless loop when looking for operation definitions in the configuration
543
+ pengine: Prevent segfault by ensuring the arguments to do_calculations() are initialized
544
+ pengine: Rewrite the ordering constraint logic to be simplicity, clarity and maintainability
545
+ pengine: Wait until stonith is available, do not fall back to shutdown for nodes requesting termination
546
+ Resolve coverity RESOURCE_LEAK defects
547
+ Shell: Complete the transition to using crm_attribute instead of crm_failcount and crm_standby
548
+ stonith: Advertise stonith-ng options in the metadata
549
+ stonith: Bug lf#2461 - Prevent segfault by not looking up operations if the hashtable has not been initialized yet
550
+ stonith: Bug lf#2473 - Add the timeout at the top level where the daemon is looking for it
551
+ Stonith: Bug lf#2473 - Ensure stonith operations complete within the timeout and are terminated if they run too long
552
+ stonith: Bug lf#2473 - Ensure timeouts are included for fencing operations
553
+ stonith: Bug lf#2473 - Gracefully handle remote operations that arrive late (after we have done notifications)
554
+ stonith: Correctly parse pcmk_host_list parameters that appear on a single line
555
+ stonith: Map poweron/poweroff back to on/off expected by the stonith tool from cluster-glue
556
+ stonith: pass the configuration to the stonith program via environment variables (bnc#620781)
557
+ Stonith: Use the timeout specified by the user
558
+ Support starting plugin-based Pacemaker clusters with the MCP as well
559
+ Tools: Bug lf#2456 - Fix assertion failure in crm_resource
560
+ tools: crm_node - Repair the ability to connect to openais based clusters
561
+ tools: crm_node - Use the correct short option for --cman
562
+ tools: crm_report - corosync.conf wont necessarily contain the text 'pacemaker' anymore
563
+ Tools: crm_simulate - Fix use-after-free in when terminating
564
+ tools: crm_simulate - Resolve coverity USE_AFTER_FREE defect
565
+ Tools: Drop the 'pingd' daemon and resource agent in favor of ocf:pacemaker:ping
566
+ Tools: Fix recently introduced use-of-NULL
567
+ Tools: Fix use-after-free defects from coverity
387
569
* Wed May 12 2010 Andrew Beekhof <andrew@beekhof.net> 1.1.2-1
388
570
- Update source tarball to revision: c25c972a25cc tip
391
573
Diff: 708 files changed, 37918 insertions(+), 10584 deletions(-)
392
574
- Changes since Pacemaker-1.1.1
393
+ High: ais: Do not count votes from offline nodes and calculate current votes before sending quorum data
394
+ High: ais: Ensure the list of active processes sent to clients is always up-to-date
395
+ High: ais: Look for the correct conf variable for turning on file logging
396
+ High: ais: Need to find a better and thread-safe way to set core_uses_pid. Disable for now.
397
+ High: ais: Use the threadsafe version of getpwnam
398
+ High: Core: Bump the feature set due to the new failcount expiry feature
399
+ High: Core: fix memory leaks exposed by valgrind
400
+ High: Core: Bug lf#2414 - Prevent use-after-free reported by valgrind when doing xpath based deletions
401
+ High: crmd: Bug lf#2414 - Prevent use-after-free of the PE connection after it dies
402
+ High: crmd: Bug lf#2414 - Prevent use-after-free of the stonith-ng connection
403
+ High: crmd: Bug lf#2401 - Improved detection of partially active peers
404
+ High: crmd: Bug lf#2379 - Ensure the cluster terminates when the PE is not available
405
+ High: crmd: Do not allow the target_rc to be misused by resource agents
406
+ High: crmd: Do not ignore action timeouts based on FSA state
407
+ High: crmd: Ensure we dont get stuck in S_PENDING if we loose an election to someone that never talks to us again
408
+ High: crmd: Fix memory leaks exposed by valgrind
409
+ High: crmd: Remove race condition that could lead to multiple instances of a clone being active on a machine
410
+ High: crmd: Send erase_status_tag() calls to the local CIB when the DC is fenced, since there is no DC to accept them
411
+ High: crmd: Use global fencing notifications to prevent secondary fencing operations of the DC
412
+ High: PE: Bug lf#2317 - Avoid needless restart of primitive depending on a clone
413
+ High: PE: Bug lf#2361 - Ensure clones observe mandatory ordering constraints if the LHS is unrunnable
414
+ High: PE: Bug lf#2383 - Combine failcounts for all instances of an anonymous clone on a host
415
+ High: PE: Bug lf#2384 - Fix intra-set colocation and ordering
416
+ High: PE: Bug lf#2403 - Enforce mandatory promotion (colocation) constraints
417
+ High: PE: Bug lf#2412 - Correctly find clone instances by their prefix
418
+ High: PE: Do not be so quick to pull the trigger on nodes that are coming up
419
+ High: PE: Fix memory leaks exposed by valgrind
420
+ High: PE: Rewrite native_merge_weights() to avoid Fix use-after-free
421
+ High: Shell: Bug bnc#590035 - always reload status if working with the cluster
422
+ High: Shell: Bug bnc#592762 - Default to using the status section from the live CIB
423
+ High: Shell: Bug lf#2315 - edit multiple meta_attributes sets in resource management
424
+ High: Shell: Bug lf#2221 - enable comments
425
+ High: Shell: Bug bnc#580492 - implement new cibstatus interface and commands
426
+ High: Shell: Bug bnc#585471 - new cibstatus import command
427
+ High: Shell: check timeouts also against the default-action-timeout property
428
+ High: Shell: new configure filter command
429
+ High: Tools: crm_mon - fix memory leaks exposed by valgrind
575
+ ais: Do not count votes from offline nodes and calculate current votes before sending quorum data
576
+ ais: Ensure the list of active processes sent to clients is always up-to-date
577
+ ais: Look for the correct conf variable for turning on file logging
578
+ ais: Need to find a better and thread-safe way to set core_uses_pid. Disable for now.
579
+ ais: Use the threadsafe version of getpwnam
580
+ Core: Bump the feature set due to the new failcount expiry feature
581
+ Core: fix memory leaks exposed by valgrind
582
+ Core: Bug lf#2414 - Prevent use-after-free reported by valgrind when doing xpath based deletions
583
+ crmd: Bug lf#2414 - Prevent use-after-free of the PE connection after it dies
584
+ crmd: Bug lf#2414 - Prevent use-after-free of the stonith-ng connection
585
+ crmd: Bug lf#2401 - Improved detection of partially active peers
586
+ crmd: Bug lf#2379 - Ensure the cluster terminates when the PE is not available
587
+ crmd: Do not allow the target_rc to be misused by resource agents
588
+ crmd: Do not ignore action timeouts based on FSA state
589
+ crmd: Ensure we dont get stuck in S_PENDING if we loose an election to someone that never talks to us again
590
+ crmd: Fix memory leaks exposed by valgrind
591
+ crmd: Remove race condition that could lead to multiple instances of a clone being active on a machine
592
+ crmd: Send erase_status_tag() calls to the local CIB when the DC is fenced, since there is no DC to accept them
593
+ crmd: Use global fencing notifications to prevent secondary fencing operations of the DC
594
+ pengine: Bug lf#2317 - Avoid needless restart of primitive depending on a clone
595
+ pengine: Bug lf#2361 - Ensure clones observe mandatory ordering constraints if the LHS is unrunnable
596
+ pengine: Bug lf#2383 - Combine failcounts for all instances of an anonymous clone on a host
597
+ pengine: Bug lf#2384 - Fix intra-set colocation and ordering
598
+ pengine: Bug lf#2403 - Enforce mandatory promotion (colocation) constraints
599
+ pengine: Bug lf#2412 - Correctly find clone instances by their prefix
600
+ pengine: Do not be so quick to pull the trigger on nodes that are coming up
601
+ pengine: Fix memory leaks exposed by valgrind
602
+ pengine: Rewrite native_merge_weights() to avoid Fix use-after-free
603
+ Shell: Bug bnc#590035 - always reload status if working with the cluster
604
+ Shell: Bug bnc#592762 - Default to using the status section from the live CIB
605
+ Shell: Bug lf#2315 - edit multiple meta_attributes sets in resource management
606
+ Shell: Bug lf#2221 - enable comments
607
+ Shell: Bug bnc#580492 - implement new cibstatus interface and commands
608
+ Shell: Bug bnc#585471 - new cibstatus import command
609
+ Shell: check timeouts also against the default-action-timeout property
610
+ Shell: new configure filter command
611
+ Tools: crm_mon - fix memory leaks exposed by valgrind
431
613
* Tue Feb 16 2010 Andrew Beekhof <andrew@beekhof.net> - 1.1.1-1
432
614
- First public release of Pacemaker 1.1
454
636
Diff: 220 files changed, 15933 insertions(+), 8782 deletions(-)
455
637
- Changes since 1.0.5-4
456
+ High: PE: Bug 2213 - Ensure groups process location constraints so that clone-node-max works for cloned groups
457
+ High: PE: Bug lf#2153 - non-clones should not restart when clones stop/start on other nodes
458
+ High: PE: Bug lf#2209 - Clone ordering should be able to prevent startup of dependant clones
459
+ High: PE: Bug lf#2216 - Correctly identify the state of anonymous clones when deciding when to probe
460
+ High: PE: Bug lf#2225 - Operations that require fencing should wait for 'stonith_complete' not 'all_stopped'.
461
+ High: PE: Bug lf#2225 - Prevent clone peers from stopping while another is instance is (potentially) being fenced
462
+ High: PE: Correctly anti-colocate with a group
463
+ High: PE: Correctly unpack ordering constraints for resource sets to avoid graph loops
464
+ High: Tools: crm: load help from crm_cli.txt
465
+ High: Tools: crm: resource sets (bnc#550923)
466
+ High: Tools: crm: support for comments (LF 2221)
467
+ High: Tools: crm: support for description attribute in resources/operations (bnc#548690)
468
+ High: Tools: hb2openais: add EVMS2 CSM processing (and other changes) (bnc#548093)
469
+ High: Tools: hb2openais: do not allow empty rules, clones, or groups (LF 2215)
470
+ High: Tools: hb2openais: refuse to convert pure EVMS volumes
471
+ High: cib: Ensure the loop for login message terminates
472
+ High: cib: Finally fix reliability of receiving large messages over remote plaintext connections
473
+ High: cib: Fix remote notifications
474
+ High: cib: For remote connections, default to CRM_DAEMON_USER since thats the only one that the cib can validate the password for using PAM
475
+ High: cib: Remote plaintext - Retry sending parts of the message that did not fit the first time
476
+ High: crmd: Ensure batch-limit is correctly enforced
477
+ High: crmd: Ensure we have the latest status after a transition abort
478
+ High (bnc#547579,547582): Tools: crm: status section editing support
479
+ High: shell: Add allow-migrate as allowed meta-attribute (bnc#539968)
638
+ pengine: Bug 2213 - Ensure groups process location constraints so that clone-node-max works for cloned groups
639
+ pengine: Bug lf#2153 - non-clones should not restart when clones stop/start on other nodes
640
+ pengine: Bug lf#2209 - Clone ordering should be able to prevent startup of dependant clones
641
+ pengine: Bug lf#2216 - Correctly identify the state of anonymous clones when deciding when to probe
642
+ pengine: Bug lf#2225 - Operations that require fencing should wait for 'stonith_complete' not 'all_stopped'.
643
+ pengine: Bug lf#2225 - Prevent clone peers from stopping while another is instance is (potentially) being fenced
644
+ pengine: Correctly anti-colocate with a group
645
+ pengine: Correctly unpack ordering constraints for resource sets to avoid graph loops
646
+ Tools: crm: load help from crm_cli.txt
647
+ Tools: crm: resource sets (bnc#550923)
648
+ Tools: crm: support for comments (LF 2221)
649
+ Tools: crm: support for description attribute in resources/operations (bnc#548690)
650
+ Tools: hb2openais: add EVMS2 CSM processing (and other changes) (bnc#548093)
651
+ Tools: hb2openais: do not allow empty rules, clones, or groups (LF 2215)
652
+ Tools: hb2openais: refuse to convert pure EVMS volumes
653
+ cib: Ensure the loop for login message terminates
654
+ cib: Finally fix reliability of receiving large messages over remote plaintext connections
655
+ cib: Fix remote notifications
656
+ cib: For remote connections, default to CRM_DAEMON_USER since thats the only one that the cib can validate the password for using PAM
657
+ cib: Remote plaintext - Retry sending parts of the message that did not fit the first time
658
+ crmd: Ensure batch-limit is correctly enforced
659
+ crmd: Ensure we have the latest status after a transition abort
660
+ (bnc#547579,547582): Tools: crm: status section editing support
661
+ shell: Add allow-migrate as allowed meta-attribute (bnc#539968)
480
662
+ Medium: Build: Do not automatically add -L/lib, it could cause 64-bit arches to break
481
+ Medium: PE: Bug lf#2206 - rsc_order constraints always use score at the top level
482
+ Medium: PE: Only complain about target-role=master for non m/s resources
483
+ Medium: PE: Prevent non-multistate resources from being promoted through target-role
484
+ Medium: PE: Provide a default action for resource-set ordering
485
+ Medium: PE: Silently fix requires=fencing for stonith resources so that it can be set in op_defaults
663
+ Medium: pengine: Bug lf#2206 - rsc_order constraints always use score at the top level
664
+ Medium: pengine: Only complain about target-role=master for non m/s resources
665
+ Medium: pengine: Prevent non-multistate resources from being promoted through target-role
666
+ Medium: pengine: Provide a default action for resource-set ordering
667
+ Medium: pengine: Silently fix requires=fencing for stonith resources so that it can be set in op_defaults
486
668
+ Medium: Tools: Bug lf#2286 - Allow the shell to accept template parameters on the command line
487
669
+ Medium: Tools: Bug lf#2307 - Provide a way to determin the nodeid of past cluster members
488
670
+ Medium: Tools: crm: add update method to template apply (LF 2289)
523
705
- Exclude documentation that is included elsewhere in the package
525
707
- Update the tarball from upstream to version ee19d8e83c2a
526
+ High: cib: Correctly clean up when both plaintext and tls remote ports are requested
527
+ High: PE: Bug bnc#515172 - Provide better defaults for lt(e) and gt(e) comparisions
528
+ High: PE: Bug lf#2197 - Allow master instances placemaker to be influenced by colocation constraints
529
+ High: PE: Make sure promote/demote pseudo actions are created correctly
530
+ High: PE: Prevent target-role from promoting more than master-max instances
531
+ High: ais: Bug lf#2199 - Prevent expected-quorum-votes from being populated with garbage
532
+ High: ais: Prevent deadlock - dont try to release IPC message if the connection failed
533
+ High: cib: For validation errors, send back the full CIB so the client can display the errors
534
+ High: cib: Prevent use-after-free for remote plaintext connections
535
+ High: crmd: Bug lf#2201 - Prevent use-of-NULL when running heartbeat
708
+ cib: Correctly clean up when both plaintext and tls remote ports are requested
709
+ pengine: Bug bnc#515172 - Provide better defaults for lt(e) and gt(e) comparisions
710
+ pengine: Bug lf#2197 - Allow master instances placemaker to be influenced by colocation constraints
711
+ pengine: Make sure promote/demote pseudo actions are created correctly
712
+ pengine: Prevent target-role from promoting more than master-max instances
713
+ ais: Bug lf#2199 - Prevent expected-quorum-votes from being populated with garbage
714
+ ais: Prevent deadlock - dont try to release IPC message if the connection failed
715
+ cib: For validation errors, send back the full CIB so the client can display the errors
716
+ cib: Prevent use-after-free for remote plaintext connections
717
+ crmd: Bug lf#2201 - Prevent use-of-NULL when running heartbeat
537
719
* Wed Oct 13 2009 Andrew Beekhof <andrew@beekhof.net> - 1.0.5-3
538
720
- Update the tarball from upstream to version 38cd629e5c3c
539
+ High: Core: Bug lf#2169 - Allow dtd/schema validation to be disabled
540
+ High: PE: Bug lf#2106 - Not all anonymous clone children are restarted after configuration change
541
+ High: PE: Bug lf#2170 - stop-all-resources option had no effect
542
+ High: PE: Bug lf#2171 - Prevent groups from starting if they depend on a complex resource which can not
543
+ High: PE: Disable resource management if stonith-enabled=true and no stonith resources are defined
544
+ High: PE: do not include master score if it would prevent allocation
545
+ High: ais: Avoid excessive load by checking for dead children every 1s (instead of 100ms)
546
+ High: ais: Bug rh#525589 - Prevent shutdown deadlocks when running on CoroSync
547
+ High: ais: Gracefully handle changes to the AIS nodeid
548
+ High: crmd: Bug bnc#527530 - Wait for the transition to complete before leaving S_TRANSITION_ENGINE
549
+ High: crmd: Prevent use-after-free with LOG_DEBUG_3
721
+ Core: Bug lf#2169 - Allow dtd/schema validation to be disabled
722
+ pengine: Bug lf#2106 - Not all anonymous clone children are restarted after configuration change
723
+ pengine: Bug lf#2170 - stop-all-resources option had no effect
724
+ pengine: Bug lf#2171 - Prevent groups from starting if they depend on a complex resource which can not
725
+ pengine: Disable resource management if stonith-enabled=true and no stonith resources are defined
726
+ pengine: do not include master score if it would prevent allocation
727
+ ais: Avoid excessive load by checking for dead children every 1s (instead of 100ms)
728
+ ais: Bug rh#525589 - Prevent shutdown deadlocks when running on CoroSync
729
+ ais: Gracefully handle changes to the AIS nodeid
730
+ crmd: Bug bnc#527530 - Wait for the transition to complete before leaving S_TRANSITION_ENGINE
731
+ crmd: Prevent use-after-free with LOG_DEBUG_3
550
732
+ Medium: xml: Mask the "symmetrical" attribute on rsc_colocation constraints (bnc#540672)
551
733
+ Medium (bnc#520707): Tools: crm: new templates ocfs2 and clvm
552
734
+ Medium: Build: Invert the disable ais/heartbeat logic so that --without (ais|heartbeat) is available to rpmbuild
553
+ Medium: PE: Bug lf#2178 - Indicate unmanaged clones
554
+ Medium: PE: Bug lf#2180 - Include node information for all failed ops
555
+ Medium: PE: Bug lf#2189 - Incorrect error message when unpacking simple ordering constraint
556
+ Medium: PE: Correctly log resources that would like to start but can not
557
+ Medium: PE: Stop ptest from logging to syslog
735
+ Medium: pengine: Bug lf#2178 - Indicate unmanaged clones
736
+ Medium: pengine: Bug lf#2180 - Include node information for all failed ops
737
+ Medium: pengine: Bug lf#2189 - Incorrect error message when unpacking simple ordering constraint
738
+ Medium: pengine: Correctly log resources that would like to start but can not
739
+ Medium: pengine: Stop ptest from logging to syslog
558
740
+ Medium: ais: Include version details in plugin name
559
741
+ Medium: crmd: Requery the resource metadata after every start operation
623
805
Diff: 266 files changed, 12010 insertions(+), 8276 deletions(-)
624
806
- Changes since Pacemaker-1.0.3
625
+ High (bnc#488291): ais: do not rely on byte endianness on ptr cast
626
+ High (bnc#507255): Tools: crm: delete rsc/op_defaults (these meta_attributes are killing me)
627
+ High (bnc#507255): Tools: crm: import properly rsc/op_defaults
628
+ High (LF 2114): Tools: crm: add support for operation instance attributes
629
+ High: ais: Bug lf#2126 - Messages replies cannot be routed to transient clients
630
+ High: ais: Fix compilation for the latest Corosync API (v1719)
631
+ High: attrd: Do not perform all updates as complete refreshes
632
+ High: cib: Fix huge memory leak affecting heartbeat-based clusters
633
+ High: Core: Allow xpath queries to match attributes
634
+ High: Core: Generate the help text directly from a tool options struct
635
+ High: Core: Handle differences in 0.6 messaging format
636
+ High: crmd: Bug lf#2120 - All transient node attribute updates need to go via attrd
637
+ High: crmd: Correctly calculate how long an FSA action took to avoid spamming the logs with errors
638
+ High: crmd: Fix another large memory leak affecting Heartbeat based clusters
639
+ High: lha: Restore compatability with older versions
640
+ High: PE: Bug bnc#495687 - Filesystem is not notified of successful STONITH under some conditions
641
+ High: PE: Make running a cluster with STONITH enabled but no STONITH resources an error and provide details on resolutions
642
+ High: PE: Prevent use-ofNULL when using resource ordering sets
643
+ High: PE: Provide inter-notification ordering guarantees
644
+ High: PE: Rewrite the notification code to be understanable and extendable
645
+ High: Tools: attrd - Prevent race condition resulting in the cluster forgetting the node wishes to shut down
646
+ High: Tools: crm: regression tests
647
+ High: Tools: crm_mon - Fix smtp notifications
648
+ High: Tools: crm_resource - Repair the ability to query meta attributes
807
+ (bnc#488291): ais: do not rely on byte endianness on ptr cast
808
+ (bnc#507255): Tools: crm: delete rsc/op_defaults (these meta_attributes are killing me)
809
+ (bnc#507255): Tools: crm: import properly rsc/op_defaults
810
+ (LF 2114): Tools: crm: add support for operation instance attributes
811
+ ais: Bug lf#2126 - Messages replies cannot be routed to transient clients
812
+ ais: Fix compilation for the latest Corosync API (v1719)
813
+ attrd: Do not perform all updates as complete refreshes
814
+ cib: Fix huge memory leak affecting heartbeat-based clusters
815
+ Core: Allow xpath queries to match attributes
816
+ Core: Generate the help text directly from a tool options struct
817
+ Core: Handle differences in 0.6 messaging format
818
+ crmd: Bug lf#2120 - All transient node attribute updates need to go via attrd
819
+ crmd: Correctly calculate how long an FSA action took to avoid spamming the logs with errors
820
+ crmd: Fix another large memory leak affecting Heartbeat based clusters
821
+ lha: Restore compatability with older versions
822
+ pengine: Bug bnc#495687 - Filesystem is not notified of successful STONITH under some conditions
823
+ pengine: Make running a cluster with STONITH enabled but no STONITH resources an error and provide details on resolutions
824
+ pengine: Prevent use-ofNULL when using resource ordering sets
825
+ pengine: Provide inter-notification ordering guarantees
826
+ pengine: Rewrite the notification code to be understanable and extendable
827
+ Tools: attrd - Prevent race condition resulting in the cluster forgetting the node wishes to shut down
828
+ Tools: crm: regression tests
829
+ Tools: crm_mon - Fix smtp notifications
830
+ Tools: crm_resource - Repair the ability to query meta attributes
649
831
+ Low Build: Bug lf#2105 - Debian package should contain pacemaker doc and crm templates
650
832
+ Medium (bnc#507255): Tools: crm: handle empty rsc/op_defaults properly
651
833
+ Medium (bnc#507255): Tools: crm: use the right obj_type when creating objects from xml nodes
677
859
Diff: 329 files changed, 15471 insertions(+), 15119 deletions(-)
678
860
- Changes since Pacemaker-1.0.2
679
861
+ Added tag SLE11-HAE-GMC for changeset 9196be9830c2
680
+ High: ais plugin: Fix quorum calculation (bnc#487003)
681
+ High: ais: Another memory fix leak in error path
682
+ High: ais: Bug bnc#482847, bnc#482905 - Force a clean exit of OpenAIS once Pacemaker has finished unloading
683
+ High: ais: Bug bnc#486858 - Fix update_member() to prevent spamming clients with membership events containing no changes
684
+ High: ais: Centralize all quorum calculations in the ais plugin and allow expected votes to be configured int he cib
685
+ High: ais: Correctly handle a return value of zero from openais_dispatch_recv()
686
+ High: ais: Disable logging to a file
687
+ High: ais: Fix memory leak in error path
688
+ High: ais: IPC messages are only in scope until a response is sent
689
+ High: All signal handlers used with CL_SIGNAL() need to be as minimal as possible
690
+ High: cib: Bug bnc#482885 - Simplify CIB disk-writes to prevent data loss. Required a change to the backup filename format
691
+ High: cib: crmd: Revert part of 9782ab035003. Complex shutdown routines need G_main_add_SignalHandler to avoid race coditions
692
+ High: crm: Avoid infinite loop during crm configure edit (bnc#480327)
693
+ High: crmd: Avoid a race condition by waiting for the attrd update to trigger a transition automatically
694
+ High: crmd: Bug bnc#480977 - Prevent extra, partial, shutdown when a node restarts too quickly
695
+ High: crmd: Bug bnc#480977 - Prevent extra, partial, shutdown when a node restarts too quickly (verified)
696
+ High: crmd: Bug bnc#489063 - Ensure the DC is always unset after we 'loose' an election
697
+ High: crmd: Bug BSC#479543 - Correctly find the migration source for timed out migrate_from actions
698
+ High: crmd: Call crm_peer_init() before we start the FSA - prevents a race condition when used with Heartbeat
699
+ High: crmd: Erasing the status section should not be forced to the local node
700
+ High: crmd: Fix memory leak in cib notication processing code
701
+ High: crmd: Fix memory leak in transition graph processing
702
+ High: crmd: Fix memory leaks found by valgrind
703
+ High: crmd: More memory leaks fixes found by valgrind
704
+ High: fencing: stonithd: is_heartbeat_cluster is a no-no if there is no heartbeat support
705
+ High: PE: Bug bnc#466788 - Exclude nodes that can not run resources
706
+ High: PE: Bug bnc#466788 - Make colocation based on node attributes work
707
+ High: PE: Bug BNC#478687 - Do not crash when clone-max is 0
708
+ High: PE: Bug bnc#488721 - Fix id-ref expansion for clones, the doc-root for clone children is not the cib root
709
+ High: PE: Bug bnc#490418 - Correctly determine node state for nodes wishing to be terminated
710
+ High: PE: Bug LF#2087 - Correctly parse the state of anonymous clones that have multiple instances on a given node
711
+ High: PE: Bug lf#2089 - Meta attributes are not inherited by clone children
712
+ High: PE: Bug lf#2091 - Correctly restart modified resources that were found active by a probe
713
+ High: PE: Bug lf#2094 - Fix probe ordering for cloned groups
714
+ High: PE: Bug LF:2075 - Fix large pingd memory leaks
715
+ High: PE: Correctly attach orphaned clone children to their parent
716
+ High: PE: Correctly handle terminate node attributes that are set to the output from time()
717
+ High: PE: Ensure orphaned clone members are hooked up to the parent when clone-max=0
718
+ High: PE: Fix memory leak in LogActions
719
+ High: PE: Fix the determination of whether a group is active
720
+ High: PE: Look up the correct promotion preference for anonymous masters
721
+ High: PE: Simplify handling of start failures by changing the default migration-threshold to INFINITY
722
+ High: PE: The ordered option for clones no longer causes extra start/stop operations
723
+ High: RA: Bug bnc#490641 - Shut down dlm_controld with -TERM instead of -KILL
724
+ High: RA: pingd: Set default ping interval to 1 instead of 0 seconds
725
+ High: Resources: pingd - Correctly tell the ping daemon to shut down
726
+ High: Tools: Bug bnc#483365 - Ensure the command from cluster_test includes a value for --log-facility
727
+ High: Tools: cli: fix and improve delete command
728
+ High: Tools: crm: add and implement templates
729
+ High: Tools: crm: add support for command aliases and some common commands (i.e. cd,exit)
730
+ High: Tools: crm: create top configuration nodes if they are missing
731
+ High: Tools: crm: fix parsing attributes for rules (broken by the previous changeset)
732
+ High: Tools: crm: new ra set of commands
733
+ High: Tools: crm: resource agents information management
734
+ High: Tools: crm: rsc/op_defaults
735
+ High: Tools: crm: support for no value attribute in nvpairs
736
+ High: Tools: crm: the new configure monitor command
737
+ High: Tools: crm: the new configure node command
738
+ High: Tools: crm_mon - Prevent use-of-NULL when summarizing an orphan
739
+ High: Tools: hb2openais: create clvmd clone for respawn evmsd in ha.cf
740
+ High: Tools: hb2openais: fix a serious recursion bug in xml node processing
741
+ High: Tools: hb2openais: fix ocfs2 processing
742
+ High: Tools: pingd - prevent double free of getaddrinfo() output in error path
743
+ High: Tools: The default re-ping interval for pingd should be 1s not 1ms
862
+ ais plugin: Fix quorum calculation (bnc#487003)
863
+ ais: Another memory fix leak in error path
864
+ ais: Bug bnc#482847, bnc#482905 - Force a clean exit of OpenAIS once Pacemaker has finished unloading
865
+ ais: Bug bnc#486858 - Fix update_member() to prevent spamming clients with membership events containing no changes
866
+ ais: Centralize all quorum calculations in the ais plugin and allow expected votes to be configured int he cib
867
+ ais: Correctly handle a return value of zero from openais_dispatch_recv()
868
+ ais: Disable logging to a file
869
+ ais: Fix memory leak in error path
870
+ ais: IPC messages are only in scope until a response is sent
871
+ All signal handlers used with CL_SIGNAL() need to be as minimal as possible
872
+ cib: Bug bnc#482885 - Simplify CIB disk-writes to prevent data loss. Required a change to the backup filename format
873
+ cib: crmd: Revert part of 9782ab035003. Complex shutdown routines need G_main_add_SignalHandler to avoid race coditions
874
+ crm: Avoid infinite loop during crm configure edit (bnc#480327)
875
+ crmd: Avoid a race condition by waiting for the attrd update to trigger a transition automatically
876
+ crmd: Bug bnc#480977 - Prevent extra, partial, shutdown when a node restarts too quickly
877
+ crmd: Bug bnc#480977 - Prevent extra, partial, shutdown when a node restarts too quickly (verified)
878
+ crmd: Bug bnc#489063 - Ensure the DC is always unset after we 'loose' an election
879
+ crmd: Bug BSC#479543 - Correctly find the migration source for timed out migrate_from actions
880
+ crmd: Call crm_peer_init() before we start the FSA - prevents a race condition when used with Heartbeat
881
+ crmd: Erasing the status section should not be forced to the local node
882
+ crmd: Fix memory leak in cib notication processing code
883
+ crmd: Fix memory leak in transition graph processing
884
+ crmd: Fix memory leaks found by valgrind
885
+ crmd: More memory leaks fixes found by valgrind
886
+ fencing: stonithd: is_heartbeat_cluster is a no-no if there is no heartbeat support
887
+ pengine: Bug bnc#466788 - Exclude nodes that can not run resources
888
+ pengine: Bug bnc#466788 - Make colocation based on node attributes work
889
+ pengine: Bug BNC#478687 - Do not crash when clone-max is 0
890
+ pengine: Bug bnc#488721 - Fix id-ref expansion for clones, the doc-root for clone children is not the cib root
891
+ pengine: Bug bnc#490418 - Correctly determine node state for nodes wishing to be terminated
892
+ pengine: Bug LF#2087 - Correctly parse the state of anonymous clones that have multiple instances on a given node
893
+ pengine: Bug lf#2089 - Meta attributes are not inherited by clone children
894
+ pengine: Bug lf#2091 - Correctly restart modified resources that were found active by a probe
895
+ pengine: Bug lf#2094 - Fix probe ordering for cloned groups
896
+ pengine: Bug LF:2075 - Fix large pingd memory leaks
897
+ pengine: Correctly attach orphaned clone children to their parent
898
+ pengine: Correctly handle terminate node attributes that are set to the output from time()
899
+ pengine: Ensure orphaned clone members are hooked up to the parent when clone-max=0
900
+ pengine: Fix memory leak in LogActions
901
+ pengine: Fix the determination of whether a group is active
902
+ pengine: Look up the correct promotion preference for anonymous masters
903
+ pengine: Simplify handling of start failures by changing the default migration-threshold to INFINITY
904
+ pengine: The ordered option for clones no longer causes extra start/stop operations
905
+ RA: Bug bnc#490641 - Shut down dlm_controld with -TERM instead of -KILL
906
+ RA: pingd: Set default ping interval to 1 instead of 0 seconds
907
+ Resources: pingd - Correctly tell the ping daemon to shut down
908
+ Tools: Bug bnc#483365 - Ensure the command from cluster_test includes a value for --log-facility
909
+ Tools: cli: fix and improve delete command
910
+ Tools: crm: add and implement templates
911
+ Tools: crm: add support for command aliases and some common commands (i.e. cd,exit)
912
+ Tools: crm: create top configuration nodes if they are missing
913
+ Tools: crm: fix parsing attributes for rules (broken by the previous changeset)
914
+ Tools: crm: new ra set of commands
915
+ Tools: crm: resource agents information management
916
+ Tools: crm: rsc/op_defaults
917
+ Tools: crm: support for no value attribute in nvpairs
918
+ Tools: crm: the new configure monitor command
919
+ Tools: crm: the new configure node command
920
+ Tools: crm_mon - Prevent use-of-NULL when summarizing an orphan
921
+ Tools: hb2openais: create clvmd clone for respawn evmsd in ha.cf
922
+ Tools: hb2openais: fix a serious recursion bug in xml node processing
923
+ Tools: hb2openais: fix ocfs2 processing
924
+ Tools: pingd - prevent double free of getaddrinfo() output in error path
925
+ Tools: The default re-ping interval for pingd should be 1s not 1ms
744
926
+ Medium (bnc#479049): Tools: crm: add validation of resource type for the configure primitive command
745
927
+ Medium (bnc#479050): Tools: crm: add help for RA parameters in tab completion
746
928
+ Medium (bnc#479050): Tools: crm: add tab completion for primitive params/meta/op
797
979
Diff: 639 files changed, 20871 insertions(+), 21594 deletions(-)
798
980
- Changes since Pacemaker-1.0.1
799
+ High (bnc#450815): Tools: crm cli: do not generate id for the operations tag
800
+ High: ais: Add support for the new AIS IPC layer
801
+ High: ais: Always set header.error to the correct default: SA_AIS_OK
802
+ High: ais: Bug BNC#456243 - Ensure the membership cache always contains an entry for the local node
803
+ High: ais: Bug BNC:456208 - Prevent deadlocks by not logging in the child process before exec()
804
+ High: ais: By default, disable supprt for the WIP openais IPC patch
805
+ High: ais: Detect and handle situations where ais and the crm disagree on the node name
806
+ High: ais: Ensure crm_peer_seq is updated after a membership update
807
+ High: ais: Make sure all IPC header fields are set to sane defaults
808
+ High: ais: Repair and streamline service load now that whitetank startup functions correctly
809
+ High: build: create and install doc files
810
+ High: cib: Allow clients without mainloop to connect to the cib
811
+ High: cib: CID:18 - Fix use-of-NULL in cib_perform_op
812
+ High: cib: CID:18 - Repair errors introduced in b5a18704477b - Fix use-of-NULL in cib_perform_op
813
+ High: cib: Ensure diffs contain the correct values of admin_epoch
814
+ High: cib: Fix four moderately sized memory leaks detected by Valgrind
815
+ High: Core: CID:10 - Prevent indexing into an array of schemas with a negative value
816
+ High: Core: CID:13 - Fix memory leak in log_data_element
817
+ High: Core: CID:15 - Fix memory leak in crm_get_peer
818
+ High: Core: CID:6 - Fix use-of-NULL in copy_ha_msg_input
819
+ High: Core: Fix crash in the membership code preventing node shutdown
820
+ High: Core: Fix more memory leaks foudn by valgrind
821
+ High: Core: Prevent unterminated strings after decompression
822
+ High: crmd: Bug BNC:467995 - Delay marking STONITH operations complete until STONITH tells us so
823
+ High: crmd: Bug LF:1962 - Do not NACK peers because they are not (yet) in our membership. Just ignore them.
824
+ High: crmd: Bug LF:2010 - Ensure fencing cib updates create the node_state entry if needed to preent re-fencing during cluster startup
825
+ High: crmd: Correctly handle reconnections to attrd
826
+ High: crmd: Ensure updates for lost migrate operations indicate which node it tried to migrating to
827
+ High: crmd: If there are no nodes to finalize, start an election.
828
+ High: crmd: If there are no nodes to welcome, start an election.
829
+ High: crmd: Prevent node attribute loss by detecting attrd disconnections immediately
830
+ High: crmd: Prevent node re-probe loops by ensuring manditory actions always complete
831
+ High: PE: Bug 2005 - Fix startup ordering of cloned stonith groups
832
+ High: PE: Bug 2006 - Correctly reprobe cloned groups
833
+ High: PE: Bug BNC:465484 - Fix the no-quorum-policy=suicide option
834
+ High: PE: Bug LF:1996 - Correctly process disabled monitor operations
835
+ High: PE: CID:19 - Fix use-of-NULL in determine_online_status
836
+ High: PE: Clones now default to globally-unique=false
837
+ High: PE: Correctly calculate the number of available nodes for the clone to use
838
+ High: PE: Only shoot online nodes with no-quorum-policy=suicide
839
+ High: PE: Prevent on-fail settings being ignored after a resource is successfully stopped
840
+ High: PE: Prevent use-of-NULL for failed migrate actions in process_rsc_state()
841
+ High: PE: Remove an optimization for the terminate node attribute that caused the cluster to block indefinitly
842
+ High: PE: Repar the ability to colocate based on node attributes other than uname
843
+ High: PE: Start the correct monitor operation for unmanaged masters
844
+ High: stonith: CID:3 - Fix another case of exceptionally poor error handling by the original stonith developers
845
+ High: stonith: CID:5 - Checking for NULL and then dereferencing it anyway is an interesting approach to error handling
846
+ High: stonithd: Sending IPC to the cluster is a privileged operation
847
+ High: stonithd: wrong checks for shmid (0 is a valid id)
848
+ High: Tools: attrd - Correctly determine when an attribute has stopped changing and should be committed to the CIB
849
+ High: Tools: Bug 2003 - pingd does not correctly detect failures when the interface is down
850
+ High: Tools: Bug 2003 - pingd does not correctly handle node-down events on multi-NIC systems
851
+ High: Tools: Bug 2021 - pingd does not detect sequence wrapping correctly, incorrectly reports nodes offline
852
+ High: Tools: Bug BNC:468066 - Do not use the result of uname() when its no longer in scope
853
+ High: Tools: Bug BNC:473265 - crm_resource -L dumps core
854
+ High: Tools: Bug LF:2001 - Transient node attributes should be set via attrd
855
+ High: Tools: Bug LF:2036 - crm_resource cannot set/get parameters for cloned resources
856
+ High: Tools: Bug LF:2046 - Node attribute updates are lost because attrd can take too long to start
857
+ High: Tools: Cause the correct clone instance to be failed with crm_resource -F
858
+ High: Tools: cluster_test - Allow the user to select a stack and fix CTS invocation
859
+ High: Tools: crm cli: allow rename only if the resource is stopped
860
+ High: Tools: crm cli: catch system errors on file operations
861
+ High: Tools: crm cli: completion for ids in configure
862
+ High: Tools: crm cli: drop '-rsc' from attributes for order constraint
863
+ High: Tools: crm cli: exit with an appropriate exit code
864
+ High: Tools: crm cli: fix wrong order of action and resource in order constraint
865
+ High: Tools: crm cli: fox wrong exit code
866
+ High: Tools: crm cli: improve handling of cib attributes
867
+ High: Tools: crm cli: new command: configure rename
868
+ High: Tools: crm cli: new command: configure upgrade
869
+ High: Tools: crm cli: new command: node delete
870
+ High: Tools: crm cli: prevent key errors on missing cib attributes
871
+ High: Tools: crm cli: print long help for help topics
872
+ High: Tools: crm cli: return on syntax error when parsing score
873
+ High: Tools: crm cli: rsc_location can be without nvpairs
874
+ High: Tools: crm cli: short node preference location constraint
875
+ High: Tools: crm cli: sometimes, on errors, level would change on single shot use
876
+ High: Tools: crm cli: syntax: drop a bunch of commas (remains of help tables conversion)
877
+ High: Tools: crm cli: verify user input for sanity
878
+ High: Tools: crm: find expressions within rules (do not always skip xml nodes due to used id)
879
+ High: Tools: crm_master should not define a set id now that attrd is used. Defining one can break lookups
880
+ High: Tools: crm_mon Use the OID assigned to the project by IANA for SNMP traps
981
+ (bnc#450815): Tools: crm cli: do not generate id for the operations tag
982
+ ais: Add support for the new AIS IPC layer
983
+ ais: Always set header.error to the correct default: SA_AIS_OK
984
+ ais: Bug BNC#456243 - Ensure the membership cache always contains an entry for the local node
985
+ ais: Bug BNC:456208 - Prevent deadlocks by not logging in the child process before exec()
986
+ ais: By default, disable supprt for the WIP openais IPC patch
987
+ ais: Detect and handle situations where ais and the crm disagree on the node name
988
+ ais: Ensure crm_peer_seq is updated after a membership update
989
+ ais: Make sure all IPC header fields are set to sane defaults
990
+ ais: Repair and streamline service load now that whitetank startup functions correctly
991
+ build: create and install doc files
992
+ cib: Allow clients without mainloop to connect to the cib
993
+ cib: CID:18 - Fix use-of-NULL in cib_perform_op
994
+ cib: CID:18 - Repair errors introduced in b5a18704477b - Fix use-of-NULL in cib_perform_op
995
+ cib: Ensure diffs contain the correct values of admin_epoch
996
+ cib: Fix four moderately sized memory leaks detected by Valgrind
997
+ Core: CID:10 - Prevent indexing into an array of schemas with a negative value
998
+ Core: CID:13 - Fix memory leak in log_data_element
999
+ Core: CID:15 - Fix memory leak in crm_get_peer
1000
+ Core: CID:6 - Fix use-of-NULL in copy_ha_msg_input
1001
+ Core: Fix crash in the membership code preventing node shutdown
1002
+ Core: Fix more memory leaks foudn by valgrind
1003
+ Core: Prevent unterminated strings after decompression
1004
+ crmd: Bug BNC:467995 - Delay marking STONITH operations complete until STONITH tells us so
1005
+ crmd: Bug LF:1962 - Do not NACK peers because they are not (yet) in our membership. Just ignore them.
1006
+ crmd: Bug LF:2010 - Ensure fencing cib updates create the node_state entry if needed to preent re-fencing during cluster startup
1007
+ crmd: Correctly handle reconnections to attrd
1008
+ crmd: Ensure updates for lost migrate operations indicate which node it tried to migrating to
1009
+ crmd: If there are no nodes to finalize, start an election.
1010
+ crmd: If there are no nodes to welcome, start an election.
1011
+ crmd: Prevent node attribute loss by detecting attrd disconnections immediately
1012
+ crmd: Prevent node re-probe loops by ensuring manditory actions always complete
1013
+ pengine: Bug 2005 - Fix startup ordering of cloned stonith groups
1014
+ pengine: Bug 2006 - Correctly reprobe cloned groups
1015
+ pengine: Bug BNC:465484 - Fix the no-quorum-policy=suicide option
1016
+ pengine: Bug LF:1996 - Correctly process disabled monitor operations
1017
+ pengine: CID:19 - Fix use-of-NULL in determine_online_status
1018
+ pengine: Clones now default to globally-unique=false
1019
+ pengine: Correctly calculate the number of available nodes for the clone to use
1020
+ pengine: Only shoot online nodes with no-quorum-policy=suicide
1021
+ pengine: Prevent on-fail settings being ignored after a resource is successfully stopped
1022
+ pengine: Prevent use-of-NULL for failed migrate actions in process_rsc_state()
1023
+ pengine: Remove an optimization for the terminate node attribute that caused the cluster to block indefinitly
1024
+ pengine: Repar the ability to colocate based on node attributes other than uname
1025
+ pengine: Start the correct monitor operation for unmanaged masters
1026
+ stonith: CID:3 - Fix another case of exceptionally poor error handling by the original stonith developers
1027
+ stonith: CID:5 - Checking for NULL and then dereferencing it anyway is an interesting approach to error handling
1028
+ stonithd: Sending IPC to the cluster is a privileged operation
1029
+ stonithd: wrong checks for shmid (0 is a valid id)
1030
+ Tools: attrd - Correctly determine when an attribute has stopped changing and should be committed to the CIB
1031
+ Tools: Bug 2003 - pingd does not correctly detect failures when the interface is down
1032
+ Tools: Bug 2003 - pingd does not correctly handle node-down events on multi-NIC systems
1033
+ Tools: Bug 2021 - pingd does not detect sequence wrapping correctly, incorrectly reports nodes offline
1034
+ Tools: Bug BNC:468066 - Do not use the result of uname() when its no longer in scope
1035
+ Tools: Bug BNC:473265 - crm_resource -L dumps core
1036
+ Tools: Bug LF:2001 - Transient node attributes should be set via attrd
1037
+ Tools: Bug LF:2036 - crm_resource cannot set/get parameters for cloned resources
1038
+ Tools: Bug LF:2046 - Node attribute updates are lost because attrd can take too long to start
1039
+ Tools: Cause the correct clone instance to be failed with crm_resource -F
1040
+ Tools: cluster_test - Allow the user to select a stack and fix CTS invocation
1041
+ Tools: crm cli: allow rename only if the resource is stopped
1042
+ Tools: crm cli: catch system errors on file operations
1043
+ Tools: crm cli: completion for ids in configure
1044
+ Tools: crm cli: drop '-rsc' from attributes for order constraint
1045
+ Tools: crm cli: exit with an appropriate exit code
1046
+ Tools: crm cli: fix wrong order of action and resource in order constraint
1047
+ Tools: crm cli: fox wrong exit code
1048
+ Tools: crm cli: improve handling of cib attributes
1049
+ Tools: crm cli: new command: configure rename
1050
+ Tools: crm cli: new command: configure upgrade
1051
+ Tools: crm cli: new command: node delete
1052
+ Tools: crm cli: prevent key errors on missing cib attributes
1053
+ Tools: crm cli: print long help for help topics
1054
+ Tools: crm cli: return on syntax error when parsing score
1055
+ Tools: crm cli: rsc_location can be without nvpairs
1056
+ Tools: crm cli: short node preference location constraint
1057
+ Tools: crm cli: sometimes, on errors, level would change on single shot use
1058
+ Tools: crm cli: syntax: drop a bunch of commas (remains of help tables conversion)
1059
+ Tools: crm cli: verify user input for sanity
1060
+ Tools: crm: find expressions within rules (do not always skip xml nodes due to used id)
1061
+ Tools: crm_master should not define a set id now that attrd is used. Defining one can break lookups
1062
+ Tools: crm_mon Use the OID assigned to the project by IANA for SNMP traps
881
1063
+ Medium (bnc#445622): Tools: crm cli: improve the node show command and drop node status
882
1064
+ Medium (LF 2009): stonithd: improve timeouts for remote fencing
883
1065
+ Medium: ais: Allow dead peers to be removed from membership calculations
949
1131
Diff: 816 files changed, 7633 insertions(+), 6286 deletions(-)
950
1132
- Changes since Pacemaker-1.0.1
951
+ High: ais: Allow the crmd to get callbacks whenever a node state changes
952
+ High: ais: Create an option for starting the mgmtd daemon automatically
953
+ High: ais: Ensure HA_RSCTMP exists for use by resource agents
954
+ High: ais: Hook up the openais.conf config logging options
955
+ High: ais: Zero out the PID of disconnecting clients
956
+ High: cib: Ensure global updates cause a disk write when appropriate
957
+ High: Core: Add an extra snaity check to getXpathResults() to prevent segfaults
958
+ High: Core: Do not redefine __FUNCTION__ unnecessarily
959
+ High: Core: Repair the ability to have comments in the configuration
960
+ High: crmd: Bug:1975 - crmd should wait indefinitely for stonith operations to complete
961
+ High: crmd: Ensure PE processing does not occur for all error cases in do_pe_invoke_callback
962
+ High: crmd: Requests to the CIB should cause any prior PE calculations to be ignored
963
+ High: heartbeat: Wait for membership 'up' events before removing stale node status data
964
+ High: PE: Bug LF:1988 - Ensure recurring operations always have the correct target-rc set
965
+ High: PE: Bug LF:1988 - For unmanaged resources we need to skip the usual can_run_resources() checks
966
+ High: PE: Ensure the terminate node attribute is handled correctly
967
+ High: PE: Fix optional colocation
968
+ High: PE: Improve up the detection of 'new' nodes joining the cluster
969
+ High: PE: Prevent assert failures in master_color() by ensuring unmanaged masters are always reallocated to their current location
970
+ High: Tools: crm cli: parser: return False on syntax error and None for comments
971
+ High: Tools: crm cli: unify template and edit commands
972
+ High: Tools: crm_shadow - Show more line number information after validation failures
973
+ High: Tools: hb2openais: add option to upgrade the CIB to v3.0
974
+ High: Tools: hb2openais: add U option to getopts and update usage
975
+ High: Tools: hb2openais: backup improved and multiple fixes
976
+ High: Tools: hb2openais: fix class/provider reversal
977
+ High: Tools: hb2openais: fix testing
978
+ High: Tools: hb2openais: move the CIB update to the end
979
+ High: Tools: hb2openais: update logging and set logfile appropriately
980
+ High: Tools: LF:1969 - Attrd never sets any properties in the cib
981
+ High: Tools: Make attrd functional on OpenAIS
1133
+ ais: Allow the crmd to get callbacks whenever a node state changes
1134
+ ais: Create an option for starting the mgmtd daemon automatically
1135
+ ais: Ensure HA_RSCTMP exists for use by resource agents
1136
+ ais: Hook up the openais.conf config logging options
1137
+ ais: Zero out the PID of disconnecting clients
1138
+ cib: Ensure global updates cause a disk write when appropriate
1139
+ Core: Add an extra snaity check to getXpathResults() to prevent segfaults
1140
+ Core: Do not redefine __FUNCTION__ unnecessarily
1141
+ Core: Repair the ability to have comments in the configuration
1142
+ crmd: Bug:1975 - crmd should wait indefinitely for stonith operations to complete
1143
+ crmd: Ensure PE processing does not occur for all error cases in do_pe_invoke_callback
1144
+ crmd: Requests to the CIB should cause any prior PE calculations to be ignored
1145
+ heartbeat: Wait for membership 'up' events before removing stale node status data
1146
+ pengine: Bug LF:1988 - Ensure recurring operations always have the correct target-rc set
1147
+ pengine: Bug LF:1988 - For unmanaged resources we need to skip the usual can_run_resources() checks
1148
+ pengine: Ensure the terminate node attribute is handled correctly
1149
+ pengine: Fix optional colocation
1150
+ pengine: Improve up the detection of 'new' nodes joining the cluster
1151
+ pengine: Prevent assert failures in master_color() by ensuring unmanaged masters are always reallocated to their current location
1152
+ Tools: crm cli: parser: return False on syntax error and None for comments
1153
+ Tools: crm cli: unify template and edit commands
1154
+ Tools: crm_shadow - Show more line number information after validation failures
1155
+ Tools: hb2openais: add option to upgrade the CIB to v3.0
1156
+ Tools: hb2openais: add U option to getopts and update usage
1157
+ Tools: hb2openais: backup improved and multiple fixes
1158
+ Tools: hb2openais: fix class/provider reversal
1159
+ Tools: hb2openais: fix testing
1160
+ Tools: hb2openais: move the CIB update to the end
1161
+ Tools: hb2openais: update logging and set logfile appropriately
1162
+ Tools: LF:1969 - Attrd never sets any properties in the cib
1163
+ Tools: Make attrd functional on OpenAIS
982
1164
+ Medium: ais: Hook up the options for specifying the expected number of nodes and total quorum votes
983
1165
+ Medium: ais: Look for pacemaker options inside the service block with 'name: pacemaker' instead of creating an addtional configuration block
984
1166
+ Medium: ais: Provide better feedback when nodes change nodeids (in openais.conf)
1012
1194
Changesets: 261
1013
1195
Diff: 3021 files changed, 244985 insertions(+), 111596 deletions(-)
1014
1196
- Changes since f805e1b30103
1015
+ High: add the crm cli program
1016
+ High: ais: Move the service id definition to a common location and make sure it is always used
1017
+ High: build: rename hb2openais.sh to .in and replace paths with vars
1018
+ High: cib: Implement --create for crm_shadow
1019
+ High: cib: Remove dead files
1020
+ High: Core: Allow the expected number of quorum votes to be configrable
1021
+ High: Core: cl_malloc and friends were removed from Heartbeat
1022
+ High: Core: Only call xmlCleanupParser() if we parsed anything. Doing so unconditionally seems to cause a segfault
1023
+ High: hb2openais.sh: improve pingd handling; several bugs fixed
1024
+ High: hb2openais: fix clone creation; replace EVMS strings
1025
+ High: new hb2openais.sh conversion script
1026
+ High: PE: Bug LF:1950 - Ensure the current values for all notification variables are always set (even if empty)
1027
+ High: PE: Bug LF:1955 - Ensure unmanaged masters are unconditionally repromoted to ensure they are monitored correctly.
1028
+ High: PE: Bug LF:1955 - Fix another case of filtering causing unmanaged master failures
1029
+ High: PE: Bug LF:1955 - Umanaged mode prevents master resources from being allocated correctly
1030
+ High: PE: Bug N:420538 - Anit-colocation caused a positive node preference
1031
+ High: PE: Correctly handle unmanaged resources to prevent them from being started elsewhere
1032
+ High: PE: crm_resource - Fix the --migrate command
1033
+ High: PE: MAke stonith-enabled default to true and warn if no STONITH resources are found
1034
+ High: PE: Make sure orphaned clone children are created correctly
1035
+ High: PE: Monitors for unmanaged resources do not need to wait for start/promote/demote actions to complete
1036
+ High: stonithd (LF 1951): fix remote stonith operations
1037
+ High: stonithd: fix handling of timeouts
1038
+ High: stonithd: fix logic for stonith resource priorities
1039
+ High: stonithd: implement the fence-timeout instance attribute
1040
+ High: stonithd: initialize value before reading fence-timeout
1041
+ High: stonithd: set timeouts for fencing ops to the timeout of the start op
1042
+ High: stonithd: stonith rsc priorities (new feature)
1043
+ High: Tools: Add hb2openais - a tool for upgrading a Heartbeat cluster to use OpenAIS instead
1044
+ High: Tools: crm_verify - clean up the upgrade logic to prevent crash on invalid configurations
1045
+ High: Tools: Make pingd functional on Linux
1046
+ High: Update version numbers for 1.0 candidates
1197
+ add the crm cli program
1198
+ ais: Move the service id definition to a common location and make sure it is always used
1199
+ build: rename hb2openais.sh to .in and replace paths with vars
1200
+ cib: Implement --create for crm_shadow
1201
+ cib: Remove dead files
1202
+ Core: Allow the expected number of quorum votes to be configrable
1203
+ Core: cl_malloc and friends were removed from Heartbeat
1204
+ Core: Only call xmlCleanupParser() if we parsed anything. Doing so unconditionally seems to cause a segfault
1205
+ hb2openais.sh: improve pingd handling; several bugs fixed
1206
+ hb2openais: fix clone creation; replace EVMS strings
1207
+ new hb2openais.sh conversion script
1208
+ pengine: Bug LF:1950 - Ensure the current values for all notification variables are always set (even if empty)
1209
+ pengine: Bug LF:1955 - Ensure unmanaged masters are unconditionally repromoted to ensure they are monitored correctly.
1210
+ pengine: Bug LF:1955 - Fix another case of filtering causing unmanaged master failures
1211
+ pengine: Bug LF:1955 - Umanaged mode prevents master resources from being allocated correctly
1212
+ pengine: Bug N:420538 - Anit-colocation caused a positive node preference
1213
+ pengine: Correctly handle unmanaged resources to prevent them from being started elsewhere
1214
+ pengine: crm_resource - Fix the --migrate command
1215
+ pengine: MAke stonith-enabled default to true and warn if no STONITH resources are found
1216
+ pengine: Make sure orphaned clone children are created correctly
1217
+ pengine: Monitors for unmanaged resources do not need to wait for start/promote/demote actions to complete
1218
+ stonithd (LF 1951): fix remote stonith operations
1219
+ stonithd: fix handling of timeouts
1220
+ stonithd: fix logic for stonith resource priorities
1221
+ stonithd: implement the fence-timeout instance attribute
1222
+ stonithd: initialize value before reading fence-timeout
1223
+ stonithd: set timeouts for fencing ops to the timeout of the start op
1224
+ stonithd: stonith rsc priorities (new feature)
1225
+ Tools: Add hb2openais - a tool for upgrading a Heartbeat cluster to use OpenAIS instead
1226
+ Tools: crm_verify - clean up the upgrade logic to prevent crash on invalid configurations
1227
+ Tools: Make pingd functional on Linux
1228
+ Update version numbers for 1.0 candidates
1047
1229
+ Medium: ais: Add support for a synchronous call to retrieve the nodes nodeid
1048
1230
+ Medium: ais: Use the agreed service number
1049
1231
+ Medium: Build: Reliably detect heartbeat libraries during configure
1079
1261
Changesets: 133
1080
1262
Diff: 89 files changed, 7492 insertions(+), 1125 deletions(-)
1081
1263
- Changes since f805e1b30103
1082
+ High: Tools: add the crm cli program
1083
+ High: Core: cl_malloc and friends were removed from Heartbeat
1084
+ High: Core: Only call xmlCleanupParser() if we parsed anything. Doing so unconditionally seems to cause a segfault
1085
+ High: new hb2openais.sh conversion script
1086
+ High: PE: Bug LF:1950 - Ensure the current values for all notification variables are always set (even if empty)
1087
+ High: PE: Bug LF:1955 - Ensure unmanaged masters are unconditionally repromoted to ensure they are monitored correctly.
1088
+ High: PE: Bug LF:1955 - Fix another case of filtering causing unmanaged master failures
1089
+ High: PE: Bug LF:1955 - Umanaged mode prevents master resources from being allocated correctly
1090
+ High: PE: Bug N:420538 - Anit-colocation caused a positive node preference
1091
+ High: PE: Correctly handle unmanaged resources to prevent them from being started elsewhere
1092
+ High: PE: crm_resource - Fix the --migrate command
1093
+ High: PE: MAke stonith-enabled default to true and warn if no STONITH resources are found
1094
+ High: PE: Make sure orphaned clone children are created correctly
1095
+ High: PE: Monitors for unmanaged resources do not need to wait for start/promote/demote actions to complete
1096
+ High: stonithd (LF 1951): fix remote stonith operations
1097
+ High: Tools: crm_verify - clean up the upgrade logic to prevent crash on invalid configurations
1264
+ Tools: add the crm cli program
1265
+ Core: cl_malloc and friends were removed from Heartbeat
1266
+ Core: Only call xmlCleanupParser() if we parsed anything. Doing so unconditionally seems to cause a segfault
1267
+ new hb2openais.sh conversion script
1268
+ pengine: Bug LF:1950 - Ensure the current values for all notification variables are always set (even if empty)
1269
+ pengine: Bug LF:1955 - Ensure unmanaged masters are unconditionally repromoted to ensure they are monitored correctly.
1270
+ pengine: Bug LF:1955 - Fix another case of filtering causing unmanaged master failures
1271
+ pengine: Bug LF:1955 - Umanaged mode prevents master resources from being allocated correctly
1272
+ pengine: Bug N:420538 - Anit-colocation caused a positive node preference
1273
+ pengine: Correctly handle unmanaged resources to prevent them from being started elsewhere
1274
+ pengine: crm_resource - Fix the --migrate command
1275
+ pengine: MAke stonith-enabled default to true and warn if no STONITH resources are found
1276
+ pengine: Make sure orphaned clone children are created correctly
1277
+ pengine: Monitors for unmanaged resources do not need to wait for start/promote/demote actions to complete
1278
+ stonithd (LF 1951): fix remote stonith operations
1279
+ Tools: crm_verify - clean up the upgrade logic to prevent crash on invalid configurations
1098
1280
+ Medium: ais: Add support for a synchronous call to retrieve the nodes nodeid
1099
1281
+ Medium: ais: Use the agreed service number
1100
+ Medium: PE: Allow external entities to ask for a node to be shot by creating a terminate=true transient node attribute
1101
+ Medium: PE: Bug LF:1950 - Notifications do not contain all documented resource state fields
1102
+ Medium: PE: Bug N:417585 - Do not restart group children whos individual score drops below zero
1103
+ Medium: PE: Implement a true maintenance mode
1104
+ Medium: PE: Print the correct message when stonith is disabled
1282
+ Medium: pengine: Allow external entities to ask for a node to be shot by creating a terminate=true transient node attribute
1283
+ Medium: pengine: Bug LF:1950 - Notifications do not contain all documented resource state fields
1284
+ Medium: pengine: Bug N:417585 - Do not restart group children whos individual score drops below zero
1285
+ Medium: pengine: Implement a true maintenance mode
1286
+ Medium: pengine: Print the correct message when stonith is disabled
1105
1287
+ Medium: stonithd: exit with better code on empty host list
1106
1288
+ Medium: xml: pacemaker-0.7 is now an alias for the 1.0 schema
1197
1379
Changesets: 439
1198
1380
Diff: 676 files changed, 41310 insertions(+), 52071 deletions(-)
1199
1381
- Changes added since stable-0.6
1200
+ High: A new tool for setting up and invoking CTS
1201
+ High: Admin: All tools now use --node (-N) for specifying node unames
1202
+ High: Admin: All tools now use --xml-file (-x) and --xml-text (-X) for specifying where to find XML blobs
1203
+ High: cib: Cleanup the API - remove redundant input fields
1204
+ High: cib: Implement CIB_shadow - a facility for making and testing changes before uploading them to the cluster
1205
+ High: cib: Make registering per-op callbacks an API call and renamed (for clarity) the API call for requesting notifications
1206
+ High: Core: Add a facility for automatically upgrading old configurations
1207
+ High: Core: Adopt libxml2 as the XML processing library - all external clients need to be recompiled
1208
+ High: Core: Allow sending TLS messages larger than the MTU
1209
+ High: Core: Fix parsing of time-only ISO dates
1210
+ High: Core: Smarter handling of XML values containing quotes
1211
+ High: Core: XML memory corruption - catch, and handle, cases where we are overwriting an attribute value with itself
1212
+ High: Core: The xml ID type does not allow UUIDs that start with a number
1213
+ High: Core: Implement XPath based versions of query/delete/replace/modify
1214
+ High: Core: Remove some HA2.0.(3,4) compatability code
1215
+ High: crmd: Overhaul the detection of nodes that are starting vs. failed
1216
+ High: PE: Bug LF:1459 - Allow failures to expire
1217
+ High: PE: Have the PE do non-persistent configuration upgrades before performing calculations
1218
+ High: PE: Replace failure-stickiness with a simple 'migration-threshold'
1219
+ High: TE: Simplify the design by folding the tengine process into the crmd
1382
+ A new tool for setting up and invoking CTS
1383
+ Admin: All tools now use --node (-N) for specifying node unames
1384
+ Admin: All tools now use --xml-file (-x) and --xml-text (-X) for specifying where to find XML blobs
1385
+ cib: Cleanup the API - remove redundant input fields
1386
+ cib: Implement CIB_shadow - a facility for making and testing changes before uploading them to the cluster
1387
+ cib: Make registering per-op callbacks an API call and renamed (for clarity) the API call for requesting notifications
1388
+ Core: Add a facility for automatically upgrading old configurations
1389
+ Core: Adopt libxml2 as the XML processing library - all external clients need to be recompiled
1390
+ Core: Allow sending TLS messages larger than the MTU
1391
+ Core: Fix parsing of time-only ISO dates
1392
+ Core: Smarter handling of XML values containing quotes
1393
+ Core: XML memory corruption - catch, and handle, cases where we are overwriting an attribute value with itself
1394
+ Core: The xml ID type does not allow UUIDs that start with a number
1395
+ Core: Implement XPath based versions of query/delete/replace/modify
1396
+ Core: Remove some HA2.0.(3,4) compatability code
1397
+ crmd: Overhaul the detection of nodes that are starting vs. failed
1398
+ pengine: Bug LF:1459 - Allow failures to expire
1399
+ pengine: Have the PE do non-persistent configuration upgrades before performing calculations
1400
+ pengine: Replace failure-stickiness with a simple 'migration-threshold'
1401
+ tengine: Simplify the design by folding the tengine process into the crmd
1220
1402
+ Medium: Admin: Bug LF:1438 - Allow the list of all/active resource operations to be queried by crm_resource
1221
1403
+ Medium: Admin: Bug LF:1708 - crm_resource should print a warning if an attribute is already set as a meta attribute
1222
1404
+ Medium: Admin: Bug LF:1883 - crm_mon should display fail-count and operation history
1311
1493
Changesets: 117
1312
1494
Diff: 354 files changed, 19094 insertions(+), 11338 deletions(-)
1313
1495
- Changes since Pacemaker-0.6.2
1314
+ High: Admin: Bug LF:1848 - crm_resource - Pass set name and id to delete_resource_attr() in the correct order
1315
+ High: Build: SNMP has been moved to the management/pygui project
1316
+ High: crmd: Bug LF1837 - Unmanaged resources prevent crmd from shutting down
1317
+ High: crmd: Prevent use-after-free in lrm interface code (Patch based on work by Keisuke MORI)
1318
+ High: PE: Allow the cluster to make progress by not retrying failed demote actions
1319
+ High: PE: Anti-colocation with slave should not prevent master colocation
1320
+ High: PE: Bug LF 1768 - Wait more often for STONITH ops to complete before starting resources
1321
+ High: PE: Bug LF1836 - Allow is-managed-default=false to be overridden by individual resources
1322
+ High: PE: Bug LF185 - Prevent pointless master/slave instance shuffling by ignoring the master-pref of stopped instances
1323
+ High: PE: Bug N-191176 - Implement interleaved ordering for clone-to-clone scenarios
1324
+ High: PE: Bug N-347004 - Ensure clone notifications are always sent when an instance is stopped/started
1325
+ High: PE: Bug N-347004 - Include notification ordering is correct for interleaved clones
1326
+ High: PE: Bug PM-11 - Directly link probe_complete to starting clone instances
1327
+ High: PE: Bug PM1 - Fix setting failcounts when applied to complex resources
1328
+ High: PE: Bug PM12, LF1648 - Extensive revision of group ordering
1329
+ High: PE: Bug PM7 - Ensure masters are always demoted before they are stopped
1330
+ High: PE: Create probes after allocation to allow smarter handling of anonymous clones
1331
+ High: PE: Do not prioritize clone instances that must be moved
1332
+ High: PE: Fix error in previous commit that allowed more than the required number of masters to be promoted
1333
+ High: PE: Group start ordering fixes
1334
+ High: PE: Implement promote/demote ordering for cloned groups
1335
+ High: TE: Repair failcount updates
1336
+ High: TE: Use the correct offset when updating failcount
1496
+ Admin: Bug LF:1848 - crm_resource - Pass set name and id to delete_resource_attr() in the correct order
1497
+ Build: SNMP has been moved to the management/pygui project
1498
+ crmd: Bug LF1837 - Unmanaged resources prevent crmd from shutting down
1499
+ crmd: Prevent use-after-free in lrm interface code (Patch based on work by Keisuke MORI)
1500
+ pengine: Allow the cluster to make progress by not retrying failed demote actions
1501
+ pengine: Anti-colocation with slave should not prevent master colocation
1502
+ pengine: Bug LF 1768 - Wait more often for STONITH ops to complete before starting resources
1503
+ pengine: Bug LF1836 - Allow is-managed-default=false to be overridden by individual resources
1504
+ pengine: Bug LF185 - Prevent pointless master/slave instance shuffling by ignoring the master-pref of stopped instances
1505
+ pengine: Bug N-191176 - Implement interleaved ordering for clone-to-clone scenarios
1506
+ pengine: Bug N-347004 - Ensure clone notifications are always sent when an instance is stopped/started
1507
+ pengine: Bug N-347004 - Include notification ordering is correct for interleaved clones
1508
+ pengine: Bug PM-11 - Directly link probe_complete to starting clone instances
1509
+ pengine: Bug PM1 - Fix setting failcounts when applied to complex resources
1510
+ pengine: Bug PM12, LF1648 - Extensive revision of group ordering
1511
+ pengine: Bug PM7 - Ensure masters are always demoted before they are stopped
1512
+ pengine: Create probes after allocation to allow smarter handling of anonymous clones
1513
+ pengine: Do not prioritize clone instances that must be moved
1514
+ pengine: Fix error in previous commit that allowed more than the required number of masters to be promoted
1515
+ pengine: Group start ordering fixes
1516
+ pengine: Implement promote/demote ordering for cloned groups
1517
+ tengine: Repair failcount updates
1518
+ tengine: Use the correct offset when updating failcount
1337
1519
+ Medium: Admin: Add a summary output that can be easily parsed by CTS for audit purposes
1338
1520
+ Medium: Build: Make configure fail if bz2 or libxml2 are not present
1339
1521
+ Medium: Build: Re-instate a better default for LCRSODIR
1416
1598
+ OpenAIS #1793 - Assertion failure in memb_state_gather_enter()
1417
1599
+ OpenAIS #1796 - Cluster message corruption
1418
1600
- Changes since Heartbeat-2.1.2-24
1419
+ High: Add OpenAIS support
1420
+ High: Admin: crm_uuid - Look in the right place for Heartbeat UUID files
1421
+ High: admin: Exit and indicate a problem if the crmd exits while crmadmin is performing a query
1422
+ High: cib: Fix CIB_OP_UPDATE calls that modify the whole CIB
1423
+ High: cib: Fix compilation when supporting the heartbeat stack
1424
+ High: cib: Fix memory leaks caused by the switch to get_message_xml()
1425
+ High: cib: HA_VALGRIND_ENABLED needs to be set _and_ set to 1|yes|true
1426
+ High: cib: Use get_message_xml() in preference to cl_get_struct()
1427
+ High: cib: Use the return value from call to write() in cib_send_plaintext()
1428
+ High: Core: ccm nodes can legitimately have a node id of 0
1429
+ High: Core: Fix peer-process tracking for the Heartbeat stack
1430
+ High: Core: Heartbeat does not send status notifications for nodes that were already part of the cluster. Fake them instead
1431
+ High: CRM: Add children to HA_Messages such that the field name matches F_XML_TAGNAME
1432
+ High: crm: Adopt a more flexible appraoch to enabling Valgrind
1433
+ High: crm: Fix compilation when bzip2 is not installed
1434
+ High: CRM: Future-proof get_message_xml()
1435
+ High: crmd: Filter election responses based on time not FSA state
1436
+ High: crmd: Handle all possible peer states in crmd_ha_status_callback()
1437
+ High: crmd: Make sure the current date/time is set - prevents use-of-NULL when evaluating rules
1438
+ High: crmd: Relax an assertion regrading ccm membership instances
1439
+ High: crmd: Use (node->processes&crm_proc_ais) to accurately update the CIB after replace operations
1440
+ High: crmd: Heartbeat: Accurately record peer client status
1441
+ High: PE: Bug 1777 - Allow colocation with a resource in the Stopped state
1442
+ High: PE: Bug 1822 - Prevent use-of-NULL in PromoteRsc()
1443
+ High: PE: Implement three recovery policies based on op_status and op_rc
1444
+ High: PE: Parse fail-count correctly (it may be set to ININFITY)
1445
+ High: PE: Prevent graph-loop when stonith agents need to be moved around before a STONITH op
1446
+ High: PE: Prevent graph-loops when two operations have the same name+interval
1447
+ High: te: Cancel active timers when destroying graphs
1448
+ High: TE: Ensure failcount is set correctly for failed stops/starts
1449
+ High: TE: Update failcount for oeprations that time out
1601
+ Add OpenAIS support
1602
+ Admin: crm_uuid - Look in the right place for Heartbeat UUID files
1603
+ admin: Exit and indicate a problem if the crmd exits while crmadmin is performing a query
1604
+ cib: Fix CIB_OP_UPDATE calls that modify the whole CIB
1605
+ cib: Fix compilation when supporting the heartbeat stack
1606
+ cib: Fix memory leaks caused by the switch to get_message_xml()
1607
+ cib: HA_VALGRIND_ENABLED needs to be set _and_ set to 1|yes|true
1608
+ cib: Use get_message_xml() in preference to cl_get_struct()
1609
+ cib: Use the return value from call to write() in cib_send_plaintext()
1610
+ Core: ccm nodes can legitimately have a node id of 0
1611
+ Core: Fix peer-process tracking for the Heartbeat stack
1612
+ Core: Heartbeat does not send status notifications for nodes that were already part of the cluster. Fake them instead
1613
+ CRM: Add children to HA_Messages such that the field name matches F_XML_TAGNAME
1614
+ crm: Adopt a more flexible appraoch to enabling Valgrind
1615
+ crm: Fix compilation when bzip2 is not installed
1616
+ CRM: Future-proof get_message_xml()
1617
+ crmd: Filter election responses based on time not FSA state
1618
+ crmd: Handle all possible peer states in crmd_ha_status_callback()
1619
+ crmd: Make sure the current date/time is set - prevents use-of-NULL when evaluating rules
1620
+ crmd: Relax an assertion regrading ccm membership instances
1621
+ crmd: Use (node->processes&crm_proc_ais) to accurately update the CIB after replace operations
1622
+ crmd: Heartbeat: Accurately record peer client status
1623
+ pengine: Bug 1777 - Allow colocation with a resource in the Stopped state
1624
+ pengine: Bug 1822 - Prevent use-of-NULL in PromoteRsc()
1625
+ pengine: Implement three recovery policies based on op_status and op_rc
1626
+ pengine: Parse fail-count correctly (it may be set to ININFITY)
1627
+ pengine: Prevent graph-loop when stonith agents need to be moved around before a STONITH op
1628
+ pengine: Prevent graph-loops when two operations have the same name+interval
1629
+ tengine: Cancel active timers when destroying graphs
1630
+ tengine: Ensure failcount is set correctly for failed stops/starts
1631
+ tengine: Update failcount for oeprations that time out
1450
1632
+ Medium: admin: Prevent hang in crm_mon -1 when there is no cib connection - Patch from Junko IKEDA
1451
1633
+ Medium: cib: Require --force|-f when performing potentially dangerous commands with cibadmin
1452
1634
+ Medium: cib: Tweak the shutdown code