NotificationsYou must be signed in to change notification settings
Fork6
Star31

Commit161021d

committed

Ensure recovery pause feature doesn't pause unless users can connect.

If we're not in hot standby mode, then there's no way for users to connectto reset the recoveryPause flag, so we shouldn't pause. The code was awareof this but the test to see if pausing was safe was seriously inadequate:it wasn't paying attention to reachedConsistency, and besides what it wastesting was that we could legally enter hot standby, not that we havedone so. Get rid of that in favor of checking LocalHotStandbyActive,which because of the coding in CheckRecoveryConsistency is tantamount tochecking that we have told the postmaster to enter hot standby.Also, move the recoveryPausesHere() call that reacts to asynchronousrecoveryPause requests so that it's not in the middle of application of aWAL record. I put it next to the recoveryStopsHere() call --- in futurethose are going to need to interact significantly, so this seems like agood waystation.Also, don't bother trying to read another WAL record if we've alreadydecided not to continue recovery. This was no big deal when the code waswritten originally, but now that reading a record might entail actions likefetching an archive file, it seems a bit silly to do it like that.Per report from Jeff Janes and subsequent discussion. The pause featureneeds quite a lot more work, but this gets rid of some indisputable bugs,and seems safe enough to back-patch.

1 parentd56b629 commit161021dCopy full SHA for 161021d

File tree

1 file changed

+34

-16

lines changed

src/backend/access/transam
- xlog.c

1 file changed

+34

-16

lines changed

`‎src/backend/access/transam/xlog.c‎`

Lines changed: 34 additions & 16 deletions

Original file line number	Diff line number	Diff line change
`@@ -5916,13 +5916,19 @@ recoveryStopsHere(XLogRecord record, bool includeThis)`
`5916`	`5916`	`}`
`5917`	`5917`
`5918`	`5918`	`/*`
`5919`		`- *Recheckshared recoveryPauseby polling.`
	`5919`	`+ *Wait untilshared recoveryPauseflag is cleared.`
`5920`	`5920`	`*`
`5921`		`- * XXX Can also be done with shared latch.`
	`5921`	`+ * XXX Could also be done with shared latch, avoiding the pg_usleep loop.`
	`5922`	`+ * Probably not worth the trouble though. This state shouldn't be one that`
	`5923`	`+ * anyone cares about server power consumption in.`
`5922`	`5924`	`*/`
`5923`	`5925`	`staticvoid`
`5924`	`5926`	`recoveryPausesHere(void)`
`5925`	`5927`	`{`
	`5928`	`+/* Don't pause unless users can connect! */`
	`5929`	`+if (!LocalHotStandbyActive)`
	`5930`	`+return;`
	`5931`	`+`
`5926`	`5932`	`ereport(LOG,`
`5927`	`5933`	`(errmsg("recovery has paused"),`
`5928`	`5934`	`errhint("Execute pg_xlog_replay_resume() to continue.")));`
`@@ -6650,7 +6656,6 @@ StartupXLOG(void)`
`6650`	`6656`	`{`
`6651`	`6657`	`boolrecoveryContinue= true;`
`6652`	`6658`	`boolrecoveryApply= true;`
`6653`		`-boolrecoveryPause= false;`
`6654`	`6659`	`ErrorContextCallbackerrcontext;`
`6655`	`6660`	`TimestampTzxtime;`
`6656`	`6661`
`@@ -6692,22 +6697,36 @@ StartupXLOG(void)`
`6692`	`6697`	`/* Allow read-only connections if we're consistent now */`
`6693`	`6698`	`CheckRecoveryConsistency();`
`6694`	`6699`
	`6700`	`+/*`
	`6701`	`+ * Pause WAL replay, if requested by a hot-standby session via`
	`6702`	`+ * SetRecoveryPause().`
	`6703`	`+ *`
	`6704`	`+ * Note that we intentionally don't take the info_lck spinlock`
	`6705`	`+ * here. We might therefore read a slightly stale value of`
	`6706`	`+ * the recoveryPause flag, but it can't be very stale (no`
	`6707`	`+ * worse than the last spinlock we did acquire). Since a`
	`6708`	`+ * pause request is a pretty asynchronous thing anyway,`
	`6709`	`+ * possibly responding to it one WAL record later than we`
	`6710`	`+ * otherwise would is a minor issue, so it doesn't seem worth`
	`6711`	`+ * adding another spinlock cycle to prevent that.`
	`6712`	`+ */`
	`6713`	`+if (xlogctl->recoveryPause)`
	`6714`	`+recoveryPausesHere();`
	`6715`	`+`
`6695`	`6716`	`/*`
`6696`	`6717`	`* Have we reached our recovery target?`
`6697`	`6718`	`*/`
`6698`	`6719`	`if (recoveryStopsHere(record,&recoveryApply))`
`6699`	`6720`	`{`
`6700`		`-/*`
`6701`		`- * Pause only if users can connect to send a resume`
`6702`		`- * message`
`6703`		`- */`
`6704`		`-if (recoveryPauseAtTarget&&standbyState==STANDBY_SNAPSHOT_READY)`
	`6721`	`+if (recoveryPauseAtTarget)`
`6705`	`6722`	`{`
`6706`	`6723`	`SetRecoveryPause(true);`
`6707`	`6724`	`recoveryPausesHere();`
`6708`	`6725`	`}`
`6709`	`6726`	`reachedStopPoint= true;/* see below */`
`6710`	`6727`	`recoveryContinue= false;`
	`6728`	`+`
	`6729`	`+/* Exit loop if we reached non-inclusive recovery target */`
`6711`	`6730`	`if (!recoveryApply)`
`6712`	`6731`	`break;`
`6713`	`6732`	`}`
`@@ -6740,15 +6759,8 @@ StartupXLOG(void)`
`6740`	`6759`	`*/`
`6741`	`6760`	`SpinLockAcquire(&xlogctl->info_lck);`
`6742`	`6761`	`xlogctl->replayEndRecPtr=EndRecPtr;`
`6743`		`-recoveryPause=xlogctl->recoveryPause;`
`6744`	`6762`	`SpinLockRelease(&xlogctl->info_lck);`
`6745`	`6763`
`6746`		`-/*`
`6747`		`- * Pause only if users can connect to send a resume message`
`6748`		`- */`
`6749`		`-if (recoveryPause&&standbyState==STANDBY_SNAPSHOT_READY)`
`6750`		`-recoveryPausesHere();`
`6751`		`-`
`6752`	`6764`	`/*`
`6753`	`6765`	`* If we are attempting to enter Hot Standby mode, process`
`6754`	`6766`	`* XIDs we see`
`@@ -6792,10 +6804,16 @@ StartupXLOG(void)`
`6792`	`6804`	`xlogctl->recoveryLastRecPtr=EndRecPtr;`
`6793`	`6805`	`SpinLockRelease(&xlogctl->info_lck);`
`6794`	`6806`
	`6807`	`+/* Remember this record as the last-applied one */`
`6795`	`6808`	`LastRec=ReadRecPtr;`
`6796`	`6809`
	`6810`	`+/* Exit loop if we reached inclusive recovery target */`
	`6811`	`+if (!recoveryContinue)`
	`6812`	`+break;`
	`6813`	`+`
	`6814`	`+/* Else, try to fetch the next WAL record */`
`6797`	`6815`	`record=ReadRecord(NULL,LOG, false);`
`6798`		`-}while (record!=NULL&&recoveryContinue);`
	`6816`	`+}while (record!=NULL);`
`6799`	`6817`
`6800`	`6818`	`/*`
`6801`	`6819`	`* end of main redo apply loop`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit161021d

File tree

1 file changed

1 file changed

`‎src/backend/access/transam/xlog.c‎`

0 commit comments