NotificationsYou must be signed in to change notification settings
Fork5
Star26

Commit63f591e

committed

Add text to "Populating a Database" pointing out that bulk data load into a

table with foreign key constraints eats memory. Per off-line discussion ofbug #5480 with its reporter. Also do some minor wordsmithing elsewhere inthe same section.

1 parentd800b03 commit63f591eCopy full SHA for 63f591e

File tree

1 file changed

+36

-17

lines changed

doc/src/sgml
- perform.sgml

1 file changed

+36

-17

lines changed

`‎doc/src/sgml/perform.sgml`

Lines changed: 36 additions & 17 deletions

Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`		`-<!-- $PostgreSQL: pgsql/doc/src/sgml/perform.sgml,v 1.79 2010/04/28 21:23:29 tgl Exp $ -->`
	`1`	`+<!-- $PostgreSQL: pgsql/doc/src/sgml/perform.sgml,v 1.80 2010/05/29 21:08:04 tgl Exp $ -->`
`2`	`2`
`3`	`3`	`<chapter id="performance-tips">`
`4`	`4`	`<title>Performance Tips</title>`
`@@ -870,11 +870,11 @@ SELECT * FROM x, y, a, b, c WHERE something AND somethingelse;`
`870`	`870`
`871`	`871`	`<para>`
`872`	`872`	`If you are adding large amounts of data to an existing table,`
`873`		`- it might be a win to drop theindex,`
`874`		`- load the table, and then recreate theindex. Of course, the`
	`873`	`+ it might be a win to drop theindexes,`
	`874`	`+ load the table, and then recreate theindexes. Of course, the`
`875`	`875`	`database performance for other users might suffer`
`876`		`- during the time theindex is missing. One should also think`
`877`		`- twice before dropping uniqueindexes, since the error checking`
	`876`	`+ during the time theindexes are missing. One should also think`
	`877`	`+ twice before droppingauniqueindex, since the error checking`
`878`	`878`	`afforded by the unique constraint will be lost while the index is`
`879`	`879`	`missing.`
`880`	`880`	`</para>`
`@@ -890,6 +890,19 @@ SELECT * FROM x, y, a, b, c WHERE something AND somethingelse;`
`890`	`890`	`the constraints. Again, there is a trade-off between data load`
`891`	`891`	`speed and loss of error checking while the constraint is missing.`
`892`	`892`	`</para>`
	`893`	`+`
	`894`	`+ <para>`
	`895`	`+ What's more, when you load data into a table with existing foreign key`
	`896`	`+ constraints, each new row requires an entry in the server's list of`
	`897`	`+ pending trigger events (since it is the firing of a trigger that checks`
	`898`	`+ the row's foreign key constraint). Loading many millions of rows can`
	`899`	`+ cause the trigger event queue to overflow available memory, leading to`
	`900`	`+ intolerable swapping or even outright failure of the command. Therefore`
	`901`	`+ it may be <emphasis>necessary</>, not just desirable, to drop and re-apply`
	`902`	`+ foreign keys when loading large amounts of data. If temporarily removing`
	`903`	`+ the constraint isn't acceptable, the only other recourse may be to split`
	`904`	`+ up the load operation into smaller transactions.`
	`905`	`+ </para>`
`893`	`906`	`</sect2>`
`894`	`907`
`895`	`908`	`<sect2 id="populate-work-mem">`
`@@ -930,11 +943,11 @@ SELECT * FROM x, y, a, b, c WHERE something AND somethingelse;`
`930`	`943`	`When loading large amounts of data into an installation that uses`
`931`	`944`	`WAL archiving or streaming replication, it might be faster to take a`
`932`	`945`	`new base backup after the load has completed than to process a large`
`933`		`- amount of incremental WAL data.You might want to disable archiving`
`934`		`- and streaming replication while loading, by setting`
	`946`	`+ amount of incremental WAL data. To prevent incremental WAL logging`
	`947`	`+while loading, disable archivingand streaming replication, by setting`
`935`	`948`	`<xref linkend="guc-wal-level"> to <literal>minimal</>,`
`936`		`- <xref linkend="guc-archive-mode"> <literal>off</>, and`
`937`		`- <xref linkend="guc-max-wal-senders"> to zero).`
	`949`	`+ <xref linkend="guc-archive-mode">to<literal>off</>, and`
	`950`	`+ <xref linkend="guc-max-wal-senders"> to zero.`
`938`	`951`	`But note that changing these settings requires a server restart.`
`939`	`952`	`</para>`
`940`	`953`
`@@ -1006,7 +1019,8 @@ SELECT * FROM x, y, a, b, c WHERE something AND somethingelse;`
`1006`	`1019`	`<application>pg_dump</> dump as quickly as possible, you need to`
`1007`	`1020`	`do a few extra things manually. (Note that these points apply while`
`1008`	`1021`	`<emphasis>restoring</> a dump, not while <emphasis>creating</> it.`
`1009`		`- The same points apply when using <application>pg_restore</> to load`
	`1022`	`+ The same points apply whether loading a text dump with`
	`1023`	`+ <application>psql</> or using <application>pg_restore</> to load`
`1010`	`1024`	`from a <application>pg_dump</> archive file.)`
`1011`	`1025`	`</para>`
`1012`	`1026`
`@@ -1027,10 +1041,11 @@ SELECT * FROM x, y, a, b, c WHERE something AND somethingelse;`
`1027`	`1041`	`<listitem>`
`1028`	`1042`	`<para>`
`1029`	`1043`	`If using WAL archiving or streaming replication, consider disabling`
`1030`		`- them during the restore. To do that, set <varname>archive_mode</> off,`
	`1044`	`+ them during the restore. To do that, set <varname>archive_mode</>`
	`1045`	`+ to <literal>off</>,`
`1031`	`1046`	`<varname>wal_level</varname> to <literal>minimal</>, and`
`1032`		`- <varname>max_wal_senders</> zero before loading the dump script,`
`1033`		`-and afterwards set them back to the right values and take a fresh`
	`1047`	`+ <varname>max_wal_senders</>tozero before loading the dump.`
	`1048`	`+Afterwards, set them back to the right values and take a fresh`
`1034`	`1049`	`base backup.`
`1035`	`1050`	`</para>`
`1036`	`1051`	`</listitem>`
`@@ -1044,10 +1059,14 @@ SELECT * FROM x, y, a, b, c WHERE something AND somethingelse;`
`1044`	`1059`	`possibly discarding many hours of processing. Depending on how`
`1045`	`1060`	`interrelated the data is, that might seem preferable to manual cleanup,`
`1046`	`1061`	`or not. <command>COPY</> commands will run fastest if you use a single`
`1047`		`- transaction and have WAL archiving turned off.`
`1048`		`- <application>pg_restore</> also has a <option>--jobs</> option`
`1049`		`- which allows concurrent data loading and index creation, and has`
`1050`		`- the performance advantages of doing COPY in a single transaction.`
	`1062`	`+ transaction and have WAL archiving turned off.`
	`1063`	`+ </para>`
	`1064`	`+ </listitem>`
	`1065`	`+ <listitem>`
	`1066`	`+ <para>`
	`1067`	`+ If multiple CPUs are available in the database server, consider using`
	`1068`	`+ <application>pg_restore</>'s <option>--jobs</> option. This`
	`1069`	`+ allows concurrent data loading and index creation.`
`1051`	`1070`	`</para>`
`1052`	`1071`	`</listitem>`
`1053`	`1072`	`<listitem>`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit63f591e

File tree

1 file changed

1 file changed

`‎doc/src/sgml/perform.sgml`

0 commit comments