NotificationsYou must be signed in to change notification settings
Fork5
Star27

Commita93b3b9

committed

Fix bug in the tsvector stats collection function, which caused a crash if

the sample contains just a one tsvector, containing only one lexeme.

1 parentfb645f6 commita93b3b9Copy full SHA for a93b3b9

File tree

1 file changed

+22

-21

lines changed

src/backend/tsearch
- ts_typanalyze.c

1 file changed

+22

-21

lines changed

`‎src/backend/tsearch/ts_typanalyze.c‎`

Lines changed: 22 additions & 21 deletions

Original file line number	Diff line number	Diff line change
`@@ -7,7 +7,7 @@`
`7`	`7`	`*`
`8`	`8`	`*`
`9`	`9`	`* IDENTIFICATION`
`10`		`- * $PostgreSQL: pgsql/src/backend/tsearch/ts_typanalyze.c,v 1.2 2008/09/19 19:03:40 tgl Exp $`
	`10`	`+ * $PostgreSQL: pgsql/src/backend/tsearch/ts_typanalyze.c,v 1.3 2008/11/27 21:17:39 heikki Exp $`
`11`	`11`	`*`
`12`	`12`	`*-------------------------------------------------------------------------`
`13`	`13`	`*/`
`@@ -290,33 +290,34 @@ compute_tsvector_stats(VacAttrStats *stats,`
`290`	`290`	`if (num_mcelem>track_len)`
`291`	`291`	`num_mcelem=track_len;`
`292`	`292`
`293`		`-/* Grab the minimal and maximal frequencies that will get stored */`
`294`		`-minfreq=sort_table[num_mcelem-1]->frequency;`
`295`		`-maxfreq=sort_table[0]->frequency;`
`296`		`-`
`297`		`-/*`
`298`		`- * We want to store statistics sorted on the lexeme value using first`
`299`		`- * length, then byte-for-byte comparison. The reason for doing length`
`300`		`- * comparison first is that we don't care about the ordering so long`
`301`		`- * as it's consistent, and comparing lengths first gives us a chance`
`302`		`- * to avoid a strncmp() call.`
`303`		`- *`
`304`		`- * This is different from what we do with scalar statistics -- they get`
`305`		`- * sorted on frequencies. The rationale is that we usually search`
`306`		`- * through most common elements looking for a specific value, so we can`
`307`		`- * grab its frequency. When values are presorted we can employ binary`
`308`		`- * search for that. See ts_selfuncs.c for a real usage scenario.`
`309`		`- */`
`310`		`-qsort(sort_table,num_mcelem,sizeof(TrackItem*),`
`311`		`-trackitem_compare_lexemes);`
`312`		`-`
`313`	`293`	`/* Generate MCELEM slot entry */`
`314`	`294`	`if (num_mcelem>0)`
`315`	`295`	`{`
`316`	`296`	`MemoryContextold_context;`
`317`	`297`	`Datum*mcelem_values;`
`318`	`298`	`float4*mcelem_freqs;`
`319`	`299`
	`300`	`+/* Grab the minimal and maximal frequencies that will get stored */`
	`301`	`+minfreq=sort_table[num_mcelem-1]->frequency;`
	`302`	`+maxfreq=sort_table[0]->frequency;`
	`303`	`+`
	`304`	`+/*`
	`305`	`+ * We want to store statistics sorted on the lexeme value using`
	`306`	`+ * first length, then byte-for-byte comparison. The reason for`
	`307`	`+ * doing length comparison first is that we don't care about the`
	`308`	`+ * ordering so long as it's consistent, and comparing lengths first`
	`309`	`+ * gives us a chance to avoid a strncmp() call.`
	`310`	`+ *`
	`311`	`+ * This is different from what we do with scalar statistics -- they`
	`312`	`+ * get sorted on frequencies. The rationale is that we usually`
	`313`	`+ * search through most common elements looking for a specific`
	`314`	`+ * value, so we can grab its frequency. When values are presorted`
	`315`	`+ * we can employ binary search for that. See ts_selfuncs.c for a`
	`316`	`+ * real usage scenario.`
	`317`	`+ */`
	`318`	`+qsort(sort_table,num_mcelem,sizeof(TrackItem*),`
	`319`	`+trackitem_compare_lexemes);`
	`320`	`+`
`320`	`321`	`/* Must copy the target values into anl_context */`
`321`	`322`	`old_context=MemoryContextSwitchTo(stats->anl_context);`
`322`	`323`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commita93b3b9

File tree

1 file changed

1 file changed

`‎src/backend/tsearch/ts_typanalyze.c‎`

0 commit comments