You signed in with another tab or window.Reload to refresh your session.You signed out in another tab or window.Reload to refresh your session.You switched accounts on another tab or window.Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: pgml-docs/docs/guides/deploying-postgresml/self-hosting/README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -31,7 +31,7 @@ Finally, you can install PostgresML:
31
31
sudo apt install -y postgresml-14
32
32
```
33
33
34
-
Ubuntu 22.04 ships with PostgreSQL 14, but if you have a different version installed on your system, just change`14` in the package name to your Postgres version. We currently support all versions supported by the community: Postgres 12 through16.
34
+
Ubuntu 22.04 ships with PostgreSQL 14, but if you have a different version installed on your system, just change`14` in the package name to your Postgres version. We currently support all versions supported by the community: Postgres 12 through15.
Copy file name to clipboardExpand all lines: pgml-docs/docs/guides/deploying-postgresml/self-hosting/building-from-source.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -40,12 +40,12 @@ For a typical deployment in production, you would need to compile and install th
40
40
41
41
####Install pgrx
42
42
43
-
`pgrx` is open source and available from crates.io. We are currently using the`0.11.0` version. It's important that your`pgrx` version matches what we're using, since there are some hard dependencies between our code and`pgrx`.
43
+
`pgrx` is open source and available from crates.io. We are currently using the`0.10.0` version. It's important that your`pgrx` version matches what we're using, since there are some hard dependencies between our code and`pgrx`.
44
44
45
45
To install`pgrx`, simply run:
46
46
47
47
```
48
-
cargo install cargo-pgrx --version "0.11.0"
48
+
cargo install cargo-pgrx --version "0.10.0"
49
49
```
50
50
51
51
Before using`pgrx`, it needs to be initialized against the installed version of PostgreSQL. In this example, we'll be using the Ubuntu 22.04 default PostgreSQL 14 installation:
Copy file name to clipboardExpand all lines: pgml-docs/docs/guides/machine-learning/supervised-learning/data-pre-processing.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,9 +25,9 @@ In this example:
25
25
26
26
There are 3 steps to preprocessing data:
27
27
28
-
*[Encoding](#categorical-encodings) categorical values into quantitative values
29
-
*[Imputing](#imputing-missing-values) NULL values to some quantitative value
30
-
*[Scaling](#scaling-values) quantitative values across all variables to similar ranges
28
+
*[Encoding](../../../../../pgml-dashboard/content/docs/guides/training/preprocessing.md#categorical-encodings) categorical values into quantitative values
29
+
*[Imputing](../../../../../pgml-dashboard/content/docs/guides/training/preprocessing.md#imputing-missing-values) NULL values to some quantitative value
30
+
*[Scaling](../../../../../pgml-dashboard/content/docs/guides/training/preprocessing.md#scaling-values) quantitative values across all variables to similar ranges
31
31
32
32
These preprocessing steps may be specified on a per-column basis to the[train()](../../../../../docs/guides/training/overview) function. By default, PostgresML does minimal preprocessing on training data, and will raise an error during analysis if NULL values are encountered without a preprocessor. All types other than`TEXT` are treated as quantitative variables and cast to floating point representations before passing them to the underlying algorithm implementations.
33
33
@@ -71,7 +71,7 @@ Encoding categorical variables is an O(N log(M)) where N is the number of rows,
|`none`|**Default** - Casts the variable to a 32-bit floating point representation compatible with numerics. This is the default for non-`TEXT` values.|
74
-
|`target`| Encodes the variable as theaverage value of the target label for all members of the category. This is the default for`TEXT` variables.|
74
+
|`target`| Encodes the variable as themean value of the target label for all members of the category. This is the default for`TEXT` variables.|
75
75
|`one_hot`| Encodes the variable as multiple independent boolean columns.|
76
76
|`ordinal`| Encodes the variable as integer values provided by their position in the input array. NULLS are always 0.|
By default the SDK uses HNSW indexes to efficiently perform vector recall. The default HNSW index sets`m` to 16 and`ef_construction` to 64. These defaults can be customized when the Pipeline is created.