|
| 1 | +--- |
| 2 | +description:What is PostgresML? |
| 3 | +coverY:0 |
| 4 | +layout: |
| 5 | +cover: |
| 6 | +visible:false |
| 7 | +size:full |
| 8 | +title: |
| 9 | +visible:true |
| 10 | +description: |
| 11 | +visible:true |
| 12 | +tableOfContents: |
| 13 | +visible:true |
| 14 | +outline: |
| 15 | +visible:true |
| 16 | +pagination: |
| 17 | +visible:true |
| 18 | +--- |
| 19 | + |
1 | 20 | #Overview |
2 | 21 |
|
3 | | -PostgresML supercharges your Postgres database into an end-to-end MLOps platform, seamlessly integrating the key components of the machine learning workflow. Without moving data outside your database, PostgresML allows Postgres to function as a feature store, model store, training engine, and inference service all in one place. This consolidation streamlines building and deploying performant, real-time AI applications for developers. |
| 22 | +[PostgresML](https://postgresml.org/) is a PostgreSQL extension that enables users to build fast, simple and powerful models right inside your database. Users can build chatbots, search engines, forecasting apps and more with the latest NLP, LLM and ML models using the simplicity and power of SQL (and our SDKs).  |
| 23 | + |
| 24 | +###Problem <ahref="#problem"id="problem"></a> |
| 25 | + |
| 26 | +Today's engineers and product teams are increasingly burdened with data hungry ML & AI apps and complex architecture. Several roundtrip network calls must be made across a web of microservices to return an accurate response. This approach to MLops is slower for users, and it’s more complex to build, manage and scale.  |
| 27 | + |
| 28 | +###Solution <ahref="#solution"id="solution"></a> |
| 29 | + |
| 30 | +PostgresML collocates data and compute, so you can save models and index data right in your Postgres database. When you need to predict or train using machine learning, you don’t have to make requests over the internet or even take your data out of your database - which is both slower and a data safety risk. We’ve even added GPUs to our databases to fully leverage the latest technology and algorithms. You can avoid the complexity and latency of microservices when you have a complete MLops platform right in your database. |
| 31 | + |
| 32 | +Instead of patching together MongoDB, Databricks, Pinecone, Huggingface, LangChain and more to get the latest algorithms and LLMs in your app – you can add one extension to your Postgres database and get to market quickly.  |
| 33 | + |
| 34 | + |
| 35 | + |
| 36 | + |
| 37 | + |
| 38 | + |
| 39 | + |
| 40 | +Why Postgres? If it’s not broke, don’t fix it. Postgres is old reliable for a reason. It’s highly efficient, scalable and open-source (Just like all of PostgresML). Plus, you can easily build your AI app using simple SQL. |
| 41 | + |
| 42 | +####➡️ Scalability <ahref="#scalability"id="scalability"></a> |
| 43 | + |
| 44 | +In ML applications, new data is constantly being generated by users and needs to be stored somewhere (feature store). For example, there are long term historical features, short term session level, and real time request level. Postgres can handle both long term with partitioning, table spaces and indexing as well as short term storage, and accept real-time features as parameters in queries, or as additional session level storage already written during the request. |
| 45 | + |
| 46 | +####➡️ Efficiency <ahref="#efficiency"id="efficiency"></a> |
| 47 | + |
| 48 | +In database-ML minimizes latency and computational cost. Postgres can also handle all types of data efficiently; including vectors, geospatial, JSON, timeseries, tabular and text. Our benchmarks show up to**40x faster** improvement over Python microservices. |
| 49 | + |
| 50 | +####➡️ All the latest LLMs, ML + AI algorithms <ahref="#all-the-latest-llms-ml-ai-algorithms"id="all-the-latest-llms-ml-ai-algorithms"></a> |
| 51 | + |
| 52 | +* Torch |
| 53 | +* Tensorflow |
| 54 | +* SCikit Learn |
| 55 | +* XGBoost |
| 56 | +* LightGBM |
| 57 | +* Pre-trained deep learning models from Hugging Face |
| 58 | +* LLama 2 |
| 59 | +* Falcon LLM |
| 60 | +* OpenAI |
| 61 | + |
| 62 | +####➡️ Open-source <ahref="#open-source"id="open-source"></a> |
4 | 63 |
|
5 | | -With PostgresML, your database becomes a full-fledged ML workbench. It supports supervised and unsupervised algorithms like regression, clustering, deep neural networks,andmore. You can build models using SQL on data insidePostgres. Models are stored back into Postgres for low-latency inferences later. |
| 64 | +We’re a totally open-source project. That includes various ML librariesandthe vastPostgres ecosystem. |
6 | 65 |
|
7 | | -PostgresML also unlocked the power of large language models like GPT-3 for your database. With just a few lines of SQL, you can leverage state-of-the-art NLP to build semantic search, analyze text, extract insights, summarize documents, translate text, and more. The possibilities are endless. |
| 66 | +###Future Development <ahref="#future-development"id="future-development"></a> |
8 | 67 |
|
9 | | -PostgresML isopen source but also offered as a fully-managed cloud service. In addition to the SQL API, it provides Javascript, Python, and Rust SDKs to quicklybuildvector search, chatbots,andother ML appsin just a few linesofcode. |
| 68 | +[PostgresML](https://postgresml.org/) isin early stages of development - but we are moving quickly. It’s our vision to help as many app developers as possiblebuild andscale AI appswithout allofthe complexity. |
10 | 69 |
|
11 | | -To scale horizontally, PostgresML utilizes PgCat, an advanced PostgreSQL proxy and load balancer. PgCat enables sharding, load balancing, failover, and mirroringtoachieve extremely high throughput and low latency. By keepingtheentire machine learning workflow within Postgres, PostgresML avoids expensive network calls between disparate systems. This allows PostgresML to handle millions of requests per second at up to 40x the speed of other platforms. PgCat and Postgres replication deliver seamless scaling while retaining transactional integrity. |
| 70 | +If you’d like to stay up-to-date with our progress or contributeto theproject (feedback, questions and comments are all welcome) you can check us out on[Github](https://github.com/postgresml/postgresml) or chat with us anytime on our[Discord](https://discord.com/invite/DmyJP3qJ7U). |