pubkey/event-reducePublic

NotificationsYou must be signed in to change notification settings
Fork23
Star749

An algorithm to optimize database queries that run multiple timeshttps://pubkey.github.io/event-reduce/

License

MIT license

749 stars 23 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 707 Commits
.github		.github
docs		docs
examples/browser		examples/browser
javascript		javascript
orga		orga
.nvmrc		.nvmrc
LICENSE		LICENSE
README.md		README.md
renovate.json		renovate.json

Repository files navigation

Event-Reduce

An algorithm to optimize database queries that run multiple times

1. You make a query to the database which returns the result in 100 milliseconds
2. A write event occurs on the database and changes some data
3. To get the new version of the query's results you now have three options:

a. Run the query over the database again which takes another 100 milliseconds
b. Write complex code that somehow merges the incoming event with the old state
c. UseEvent-Reduce to calculate the new results on the CPU without disc-IOnearly instant

Efficiency

In thebrowser demo you can see that for randomly generated events, about94% of them could be optimized by EventReduce. In real world usage, with non-random events, this can be even higher. For the different implementations in common browser databases, we can observe an up to12 times faster displaying of new query results after a write occurred.

How they do it

EventReduce uses 18 differentstate functions to 'describe' an event+previousResults combination. A state function is a function that returns a boolean value likeisInsert(),wasResultsEmpty(),sortParamsChanged() and so on.

Also there are 16 differentaction functions. An action function gets the event+previousResults and modifies the results array in a given way likeinsertFirst(),replaceExisting(),insertAtSortPosition(),doNothing() and so on.

For each of our2^19 state combinations, we calculate which action function gives the same results that the database would return when the full query is executed again.

From this state-action combinations we create a big truth table that is used to create abinary decision diagram. The BDD is then optimized to call as fewstate functions as possible to determine the correct action of an incoming event-results combination.

The resulting optimized BDD is then shipped as the EventReduce algoritm and can be used in different programming languages and implementations. The programmer does not need to know about all this optimisation stuff and can directly use three simple functions like shown in thejavascript implementation

When to use this

You can use this to

reduce the latency until a change to the database updates your application
make observing query results more scalable by doing less disk-io
reduce the bandwith when streaming realtime query results from the backend to the client
create a better form of caching where instead of invalidating the cache on write, you update its content

Limitations

EventReduce only works with queries that have apredictable sort-order for any given documents. (you can make any query predicable by adding the primary key as last sort parameter)
EventReduce can be used with relational databases but not on relational queries that run over multiple tables/collections. (you can use views as workarround so that you can query over only one table). In theory Event-Reduce could also be used for relational queries but I did not need this for now. Also it takes about one week on an average machine to run all optimizations, and having more state functions looks like an NP problem.

Implementations

At the moment there is only theJavaScript implementation that you can use over npm. Pull requests for other languages are welcomed.

Previous Work

Meteor uses a feature calledOplogDriver that is limited on queries that do not useskip orsort. Also watchthis video to learn how OpLogDriver works.
RxDB used theQueryChangeDetection which works by many handwritten if-else comparisons. RxDB switched to EventReduce since version9.0.0.
Baqend iscreating a database that optimizes for realtime queries. Watch the videoReal-Time Databases Explained: Why Meteor, RethinkDB, Parse & Firebase Don't Scale to learn more.

FAQ

Is this something like materialized views?

Yes and no. Materialized views solve a similar problem but in a different way with different trade-offs. When you have many users, all subscribing todifferent queries, you cannot create that many views because they are all recalculated on each write access to the database. EventReduce however has better scalability because it does not affect write performance and the calculation is done when the fresh query results are requested, not beforehand.

Is this something like event sourcing or CQRS?

No, event sourcing is mostly used to calculate a current state by attaching the full event stream to the starting state. This allows for stuff like time travel and so on. EventReduce solves a completely different (performance-) problem and only shares some common keywords likeevent.

Isn't this optimization already done by database engines?

No. I tested EventReduce with many common databases like MongoDB, MySQL and Postgres. Each of them had better performance with Event-Reduce then just observing the eventstream and running the queries again. If you understand what Event-Reduce exactly does, it comes clear that this optimization can not done by pull-based databases because they have missing information.

Isn't this the same as product XY?

No. EventReduce is not a product, it is not comparable to any database or streaming backend out there. EventReduce is an algorithm with a specific input and output, nothing more, nothing less. You can use EventReduce without having to change your underlaying data infrastructure.

About

An algorithm to optimize database queries that run multiple timeshttps://pubkey.github.io/event-reduce/

pubkey.github.io/event-reduce

Releases3

5.1.0 Latest

Dec 12, 2023

+ 2 releases

Sponsor this project

Learn more about GitHub Sponsors

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Event-Reduce

Efficiency

How they do it

When to use this

Limitations

Implementations

Previous Work

FAQ

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases3

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors5

Uh oh!

Languages

Movatterモバイル変換

Uh oh!

License

pubkey/event-reduce

Folders and files

Latest commit

History

Repository files navigation

Event-Reduce

Efficiency

How they do it

When to use this

Limitations

Implementations

Previous Work

FAQ

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases3

Sponsor this project

Uh oh!

Packages0

Uh oh!

Contributors5

Uh oh!

Languages

Packages