raystack/daggerPublic

NotificationsYou must be signed in to change notification settings
Fork41
Star271

Dagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.

License

Apache-2.0 license

271 stars 41 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 825 Commits
.github/workflows		.github/workflows
config/checkstyle		config/checkstyle
dagger-common		dagger-common
dagger-core		dagger-core
dagger-functions		dagger-functions
dagger-py-functions		dagger-py-functions
dagger-tests		dagger-tests
docs		docs
gradle/wrapper		gradle/wrapper
quickstart		quickstart
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.gradle		build.gradle
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle		settings.gradle
version.txt		version.txt

Repository files navigation

Dagger

Dagger or Data Aggregator is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flinkfor stateful processing of data. With Dagger, you don't need to write custom applications or complicated code to processdata as a stream. Instead, you can write SQL queries and UDFs to do the processing and analysis on streaming data.

Key Features

Discover why to use Dagger

Processing: Dagger can transform, aggregate, join and enrich streaming data, both real-time and historical.
Scale: Dagger scales in an instant, both vertically and horizontally for high performance streaming sink and zero data drops.
Extensibility: Add your own sink to dagger with a clearly defined interface or choose from already provided ones. Use Kafka and/or Parquet Files as stream sources.
Flexibility: Add custom business logic in form of plugins (UDFs, Transformers, Preprocessors and Post Processors) independent of the core logic.
Metrics: Always know what’s going on with your deployment with built-inmonitoring of throughput, response times, errors and more.

What problems Dagger solves?

Map reduce ->SQL
Enrichment ->Post Processors
Aggregation ->SQL,UDFs
Masking ->Hash Transformer
Deduplication ->Deduplication Transformer
Realtime long window processing ->Longbow

To know more, follow the detaileddocumentation.

Usage

Explore the following resources to get started with Dagger:

Guides provides guidance oncreating Dagger with different sinks.
Concepts describes all important Dagger concepts.
Advance contains details regarding advance features of Dagger.
Reference contains details about configurations, metrics and other aspects of Dagger.
Contribute contains resources for anyone who wants to contribute to Dagger.
Usecase describes examples use cases which can be solved via Dagger.
Examples contains tutorials to try out some of Dagger's features with real-world usecases

Running locally

Please follow thisDagger Quickstart Guide for setting up a local running Dagger consuming from Kafka or to set up a Docker Compose for Dagger.

Note: Sample configuration for running a basic dagger can be foundhere. For detailed configurations, referhere.

Find more detailed steps on local setuphere.

Running on cluster

Referhere for details regarding Dagger deployment.

Running tests

# Running unit tests$ ./gradlew cleantest# Run code quality checks$ ./gradlew checkstyleMain checkstyleTest# Cleaning the build$ ./gradlew clean

Contribute

Development of Dagger happens in the open on GitHub, and we are grateful to the community for contributing bug fixes and improvements. Read below to learn how you can take part in improving Dagger.

Read ourcontributing guide to learn about our development process, how to propose bug fixes and improvements, and how to build and test your changes to Dagger.

To help you get your feet wet and get you familiar with our contribution process, we have a list ofgood first issues that contain bugs which have a relatively limited scope. This is a great place to get started.

Credits

This project exists thanks to all thecontributors.

License

Dagger isApache 2.0 licensed.

About

Dagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.

raystack.github.io/dagger/

Releases26

v0.7.0 Latest

Jul 15, 2023

+ 25 releases

Packages

Contributors21

+ 7 contributors

Languages

Java99.9%
Other0.1%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Folders and files

Latest commit

History

Repository files navigation

Dagger

Key Features

What problems Dagger solves?

Usage

Running locally

Running on cluster

Running tests

Contribute

Credits

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases26

Packages

Contributors21

Languages

Movatterモバイル変換

License

raystack/dagger

Folders and files

Latest commit

History

Repository files navigation

Dagger

Key Features

What problems Dagger solves?

Usage

Running locally

Running on cluster

Running tests

Contribute

Credits

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases26

Packages0

Contributors21

Languages

Packages