-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for Apache Beam #50
Comments
This has been under active consideration for quite some time now. Part of the issue is that Beam's model and Trill's are substantially different in several ways, enough that it would almost require a redesign or reimplementation of Trill into something far more DataFlow-like to do, which would eliminate most of what Trill does really well. That said, there is always the possibility that we could find some innovative way to support both. Another issue has been that we've seen other implementations of Beam have remarkably bad performance over their native implementations, enough that they tend to lose interest quickly. I'd love to get a conversation going on this though. What I would like to know, if possible, is:
|
Some initial feedback...
Think this article is applicable: Why Apache Beam? A Google Perspective |
A further discussion stimulator... Batch as a Special Case of Streaming |
This is definitely good conversation fodder, and thank you. I think the biggest question here is if we go forward with a Beam API layer, where in the architecture would it sit? My immediate thought is that it would be atop Trill and not inside it, but that is certainly debatable. As for batch as a special case of streaming, you've got no argument from me there. :-) |
I would use either implementation in a large stream processing application if available today |
Consider adding support for Apache Beam's unified model for defining both batch and streaming data-parallel processing pipelines.
The text was updated successfully, but these errors were encountered: