-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
December 2024 ASF Board Report #10157
Comments
Here is a draft report: https://docs.google.com/document/d/1b_C8uwMJVSrw9N1Oc8_fzFdpT0YExaRiuXJ8ulAXaYs/edit?tab=t.0 @andygrove is there any chance you can help with the Comet section? |
@alamb, @andygrove quick drafts summary for ballista, feel free to modify as necessary: As described in apache/datafusion-ballista#1066 and announced by @andygrove in
40+ commits later, we have API which can make datafusion applications distributed with single line change: use ballista::prelude::*;
use datafusion::prelude::*;
#[tokio::main]
async fn main() -> datafusion::error::Result<()> {
// create DataFusion SessionContext with ballista standalone cluster started
let ctx = datafusion::prelude::SessionContext::standalone();
ctx.register_csv("example", "tests/data/example.csv", CsvReadOptions::new()).await?;
let df = ctx.sql("SELECT a, MIN(b) FROM example WHERE a <= b GROUP BY a LIMIT 100").await?;
df.show().await?;
Ok(())
} and ongoing planning for next release apache/datafusion-ballista#974. Also, benchmark result has been updated, showing huge benefit keeping up with latest datafusion Short term focus would be:
|
Wow that is pretty amazing. Thank you @milenkovicm I have added a link to your update in the report. very cool |
I made a ticket to track the next release and took a pass over the document today. I plan to submit the doc tomorrow Dec 11 per the plan |
Here is the final report that I submitted. Thanks to @phillipleblanc @andygrove @timsaucer @milenkovicm for the help writing it 🙏
|
Is your feature request related to a problem or challenge?
Per https://whimsy.apache.org/roster/committee/datafusion the DataFusion ASF board report schedule is
March, June, September, December
Describe the solution you'd like
I would like to draft a board report for the ASF board meeting, ideally with community help.
The meetings are typically in the second or third week of the month
Describe alternatives you've considered
I plan to do this in the same style that worked well in Arrow (see an example from @andygrove
here https://lists.apache.org/thread/7w4mgy98qomc6drvj2fo81gvhq6p0boc) -- make a google doc (or issue) that people can add relevant content to and then the chair (me for the time being) submits it to the board
Additional context
No response
The text was updated successfully, but these errors were encountered: