TalkDate: 26.10 / Start: 00:00 – Finish: 00:00

Spark magic: How high-level pipelines become distributed hardcore

In Russian

Spark is the most popular tool for building data pipelines. Every data engineer knows Spark, blah-blah-blah… OK, but Spark is just a distributed Java Streams, right? But how does it work then? Oh, it turns out you can't just call "flatMap" or "groupBy" to a remote machine. Codegen! Interested? Come and find more!

#big data
#codegen
#kotlin

Speakers

Pasha Finkelstein
JetBrains

Invited experts

Evgeny Mandrikov
SonarSource

Schedule

Spark magic: How high-level pipelines become distributed hardcore

Speakers

Pasha Finkelstein

Invited experts

Evgeny Mandrikov