• Sessions appear in the color of their primary track
  • Sessions can be filtered using Products on the right
  • Use the Search bar for more flexibility
See this link for hints on how to search the schedule or sign up for sessions
Back To Schedule
Tuesday, March 20 • 10:10am - 11:00am
Path to Data Architect: Hadoop, Spark, Flink, and Beam Explained to Oracle DBAs
Feedback form is now closed.
Big Data is growing exponentially, requiring massive-scale infrastructure however, business analytics has shifted from reactive to proactive analysis; this is the era of streaming data (a.k.a. Fast Data). Apache Hadoop is very good for analyzing data at rest but cannot handle streaming data.
Big Data analytics needs new Big data frameworks. Apache Spark brings in-memory processing and RDD data abstraction which allow real-time processing of streaming data however its micro batch architecture incurs high latency. Apache Flink brings low latency and could address Spark limitations however it is not as mature and largely adopted as Spark.
Apache Beam promotes its portable Beam model across Big data frameworks (Spark, Flink, Dataflow).
Tis session presents and overview of the major Big Data frameworks and suggests that DBA should embrace these frameworks and expand their skills as a necessary path to becoming Data Architects.

avatar for Kuassi Mensah

Kuassi Mensah

Director Product Management, Oracle Corporation
Kuassi is Director of Product Management at Oracle. He looks after the following product areas (i) Java connectivity to DB (Cloud, on-premises), in-place processing with DB embedded JVM (ii) MicroServices and DB connectivity, and related topics (Data & Tx models, Kubernetes, SAGAs... Read More →

Tuesday March 20, 2018 10:10am - 11:00am PDT
4-Rm 104