Apache Spark Complex Event Processing, Training and SparkSQL Datawarehouse
A link sheet for those who want to learn more about using Apache Spark and data warehousing.
Join the DZone community and get the full member experience.Join For Free
Apache Spark and Machine learning can solve a lot of problems and can discover patterns leading to rules replacement. But for some simple cases and for legacy rules requires, you may still need to hand develop rules and implement in Java. For those cases, Drools can be used with Spark. Besides machine learning, CEP or complex event processing can be used as a replacement for standard IF-THEN-ELSE style hardcoded rules.
Siddhi Complex Event Processing Engine
Use Complex Event Processing (CEP) instead of rules:
Decision CEP Engine - complex event processing platform build on spark streaming
Business Rules - JRules Data Mining
There's a ton of great presentations, webinars and short online courses for learning Spark. Here are a few that I recommend.
More Free Spark Training
Build and Monitor Spark Applications
Create Data Pipelines with Spark
When implementing a SQL Datawarehouse on Hadoop with Spark, here are a few useful starters.
SQL Data Warehouse With Spark
Spark Streaming for Robust Apps
Memory-centric Distributed Storage System
Alluxio is proving to be a great general purpose in memory file system.
Opinions expressed by DZone contributors are their own.
Scaling Site Reliability Engineering (SRE) Teams the Right Way
Design Patterns for Microservices: Ambassador, Anti-Corruption Layer, and Backends for Frontends
MLOps: Definition, Importance, and Implementation
Redefining DevOps: The Transformative Power of Containerization