Project
SWRO desalination ML on Databricks
Machine learning and distributed processing applied to desalination data using Spark on Databricks.
- Context
- Public data engineering and machine learning project documented on Medium.
- Problem
- Industrial datasets require exploration, processing and modeling steps that can scale beyond local notebooks.
- Solution
- Used Apache Spark on Databricks to process desalination data and apply machine learning techniques.
- Impact
- Connects data engineering foundations with applied ML on a lakehouse platform.
Stack
DatabricksApache SparkMachine learningPython
Links
The project is a useful example of combining domain data, Spark processing and modeling work.