Skip to content
Rafael Vera Marañón Senior Data Engineer & Data Architect

Project

SWRO desalination ML on Databricks

Machine learning and distributed processing applied to desalination data using Spark on Databricks.

SWRO desalination ML on Databricks
Context
Public data engineering and machine learning project documented on Medium.
Problem
Industrial datasets require exploration, processing and modeling steps that can scale beyond local notebooks.
Solution
Used Apache Spark on Databricks to process desalination data and apply machine learning techniques.
Impact
Connects data engineering foundations with applied ML on a lakehouse platform.

Stack

DatabricksApache SparkMachine learningPython

Links

The project is a useful example of combining domain data, Spark processing and modeling work.