Which data science challenges do these two machine learning platforms solve? Download the whitepaper now!
While Databricks has support for blob storage though its Delta Lakes product, it builds on the Spark query language for accessing that data. Valohai is built for large scale processing of unstructured data with support for frameworks such as Horovod and scales from on-premises to hybrid-cloud data.
Valohai lets you run your experiments on the exact libraries and environments you need. It also helps you build ML pipeline steps that run in different environments. In Databricks you are bound to certain languages and library versions. Databricks also supports different languages in different notebook cells, which is handy in experimentation but not reproducible for more than one person working on a problem.
Both Databricks and Valohai ensure role-based access and team management. In terms of governance, Valohai also maintains an up to date audit trail so that you can trace from any experiment, through every script and notebook to the original code and datasets that were used.